Simple Python script that extracts the medical text from every page of MTSamples automatically and writes the contents to text files.
To launch the code, in the command line write:
python web_extract.py <text_folder_name>
where <text_folder_name> is the folder into which you want the text to be written to.