3.2. Download and parsing

This block starts when clicking on Download 'n parse button in the main window. It downloads checked files shown in the articles list. It can be considered as a unique block because it is developed in different ways and orders depending on search part setup.
For local loaded files, only parsing is performed.
For searches done with Google Scholar they are downloaded on different html or pdf files, and after all articles are downloaded, parsing is performed, showing in the progress bar the number of articles left to parse.
PubMed Central articles are downloaded as an only xml file that contains all articles, thus parsing is performed on this file. In the progress bar is shown an extimated remaining time, depending on internet connection and CPU speed.
Articles from PubMed searches downloading is usually very fast, as it is actually performed in the search block.

After parsing is done, articles are saved in txt-pseudo-xml form, with tags (when available) for title, journal, authors, date, abstract and body. The user can save these temporary files for future use (see Local Articles chapter) clicking on 'Save txt' button. Txts are saved in a folder that can be changed from the menu Edit/Preferences....


< 3.1.3. Local articles Index 3.3. Summary >