-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Feature to download also Zip-files with the script ? #3
Comments
Do you have an example of the magazine issues where this is happening, and can you tell me how exactly it fails please? |
Hi, after i execute the download_articles.sh script as: sudo ./download_articles.sh ct 1990 It gives the following output: Logging in... ####################100,0% [ct][1990/01][TRY 1/3] Server did not serve a valid pdf (instead application/zip; charset=ISO-8859-1). I know it find a Zip file and can´t proceed it, because there is not so further loop to proceed zip files to download and merge. So he retry after 80 seconds and hang on the same issue. I´m not a good programmer so i would ask you if you can add this to your script or as a new one e.g download_articles_ZIP.sh I tried to change the .pdf things in the download_articles.sh script where it makes sense, but i think it´s not that easy to fix. |
I see, thank you for providing the error message. Do you need to use If you will use
I only changed the |
Hi, thanks for your fast answer. I have tried to change the lines 104-107 as you suggested. I´m fine with any script which is working. Well it also ends in an error code which i think is occured while the download.sh script is downloading complete files, but the articles from 1990 - 2009 are zipped html files, which are a number of 20 or more per month. As the single pdf files in the download_articles.sh script. The error message: [ct][1990/01][SKIP] Magazine issue does not exist on the server, skipping. Hopefully you can find a fix for that. Would be great to have the possibility to download the complete huge archive per magazine publisher. |
@AlexanderMelde: The URL path was wrong, @Xx-Ylzakros-xX: please use
But be aware that the downloaded content may just be a "mockup": I also start to pull at year 1990 and for Issue 01 and 02, the HTML page just contain the title, the abstract and the statement
And of course, the code that will try to concat the downloaded PDF parts to a single volume will fail at the moment. |
Hi,
thanks for your great work to make it easy to download pdf files in the archive.
One issue remain in your scripts. If it is a zip-folder you can´t download them in the same automatic process.
Is it possible to change this by my own in an easy way or can you publish another script which do that great work ?
It would be great.
The text was updated successfully, but these errors were encountered: