This package contains some very rudimental functions which will allow you to download all mp3 and zip files from the Project Gutenberg (http://www.gutenberg.org).
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
EXAMPLE
gutenberg_pages
gutenberg_download_mp3
gutenberg_download_zip
Downloading all the archive might take --> SEVERAL DAYS <--
After such a huge download --> HUNDREDS OF GYGABYTES <-- please keep one copy for you and give one to a friend, or a library, or a school, or a passer-by.
CONTRIBUTE TO DISTRIBUTE.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
This is a very rudimental way for downloading files from the Project Gutenberg (http://www.gutenberg.org).
If you know of a more elegant/efficient way to obtain the same result and you want to share I would greatly appreciate it. If you can show me how to do the same (or better) in a different programming language (C, Java, etc.), I will greatly appreciate it.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Please, ensure you are respecting the rules the web site requires for robots, in this case:
http://www.gutenberg.org/wiki/Gutenberg:Information_About_Robot_Access_to_our_Pages
and
http://www.gutenberg.org/robots.txt
Particularly ensure the bot makes a reasonable pause at each loop so that it doesn't affect other users' possibility to access the web site.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
IMPORTANT. Please, remember that gutenberg dvds can be downloaded for free from:
http://www.gutenberg.org/wiki/Gutenberg:The_CD_and_DVD_Project
The dvds do not contain the mp3 files
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
You can get an offline version of the project gutenber web site:
http://www.gutenberg.org/wiki/Gutenberg:Feeds
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
You can get the project gutenberg catalog data:
http://www.gutenberg.org/wiki/Gutenberg:Feeds
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
In order to generate a final catalog, after the download, function mmfileinfo is used: be aware that this works only on Windows machines. |