gallery-dl

Author	SHA1	Message	Date
Mike Fährmann	8f338347b6	[imagehosts] cleanup removed - chronos.to - unable to resolve hostname - coreimg.net - same - imgmaid.net - same - hosturimage.com - everything returns 404 - imageontime.org - redirects to some shady site - imgupload.yt - cloudflare error 522, host down - img4ever.net - read timeout	2018-02-23 01:05:42 +01:00
Mike Fährmann	e1e0668ca8	add option to set default replacement field value Missing or undefined keywords will now be replaced with the value set for 'keywords-default'. The default is Python's 'None', which is equivalent to setting this option to JSON's 'null'.	2018-02-23 00:59:20 +01:00
Mike Fährmann	ac3da8115e	[util] don't add text: URLs to list of downloaded URLs	2018-02-20 18:14:27 +01:00
Mike Fährmann	89440382ad	[tumblr] use separate API key for unit tests	2018-02-19 16:54:37 +01:00
Mike Fährmann	b50bdbf3d7	change config specifiers in input file format Instead of a dictionary/object, input file options are now specified by a 'key=value' pair starting with '-' for options only applying to the next URL or '-G' for Global options applying to all following URLs. See the docstring of parse_inputfile() for details. Example option specifiers: - filename = "{id}.{extension}" - extractor.pixiv.user.directory = ["Pixiv Users", "{user[id]}"] -spaces="are_optional" -G keywords = {"global": "option"}	2018-02-16 03:10:41 +01:00
Mike Fährmann	be3ea4425d	test archive-id creation and uniqueness	2018-02-12 23:02:09 +01:00
Mike Fährmann	b73b8b4f50	add OAuth unittests	2018-02-12 17:07:07 +01:00
Mike Fährmann	f5f2d29f56	[nijie] fix dojin extraction - correctly extract artist_id - set extension to "jpg" if it was empty and let filetype checks do the rest	2018-02-09 22:06:26 +01:00
Mike Fährmann	7a412f5c32	implement generic manga-chapter extractor	2018-02-04 22:02:04 +01:00
Mike Fährmann	aa38eab2be	allow not-defined fields in format strings ... and replace them with "None", for now	2018-02-03 22:28:41 +01:00
Mike Fährmann	619387cbb1	update extractor unittest results	2018-01-28 18:29:05 +01:00
Mike Fährmann	f94e3706a8	use logging module for error messages during downloads	2018-01-26 18:11:13 +01:00
Mike Fährmann	0dd48d644f	update test results nothing broke, but things got updated or changed	2018-01-23 21:38:29 +01:00
Mike Fährmann	1e93955170	[batoto] remove module Site officially shut down on 2018.01.18	2018-01-23 21:37:32 +01:00
Mike Fährmann	f10ffc0839	update extractor blacklist to also allow classes	2018-01-14 18:47:22 +01:00
Mike Fährmann	35e09869d1	[mangapark] fix image URLs and use HTTPS	2018-01-12 14:59:49 +01:00
Mike Fährmann	4edb25346e	[slideshare] support mobile URLs (closes #67 )	2018-01-10 14:15:00 +01:00
Mike Fährmann	b33efc99a4	[idolcomplex] add support for idol.sankakucomplex.com	2018-01-09 17:54:37 +01:00
Mike Fährmann	1a70857a12	update extractor-unittest capabilities - "count" can now be a string defining a comparison in the form of '<operator> <value>', for example: '> 12' or '!= 1'. If its value is not a string, it is assumed to be a concrete integer as before. - "keyword" can now be a dictionary defining tests for individual keys. These tests can either be a type, a concrete value or a regex starting with "re:". Dictionaries can be stacked inside each other. Optional keys can be indicated with a "?" before its name. For example: "keyword:" { "image_id": int, "gallery_id", 123, "name": "re:pattern", "user": { "id": 321, }, "?optional": None, }	2017-12-30 19:05:37 +01:00
Mike Fährmann	28cd78aae0	[kissmanga] extend chapter-string regex (closes #58 )	2017-12-24 22:53:10 +01:00
Mike Fährmann	fc7d165c97	[deviantart] add support for OAuth2 authentication Some user galleries [] require you to be either logged in or authenticated via OAuth2 to access their deviations. [] e.g. https://polinaegorussia.deviantart.com/gallery/ -------------- known issue: A deviantart 'refresh_token' can only be used once and gets updated whenever it is used to request a new 'access_token', so storing its initial value in a config file and reusing it again and again is not possible.	2017-12-18 01:16:46 +01:00
Mike Fährmann	0a9a07a6e1	[slideshare] improve metadata; flake8 - added 'views' and 'published' keywords - fixed longer titles and descriptions	2017-12-13 21:16:49 +01:00
Mike Fährmann	291369eab2	various smaller changes/additions	2017-12-06 21:45:56 +01:00
Mike Fährmann	300346ecdf	[mangazuki] remove extractors This site has been in "rebuild"-mode for a fairly long time and the current extractor code isn't going to work for the new version either.	2017-12-04 13:36:04 +01:00
Mike Fährmann	93482a1f88	implement 'util.advance()'	2017-12-03 01:38:24 +01:00
Mike Fährmann	a718c6c6cd	implement 'util.parse_bytes()'	2017-12-02 01:24:49 +01:00
Mike Fährmann	214972bc9a	[gelbooru] use manual extraction ... to compensate for their disabled API. (https://gelbooru.com/index.php?page=forum&s=view&id=3875) This also adds an extractor for image-pools.	2017-11-29 20:48:17 +01:00
Mike Fährmann	b14de6ffc2	[tumblr] small improvements - don't transform inline GIF URLs - set 'type' parameter for API calls if there is only one post type selected	2017-11-24 16:51:07 +01:00
Mike Fährmann	b8cdd42cab	[senmanga] fix extraction (again) this is basically a re-revert of 2ace5c7	2017-11-18 17:23:32 +01:00
Mike Fährmann	6913eeaa40	[powermanga] replace manga extractor unit test My Hero Academia is gone	2017-11-15 14:01:24 +01:00
Mike Fährmann	f72318e593	[seiga] support more than 200 images Due to API restrictions and/or missing knowledge about and documentation of API usage, it was only possible to retrieve the latest 200 images of a niconico seiga user with said API. The new approach manually visits each HTML page and gets its information from there.	2017-11-13 20:46:24 +01:00
Mike Fährmann	2457b71633	skip tests on 5xx status codes	2017-11-12 20:51:12 +01:00
Mike Fährmann	305da540c3	[mangahere] fix metadata extraction	2017-11-03 14:54:46 +01:00
Mike Fährmann	035ef655f1	[imagefap] update unit tests old gallery/image has been deleted	2017-10-27 12:22:16 +02:00
Mike Fährmann	caf26412dd	add option to set alternate location of .part files (#29 ) Note: The path set for 'downloader.*.part-directory' needs to point to an already existing directory.	2017-10-26 00:16:48 +02:00
Mike Fährmann	27c026543f	re-enable download unit tests	2017-10-25 12:55:36 +02:00
Mike Fährmann	b0353aa02d	rewrite download modules (#29 ) - use '.part' files during file-download - implement continuation of incomplete downloads - check if file size matches the one reported by server	2017-10-24 12:53:03 +02:00
Mike Fährmann	6af921a952	[sankaku] rewrite/improve (fixes #44 ) - add wait-time between HTTP requests similar to exhentai - add 'wait-min' and 'wait-max' options - increase retry-count for HTTP requests to 10 - implement user authentication (non-authenticated users can only view images up to page 25) - implement 'skip()' functionality (only works up to page 50) - implement image-retrieval for pages >= 51 - fix issue with multiple tags	2017-10-14 23:01:33 +02:00
Mike Fährmann	75d3a1f72f	[deviantart] always download original images Deviation-objects returned by the DeviantArt API don't always contain the URL and metadata of the original image ([1]). Getting this information requires an additional API call [2], which is indicated by the 'is_downloadable' and 'download_filesize' metadata within a deviation-object. [1] https://myria-moon.deviantart.com/art/Aime-Moi-part-en-vadrouille-261986576 [2] https://www.deviantart.com/developers/http/v1/20160316/deviation_download/bed6982b88949bdb08b52cd6763fcafd	2017-10-07 13:07:34 +02:00
Mike Fährmann	8e6a767109	[util] restructure formatter for better exception propagation	2017-10-06 17:10:35 +02:00
Mike Fährmann	0386503c80	fix (sub)category-transfer for DownloadJob instances (#41 ) ... and extend "parent" parameters to TestJob- and DataJob-classes as well.	2017-10-06 15:38:35 +02:00
Mike Fährmann	41adb99e9c	[pawoo] fix extraction - changed access_token - use account-search instead of general search	2017-10-02 18:33:52 +02:00
Mike Fährmann	b319f4bab3	smaller code and text changes	2017-10-01 18:23:40 +02:00
Mike Fährmann	c1f0afe4c6	add custom string formatter class	2017-09-28 17:12:39 +02:00
Mike Fährmann	85a2b2ae59	[khinsider] fix extraction	2017-09-28 11:47:26 +02:00
Mike Fährmann	8e14714c2b	[imgspice] fix extraction	2017-09-26 21:04:48 +02:00
Mike Fährmann	a85f06d2d1	[foolslide] restructure; convert suitable values to int	2017-09-24 16:57:47 +02:00
Mike Fährmann	9fc1d0c901	implement and use 'util.safe_int()' same as Python's 'int()', except it doesn't raise any exceptions and accepts a default value	2017-09-24 15:59:25 +02:00
Mike Fährmann	a9e7145651	[hbrowse] extract hmanga metadata & general maintenance	2017-09-20 16:25:25 +02:00
Mike Fährmann	84d4450410	[fallenangels] extract manga metadata	2017-09-15 20:51:40 +02:00

... 4 5 6 7 8

397 Commits