gallery-dl

Author	SHA1	Message	Date
Mike Fährmann	9e7dfc0cfc	[myportfolio] fix extraction of galleries without title	2020-04-08 21:08:05 +02:00
Mike Fährmann	88fca0a172	[mastodon] update OAuth credentials for pawoo.net (#665 )	2020-04-06 00:50:30 +02:00
Mike Fährmann	4ae8a25567	[mastodon] use 'combine_dict()' to combine extractor info dicts	2020-04-05 21:45:00 +02:00
Mike Fährmann	220c06b86e	[mastodon] handle rate limits	2020-04-05 21:44:00 +02:00
Mike Fährmann	d02f7c1118	improve Extractor.wait() - allow 'until' to be a datetime object - do "time calculations" with UTC timestamps - set a default 'reason'	2020-04-05 21:23:05 +02:00
Mike Fährmann	5d7404ab58	[oauth] use the new name for 'DeviantartAPI' (fixes #670 )	2020-04-04 20:34:47 +02:00
Mike Fährmann	762c758af4	[hiperdex] fix extraction	2020-04-03 21:25:25 +02:00
Mike Fährmann	f9a590f92b	[deviantart] apply HTTP request limits in more places "Request blocked" can also happen on sta.sh and for any HTTP request directed at deviantart.com	2020-04-03 21:21:59 +02:00
Mike Fährmann	2587296deb	[mastodon] add access tokens for mastodon.social and baraag.net (closes #665)	2020-04-02 22:34:32 +02:00
Mike Fährmann	ff7c0b7eff	[deviantart] handle "Request blocked" errors (#655 ) - add a 2 second wait time between requests to deviantart.com - catch 403 "Request blocked" errors and wait for 3 minutes until retrying	2020-04-02 22:14:02 +02:00
Mike Fährmann	c874684f05	[deviantart] retrieve all download URLs through OAuth API '/extended_fetch' as well as Deviation webpages now again contain Deviation UUIDs needed to grab Deviation info through the OAuth API, meaning cookies are no longer necessary to grab original files. The only instance were cookies are still needed are scraps marked as "mature", since those entries are hidden for public users. (#655, #657, #660)	2020-04-02 22:10:33 +02:00
Mike Fährmann	5c27b25a8f	[deviantart] improve sta.sh extraction Extract all sta.sh items in a single extractor run. Don't spawn a new StashExtractor for each individual sta.sh item to preserve the current requests.Session and its opened TCP connections.	2020-04-01 03:17:25 +02:00
Mike Fährmann	e2fc4eaa6f	[deviantart] detect stash folders (fixes #659 )	2020-04-01 01:59:03 +02:00
Mike Fährmann	c034159701	[piczel] fix extraction for single images	2020-03-31 22:47:23 +02:00
Mike Fährmann	699036ea0c	[weibo] accept status URLs with non-numeric IDs (#664 )	2020-03-31 22:46:50 +02:00
Mike Fährmann	fe96f99e4b	[hentainexus] reduce line length (flake8) & update test	2020-03-31 22:08:43 +02:00
墨焓	6f81cac8fa	Add metadata to hentainexus: circle, event, title_conventional. (#661 )	2020-03-31 21:59:02 +02:00
Mike Fährmann	6f911aeb1c	[deviantart] add error message for cloudFront blocks (#655 )	2020-03-28 21:18:04 +01:00
Mike Fährmann	7499d71d02	[simplyhentai] ignore certificate errors in video test	2020-03-28 21:07:30 +01:00
Mike Fährmann	4203dc0bdc	[mangapark] fix metadata extraction	2020-03-28 03:00:26 +01:00
Mike Fährmann	1b82d36ab2	[deviantart] handle decode errors for extended_fetch results (#655 ) This isn't going to solve the underlying problem, but it should at least provide the server response when those errors happen.	2020-03-24 20:56:41 +01:00
Mike Fährmann	09f2271528	[35photo] add 'tag' extractor	2020-03-24 02:49:00 +01:00
Mike Fährmann	77fda8190c	[35photo] simplify/remove tests for the 'genre' extractor There is still a nice genre overview page (https://35photo.pro/genre/) but the individual sub-pages don't list photos anymore	2020-03-24 02:48:25 +01:00
Mike Fährmann	fb846c9ee5	[instagram] reduce line lengths and make flake8 happy	2020-03-23 22:56:43 +01:00
Mike Fährmann	ad2efa8509	[e621] derive from Danbooru extractors (#651 ) - use extractor implementations from 'danbooru' - use "page": "b[ID]" to paginate over results instead of "tags": "id:<[ID]", avoiding infinite loops with certain post orders - bump User-Agent version	2020-03-22 21:08:45 +01:00
Mike Fährmann	9b39e1cd7e	[e621] fix bug in API rate limiting (#651 )	2020-03-22 14:01:23 +01:00
Mike Fährmann	b607d0ad7f	[twitter] fix typo in 'x-twitter-auth-type' header (#625 )	2020-03-21 23:11:39 +01:00
Mike Fährmann	2c3b9e1450	[nozomi] support multiple images per post (#646 ) This changes the default filename format as well as archive IDs, since those assumed that each post would only have one image.	2020-03-19 21:07:31 +01:00
Mike Fährmann	c606d0c854	[instagram] update pattern for user profile URLs Allow for query parameters and fragments, for example https://www.instagram.com/instagram/?hl=en	2020-03-18 22:24:20 +01:00
Mike Fährmann	2530db3f4d	[mangadex] transform 'date' timestamps to datetime objects	2020-03-18 02:19:19 +01:00
Mike Fährmann	ae2a33243b	[newgrounds] catch general Exceptions	2020-03-18 02:17:43 +01:00
Mike Fährmann	32e36d8f02	[sexcom] replace tests	2020-03-17 22:47:45 +01:00
Mike Fährmann	33b42dc847	[nozomi] sort search results (fixes #646 )	2020-03-17 22:28:23 +01:00
Mike Fährmann	eaa60a438b	[piczel] fix extraction - manually filter by folder_id - extract data for single posts from embedded JSON, since the '/api/gallery/image/<id>' endpoint is no longer available	2020-03-17 17:12:28 +01:00
Mike Fährmann	5bcc7184c9	[danbooru][e621] increase page limits	2020-03-17 15:53:28 +01:00
Mike Fährmann	90d15e3682	[instagram] use 'itertools.chain()'	2020-03-17 15:52:44 +01:00
Leonardo Taccari	160328d21c	[instagram] Add support for user's saved medias (#644 ) * [instagram] Gracefully handle possible 'HttpErrorPage' in _extract_page() `HttpErrorPage' is returned in shared_data at least when not authenticated or when trying to fetch other users saved medias (i.e. `instagram.com/<user>/saved/'). Gracefully handle it by returning nothing. * [instagram] Add support for user's saved medias (Please note that this need the user to be authenticated and they can only see their saved media (not other users ones).) Close #643. * [instagram] Bump copyright year	2020-03-16 21:09:14 +01:00
Mike Fährmann	d3482ace7f	[furaffinity] extract more metadata - views - favorites - comments - rating - fa_category (since 'category' is already in use) - theme - species - gender - width - height	2020-03-13 23:56:55 +01:00
Mike Fährmann	fdd2dd5136	[kabeuchi] add 'user' extractor (closes #561 )	2020-03-13 16:45:42 +01:00
Mike Fährmann	59edcdc822	[hitomi] restore metadata fields from before f33b13a ... and add a 'metadata' option to disable visiting the gallery page and extracting data from it if this is not needed.	2020-03-12 23:43:41 +01:00
Mike Fährmann	2d5703c493	[twitter] use a simpler data structure to store cookies in cache Use a dict with name-value pairs instead of an entire RequestsCookieJar object.	2020-03-12 22:02:12 +01:00
Mike Fährmann	87d4f83597	[newgrounds] make post extraction nonfatal	2020-03-10 01:49:59 +01:00
Mike Fährmann	823fbeaae6	[newgrounds] add 'favorite' extractor (#394 )	2020-03-10 01:07:09 +01:00
Mike Fährmann	a45fbc38ea	[pixiv] implement 'avatar' option (#595 , #623 )	2020-03-09 21:18:16 +01:00
Mike Fährmann	a63a376ad2	[mangoxo] fix login	2020-03-08 23:01:51 +01:00
Mike Fährmann	ebc70e87ce	[e621] update to new interface / API endpoints (closes #635 )	2020-03-06 21:12:58 +01:00
Mike Fährmann	d1cf7ccdb3	[instagram] add 'post_shortcode' metadata field (#525 )	2020-03-06 15:20:32 +01:00
Mike Fährmann	32df8d06fe	[twitter] add 'bookmark' extractor (closes #625 )	2020-03-06 01:20:04 +01:00
Mike Fährmann	3fb41c34c8	[bcy] reduce requests to '/item/detail/<id>' (#613 ) The former implementation would try to use the embedded data from '/item/detail/' pages for every post, even if that wasn't really necessary. This commit also fixes some issues with posts only visible to logged in users.	2020-03-04 01:37:51 +01:00
Mike Fährmann	f33b13aacf	[hitomi] simplify metadata extraction Use the data from https://ltn.hitomi.la/galleries/<id>.js for both image URLs and metadata and ignore any gallery or reader pages. This removes 'artist', 'characters', 'group', and 'parody' metadata fields since this information is, as for now, only available in gallery pages.	2020-03-04 01:22:32 +01:00

... 4 5 6 7 8 ...

1910 Commits