2616 Commits

Author SHA1 Message Date
Mike Fährmann
a4fd620a25
[hiperdex] revert domain back to hiperdex.com 2020-04-27 20:42:31 +02:00
Mike Fährmann
233b6f93a2
[patreon] recognize URLs with creator IDs (#711)
e.g. https://www.patreon.com/user/posts?u=…
2020-04-26 22:19:10 +02:00
Mike Fährmann
38b6bd66b0
[500px] match 'web.500px.com' subdomains 2020-04-26 22:17:20 +02:00
Mike Fährmann
d3b3b30107
update test results 2020-04-26 22:14:28 +02:00
Mike Fährmann
5d7ca76885
retry Cloudflare challenges 2020-04-24 22:47:27 +02:00
Mike Fährmann
3eab07739f
[twitter] ensure videos have a 'filename'
This usually gets set when invoking the 'ytdl' downloader, but when
that fails, the error message would use 'None' as filename.
2020-04-24 22:34:19 +02:00
Mike Fährmann
c4371a6970
[twitter] add 'reply' metadata field (#705) 2020-04-24 22:31:24 +02:00
Mike Fährmann
12ff23b6cc
[mastodon] improve account searches (fixes #704)
Searching for just the username ("@NAME") can produce multiple
unrelated results, so we now search for username + mastodon instance
("@NAME@INSTANCE")
2020-04-23 20:23:10 +02:00
Mike Fährmann
400a0df661
[jaiminisbox] update decoding procedure (fixes #702) 2020-04-23 20:21:48 +02:00
Mike Fährmann
8fe858eb0e
improve parameter extraction when solving Cloudflare challenge 2020-04-22 22:08:17 +02:00
Mike Fährmann
fb98b567fa
[gelbooru] improve post ID extraction for pools 2020-04-22 21:28:18 +02:00
Mike Fährmann
d6facdee7b
[mastodon] add tests (#701) 2020-04-22 21:10:34 +02:00
Mike Fährmann
12eebb6f16
[xhamster] support xhamster.porncache.net domains (closes #700) 2020-04-22 18:31:05 +02:00
Mike Fährmann
e749402191
[mastodon] fix pagination (#701) 2020-04-22 17:58:55 +02:00
Mike Fährmann
921914141e
[imgbb] improve redirect handling 2020-04-20 23:36:57 +02:00
Mike Fährmann
6cc800aad4
[instagram] add 'post_id' and 'num' metadata fields (closes #698) 2020-04-20 22:22:29 +02:00
Mike Fährmann
a3de234e70
[hitomi] add extractor for tag searches (closes #697) 2020-04-20 21:55:19 +02:00
Mike Fährmann
456f6e8d05
[nozomi] move '_unpack()' method to global scope 2020-04-20 21:44:16 +02:00
Mike Fährmann
55ac408bdf
[hitomi] fix extraction of galleries without tags 2020-04-20 21:42:14 +02:00
Mike Fährmann
db6685eeae
[aryion] support downloading from folders (fixes #694) 2020-04-18 01:25:54 +02:00
Mike Fährmann
fa2952ac55
[furaffinity] add 'following' extractor (#515) 2020-04-17 22:18:39 +02:00
Mike Fährmann
9b194520db
[newgrounds] add 'following' extractor (closes #684) 2020-04-17 22:17:43 +02:00
Mike Fährmann
6386ee54e1
[deviantart] add extractor info to 'following' results 2020-04-16 23:20:07 +02:00
Mike Fährmann
d5273f9b0c
[hiperdex] update domain to hiperdex.net 2020-04-16 20:39:56 +02:00
Mike Fährmann
08674a91f3
[patreon] fix hash extraction from download URLs (closes #693)
The old method was assuming every URL path ends with '/1'. For URLs
where this is not the case, the segment containing the post ID was
used as file hash.
2020-04-15 23:28:57 +02:00
Mike Fährmann
a31c1aae72
release version 1.13.4 2020-04-12 21:24:52 +02:00
Mike Fährmann
a6286bb551
[hiperdex] add 'artist' extractor (#606) 2020-04-12 02:32:37 +02:00
Mike Fährmann
291033720a
[hiperdex] fix manga extraction 2020-04-12 02:27:13 +02:00
Mike Fährmann
dfc0557807
[vsco] fix collection extraction 2020-04-11 23:06:29 +02:00
Mike Fährmann
fd438f0d78
update extractor test results 2020-04-11 23:00:42 +02:00
Mike Fährmann
bae1e8ed12
[deviantart] fix JPEG quality replacement pattern
'q_\d+' would sometimes also replace something in the 'token' query
parameter, invalidating the URL.
2020-04-11 02:37:06 +02:00
Mike Fährmann
cf4cef3d63
[aryion] adjust 'date' to UTC time 2020-04-11 02:08:05 +02:00
Mike Fährmann
a0f4c295c0
add optional 'utcoffset' argument to 'parse_datetime()' 2020-04-11 02:05:00 +02:00
Mike Fährmann
6c531be294
[aryion] fix malformed 'last-modified' headers (#390) 2020-04-10 23:08:52 +02:00
Mike Fährmann
38bc6430d3
[downloader:http] don't overwrite existing '_mtime' fields 2020-04-10 23:08:03 +02:00
Mike Fährmann
dc65f7d8dc
[aryion] use generic download URLs (#390)
i.e. /g4/data.php?id=…

- get filename & extension from Content-Disposition header
- handle all downloadable file types (docx, swf, etc)
2020-04-10 22:08:45 +02:00
Mike Fährmann
96b78bcf04
[aryion] include path in default directory format (#390) 2020-04-10 21:58:46 +02:00
Mike Fährmann
406449b0d6
ensure keys for mastodon instances are available during tests
Calls to config.clear() from other tests are removing the API
credentials set when importing mastodon.py for the first time.
2020-04-08 21:56:14 +02:00
Mike Fährmann
300264f676
read config files from PyInstaller exe directory (closes #682) 2020-04-08 21:53:50 +02:00
Mike Fährmann
6143050980
[aryion] add gallery and post extractors (#390, #673) 2020-04-08 21:52:51 +02:00
Mike Fährmann
9e7dfc0cfc
[myportfolio] fix extraction of galleries without title 2020-04-08 21:08:05 +02:00
Mike Fährmann
3b50c4f49d
add tests for "Extractors" in oauth.py (#670) 2020-04-07 20:26:12 +02:00
Mike Fährmann
04bd0472de
add tests for Extractor.wait() 2020-04-07 20:24:56 +02:00
Mike Fährmann
88fca0a172
[mastodon] update OAuth credentials for pawoo.net (#665) 2020-04-06 00:50:30 +02:00
Mike Fährmann
4ae8a25567
[mastodon] use 'combine_dict()' to combine extractor info dicts 2020-04-05 21:45:00 +02:00
Mike Fährmann
220c06b86e
[mastodon] handle rate limits 2020-04-05 21:44:00 +02:00
Mike Fährmann
d02f7c1118
improve Extractor.wait()
- allow 'until' to be a datetime object
- do "time calculations" with UTC timestamps
- set a default 'reason'
2020-04-05 21:23:05 +02:00
Mike Fährmann
5d7404ab58
[oauth] use the new name for 'DeviantartAPI' (fixes #670) 2020-04-04 20:34:47 +02:00
Mike Fährmann
762c758af4
[hiperdex] fix extraction 2020-04-03 21:25:25 +02:00
Mike Fährmann
f9a590f92b
[deviantart] apply HTTP request limits in more places
"Request blocked" can also happen on sta.sh and for *any* HTTP
request directed at deviantart.com
2020-04-03 21:21:59 +02:00