Mike Fährmann
43ab9572b4
[twitter] handle API rate limits ( #526 )
2020-01-04 23:46:29 +01:00
Mike Fährmann
569747a78d
implement extractor.wait()
2020-01-04 23:42:07 +01:00
Mike Fährmann
5532e9c158
[twitter] handle quoted tweets ( #526 )
...
… and categorize them as retweets
2020-01-04 21:26:55 +01:00
Mike Fährmann
0b4cb8e57a
[mangahere] send 'isAdult' cookie ( fixes #556 )
2020-01-04 21:25:35 +01:00
Mike Fährmann
025f6e3398
add fallback for missing WITHOUT ROWID support ( #553 )
2020-01-03 22:58:28 +01:00
Mike Fährmann
87c8b89ddd
[postprocessor:metadata] add 'directory' option ( #520 )
2020-01-02 20:58:10 +01:00
Mike Fährmann
1afb91363c
[imagefap] generalize URL patterns and add tests ( #552 )
2020-01-02 14:26:18 +01:00
Xope Totec
f701e9f33a
Handle beta.imagefap.com URLs ( #552 )
2020-01-02 14:22:00 +01:00
Mike Fährmann
ce54b8c04c
let extractors opt-out of cookie option usage
...
useful to avoid sending unnecessary cookies when all authentication
is done through OAuth tokens
2020-01-01 21:12:37 +01:00
Mike Fährmann
5ad92fc196
[newgrounds] fix tags metadata extraction
2020-01-01 16:06:58 +01:00
Mike Fährmann
82f7f4172a
update test results
2020-01-01 16:05:38 +01:00
Mike Fährmann
1f2a69f3c5
add '_extractor' information to redirect results
2019-12-29 23:37:34 +01:00
Mike Fährmann
2d4887b75b
improve KeywordJob output for "parent" extractors ( closes #548 )
2019-12-28 22:26:49 +01:00
Mike Fährmann
a27f43dad1
[pixiv] wait and retry after rate limit error ( closes #535 )
2019-12-28 22:06:58 +01:00
Mike Fährmann
6b373cb7e2
[exhentai] restrict default directory name length ( #545 )
2019-12-27 21:29:00 +01:00
Mike Fährmann
b347bf68c7
[deviantart] add extractor for followed users ( #515 )
2019-12-27 21:27:39 +01:00
Mike Fährmann
c0f391a4e2
[pixiv] support listing followed users ( #515 )
2019-12-26 23:42:42 +01:00
Mike Fährmann
2e2fc7f0ad
prevent infinite recursion when spawning extractors ( closes #489 )
2019-12-26 23:38:16 +01:00
Mike Fährmann
896896a490
[twitter] fix URLs forwarded to youtube-dl ( closes #540 )
...
Since commit 3bba763 data["user"] is an entire dict object
and no longer just the user nickname …
2019-12-25 17:28:55 +01:00
Mike Fährmann
1e2713b895
[artstation] fix search result pagination ( closes #537 )
2019-12-25 17:26:37 +01:00
Mike Fährmann
bf3df3d0b0
[directlink] send Referer headers ( closes #536 )
2019-12-25 17:17:07 +01:00
Mike Fährmann
83909ab5d4
release version 1.12.1
2019-12-22 22:26:33 +01:00
Mike Fährmann
9be7ff600e
[imagetwist] replace test image
...
the old one expired, it seems
2019-12-22 22:13:13 +01:00
Mike Fährmann
66905b1664
[foolslide] add fallback for chapter data extraction
2019-12-21 20:44:03 +01:00
Mike Fährmann
48e42e73fb
[reddit] change default value for 'comments' to '0'
2019-12-20 16:54:59 +01:00
Mike Fährmann
9c0928457a
[reddit] fix errors with 't1_…' submissions
2019-12-20 16:49:44 +01:00
Mike Fährmann
58391d492d
cache archive keys generated in __contains__() ( #524 )
...
To avoid writing a different key to the archive than what was checked
against before the file download.
2019-12-20 16:43:08 +01:00
Mike Fährmann
bf658fd84b
[vsco] implement 'videos' option
2019-12-19 18:34:19 +01:00
Mike Fährmann
95c90722ee
[instagram] implement 'videos' option ( closes #521 )
2019-12-19 18:34:06 +01:00
Mike Fährmann
1921c127a5
make OSErrors during file downloads nonfatal ( closes #512 )
...
… except ENOSPC (No space left on device), since there is no reason to
continue downloading in that case.
All other errors that would prevent downloading data and writing it to
disk get already raised during directory creation and are therefore not
checked here.
2019-12-19 18:34:05 +01:00
Mike Fährmann
d0920e84e9
update test results
2019-12-19 18:33:48 +01:00
Mike Fährmann
8c11e81c9f
Merge commit '63e6993716db8d8bedfb7b0d445c7161493046b6'
2019-12-16 17:25:23 +01:00
Mike Fährmann
63e6993716
merge 'bypost' functionality into metadata postprocessor
2019-12-16 17:19:23 +01:00
Mike Fährmann
31a29835ff
[realbooru] simplify extractors and update tests ( #514 )
2019-12-16 16:06:50 +01:00
The Oddball
9a4ce20b8e
[realbooru] Add Realbooru extractor ( #514 )
2019-12-16 15:49:30 +01:00
Mike Fährmann
f9e74320de
retain trailing zeroes in Cloudflare challenge answers
2019-12-16 15:32:49 +01:00
Mike Fährmann
72b8fbfbad
[instagram] make post-page extraction nonfatal
2019-12-15 22:56:27 +01:00
Mike Fährmann
922b8a9595
[weibo] raise NotFoundError for unavailable/deleted statuses
2019-12-14 22:10:02 +01:00
Mike Fährmann
0cd157300e
[patreon] fix regex pattern for posts
...
The previous one would match the first number in the URL slug as
post ID, which would fail for posts with numbers in their title.
2019-12-14 22:08:49 +01:00
Mike Fährmann
fe19e233f3
[xvideos] improve
...
- derive from GalleryExtractor
- match '…-channels' URLs
- "better" metadata structure
2019-12-13 19:03:56 +01:00
Mike Fährmann
d3e44e899d
raise NotFoundErrors for 404 responses in GalleryExtractors
2019-12-13 18:42:04 +01:00
Mike Fährmann
a4dd8b3dab
improve _check_cookies()
...
Only loop over all cookies once instead of calling
cookiejar._find() for each cookie name.
2019-12-13 15:51:20 +01:00
Mike Fährmann
76e60d10a6
[patreon] raise proper exception if creator/post doesn't exist
2019-12-12 01:14:32 +01:00
Mike Fährmann
9e63804347
[patreon] make retrieving user info nonfatal ( #508 )
...
… and fall back to the included data if an error occurs.
2019-12-12 00:31:35 +01:00
Mike Fährmann
964dc57286
[vsco] improve image resolutions
...
https://im.vsco.co/ URLs redirect to the appropriate CDN server
and occasionally insert a '/1200x1600/' into the image path,
limiting image dimensions.
This commit constructs redirect targets out of the given
im,vsco.co URLs without sending extra HTTP requests
and without any "builtin" resolution restrictions.
2019-12-12 00:27:45 +01:00
Mike Fährmann
0629fe8fa4
[vsco] fix user profile extraction … again
...
Given the pattern from last time, collections will also change
in due time and use cursor-based pagination.
2019-12-11 23:57:56 +01:00
Mike Fährmann
ab17ea9632
[deviantart] only print warning if 'original' is enabled
2019-12-11 21:58:43 +01:00
Mike Fährmann
2188db6284
[gelbooru] fix non-API tag extraction
2019-12-10 21:31:55 +01:00
Mike Fährmann
c4702ec9b6
simplify some logging calls
2019-12-10 21:30:08 +01:00
Gio
c0b9ad678d
Separate metadata from handle_url into handle_metadata, commenting
2019-12-09 16:02:15 -06:00