Mike Fährmann
193dca2ce1
update extractor test results
2021-01-21 21:35:42 +01:00
Mike Fährmann
89ea1384fc
[unsplash] fix typo
2021-01-20 22:51:02 +01:00
Mike Fährmann
e5e591b848
[vipr] simplify and add test ( #1258 )
2021-01-20 21:40:04 +01:00
v-delta
e707e060cb
[vipr] add image extractor ( #1258 )
...
* [vipr] add image extractor
Adds support for images hosted on https://vipr.im
* Fix codestyle issues
2021-01-20 21:38:43 +01:00
Mike Fährmann
95e5911895
[twitter] match '/i/user/ID' URLs
2021-01-20 00:33:57 +01:00
Mike Fährmann
069b113cbf
[twitter] improve and fix retry after hitting rate limit
...
- replace recursive call with infinite loop
- fix function arguments for recursive call
2021-01-19 23:50:07 +01:00
Mike Fährmann
89a2bcbb2d
[furaffinity] add 'descriptions' option ( #1231 )
2021-01-19 19:09:29 +01:00
Mike Fährmann
36f281330a
[newgrounds] fix flash file extraction ( closes #1257 )
...
… and add a 'flash' option to choose between flash and video formats.
2021-01-19 17:48:14 +01:00
Mike Fährmann
534194bf92
[unsplash] add extractors ( #1197 )
...
for
- single photos (/photos/ID)
- user profiles (/@USER)
- user likes (/@USER/likes)
- search results (/s/photos/SEARCH)
2021-01-19 02:23:39 +01:00
Mike Fährmann
1fc16cb8c5
[instagram] fix regex for '/saved' URLs ( fixes #1251 )
...
The URL pattern erroneously had two '([^/?#]+)' capture groups,
which would split any username into 'usernam' for the first group
and 'e' for the ignored second group.
2021-01-17 23:34:45 +01:00
Mike Fährmann
c008cb5100
[pixiv] add 'related' option ( #1237 )
2021-01-17 22:48:32 +01:00
Mike Fährmann
e9a75e27d9
[foolfuuka] stop search when results are exhausted ( #1174 )
2021-01-17 22:48:21 +01:00
Mike Fährmann
b0cf968115
[mangadex] update API URLs
...
https://mangadex.org/thread/351011/9/#post_4238014
2021-01-15 21:11:58 +01:00
Mike Fährmann
a6414c31d6
[kemonoparty] simplify ( #1216 )
...
Use metadata from API responses as is and
don't try to detect duplicated by their original filename.
2021-01-15 01:13:59 +01:00
Mike Fährmann
01b9ccd4de
[derpibooru] use the "Everything" filter by default ( #1243 )
...
when neither 'api-key' nor 'filter' are set
2021-01-14 18:29:09 +01:00
Mike Fährmann
91308140ec
make 'generate_token()' compatible with Python 3.4
2021-01-14 03:48:10 +01:00
Mike Fährmann
1fdecfa269
[kemonoparty] use API endpoints ( #1216 )
2021-01-14 03:23:21 +01:00
Mike Fährmann
318876e4dd
[nozomi] add 'num' enumeration index ( closes #1239 )
2021-01-12 22:32:52 +01:00
Mike Fährmann
2da9068ea8
[sankaku] simplify login process
2021-01-12 00:15:22 +01:00
Mike Fährmann
e07dfc4fe5
[kemonoparty] add 'user' and 'post' extractors ( #1216 )
2021-01-11 22:17:08 +01:00
Mike Fährmann
780b6adb91
rename 'generate_csrf_token()' to just 'generate_token()'
...
and add a 'size' argument
2021-01-11 22:12:40 +01:00
Mike Fährmann
f277e48c77
release version 1.16.3
2021-01-10 00:31:52 +01:00
Mike Fährmann
79501a356f
fix crash when 'path-restrict' is an object/dict
...
This basically reverts commit 5818c928
(#1234 )
2021-01-10 00:13:48 +01:00
Mike Fährmann
0fdaea00a3
[postprocessor:metadata] sanitize filenames
2021-01-10 00:13:20 +01:00
Mike Fährmann
32fcc61b84
release version 1.16.2
2021-01-09 18:24:51 +01:00
Mike Fährmann
02bc59d75c
[hentainexus] fix extraction ( fixes #1234 )
...
hentainexus is now hosting its images on wordpress, or at least it is
using wordpress' servers as cache:
https://i2.wp.com/images.hentainexus.com/gallery/2199754b23c191deb330c99c9bb43341/9576/002.png?filter=null
2021-01-09 15:12:05 +01:00
Mike Fährmann
5d4494b15f
add "ascii" as a special 'path-restrict' value
2021-01-09 02:41:20 +01:00
Mike Fährmann
5818c928c4
refactor 'path-restrict' parsing
2021-01-09 02:33:42 +01:00
Mike Fährmann
aac00a2024
add 'd' conversion for format strings
...
to convert a timestamp to a formattable 'datetime' object.
For example '{created_at!d:%Y-%m-%d}'
transforms the timestamp in 'created_at' into a 'datetime' object
and then formats its content using '%Y-%m-%d' as template.
1262304000 -> datetime(2010, 1, 1) -> "2010-01-01"
2021-01-09 01:58:44 +01:00
Mike Fährmann
20bd9cd296
[wikiart] add extractor for single paintings ( closes #1233 )
...
There is no API endpoint for single paintings from what I can tell,
so this uses the site's search.
2021-01-08 23:19:00 +01:00
Mike Fährmann
e2d4ca4955
[deviantart] improve '--range' for favorites ( closes #1226 )
2021-01-08 22:57:35 +01:00
Mike Fährmann
56ccb9951a
[gfycat] add 'date' metadata field ( #1138 )
2021-01-08 17:45:09 +01:00
Mike Fährmann
f2b83b8578
[gfycat] convert IDs to lowercase
...
Redgifs expects all IDs and names to be lowercase
and throws a 404 if an ID contains an uppercase letter.
Gfycat on the other hand doesn't care about case,
so it's fine to just convert all IDs.
(#1138 )
2021-01-08 17:41:45 +01:00
Mike Fährmann
b3bc646236
[redgifs] match embedded URLs
...
https://redgifs.com/ifr/ <ID>
2021-01-08 16:01:01 +01:00
Mike Fährmann
98e0d21383
[instagram] categorize single highlight URLs as 'highlights'
...
They were categorized as 'stories' before.
(fixes #1222 )
2021-01-08 15:56:27 +01:00
Mike Fährmann
1c9435e0df
add '-G' command-line option ( #1217 )
...
A "stronger" version of '-g', resolving all intermediate URLs.
2021-01-07 19:07:05 +01:00
Mike Fährmann
fa8ee6eac4
[derpibooru] add search and gallery extractors ( #862 )
2021-01-07 18:05:32 +01:00
Mike Fährmann
3759d0cb42
[redgifs] fix search results
...
The metadata for Redgifs search results got stripped down to a bare
minimum, including download URLs. (Clicking on search results on the
website itself is broken as well)
As a workaround, we make an extra call to '/v1/gfycats/<ID>'
for each search result entry to fetch the missing data.
2021-01-06 18:16:06 +01:00
Mike Fährmann
8a88025dc4
[pinterest] support generic user URLs ( #1205 )
...
i.e. https://www.pinterest.com/USERNAME
also renames 'BoardsExtractor' to 'UserExtractor'
2021-01-02 02:36:53 +01:00
Mike Fährmann
56b460dcea
[foolfuuka] add 'search' extractors ( #1174 )
2021-01-02 02:34:06 +01:00
Mike Fährmann
fb64183d53
[foolfuuka] add 'board' extractors ( closes #1044 )
2021-01-01 19:33:35 +01:00
Mike Fährmann
0594821fcd
[downloader:http] add MIME type and signature for .ico files
...
(closes #1211 )
2021-01-01 16:07:33 +01:00
Mike Fährmann
b0beed7a06
[sankaku] add support for book searches ( closes #1204 )
2020-12-29 17:36:37 +01:00
Mike Fährmann
6cdbab07b5
[pinterest] add support for getting all boards of a user
...
(#1205 )
2020-12-29 16:57:03 +01:00
Mike Fährmann
25074aec47
[twitter] fetch media from pinned tweets ( #1203 )
2020-12-29 16:27:43 +01:00
Mike Fährmann
2475176d99
[twitter] fetch tweets from 'homeConversation' entries
...
When logged in, some entries returned by Twitter's API are so called
'homeConversation's (they would be regular tweet entries otherwise.)
Those weren't picked up before and resulted in missing files compared
to accessing a timeline as guest.
('/media' timelines and search results were not affected)
2020-12-29 00:42:46 +01:00
Mike Fährmann
3af9350648
[twitter] update API calls
...
- use 'https://twitter.com/i/api ' for all requests
except '/guest/activate.json'
- update (default) URL parameters
- update GraphQL endpoints
2020-12-28 22:05:48 +01:00
Mike Fährmann
b656b829db
[twitter] fix login with username & password
...
It is no longer possible to get an 'authenticity_token' from Twitter's
Javascript-free login form, which got disabled few days ago.
Generating a random 16 byte hex string client-side and sending that as
a cookie alongside the regular login form works just as well.
2020-12-28 16:10:19 +01:00
Mike Fährmann
d1903589a5
release version 1.16.1
2020-12-27 18:28:33 +01:00
Mike Fährmann
912eea29bc
update extractor test results
2020-12-27 17:41:08 +01:00