2681 Commits

Author SHA1 Message Date
Mike Fährmann
fa33f13453
[mastodon] update
- inherit from BaseExtractor
- remove custom generate_extractors() and config()
- improve layout of MastodonAPI internals
2021-01-27 23:49:01 +01:00
Mike Fährmann
231bcad614
[shopify] use BaseExtractor 2021-01-27 23:45:22 +01:00
Mike Fährmann
2de8ebc098
[moebooru] use BaseExtractor 2021-01-27 01:33:40 +01:00
Mike Fährmann
0978c1e184
[booru] use BaseExtractor 2021-01-27 01:33:01 +01:00
Mike Fährmann
c6cc86d7d0
[deviantart] update parameters for '/browse/popular'
- limit results to 50 when also querying metadata (fixes #1267)
- remove deprecated 'category_path' parameter
2021-01-26 23:35:32 +01:00
Mike Fährmann
993856b866
[foolslide] use BaseExtractor 2021-01-26 18:50:32 +01:00
Mike Fährmann
671a95cae5
[foolfuuka] use BaseExtractor 2021-01-26 18:48:37 +01:00
Mike Fährmann
745a114c61
[common] implement BaseExtractor class
Should be used when the same extractor logic applies to different
instances/domains of several sites, e.g. FoolFuuka, Shopify, etc.

This will replace the functionality of 'generate_extractors()' in
a more efficient way, by condensing everything into 1 class and not
dynamically generating an extractor class for each instance.
2021-01-26 03:48:02 +01:00
Mike Fährmann
b549c53b36
add long option for '-G' 2021-01-26 01:01:22 +01:00
Mike Fährmann
c26de0929d
[deviantart] provide 'extension' for original file downloads
(#1272)
2021-01-25 19:11:00 +01:00
Mike Fährmann
24e8e398e0
[twitter] skip login if 'auth_token' cookie is present 2021-01-25 15:03:59 +01:00
Mike Fährmann
cdb0b02e30
[pillowfort] add 'reblogs' option (#846) 2021-01-25 15:03:59 +01:00
Mike Fährmann
7ca3bf7cb0
[pillowfort] add 'user' and 'post' extractors (#846) 2021-01-25 15:03:22 +01:00
Mike Fährmann
ebf417f31f
remove support for deprecated options
- instagram.highlights
- metadata.bypost
- exec.final
2021-01-24 22:30:16 +01:00
Mike Fährmann
477ed010c1
release version 1.16.4 2021-01-23 17:34:59 +01:00
Mike Fährmann
1d13e48512
[unsplash] implement 'skip()' 2021-01-23 17:24:18 +01:00
Mike Fährmann
6cdbfb79e9
[photovogue] update (#1253) 2021-01-22 19:38:10 +01:00
Federico Ravasio
25297815bc
[photovogue] added portfolio extractor (#1253) 2021-01-22 19:36:13 +01:00
Mike Fährmann
0265fbda61
[mangakakalot] fix extraction 2021-01-22 17:24:18 +01:00
Mike Fährmann
7a096c443f
[unsplash] add 'format' option (#1197) 2021-01-22 17:23:46 +01:00
Mike Fährmann
3188ac16d1
[unsplash] add 'collection' extractor (#1197) 2021-01-21 22:30:01 +01:00
Mike Fährmann
247cc73446
[derpibooru] update 'date' parsing 2021-01-21 21:51:04 +01:00
Mike Fährmann
193dca2ce1
update extractor test results 2021-01-21 21:35:42 +01:00
Mike Fährmann
89ea1384fc
[unsplash] fix typo 2021-01-20 22:51:02 +01:00
Mike Fährmann
e5e591b848
[vipr] simplify and add test (#1258) 2021-01-20 21:40:04 +01:00
v-delta
e707e060cb
[vipr] add image extractor (#1258)
* [vipr] add image extractor

Adds support for images hosted on https://vipr.im

* Fix codestyle issues
2021-01-20 21:38:43 +01:00
Mike Fährmann
95e5911895
[twitter] match '/i/user/ID' URLs 2021-01-20 00:33:57 +01:00
Mike Fährmann
069b113cbf
[twitter] improve and fix retry after hitting rate limit
- replace recursive call with infinite loop
- fix function arguments for recursive call
2021-01-19 23:50:07 +01:00
Mike Fährmann
89a2bcbb2d
[furaffinity] add 'descriptions' option (#1231) 2021-01-19 19:09:29 +01:00
Mike Fährmann
36f281330a
[newgrounds] fix flash file extraction (closes #1257)
… and add a 'flash' option to choose between flash and video formats.
2021-01-19 17:48:14 +01:00
Mike Fährmann
534194bf92
[unsplash] add extractors (#1197)
for
- single photos  (/photos/ID)
- user profiles  (/@USER)
- user likes     (/@USER/likes)
- search results (/s/photos/SEARCH)
2021-01-19 02:23:39 +01:00
Mike Fährmann
1fc16cb8c5
[instagram] fix regex for '/saved' URLs (fixes #1251)
The URL pattern erroneously had two '([^/?#]+)' capture groups,
which would split any username into 'usernam' for the first group
and 'e' for the ignored second group.
2021-01-17 23:34:45 +01:00
Mike Fährmann
c008cb5100
[pixiv] add 'related' option (#1237) 2021-01-17 22:48:32 +01:00
Mike Fährmann
e9a75e27d9
[foolfuuka] stop search when results are exhausted (#1174) 2021-01-17 22:48:21 +01:00
Mike Fährmann
b0cf968115
[mangadex] update API URLs
https://mangadex.org/thread/351011/9/#post_4238014
2021-01-15 21:11:58 +01:00
Mike Fährmann
a6414c31d6
[kemonoparty] simplify (#1216)
Use metadata from API responses as is and
don't try to detect duplicated by their original filename.
2021-01-15 01:13:59 +01:00
Mike Fährmann
01b9ccd4de
[derpibooru] use the "Everything" filter by default (#1243)
when neither 'api-key' nor 'filter' are set
2021-01-14 18:29:09 +01:00
Mike Fährmann
91308140ec
make 'generate_token()' compatible with Python 3.4 2021-01-14 03:48:10 +01:00
Mike Fährmann
1fdecfa269
[kemonoparty] use API endpoints (#1216) 2021-01-14 03:23:21 +01:00
Mike Fährmann
318876e4dd
[nozomi] add 'num' enumeration index (closes #1239) 2021-01-12 22:32:52 +01:00
Mike Fährmann
2da9068ea8
[sankaku] simplify login process 2021-01-12 00:15:22 +01:00
Mike Fährmann
e07dfc4fe5
[kemonoparty] add 'user' and 'post' extractors (#1216) 2021-01-11 22:17:08 +01:00
Mike Fährmann
780b6adb91
rename 'generate_csrf_token()' to just 'generate_token()'
and add a 'size' argument
2021-01-11 22:12:40 +01:00
Mike Fährmann
f277e48c77
release version 1.16.3 2021-01-10 00:31:52 +01:00
Mike Fährmann
79501a356f
fix crash when 'path-restrict' is an object/dict
This basically reverts commit 5818c928

(#1234)
2021-01-10 00:13:48 +01:00
Mike Fährmann
0fdaea00a3
[postprocessor:metadata] sanitize filenames 2021-01-10 00:13:20 +01:00
Mike Fährmann
32fcc61b84
release version 1.16.2 2021-01-09 18:24:51 +01:00
Mike Fährmann
02bc59d75c
[hentainexus] fix extraction (fixes #1234)
hentainexus is now hosting its images on wordpress, or at least it is
using wordpress' servers as cache:
https://i2.wp.com/images.hentainexus.com/gallery/2199754b23c191deb330c99c9bb43341/9576/002.png?filter=null
2021-01-09 15:12:05 +01:00
Mike Fährmann
5d4494b15f
add "ascii" as a special 'path-restrict' value 2021-01-09 02:41:20 +01:00
Mike Fährmann
5818c928c4
refactor 'path-restrict' parsing 2021-01-09 02:33:42 +01:00