2302 Commits

Author SHA1 Message Date
Mike Fährmann
bdfcc9c4b1
update extractor test results 2021-04-18 20:28:15 +02:00
Hans Christian Gunawan
334d690687
[hentaicosplays] Add extractor (#1473) 2021-04-18 20:28:00 +02:00
Mike Fährmann
82c32d25af
[500px] update query hashes 2021-04-15 17:28:31 +02:00
Mike Fährmann
de14b7ad7a
[slideshare] fix extraction 2021-04-15 17:15:59 +02:00
Mike Fährmann
bef3105121
[komikcast] fix extraction 2021-04-15 17:04:53 +02:00
Mike Fährmann
086925e685
[shopify] support omgmiamiswimwear.com (closes #1280) 2021-04-13 23:54:03 +02:00
thatfuckingbird
224b883ff4
[danbooru] add option for extended metadata extraction (#1458)
* [danbooru] add option for extended metadata extraction

* appease linter

* [danbooru] update docs/configuration.rst

* [danbooru] rename extended-metadata -> metadata
2021-04-13 23:41:30 +02:00
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) (#1457)
* [booru] add an option to extract notes (currently implemented only for gelbooru)

* appease linter

* [gelbooru] rename "text" to "body" in note extraction

* add a code comment about reusing return value of _extended_tags
2021-04-13 23:40:24 +02:00
Mike Fährmann
78d7ee3ef4
[yuki] remove module for yuki.la 2021-04-12 21:42:32 +02:00
Mike Fährmann
5a98bcec3a
[deviantart] improve folder name matching (fixes #1451) 2021-04-11 20:39:40 +02:00
thatfuckingbird
918b0441fb
[gelbooru] fix tag category extraction (#1455) 2021-04-10 19:05:00 +02:00
Mike Fährmann
fe6ce5b8f8
[erome] skip deleted albums (fixes #1447) 2021-04-09 15:24:18 +02:00
Mike Fährmann
457abf0e71
[deviantart] fix pagination for Eclipse results (fixes #1444)
- don't crash on missing keys
- use fallback for invalid 'nextOffset' values
2021-04-09 15:16:56 +02:00
Mike Fährmann
dee540050f
[8muses] fix JSON unobfuscation
limit the characters that get modified,
leave non-ASCII characters alone
2021-04-09 01:49:54 +02:00
Mike Fährmann
b869b3a9eb
[instagram] fetch media for incomplete GraphSidecar posts
GraphSidecar results from /tagged pages don't contain
all media elements, only the first one.

(#1439)
2021-04-09 00:37:16 +02:00
Mike Fährmann
b0686d2174
[instagram] update query hashes 2021-04-09 00:37:15 +02:00
Mike Fährmann
e8e3717b71
[instagram] add extractor for /tagged posts (#1439) 2021-04-09 00:37:08 +02:00
Mike Fährmann
abafe71e04
[exhentai] fix image limit detection (closes #1437)
check for image limit message when downloading original files
2021-04-08 21:33:41 +02:00
Mike Fährmann
52a7913abe
[artstation] download /4k/ images (#1422) 2021-04-07 21:50:16 +02:00
Christian Paul
41fbc20020
[webtoons]: Add cookie rstagGDPR_DE=true (#1431) 2021-04-07 21:42:55 +02:00
FollieHiyuki
e3b9f88540
Add manganelo extractor (#1415) 2021-04-02 21:01:31 +02:00
Mike Fährmann
fd858eed7b
[twitter] add 'user_likes' metadata field for liked tweets
i.e. the 'screen_name' of the user whose liked tweets get extracted.

Ideally this would replace 'user' or at least be in the same format,
but that would break backwards compatibility or be impossible/too
complicated thanks to API result differences.

(#1421)
2021-04-02 03:41:41 +02:00
Mike Fährmann
8d124a3766
[twitter] rename variables 2021-04-02 02:49:53 +02:00
Mike Fährmann
105f3c9666
[twitter] add extractor for direct image links (closes #1417) 2021-04-02 02:45:23 +02:00
Mike Fährmann
ec3d5d58a8
[vk] improve extractor (#474)
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
2021-04-01 14:35:56 +02:00
Mike Fährmann
ebd142e2a8
[twitter] don't use youtube-dl for cards when videos are disabled
(#1416)
2021-04-01 14:26:08 +02:00
Mike Fährmann
d5aad999dc
[tapas] implement login with username & password (#692) 2021-03-30 01:45:28 +02:00
Mike Fährmann
e9ec91c811
[exhentai] improve image limits check
- check if current image is the '509 Bandwidth Exceeded' notification
  (https://ehgt.org/g/509.gif or https://exhentai.org/img/509.gif)
- remove 'limits' option
2021-03-29 19:01:13 +02:00
Mike Fährmann
387fe415d5
unescape items in text.split_html() 2021-03-29 02:12:29 +02:00
Mike Fährmann
36291176bc
[pinterest] add 'search' extractor (#1411) 2021-03-29 01:41:28 +02:00
Mike Fährmann
058cc47e9b
[bcy] improve pagination 2021-03-28 23:08:26 +02:00
Mike Fährmann
ddd48ceee5
update extractor test results 2021-03-28 23:06:44 +02:00
Mike Fährmann
1a540fbe00
[komikcast] fix extraction 2021-03-28 21:18:58 +02:00
Mike Fährmann
5aa30c3669
[tapas] add 'series' and 'episode' extractors (#692) 2021-03-27 18:28:16 +01:00
Mike Fährmann
ccfa5a8694
[twitter] better error message when logging in with 2FA (#1409) 2021-03-27 18:26:37 +01:00
Mike Fährmann
214ecf62ce
[deviantart] fix arguments for search/popular results (#1408) 2021-03-27 18:26:10 +01:00
Magnus Boman
522d0a834c
[aryion] Unescape paths too (#1414)
Without this you'll get paths like this:
  - Starcross - Ch. 2 "The Ins and Outs of Sarah"

This commit changes it to:
  - Starcross - Ch. 2 "The Ins and Outs of Sarah"
2021-03-27 18:25:38 +01:00
beesdotjson
5ad615f0db
fix PixivFavoriteExtractor regex (#1405)
* fix PixivFavoriteExtractor regex

* do not use lookbehind
2021-03-25 14:59:33 +01:00
Mike Fährmann
62cfee4d28
[vk] initial support for albums (#474) 2021-03-23 19:02:16 +01:00
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags (#1388)
normalize 'tags' and 'artist_tags' to a string-list
2021-03-23 18:45:45 +01:00
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field (#1388)
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
2021-03-23 15:42:13 +01:00
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books (#1388) 2021-03-23 15:32:54 +01:00
Mike Fährmann
139fb84108
[deviantart] fix username for 'watch' results (#794)
before it'd use "/" as username
2021-03-22 22:14:21 +01:00
Mike Fährmann
91c2e15da9
[deviantart] add support for posts from watched users (#794) 2021-03-22 19:25:04 +01:00
Mike Fährmann
03c20d8c8e
[deviantart] update 'watch' URL pattern (#794) 2021-03-21 22:48:06 +01:00
Mike Fährmann
2846235669
[twitter] allow specifying a custom format for user results
(#1337)
2021-03-21 22:26:26 +01:00
Mike Fährmann
dc23cfd684
[deviantart] use fallback for /intermediary/ URLs
instead of checking availability with HEAD requests
2021-03-20 00:10:53 +01:00
Mike Fährmann
b0438c8f99
Revert "[deviantart] extend 'extra' option"
This reverts commit
5ad2b9c82bd9a92b80b935cb268cedb35008da86,
5c32a7bf58176bb5d2c5e22260cfe1d8a0844808, and
83f465faca3107c6406972d913d3f194412d9494.

(#1387, #1356)
2021-03-19 16:24:23 +01:00
Mike Fährmann
58b93635ee
[architizer] add 'firm' extractor (#1369) 2021-03-19 01:31:34 +01:00
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
The TLS certificate for main.imgclick.net is invalid.
2021-03-19 01:30:49 +01:00