2795 Commits

Author SHA1 Message Date
Mike Fährmann
9b52eb9bf1
[naverwebtoon] ignore non-comic images 2021-04-18 20:28:30 +02:00
Mike Fährmann
bdfcc9c4b1
update extractor test results 2021-04-18 20:28:15 +02:00
Hans Christian Gunawan
334d690687
[hentaicosplays] Add extractor (#1473) 2021-04-18 20:28:00 +02:00
Mike Fährmann
82c32d25af
[500px] update query hashes 2021-04-15 17:28:31 +02:00
Mike Fährmann
de14b7ad7a
[slideshare] fix extraction 2021-04-15 17:15:59 +02:00
Mike Fährmann
bef3105121
[komikcast] fix extraction 2021-04-15 17:04:53 +02:00
Mike Fährmann
086925e685
[shopify] support omgmiamiswimwear.com (closes #1280) 2021-04-13 23:54:03 +02:00
thatfuckingbird
224b883ff4
[danbooru] add option for extended metadata extraction (#1458)
* [danbooru] add option for extended metadata extraction

* appease linter

* [danbooru] update docs/configuration.rst

* [danbooru] rename extended-metadata -> metadata
2021-04-13 23:41:30 +02:00
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) (#1457)
* [booru] add an option to extract notes (currently implemented only for gelbooru)

* appease linter

* [gelbooru] rename "text" to "body" in note extraction

* add a code comment about reusing return value of _extended_tags
2021-04-13 23:40:24 +02:00
Mike Fährmann
78d7ee3ef4
[yuki] remove module for yuki.la 2021-04-12 21:42:32 +02:00
Mike Fährmann
a86ffb04bb
add 'output.fallback' option
to enable/disable fallback URLs for -g/--get-urls
2021-04-12 02:00:41 +02:00
Mike Fährmann
5a98bcec3a
[deviantart] improve folder name matching (fixes #1451) 2021-04-11 20:39:40 +02:00
thatfuckingbird
918b0441fb
[gelbooru] fix tag category extraction (#1455) 2021-04-10 19:05:00 +02:00
Mike Fährmann
fe6ce5b8f8
[erome] skip deleted albums (fixes #1447) 2021-04-09 15:24:18 +02:00
Mike Fährmann
457abf0e71
[deviantart] fix pagination for Eclipse results (fixes #1444)
- don't crash on missing keys
- use fallback for invalid 'nextOffset' values
2021-04-09 15:16:56 +02:00
Mike Fährmann
dee540050f
[8muses] fix JSON unobfuscation
limit the characters that get modified,
leave non-ASCII characters alone
2021-04-09 01:49:54 +02:00
Mike Fährmann
b869b3a9eb
[instagram] fetch media for incomplete GraphSidecar posts
GraphSidecar results from /tagged pages don't contain
all media elements, only the first one.

(#1439)
2021-04-09 00:37:16 +02:00
Mike Fährmann
b0686d2174
[instagram] update query hashes 2021-04-09 00:37:15 +02:00
Mike Fährmann
e8e3717b71
[instagram] add extractor for /tagged posts (#1439) 2021-04-09 00:37:08 +02:00
Mike Fährmann
abafe71e04
[exhentai] fix image limit detection (closes #1437)
check for image limit message when downloading original files
2021-04-08 21:33:41 +02:00
Mike Fährmann
a75e485461
add archive format to InfoJob output (#875) 2021-04-07 21:50:16 +02:00
Mike Fährmann
52a7913abe
[artstation] download /4k/ images (#1422) 2021-04-07 21:50:16 +02:00
Mike Fährmann
37940193a6
build executables with SOCKS proxy support (closes #1424) 2021-04-07 21:50:03 +02:00
Christian Paul
41fbc20020
[webtoons]: Add cookie rstagGDPR_DE=true (#1431) 2021-04-07 21:42:55 +02:00
Mike Fährmann
583bee7725
release version 1.17.2 2021-04-02 21:16:44 +02:00
FollieHiyuki
e3b9f88540
Add manganelo extractor (#1415) 2021-04-02 21:01:31 +02:00
Mike Fährmann
fd858eed7b
[twitter] add 'user_likes' metadata field for liked tweets
i.e. the 'screen_name' of the user whose liked tweets get extracted.

Ideally this would replace 'user' or at least be in the same format,
but that would break backwards compatibility or be impossible/too
complicated thanks to API result differences.

(#1421)
2021-04-02 03:41:41 +02:00
Mike Fährmann
8d124a3766
[twitter] rename variables 2021-04-02 02:49:53 +02:00
Mike Fährmann
105f3c9666
[twitter] add extractor for direct image links (closes #1417) 2021-04-02 02:45:23 +02:00
Mike Fährmann
ec3d5d58a8
[vk] improve extractor (#474)
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
2021-04-01 14:35:56 +02:00
Mike Fährmann
ebd142e2a8
[twitter] don't use youtube-dl for cards when videos are disabled
(#1416)
2021-04-01 14:26:08 +02:00
Mike Fährmann
d5aad999dc
[tapas] implement login with username & password (#692) 2021-03-30 01:45:28 +02:00
Mike Fährmann
e9ec91c811
[exhentai] improve image limits check
- check if current image is the '509 Bandwidth Exceeded' notification
  (https://ehgt.org/g/509.gif or https://exhentai.org/img/509.gif)
- remove 'limits' option
2021-03-29 19:01:13 +02:00
Mike Fährmann
387fe415d5
unescape items in text.split_html() 2021-03-29 02:12:29 +02:00
Mike Fährmann
36291176bc
[pinterest] add 'search' extractor (#1411) 2021-03-29 01:41:28 +02:00
Mike Fährmann
058cc47e9b
[bcy] improve pagination 2021-03-28 23:08:26 +02:00
Mike Fährmann
ddd48ceee5
update extractor test results 2021-03-28 23:06:44 +02:00
Mike Fährmann
1a540fbe00
[komikcast] fix extraction 2021-03-28 21:18:58 +02:00
Mike Fährmann
78fd63b8f0
remove 'text.clean_xml()'
was not used anywhere
2021-03-28 04:05:16 +02:00
Mike Fährmann
8553b218d9
replace calls to 'os.path.splitext()' with 'str.rpartition()'
Makes functions who used it more than twice as fast
and we can get rid of an import as well.
2021-03-28 04:01:27 +02:00
Mike Fährmann
5aa30c3669
[tapas] add 'series' and 'episode' extractors (#692) 2021-03-27 18:28:16 +01:00
Mike Fährmann
ccfa5a8694
[twitter] better error message when logging in with 2FA (#1409) 2021-03-27 18:26:37 +01:00
Mike Fährmann
214ecf62ce
[deviantart] fix arguments for search/popular results (#1408) 2021-03-27 18:26:10 +01:00
Magnus Boman
522d0a834c
[aryion] Unescape paths too (#1414)
Without this you'll get paths like this:
  - Starcross - Ch. 2 "The Ins and Outs of Sarah"

This commit changes it to:
  - Starcross - Ch. 2 "The Ins and Outs of Sarah"
2021-03-27 18:25:38 +01:00
beesdotjson
5ad615f0db
fix PixivFavoriteExtractor regex (#1405)
* fix PixivFavoriteExtractor regex

* do not use lookbehind
2021-03-25 14:59:33 +01:00
Mike Fährmann
62cfee4d28
[vk] initial support for albums (#474) 2021-03-23 19:02:16 +01:00
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags (#1388)
normalize 'tags' and 'artist_tags' to a string-list
2021-03-23 18:45:45 +01:00
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field (#1388)
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
2021-03-23 15:42:13 +01:00
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books (#1388) 2021-03-23 15:32:54 +01:00
Mike Fährmann
139fb84108
[deviantart] fix username for 'watch' results (#794)
before it'd use "/" as username
2021-03-22 22:14:21 +01:00