Mike Fährmann
bdfcc9c4b1
update extractor test results
2021-04-18 20:28:15 +02:00
Hans Christian Gunawan
334d690687
[hentaicosplays] Add extractor ( #1473 )
2021-04-18 20:28:00 +02:00
Mike Fährmann
82c32d25af
[500px] update query hashes
2021-04-15 17:28:31 +02:00
Mike Fährmann
de14b7ad7a
[slideshare] fix extraction
2021-04-15 17:15:59 +02:00
Mike Fährmann
bef3105121
[komikcast] fix extraction
2021-04-15 17:04:53 +02:00
Mike Fährmann
086925e685
[shopify] support omgmiamiswimwear.com ( closes #1280 )
2021-04-13 23:54:03 +02:00
thatfuckingbird
224b883ff4
[danbooru] add option for extended metadata extraction ( #1458 )
...
* [danbooru] add option for extended metadata extraction
* appease linter
* [danbooru] update docs/configuration.rst
* [danbooru] rename extended-metadata -> metadata
2021-04-13 23:41:30 +02:00
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) ( #1457 )
...
* [booru] add an option to extract notes (currently implemented only for gelbooru)
* appease linter
* [gelbooru] rename "text" to "body" in note extraction
* add a code comment about reusing return value of _extended_tags
2021-04-13 23:40:24 +02:00
Mike Fährmann
78d7ee3ef4
[yuki] remove module for yuki.la
2021-04-12 21:42:32 +02:00
Mike Fährmann
5a98bcec3a
[deviantart] improve folder name matching ( fixes #1451 )
2021-04-11 20:39:40 +02:00
thatfuckingbird
918b0441fb
[gelbooru] fix tag category extraction ( #1455 )
2021-04-10 19:05:00 +02:00
Mike Fährmann
fe6ce5b8f8
[erome] skip deleted albums ( fixes #1447 )
2021-04-09 15:24:18 +02:00
Mike Fährmann
457abf0e71
[deviantart] fix pagination for Eclipse results ( fixes #1444 )
...
- don't crash on missing keys
- use fallback for invalid 'nextOffset' values
2021-04-09 15:16:56 +02:00
Mike Fährmann
dee540050f
[8muses] fix JSON unobfuscation
...
limit the characters that get modified,
leave non-ASCII characters alone
2021-04-09 01:49:54 +02:00
Mike Fährmann
b869b3a9eb
[instagram] fetch media for incomplete GraphSidecar posts
...
GraphSidecar results from /tagged pages don't contain
all media elements, only the first one.
(#1439 )
2021-04-09 00:37:16 +02:00
Mike Fährmann
b0686d2174
[instagram] update query hashes
2021-04-09 00:37:15 +02:00
Mike Fährmann
e8e3717b71
[instagram] add extractor for /tagged posts ( #1439 )
2021-04-09 00:37:08 +02:00
Mike Fährmann
abafe71e04
[exhentai] fix image limit detection ( closes #1437 )
...
check for image limit message when downloading original files
2021-04-08 21:33:41 +02:00
Mike Fährmann
52a7913abe
[artstation] download /4k/ images ( #1422 )
2021-04-07 21:50:16 +02:00
Christian Paul
41fbc20020
[webtoons]: Add cookie rstagGDPR_DE=true ( #1431 )
2021-04-07 21:42:55 +02:00
FollieHiyuki
e3b9f88540
Add manganelo extractor ( #1415 )
2021-04-02 21:01:31 +02:00
Mike Fährmann
fd858eed7b
[twitter] add 'user_likes' metadata field for liked tweets
...
i.e. the 'screen_name' of the user whose liked tweets get extracted.
Ideally this would replace 'user' or at least be in the same format,
but that would break backwards compatibility or be impossible/too
complicated thanks to API result differences.
(#1421 )
2021-04-02 03:41:41 +02:00
Mike Fährmann
8d124a3766
[twitter] rename variables
2021-04-02 02:49:53 +02:00
Mike Fährmann
105f3c9666
[twitter] add extractor for direct image links ( closes #1417 )
2021-04-02 02:45:23 +02:00
Mike Fährmann
ec3d5d58a8
[vk] improve extractor ( #474 )
...
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
2021-04-01 14:35:56 +02:00
Mike Fährmann
ebd142e2a8
[twitter] don't use youtube-dl for cards when videos are disabled
...
(#1416 )
2021-04-01 14:26:08 +02:00
Mike Fährmann
d5aad999dc
[tapas] implement login with username & password ( #692 )
2021-03-30 01:45:28 +02:00
Mike Fährmann
e9ec91c811
[exhentai] improve image limits check
...
- check if current image is the '509 Bandwidth Exceeded' notification
(https://ehgt.org/g/509.gif or https://exhentai.org/img/509.gif )
- remove 'limits' option
2021-03-29 19:01:13 +02:00
Mike Fährmann
387fe415d5
unescape items in text.split_html()
2021-03-29 02:12:29 +02:00
Mike Fährmann
36291176bc
[pinterest] add 'search' extractor ( #1411 )
2021-03-29 01:41:28 +02:00
Mike Fährmann
058cc47e9b
[bcy] improve pagination
2021-03-28 23:08:26 +02:00
Mike Fährmann
ddd48ceee5
update extractor test results
2021-03-28 23:06:44 +02:00
Mike Fährmann
1a540fbe00
[komikcast] fix extraction
2021-03-28 21:18:58 +02:00
Mike Fährmann
5aa30c3669
[tapas] add 'series' and 'episode' extractors ( #692 )
2021-03-27 18:28:16 +01:00
Mike Fährmann
ccfa5a8694
[twitter] better error message when logging in with 2FA ( #1409 )
2021-03-27 18:26:37 +01:00
Mike Fährmann
214ecf62ce
[deviantart] fix arguments for search/popular results ( #1408 )
2021-03-27 18:26:10 +01:00
Magnus Boman
522d0a834c
[aryion] Unescape paths too ( #1414 )
...
Without this you'll get paths like this:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
This commit changes it to:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
2021-03-27 18:25:38 +01:00
beesdotjson
5ad615f0db
fix PixivFavoriteExtractor regex ( #1405 )
...
* fix PixivFavoriteExtractor regex
* do not use lookbehind
2021-03-25 14:59:33 +01:00
Mike Fährmann
62cfee4d28
[vk] initial support for albums ( #474 )
2021-03-23 19:02:16 +01:00
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags ( #1388 )
...
normalize 'tags' and 'artist_tags' to a string-list
2021-03-23 18:45:45 +01:00
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field ( #1388 )
...
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
2021-03-23 15:42:13 +01:00
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books ( #1388 )
2021-03-23 15:32:54 +01:00
Mike Fährmann
139fb84108
[deviantart] fix username for 'watch' results ( #794 )
...
before it'd use "/" as username
2021-03-22 22:14:21 +01:00
Mike Fährmann
91c2e15da9
[deviantart] add support for posts from watched users ( #794 )
2021-03-22 19:25:04 +01:00
Mike Fährmann
03c20d8c8e
[deviantart] update 'watch' URL pattern ( #794 )
2021-03-21 22:48:06 +01:00
Mike Fährmann
2846235669
[twitter] allow specifying a custom format for user results
...
(#1337 )
2021-03-21 22:26:26 +01:00
Mike Fährmann
dc23cfd684
[deviantart] use fallback for /intermediary/ URLs
...
instead of checking availability with HEAD requests
2021-03-20 00:10:53 +01:00
Mike Fährmann
b0438c8f99
Revert "[deviantart] extend 'extra' option"
...
This reverts commit
5ad2b9c82bd9a92b80b935cb268cedb35008da86,
5c32a7bf58176bb5d2c5e22260cfe1d8a0844808, and
83f465faca3107c6406972d913d3f194412d9494.
(#1387 , #1356 )
2021-03-19 16:24:23 +01:00
Mike Fährmann
58b93635ee
[architizer] add 'firm' extractor ( #1369 )
2021-03-19 01:31:34 +01:00
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
...
The TLS certificate for main.imgclick.net is invalid.
2021-03-19 01:30:49 +01:00