1381 Commits

Author SHA1 Message Date
Mike Fährmann
946f2751e2
[reddit] add 'user' extractor (closes #350) 2019-09-22 22:18:17 +02:00
Mike Fährmann
c14abb9fb8
[reddit] improve URL parameter handling for subreddit links 2019-09-22 22:03:22 +02:00
Mike Fährmann
ee8b654464
[instagram] implement 'highlights' option (closes #329) 2019-09-21 23:38:20 +02:00
Mike Fährmann
f63c3097a9
[instagram] rework some code paths
- combine fetching an HTML page and extracting its 'shared_data'
- move 'shared_data' and field access info out of '_extract_page()'
- introduce a '_request_graphql()' method
2019-09-21 23:10:41 +02:00
Mike Fährmann
4330133114
[imgur] add 'favorite' extractor (closes #420)
… and use a newer site-internal API endpoint for user posts
2019-09-19 15:54:26 +02:00
Mike Fährmann
ee5e20221f
[imgth] fix image URLs 2019-09-19 14:56:48 +02:00
Mike Fährmann
b63b126808
[hentaicafe] extend URL pattern 2019-09-18 19:08:45 +02:00
Mike Fährmann
d780f0357e
[imgur] add user extractor 2019-09-17 22:58:18 +02:00
Mike Fährmann
11ea689013
[simplyhentai] fix image and video URLs 2019-09-16 21:37:16 +02:00
Mike Fährmann
15632a1570
[tsumino] fix extraction 2019-09-15 22:09:59 +02:00
Mike Fährmann
d92802fd37
[luscious] fix detection of unavailable galleries 2019-09-15 21:16:25 +02:00
Mike Fährmann
f99da2b866
[imgbb] detect invalid album and user profile links
and update test results, since the old album got deleted
2019-09-14 23:22:08 +02:00
Mike Fährmann
01bc7adadc
[deviantart] improve journal detection (#419)
Some journal-like posts are not reported to be journals (isJournal
is set to False), even though they have a textContent field.

https://www.deviantart.com/gliitchlord/art/brashstrokes-812942668
2019-09-14 22:45:22 +02:00
Mike Fährmann
6e12907de6
[deviantart] improve handling of private deviations (#414)
- don't try to call '/deviation/metadata' with an empty list of
  deviation ids
- print a warning when detecting private deviations without having
  a 'refresh-token'
2019-09-10 16:09:03 +02:00
Mike Fährmann
e7690ac694
[vsco] update URL pattern (closes #410) 2019-09-08 11:37:27 +02:00
Mike Fährmann
1848788970
update test results etc 2019-09-08 11:33:35 +02:00
Mike Fährmann
d5fbb2d9de
[tumblr] ignore audio links from Spotify etc. 2019-09-07 18:18:12 +02:00
Mike Fährmann
b1cddce865
Revert "[simplyhentai] fix extraction; remove image+video extractors"
This reverts commit d1db5180ab29a3c892d7fa5459cf7073ea029573.
2019-09-07 14:48:31 +02:00
Mike Fährmann
d23660c04d
[hentaicafe] restore default 'request()' behavior 2019-09-07 14:35:00 +02:00
Mike Fährmann
9ae58a6b3e
[exhentai] update image limit checks
- adjust cost of original images
- delay limit initialization until gallery and first image page have
  been requested and all cookies are available
2019-09-07 13:29:01 +02:00
Mike Fährmann
6fe9a134bf
[lineblog] add blog and post extractors (closes #404) 2019-09-06 22:16:42 +02:00
Mike Fährmann
4e8a548a61
[livedoor] update metadata extraction 2019-09-06 21:44:25 +02:00
Mike Fährmann
f9285f99e6
[pixiv] fix authentication 2019-09-02 22:38:56 +02:00
Mike Fährmann
6f3df3999a
[fuskator] add gallery and search extractor (closes #407) 2019-09-02 21:20:02 +02:00
Mike Fährmann
bc0ca66c99
[twitter] small improvements
- handle reply tweets (#403)
- unset cookies in Tweet extractor to "force" the legacy interface
2019-09-01 17:37:48 +02:00
Mike Fährmann
f02a768b5c
[danbooru] add 'ugoira' option (#406)
to choose between ZIP archives or converted video files
for Ugoira posts
2019-08-31 21:51:52 +02:00
Mike Fährmann
dedea3b4db
[deviantart] fix journal creation (#400) 2019-08-30 18:50:04 +02:00
Mike Fährmann
c6c5cb1898
improve 'deviantart.quality' description 2019-08-30 18:41:18 +02:00
Mike Fährmann
efb64ad031
[deviantart] generate filenames (#392, #400) 2019-08-29 10:09:21 +02:00
Mike Fährmann
b2151f3928
[seiga] support mobile URLs (closes #401) 2019-08-28 22:56:43 +02:00
Mike Fährmann
20fd2d8450
[flickr] skip unavailable images/videos (fixes #398) 2019-08-27 23:26:49 +02:00
Mike Fährmann
5cc7be2536
[piczel] update and improve
- use proper pagination (fixes #396)
- update API host and endpoints
- "fix" double slash // in image URLs
2019-08-24 20:37:33 +02:00
Mike Fährmann
49f6d7176d
[deviantart] restore filenames (#392)
<title>_by_<user>_<id> --> <title>_by_<user>-<id>
2019-08-23 22:02:03 +02:00
Mike Fährmann
63daa68d67
[deviantart] improvements (#392)
- consistent 'filename' entries, at least as far as possible
  - GIFs and SWFs don't have a <title>_by_<artist>_<id> anywhere in
    their metadata
  - Generating <id> (from 'deviationid'?) might be something that needs
    to be figured out, so we can build those filenames ourselves
- better code structure etc.
- tests for videos, archives, and flash animations
2019-08-23 12:27:19 +02:00
Mike Fährmann
d1db5180ab
[simplyhentai] fix extraction; remove image+video extractors 2019-08-22 23:56:41 +02:00
Mike Fährmann
30d6e284b0
[deviantart] use NAPI for artworks and scraps (#392)
TODO:
- journal downloads
- test for all media types
2019-08-21 23:56:06 +02:00
Mike Fährmann
7d6af936c5
[imgur] simplify gallery extraction 2019-08-20 20:00:43 +02:00
Mike Fährmann
51d10783fc
[patreon] include image info in API results (#383) 2019-08-18 23:28:47 +02:00
Mike Fährmann
7a5e78741c
[booru] build directory path for each file (#385) 2019-08-18 23:28:33 +02:00
Mike Fährmann
b1728f512d
[patreon] support multi image posts and post URLs (#383) 2019-08-17 23:24:46 +02:00
Mike Fährmann
c50d60a53d
[reactor] fix image URLs 2019-08-16 14:07:22 +02:00
Mike Fährmann
32447d0d24
[pixiv] simplify default filename format
(#366)
2019-08-15 13:32:47 +02:00
Mike Fährmann
829b1ccf04
[imgur] distinguish album and gallery URLs (#380)
A gallery can be either an album or a single image.
2019-08-14 21:40:14 +02:00
Mike Fährmann
23251356cb
require 'extension' data for each URL (#382) 2019-08-14 20:03:03 +02:00
Mike Fährmann
a67413d64f
[xhamster] use input URL domain
Don't rewrite all URLs as 'https://xhamster.com/...'
2019-08-14 00:21:30 +02:00
Mike Fährmann
423f68f585
[deviantart] fix scraps extraction (closes #376) 2019-08-11 16:06:15 +02:00
Mike Fährmann
3bf20ffb70
[instagram] add support for story highlights 2019-08-10 14:34:22 +02:00
Mike Fährmann
a732e9c430
[instagram] update query hashes and headers 2019-08-10 14:13:08 +02:00
Mike Fährmann
2ccf6a9e35
[instagram] make extractor tests happy (#373) 2019-08-08 18:50:26 +02:00
Leonardo Taccari
bc5eaf7746 [instagram] Add support for IGTV (#373)
Add support for IGTV profile (instagram.com/<username>/channel/)
and IGTV medias (instagram.com/tv/<short_id>).
2019-08-08 18:33:13 +02:00