204 Commits

Author SHA1 Message Date
Ivan Kozik
b7dfb14dd8 Upgrade to and require Python 3.7.0 2018-10-05 07:53:58 +00:00
Ivan Kozik
b32da83a0f Install ludios/pyre2, to be used soon for processing ignores 2018-10-04 14:13:05 +00:00
Ivan Kozik
837551c201 README: link to ludios/wpull 2018-10-04 13:40:28 +00:00
Ivan Kozik
bc512c696d README: fix pip3 install step 2018-10-04 11:27:53 +00:00
Ivan Kozik
7c14a909ef README: Python 3.4.8 -> 3.4.9 2018-10-04 11:22:41 +00:00
Ivan Kozik
ba960e0ea8 README: fix pip3 install step for new setup.py 2018-10-04 11:22:21 +00:00
Ivan Kozik
eaaf0ec06e Use ludios/wpull for html5-parser support 2018-10-04 11:13:35 +00:00
Ivan Kozik
e664e4fd54 README: mention cookies.txt extension for Firefox 2018-09-08 04:30:01 +00:00
Ivan Kozik
424e58a173 README: document DIR/scrape 2018-08-28 01:30:35 +00:00
Ivan Kozik
eabcf70141 README: tweak wording 2018-08-28 01:27:28 +00:00
Ivan Kozik
bf0d7d28a9 README: using Googlebot UA on tumblr no longer works 2018-08-24 00:41:35 +00:00
Ivan Kozik
ca8fd22c02 singletumblr igset: don't ignore non-tumblr domains; don't apply ignores to start URLs
https://github.com/ludios/grab-site/issues/126
2018-08-06 23:36:50 +00:00
Ivan Kozik
644260c479 README: document how to bypass tumblr's GDPR consent page 2018-07-07 12:05:34 +00:00
Ivan Kozik
a3537c7f2c Revert Googlebot UA to avoid breaking reddit crawls
With Googlebot in the UA, reddit says:

429 Too Many Requests https://www.reddit.com/...
2018-07-07 12:03:23 +00:00
Ivan Kozik
aa01eb8293 README: mention updated UA 2018-06-25 02:13:09 +00:00
Ivan Kozik
a2e751f9dc README: Ubuntu 17.10 -> 18.04; show newer-distro instructions first 2018-05-19 19:31:19 +00:00
Ivan Kozik
e79cbac070 README: fix macOS install steps for PyPI now requiring TLS 1.2 support
Fixes https://github.com/ludios/grab-site/issues/121
2018-05-15 20:57:38 +00:00
Ivan Kozik
b97414c5a4 README: Python 3.4.7 -> 3.4.8 2018-05-15 20:51:02 +00:00
Ivan Kozik
82de2f2b2b Add --import-ignores for starting with a non-empty DIR/ignores file 2017-12-27 13:48:20 +00:00
Ivan Kozik
6b6d5785e2 README: adjust logo size 2017-12-27 13:36:32 +00:00
Ivan Kozik
97caf59705 README: add BrowserStack logo per terms 2017-12-13 23:05:41 +00:00
Ivan Kozik
fe38081834 README: thank BrowserStack 2017-12-13 23:00:26 +00:00
Ivan Kozik
4699e581fc README: add install steps for Debian 8 (jessie) 2017-12-07 02:36:14 +00:00
Ivan Kozik
26655fb28c README: switch from PPA-based python3.4 install to pyenv-based install; add install steps for Debian 9 and 10 2017-12-07 02:28:45 +00:00
Ivan Kozik
95e98ecefe README: link to wpull v1.2.3 2017-11-22 18:34:50 +00:00
Ivan Kozik
b3c83f203c README: add note about gs-server listening on all interfaces by default 2017-11-22 18:09:49 +00:00
Ivan Kozik
62d4575b0c README: point to the newer ppa:deadsnakes/ppa PPA with Python 3.4.7 2017-11-22 17:57:36 +00:00
Ivan Kozik
2276adefe8 README: be less confusing about "start a new shell" 2017-11-22 17:25:31 +00:00
Ivan Kozik
fc09d22028 README: ask users to file issues 2017-11-19 04:11:57 +00:00
Ivan Kozik
90300f0f57 Document how to grab a website that requires login / cookies 2017-11-09 11:10:54 +00:00
Ivan Kozik
d9f75f5ae3 README: update "Install on a non-Ubuntu distribution" steps to also use a virtualenv 2017-10-24 17:55:48 +00:00
Ivan Kozik
ad5c4d2449 README: OS X -> macOS and update instructions to use virtualenv 2017-10-24 17:42:33 +00:00
Ivan Kozik
d5698bc08a README: fix TOC order 2017-10-24 17:25:33 +00:00
Ivan Kozik
d9b89f551b README: rework instructions to not require activating the virtualenv 2017-10-24 17:24:27 +00:00
Ivan Kozik
be5db3f397 README: rework the Ubuntu 14.04 install steps to use virtualenv; assume grab-site and related executables are in PATH 2017-10-24 17:14:57 +00:00
Ivan Kozik
0ad6bdf89f README: ancient non-LTS Ubuntu releases are not supported 2017-10-24 17:02:33 +00:00
Ivan Kozik
a954a0caca README: "Python 3.5 or newer" 2017-10-24 16:45:40 +00:00
Ivan Kozik
cd3931b5fc Add install instructions for Windows 10 2017-10-24 16:43:53 +00:00
Ivan Kozik
6680cf7e50 README: add install steps for Ubuntu 17.10 2017-10-24 01:02:42 +00:00
Ivan Kozik
25a19d1dc3 Update install instructions for Ubuntu 17.04 and fold Ubuntu 16.10 instructions into 16.04 instructions 2017-04-09 08:18:59 +00:00
Ivan Kozik
ae400137d3 README: update Help section 2017-03-09 07:58:28 +00:00
Ivan Kozik
69d1dab393 Mention grab-site 'URL' instead of grab-site URL to avoid issues with ? or & 2017-02-26 21:42:53 +00:00
Ivan Kozik
d88dccac27 Fix link to Python installer for OS X (there is no 3.4.5 installer) 2017-02-17 23:33:11 +00:00
Ivan Kozik
94e486c7cf Document --permanent-error-status-codes 2017-02-08 20:23:12 +00:00
Ivan Kozik
4fd740e815 Point to Python 3.4.5 instead of 3.4.3 2017-02-04 13:39:46 +00:00
Ivan Kozik
32544d096e Add install instructions for Ubuntu 16.10 2017-02-04 13:36:47 +00:00
Ivan Kozik
75452363d0 README: Tweak 2016-11-12 22:42:15 +00:00
Ivan Kozik
016a166f14 README: Advise downgrading tmux, not upgrading with some ppa 2016-08-02 18:53:07 +00:00
Ivan Kozik
d66011e86a README: Fix note; wpull 2.0.1 does work on Python 3.5 2016-07-15 05:41:27 +00:00
Ivan Kozik
ca7bc71045 README: Add warning about tmux 2.1 2016-06-21 06:42:46 +00:00