236 Commits

Author SHA1 Message Date
Ivan Kozik
4471660656 README: use spaces for indentation to avoid tab-completion on Tab paste 2018-10-22 15:25:57 +00:00
Ivan Kozik
e650278421 README: try to get people to stop installing grab-site as root and explain the pip options 2018-10-22 15:24:50 +00:00
Ivan Kozik
d22ea9ad78 README: add install steps for NixOS 18.09 2018-10-18 15:04:50 +00:00
Ivan Kozik
4f898d2b06 README: mention pkg-config 2018-10-09 18:32:52 +00:00
Ivan Kozik
9022a57f6f README: move pkg-config to the end because it's for grab-site's dependencies and not for Python 2018-10-09 18:09:02 +00:00
Ivan Kozik
db251482cf README: abbreviate webrecorder.io instructions 2018-10-09 17:49:15 +00:00
Ivan Kozik
73587696f2 README: http:// -> https:// links 2018-10-09 17:42:35 +00:00
Ivan Kozik
ab7e20eb4d README: update webrecorder.io instructions 2018-10-09 17:40:01 +00:00
Ivan Kozik
20156b1f76 README: write more about how to archive Twitter users 2018-10-09 17:26:22 +00:00
Ivan Kozik
ea94a3d925 README: fix macOS install instructions 2018-10-09 16:53:03 +00:00
Ivan Kozik
d737e09b3b README: fix link 2018-10-09 16:44:19 +00:00
Ivan Kozik
5b67d90cfb README: install instruction fixes 2018-10-09 16:00:57 +00:00
Ivan Kozik
ce80bcc8c1 wpull_hooks: when igon is enabled, print which ignore was responsible 2018-10-09 13:22:39 +00:00
Ivan Kozik
53370c78db README: Ubuntu 14.04 and Debian 8 (jessie) are no longer supported 2018-10-09 06:10:03 +00:00
Ivan Kozik
cc839e3468 README: fix TOC link 2018-10-09 04:10:31 +00:00
Ivan Kozik
8e6649d144 README: install libssl-dev instead of libssl1.0-dev now that we no longer use Python 3.4 2018-10-09 04:03:44 +00:00
Ivan Kozik
2a1f0b9548 README: update macOS instructions for lxml and pyre2 (untested)
Brew has a Python 3.7.0 right now, so there is no need to compile a Python with pyenv.
2018-10-08 07:09:50 +00:00
Ivan Kozik
7589d57e75 Thank falconkirtaran 2018-10-07 22:57:42 +00:00
Ivan Kozik
08acad3e1c README: move security note to the end 2018-10-05 16:22:19 +00:00
Ivan Kozik
fca0eaf432 README: wrap lines 2018-10-05 16:19:55 +00:00
Ivan Kozik
c186df0617 README: wrap lines 2018-10-05 16:19:18 +00:00
Ivan Kozik
d3da0899ad README: wrap lines 2018-10-05 16:19:18 +00:00
Ivan Kozik
ea4e4eff74 README: wrap lines 2018-10-05 16:19:18 +00:00
Ivan Kozik
65b4f5caca README: remove phantomjs mention because I removed support from ludios/wpull 2018-10-05 16:19:18 +00:00
Ivan Kozik
e8a2163dd3 README: wrap lines 2018-10-05 16:19:15 +00:00
Ivan Kozik
4473c7040e README: wrap lines 2018-10-05 16:16:04 +00:00
Ivan Kozik
469dfdd0b9 README: wrap lines 2018-10-05 16:13:37 +00:00
Ivan Kozik
e45f6f5b97 README: remove BrowserStack mention 2018-10-05 16:12:41 +00:00
Ivan Kozik
8bf22e410f README: thank JAA 2018-10-05 16:12:15 +00:00
Ivan Kozik
fc54bed43a README: don't tell users to file issues on chfoo/wpull 2018-10-05 16:10:25 +00:00
Ivan Kozik
ca10585c45 README: fix list of options 2018-10-05 16:09:11 +00:00
Ivan Kozik
3f14886435 README: pip3 -> pip 2018-10-05 08:15:23 +00:00
Ivan Kozik
b7dfb14dd8 Upgrade to and require Python 3.7.0 2018-10-05 07:53:58 +00:00
Ivan Kozik
b32da83a0f Install ludios/pyre2, to be used soon for processing ignores 2018-10-04 14:13:05 +00:00
Ivan Kozik
837551c201 README: link to ludios/wpull 2018-10-04 13:40:28 +00:00
Ivan Kozik
bc512c696d README: fix pip3 install step 2018-10-04 11:27:53 +00:00
Ivan Kozik
7c14a909ef README: Python 3.4.8 -> 3.4.9 2018-10-04 11:22:41 +00:00
Ivan Kozik
ba960e0ea8 README: fix pip3 install step for new setup.py 2018-10-04 11:22:21 +00:00
Ivan Kozik
eaaf0ec06e Use ludios/wpull for html5-parser support 2018-10-04 11:13:35 +00:00
Ivan Kozik
e664e4fd54 README: mention cookies.txt extension for Firefox 2018-09-08 04:30:01 +00:00
Ivan Kozik
424e58a173 README: document DIR/scrape 2018-08-28 01:30:35 +00:00
Ivan Kozik
eabcf70141 README: tweak wording 2018-08-28 01:27:28 +00:00
Ivan Kozik
bf0d7d28a9 README: using Googlebot UA on tumblr no longer works 2018-08-24 00:41:35 +00:00
Ivan Kozik
ca8fd22c02 singletumblr igset: don't ignore non-tumblr domains; don't apply ignores to start URLs
https://github.com/ludios/grab-site/issues/126
2018-08-06 23:36:50 +00:00
Ivan Kozik
644260c479 README: document how to bypass tumblr's GDPR consent page 2018-07-07 12:05:34 +00:00
Ivan Kozik
a3537c7f2c Revert Googlebot UA to avoid breaking reddit crawls
With Googlebot in the UA, reddit says:

429 Too Many Requests https://www.reddit.com/...
2018-07-07 12:03:23 +00:00
Ivan Kozik
aa01eb8293 README: mention updated UA 2018-06-25 02:13:09 +00:00
Ivan Kozik
a2e751f9dc README: Ubuntu 17.10 -> 18.04; show newer-distro instructions first 2018-05-19 19:31:19 +00:00
Ivan Kozik
e79cbac070 README: fix macOS install steps for PyPI now requiring TLS 1.2 support
Fixes https://github.com/ludios/grab-site/issues/121
2018-05-15 20:57:38 +00:00
Ivan Kozik
b97414c5a4 README: Python 3.4.7 -> 3.4.8 2018-05-15 20:51:02 +00:00