291 Commits

Author SHA1 Message Date
Ivan Kozik
e6f830764e Allow changing concurrency using DIR/concurrency file 2015-07-28 14:21:28 +00:00
Ivan Kozik
47c9a20ba7 Bump version 2015-07-28 14:01:42 +00:00
Ivan Kozik
1198c88f2a Document --delay in README 2015-07-28 14:01:28 +00:00
Ivan Kozik
7ac5b07a99 Add --delay option 2015-07-28 13:57:42 +00:00
Ivan Kozik
3c28b53620 Allow changing delay (in milliseconds) using DIR/delay file 2015-07-28 13:44:51 +00:00
Ivan Kozik
4f5fb8f108 Print IGNOR messages more nicely in the console 2015-07-28 13:26:09 +00:00
Ivan Kozik
cae516eb5d Cache these control files for 3 seconds to reduce stat calls: ignores, igsets, igoff, stop 2015-07-28 13:23:00 +00:00
Ivan Kozik
4b174ee94f Remove unused imports 2015-07-28 12:52:42 +00:00
Ivan Kozik
4c84312462 Undo my camelCase mistake 2015-07-28 12:51:59 +00:00
Ivan Kozik
4eb2805df0 Format DUPE/OF messages more nicely in terminal 2015-07-28 12:33:59 +00:00
Ivan Kozik
37d1f2e473 directory name gen: don't try and fail to create directory with > 255 chars when given a long URL 2015-07-28 12:16:32 +00:00
Ivan Kozik
a82e4017fe directory name gen: whitelist instead of blacklist characters 2015-07-28 12:12:35 +00:00
Ivan Kozik
2418ea04e8 dashboard: don't include '!ig ID' in the context menu regexp helper, since these are designed to be pasted into a DIR/ignores file 2015-07-28 12:01:20 +00:00
Ivan Kozik
0f1bdfd738 Don't spawn wpull in a subprocess, just import it and call its main() 2015-07-28 11:53:47 +00:00
Ivan Kozik
7cf8db39d3 Mention pipe to sort | less -S 2015-07-28 11:40:14 +00:00
Ivan Kozik
0dc440ffd8 Tweak README 2015-07-28 11:34:58 +00:00
Ivan Kozik
975f328c95 Document gs-dump-urls 2015-07-28 11:33:51 +00:00
Ivan Kozik
6bbe9fb3bb Fix formatting 2015-07-28 11:27:50 +00:00
Ivan Kozik
e506d6a103 Add gs-dump-urls, a utility to dump URLs from a wpull.db file 2015-07-28 11:26:10 +00:00
Ivan Kozik
991718b2e2 hooks: better ws:// connect messages, slow down reconnects exponentially 2015-07-27 14:01:22 +00:00
Ivan Kozik
36f24b03b3 hooks: print which ws:// server it can't connect to 2015-07-27 13:45:04 +00:00
Ivan Kozik
dbe1deb9f0 Clarify ignore sets 2015-07-27 13:28:49 +00:00
Ivan Kozik
41f7683d98 +1 is OK 2015-07-27 08:52:48 +00:00
Ivan Kozik
015df2a0df Link yipdw 2015-07-27 08:06:33 +00:00
Ivan Kozik
a89ef4b22b README: add Thanks and P.S. 2015-07-27 07:59:57 +00:00
Ivan Kozik
3b5f8b4be3 Clarify --concurrency 2015-07-27 07:38:06 +00:00
Ivan Kozik
d34c1c5f34 Fix formatting 2015-07-27 07:31:43 +00:00
Ivan Kozik
493971e2a9 Merge branch 'grab-site-py3-launcher' 2015-07-27 07:27:28 +00:00
Ivan Kozik
472edf5ebc Put all temporary files in DIR/temp; don't let ctrl-c exit grab-site before wpull 2015-07-27 07:26:54 +00:00
Ivan Kozik
99dfbe2756 Add --version 2015-07-27 07:07:18 +00:00
Ivan Kozik
4c9a935bec Make --igsets actually work 2015-07-27 06:58:17 +00:00
Ivan Kozik
b7c2f1d1bd Add --sitemaps/--no-sitemaps 2015-07-27 06:55:20 +00:00
Ivan Kozik
2e7d928614 Update README 2015-07-27 06:50:48 +00:00
Ivan Kozik
84b183ec84 Write proper --help text and use aliased inputs too 2015-07-27 06:44:55 +00:00
Ivan Kozik
637929ab76 First take on converting grab-site to a Python program 2015-07-27 06:32:08 +00:00
Ivan Kozik
915ed0eeae Use cchardet for faster encoding detection (imported by wpull/thirdparty/dammit.py) 2015-07-21 03:25:56 +00:00
Ivan Kozik
8d2acd669a README: minor tweaks 2015-07-20 09:53:13 +00:00
Ivan Kozik
a7f2ee7684 Document webarchiveplayer for viewing your WARCs 2015-07-20 09:47:22 +00:00
Ivan Kozik
5e85e00201 README: document --concurrency= 2015-07-20 09:30:51 +00:00
Ivan Kozik
c35b388677 Allow archiving archive.org content despite it being in the global ignore set 2015-07-20 09:02:19 +00:00
Ivan Kozik
08933f60e2 Clarify ?host= dashboard option 2015-07-20 08:50:47 +00:00
Ivan Kozik
35d6d780bd Bump version 2015-07-20 08:37:46 +00:00
Ivan Kozik
3f78e5f4bf README: use an <h3> 2015-07-20 08:30:57 +00:00
Ivan Kozik
58b560257a README: improve docs for options 2015-07-20 08:29:37 +00:00
Ivan Kozik
9af02f122b Unbreak README 2015-07-20 08:25:33 +00:00
Ivan Kozik
1fce3af4a0 Add --1 option for turning off recursion; document options 2015-07-20 08:23:35 +00:00
Ivan Kozik
e83375382d README: there are control files in DIR too 2015-07-20 08:04:14 +00:00
Ivan Kozik
210c3d03b5 README: include suggestions from @ethus3h (thanks!) and wrap long lines 2015-07-20 07:50:49 +00:00
Ivan Kozik
c7a272d7ba Document how to fix your PATH for grab-site 2015-07-20 07:25:06 +00:00
Ivan Kozik
0e38441234 Add OS X support 2015-07-20 06:35:32 +00:00