41 Commits

Author SHA1 Message Date
Ivan Kozik
0e38441234 Add OS X support 2015-07-20 06:35:32 +00:00
Ivan Kozik
3ffed7dfbb Tell people to use GitHub issues 2015-07-19 20:15:23 +00:00
Ivan Kozik
55e3507122 Tweak README 2015-07-18 12:09:51 +00:00
Ivan Kozik
9f872f4fae Recommend starting gs-server first 2015-07-18 12:06:00 +00:00
Ivan Kozik
210baaa156 Tweak README 2015-07-18 11:25:00 +00:00
Ivan Kozik
b1d5f677b0 Link to raw.githubusercontent.com for screenshot 2015-07-18 11:22:28 +00:00
Ivan Kozik
bec8615d46 Add dashboard screenshot 2015-07-18 11:19:07 +00:00
Ivan Kozik
47940fd09e Explain how to stop a crawl 2015-07-18 10:51:17 +00:00
Ivan Kozik
dc7fe9ed06 Update install and usage instructions 2015-07-18 10:41:24 +00:00
Ivan Kozik
1266cf6c97 Fix typo 2015-07-18 10:02:10 +00:00
Ivan Kozik
bcd29c1837 Mention duplicate page detection 2015-07-18 10:01:25 +00:00
Ivan Kozik
4aeb715c0f Mention ignore sets 2015-07-18 09:58:17 +00:00
Ivan Kozik
8e47415e83 Tweak README 2015-07-18 09:54:07 +00:00
Ivan Kozik
266cf34a23 aiohttp is required as well 2015-07-18 09:50:42 +00:00
Ivan Kozik
f4f445b7dd igoff by default 2015-07-18 08:23:56 +00:00
Ivan Kozik
8c9ce8c24b ignore_sets -> igsets 2015-07-18 06:23:24 +00:00
Ivan Kozik
3965233862 Tweak README 2015-07-18 06:22:47 +00:00
Ivan Kozik
02502c5260 Tweak README 2015-07-18 06:21:03 +00:00
Ivan Kozik
804cb0a1ee Document grab-site dashboard 2015-07-18 06:16:46 +00:00
Ivan Kozik
53fd04a29e Make reconnecting work 2015-07-18 03:17:27 +00:00
Ivan Kozik
18a192739b Make WebSocket client/server sort of work; rename ignore_sets to igsets 2015-07-18 02:11:18 +00:00
Ivan Kozik
cc43d39a8e Remove License note in README 2015-07-17 23:53:28 +00:00
Ivan Kozik
5229ddf5dc Start work on websocket server for future dashboard integration 2015-07-17 22:42:25 +00:00
Ivan Kozik
03d1efc2ce Clarify argument order requirement 2015-07-17 03:59:42 +00:00
Ivan Kozik
f80df6944f Describe arguments more 2015-03-09 05:06:44 +00:00
Ivan Kozik
611a0be845 Cleanup 2015-03-09 04:53:38 +00:00
Ivan Kozik
820e2aeef4 Mention WARC files; clarify 2015-03-09 04:52:18 +00:00
Ivan Kozik
a1cbcb9ea9 Describe what this is 2015-03-09 04:48:27 +00:00
Ivan Kozik
ccaee25497 Link to global ignore set 2015-02-05 19:32:47 +00:00
Ivan Kozik
e2118bbea4 Clarify 2015-02-05 19:31:50 +00:00
Ivan Kozik
4a22b4d593 Tell user to install git as well 2015-02-05 19:27:19 +00:00
Ivan Kozik
65e096a035 Support --ignore-sets= instead of the space-separated version 2015-02-05 06:05:54 +00:00
Ivan Kozik
2d7125951f Link to pythex 2015-02-05 05:39:44 +00:00
Ivan Kozik
f815920a83 Document file formats 2015-02-05 05:37:34 +00:00
Ivan Kozik
d73ee5ba27 Make it real obvious 2015-02-05 05:34:49 +00:00
Ivan Kozik
0699689a14 Add igoff feature 2015-02-05 05:19:34 +00:00
Ivan Kozik
2ccb8b4d6f Add support for --no-offsite-links 2015-02-05 05:15:46 +00:00
Ivan Kozik
979b843458 Load changes from DIR/ignores and DIR/ignore_sets while the crawl is running 2015-02-05 04:59:28 +00:00
Ivan Kozik
429b2032ff Improve README 2015-02-05 04:27:38 +00:00
Ivan Kozik
1705174fb2 CRLF -> LF 2015-02-05 04:25:49 +00:00
Ivan Kozik
91fd89be5d Add a site-grabber based on ArchiveBot's use of wpull 2015-02-05 03:43:50 +00:00