Ivan Kozik
|
450a5c394f
|
Bump Firefox UA
|
2016-06-01 02:16:36 +00:00 |
|
Ivan Kozik
|
febee9c85e
|
global igset: Add /%3Ca%20href= pattern
|
2016-05-29 14:00:15 +00:00 |
|
Ivan Kozik
|
fb6e01caa7
|
global igset: Add /%20https?:/ pattern
|
2016-05-27 14:03:28 +00:00 |
|
Ivan Kozik
|
aa366bbb27
|
Update grab-site URL in setup.py and dashboard
|
2016-05-27 13:53:35 +00:00 |
|
Ivan Kozik
|
842fab4b23
|
Stop listening on legacy ws port 29001
|
2016-05-22 11:06:38 +00:00 |
|
Ivan Kozik
|
f0bb696dc8
|
Actually install the favicon.ico
|
2016-05-22 11:01:53 +00:00 |
|
Ivan Kozik
|
b3c75b0ffb
|
dashboard: Add a favicon
|
2016-05-22 10:59:32 +00:00 |
|
Ivan Kozik
|
7df1761bf0
|
dashboard: Allow for another digit in the MB stat
|
2016-05-22 10:22:11 +00:00 |
|
Ivan Kozik
|
8d93776742
|
dashboard: Align the req/s stat properly
|
2016-05-22 10:18:20 +00:00 |
|
Ivan Kozik
|
38877106ef
|
Don't raise an exception if client lacks User-Agent
|
2016-05-03 07:16:48 +00:00 |
|
Ivan Kozik
|
fe2530e667
|
global igset: ignore amp%3Bamp%3Bamp%3B loops
|
2016-04-22 03:55:09 +00:00 |
|
Ivan Kozik
|
01ac84da06
|
global igset: tumblr serves 16px avatars on https now as well
|
2016-04-04 18:38:59 +00:00 |
|
Ivan Kozik
|
ffecfcabda
|
global igset: Ignore instapaper share links
|
2016-03-30 17:11:03 +00:00 |
|
Ivan Kozik
|
316db6eec4
|
grab-site 0.11
|
2016-02-25 01:08:17 +00:00 |
|
Ivan Kozik
|
bfb866dfed
|
s/started/listening/
|
2016-02-25 00:32:42 +00:00 |
|
Ivan Kozik
|
fcc3a7f904
|
Listen on port 29001 as well until everyone's old port-29001 crawls finish
|
2016-02-25 00:20:23 +00:00 |
|
Ivan Kozik
|
be8418f52c
|
Don't allow anyone to frame/iframe us
|
2016-02-25 00:01:27 +00:00 |
|
Ivan Kozik
|
0f5342b9c0
|
Factor out a sendPage
|
2016-02-24 23:58:43 +00:00 |
|
Ivan Kozik
|
25b3c9c076
|
Read bytes from file and send the correct Content-Length
|
2016-02-24 23:52:37 +00:00 |
|
Ivan Kozik
|
7486daa33f
|
Use \r\n instead of \x0d\x0a
|
2016-02-24 23:47:03 +00:00 |
|
Ivan Kozik
|
7024f1e232
|
Fix comment
|
2016-02-24 23:46:21 +00:00 |
|
Ivan Kozik
|
c20a0f76ba
|
Merge branch 'pr_76' into ws-http-same-port
|
2016-02-24 23:43:23 +00:00 |
|
12As
|
5da8e85d43
|
Update server.py
fixed errors
|
2016-02-24 16:44:13 -06:00 |
|
12As
|
e7e3030d4e
|
Update server.py based on comments in PR #76
Changed server to handle paths, added a send404 method and fixed letter case on variables
|
2016-02-24 16:03:34 -06:00 |
|
12As
|
3f0c433668
|
Update dashboard.html, per comments in PR.
|
2016-02-24 15:22:31 -06:00 |
|
12As
|
4a51ee1c83
|
Create 404 Not Found page
|
2016-02-24 15:10:33 -06:00 |
|
Ivan Kozik
|
506a7604ef
|
Rename --which-wpull-args-full to --which-wpull-command
|
2016-02-21 04:49:53 +00:00 |
|
Ivan Kozik
|
5805e4c155
|
Implement --which-wpull-args-partial and --which-wpull-args-full for figuring out which wpull arguments grab-site would use, without actually starting wpull
|
2016-02-21 04:33:14 +00:00 |
|
Ivan Kozik
|
bda4d8cf6d
|
Pass maybe_log_ignore and print_to_terminal as globals to custom_hooks.py as well
|
2016-02-21 00:53:03 +00:00 |
|
Ivan Kozik
|
c37b32bd1c
|
Implement --custom-hooks so that users can modify wpull_hook
|
2016-02-21 00:23:18 +00:00 |
|
12As
|
dec4150969
|
Change default port in wpull_hooks.py
|
2016-02-20 15:25:20 -06:00 |
|
12As
|
492625a09d
|
Change default port in dashboard.html
|
2016-02-20 15:23:31 -06:00 |
|
12As
|
53e079228d
|
Make gs-server use single port
Remove the need for gs-server to require multiple ports
|
2016-02-20 15:20:53 -06:00 |
|
Ivan Kozik
|
292682a48f
|
Bump version
|
2016-02-16 17:25:11 +00:00 |
|
Daniel Oaks
|
95012d1e0c
|
gs-server: Use env instead of py3 directly, makes virtualenvs nicer
|
2016-02-16 17:24:20 +00:00 |
|
Ivan Kozik
|
7c1afbefe0
|
Use wpull 1.2.3
|
2016-02-05 19:15:25 +00:00 |
|
Ivan Kozik
|
ef5137ae86
|
Update UA
|
2016-02-01 16:19:49 +00:00 |
|
Ivan Kozik
|
7ec2f90534
|
global igset: Ignore /CSI/CSI/ loops on blogspot
|
2016-01-12 22:44:57 +00:00 |
|
Ivan Kozik
|
0214558d5e
|
global igset: ignore bogus /search/label/CSI/ links on blogspot
|
2016-01-12 03:20:44 +00:00 |
|
Ivan Kozik
|
3b9f8c1a4c
|
global igset: ignore /CaptchaImage.axd
|
2016-01-09 21:46:35 +00:00 |
|
Ivan Kozik
|
dff87eba2f
|
global igset: also ignore www.digg.com/submit
|
2016-01-09 02:19:18 +00:00 |
|
Ivan Kozik
|
01bf6d527b
|
Bump version
|
2016-01-05 01:08:14 +00:00 |
|
Daniel Oaks
|
04179a2a99
|
server: Fix exception when clients visit ws port via a browser
|
2016-01-05 10:53:37 +10:00 |
|
Ivan Kozik
|
111ffca643
|
global igset: ignore two more loops
|
2016-01-03 02:00:26 +00:00 |
|
Ivan Kozik
|
5f14263070
|
global igset: ignore livejournal.com/identity/login.bml
|
2015-12-30 05:35:57 +00:00 |
|
Ivan Kozik
|
2acc826d56
|
lstrip '-' to avoid creating filenames that must be --'ed or quoted
|
2015-12-17 16:54:55 +00:00 |
|
Ivan Kozik
|
4ea80eec80
|
global igset: Ignore a loop on archive.org
|
2015-12-16 13:11:16 +00:00 |
|
Ivan Kozik
|
38f733f9d2
|
global igset: ignore /wp-admin/
|
2015-12-16 11:00:54 +00:00 |
|
Ivan Kozik
|
adb35ee4e3
|
Add --id=, --dir=, and --finished-warc-dir= options
|
2015-12-12 19:52:32 +00:00 |
|
Ivan Kozik
|
f8fece9ebb
|
Set NCR=1 cookie for .blogspot.com to avoid getting redirected
|
2015-12-12 18:29:53 +00:00 |
|