562 Commits

Author SHA1 Message Date
Ivan Kozik
40d22d140e Ignore more mp3 streaming sites 2015-07-29 07:33:49 +00:00
Ivan Kozik
4a97458138 Ignore weibo share links 2015-07-29 07:33:49 +00:00
Ivan Kozik
78c4a03a33 Remove anti-loop patterns that may result in false positives 2015-07-29 07:33:49 +00:00
Ivan Kozik
7bb3c512e5 Ignore share links on IPB 2015-07-29 07:33:49 +00:00
Ivan Kozik
a1d8b4a853 Ignore /?view=getlastpost 2015-07-29 07:33:48 +00:00
Ivan Kozik
d512e1d80e Add more forum ignores 2015-07-29 07:33:48 +00:00
Ivan Kozik
39704b6598 Ignore loops on gdcvault.com 2015-07-29 07:33:48 +00:00
Ivan Kozik
da77bfc1e5 Ignore more mp3 streaming sites 2015-07-29 07:33:48 +00:00
Ivan Kozik
3571689c25 Ignore another Icecast site 2015-07-29 07:33:48 +00:00
Ivan Kozik
9547fc463a Ignore another mp3 streaming site 2015-07-29 07:33:48 +00:00
Ivan Kozik
39fc0d5bc6 Fix very broken Google Finance ignore 2015-07-29 07:33:48 +00:00
Ivan Kozik
45e64d767a Support all Google TLDs in Google Finance regexp 2015-07-29 07:33:48 +00:00
Ivan Kozik
671328c4c3 Ignore more incorrect flickr URLs 2015-07-29 07:33:48 +00:00
Ivan Kozik
a5eb63cdf9 Fix google finance ignore 2015-07-29 07:33:48 +00:00
Ivan Kozik
561e775a65 Ignore another Icecast site 2015-07-29 07:33:48 +00:00
Ivan Kozik
3ffe37d057 Fix flickr rule 2015-07-29 07:33:48 +00:00
Ivan Kozik
ac8eb4bc30 Ignore incorrect flickr URLs found by wpull 2015-07-29 07:33:48 +00:00
Ivan Kozik
0f6dda82b3 Ignore webcam streams 2015-07-29 07:33:48 +00:00
Ivan Kozik
a22bafc481 Ignore Google finance pages that wpull finds
Consider removing this after page requisites of page requisites/linked pages are not grabbed
2015-07-29 07:33:48 +00:00
Ivan Kozik
f15ac24c7e Ignore another mp3 streaming site 2015-07-29 07:33:48 +00:00
Ivan Kozik
e3661f8a4a Ignore more share links 2015-07-29 07:33:48 +00:00
Ivan Kozik
06e9b19ebc Ignore another default gravatar 2015-07-29 07:33:48 +00:00
Ivan Kozik
f1e09893bb Add nosortedindex ignore set 2015-07-29 07:33:48 +00:00
Ivan Kozik
4233beaf10 Ignore linkedin loop
Remove this when wpull has dupe detection
2015-07-29 07:33:48 +00:00
Ivan Kozik
fecf82f069 Ignore another share link 2015-07-29 07:33:48 +00:00
Ivan Kozik
375f036c11 Ignore another Icecast site 2015-07-29 07:33:48 +00:00
Ivan Kozik
4bacfdc601 Ignore /navbar.g because wpull doesn't decode the URL properly 2015-07-29 07:33:48 +00:00
Ivan Kozik
1b44b59a47 Ignore addtoany.com/share_save 2015-07-29 07:33:48 +00:00
Ivan Kozik
f4423bde70 Ignore localhost 2015-07-29 07:33:48 +00:00
Ivan Kozik
23344a0d8c Ignore https as well 2015-07-29 07:33:48 +00:00
Ivan Kozik
175b24d789 Ignore frequently-encountered wikipedia thumbnails 2015-07-29 07:33:48 +00:00
Ivan Kozik
d5ec636cba Ignore pages on draft.blogger.com 2015-07-29 07:33:48 +00:00
Ivan Kozik
0bf569d29a Ignore some reddit wiki pages 2015-07-29 07:33:48 +00:00
Ivan Kozik
37f8aafa59 Ignore more radioscoop 2015-07-29 07:33:48 +00:00
Ivan Kozik
580a204b03 Ignore more js-agent.newrelic.com 2015-07-29 07:33:48 +00:00
Ivan Kozik
68ffb8932f Ignore another Icecast site 2015-07-29 07:33:48 +00:00
Ivan Kozik
85e557f44a Ignore &mobileaction= 2015-07-29 07:33:48 +00:00
Ivan Kozik
d1efcf1fbd Remove moved rule 2015-07-29 07:33:48 +00:00
Ivan Kozik
c0e443476c Copy tumblr rule from blogs set 2015-07-29 07:33:48 +00:00
Ivan Kozik
e1452e6e5b Ignore more & 2015-07-29 07:33:48 +00:00
Ivan Kozik
399ca7eaa7 Ignore Special:ListFiles.*&user= 2015-07-29 07:33:48 +00:00
Ivan Kozik
a34f824965 Ignore some Special:ListFiles
Note: & args in URL like

https://wiki.unrealengine.com/index.php?title=Special:ListFiles&dir=prev&sort=img_size&limit=50&user=ListFiles/ListFiles/Skins.vector.js

seem to be ignored, not treated as &
2015-07-29 07:33:48 +00:00
Ivan Kozik
5f46491a17 Ignore stumbleupon without www. as well 2015-07-29 07:33:48 +00:00
Ivan Kozik
2d3d04b790 Ignore per-section edit pages 2015-07-29 07:33:48 +00:00
Ivan Kozik
88c69effc2 Ignore Special:RecentChanges&from= 2015-07-29 07:33:48 +00:00
Ivan Kozik
c9bcacbeb6 Ignore Special:RecentChangesLinked 2015-07-29 07:33:48 +00:00
Ivan Kozik
2d206de1f5 Ignore a SHOUTcast site 2015-07-29 07:33:48 +00:00
Ivan Kozik
60e89778b3 Ignore www as well 2015-07-29 07:33:48 +00:00
Ivan Kozik
14f8f6aab9 Fix literal . 2015-07-29 07:33:48 +00:00
Ivan Kozik
6ca1e37b85 Ignore another Icecast site 2015-07-29 07:33:48 +00:00