README: document --no-global-igset

master
Ivan Kozik 2022-08-07 08:10:58 +00:00
parent d2bf2844dc
commit 5ee2132472
1 changed files with 3 additions and 1 deletions

View File

@ -282,10 +282,12 @@ Options can come before or after the URL.
regular expressions. See [the full list of available ignore sets](https://github.com/ArchiveTeam/grab-site/tree/master/libgrabsite/ignore_sets).
The [global](https://github.com/ArchiveTeam/grab-site/blob/master/libgrabsite/ignore_sets/global)
ignore set is implied and always enabled.
ignore set is implied and enabled unless `--no-global-igset` is used.
The ignore sets can be changed during the crawl by editing the `DIR/igsets` file.
* `--no-global-igset`: don't add the [global](https://github.com/ArchiveTeam/grab-site/blob/master/libgrabsite/ignore_sets/global) ignore set.
* `--no-offsite-links`: avoid following links to a depth of 1 on other domains.
grab-site always grabs page requisites (e.g. inline images and stylesheets), even if