README: wrap lines

This commit is contained in:
Ivan Kozik 2018-10-05 16:16:54 +00:00
parent 4473c7040e
commit e8a2163dd3

View File

@ -359,7 +359,11 @@ no recourse. Good alternatives include OVH ([OVH](https://www.ovh.com/us/dedica
#### Website requiring login / cookies
Log in to the website in Chrome or Firefox. Use the cookies.txt extension [for Chrome](https://github.com/daftano/cookies.txt) or [for Firefox](https://addons.mozilla.org/en-US/firefox/addon/cookies-txt/) extension to copy Netscape-format cookies. Paste the cookies data into a new file. Start grab-site with `--wpull-args=--load-cookies=ABSOLUTE_PATH_TO_COOKIES_FILE`.
Log in to the website in Chrome or Firefox. Use the cookies.txt extension
[for Chrome](https://github.com/daftano/cookies.txt) or
[for Firefox](https://addons.mozilla.org/en-US/firefox/addon/cookies-txt/)
extension to copy Netscape-format cookies. Paste the cookies data into a new
file. Start grab-site with `--wpull-args=--load-cookies=ABSOLUTE_PATH_TO_COOKIES_FILE`.
#### Static websites; WordPress blogs; Discourse forums
@ -369,7 +373,14 @@ The defaults usually work fine.
The defaults work fine except for blogs with a JavaScript-only Dynamic Views theme.
Some blogspot.com blogs use "[Dynamic Views](https://support.google.com/blogger/answer/1229061?hl=en)" themes that require JavaScript and serve absolutely no HTML content. In rare cases, you can get JavaScript-free pages by appending `?m=1` ([example](http://happinessbeyondthought.blogspot.com/?m=1)). Otherwise, you can archive parts of these blogs through Google Cache instead ([example](https://webcache.googleusercontent.com/search?q=cache:http://blog.datomic.com/)) or by using http://archive.is/ instead of grab-site. If neither of these options work, try [using grab-site with phantomjs](https://github.com/ludios/grab-site/issues/55#issuecomment-162118702).
Some blogspot.com blogs use "[Dynamic Views](https://support.google.com/blogger/answer/1229061?hl=en)"
themes that require JavaScript and serve absolutely no HTML content. In rare
cases, you can get JavaScript-free pages by appending `?m=1`
([example](http://happinessbeyondthought.blogspot.com/?m=1)). Otherwise, you
can archive parts of these blogs through Google Cache instead
([example](https://webcache.googleusercontent.com/search?q=cache:http://blog.datomic.com/))
or by using http://archive.is/ instead of grab-site. If neither of these options work,
try [using grab-site with phantomjs](https://github.com/ludios/grab-site/issues/55#issuecomment-162118702).
#### Tumblr blogs