diff --git a/README.md b/README.md index 1d92f0f..d941d9f 100644 --- a/README.md +++ b/README.md @@ -359,7 +359,11 @@ no recourse. Good alternatives include OVH ([OVH](https://www.ovh.com/us/dedica #### Website requiring login / cookies -Log in to the website in Chrome or Firefox. Use the cookies.txt extension [for Chrome](https://github.com/daftano/cookies.txt) or [for Firefox](https://addons.mozilla.org/en-US/firefox/addon/cookies-txt/) extension to copy Netscape-format cookies. Paste the cookies data into a new file. Start grab-site with `--wpull-args=--load-cookies=ABSOLUTE_PATH_TO_COOKIES_FILE`. +Log in to the website in Chrome or Firefox. Use the cookies.txt extension +[for Chrome](https://github.com/daftano/cookies.txt) or +[for Firefox](https://addons.mozilla.org/en-US/firefox/addon/cookies-txt/) +extension to copy Netscape-format cookies. Paste the cookies data into a new +file. Start grab-site with `--wpull-args=--load-cookies=ABSOLUTE_PATH_TO_COOKIES_FILE`. #### Static websites; WordPress blogs; Discourse forums @@ -369,7 +373,14 @@ The defaults usually work fine. The defaults work fine except for blogs with a JavaScript-only Dynamic Views theme. -Some blogspot.com blogs use "[Dynamic Views](https://support.google.com/blogger/answer/1229061?hl=en)" themes that require JavaScript and serve absolutely no HTML content. In rare cases, you can get JavaScript-free pages by appending `?m=1` ([example](http://happinessbeyondthought.blogspot.com/?m=1)). Otherwise, you can archive parts of these blogs through Google Cache instead ([example](https://webcache.googleusercontent.com/search?q=cache:http://blog.datomic.com/)) or by using http://archive.is/ instead of grab-site. If neither of these options work, try [using grab-site with phantomjs](https://github.com/ludios/grab-site/issues/55#issuecomment-162118702). +Some blogspot.com blogs use "[Dynamic Views](https://support.google.com/blogger/answer/1229061?hl=en)" +themes that require JavaScript and serve absolutely no HTML content. In rare +cases, you can get JavaScript-free pages by appending `?m=1` +([example](http://happinessbeyondthought.blogspot.com/?m=1)). Otherwise, you +can archive parts of these blogs through Google Cache instead +([example](https://webcache.googleusercontent.com/search?q=cache:http://blog.datomic.com/)) +or by using http://archive.is/ instead of grab-site. If neither of these options work, +try [using grab-site with phantomjs](https://github.com/ludios/grab-site/issues/55#issuecomment-162118702). #### Tumblr blogs