||2 years ago|
|snarfbot||2 years ago|
|.gitignore||2 years ago|
|README.md||2 years ago|
|requirements.txt||2 years ago|
|web2text.py||2 years ago|
This will eventually be a web crawler that saves websites in plaintext files. For now please enjoy a few cli tools, written as POC. Comments, compliments, complaints, and pull requests accepted.
Command line tool that does exactly what it says on the tin. Extract the content of a web document to plain text. With a choice of two scraping engines.