diff --git a/README.md b/README.md new file mode 100644 index 0000000..474cf22 --- /dev/null +++ b/README.md @@ -0,0 +1,33 @@ +# Babble + +Standalone LLM crawler tarpit binary. Generates an endless stream of deterministic bollocks to be ingested by bots, +with plenty of links. + +## Why? + +- Divert and slow down LLM crawler traffic, protecting your main site +- Potentially poison LLM training data (likely not very effectuve) +- Collective defence; the more time a scraper spends swallowing babble, the less time it'll spend bulling someone +else's site +- Do your bit to protect the public commons from those who would readily see it destroyed for the sake of an investment +round + +## Usage + +``` +--pem-dir | Directory containing `key.pem` and `cert.pem` files, enables TLS support +--sock
| Bind to the given socket. Defaults to 0.0.0.0:3000. +``` + +Deploy it in a docker environment. It's probably safe, but no reason to take chances. + +If you want to be nice to crawlers that *actually abide by `robots.txt`*, perhaps add an entry to warn search engines +away from it. + +## Usage terms + +There are none, other than those implied by dependencies. Use it whenever and wherever you want, and in any way. + +## Attribution + +Fuck you, Sam Altman.