33 lines
1.1 KiB
Markdown
33 lines
1.1 KiB
Markdown
# Babble
|
|
|
|
Standalone LLM crawler tarpit binary. Generates an endless stream of deterministic bollocks to be ingested by bots,
|
|
with plenty of links.
|
|
|
|
## Why?
|
|
|
|
- Divert and slow down LLM crawler traffic, protecting your main site
|
|
- Potentially poison LLM training data (likely not very effectuve)
|
|
- Collective defence; the more time a scraper spends swallowing babble, the less time it'll spend bulling someone
|
|
else's site
|
|
- Do your bit to protect the public commons from those who would readily see it destroyed for the sake of an investment
|
|
round
|
|
|
|
## Usage
|
|
|
|
```
|
|
--pem-dir <directory> | Directory containing `key.pem` and `cert.pem` files, enables TLS support
|
|
--sock <address> | Bind to the given socket. Defaults to 0.0.0.0:3000.
|
|
```
|
|
|
|
Deploy it in a docker environment. It's probably safe, but no reason to take chances.
|
|
|
|
If you want to be nice to crawlers that *actually abide by `robots.txt`*, perhaps add an entry to warn search engines
|
|
away from it.
|
|
|
|
## Usage terms
|
|
|
|
There are none, other than those implied by dependencies. Use it whenever and wherever you want, and in any way.
|
|
|
|
## Attribution
|
|
|
|
Fuck you, Sam Altman.
|