Added README
This commit is contained in:
parent
e9b8272706
commit
41a442ce0a
1 changed files with 33 additions and 0 deletions
33
README.md
Normal file
33
README.md
Normal file
|
@ -0,0 +1,33 @@
|
|||
# Babble
|
||||
|
||||
Standalone LLM crawler tarpit binary. Generates an endless stream of deterministic bollocks to be ingested by bots,
|
||||
with plenty of links.
|
||||
|
||||
## Why?
|
||||
|
||||
- Divert and slow down LLM crawler traffic, protecting your main site
|
||||
- Potentially poison LLM training data (likely not very effectuve)
|
||||
- Collective defence; the more time a scraper spends swallowing babble, the less time it'll spend bulling someone
|
||||
else's site
|
||||
- Do your bit to protect the public commons from those who would readily see it destroyed for the sake of an investment
|
||||
round
|
||||
|
||||
## Usage
|
||||
|
||||
```
|
||||
--pem-dir <directory> | Directory containing `key.pem` and `cert.pem` files, enables TLS support
|
||||
--sock <address> | Bind to the given socket. Defaults to 0.0.0.0:3000.
|
||||
```
|
||||
|
||||
Deploy it in a docker environment. It's probably safe, but no reason to take chances.
|
||||
|
||||
If you want to be nice to crawlers that *actually abide by `robots.txt`*, perhaps add an entry to warn search engines
|
||||
away from it.
|
||||
|
||||
## Usage terms
|
||||
|
||||
There are none, other than those implied by dependencies. Use it whenever and wherever you want, and in any way.
|
||||
|
||||
## Attribution
|
||||
|
||||
Fuck you, Sam Altman.
|
Loading…
Add table
Reference in a new issue