website/robots.txt

92 lines
2.8 KiB
Plaintext
Raw Permalink Blame History

This file contains ambiguous Unicode characters!

This file contains ambiguous Unicode characters that may be confused with others in your current locale. If your use case is intentional and legitimate, you can safely ignore this warning. Use the Escape button to highlight these characters.

# Welcome to robots.txt, the place where shunning bots is encouraged.
#
# Humans are welcome to read. Bots are welcome to follow.
#
# Policy
#
# Allowed:
# - Search engine indexers (even google, though I hate it)
# - RSS Aggreggators (unless too aggressive)
# - Archival services
# - Fediverse federation stuff
#
# Disallowed:
# - Marketing or SEO crawlers
# - Agressive and annoying bots
#
# If your piece of sloppy code gets in this list, you contribute to the
# enshittification of the web and you should fuck off. Also stay the fuck
# away from me and my data, as well as from the users I host here.
#
# If your piece of shit doesn't respect robots.txt, your IP will be blocked.
#
# If you have any questions, reach out to getimiskon at disroot dot org.
# +-------------------+
# | |
# | HALL OF SHAME |
# | |
# +-------------------+
# Marketing/SEO cancer
User-agent: AhrefsBot
Disallow: /
# I swear, I have to block this one from my Nginx settings, Fuck you.
# Search crawler
User-agent: ImagesiftBot
Disallow: /
# Marketing/SEO cancer
User-agent: dotbot
Disallow: /
User-agent: DotBot
Disallow: /
# Image Search Crawler
User-agent: ByteSpider
Disallow: /
# Marketing/SEO cancer
User-agent: SemrushBot
Disallow: /
User-agent: SemrushBot-SA
Disallow: /
# Social media cancer
User-agent: facebookexternalhit
Disallow: /
# kill yourself zucc
# 'Threat hunting' bullshit
User-agent: CensysInspect
Disallow: /
#...................../´¯¯/)
#...................,/¯.../ +----------------------------------------+
#.................../..../ | |
#.............../´¯/'..'/´¯¯`·¸ | To the creators of the shitbots above: |
#.........../'/.../..../....../¨¯\ | |
#..........('(....´...´... ¯~/'..') | FUCK YOU. DEATH TO SEO. |
#...........\..............'...../ | TOTAL COMMERCIAL WEB DEATH. |
#............\....\.........._.·´ | |
#.............\..............( +----------------------------------------+
#..............\..............\
# You may ask, why I'm using that kind of language.
# The reason? The bots above. There are people who just don't give a damn about
# SEO shit. They make websites and host services by themselves because it's fun.
#
# I consider myself one of those people.
#
# The thing is that you know online hosting is NOT free.
# Yet you send requests to our servers and scraping our data without consent.
# By doing so, you add a lot of unnecessary work for us to block your bots.
# You're a disgrace. You are the reason the web is shit.
# You made the people being afraid of expressing themselves online.
# Congratulations. Enjoy your enshittified web until it collapses.
# This file is loosely based on the robots.txt file of sr.ht