2 # robots.txt, based on the one for http://www.wikipedia.org/ and friends
4 # Please note: There are a lot of pages on this site, and there are
5 # some misbehaved spiders out there that go _way_ too fast. If you're
6 # irresponsible, your access to the site may be blocked.
9 # advertising-related bots:
10 User-agent: Mediapartners-Google*
13 # Wikipedia work bots:
17 User-agent: Orthogaffe
20 # Crawlers that are kind enough to obey, but which we'd rather not have
21 # unless they're feeding search engines.
22 User-agent: UbiCrawler
31 # Some bots are known to be trouble, particularly those designed to copy
32 # entire sites. Please obey robots.txt.
33 User-agent: sitecheck.internetseer.com
39 User-agent: MSIECrawler
42 User-agent: SiteSnagger
45 User-agent: WebStripper
54 User-agent: Offline Explorer
60 User-agent: TeleportPro
72 User-agent: Microsoft.URL.Control
87 User-agent: Download Ninja
91 # Sorry, wget in its recursive mode is a frequent problem.
92 # Please read the man page and use it properly; there is a
93 # --wait option you can use to set the delay between hits,
100 # The 'grub' distributed client has been *very* poorly behaved.
102 User-agent: grub-client
106 # Doesn't follow robots.txt anyway, but...
112 # Hits many times per second, not acceptable
113 # http://www.nameprotect.com/botinfo.html
117 # A capture bot, downloads gazillions of pages with no public benefit
118 # http://www.webreaper.net/
119 User-agent: WebReaper
123 User-agent: TurnitinBot
126 # Disable AI harvesting bots
130 User-agent: ChatGPT-User
136 User-agent: Google-Extended
139 User-agent: Omgilibot
142 User-agent: FacebookBot
146 # Don't allow the wayback-maschine to index user-pages
147 #User-agent: ia_archiver
148 #Disallow: /wiki/User
149 #Disallow: /wiki/Benutzer
152 # Friendly, low-speed bots are welcome viewing article pages, but not
153 # dynamically-generated pages please.
155 # Inktomi's "Slurp" can read a minimum delay between hits; if your
156 # bot supports such a thing using the 'Crawl-delay' or another
157 # instruction, please let us know.
160 Disallow: /mediawiki/
163 Disallow: /Special:Random
164 Disallow: /Special%3ARandom
165 Disallow: /Special:Search
166 Disallow: /Special%3ASearch
168 ## *at least* 1 second please. preferably more :D