# # Okay, change of plans. I haven't seen a new bot in years worth a damn, # but there are dozens of new bots each year that suck. # so I'm shutting the doors. If you aren't Google, go away. # User-agent: GoogleBot Disallow: /sweetie/ Disallow: /heather/photos/ Disallow: /janis/photos/ Disallow: /~scott/ Disallow: /~janis/photos/ Disallow: /~scott/bookcross.cgi Disallow: /wiki2/metric.cgi Disallow: /wiki2/reverse.cgi Disallow: /wiki2/assemble.cgi Disallow: /wiki2/spell.cgi Disallow: /wiki3/metric.cgi Disallow: /wiki3/reverse.cgi Disallow: /wiki3/assemble.cgi Disallow: /wiki3/spell.cgi Disallow: /finger.cgi Disallow: /metric.cgi Disallow: /reverse.cgi Disallow: /assemble.cgi Disallow: /spell.cgi Disallow: /documentation/db/ Disallow: /documentation/web/javascript_1.1/ Disallow: /sexcantwait/ Disallow: /~scott/Continuity/_darcs/ Disallow: /cal/cal.cgi Disallow: /index.cgi Disallow: /documentation/graphics/gl/ User-agent: ia_archiver Disallow: /sweetie/ Disallow: /heather/photos/ Disallow: /janis/photos/ Disallow: /~scott/ Disallow: /~scott/bookcross.cgi Disallow: /wiki2/metric.cgi Disallow: /wiki2/reverse.cgi Disallow: /wiki2/assemble.cgi Disallow: /wiki2/spell.cgi Disallow: /wiki3/metric.cgi Disallow: /wiki3/reverse.cgi Disallow: /wiki3/assemble.cgi Disallow: /wiki3/spell.cgi Disallow: /finger.cgi Disallow: /metric.cgi Disallow: /reverse.cgi Disallow: /assemble.cgi Disallow: /spell.cgi Disallow: /documentation/db/ Disallow: /documentation/web/javascript_1.1/ Disallow: /sexcantwait/ Disallow: /cal/cal.cgi Disallow: /index.cgi Disallow: /documentation/graphics/gl/ # robots - these .cgi's won't give you any additional information beyond what # index.cgi/wiki.cgi does, but they take a *lot* of cpu to generate! thank you... # Why are these excluded? # The CGIs are poison for bots that don't follow these rule, # or else they're expensive for me to compute and redundant with other # non-excluded pages. Regardless, you don't want to request or download them # anyway. You aren't missing anything. Trust me. User-agent: ZyBorg Disallow: / User-agent: TurnitinBot Disallow: / User-agent: FAST-WebCrawler Disallow: / User-agent: Mercator Disallow: / User-agent: psbot Disallow: / User-agent: YahooFeedSeeker Disallow: / # Why is your bot excluded? # You're reselling, commercially, information taken from my site without my # permission in violation of the license I've made the information available under. # You're wasting bandwidth, harvesting billions of web pages for some pissant # search engine that no one uses, and not even making the harvested pages available # to other pissant searchengines so that they don't have to. # In short, you're a drain on humanity, a mooch, a leech, a user, and I don't # want to help you make a quick buck atsocietys expense. User-agent: Slurp Disallow: / User-agent: msnbot Disallow: / # Why are these bots excluded? # I'm sick and tired of seeing them crawling files over and over that haven't # changed in 5 years. In the case of msnbot, they skew search results against # a bias designed to perpetuate their monopoly. Don't list me at all if you're # going to do that shit. # OmniExplorer is ignoring the catch-all rule. It thinks it's special. Fuck you, OmniExplorer. User-agent: OmniBot Disallow: / User-agent: OmniExplorer_Bot Disallow: / User-agent: LarbinWebCrawler Disallow: / # If you're not on the invite list, go away. Sorry. Too many bad apples... User-agent: * Disallow: /