# Last updated 2008-06-20, crk # # For scoop sites # Some user agents listen to crawl-delay, and we want them too # because unlike static pages, it is kind of easy to flood scoop. the value is in seconds. # # We dont want bots in the calendar area... they can get caught in a loop # Search isn't a good plan either # # /comments are gonna give us a duplicate content penalty (plus it can encourage comment spammers) # /print = duplicate content # /?count = caught some bot doing this... was using search # /?op= .... duh # /user .... no point # /css .... duh # /images .... duh # the adsense bot # we trust this enough that it will not trash the calendars User-Agent: Mediapartners-Google Disallow: User-Agent: * Crawl-Delay: 60 Disallow: /calendar Disallow: /my Disallow: /css Disallow: /images Disallow: /poll Disallow: /search Disallow: /user Disallow: /comments Disallow: /?op=poll_vote Disallow: /?op=search Disallow: /print Disallow: /?count Disallow: /~ Disallow: ~