source: web/trunk/www/robots.txt @ 4083

Last change on this file since 4083 was 4083, checked in by Sam Hocevar, 10 years ago

Rename web/static into web/www.

  • Property svn:keywords set to Id
File size: 1.1 KB
Line 
1# $Id: robots.txt 4083 2009-12-03 22:45:30Z sam $
2
3# Do not crawl CVS and .svn directories (they are 403 Forbidden anyway)
4User-agent: *
5Disallow: CVS
6Disallow: .svn
7
8# Prevent excessive search engine hits
9Disallow: /cgi-bin/trac.cgi
10Disallow: /log
11
12# "This robot collects content from the Internet for the sole purpose of
13# helping educational institutions prevent plagiarism. [...] we compare
14# student papers against the content we find on the Internet to see if we
15# can find similarities." (http://www.turnitin.com/robot/crawlerinfo.html)
16#  --> fuck off.
17User-Agent: TurnitinBot
18Disallow: /
19
20# "NameProtect engages in crawling activity in search of a wide range of
21# brand and other intellectual property violations that may be of interest
22# to our clients." (http://www.nameprotect.com/botinfo.html)
23#  --> fuck off.
24User-Agent: NPBot
25Disallow: /
26
27# "iThenticate® is a new service we have developed to combat the piracy
28# of intellectual property and ensure the originality of written work for
29# publishers, non-profit agencies, corporations, and newspapers."
30# (http://www.slysearch.com/)
31#  --> fuck off.
32User-Agent: SlySearch
33Disallow: /
34
Note: See TracBrowser for help on using the repository browser.