# # $Id: robots.txt,v 1.8 2010/03/29 20:54:32 mertensb Exp $ # # This file tells web-crawling spiders what not to crawl on infomart.ca # # We do this mostly to keep web crawlers from generating # 404 errors # for pages that don't exist anymore, # since some web sites have outdated links to our site. # # Google Premium - ALLOW User-agent: googlebot-pm Disallow: # Google AdSense - ALLOW # User-agent: Mediapartners-Google* # Disallow: # Google standard - ALLOW everything but # User-agent: Googlebot # Disallow: /*/rss/ # All others User-agent: * Disallow: /ar/ Disallow: /clip/ Disallow: /cmt/ Disallow: /ce/ Disallow: /cnw/ Disallow: /cs/ Disallow: /doc/ Disallow: /doss/ Disallow: /download/ Disallow: /error/ Disallow: /fp/ Disallow: /fpan/ Disallow: /fpcr/ Disallow: /fpdd/ Disallow: /fpdv/ Disallow: /fphr/ Disallow: /fpir/ Disallow: /fpma/ Disallow: /fpni/ Disallow: /fppd/ Disallow: /go/ Disallow: /gp/ Disallow: /image/ Disallow: /img/ Disallow: /int/ Disallow: /ip/ Disallow: /ln/ Disallow: /login/ Disallow: /news/ Disallow: /pda/ Disallow: /pdf/ Disallow: /potp/ Disallow: /ppv/ Disallow: /prefs/ Disallow: /reg/ Disallow: /pr/ Disallow: /session/ Disallow: /blueshee Disallow: /products Disallow: /sla Disallow: /support Disallow: /sv/ Disallow: /sys/ Disallow: /telnet Disallow: /test/ Disallow: /todays_news Disallow: /tools/ Disallow: /*/rss/