User-Agent: * Allow: / Disallow: */_borders Disallow: */_derived Disallow: */_fpclass Disallow: */_overlay Disallow: */_private Disallow: */_themes Disallow: */_vti* Disallow: */_vti* # Google Image User-agent: Googlebot-Image Disallow: Allow: /* # Google AdSense User-agent: Mediapartners-Google* Disallow: Allow: /* # Lista de bots que suelen respetar el robots.txt pero rara # vez hacen un buen uso del sitio y abusan bastante... # Añadir al gusto del consumidor... User-agent: MSIECrawler Disallow: / User-agent: WebCopier Disallow: / User-agent: HTTrack Disallow: / User-agent: Microsoft.URL.Control Disallow: / User-agent: libwww Disallow: / # Internet Archiver Wayback Machine User-agent: ia_archiver Disallow: / # digg mirror User-agent: duggmirror Disallow: / User-agent: URL_Spider_Pro Disallow: / User-agent: CherryPicker Disallow: / # # Sitemap permitido, búsquedas no. # Sitemap: http://www.planasantich.org/sitemap.xml Disallow: /?s= Disallow: /search