• User Attivo

    URL limitati da Robots.txt

    Salve ragazzi, in questi giorni accedendo a Google Webmaster Tool ho notato che ho ben 4.009 file limitati da Robots

    Così mi sono subito apprestato a modificare il mio vecchio robots.txt che era questo:

    User-agent: *
        Disallow: /cgi-bin/
        Disallow: /wp-content/
        Disallow: /wp-admin/
        Disallow: /wp-includes/
        Disallow: /tag/
        Disallow: /category/
        Disallow: /user/
        Disallow: /author/
    
    Disallow: /trackback/
    Disallow: */trackback/
    
    User-agent: Mediapartners-Google
    Allow: /
     
    User-agent: Adsbot-Google
    Allow: /
     
    User-agent: Googlebot-Image
    Allow: /
     
    User-agent: Googlebot-Mobile
    Allow: /
     
    User-agent: ia_archiver
    Disallow: /
    
    User-agent: duggmirror
    Disallow: /
    
    User-agent: NetMechanic
    Disallow: /
    
    User-agent: EmailCollector
    Disallow: /
    
    User-agent: Teleport
    Disallow: /
    
    User-agent: UbiCrawler
    Disallow: /
    
    User-agent: DOC
    Disallow: /
    
    User-agent: Zao
    Disallow: /
    
    User-agent: sitecheck.internetseer.com
    Disallow: /
    
    User-agent: Zealbot
    Disallow: /
    
    User-agent: MSIECrawler
    Disallow: /
    
    User-agent: SiteSnagger
    Disallow: /
    
    User-agent: WebStripper
    Disallow: /
    
    User-agent: WebCopier
    Disallow: /
    
    User-agent: Fetch
    Disallow: /
    
    User-agent: Offline Explorer
    Disallow: /
    
    User-agent: Teleport
    Disallow: /
    
    User-agent: TeleportPro
    Disallow: /
    
    User-agent: WebZIP
    Disallow: /
    
    User-agent: linko
    Disallow: /
    
    User-agent: HTTrack
    Disallow: /
    
    User-agent: Microsoft.URL.Control
    Disallow: /
    
    User-agent: Xenu
    Disallow: /
    
    User-agent: larbin
    Disallow: /
    
    User-agent: libwww
    Disallow: /
    
    User-agent: ZyBORG
    Disallow: /
    
    User-agent: Download Ninja
    Disallow: /
    
    User-agent: wget
    Disallow: /
    
    User-agent: grub-client
    Disallow: /
    
    User-agent: k2spider
    Disallow: /
    
    User-agent: NPBot
    Disallow: /
    
    User-agent: WebReaper
    Disallow: /
    

    Con questo:

    User-agent: *
        Disallow: /cgi-bin/
        Disallow: /wp-content/
        Disallow: /wp-admin/
        Disallow: /wp-includes/
    
    Allow: /wp-content/uploads/
    
    Disallow: /trackback/
    Disallow: /feed/
    Disallow: /comments/
    Disallow: */trackback/
    Disallow: */feed/
    Disallow: */comments/
    
    Disallow: /*?*
    Disallow: /*?
    
    # Disallow: /tag/
    # Disallow: /category/
    
    Sitemap: mio sito . xml
    

    Come vi sembra, c'è qualcosa che può impedire l'indicizzazione?

    Saluti


  • User Attivo

    Analizzando meglio gli url limitati ho notato una cosa, sono tutti Feed e Tags..

    Ragazzi come risolvo?