Robots

Name

Robots --  defines whether to respect robots.txt and robot directives (in HTTP headers, meta tags, link attributes).

indexer.conf

Synopsis

Robots {yes | no | robotstxt | xrobotstag | meta | rel}...

Description

Robots defines which robot directives should be respected when crawling:

Setting Robots to no can be useful when running mnoGoSearch for site validation purposes, as well as when crawling your own Web site.

The default value is yes, for polite crawling purposes.

Scope

Robots can be used multiple times and affects on all following Server and Realm commands until the end of the configuration file, or until the next Robots command.

Examples


# Respect all directives except the rel attribute
Robots robotstxt xrobotstag meta

# Ignore all robot directives
Robots no
      

See also

CrawlDelay, Disallow, Index, IndexIf, NoIndexIf, UseSitemap.