HrefOnly

Name

HrefOnly -- scans matching documents for links only

indexer.conf

Synopsis

HrefOnly [Match | NoMatch] [Case | NoCase] [String | Regexp] {pattern...}

Description

The meaning of the first three optional parameters is exactly the same as in the Allow command.

HrefOnly makes indexer scan the documents with URL matching pattern for links only, without indexing the content of these document.

HrefOnly can be useful when indexing Web forums or mail archives. The pages with message lists should usually be scanned only for the links to messages, and only messages should be indexed.

Scope

HrefOnly has global effect for entire configuration file.

Examples


# When indexing mail list archives for example, the thread
# index pages (like mail.10.html, thread.21.html, etc.) should be scanned
# for links but should not be indexed:
HrefOnly */mail*.html */thread*.html
    

See also

Allow, CheckMP3, CheckMP3Only, CheckOnly, Disallow, FollowLinks, Skip, SkipIf.