Note: This documentation is for an old version of Webinator. The latest documentaion is here.

Ignore robots.txt (-r)

Syntax: -r

Ignore robots.txt. Normally gw will initially get /robots.txt from any site being indexed and respect its settings for what prefixes to ignore. This option will disable the use of robots.txt and retrieve everything. Use of this option in not generally recommended. Any URLs specified with -x will still be excluded when using this option.


Copyright © Thunderstone Software     Last updated: Tue Nov 6 10:58:47 EST 2007
Copyright © 2024 Thunderstone Software LLC. All rights reserved.