Thunderstone Software Document Search, Retrieval, and Management
Search:
Webinator 2 Manual
 

Ignore robots.txt (-r)

Syntax: -r

Ignore robots.txt. Normally gw will initially get /robots.txt from any site being indexed and respect its settings for what prefixes to ignore. This option will disable the use of robots.txt and retrieve everything. Use of this option in not generally recommended. Any URLs specified with -x will still be excluded when using this option.


Copyright © Thunderstone Software     Last updated: Tue Nov 6 10:58:47 EST 2007
 
Home   ::   Products   ::   Solutions   ::   How to Buy   ::   Support   ::   Contact Us   ::   News   ::   About
Copyright © 2012 Thunderstone Software LLC. All rights reserved.