Note: This documentation is for an old version of Webinator. The latest documentaion is here.

Primer Type

``Primer URLs'' are URLs that are fetched before actually starting a crawl. They are not stored in the search database, but instead are used to ``prime'' Webinator with any necessary credentials (eg. login cookies) for accessing the rest of the site. By default, the Base URL is used, in case any session/ASP cookies are needed.

The Primer Type setting specifies which (if any) urls are used to prime the profile:

  • None - No primer URL is used. The Base URLs are crawled as normal.

  • Base URL - the Base URLs are used to prime the walk. This differs from None in that the base URLs are submitted once and the results discarded, and then submitted again for crawling.

    This is useful in situations where the Base URL contains login information, and the page returns ``thank you for logging in'' with no other content until the page is requested again.

  • Custom - The URLs listed in the Custom Primer URLs setting are used, as described below.

For HTTP Basic or NTLM protected web sites, the Login Info setting should be used instead.


Copyright © Thunderstone Software     Last updated: Thu Mar 11 16:13:32 EST 2010
Copyright © 2024 Thunderstone Software LLC. All rights reserved.