Toggle navigation
+1 216-820-2200
+1 216-820-2200
Toggle navigation
Products
Solutions
How to Buy
Support
Contact Us
News
About
Note: This documentation is for an old version of Webinator. The latest documentaion is
here
.
Thunderstone Webinator
WWW Site Indexer Version 6.1.0
Thunderstone Software
Contents
Document Conventions
Overview
Features
Obtaining Webinator
Technical Support
Installation
Unix Download and Installation
Windows Download and Installation
Filesystem Layout
File Permissions and OS Specific Notes
Customizing Webinator's Appearance
Operation
Running the Administrative Interface
First Time Run: Quick Start
Step 1: Create an Account
Step 2: Create a Profile
Step 3: Walk the Profile
Last Step: Search
Administrative Interface Overview
Entry
Basic Walk Settings
All Walk Settings
Search Settings
Profile Tools
List/Edit URLs
List Duplicates
Test Fetch
Best Bet Groups
Walk Status
Now button
Pause/Auto button
STOP walk button
Pause walk and Make live button
Query Log
Test Search
Live Search
Profiles
Accounts
Add a User
Change Password
Delete
User Groups
Access Control
Maintenance
Documentation
Webinator Home
Logout
Basic Walk Settings
Database
Walk Summary
Notes
Base URL
Enterprise
Robots
Allow Extensions
All Extensions
Exclude Extensions
Exclusions
Crawl Delay
Parallelism
Verbosity
Rewalk Type
New
Refresh
Refresh in version 5 vs. 4
Rewalk Type Summary Table
Rewalk Schedule
Action Buttons
Advanced Walk Settings
Watch URL
Notify
Attach Logs
Categories
Categories Type
URL File
URL URL
Single Page
Page File
Page URL
Strip Queries
Ignore Case
Extra Domains
Extra Networks
Extra URLs REX
Exclusion REX
Exclusion Prefix
Exclude by Field
Additional Fields
Data from Field
Data From Field Example - Using Description for Title
Data From Field Example - using PublishDate for Modified Date
Data From Field Example - grabbing Price from meta
Data From Field Example - grabbing Price from Text
Required REX
Required Prefix
Max Page Size
Max Pages
Max Bytes
Max Depth
Max URL Size
Max Requests
Max Connection Lifetime
Page Timeout
Meta Tags
Standard Meta
All Meta
Storage Charset
Source Default Charset
XML UTF-8
Keep HTML
Keep Links
Remove Common
Ignore Tags
Keep Tags
Ignore Characters
Plugin Split
Language Analysis
CJK Mode
Word Definition
Text Search Mode
Attribute Compare Mode
Index Fields
Compound Index Fields
Extra Indexes
Spell-check Dictionaries
Primer Type
Primer URLs
Submitting the Form Directly: Custom Primer URL
Filling Out the Form: Custom Primer Variables
Checking for Bad Logins: Bad Login MM Query
Multiple Primers: Base URL MM Query
Login Info
Proxy
Proxy Login Info
Cookie Source Path
Off-Site Pages
Stay Under
Prevent Duplicates
Duplicate Check Fields
Store Refs
Inline Iframes
Max Frames
Execute JavaScript
Fetch JavaScript
JavaScript String Links
Debug JavaScript
JavaScript Memory
JavaScript Timeout
Protocols
HTTP Version
SSL Client Protocols
Authentication Schemes
Embedded Security
Entropy Source
Multiple Fetches
Follow Cross-Site Links
Max Redirects
Empty Form Redirects
Index Name
DNS Mode
Net Mode
User Agent
Mime Types
Respect Expires Header
Default Refresh Time
Minimum Refresh Time
Maximum Refresh Time
Maximum Process Size
Replication Settings
Debug Replication
Search Settings
Notes
Query Logging
Rotate Schedule
Email
Result Order
Results Style
Allow RSS
Format XSL Output
XSL File
Abstract Style
Abstract Length
Max Title Length
Max URL Display Length
Results per Page
Max User Results per Page
Page Links Shown
Results Width
Box Color
Show Advanced Search
Results Highlighting
Context Highlighting
PDF Query Highlighting
Font
Display Charset
Top HTML and Bottom HTML
CSS Stylesheet
Enable Sherlock
Top Best Bet Title
Right Best Bet Title
Top Best Bet Group
Right Best Bet Group
Top Best Bet Box Color
Right Best Bet Box Color
Top Best Bet Border Style
Right Best Bet Border Style
Right Best Bet Box Width
Authorization Method
Login Cookies
Login URL
Basic/NTLM/file Cookie Type
Login Verification URL
Unauthorized Result Query
Username Fixup
Examples
Max Docs to Auth-Check
Successful Auth Result Limit
Total Auth Timeout
Allow Authorization URL
Authorization Caching
Debug Results Authorization
Show Authorization Info
Enable Spell Check
Suggest Time Limit
Number of Suggestions
Synonyms
Main Thesaurus
Secondary Thesaurus
Translate Boolean
Allow the @ Operator
Allow Linear
Allow NOT Logic
Allow Post-Processing
Allow Wildcards
Allow Leading Wildcards
Single-Word Wildcards
Allow WITHIN Operators
Require All Words
Resolve Phrase Noise Words
Keep Noise Words
Noise List
Search Timeout
Show Error Messages
Debug SQL Level
Fast Result Counts
Proximity
Language Characters
Word Forms
Custom Suffix List
Custom Suffix Default Removal
Custom Suffix Min Length
Word Ordering
Word Proximity
Database Frequency
Document Frequency
Position in Text
Clicks from Home
Ranked Rows
XML Export Variables
Phishing Protection
Decode Displayed URLs
Visible
Results Authorization
Results Authorization Crawl Settings
Results Authorization Search Settings
Meta Search - Search multiple profiles as one
Profile Creation
Meta Search Walk Settings
Search Settings
Access Control
User Groups
Object hierarchy
Access Control Lists
Determining Effective Rights
Required Rights for Admin Actions
Walk and Search Settings
Starting and stopping a walk
Best Bets
List/Edit URLs
List Duplicates
Walk Status
Query Log
Profiles
Accounts
User Groups
Access Control
Maintenance
Running the Walker by Hand
Using dowalk
Running the Search Interface
Maintenance
Information
Thunderstone Information
Install/Upgrade
Apply a License
System Settings
System Wide Settings
Enable/Disable Access Control Lists, View/Edit Access Control Lists
Custom Thesaurus
Save Webinator Settings
Restore Webinator Settings
Test Network and Servers
Advanced Support Tools
Procedures and Examples
Searching your Index
Similarity Searching
Using the Thesaurus Feature
Page Exclusion, Robots.txt, and Meta-robots
Indexing Other Sites
Indexing Individual Pages
Reindexing on a Schedule
Checking for Web Server Errors
Removing Pages from the Database
Erasing the Entire Database
Using Multiple Databases
Integrating Webinator with your Site
Static Host
Dynamic Host and HTML
Issuing a Query Programmatically
dateSource: id vs modified
Processing Search Results
Dynamic Host and XML
Issuing a Query Programmatically
Processing Search Results
Sample ASP Code
Search Result RSS Feeds
OpenSearch Support
Using Best Bets
Quick Creation
Fully Customized
Using Access Control
Initial Lockdown
Example: User with Complete Control on One Profile
Example: User with Look and Feel Control on All Profiles
Replication
Replication Overview
Procedure
Set up the Sender Profile
Create the Receiver Profile
DataLoad API
Submission Format
Reply Format
Dataload SOAP API
Additional Fields
Overview
Populating
Sorting
Searching
SOAP API
SOAP Overview
SOAP API vs. XML Output
Getting the WSDL
Global vs. per-profile WSDLs
Configuring the SOAP Interface
Dataload SOAP API
C# example project
SOAP Links for Languages
SOAP API search Reference
search
moreLikeThis
showParents function
SOAP API admin Reference
login
listProfiles
getProfileStatus
addProfile
deleteProfile
getSettings
setSettings
getThesauruses
setThesaurus
deleteThesaurus
Thunderstone ISAPI Proxy Module
Overview
Requirements
Installing the Proxy Module
Post-Install Setup
Grant "Trust for Delegation" to the proxy machine
Configuring Internet Explorer for Passing Credentials
Configuring Webinator
Add the Proxy Machine to Cluster Members
Make the Target Profiles Visible
Enable Results Authorization for the Target Profile
Manually Configuring the Proxy Module
Troubleshooting the Proxy Module Authentication
Review Installation Steps
Machine names and SPNs
DelegConfig Diagnostic Tool
Launch IE as a different user
Reference
Database and File Usage
Walk Database Tables and Fields
Options Table Fields
Customizing the Search
Customizing the Walker
Texis ISAPI
Overview
How it Works
Settings for Texis ISAPI
Reading values from conf/texis.ini
Reading values from the Registry
IIS Manual Configuration
IIS 5.X or earlier
IIS 6 or later
CGI Mapping by Vortex File Extension
Microsoft IIS
Apache
Preferred Method: Redirect Handler
Alternate Method: Direct Execution
XML Elements in Search Results
Third-Party Software
Version Differences
Search Interface Help
Forming a Query
Query Rules of Thumb
Overview of Query Abilities
Controlling Proximity
Ranking Factors
Keywords Phrases and Wild-cards
Applying Search Logic
Natural Language Query
Using the Special Pattern Matchers
Invoking Thesaurus Expansion
Using Word Forms
Controlling Proximity
Interpreting Search Results
Viewing Match Info
Finding Similar Documents
Showing Document Parents
Copyright © Thunderstone Software
Last updated: Thu Dec 22 14:38:01 EST 2011
Webinator Manual
Top
Next: Contents
PDF
Contact
Submit Request
Call +1 216-820-2200
Copyright © 2024 Thunderstone Software LLC. All rights reserved.