Field | Description |
id | Unique record id |
Hash | Document hash for duplicate content detection |
Size | Size of retrieved html document |
Visited | The date the page was modified (or fetched if modified not set) |
Dlsecs | The number of seconds to fetch the page |
Depth | The number of URLs traversed to reach the page |
Url | The URL of the real HTML page |
Title | The Title of the page |
Body | The textual content of the page |
Keywords | The keywords meta data from the page |
Description | The description meta data from the page |
Meta | Other meta data from the page, separated by newlines |
Catno | List of categories to which the URL belongs |
Field | Description |
Url | The URL of the HTML page |
Ref | The URL of a reference (link) on the HTML page |
Field | Description |
Catno | The number for the category |
Url | The URL pattern for the category |
Category | The name of the category |
Field | Description |
Url | The URL of the an HTML page that could not be retrieved |
Reason | The reason it could not be retrieved |
id | Unique record id (includes timestamp info). |
Field | Description |
id | Contains the date and time of the query (unique record id) |
Client | The hostname of the web client that performed the query |
Query | The user's query as entered |