Note: This documentation is for an old version of Webinator. The latest documentaion is here.

Data from Field

 

Syntax: REX expression, Replace expression, field to search, where to store it

This provides alternate means of setting both the HTML fields (Modify Date, Title, Description) and any Additional Fields. It allows getting page information from non-default places by searching and optionally replacing the data. New blank rows will be provided as rows are used. See below for examples.

REX Search - Allows you to specify a REX expression to narrow down what contents of the From Field will be used. Leave it empty to use the entire field.

Note that a REX Search must be specified for the following From field types:

  • HTML

  • Text
You can specify they entire field for these by using .* as the REX Search.

Replace - Replace can be used to specify a subset of the value to be stored in the To field (or subset of the match, if you're using REX Search. It uses sandr replacement string syntax.

From Field - specifies what the source field is for the data.

  • HTML - the raw HTML source of the page.

  • Text - the text of the page, after HTML rendering has been applied.

  • Title - the HTML title of the page

  • All Meta - the contents of all meta headers specified in the HTML page.

  • Meta Field -> - the contents of a specific meta field, specified in the next input box, From Meta Field.

  • Keywords - the contents of the keywords meta header.

  • Description - the contents of the description meta header.

  • Mime Type the MIME type of the page. This may have been derived from the Content-Type header, a <META HTTP-EQUIV> tag, or the URL extension, depending on what is available.

  • URL - the URL of the page.

  • URL Decoded - the decoded version of the URL. Any %XX 'URL-safe' sequences in the URL are replaced with their real characters. E.g. Pre%20%2D%20Expense%20Report.doc is decoded into Pre - Expense Report.doc.

  • URL Protocol - the URL's protocol, e.g. http.

  • URL Host - the host (without port number) from the URL.

  • URL Host and Port - the host (and port number if given) from the URL.

  • URL Path - the file path from the URL.

  • URL Path Decoded - the file path from the URL, URL-decoded.

  • URL Anchor - the anchor from the URL (if any), i.e. the part after the # (pound sign). May not be available if already stripped.

  • URL Query - the query string from the URL (if any), i.e. the part after the ? (question mark).

  • URL Query Var -> - the value of the URL query-string variable named in From Meta Field, URL-decoded.

From Meta Field - If Meta Field -> or URL Query Var -> is given as the From Field, this field is used to specify which meta field's or query var's contents to use as data. Leave blank otherwise.

Entering text in this field will force the use of Meta Field ->, regardless of the From Field setting.

To Field - specifies where information should be stored. Modified Date, Title, and `verb`Description` are the standard HTML fields. If you've defined any Additional Fields, they will also be listed as selections here.

If you just added an Additional Field, you will need to hit Update for the Additional Field to appear in the To Field list.



Copyright © Thunderstone Software     Last updated: Thu Dec 22 14:38:01 EST 2011
Copyright © 2024 Thunderstone Software LLC. All rights reserved.