Syntax: REX expression, Replace expression, field to search, where to store it
This provides alternate means of setting both the HTML fields
(Modify Date
, Title
, Description
) and any
Additional Fields. It allows getting page information from
non-default places by searching and optionally replacing the data.
New blank rows will be provided as rows are used. See below for
examples.
REX Search - Allows you to specify a REX expression to narrow
down what contents of the From Field
will be used. Leave it
empty to use the entire field.
Note that a REX Search
must be specified for the following
From
field types:
HTML
Text
.*
as the
REX Search
.
Replace - Replace
can be used to specify a subset of the
value to be stored in the To
field (or subset of the match, if
you're using REX Search
. It uses sandr
replacement
string syntax.
From Field - specifies what the source field is for the data.
HTML
- the raw HTML source of the page.Text
- the text of the page, after HTML rendering has
been applied.Title
- the HTML title of the pageAll Meta
- the contents of all meta
headers
specified in the HTML page.Meta Field ->
- the contents of a specific meta
field, specified in the next input box, From Meta Field.Keywords
- the contents of the keywords
meta
header.Description
- the contents of the description
meta
header.Mime Type
the MIME type of the page. This may have been
derived from the Content-Type
header, a
<META HTTP-EQUIV>
tag, or the URL extension, depending on
what is available.URL
- the URL of the page.URL Decoded
- the decoded version of the URL. Any %XX
'URL-safe' sequences in the URL are replaced with their real
characters. E.g. Pre%20%2D%20Expense%20Report.doc
is decoded into
Pre - Expense Report.doc
.URL Protocol
- the URL's protocol, e.g. http
.URL Host
- the host (without port number) from the URL.URL Host and Port
- the host (and port number if given) from
the URL.URL Path
- the file path from the URL.URL Path Decoded
- the file path from the URL, URL-decoded.URL Anchor
- the anchor from the URL (if any), i.e. the part
after the #
(pound sign). May not be available if already stripped.URL Query
- the query string from the URL (if any), i.e.
the part after the ?
(question mark).URL Query Var ->
- the value of the URL query-string variable
named in From Meta Field, URL-decoded.
From Meta Field - If Meta Field ->
or
URL Query Var ->
is given as the From Field, this field is
used to specify which meta field's or query var's contents to use as
data. Leave blank otherwise.
Entering text in this field will force the use of Meta Field ->
,
regardless of the From Field setting.
To Field - specifies where information should be stored.
Modified Date
, Title
, and `verb`Description` are the
standard HTML fields. If you've defined any Additional Fields,
they will also be listed as selections here.
If you just added an Additional Field, you will need to hit
Update
for the Additional Field to appear in the
To Field
list.