|
These settings affect the way that text searches are performed. They are
equivalent to changing the corresponding parameter in the profile, or by
calling the Metamorph API function to set them (if there is an equivalent).
They are:
- minwordlen
- The smallest a word can get due to suffix and prefix
removal. Removal of trailing vowel or double consonant can make it a
letter shorter than this. Default 255.
- keepnoise
- Whether noise words should be stripped from the query
and index. Default off.
- suffixproc
- Whether suffixes should be stripped from the words to
find a match. Default on.
- prefixproc
- Whether prefixes should be stripped from the words to
find a match. Turning this on is not suggested when using a Metamorph
index. Default off.
- rebuild
- Make sure that the word found can be built from the root
and appropriate suffixes and prefixes. This increases the accuracy of
the search. Default on.
- useequiv
- Perform thesaurus lookup. If this is on then the word
and all equivalences will be searched for. If it is off then only the
query word is searched for. Default off. Aka
keepeqvs
in version 5.01.1171414736 20070213 and later.
- inc_sdexp
-
Include the start delimiter as part of the hit. This is not
generally useful in Texis unless hit offset information is being
retrieved. Default off.
- inc_edexp
-
Include the end delimiter as part of the hit. This is not
generally useful in Texis unless hit offset information is being
retrieved. Default on.
- sdexp
- Start delimiter to use: a regular expression to match
the start of a hit. The default is no delimiter.
- edexp
- End delimiter to use: a regular expression to match
the start of a hit. The default is no delimiter.
- hyphenphrase
- Controls whether a hyphen between words searches
for the phrase of the two words next to each other, or searches for
the hyphen literally. The default value of 1 will search for the two
words as a phrase. Setting it to 0 will search for a single term
including the hyphen. If you anticipate setting hyphenphrase to 0 then
you should modify the index expression to include hyphens.
- wordc
- Defines which characters consitute a word. When a
match is found the hit is expanded to include all surrounding word
characters. The value is specified as a REX character set.
The default setting is
[\alpha\'] which corresponds
to all letters and apostrophe. If you wanted to exclude apostrophe
and include digits you could say: set wordc='[\alnum]'
Added in version 3.00.942260000.
- langc
- Defines which characters consitute a language query.
This will allow suffix processing, as well as forcing the use of
wordc to qualify the hit. Normally langc should be set the same
as wordc with the addition of the phrase characters space and hyphen.
The default is
[\alpha\' \-]
Added in version 3.00.942260000.
- withinmode
-
Whether the ``within-
N'' operator (eg. w/5) denotes
within N characters (the default) or within N words.
The settings are char for character and word for words.
Added in version 4.04.1077930936 20040227.
- phrasewordproc
-
Which words of a phrase to do suffix/wildcard processing on. The
possible values are
mono to treat the phrase as a
monolithic word (ie. only last word processed, but entire phrase
counts towards minwordlen); none for no
suffix/wildcard processing on phrases; or last to process just
the last word.
Note that a phrase is multi-word, ie. a single word in double-quotes
is not considered a phrase, and thus phrasewordproc does not apply.
Added in version 4.03.1082000000 20040414. Mode none
supported in version 5.01.1127760000 20050926.
- mdparmodifyterms
-
If nonzero, allows the Metamorph query parset to modify search terms
by compression whitespace and quoting/unquoting. This is for
back-compatibility with earlier versions; enabling it will break the
information from bit 4 of
mminfo() (query offset/lengths of
sets). Added in version 5.01.1220640000 20080905.
Copyright © Thunderstone Software Last updated: Wed Sep 10 11:42:21 EDT 2008
|