Thunderstone Software Document Retreival and Management
Search:
Advanced Search
Home | Products | Company | News | Tech Support | Demos | Contact Us
Vortex Manual

Query Processing

These settings affect how a query is processed, eg. what documents it will match.

  • exactphrase (tri-state, off by default in Vortex, on by default in tsql)
    Whether to exactly resolve the noise words in phrases. If on, a phrase such as ``state of the art'' will only match those exact words; however this may require post-processing to resolve (potentially slower). If off, any word is permitted in place of the noise words, and no post-processing is needed: faster but potentially less accurate. In version 5.01.1178072161 20070501 and later, may be set to ignorewordposition: same as off, but non-noise words are permitted in any order or position; essentially emulates behavior of a non-inverted Metamorph index with no-postproc, but on a Metamorph inverted index too.

  • prefixproc (boolean, off by default)
    Whether to do prefix processing.

  • suffixproc (boolean, on by default)
    Whether to do suffix processing.

  • defsuffrm aka defsufrm (boolean, on by default)
    Whether to remove a trailing vowel, or one of a trailing double consonant pair, after normal suffix processing, and if the word is still minwordlen or greater. This only has effect if suffix processing is enabled (suffixproc on and the original word is at least minwordlen long). Added in version 3.0.941600000 19991102.

  • rebuild (boolean, on by default)
    Whether to do word rebuilding.

  • incsd (boolean, off by default)
    Whether to include the start delimiters in hits.

  • inced (boolean, on by default)
    Whether to include the end delimiters in hits.

  • withinproc (boolean, on by default)
    Whether to process the w/ operator in queries.

  • minwordlen (integer, 255 by default)
    The minimum word length for prefix/suffix processing to occur. Note that this is different from qminwordlen, which is the minimum word length allowed.

  • intersects (integer, -1 by default)
    The default number of intersections (if not given in a query).

  • see (boolean, off by default)
    Whether to look up ``see also'' references during equivalence lookup.

  • keepeqvs (boolean, off by default)
    Whether to use equivalences for words/phrases found in the equivalence file(s) or not.

  • keepnoise (boolean, off by default)
    Whether to remove noise words from the query during query processing.

  • sdexp (string, empty by default)
    The default start delimiter expression.

  • edexp (string, empty by default)
    The default end delimiter expression.

  • reqsdelim, reqedelim (boolean, on by default)
    Whether to require the start (reqsdelim) or end (reqedelim) delimiter to actually be present in a hit. If these are turned off, then the given delimiter need not be found for a hit to match; it's as if the delimiter were ``found'' at the start or end of the buffer if not present. (Added in version 3.0.950300000 20000211. Previous versions behave as if these settings were off.)

  • olddelim (boolean, off by default)
    Whether to emulate ``old'' delimiter behavior. If turned on, it is possible for a hit to occur outside dissimilar start and end delimiters, such as in this example text:

      start-delim ... end-delim ... hit ... start-delim ... end-delim

    Here the hit is ``within'' the outermost start and end delimiters, but it's not within the nearest delimiters. With olddelim off (the default), this hit now does not match: it would have to occur within the nearest delimiters, which would have to be in the correct order. (Added in version 3.0.950300000 20000211. Previous versions behave as if olddelim were on.)

  • eqprefix (string)
    The name of the equivalence file. Default is builtin, which uses the built-in equivalence list.

  • ueqprefix (string)
    The name of the user equivalence file. Default is empty, except for Webinator versions where it is the USROBJECTS.tbl file in the current database.

  • suffix (list)
    The suffix list used for suffix processing (if enabled) during search. The default suffix list is:

    ' (single quote) able age aged ager ages al ally ance anced ancer ances ant ary at ate ated ater atery ates atic ed en ence enced encer ences end ent er ery es ess est ful ial ible ibler ic ical ice iced icer ices ics ide ided ider ides ier ily ing ion ious ise ised ises ish ism ist ity ive ived ives ize ized izer izes less ly ment ncy ness nt ory ous re red res ry s ship sion th tic tion ty ual ul ward

  • suffixeq (list)
    The suffix list used for suffix processing during equivalence lookup. The default suffixeq list is:

    ' (single quote) ies s

  • prefix (list)
    The prefix list used for prefix processing (if enabled) during search. The default prefix list is:

    ante anti arch auto be bi counter de dis em en ex extra fore hyper in inter mis non post pre pro re semi sub super ultra un

  • noise (list)
    The noise word list used during query processing.

  • phrasewordproc (string)
    Which words of a phrase to do suffix/wildcard processing on. The possible values are mono to treat the phrase as a monolithic word (ie. only last word processed, but entire phrase counts towards minwordlen); none for no suffix/wildcard processing on phrases; or last to process just the last word. Note that a phrase is multi-word, ie. a single word in double-quotes is not considered a phrase, and thus phrasewordproc does not apply. Added in version 4.03.1082000000 20040414. Mode none supported in version 5.01.1127760000 20050926.

  • withinmode (string)
    Whether the ``within-N'' operator (eg. w/5) denotes within N characters (the default) or within N words. The settings are char for character and word for words. Added in version 4.03.1081200000 20040405.


DIAGNOSTICS
apicp returns nothing.


EXAMPLE

<apicp "alpostproc" "on">


CAVEATS
The apicp function was added Sep. 13 1996. Various settings were added since then and are unknown to previous versions.

Any apicp calls should take place after USER/PASS statements, but before SQL and fmt calls.

The ability to pass multiple $value arguments for string-list settings was added in version 3.0.996300000 20010728.


SEE ALSO
USER, PASS; Metamorph hit markup (p.  gif )

The Metamorph Linguistics chapter in the Texis manual


Copyright © Thunderstone Software     Last updated: Wed Sep 10 11:16:28 EDT 2008
 
Home   ::   Products   ::   Company   ::   News   ::   Tech Support   ::   Demos   ::   Contact Us
Copyright © 2008 Thunderstone Software LLC. All rights reserved.