Thunderstone Software Document Retreival and Management
Search:
Home Products News Support Contact Us About
Vortex Manual

Query Processing

These settings affect how a query is processed, eg. what documents it will match.

  • exactphrase (tri-state, off by default in Vortex, on by default in tsql)
    on by default in tsql)
    Whether to exactly resolve the noise words in phrases. If on, a phrase such as "state of the art" will only match those exact words; however this may require post-processing to resolve the noise words "of the" (potentially slower). If off, any word is permitted in place of the noise words, and no post-processing is done: faster but potentially less accurate. In version 5.01.1178072161 20070501 and later, may be set to ignorewordposition: same as off, but non-noise words are permitted in any order or position; essentially emulates behavior of a non-inverted Metamorph index with no post-processing, but on a Metamorph inverted index too.

  • prefixproc (boolean, off by default)
    Whether to do prefix processing.

  • suffixproc (boolean, on by default)
    Whether to do suffix processing.

  • defsuffrm aka defsufrm (boolean, on by default)
    Whether to remove a trailing vowel, or one of a trailing double consonant pair, after normal suffix processing, and if the word is still minwordlen or greater. This only has effect if suffix processing is enabled (suffixproc on and the original word is at least minwordlen long). Added in version 3.0.941600000 19991102.

  • rebuild (boolean, on by default)
    Whether to do word rebuilding.

  • incsd (boolean, off by default)
    Whether to include the start delimiters in hits. Ignored for w/N (within N chars or words) delimiters.

  • inced (boolean, on by default)
    Whether to include the end delimiters in hits. Ignored for w/N (within N chars or words) delimiters.

  • withinproc (boolean, on by default)
    Whether to process the w/ operator in queries.

  • minwordlen (integer, 255 by default)
    The minimum word length for prefix/suffix processing to occur. Note that this is different from qminwordlen, which is the minimum word length allowed.

  • intersects (integer, -1 by default)
    The default number of intersections (if not given in a query).

  • see (boolean, off by default)
    Whether to look up "see also" references during equivalence lookup.

  • keepeqvs (boolean, off by default)
    Whether to use equivalences for words/phrases found in the equivalence file(s) or not.

  • keepnoise (boolean, off by default)
    Whether to remove noise words from the query during query processing.

  • sdexp (string, empty by default)
    The default start delimiter expression.

  • edexp (string, empty by default)
    The default end delimiter expression.

  • reqsdelim, reqedelim (boolean, on by default)
    Whether to require the start (reqsdelim) or end (reqedelim) delimiter to actually be present in a hit. If these are turned off, then the given delimiter need not be found for a hit to match; it's as if the delimiter were "found" at the start or end of the buffer if not present. (Added in version 3.0.950300000 20000211. Previous versions behave as if these settings were off.)

  • olddelim (boolean, off by default)
    Whether to emulate "old" delimiter behavior. If turned on, it is possible for a hit to occur outside dissimilar start and end delimiters, such as in this example text:

      start-delim ... end-delim ... hit ... start-delim ... end-delim

    Here the hit is "within" the outermost start and end delimiters, but it's not within the nearest delimiters. With olddelim off (the default), this hit now does not match: it would have to occur within the nearest delimiters, which would have to be in the correct order. (Added in version 3.0.950300000 20000211. Previous versions behave as if olddelim were on.)

  • eqprefix (string)
    The name of the equivalence file. Default is builtin, which uses the built-in equivalence list.

  • ueqprefix (string)
    The name of the user equivalence file. Default is empty, except for Webinator versions where it is the USROBJECTS.tbl file in the current database.

  • suffix (list)
    The suffix list used for suffix processing (if enabled) during search. The default suffix list is:

    ' (single quote) able age aged ager ages al ally ance anced ancer ances ant ary at ate ated ater atery ates atic ed en ence enced encer ences end ent er ery es ess est ful ial ible ibler ic ical ice iced icer ices ics ide ided ider ides ier ily ing ion ious ise ised ises ish ism ist ity ive ived ives ize ized izer izes less ly ment ncy ness nt ory ous re red res ry s ship sion th tic tion ty ual ul ward

  • suffixeq (list)
    The suffix list used for suffix processing during equivalence lookup. The default suffixeq list is:

    ' (single quote) ies s

  • prefix (list)
    The prefix list used for prefix processing (if enabled) during search. The default prefix list is:

    ante anti arch auto be bi counter de dis em en ex extra fore hyper in inter mis non post pre pro re semi sub super ultra un

  • noise (list)
    The noise word list used during query processing. The default noise list is:

    a about after again ago all almost also always am an and another any anybody anyhow anyone anything anyway are as at away back be became because been before being between but by came can cannot come could did do does doing done down each else even ever every everyone everything for from front get getting go goes going gone got gotten had has have having he her here him his how i if in into is isn't it just last least left less let like make many may maybe me mine more most much my myself never no none not now of off on one onto or our ourselves out over per put putting same saw see seen shall she should so some somebody someone something stand such sure take than that the their them then there these they this those through till to too two unless until up upon us very was we went were what what's whatever when where whether which while who whoever whom whose why will with within without won't would wouldn't yet you your

  • phrasewordproc (string)
    Which words of a phrase to do suffix/wildcard processing on. The possible values are mono to treat the phrase as a monolithic word (ie. only last word processed, but entire phrase counts towards minwordlen); none for no suffix/wildcard processing on phrases; or last to process just the last word. Note that a phrase is multi-word, ie. a single word in double-quotes is not considered a phrase, and thus phrasewordproc does not apply. Added in version 4.03.1082000000 20040414. Mode none supported in version 5.01.1127760000 20050926.

  • withinmode (string)
    A space- or comma-separated unit and optional type for the "within-N" operator (eg. w/5). The unit is one of:

    • char for within-N characters

    • word for within-N words
    The optional type determines what distance the operator measures. It is one of the following:

    • radius (the default if no type specified when set) indicates all sets must be within a radius N of an "anchor" set, i.e. there is a set in the match such that all other sets are within N units right of its right edge or N units left of its left edge.

    • span indicates all sets must be within an N-unit span
    Added in version 4.03.1081200000 20040405. The optional type was added in version 5.01.1258712000 20091120; previously the only type was implicitly radius. The default setting for version 5 and earlier is char (i.e. char radius); in version 6 and later the default is word span.


DIAGNOSTICS
apicp returns nothing.


EXAMPLE

<apicp "alpostproc" "on">


CAVEATS
The apicp function was added Sep. 13 1996. Various settings were added since then and are unknown to previous versions.

Any apicp calls should take place after USER/PASS statements, but before SQL and fmt calls.

The ability to pass multiple $value arguments for string-list settings was added in version 3.0.996300000 20010728.


SEE ALSO
USER, PASS; Metamorph hit markup (p.  gif )

The Metamorph Linguistics chapter in the Texis manual


Copyright © Thunderstone Software     Last updated: Thu Mar 11 17:19:03 EST 2010
 
Home   ::   Products   ::   News   ::   Support   ::   Contact Us   ::   About
Copyright © 2010 Thunderstone Software LLC. All rights reserved.