Re: [htdig] word_list columns


Subject: Re: [htdig] word_list columns
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Thu Nov 25 1999 - 12:55:54 PST


According to Aaron Turner:
> there are 6 columns in the wordlist file. Obviously col1 is the word.
> What are the others? (i, l, w, c a)

First field: indexed word (lower case)
i: doc ID (to match up with records in db.docs.index)
l: location of word in doc (0-1000, i.e. tenth of a percent units)
w: weight of word in searches
c: no. of occurrences of word in document, if > 1
a: index into anchor list in db.docdb record, to indicate which
                anchor name, if any, preceded this word

Fields are tab separated. All of this info gets put into db.words.db by
htmerge, so htsearch doesn't actually look at db.wordlist.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You'll receive a message confirming the unsubscription.



This archive was generated by hypermail 2b25 : Thu Nov 25 1999 - 13:07:49 PST