Subject: Re: [htdig] word_list columns
From: Gilles Detillieux (email@example.com)
Date: Thu Nov 25 1999 - 12:55:54 PST
According to Aaron Turner:
> there are 6 columns in the wordlist file. Obviously col1 is the word.
> What are the others? (i, l, w, c a)
First field: indexed word (lower case)
i: doc ID (to match up with records in db.docs.index)
l: location of word in doc (0-1000, i.e. tenth of a percent units)
w: weight of word in searches
c: no. of occurrences of word in document, if > 1
a: index into anchor list in db.docdb record, to indicate which
anchor name, if any, preceded this word
Fields are tab separated. All of this info gets put into db.words.db by
htmerge, so htsearch doesn't actually look at db.wordlist.
-- Gilles R. Detillieux E-mail: <firstname.lastname@example.org> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------ To unsubscribe from the htdig mailing list, send a message to email@example.com You'll receive a message confirming the unsubscription.
This archive was generated by hypermail 2b25 : Thu Nov 25 1999 - 13:07:49 PST