Re: htdig: Possible htsearch bug


George Adams (learningapache@my-dejanews.com)
Wed, 25 Nov 1998 07:28:57 -0700


 
>db.words.db should be generated from scratch from db.wordlist by htmerge.
>I'm assuming the word is actually in db.wordlist?

No, actually that's not the case.

Here is the state after indexing the site while the file containing the keyword "dalek" DOES exist:

% ls -l
-rw-rw-r-- 1 adams users 83968 Nov 25 10:20 db.docdb
-rw-rw-r-- 1 adams users 6144 Nov 25 10:20 db.docs.index
-rw-rw-r-- 1 adams users 109895 Nov 25 10:20 db.wordlist
-rw-rw-r-- 1 adams users 117760 Nov 25 10:20 db.words.db

% grep -l "dalek" *
db.docdb
db.wordlist
db.words.db

Now I remove the file containing the word "dalek" and reindex the site by running "rundig".

% ls -l
-rw-rw-r-- 1 adams users 83968 Nov 25 10:21 db.docdb
-rw-rw-r-- 1 adams users 6144 Nov 25 10:21 db.docs.index
-rw-rw-r-- 1 adams users 109740 Nov 25 10:21 db.wordlist
-rw-rw-r-- 1 adams users 117760 Nov 25 10:21 db.words.db

% grep -l "dalek" *
db.words.db

Although it appears (from the timestamps) that each of the four files are being regenerated by "rundig", db.words.db still contains the word "dalek", though no other file does. (and read/write permissions shouldn't factor into this - everything is owned by me)

-----== Sent via Deja News, The Discussion Network ==-----
http://www.dejanews.com/ Easy access to 50,000+ discussion forums
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:53 PST