htdig: 3.1.b2 -> 3.1.b3 performance degradation +


Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Thu, 17 Dec 1998 10:45:06 -0800 (PST)


Hi Geoff,

Yesterday I installed htdig 3.1.b3 on my machine. I compiled it on a BSDI
box, everything was left as was except Retriever.cc, which was patched
with the old 3.0.b2 patch,
        ftp://sol.ccsf.cc.ca.us/htdig-patches/3.0.8b2/Retriever.cc.0
to exclude local duplicates, same as my 3.1.b2.

The results:

1. It takes considerably longer to search ( 10 to 20 times) than
   3.1.b2
2. Many of the pages present in 3.1.b2 results, are absent in
   3.1.b3 results.
3. I can not explain the size changes of the db.wordlist and db.words.db
   files.

   3.1.b2 DB files:

   -rw-r--r-- 1 jjah www 11360256 Dec 16 02:35 db.docdb
   -rw-r--r-- 1 jjah www 385024 Dec 16 02:35 db.docs.index
   -rw-r--r-- 1 jjah www 19231896 Dec 16 02:34 db.wordlist
   -rw-r--r-- 1 jjah www 16835584 Dec 16 02:34 db.words.db

   3.1.b3 DB files:

   -rw-r--r-- 1 jjah www 11515904 Dec 17 02:37 db.docdb
   -rw-r--r-- 1 jjah www 372736 Dec 17 02:37 db.docs.index
   -rw-r--r-- 1 jjah www 17188189 Dec 17 02:36 db.wordlist
   -rw-r--r-- 1 jjah www 17328128 Dec 17 02:36 db.words.db

I appreciate any pointer.

TIA,

Joe

     _/ _/_/_/ _/ ____________ __o
     _/ _/ _/ _/ ______________ _-\<,_
 _/ _/ _/_/_/ _/ _/ ......(_)/ (_)
  _/_/ oe _/ _/. _/_/ ah jjah@cloud.ccsf.cc.ca.us

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:29:53 PST