Re: [htdig3-dev] Re: htdig-3.1.4 prerelease


Subject: Re: [htdig3-dev] Re: htdig-3.1.4 prerelease
From: Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Date: Tue Dec 07 1999 - 12:05:31 PST


On Tue, 7 Dec 1999, Geoff Hutchison wrote:

> Date: Tue, 7 Dec 1999 11:07:01 -0600
> From: Geoff Hutchison <ghutchis@wso.williams.edu>
> To: htdig3-dev <htdig3-dev@htdig.org>
> Subject: [htdig3-dev] Re: htdig-3.1.4 prerelease
>
> Hi,
>
> First off, many thanks to Gilles, who has been hard at work on
> cleaning up the 3-1-x tree! The results of that work is currently on
> the site, as a prelease for the 3.1.4 release:
> <http://www.htdig.org/files/snapshots/htdig-3.1.4-prerelease.tar.gz>
>
> As I put in the htdoc/RELEASE.html file, I'd like to release this on
> Thursday evening my time (US Central, 12/9/99), so if people could
> take a look and make sure it compiles, etc. we'd both greatly
> appreciate it. Using CVS, it's the current state of the 3-1-x branch.

I downloaded and installed it on a BSDI 4.0 box; it compiled but, htsearch
dumped core. I followed the old BSDI/htdig fix:

 . make clean
 . Remove references to regex.o from htlib/Makefile
 . rm htlib/regex.h
 . make

everything worked except my the old local duplicate suppressor patch:
ftp://sol.ccsf.cc.ca.us/htdig-patches/3.0.8b2/Retriever.cc.0
did not quite do its job.

Here are some stats:
____________________________________________________________________
3.1.4:
Start dig: Tue Dec 7 11:16:16 PST 1999
End dig: Tue Dec 7 11:30:37 PST 1999

-rw-r--r-- 1 jjah www 16614400 Dec 7 11:30 db.docdb
-rw-r--r-- 1 jjah www 418816 Dec 7 11:30 db.docs.index
-rw-r--r-- 1 jjah www 18819167 Dec 7 11:30 db.wordlist
-rw-r--r-- 1 jjah www 19718144 Dec 7 11:30 db.words.db
htdig: Run complete
htdig: 1 server seen:
htdig: www.ccsf.cc.ca.us:80 7069 documents

htmerge: Total word count: 88711
htmerge: Total documents: 3727
htmerge: Total doc db size (in K): 29409

3.1.3:
Start dig: Tue Dec 7 11:32:31 PST 1999
End dig: Tue Dec 7 11:47:09 PST 1999

-rw-r--r-- 1 jjah www 16571392 Dec 7 11:47 db.docdb
-rw-r--r-- 1 jjah www 416768 Dec 7 11:47 db.docs.index
-rw-r--r-- 1 jjah www 18734111 Dec 7 11:46 db.wordlist
-rw-r--r-- 1 jjah www 19638272 Dec 7 11:46 db.words.db

htdig: Run complete
htdig: 1 server seen:
htdig: www.ccsf.cc.ca.us:80 7077 documents

htmerge: Total word count: 88507
htmerge: Total documents: 3727
htmerge: Total doc db size (in K): 29409
_______________________________________________________________

As you see database sizes do not vary too much, but the results pages
point to the same URL MULTIPLE times in 3.1.4 case; baffling;-/?

That reminds me; has the _promised_ duplicate suppression feature been
placed in 3.2.x yet?

Regards,

Joe

-- 
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        jjah@cloud.ccsf.cc.ca.us

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue Dec 07 1999 - 12:18:35 PST