Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Wed, 10 Jun 1998 22:05:46 -0700 (PDT)
On Wed, 10 Jun 1998, Peter Burden wrote:
> Date: Wed, 10 Jun 1998 21:33:38 +0100
> From: Peter Burden <jphb@scit.wlv.ac.uk>
> To: htdig@sdsu.edu
> Subject: htdig: What am I doing wrong
>
> Hello,
> We've been running htdig on a medium site (some 18000 pages)
> for some time and it's been quite OK (apart form the odd time the
> database build broke the disc partition). Recent analysis of results
> has identified one or two problems. Are these configuration issues ?
> Are there patches available ?
>
> 1. Duplicate URLs
>
> htdig doesn't seem too good at spotting multiple different
> URLs pointing to the same page. Host name duplication
You can apply the following patches:
http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Docu-def-Retr-Serv.0
http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Document.cc.0
http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Retriever-def.0
http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b2/Retriever.cc.0
Joe
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah jjah@cloud.ccsf.cc.ca.us
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.
This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:26:33 PST