Re: htdig: What am I doing wrong


Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Wed, 10 Jun 1998 22:05:46 -0700 (PDT)


On Wed, 10 Jun 1998, Peter Burden wrote:

> Date: Wed, 10 Jun 1998 21:33:38 +0100
> From: Peter Burden <jphb@scit.wlv.ac.uk>
> To: htdig@sdsu.edu
> Subject: htdig: What am I doing wrong
>
> Hello,
> We've been running htdig on a medium site (some 18000 pages)
> for some time and it's been quite OK (apart form the odd time the
> database build broke the disc partition). Recent analysis of results
> has identified one or two problems. Are these configuration issues ?
> Are there patches available ?
>
> 1. Duplicate URLs
>
> htdig doesn't seem too good at spotting multiple different
> URLs pointing to the same page. Host name duplication

You can apply the following patches:

  http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Docu-def-Retr-Serv.0
  http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Document.cc.0
  http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b1/Retriever-def.0
  http://sol.ccsf.cc.ca.us/ftp/htdig-patches/3.0.8b2/Retriever.cc.0

Joe

     _/ _/_/_/ _/ ____________ __o
     _/ _/ _/ _/ ______________ _-\<,_
 _/ _/ _/_/_/ _/ _/ ......(_)/ (_)
  _/_/ oe _/ _/. _/_/ ah jjah@cloud.ccsf.cc.ca.us

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:26:33 PST