Re: [htdig3-dev] 3.2.0-022000 / HP-UX 10.20


Subject: Re: [htdig3-dev] 3.2.0-022000 / HP-UX 10.20
From: Geoff Hutchison (ghutchis@wso.williams.edu)
Date: Tue Feb 22 2000 - 06:15:11 PST


At 1:06 PM +0100 2/22/00, loic@ceic.com wrote:
> > htdig runs fine
> > htmerge seems to run fine, only I get a lot of these things
> >
> > htmerge: Discarding wwwsthhsnl <DEF> 334 64 11 0
> > htmerge: Discarding wwwsthhsnl <DEF> 383 64 4 0
> > htmerge: Discarding wwwsthhsnl <DEF> 488 64 11 0
> > htmerge: Discarding wwwsthhsnl <DEF> 489 64 11 0
> > htmerge: Discarding wwwsthhsnl <DEF> 620 64 4 0
> > htmerge: Discarding zuylen <DEF> 39 64 0 0
>
> It's normal, during the merge words that belong to deleted documents
>are discarded. This happens even if you crawl for the first time because
>text in href links are indexed as part of the the document the link points
>to. If the document is not crawled for some reason (404 for instance), these
>must be discarded.
> It may be a good idea to issue these messages only at a higher verbosity
>level, though. Currently they are visible at -v maybe only -vvv ?

I agree that these are normal and in fact, you'll see them in 3.1.
Personally, I'd be a bit unnerved by the part of the line after the
word because it looks so odd. This part is new to 3-2-x.

Maybe this part should go up to -vvv and for -v we have some sort of
explanatory text as to why discarding the word is OK.

-Geoff

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev-unsubscribe@htdig.org
You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue Feb 22 2000 - 06:22:46 PST