[htdig3-dev] 3.2.0-022000 / HP-UX 10.20


Subject: [htdig3-dev] 3.2.0-022000 / HP-UX 10.20
From: loic@ceic.com
Date: Tue Feb 22 2000 - 04:06:07 PST


> htdig runs fine
> htmerge seems to run fine, only I get a lot of these things
>
> htmerge: Discarding wwwsthhsnl <DEF> 334 64 11 0
> htmerge: Discarding wwwsthhsnl <DEF> 383 64 4 0
> htmerge: Discarding wwwsthhsnl <DEF> 488 64 11 0
> htmerge: Discarding wwwsthhsnl <DEF> 489 64 11 0
> htmerge: Discarding wwwsthhsnl <DEF> 620 64 4 0
> htmerge: Discarding zuylen <DEF> 39 64 0 0

 It's normal, during the merge words that belong to deleted documents
are discarded. This happens even if you crawl for the first time because
text in href links are indexed as part of the the document the link points
to. If the document is not crawled for some reason (404 for instance), these
must be discarded.
 It may be a good idea to issue these messages only at a higher verbosity
level, though. Currently they are visible at -v maybe only -vvv ?

> After content-type .... nothing appears.
> "jesse" does exist in the word database.

 Any core ? Does the process finish ok ?

-- 
		Loic Dachary

24 av Secretan 75019 Paris Tel: 33 1 42 45 09 16 e-mail: loic@dachary.org URL: http://www.senga.org/

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue Feb 22 2000 - 02:47:36 PST