Re: [htdig] Fwd: Help with "Missing Pages"

Subject: Re: [htdig] Fwd: Help with "Missing Pages"
From: Gilles Detillieux (
Date: Wed Apr 26 2000 - 08:25:14 PDT

According to Danny Summers:
> >I am new to htdig and having problems with it dropping or loosing
> >indexed pages on second or subsequent digs. I can trash the URL's
> >htdig databases, do a new run and everything is there, fully
> >searchable. The next scheduled run, it drops most of the indexed
> >pages and we're back to a very few pages being found on a search. I
> >have gone through as much documentation as possible, checked the
> >configs and still can't figure it out. Any ideas?

It would help to know what exactly is executed at "the next scheduled
run". I assume this is a cron job? Could you show us the crontab entry,
and the shell script that it runs? I assume it's a fairly straightforward
run of htdig followed by htmerge, but it might help to see exactly how
it's being run.

Gilles R. Detillieux              E-mail: <>
Spinal Cord Research Centre       WWW:
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to You will receive a message to confirm this.

This archive was generated by hypermail 2b28 : Wed Apr 26 2000 - 06:12:13 PDT