Re: [htdig3-dev] Problem: same urls in doc database

Subject: Re: [htdig3-dev] Problem: same urls in doc database
From: Geoff Hutchison (
Date: Sat Feb 26 2000 - 17:08:46 PST

At 7:59 PM +0200 2/21/00, Valdas Andrulis wrote:
>It is easier to setup a web server with one document yourself, than to
>me to explain all the stuff. Document does not have to be dynamic, you
>can change modification date by hand.
>I have tried one server one document setup with the latest cvs code, the
>same weired behavior.

I'm having a lot of trouble reproducing this. Here's what I did. I
indexed my default web tree, then ran htmerge. This gave me the right
# of documents and it only weeded out one--an intentional 404 to test
other parts of the code.

Then I "modified" a file. Since I didn't want to throw things off
with the word count and so on, I just ran touch. Then I re-ran htdig.
It gave me a whole bunch of retrived but not changed messages (hmm,
need to check if the HtHTTP is running the If-Modified-Since right)
and one "changed" message.

So far so good.

I re-ran htmerge and it killed off the previous file, leaving the
same number of documents and the same number of words.

So I think I need some help from you to work this out. Can you run
htmerge with "-v -s" before and after so we can see the results? It
should tell us exactly what documents it's purging (and why) and give
us the counts.


To unsubscribe from the htdig3-dev mailing list, send a message to
You will receive a message to confirm this.

This archive was generated by hypermail 2b28 : Sat Feb 26 2000 - 17:19:22 PST