Geoff Hutchison (email@example.com)
Tue, 14 Sep 1999 12:02:10 -0500
At 10:10 PM -0400 9/12/99, Patrick wrote:
>The document counter is being incremented even when a document
>is not found or is a redirect. I believe the document counter
>is in Server.cc (about line 274, reads: "_documents++;")
You're correct. The problem is that the Server class and the
Retriever class don't talk to each other much (if at all).
For another example, take the documents forbidden by robots.txt
files. The Server class blocks retrieval, but the Retriver class
doesn't have it in the limits and blindly adds it.
The result is an empty document--it wasn't retrieved since it was
forbidden, but a DocumentRef was added to the database for it.
I guess Server/Retriever code needs to be on the rewrite list...
To unsubscribe from the htdig3-dev mailing list, send a message to
firstname.lastname@example.org containing the single word "unsubscribe" in
the SUBJECT of the message.
This archive was generated by hypermail 2.0b3 on Tue Sep 14 1999 - 10:08:00 PDT