[htdig3-dev] Databases (was Re: Final Push)


Geoff Hutchison (ghutchis@wso.williams.edu)
Mon, 1 Feb 1999 18:28:16 -0400


>Seems to me that we need access by url to see if the document has
>already been indexed.

Right. But let's say we keep a (temporary) url -> docID list/database while
indexing...

Then a search request makes one lookup per document returned.
Now a search request makes two lookups per document returned...

Hmm. If given a choice between slowing down the indexing (if at all) to
speed up the search, or vice-versa, I'll choose the faster searching every
time. Besides, with a URL -> docID list, it's only needed when indexing so
you can delete it if pressed for space.

-Geoff

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Thu Feb 04 1999 - 22:20:31 PST