Re: [htdig3-dev] Largest HtDig Database Size?


Jeff Hill (jhill@hronline.com)
Wed, 05 May 1999 10:33:53 -0400


Well, I don't have a TONNE of documents as we don't keep print
copies ;), but we do have over 55,000 indexed by HtDig (seems
like a lot for me). The HtDig db file sizes are:

-rw-r--r-- 1 root root 152830976 May 3 15:34 db.docdb
-rw-r--r-- 1 root root 6804480 May 3 15:34
db.docs.index
-rw-r--r-- 1 root root 115452539 May 3 15:17
db.wordlist
-rw-r--r-- 1 root root 82358272 May 3 15:17
db.words.db

I'm running this on an IDE disk and still get fine performance.
If you happen to be running Apache with PHP3 as a module,
performance should increase (as previously mentioned on this
list) by using PHP3 as a wrapper (something I'm looking at now).
For a start on that, you can see
http://www.devshed.com/Server_Side/PHP/Search_This/

Regards,

Jeff Hill

OCD Support wrote:
>
> Hi there... I know this question has been addressed before in the FAQ
> but what's the largest db that anyone has built using HtDig and how did
> it perform?
>
> We're looking at possibly using it to index a TONNE of documents for a
> client and just trying to obtain a benchmark to work with.
>
> Thanks very much.
>
> Paul Stewart
>
> ------------------------------------
> To unsubscribe from the htdig3-dev mailing list, send a message to
> htdig3-dev@htdig.org containing the single word "unsubscribe" in
> the SUBJECT of the message.

-- 

********* HR On-Line: The Network for Workplace Issues ******** ** Ph:416-604-7251 -- Fax:416-604-4708 ** http://www.hronline.com ** ------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev@htdig.org containing the single word "unsubscribe" in the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Wed May 05 1999 - 07:40:35 PDT