Re: htdig: DB2 problem


Jeff Hill (jhill@hronline.com)
Tue, 17 Nov 1998 08:18:27 -0500


Thank you all, it was hard disk space. Just thought 167MB free after
running rundig meant I had enough space. Finally ran with no errors with
600MB free.

Total db directory size, 311MB. So, set to index all of every page on my
site, the db directory is about 6.5% larger than all of the files
indexed.

Now I'll have to try using contrib/wordfreq/ or Geoff's method. I assume
halving the database size would not only save disk space, but speed
searches.

>used "cut -f 1 db.wordlist | uniq -c | sort -r" to determine how many
>documents each word was in, then I took the top 500 and edited the list.
                                                                ^^^^^^^^
Edited db.worklist, I assume?

Thanks again,

Jeff Hill

Iosif Fettich wrote:
>
> It's just an idea, maybe someone knows better: as far as I know, htdig
> isn't indeed creating _explicitly_ other files than what you see. But I
> can imagine that some kind of sorting will need external files - and these
> could be larger than you would expect. Can you maybe just give-it a try,
> freeing a larger part of your /tmp or so during indexing...?

********* HR On-Line: The Network for Workplace Issues ********
** Ph:416-604-7251 -- Fax:416-604-4708 ** http://www.hronline.com **
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:49 PST