Re: [htdig3-dev] db sizes with 3.1.5


Subject: Re: [htdig3-dev] db sizes with 3.1.5
From: C.H.Liddiard@qmw.ac.uk
Date: Tue Feb 29 2000 - 05:27:12 PST


>I just upgraded to 3.1.5 from 3.1.2; thanks for maintaining this
>software.
>
>With 3.1.5 I noticed the files in the db directory are about 3 times
>bigger than with 3.1.2; is this normal? I couldn't find any mention
>of this in the release notes on the web site.
>
>Here are the listings of the two directories:
>
>htdig-3.1.2/db:
>total 66704
>21888 -rw-r--r-- 1 root other 11191296 Feb 28 05:05 db.docdb
> 800 -rw-r--r-- 1 root other 401408 Feb 28 05:05 db.docs.index
>22704 -rw-r--r-- 1 root other 11611928 Feb 28 05:05 db.wordlist
>21312 -rw-r--r-- 1 root other 10903552 Feb 28 05:05 db.words.db
>
>htdig-3.1.5/db:
>total 163712
>38048 -rw-r--r-- 1 root other 19453952 Feb 28 12:23 db.docdb
>1104 -rw-r--r-- 1 root other 555008 Feb 28 12:23 db.docs.index
>65408 -rw-r--r-- 1 root other 33462478 Feb 28 12:23 db.wordlist
>59152 -rw-r--r-- 1 root other 30256128 Feb 28 12:23 db.words.db
>
>------------------------------------
>To unsubscribe from the htdig3-dev mailing list, send a message to
>htdig3-dev-unsubscribe@htdig.org
>You will receive a message to confirm this.

I noticed something similar when I was testing 3.1.5. I got the foolowing sizes

/info/htdig/sitea
total 34636
-rw-r--r-- 1 htdig htdig 2007040 Feb 29 00:30 db.docdb
-rw-r--r-- 1 htdig htdig 84992 Feb 29 00:30 db.docs.index
-rw-r--r-- 1 htdig htdig 2448955 Feb 29 00:30 db.wordlist
-rw-r--r-- 1 htdig htdig 3571712 Feb 29 00:30 db.words.db

/info/htdig/siteb
total 39040
-rw-r--r-- 1 htdig htdig 4835328 Feb 29 10:42 db.docdb
-rw-r--r-- 1 htdig htdig 482304 Feb 29 10:42 db.docs.index
-rw-r--r-- 1 htdig htdig 7287520 Feb 29 10:42 db.wordlist
-rw-r--r-- 1 htdig htdig 7333888 Feb 29 10:42 db.words.db

/info/htdig/sitec
total 26416
-rw-r--r-- 1 htdig htdig 4890624 Feb 29 13:00 db.docdb
-rw-r--r-- 1 htdig htdig 487424 Feb 29 13:00 db.docs.index
-rw-r--r-- 1 htdig htdig 3605764 Feb 29 13:00 db.wordlist
-rw-r--r-- 1 htdig htdig 4495360 Feb 29 13:00 db.words.db

These all represent digs of the same sites using identical config files. In
the case of sitea we had run an initial dig many months ago and were just
running updates on it nightly. Siteb represents the database sizes I got
when I ran 3.1.5 which show similar increases to Rusty Wright's database
sizes. As sitea was only updating and not an initial run I thought I would
do an initial run which I did as sitec. Imagine my surprise when the
database sizes changed drastically. Obviously there are bugs in updating
with 3.1.2 and the difference between 3.1.2 and 3.1.5 was, in fact, a lot
less than I had imagined. I will now be interested to run 3.1.5 on my other
sites which are far larger than the above but which I ran an initial dig on
at Christmas.

--
___________________________________________________________________________
Chris Liddiard                  TEL     +44(0) 20 7882 5364
Systems Maintenance             FAX	+44(0) 20 8980 2001
Computing Services
Queen Mary & Westfield College
University of London
Mile End Road                   email: C.H.Liddiard@qmw.ac.uk
London E1 4NS
UK
__________________________________________________________________________

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue Feb 29 2000 - 05:31:21 PST