Re: [htdig] file size not a multiple of the pagesize


Subject: Re: [htdig] file size not a multiple of the pagesize
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Thu Mar 30 2000 - 13:16:31 PST


According to Josh Walton:
> Newbe here...
>
> Ran htsearch with no problems, gathered a rather large datafile, in the 2
> gig range.
>
> Ran htmerge and ended up with this:
>
> [snip]
> htmerge: 334800:zweit
> htmerge: 334900:zymography
>
> htmerge: Total word count: 334923
> DB2 problem...: /home/httpd/htdig/db/db.docdb: file size not a multiple of
> the p
> agesize
>
> htmerge: Total documents: 0
> htmerge: Total doc db size (in K): 0
>
>
> [root@beast db]# ls -l
> total 4807712
> drwxrwxrwx 2 root root 4096 Mar 28 13:19 bkup
> -rwxrwxrwx 1 root root 2147483647 Mar 28 10:49 db.docdb
> -rw-r--r-- 1 root root 2048 Mar 29 17:50 db.docs.index
> -rw-r--r-- 1 root root 1682985121 Mar 29 17:50 db.wordlist
> -rw-r--r-- 1 root root 1087792128 Mar 29 17:50 db.words.db
> [root@beast db]#
>
> system:
> VALinux dual P500 Redhat 6.1 (via VALinux)
>
> Any Ideas?

Linux has a maximum file size of 2 GB, i.e. 2^31 bytes. Your db.docdb
has reached exactly that size, so my guess is it needed more space, which
it couldn't get. You'll need to find a way to reduce your space requirements,
either by indexing less documents, or by limiting the amount of data indexed,
either via max_doc_size or max_head_length.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Thu Mar 30 2000 - 12:15:00 PST