Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Mon, 21 Dec 1998 10:31:54 -0600 (CST)
According to Rodger Zeisler:
> I indexed a 1400 page site. Each of the pages was only about 6-8K. The
> resulting database looks like this:
>
> -rw-rw-r-- 1 infoserv vuser 4023296 Dec 18 14:15 db.docdb
> -rw-rw-r-- 1 infoserv vuser 271360 Dec 18 14:15 db.docs.index
> -rw-rw-r-- 1 infoserv vuser 3435360 Dec 18 14:15 db.urls
> -rw-rw-r-- 1 infoserv vuser 4041919 Dec 18 14:15 db.wordlist
> -rw-rw-r-- 1 infoserv vuser 4072448 Dec 18 14:15 db.words.db
>
> When I run htsearch with a small result set (400-500 I'm guessing), it all
> works fine. When I pick a really common word, that would result in close to
> all the pages being retrieved (like the name of the company, which exists in
> the footer of each page), htsearch dies with an internal server error. In
> the error_log it says:
>
> Virtual memory exceeded in `new'
> [Fri Dec 18 15:52:56 1998] [error] Premature end of script headers:
> /usr/local/etc/httpd/cgi-bin/htsearch
>
> Any suggestions. If your answer is 'you need more virtual memory', how is
> this accomplished. I am running on a VServers.com virtual server account.
There is a memory leak in htsearch, introduced in 3.1.0b3. If that's what
you're running, give this patch a try and let us know whether it works for
you:
--- htsearch/Display.cc.membug Tue Dec 15 10:58:13 1998
+++ htsearch/Display.cc Mon Dec 21 10:11:07 1998
@@ -852,6 +852,7 @@
links = 1; // It's a hack, but it helps...
score += config.Double("backlink_factor")
* (thisRef->DocBackLinks() / (double)links);
+ delete thisRef;
}
thisMatch->setIncompleteScore(score);
-- Gilles R. Detillieux E-mail: <grdetil@scrc.umanitoba.ca> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 ---------------------------------------------------------------------- To unsubscribe from the htdig mailing list, send a message to htdig-request@sdsu.edu containing the single word "unsubscribe" in the body of the message.
This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:29:55 PST