Subject: [htdig] Re: strange problems with files to be parsed externally
From: Gergely Madarasz (email@example.com)
Date: Tue Feb 29 2000 - 09:23:17 PST
On Tue, 29 Feb 2000, Gergely Madarasz wrote:
> I've just got a bugreport from the debian BTS saying that indexing pdf
> files causes htdig to hang. First I thought it might be the bit modified
> parse-doc.pl which is included in the .deb package, but it seems it is
> not. The following happens: htdig only downloads the first 200000 bytes of
> the pdf:
> -rw-r--r-- 1 root root 200000 Feb 29 18:12 /tmp/htdext.28167
> of course the parser can't handle this since the file is originally much
> larger and expects additional data. What might cause this ?
Argh, I got it... it is the default max_doc_size ... so do you have any
suggestions how to handle this case ?
-- Madarasz Gergely firstname.lastname@example.org email@example.com It's practically impossible to look at a penguin and feel angry. Egy pingvinre gyakorlatilag lehetetlen haragosan nezni. HuLUG: http://mlf.linux.rulez.org/
------------------------------------ To unsubscribe from the htdig mailing list, send a message to firstname.lastname@example.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Tue Feb 29 2000 - 09:27:22 PST