[htdig] strange problems with files to be parsed externally

Subject: [htdig] strange problems with files to be parsed externally
From: Gergely Madarasz (gorgo@sztaki.hu)
Date: Tue Feb 29 2000 - 09:18:57 PST


I've just got a bugreport from the debian BTS saying that indexing pdf
files causes htdig to hang. First I thought it might be the bit modified
parse-doc.pl which is included in the .deb package, but it seems it is
not. The following happens: htdig only downloads the first 200000 bytes of
the pdf:
-rw-r--r-- 1 root root 200000 Feb 29 18:12 /tmp/htdext.28167
of course the parser can't handle this since the file is originally much
larger and expects additional data. What might cause this ?

Madarasz Gergely           gorgo@sztaki.hu           gorgo@linux.rulez.org
     It's practically impossible to look at a penguin and feel angry.
         Egy pingvinre gyakorlatilag lehetetlen haragosan nezni.
                   HuLUG: http://mlf.linux.rulez.org/

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.

This archive was generated by hypermail 2b28 : Tue Feb 29 2000 - 09:23:15 PST