Re: [htdig] Problem with PDF files....


Subject: Re: [htdig] Problem with PDF files....
From: Elijah Kagan (elijah@netvision.net.il)
Date: Fri Jan 12 2001 - 04:13:58 PST


Gilles,

1. I run htdig with an explicit -c option, so it uses the correct conf
file.
2. I rewrote the external_parsers so it includes only one line...
3. ..and it is the first line in the file

Results are the same! It is still looking for an acroread!

Please, help. I am getting desperate...

-- elijah

On Thu, 11 Jan 2001, Gilles Detillieux wrote:

> According to Elijah Kagan:
> >
> > Dear Everyone
> >
> > Hope this is the correct list to send such questions. If not, accept my
> > apologies.
> >
> > When I run htdig on my files I get the following message when it comes to
> > a PDF document:
> >
> > 41:41:3:http://myserver/~elijah/document.pdf: PDF::parse: cannot find pdf
> > parser /usr/local/bin/acroread size = 1965732
> >
> > For some reason htdig looks for an Acrobat while its config file clearly
> > states:
> >
> > external_parsers: application/msword->text/html /usr/local/bin/conv_doc.pl \
> > application/postscript->text/html /usr/local/bin/conv_doc.pl \
> > application/pdf->text/html /usr/local/bin/conv_doc.pl
> >
> > The conv_doc.pl exists and working and the content type received from the
> > server is application/pdf.
> >
> > Any ideas?
> ...
> > P.S. I am running htdig 3.1.5 on a Debian system.
>
> There are a few possibilities:
>
> 1) htdig isn't looking at this config file, but another one, without
> the external_parsers definition;
> 2) there's a typo in the external_parsers definition that isn't showing up
> in the text you e-mailed above, e.g. a misspelled word or a space after
> one of the backslashes at the end of the first two lines; or
> 3) there's a definition right above your external_parsers definition that
> mistakenly ends with a backslash at the end of the line, causing your
> external_parsers definition to be swallowed up by the previous line.
>
> That htdig is attempting to invoke acroread confirms two things: a)
> the PDF file is correctly being tagged by the server as application/pdf,
> and b) htdig is not seeing a usable definition of an external parser
> for that content-type, for any of the reasons outlined above.
>
> --
> Gilles R. Detillieux E-mail: <grdetil@scrc.umanitoba.ca>
> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
> Dept. Physiology, U. of Manitoba Phone: (204)789-3766
> Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
>

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Fri Jan 12 2001 - 04:28:00 PST