Re: [htdig] Parsing PDF files.


Subject: Re: [htdig] Parsing PDF files.
From: Geoff Hutchison (ghutchis@wso.williams.edu)
Date: Thu Jun 15 2000 - 15:55:14 PDT


At 1:31 PM -0400 6/15/00, Wayne Fool wrote:
>I have tried to use parse_doc.pl, conv_doc.pl, and doc2html.pl, all of these
>give me 14 consecutive ":=command not found" error messages
>a "syntax error near unexpected token '( )' " error messages then finally a
>message stating "line 83: 'parts = ( );" This is an example of the error
>messages I get with all of the above scripts when I run them manually. I
>have checked the location of ps2ascii and pdftotext files in the script and
>they are correct. The script just shuts down when run with rundig -vvv

What version of Perl are you using? What shell do you use?

>It looks like it is reading the title, is there a way to index those words
>along with 5095 lines of text. I don't get a file returned from the search
>when I search on any of the words in the file.

You said htmerge discards these files. What does it say? (Try htmerge
-vv or more verbosity.)

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Thu Jun 15 2000 - 13:47:07 PDT