[htdig] indexing pdf files


Subject: [htdig] indexing pdf files
From: Sid Wilroy (swilroy@transmeta.com)
Date: Wed May 10 2000 - 16:11:15 PDT


I've spend the whole day trying...
This is what I have in my htdig.conf file

external_parsers: application/pdf /export/home/htdig/bin/parse_doc.pl

This is what I have in my parse_doc.pl

$CATPDF = /export/home/xpdf/bin/pdftotext";
$PDFINFO = "/export/home/xdpf/bin/pdfinfo";
$CATPDF = "/export/home/xpdf/bin/pdftotext";
$PDFINFO = "/export/home/xpdf/bin/pdfinfo";

I'm getting errors when I run $rundig

String found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 114, near "$parsecmd = ""
  (Might be a runaway multi-line "" string starting on line 107)
        (Missing semicolon on previous line?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 114, at end of line
        (Missing operator before ?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 116, at end of line
  (Might be a runaway multi-line "" string starting on line 114)
        (Missing operator before ?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 116, at end of line
        (Missing operator before ?)
Bareword found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 117, near "$type = "PDF"
  (Might be a runaway multi-line "" string starting on line 116)
        (Missing operator before PDF?)
String found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 119, near "if (open(INFO, ""
  (Might be a runaway multi-line "" string starting on line 117)
        (Missing semicolon on previous line?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 119, at end of line
        (Do you need to predeclare if?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 119, at end of line
        (Missing operator before ?)
Number found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 119, near "] 2"
        (Missing operator before 2?)
Bareword found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 119, near "/dev/null"
        (Missing operator before null?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 136, at end of line
  (Might be a runaway multi-line "" string starting on line 119)
        (Missing operator before ?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 136, at end of line
        (Missing operator before ?)
Bareword found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 137, near "$type =
"WordPerfect"
  (Might be a runaway multi-line "" string starting on line 136)
        (Missing operator before WordPerfect?)
String found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 141, near "$parsecmd = ""
  (Might be a runaway multi-line "" string starting on line 137)
        (Missing semicolon on previous line?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 141, at end of line
        (Missing operator before ?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 141, at end of line
        (Missing operator before ?)
Bareword found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 142, near "$type = "RTF"
  (Might be a runaway multi-line "" string starting on line 141)
        (Missing operator before RTF?)
String found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 146, near "$parsecmd = ""
  (Might be a runaway multi-line "" string starting on line 142)
        (Missing semicolon on previous line?)
Scalar found where operator expected at
/export/home/htdig/bin/parse_doc.pl line 146, at end of line
        (Missing operator before ?)
Unrecognized file test: -a at /export/home/htdig/bin/parse_doc.pl line
146.

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Wed May 10 2000 - 13:59:04 PDT