[htdig] htdig ignores *.doc file extension


Subject: [htdig] htdig ignores *.doc file extension
From: Evelio Martinez (evelio.martinez@testanet.com)
Date: Fri Jan 12 2001 - 12:19:19 PST


Hello!

I have installed htdig under RH 6.2 and I have followed the README
files instructions.

1) conf/htdig.conf
    does not contain nothing related to *.doc or *.pdf documents in
bad_extensions:
2) external_parsers: application/msword->text/html
/opt/www/htdig/scripts/doc2html.pl \
                                         application/pdf->text/html
/opt/www/htdig/scripts/doc2html.pl
3) Variables in doc2html point to the correct place
   $CATDOC = "/usr/local/bin/catdoc";
   $CATPDF = "/usr/local/bin/pdftotext";
  $PDFINFO = "/usr/local/bin/pdfinfo";

htdig is ignoring the files with pdf and doc extension.

Did I miss something?
Any suggestion?

Thanks in advance

--
Evelio Martínez
Testanet. Dept. desarrollo software.
Av. Reino de Valencia, 15 - 5
46005 Valencia (Spain)
Tel: +34 96 395 90 00
Fax: +34 96 316 23 19



This archive was generated by hypermail 2b28 : Fri Jan 12 2001 - 12:33:40 PST