AW: [htdig] irrelevant pages in search


Subject: AW: [htdig] irrelevant pages in search
From: Hartmut Steffin (h.steffin@abi-behoerden.de)
Date: Fri Nov 19 1999 - 04:50:10 PST


I have the same problem on our intranet site. it reaches a level of
unreliability that the whole search is useless. there must be a principle
error. the only errors in the log i have are about not being able to index
pdf-files:

/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Expected a dict object.
/tmp/htdig12988.pdf: This document requires a password.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: Could not repair file.
/tmp/htdig12988.pdf: This document requires a password.
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output
PDF::parse: cannot open acroread output

I don't understand what the problem with these files is. They work perfectly
from the browser.
Is there a connection between error in pdf-files and messing up the
database?

regards
Hardy

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You'll receive a message confirming the unsubscription.



This archive was generated by hypermail 2b25 : Fri Nov 19 1999 - 05:01:49 PST