Subject: [htdig] Re: is this a bug?
From: Gilles Detillieux (firstname.lastname@example.org)
Date: Wed Jul 26 2000 - 07:08:44 PDT
First of all, please send questions like this to the email@example.com mailing
list, and not to me personally. This isn't a one-man show!
According to inzli:
> i'm using ht://Dig 3.1.0b3 on a windows platform.
> seems it is working okay. but if i enter a search-term i receive the following message:
> "Documents 1-8 of 8 matches"
> but it only shows two matches.
> With another searchterm, it reveals that there are two matches, but it doesn't show one
> single result.
> actually, the search-term was in a *.pdf-document. i believe that pdf-docs can not be
> digged, therefore i converted it into a html-document and wwwoffled it for digging this
> is this a bug or just a bad configuration?
Your version is ancient. The current stable release is 3.1.5, so I
suggest you upgrade to it. Also, the behaviour you describe seems to
suggest a problem with a corrupt database, so you should rebuild your
database from scratch.
I don't understand enough about how you're using WWWoffle to substitute
an HTML document for a PDF, so I can't say whether or not you have a
configuration problem here as well.
PDF documents can be indexed, though. You'll need a current version of
htdig, as well as the pdftotext utility from the xpdf package, a Perl
interpreter, and the conv_doc.pl or doc2html.pl script. Under Windows,
you'll also need to patch htdig/ExternalParser.cc, so that it uses "wb",
rather than just "w", as the second argument to the fopen() call that
creates the temporary file. See http://www.htdig.org/FAQ.html#q4.9 for
pointers on setting up an external converter.
-- Gilles R. Detillieux E-mail: <firstname.lastname@example.org> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------ To unsubscribe from the htdig mailing list, send a message to email@example.com You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Tue Jul 25 2000 - 21:07:14 PDT