Re: [htdig3-dev] Bug#56721: htdig and locale de_DE peculiarities. (fwd)


Subject: Re: [htdig3-dev] Bug#56721: htdig and locale de_DE peculiarities. (fwd)
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Fri Feb 04 2000 - 12:58:00 PST


I think Geoff answered the first few of your questions.

According to Gergely Madarasz:
> On Fri, 4 Feb 2000, Gilles Detillieux wrote:
> > I did notice a problem with the debian/postinst script, though. The test
> > for the endings and synomyms databases is wrong - right now it's testing
> > for the document databases. Also, the message to the installer suggests
> > that /usr/sbin/htdigconfig will rebuild an existing endings database.
> > It won't. It will only rebuild the endings or the synonyms database if
> > it doesn't exist.
>
> Also remained from the previous maintainer... it is wrong but works most
> of the time :) nobody complained yet about it anyway, so if it works,
> I won't break it :)

It works most of the time because if it finds a document database, chances
are the endings and synonyms databases have been built. It'll break if
you install an htdig update package that uses an incompatible endings or
synonyms database format. So, it's not likely to be an issue just yet,
but it could become one.

> > The code that was commented out of rundig probably
> > does a better test, as it will rebuild if the source files are newer than
> > the current synonyms or endings databases.
>
> Yes, but currently those files are in /usr/lib/htdig which may be mounted
> readonly, so can't be rebuilt on the fly... they could be moved, but the
> upgrade could be difficult (automatic changes of conffiles are not
> allowed, etc....)

What I was getting at is it may make sense to use similar tests in your
postinst and htdigconfig scripts.

> > I'd also highly recommend
> > conv_doc.pl over parse_doc.pl for 3.1.4.
>
> parse_doc.pl was a bit modified by a fellow debian developer to handle all
> possible converters from .doc, .ps and .pdf available in debian... I
> didn't actually go into it, so I just included the file...

The configuration section of conv_doc.pl is almost identical to
parse_doc.pl's, so the debian-specific stuff should migrate easily.
The only gotcha is that the new parse_doc.pl and conv_doc.pl use the -raw
option by default for the pdftotext command that comes with xpdf 0.90,
so you'd have to take that out to work with other PDF to text filters
which the debian-specific code looks for.

The text parser in parse_doc.pl is really pretty crude, and doesn't parse
in a manner consistent with htdig's internal parsers.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Fri Feb 04 2000 - 12:59:59 PST