Subject: Re: [htdig] problems with the "accent" patch
From: Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Date: Thu Mar 02 2000 - 13:25:47 PST
On Thu, 2 Mar 2000, Eric van der Vlist wrote:
> Date: Thu, 02 Mar 2000 22:12:34 +0100
> From: Eric van der Vlist <vdv@dyomedea.com>
> To: Gilles Detillieux <grdetil@scrc.umanitoba.ca>
> Cc: Eric.Doutreleau@int-evry.fr, htdig@htdig.org,
> robert.marchand@UMontreal.CA
> Subject: Re: [htdig] problems with the "accent" patch
>
> Hi,
>
> I have applied this patch as well and noticed that it's working for most
> of the words, but not for others...
>
> Looking at the output of "htfuzzy -vv accents", I have noticed that all
> the words are truncated to 12 characters and that the words which are
> truncated are those for which there is a problem.
>
> For instance searching for "enchere" (not truncated) will return the
> matching for the correctly spelled word (with è) while searching
> for "specification" truncated to "specificatio" will not match
> specification with a é.
>
> If I search for "specificatio", I do get the matching for the
> accentuated word...
>
> I am trying to find where this truncation happens, but if anyone more
> familiar with the code can shed some light, it would help !
In the htdig.conf file set maximum_word_length attribute. It is by
default 12.
Regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah jjah@cloud.ccsf.cc.ca.us
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Thu Mar 02 2000 - 13:30:23 PST