Re: [htdig] problems with the "accent" patch


Subject: Re: [htdig] problems with the "accent" patch
From: Joe R. Jah (jjah@cloud.ccsf.cc.ca.us)
Date: Thu Mar 02 2000 - 13:25:47 PST


On Thu, 2 Mar 2000, Eric van der Vlist wrote:

> Date: Thu, 02 Mar 2000 22:12:34 +0100
> From: Eric van der Vlist <vdv@dyomedea.com>
> To: Gilles Detillieux <grdetil@scrc.umanitoba.ca>
> Cc: Eric.Doutreleau@int-evry.fr, htdig@htdig.org,
> robert.marchand@UMontreal.CA
> Subject: Re: [htdig] problems with the "accent" patch
>
> Hi,
>
> I have applied this patch as well and noticed that it's working for most
> of the words, but not for others...
>
> Looking at the output of "htfuzzy -vv accents", I have noticed that all
> the words are truncated to 12 characters and that the words which are
> truncated are those for which there is a problem.
>
> For instance searching for "enchere" (not truncated) will return the
> matching for the correctly spelled word (with &egrave;) while searching
> for "specification" truncated to "specificatio" will not match
> specification with a &eacute;.
>
> If I search for "specificatio", I do get the matching for the
> accentuated word...
>
> I am trying to find where this truncation happens, but if anyone more
> familiar with the code can shed some light, it would help !

In the htdig.conf file set maximum_word_length attribute. It is by
default 12.

Regards,

Joe

-- 
     _/   _/_/_/       _/              ____________    __o
     _/   _/   _/      _/         ______________     _-\<,_
 _/  _/   _/_/_/   _/  _/                     ......(_)/ (_)
  _/_/ oe _/   _/.  _/_/ ah        jjah@cloud.ccsf.cc.ca.us

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Thu Mar 02 2000 - 13:30:23 PST