Re: [htdig]PB accent with french words


Terje Nagel (terje@kfumscout.dk)
Mon, 15 Mar 1999 09:42:32 +0000


Hello

It seems that locale support dosn't work proberly in RedHat 5.0 (se
danish-HOWTO section 5, http://www.sunsite.dk/ldp/HOWTO/Danish-HOWTO-5.html
. The danish-HOWTO also talks about locale support i general terms)

Try to do a (taken from the HOWTO):
localedef -c -i fr_FR -f ISO-8859-1 fr_FR

Then put
    locale: fr_FR
in your htdig.conf

Now run htdig and htmerge as usual.

Cheers Terje

-----Original Message-----
From: André LAGADEC <andre.lagadec@proto.education.gouv.fr>
To: htdig@htdig.org <htdig@htdig.org>
Date: 11. marts 1999 22:27
Subject: [htdig]PB accent with french words

>
>Hello,
>
>I get and install Htdig on a Web server with french document on Compaq
>Proliant 200 computer, with Linux Red Hat 5.0, kernel 2.0.33 and Apache
>1.2.5
>
>It work but I have a problem with accent. I can retrieve some word like
>"académie" in html pages but not in all pages where there is the word
>"académie". And if I search "acad", I can see the pages where there is
>the word "académies" because in the db.wordlist file this word is
>present.
>
>I suppose that when Htdig see "académie", he detect 2 word "acad" and
>"mie", because character 'é' or &eacute; but he detect also One word
>"académie" in other page !?
>
>I see in the mailing list, that other people have the same problem. I
>change my htdig file configuration (see follow) and add some directives
>preconised by different people, like
>locale fr, or locale fr_FR.ISO_8859-1, valid_punctuation
>
>But he doesn'y=t work correctly.
>
>HTDIG.CONF
>bad_word_list: ${common_dir}/mots_exclus
>locale: fr_FR.ISO_8859-1
>iso_8601: true
>valid_punctuation: "()!?,
>search_algorithm: exact:1 synonyms:0.5 endings:0.1
># Affix rules file
>endings_affix_file ${common_dir}/francais.aff
># Dictionary file
>endings_dictionary ${common_dir}/francais.0
>
>An idea ?
>
>Thanks.
>Andre.
>
>------------------------------------
>To unsubscribe from the htdig mailing list, send a message to
>htdig@htdig.org containing the single word "unsubscribe" in
>the SUBJECT of the message.
>
>

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Mon Mar 15 1999 - 08:57:47 PST