André LAGADEC (firstname.lastname@example.org)
Thu, 11 Mar 1999 21:18:11 +0100
I get and install Htdig on a Web server with french document on Compaq
Proliant 200 computer, with Linux Red Hat 5.0, kernel 2.0.33 and Apache
It work but I have a problem with accent. I can retrieve some word like
"académie" in html pages but not in all pages where there is the word
"académie". And if I search "acad", I can see the pages where there is
the word "académies" because in the db.wordlist file this word is
I suppose that when Htdig see "académie", he detect 2 word "acad" and
"mie", because character 'é' or é but he detect also One word
"académie" in other page !?
I see in the mailing list, that other people have the same problem. I
change my htdig file configuration (see follow) and add some directives
preconised by different people, like
locale fr, or locale fr_FR.ISO_8859-1, valid_punctuation
But he doesn'y=t work correctly.
search_algorithm: exact:1 synonyms:0.5 endings:0.1
# Affix rules file
# Dictionary file
An idea ?
To unsubscribe from the htdig mailing list, send a message to
email@example.com containing the single word "unsubscribe" in
the SUBJECT of the message.
This archive was generated by hypermail 2.0b3 on Mon Mar 15 1999 - 08:57:46 PST