Re: [htdig] Two languages and accentuated words

Subject: Re: [htdig] Two languages and accentuated words
From: Gilles Detillieux (
Date: Wed Sep 20 2000 - 08:59:42 PDT

According to Manuel Monteiro:
> I've installed ht://dig 3.1.5 on my alphaev6-osf4.0f and it's working
> nicely.
> On my website i have 2 languages, english an portuguese. I've added the
> following line to the configuration file in order to support portuguese:
> # Languages
> #
> locale: pt_PT
> lang_dir: ${common_dir}/portugues
> bad_word_list: ${lang_dir}/bad_words
> endings_affix_file: ${lang_dir}/portugues.aff
> endings_dictionary: ${lang_dir}/portugues.0
> endings_root2word_db: ${lang_dir}/root2word.db
> endings_word2root_db: ${lang_dir}/word2root.db
> The search in english works perfectly but in portuguese I can only
> search words without accentuation.
> Just an exemple:
> * If i search for 'Monteiro' the results will show both english
> and portuguese entries.
> * If i search for 'Mapa' (map in portuguese) the results will
> show portuguese entries.
> * If i search for 'Seminário' (seminar in portuguese and
> written seminário in HTML) it reports: No matches were found for
> '(seminário or seminários)'

Hmm. My first guess was that the pt_PT locale isn't working on your system,
but the fact that your query is expanded to '(seminário or seminários)'
suggests that the endings algorithm is working correctly, and apparently
it's handling accented letters as letters correctly as well. Does the
word seminário appear in your db.wordlist file?

Gilles R. Detillieux              E-mail: <>
Spinal Cord Research Centre       WWW:
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to You will receive a message to confirm this. List archives: <> FAQ: <>

This archive was generated by hypermail 2b28 : Wed Sep 20 2000 - 09:02:34 PDT