Subject: [htdig] Foreign chars (Swedish)
From: Philippe Ramkvist-Henry (phira600@student.liu.se)
Date: Thu Nov 25 1999 - 05:51:19 PST
Hello!
I'm having problems with some foreign chars when using htdig to index and
search a Swedish site. The locale is set right (sv) and is working in
other applications. The problem I have is somewhat weird, maybe it has
something to do with "uppercase" "lowercase"?
Well, I can search words like "Åsa,åsa,Öl,öl" and get the same matches.
But when I try to search "bäst" I get no hits. With "bÄst" I get several
hits...
I asked a guy here a the University and he said that there might be
complications with "unsigned char" and "char". He gave me the example
below. Please answer at a novice level, my C++ and Unix knowledge is very
limited.
Thanks
Philippe Ramkvist-Henry
------------------------------------------------------------------------
htlib/StringMatch.cc
while ((unsigned char)string[pos])
{
new_state = table[trans[string[pos]]][state];
Should be? or?
while (string[pos])
{
new_state = table[trans[(unsigned
char)string[pos]]][state];
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You'll receive a message confirming the unsubscription.
This archive was generated by hypermail 2b25 : Thu Nov 25 1999 - 06:03:17 PST