Re: [htdig] Problem with upper case umlauts in HTML documents


Subject: Re: [htdig] Problem with upper case umlauts in HTML documents
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Thu Dec 16 1999 - 13:04:04 PST


According to Manfred Kunicke:
> I'm faced with the same problem as Jens Moellenhoff described in his mail of
> Tue Nov 30 1999 and found out it is a problem of upper case umlauts at
> searching.
>
> my ht://Dig 3.1.4 is running on AIX 4.3.2
>
> For test purpose star_url points only one HTML-page:
> <HTML>
> &Uuml;berfall
> </HTML>
>
> After digging and merging db.wordlist consists of
> überfall i:0 l:291 w:709
>
> i.e. the umlaut is recognized right
>
> Searching for Überfall
> gives the result:
> No matches were found for 'Überfall'...
>
> Searching for überfall
> gives the result:
> Search results for 'überfall'
> ...
> [umlaut.html]
> (None of the search words were found in the top of this document.)
> http://www.fz-rossendorf.de/FVTK/TEST/htdig/umlaut.html 12/16/99,
> 28 bytes.

It sounds to me like the locale is set correctly for htdig, but not
htsearch. If you're using separate config files for both, you must set
the locale the same way in both.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Thu Dec 16 1999 - 13:17:51 PST