Re: htdig: [Patch] non english text parser broken


Nuno Grilo (nmg@publico.publico.pt)
Wed, 4 Nov 1998 18:13:15 +0000 (WET)


On Wed, 4 Nov 1998, Geoff Hutchison wrote:

> At 9:08 AM -0500 11/4/98, Vadim Chekan wrote:
>
> >I found a bug in current (3.1.0.b2) release: I can't index text cyrillic
> >files. This is because of declare "char" instead of "unsigned char".
> >Function "isalpha" doesn't work with char>127.
>
> Is this just a problem with text files? In other words, is the problem with
> the Plaintext parser, or also with the HTML parser?
>
I get no matches for non-english words in html documents with or without
the patch. This is in Digital Unix 4.0

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:44 PST