Re: [htdig3-dev] Patch for accent translation (need help)


Geoff Hutchison (ghutchis@wso.williams.edu)
Fri, 2 Jul 1999 21:45:13 -0400 (EDT)


On Fri, 2 Jul 1999 alex@narwhal.cisti.nrc.ca wrote:

> I've made a quick little modification to allow me to translate accented
> characters down to their low ascii equivalent. Basically it just adds a
> second word to the database with all low ascii characters. This way I
> can still let my users search for accents if they want, but aren't forced
> to enter them.

Nice idea. The only "downside" is the word database gets a bit larger.
However, the solution isn't bad.

> The translation is also hard coded right now :( I was thinking of adding a
> config file paramater such as "translation_table: 123-162=a 183-184=d" etc
> if this feature is useful and so others could adapt it without actually
> changing the code =)

I was envisioning this sort of transformation through a subclass of
HtWordCodec. Then people could specify a translation table through a file.

> I'm by no means a very good c++ coder so any suggestions would be great.
> Is there a better file for me to be modifying instead of htdig/Retriever?

I think the best place for word-related transformations is
htcommon/WordList.cc

-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Fri Jul 02 1999 - 18:00:52 PDT