[htdig3-dev] Debugged excerpt/valid_punctuation


Geoff Hutchison (ghutchis@wso.williams.edu)
Wed, 27 Jan 1999 21:39:47 -0400


OK, I poked through the code and worked out the problem with finding words
with punctuation in the excerpt.

Basically, htsearch takes the user input and puts it into $WORDS. It then
does some parsing (applying fuzz and checking for boolean syntax) and puts
the result in $LOGICAL_WORDS. When it does this, it generates a StringMatch
with the parsed $LOGICAL_WORDS in it. This makes sure fuzzy matches are
included in the StringMatch, but it's already stripped out
valid_punctuation. :-(

So here's my proposed fix. In addition to the logicalWords currently placed
in searchWordsPattern in htsearch.cc, we should ALSO add the user's
original input. This should include the punctuation and ensure that these
words are considered when looking up the excerpt and doing hilighting.

Does this make sense?

-Geoff

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Thu Feb 04 1999 - 22:24:20 PST