[htdig3-dev] Using Regex for htdig limits


Geoff Hutchison (ghutchis@wso.williams.edu)
Sat, 15 May 1999 13:08:30 -0400


OK, I finally had some time to sit down and write some escaping code for
the HtRegex code. I've turned on regex parsing for the limit_url_to,
limit_normalized, exclude_urls, and bad_querystr variables.

*PLEASE* test it. Try doing indexing and see if it's actually
backwards-compatible. Try adding a regex to these options and see if it
does what you expect. To make things backwards-compatible, regex must be
enclosed in [] and the 'escaping' brackets will be removed from the pattern.

Also, check out HtRegex::setEscaped() and tell me if I'm missing anything
horribly dangerous for escapes. Right now it escapes '.' '?' and '+' but I
don't think this is a comprehensive list--I just figured it would be a
useful subset to start testing.

-Geoff

------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Sat May 15 1999 - 10:22:48 PDT