[htdig] [PATCH] Regex URL Rewriting for 3.2.0b2


Subject: [htdig] [PATCH] Regex URL Rewriting for 3.2.0b2
From: Andy Armstrong (andy@tagish.com)
Date: Thu Aug 17 2000 - 07:50:39 PDT


Diggers,

Here's a patch for 3.2.0b2 that introduces a new configuration item,
"url_rewrite_rules" that allows a number of regex rewrites to be applied
to parsed URLs before they are used. These rules are applied before any
processing is done on discovered URLs.

Here's an example fragment from htdig.conf:

url_rewrite_rules: (.*)\\?JServSessionIdroot=.* \\1 \
                        (.*)\\&JServSessionIdroot=.* \\1 \
                        (.*)&context=.* \\1

As you can see from the example we're using this to trim JServ session
IDs and some other extraneous stuff from the end of URLs we're
processing, but there are likely to be other applications; off the top
of my head I can see how it might be useful for handling Lotus Domino
URLs too.

If anyone's using 3.1.5 and would like the same functionality I have a
(different) patch that does the same for that version.

Comments welcome.

-- 
Andy Armstrong, Tagish


htdig-3.2.0b2.rewrite.tar.gz

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Wed Aug 16 2000 - 21:49:40 PDT