Subject: Re: [htdig3-dev] mixed case urls
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Wed Apr 12 2000 - 09:49:36 PDT
According to Geoff Hutchison:
> At 3:55 PM -0700 4/11/00, Rusty Wright wrote:
> >Htdig has a config file option to ignore the case of urls but it
> >applies to all sites indexed by htdig. Would it be possible to
> >convert all urls to lowercase on a per site basis?
>
> Yes. I believe the case_sensitive attribute is already turned on in
> server blocks for 3.2.0b2. If not, we can fix that pretty quickly.
I just checked the source code, and it appears that case_sensitive is only
used globally right now. It's used in htdig/Server.cc and htcommon/URL.cc
(in 2 places) so those are the locations that would need to be patched.
Using server block attributes in the URL class may be a bit tricky,
because the server name may be different before and after normalization.
You probably don't want to force URL normalization too early on, so it may
require building a temporary URL just to get the normalized server name
to know which server block to use to get the attribute setting you want.
Until all this is resolved, I guess the only option, in 3.1.5 or 3.2.0b2,
is to build two separate databases, one for case sensitive servers,
and one for case insensitive servers, and then htmerge the two together
afterward.
-- Gilles R. Detillieux E-mail: <grdetil@scrc.umanitoba.ca> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Wed Apr 12 2000 - 07:34:50 PDT