[htdig3-dev] url.lowercase() -- worked, but not really.


Neil Mansilla (neil@aol2.com)
Mon, 28 Jun 1999 12:54:41 -0400


A couple weeks ago we were discussing the case_sensitive issue.
Through the discussion, I gathered that making changes to the
htcommon/DocumentDB.cc file would help -- but it did not.

I have updated htcommon/DocumentDB.cc by uncommenting all of the
"url.lowercase()" statements. If you look at the htdig output,
it appears to be working:

11:11:2:http://www.foo.com/quick.html: size = 32
12:11:2:http://www.foo.com/QuIcK.html: retrieved but not changed

However, the document is NOT being indexed. So, whenver a
case-insensitiveiidentical URL/document is found, the entire
document is essentially tossed out. I have "proven" this theory
by re-commenting the url.lowercase() lines in DocumentDB.cc,
in which case, a query will match BOTH documents.

I hope that any of you can share some insight as to how to handle
case insensitive servers. To be honest, at this time, I don't
really care of case_insensitive affects the behavior -- leaving it
hard coded in C++ is fine with me.

Thanks for any help,
Neil
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
htdig3-dev@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Mon Jun 28 1999 - 09:14:20 PDT