Re: htdig: Free BSD, local_urls


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Tue, 24 Nov 1998 14:50:15 -0600 (CST)


According to Geoff Hutchison:
>
> At 6:47 PM -0500 11/23/98, Gilles Detillieux wrote:
> >Anything wrong there, or is this a bug?
>
> Dunno. Try running with "-vvv" and see what the out put is--it should tell
> you the filename when it tries files.

Here's what htdig -vvv told me...

pick: www.scrc.umanitoba.ca:80, # servers = 1
78:78:2:http://www.scrc.umanitoba.ca/~grdetil/: Trying local file /home/grdetil/public_html/
Local retrieval failed, trying HTTP
Retrieval command for http://www.scrc.umanitoba.ca/~grdetil/: GET /~grdetil/ HTTP/1.0

So it never tacked on the "index.html"! I looked at htdig/Retriever.cc,
and sure enough, Retriever::IsLocalUser() doesn't even use "local_default_doc".
Here's the patch:

--- htdig-3.1.0b2/htdig/Retriever.cc.lcldoc Mon Nov 2 18:21:51 1998
+++ htdig-3.1.0b2/htdig/Retriever.cc Tue Nov 24 14:14:33 1998
@@ -735,6 +735,8 @@
         }
         *local += *dir;
         *local += rest;
+ if (local->last() == '/' && config["local_default_doc"] != "")
+ *local += config["local_default_doc"];
         return local;
     }
     return 0;

Now, the only things it gets from the server are /robots.txt and
/manual/LICENSE. If I really wanted to run totally serverless, as some
people on this list were asking about, it would require further patches
to htdig/Server.cc (to get robots.txt locally) and htdig/Document.cc
(to allow files without suffixes, and .txt files, as text/plain type,
rather than returning Document_not_local). I'll leave it as an exercise
for the reader. :-)

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:52 PST