Re: htdig: Free BSD, local_urls


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Mon, 23 Nov 1998 17:47:39 -0600 (CST)


According to Geoff Hutchison:
>
> > which it doesn't know the type (non-HTML files). On my system, where
> > some users' web directories are also indexed, it also uses the server
> > for the main index page of each user's web pages. (For some reason,
> > the local_user_urls directive works for pages under the home page, e.g.
> > /~user/file.html, but not the home page itself, i.e. "/~user/".) I don't
>
> The local_default_doc should work for this. /~user/ ->
> /~user/$local_default_doc -> /home/user/www/$local_default_doc

Hmmm. I thought so too, but for every indexing run, I get these entries
in /var/log/httpd/access_log:

cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:01 -0600] "GET /robots.txt HTTP/1.0" 200 308 "-" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"
cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:09 -0600] "GET /~grdetil/ HTTP/1.0" 200 5739 "http://www.scrc.umanitoba.ca/SCRC/mugshots/preformatted/Gilles_Detillieux.html" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"
cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:09 -0600] "GET /~matt/ HTTP/1.0" 200 1095 "http://www.scrc.umanitoba.ca/SCRC/mugshots/preformatted/Matt_Ellis.html" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"
cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:09 -0600] "GET /~simon/ HTTP/1.0" 200 2068 "http://www.scrc.umanitoba.ca/SCRC/mugshots/preformatted/Simon_Gosgnach.html" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"
cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:09 -0600] "GET /~kelvin/ HTTP/1.0" 403 155 "http://www.scrc.umanitoba.ca/SCRC/mugshots/preformatted/Kelvin_Jones.html" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"
cliff.scrc.umanitoba.ca - - [23/Nov/1998:01:23:09 -0600] "GET /manual/LICENSE HTTP/1.0" 200 2607 "http://www.scrc.umanitoba.ca/manual/" "htdig/3.1.0b2 (www@scrc.umanitoba.ca)"

Here's the relevant stuff from my htdig.conf:

database_dir: /var/lib/htdig/db
limit_urls_to: ${start_url}
exclude_urls: /cgi-bin/ .cgi
maintainer: www@scrc.umanitoba.ca
max_head_length: 50000
search_algorithm: exact:1 synonyms:0.5 endings:0.1
start_url: http://www.scrc.umanitoba.ca/ \
                http://www.scrc.umanitoba.ca/SCRC/mugshots/preformatted/index.html \
                http://www.scrc.umanitoba.ca/Physiology/mugshots/preformatted/index.html \
                http://www.scrc.umanitoba.ca/MedRehab/mugshots/preformatted/index.html
local_urls: http://www.scrc.umanitoba.ca/=/home/httpd/html/
local_user_urls: http://www.scrc.umanitoba.ca/=/home/,/public_html/

Anything wrong there, or is this a bug?

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:51 PST