Re: [htdig] How to exclude symbolic links


Subject: Re: [htdig] How to exclude symbolic links
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Tue Feb 08 2000 - 07:17:16 PST


According to Geoff Hutchison:
> At 6:55 PM -0700 2/7/00, Fates wrote:
> >How to exclude symbolic links when using rundig?
>
> You can't. Well, that is to say if you're indexing through HTTP,
> there's no way for htdig to know the difference--if it has a
> different URL, it will be indexed twice presently. You can turn this
> off in most servers, however. For example in Apache, the directive is
> -FollowSymLinks set in an Options directive.

The other option might be to use a "find" command to get the pathnames
of all the symlinks, and convert them into their equivalent URLs (e.g.
using sed). Put that list into a file which would be fed into the
exclude_urls attribute. E.g.:

exclude_urls: /cgi-bin/ .cgi `/etc/htdig/urls_to_exclude`

...

find /home/httpd/html -type l -print | \
        sed 's|/home/httpd/html|http://www.mydomain.org|' \
> /etc/htdig/urls_to_exclude

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue Feb 08 2000 - 07:22:58 PST