Subject: Re: [htdig] new ask
From: Torsten Neuer (tneuer@inwise.de)
Date: Thu Sep 07 2000 - 08:57:29 PDT
Andoni Ayala wrote:
>
> Hello!
>
> In a directory that not have "DirectoryIndex" archives (normaly index.html index.htm)
>
> How can i do for not index the page automatica generated by apache.?
>
> i´m trying with
> remove_default_doc: index.html index.htm
> but not works
There are several ways of doing so:
1) In the robots.txt file, add a line
Disallow: /foo/bar/
Drawback: You cannot index the content of the directory either.
2) Put a dummy index.html in this directory.
Drawback: No client application will get a directory listing,
and thus access the directory contents. You are also not able
to index the contents of the files unless specified in the
start_url directive of your ht://dig configuration file.
3) Hack Apache mod_autoindex to include a <!--htdig_noindex-->
or issue a <META NAME="robots" CONTENT="noindex,follow"> in
the header of the automatically generated index.
Sadly, Apache has no configuration attributes for mod_autoindex
with which someone could do this without hacking the code.
hth,
Torsten
-- InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH Waldhofstraße 14 Tel: +49-4101-403605 D-25474 Ellerbek Fax: +49-4101-403606 E-Mail: info@inwise.de Internet: http://www.inwise.de------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
This archive was generated by hypermail 2b28 : Thu Sep 07 2000 - 08:59:47 PDT