Re: [htdig] new ask


Subject: Re: [htdig] new ask
From: Torsten Neuer (tneuer@inwise.de)
Date: Thu Sep 07 2000 - 08:57:29 PDT


Andoni Ayala wrote:
>
> Hello!
>
> In a directory that not have "DirectoryIndex" archives (normaly index.html index.htm)
>
> How can i do for not index the page automatica generated by apache.?
>
> i´m trying with
> remove_default_doc: index.html index.htm
> but not works

There are several ways of doing so:

1) In the robots.txt file, add a line
        Disallow: /foo/bar/
    Drawback: You cannot index the content of the directory either.

2) Put a dummy index.html in this directory.
    Drawback: No client application will get a directory listing,
    and thus access the directory contents. You are also not able
    to index the contents of the files unless specified in the
    start_url directive of your ht://dig configuration file.

3) Hack Apache mod_autoindex to include a <!--htdig_noindex-->
    or issue a <META NAME="robots" CONTENT="noindex,follow"> in
    the header of the automatically generated index.

    Sadly, Apache has no configuration attributes for mod_autoindex
    with which someone could do this without hacking the code.

hth,

  Torsten

-- 
InWise - Wirtschaftlich-Wissenschaftlicher Internet Service GmbH
Waldhofstraße 14                            Tel: +49-4101-403605
D-25474 Ellerbek                            Fax: +49-4101-403606
E-Mail: info@inwise.de            Internet: http://www.inwise.de

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Thu Sep 07 2000 - 08:59:47 PDT