Re: [htdig3-dev] htdig 3.1.5 indexing


Subject: Re: [htdig3-dev] htdig 3.1.5 indexing
From: Geoff Hutchison (ghutchis@wso.williams.edu)
Date: Wed Aug 30 2000 - 06:56:25 PDT


On 30 Aug 2000 ds@adq.de wrote:

> This directory doesnØt contain any index.htm. so i point htdig to this
> directory, and the html pages are indexed fine. however, as search results
> also the filenames of the html pages are considered. if you click such a
> link, you get the directory listing.

Sure. That's because the page generated for the directory index includes
the filenames. If you don't want the text of links indexed, set
description_factor: 0 in your config file:
<http://www.htdig.org/attrs.html#description_factor>

> Now, if i put an index.html in this directory, all the other pages
> (butcher.htm, etc..) are NOT indexed. I guess because there is no reference
> in index.htm to butcher.htm.

Right, you got it--htdig follows links. Put in an index.html and the
automatic directory-generation by the server stops. You could, of course,
use the index.html for links to the files or whatever you want and that
would also solve your question.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Wed Aug 30 2000 - 06:58:02 PDT