[htdig] Re: [htdig3-dev] question


Subject: [htdig] Re: [htdig3-dev] question
From: Geoff Hutchison (ghutchis@wso.williams.edu)
Date: Thu Oct 12 2000 - 12:13:08 PDT


On Thu, 12 Oct 2000, Iosif Fettich wrote:

> hope someone has a clue. Something seems to be wrong with the way htdig
> indexes. In the config file, I have
>
> start_url:http://www.pcconcrete.ro/phtml/
> limit_urls_to:${start_url}p0 ${start_url}r0
>
> What I'd like to have indexed are the files that have names starting with
> 'p0' or 'r0' - and _nothing_ else.
>
> However, when I search something, it happens that I get results that are
> in the main index of the directory /phtml/ - what shouldn't be indexed (if
> it would do it like I'd wish).
>
> Am I doing something wrong, missing something, or is it time for looking
> into the code for a patch...?

Of course my usual suggestion is to turn on some debugging with the -vvv
flag. ;-)

My concern here is that the start_url *itself* doesn't match the pattern.
So it's going to index the index.html file first (since that's the
start_ulr) and then take the links that match from there.

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Thu Oct 12 2000 - 12:18:15 PDT