Re: [htdig] site scanning


Subject: Re: [htdig] site scanning
From: Michael Schulz (Mike.Schulz@gmx.de)
Date: Wed Aug 23 2000 - 00:06:50 PDT


Gilles Detillieux wrote:
>
> According to Michael Schulz:
> > i have a problem here while scanning a site:
> >
> > If i write a concrete url in the conf-file, means
> >
> >
> > start_url: http://www.blablabla.com/url_liste.html
> >
> > , htDig only put that page in the index!
> > No links, which are on that page will be put to te index!
> >
> > Is there a way to solve that problem?
> > (Because of JAVASCRIPT, start_url: http://www.blablabla.com is
> > no solution...)
>
> Change your limit_urls_to setting, which defaults to the value of
> start_url.
>
> http://www.htdig.org/attrs.html#limit_urls_to

So i set the config-file like below:

--------------------------------------------------------------
start_url: http://www.blablabla.com/xyz/url_list.html

limit_urls_to: www.blablabla.com
--------------------------------------------------------------

but it doesn´t work: Only the start-url is in the index...

Michael

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Wed Aug 23 2000 - 00:08:05 PDT