Re: [htdig] site scanning


Subject: Re: [htdig] site scanning
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Tue Aug 22 2000 - 09:16:02 PDT


According to Michael Schulz:
> i have a problem here while scanning a site:
>
> If i write a concrete url in the conf-file, means
>
>
> start_url: http://www.blablabla.com/url_liste.html
>
> , htDig only put that page in the index!
> No links, which are on that page will be put to te index!
>
> Is there a way to solve that problem?
> (Because of JAVASCRIPT, start_url: http://www.blablabla.com is
> no solution...)

Change your limit_urls_to setting, which defaults to the value of
start_url.

http://www.htdig.org/attrs.html#limit_urls_to

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Tue Aug 22 2000 - 09:16:42 PDT