Re: [htdig] Altavista. CGI-BIN?


Subject: Re: [htdig] Altavista. CGI-BIN?
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Wed Jul 12 2000 - 09:45:38 PDT


According to Marco Ghirardi:
> I'd like to configure htdig in order to receive a list of the urls that
> results from a search on altavista.com. Is it possible??
>
> I used as start_url this one:
> http://www.altavista.com/cgi-bin/query?sc=on&hl=on&q=a_topic&kl=XX&pg=q
> that is the url which results from the search of the word
> a_topic but i receive nothing.
>
> PS: I obviously cutted "cgi" from exclude_urls.

You'll need also to set limit_urls_to. By default it takes on the value
of start_url, which is not what you want in this case. I expect you'll also
want to set max_hop_count to something small, so you don't attempt to index
the whole web.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Wed Jul 12 2000 - 07:01:32 PDT