Re: [htdig] Newbie indexing problems


Subject: Re: [htdig] Newbie indexing problems
From: Adam Rice (adam@newsquest.co.uk)
Date: Wed Oct 25 2000 - 07:22:15 PDT


Sara Rudd wrote:
> I am having some trouble with configuring htdig it
> seems to be ignoring some of the servers that I have
> listed using limit_urls dont know why. But if I
> include the main server name, under limit_urls is does
> index them
>
> www.csv.warwick.ac.uk
>
> but of course I get a double count.
>
> Heres hopefully enough of the conf file to maybe
> give a clue as to whats going wrong?
>
> start_url: http://www.warwick.ac.uk/
>
> limit_urls_to: ${start_url} http://www.astro.warwick.ac.uk/ \
> http://www.bio.warwick.ac.uk/ http://www.dcs.warwick.ac.uk/ \
> http://www.eng.warwick.ac.uk/ http://law.bio.warwick.ac.uk/ \
> http://www.maths.warwick.ac.uk/ http://www.phys.warwick.ac.uk/ \
> http://www.wbs.warwick.ac.uk/ http://www.hosp.warwick.ac.uk/ \
> http://www.conferences.warwick.ac.uk/ http://www.unitemps.warwick.ac.uk/ \
> #http://www.csv.warwick.ac.uk/ \

At a guess, some of those servers are only linked from pages that
themselves are linked via the www.csv.warwick.ac.uk hostname. You could
try

server_aliases: www.csv.warwick.ac.uk=www.warwick.ac.uk

Adam

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Wed Oct 25 2000 - 07:34:02 PDT