Subject: [htdig] Fw: [htdig] - Question for start_url and exclude_urls
From: David Adams (D.J.Adams@soton.ac.uk)
Date: Fri Jan 05 2001 - 01:37:56 PST
Your colleague Aditya got into the habit of emailing his Ht://Dig
problems to me rather than to the htdig mailing list.
As this latest query is not something I can immediately answer I am
forwarding to the list.
For authoritative answers to all queries please always email to
email@example.com and not to me personally.
----- Original Message -----
From: "Mohai Wang" <firstname.lastname@example.org>
Sent: Thursday, January 04, 2001 7:39 PM
Subject: [htdig] - Question for start_url and exclude_urls
> Aditya is been taking 3 weeks vacation from yesterday. I am going take
> "htdig" search engine project.
> 1. start_url:
> as long as start_url = "http://stagsite.coreon.com/download/". When I
> "rundig -vvv >log", I got error message from screen "DB2 problem...:
> or empty key value specified". I also attached debug mode "log" and
> "htdig.conf" files, please take a look. Did I set wrong option?
> If start_url = "http://stagsite.coreon.com/" that it will go through to
> write index, because I only need to write everything under "download"
> nothing else.
> 2. exclude_urls:
> I try to do something differently, start_url =
> "http://stagsite.coreon.com/" then I added exclude_urls = "/cgi-bin/
> /calendar/ /coreonlib/". When I run "rundig -vvv >log3", it will read
> /coreonlib/ first then stop. After I took off "coreonlib" from
> then rerun "rundig -vvv >log2" that everything are indexing and reject
> "cgi-bin" and "calendar". Could you tell me why? Please take a look log3
> Mohai Wang
> Coreon Inc.,
-- David Adams Computing Services University of Southampton
------------------------------------ To unsubscribe from the htdig mailing list, send a message to email@example.com You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
This archive was generated by hypermail 2b28 : Fri Jan 05 2001 - 01:49:55 PST