[htdig] how to ignore robots.txt


p0222 (p0222@odb.rhein-main.de)
Sun, 28 Mar 1999 14:28:11 +0200


Hello People !!!

How can I tell htdig to *ignore* the robots.txt-files, on the whole web or
on specified servers ?

That's my problem:
title: ReferateFundus
href: http://www.fundus.org/index1.htm ()
resolving 'http://www.fundus.org/index1.htm'

   pushing http://www.fundus.org/index1.htm
+href: http://www.fundus.org/indexrechts.htm ()
resolving 'http://www.fundus.org/indexrechts.htm'

   pushing http://www.fundus.org/indexrechts.htm
+A tag: pos = 2, position =
="http://www.fundus.org/cgi/ref_anz.cgi?Biographien">
href: http://www.fundus.org/cgi/ref_anz.cgi?Biographien (Biographien [290])

   Rejected: Item in the exclude list: item # 2 length: 4

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^EXLCUDE LIST ?!?
How can i turn this exlcude list *OFF* ?!?

Thank you,
Gunther Stammwitz

url rejected: (level 1)http://www.fundus.org/cgi/ref_anz.cgi?Biographien
A tag: pos = 2, position =
="http://www.fundus.org/cgi/ref_anz.cgi?Biologie">
href: http://www.fundus.org/cgi/ref_anz.cgi?Biologie (Biologie [238])

   Rejected: Item in the exclude list: item # 2 length: 4

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Sun Mar 28 1999 - 06:25:29 PST