Re: htdig: htdig-8.1b2: Ignoring URLs?


Frank Richter (Frank.Richter@hrz.tu-chemnitz.de)
Tue, 8 Dec 1998 10:12:59 +0100 (MET)


Hi,
I applied your patch to htdig, now I get
Digging with max_hop_count 8: htdig-8.0.8b2 - ca. 55,000 documents
                              htdig-8.1.0b2 - ca. 13,000 documents
                     patched htdig-8.1.0b2 - 92,118 documents

A lot more documents! I detected 6127 lines with "level -1":

4201:588:-1:http://www.tu-chemnitz.de/chemnitz/: ** size = 497
4207:589:-1:http://www.tu-chemnitz.de/tu/impressum.html: ----*-*--* size = 3385
         ^^
What does this mean?

- Frank

PS: It would be nice to have a possiblity to configure a maximum document
count to dig, i.e.
max_hop_count: 8 # dig to level 8
max_doc_count: 60000 # but maximum this number of documents

-- 
Email: Frank.Richter@hrz.tu-chemnitz.de  http://www.tu-chemnitz.de/~fri/
Work:  Computing Services, Technical University, 09107 Chemnitz, Germany
-+# Es weihnachtet sehr ... http://www.tu-chemnitz.de/urz/advent98/  #+-

---------------------------------------------------------------------- To unsubscribe from the htdig mailing list, send a message to htdig-request@sdsu.edu containing the single word "unsubscribe" in the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:29:48 PST