htdig: bug report (files attached)


Marek Sobol (bolec@chaosworks.com.pl)
Fri, 11 Dec 1998 12:31:04 +0100


HI,

I think I found another bug in htdig. Htdig probably has troubles parsing
HTML containing &gt and &lt.
You have several files attached below (packed in bug981211.zip file)

1. htdig.conf - my configuration file
2. ch25.html - a document which causes problem
3. htsearch.htm - the problem itself - look at first entry in this file
(veeeeeeeeeeeery long description). It is a htsearch result.

I noticed also another problem which is probably caused by the same bug.
File iisdir.html contains a directory listing generated by Microsoft IIS 4.0
HTP server (piece of shit, but many people use it). Htdig cannot dig
directory structure starting from this file. It is *probably* caused by

<dir>

Htdig may thnk it is a tag <dir> or something like that.

I hope it will help you.
Unfortunately I had no time so far to take a look in htdig source code
(installed htdig 3 days ago :-)

Marek Sobol
bolec@chaosworks.com.pl

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:29:50 PST