Re: [htdig] Problem in HTDig 3.2b2?


Subject: Re: [htdig] Problem in HTDig 3.2b2?
From: Geoff Hutchison (ghutchis@wso.williams.edu)
Date: Mon Jun 05 2000 - 12:08:34 PDT


On Mon, 5 Jun 2000, Ravindra Wankar wrote:

> In the db.worddump file, (created with htdig -t) the value in the
> "flags" column for
> words that appears in the meta description tags is the same as that for
> those in title.
>
> The flags column is "2". Shouldn't this be "16" as defined in
> HtWordReference.h?

Sounds like a bug, yes.

> BTW, what are the effects on htdig if the html files are not
> "clean" e.g. have missing tags etc?

Remember that the parser strips out most tags. For example, table tags are
completely ignored. So for these tags, it really doesn't matter if the
tags aren't closed properly.

In the case of tags that are important to the parser,
it implements the standard. In particular, tags are "closed" at the end of
the file, or at the beginning of a tag of that type (e.g. <a href="...">
will close for a missing </a>).

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Mon Jun 05 2000 - 09:58:38 PDT