Re: [htdig] Leading reasons for htdig not finding known matches?


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Wed, 27 Oct 1999 09:11:21 -0500 (CDT)


According to David J. Adams:
> I have checked back on our previous correspondence. The
> problem was indeed for meta descriptions, and may not have
> included the enter head section, but it was definitely
> htdig version 3.1.2.

Yes, the problem with punctuation in meta descriptions is still there.
It hasn't yet been fixed in either the 3.1.x or 3.2.x development source
trees. Someone will need to take the time to fix htdig/HTML.cc to do
proper parsing of words in meta descriptions. The current, simplistic
approach using strtok() just doesn't cut it. I think the same problem
exists with img alt text handling in 3.2 as well, so a general and
reusable fix is needed. Any takers?

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig@htdig.org containing the single word unsubscribe in the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Wed Oct 27 1999 - 07:20:40 PDT