Re: [htdig3-dev] Re: [Fwd: [htdig] Problem with &..; entities in meta tags]


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Fri, 30 Jul 1999 12:07:26 -0500 (CDT)


According to Torsten Neuer:
> According to Gilles Detillieux:
> >OK, we do clearly have a problem with SGML entities in 3.1.2, as well
> >as 3.2. (3.2 has some more serious problems, which I was hoping to
> >tackle, but that's another story.) So, right now, it only translates
> >&foo; entities outside of any HTML tags. I think there are reasons
> >not to translate them in all tags, but where is it valid to do so?
> >Certainly in keywords text, alt text in img tags, and meta description
> >text. How about htdig-email-subject? Any others I've missed?
>
> - HTML 4.0 "title" attribute (not yet handled by ht://Dig, but would be
> nice to improve search results)
>
> - Most of Dublin Core META infomation contents (would be nice if ht://Dig
> could directly support this META standard).
>
> - Alt text in client side image maps.

None of these are currently handled in HTML.cc for either 3.1.2 or
3.2, so we need not worry about them yet, for this bug fix. However,
they are good suggestions to keep in mind for future 3.2 development.
I guess the rule of thumb is any tag attributes that contain plain text,
as opposed to URIs, numbers, or other "internals", ought to be processed
for SGML entities.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev@htdig.org containing the single word "unsubscribe" in the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Fri Jul 30 1999 - 09:25:29 PDT