Subject: Re: [htdig] Request for new htdig META property: htdig-description
From: Gilles Detillieux (firstname.lastname@example.org)
Date: Fri Apr 07 2000 - 14:00:51 PDT
According to Patrick Jennings:
> Well, that was a nuance of my explicit setup and, given that I was
> rolling my own solution, it was the simplest way of getting the result
> I needed. (I love modifying one line of code to get significant new
> functionality.) A more generalised solution would be preferable, And
> I think you've hit the solution on the nose.
> A description_meta_tag_names attribute could do much the same--and
> more--so long as the associated "names" list implies precedence. EG:
> description_meta_tag_names: htdig-description description
> which generates the behaviour:
> if no htdig-description then look for description,
> if no description then automatically generated
> description_meta_tag_names: htdig-description
> if no htdig-description, then automatically generated
> This is a more elegant and configurable implementation. I like it.
> It'll just take a bit more code to implement.
The precedence aspect would be more difficult to implement. The HTML.cc
code parses HTML in one pass, and deals with tags as they occur, so it
would require storing the description tags until you find the highest
precedence one, then ignoring subsequent tags, or something like that,
and only indexing the words in the tag after you know you have the final
one. I guess that makes sense in any case, as you probably don't want
to index more than one of these, which is what your quick fix does.
-- Gilles R. Detillieux E-mail: <email@example.com> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------ To unsubscribe from the htdig mailing list, send a message to firstname.lastname@example.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Fri Apr 07 2000 - 11:45:38 PDT