Subject: [htdig3-dev] Re: max_keywords works as expected;)
From: Gilles Detillieux (firstname.lastname@example.org)
Date: Tue Dec 14 1999 - 11:22:26 PST
According to Joe R. Jah:
> On Mon, 13 Dec 1999, Gilles Detillieux wrote:
> > Date: Mon, 13 Dec 1999 13:17:01 -0600 (CST)
> > From: Gilles Detillieux <email@example.com>
> > To: "Joe R. Jah" <firstname.lastname@example.org>
> > Cc: email@example.com, firstname.lastname@example.org
> > Subject: Re: Non-zero max_keywords disables meta keywords;(
> > > keywords_meta_tag_names: keywords htdig-keywords description
> > Well that explains it! If those 26 words appear before your meta keywords
> > tag in the document, they'll be indexed and counted first.
> If some or all of those words do not appear in the document body, what
> would the order of keywords be?
I'm not talking about the document body here. The 26 words in questions
were words that you said were in the meta description tag. If you include
description in keywords_meta_tag_names, then the meta description tag will
get parsed just like any meta keywords tag (or for that matter, any meta
tag with a name that appears in keywords_meta_tag_names). That means the
words in that tag will get indexed and counted as keywords, and use up
part or all of the max_keywords quota for that document.
Words in the document body have no bearing on this, as they're not treated
> > I must say I don't understand the need to index meta description words as
> > keywords, as they are already indexed using the meta_description_factor
> > weight.
> Bingo;) I removed the description from keywords_meta_tag_name and
> everything works as expected; i.e. if I set the max_keywords to six I find
> the sixth meta keyword in the results, but if I set it to five or less, I
> don't find it. This, however, leaves an open question; if a document has
> both keywords and htdig-keywords, then how are the keywords ordered?
> which set comes first if some or all keywords do not appear in the boby?
I thought I had answered that question yesterday, but I guess I wasn't
clear enough. The HTML parser parses the HTML tags linearly, i.e. tags
are processed in the order in which they appear in the document.
It makes absolutely no difference in which order you list the names in
the keywords_meta_tag_names attribute - what matters is the order in
which they appear within any given document. Of course, one document
may use a completely different order than the next.
If you have keywords and htdig-keywords in keywords_meta_tag_names, then
the two different meta tags will be treated as completely equivalent.
If both appear in one document, the parser will parse both the same way.
It will index and count the words in the first meta tag (whichever one
that may be), then it will do the same with the second, just as though
all the contents of the second tag were appended to the first. In either
case, it counts and indexes the keywords until max_keywords is used up,
after which it won't look at another keyword in that document.
-- Gilles R. Detillieux E-mail: <email@example.com> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to firstname.lastname@example.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Tue Dec 14 1999 - 11:36:06 PST