Subject: Re: [htdig] Score differences
From: Gilles Detillieux (firstname.lastname@example.org)
Date: Fri Aug 25 2000 - 12:43:12 PDT
According to Jonas Larsson:
> I have noticed a strange behaviour in how htdig
> computes the score when you search for a keyword.
> If I index my site in one big htdig run and then
> search for a specific keyword in the generated database
> using htsearch I get one set of "scores" for the
> documents found.
> If I on the other hand index different parts of my site
> with several htdig runs, merge the databases together
> into one big database and then search again for the same
> keyword using htsearch I get a different set of "scores"
> for the documents found. The score for the same document
> is often different - strange, seems incorrect.
> Is there a good explanation to this behaviour?
This may be guesswork on my part, but one item that adds a lot of
weight to pages is the link description text from other documents
that link to it. When you index the site in separate parts, I think
you'll lose all the link descriptions for links from one part to
another, so that could have an impact on scores - quite possibly
a profound one.
-- Gilles R. Detillieux E-mail: <email@example.com> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------ To unsubscribe from the htdig mailing list, send a message to firstname.lastname@example.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
This archive was generated by hypermail 2b28 : Fri Aug 25 2000 - 12:43:58 PDT