Re: [htdig] Search results


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Mon, 9 Aug 1999 14:29:55 -0500 (CDT)


According to peter karlsson:
> > I'm not sure what you mean. Do you want to turn off the match
> > highlighting? I can't think of a *direct* way to do this.
>
> No, I just don't want it to show results like this:
>
> [] *
> ... )<p> <TABLE> <TD align=left> <IMG SRC="../pic/li.gif"
> ALT="*"> <a
> href=http://www.stwing.upenn.edu/~jruspini/starwars/> Star Wars
> Home Page</a> at UPENN<br> <IMG SRC="../pic/li.gif" ALT="*"> <a
> href=http://www.toysrgus.com/> Star Wars Collectors Archive ...
> http://www.mds.mdh.se/~frv95pen/starwar/ 1999-08-01 02:45:10
> MET DST, 8186 bytes
>
> [] *
> ... /" target="_top"
> onMouseOver="window.status='http://www.blueharvest.net/sw-now/'
> ; return true"> <img SRC="images/sw-now-sm.gif" BORDER="0"
> ALT="Star Wars Now"></a> </th><TH></th></tr> <tr><th>
> </th><th><a HREF="/~dal98lsg/cgi-bin/ax.cgi ...
> http://www.mds.mdh.se/~dal98lsg/ 1999-08-01 02:20:32 MET DST,
> 4317 bytes

I grabbed copies of these two pages, indexed them with 3.1.2 on my system,
and I can't reproduce the error. One of these documents has unclosed
quotes in a few ALT text attributes in IMG tags, but that doesn't pose
a problem with indexing them.

Could you let us know:

1) which version of ht://Dig is causing you this problem?

2) have you applied any patches to it, or made any changes at all to
the source? if so, what?

3) what is your OS version?

4) what does your htdig.conf file look like? (you may strip out comments
and any attributes you don't want to post to the list, but I'm interested
in seeing any attributes that may have an impact on what gets indexed
in the documents, and what gets stripped out)

5) does the problem persist after reindexing?

I've seen a user here inadvertently convert an HTML file into an HTML
encoded text file, so that all the < and > symbols became &lt; and &gt;
SGML entities. If that happened to the two documents above, and the
problem has since been corrected (they were OK when I looked), but they
haven't been reindexed since, that could explain the problem.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig@htdig.org containing the single word unsubscribe in the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Mon Aug 09 1999 - 12:30:28 PDT