Re: [htdig] Indexing news articles ?


Subject: Re: [htdig] Indexing news articles ?
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Mon May 15 2000 - 12:23:53 PDT


According to Vincent Royer:
> As you can see above, there's an index.html file containing
> relatives links to news articles. The index.html page is
> correctly indexed but none of the articles. Moreover, apache
> use the MIME type message/news when news articles
> are browsed. Any idea ?

Yes, if the mime type is message/news, then you should define an external
converter for that mime type.

See http://www.htdig.org/attrs.html#external_parsers

E.g.:

    external_parsers: message/news->text/plain /usr/local/bin/convnews

where convnews is a script that converts the news message to plain text.
If it's already in that format, it could just be a simple shell script
that does a "cat $1". If it's already in HTML, or easily converted to
HTML, then you can use text/html instead of text/plain as the target
mime type in the external_parsers definition above.

See also http://www.htdig.org/FAQ.html#q4.8
and http://www.htdig.org/FAQ.html#q4.9
for further examples.

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Mon May 15 2000 - 10:11:47 PDT