Subject: Re: [htdig] Indexing news articles ?
From: Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Date: Mon May 15 2000 - 12:23:53 PDT
According to Vincent Royer:
> As you can see above, there's an index.html file containing
> relatives links to news articles. The index.html page is
> correctly indexed but none of the articles. Moreover, apache
> use the MIME type message/news when news articles
> are browsed. Any idea ?
Yes, if the mime type is message/news, then you should define an external
converter for that mime type.
See http://www.htdig.org/attrs.html#external_parsers
E.g.:
external_parsers: message/news->text/plain /usr/local/bin/convnews
where convnews is a script that converts the news message to plain text.
If it's already in that format, it could just be a simple shell script
that does a "cat $1". If it's already in HTML, or easily converted to
HTML, then you can use text/html instead of text/plain as the target
mime type in the external_parsers definition above.
See also http://www.htdig.org/FAQ.html#q4.8
and http://www.htdig.org/FAQ.html#q4.9
for further examples.
-- Gilles R. Detillieux E-mail: <grdetil@scrc.umanitoba.ca> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Mon May 15 2000 - 10:11:47 PDT