Subject: Re: [htdig] Digging problem, probably css-related?
From: Thomas Rother (t.rother@gaia.de)
Date: Mon Sep 04 2000 - 08:57:54 PDT
"J. op den Brouw" wrote:
> > "text/css" not a recognized type. Assuming text
> > size = 943
>
> Htdig doesn't know what type text/css is. That is, it has no
> parser associated with it.
> By default it assumes text. If this is bad behaviour, it
> should be fixed.
What really makes me crazy is the fact that I have
exclude_urls: /cgi-bin/ .cgi .css /css/ .htaccess suchdb/ Msgs mh.rsc
suck htdig mhonarc
but still the css is parsed! Does that mean I need to add an extra
"external_parsers" statement, just for css? Hm, doesn't make sens,
really...
And what parser would then be appropriate? /usr/bin/less ;-)?
The external parsers line has:
external_parsers: application/rtf->text/html
/usr/local/scripts/doc2html.pl \
text/rtf->text/html
/usr/local/scripts/doc2html.pl \
application/pdf->text/html
/usr/local/scripts/doc2html.pl \
application/postscript->text/html
/usr/local/scripts/doc2html.pl \
application/msword->text/html
/usr/local/scripts/doc2html.pl \
application/Wordperfect5.1->text/html
/usr/local/scripts/doc2html.pl
The html has the following code:
<html>
<head>
<base target="main_win">
<link href="css/gaia.css" rel="stylesheet" type="text/css">
<title>GAIA WEB INTERN Titelseite</title>
Does a simple css link really kill the htdig?
Thomas
-- ----------------------------------------------------------- Thomas M. ROTHER -- 73728 Esslingen -- EU/Germany mailto:t.rother@netzwissen.de - http://www.netzwissen.de Public PGP Key auf http://www.keyserver.net ----------------------------------------------------------------------------------------------- To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>
This archive was generated by hypermail 2b28 : Mon Sep 04 2000 - 09:03:08 PDT