Re: [htdig] Indexing news articles ?


Subject: Re: [htdig] Indexing news articles ?
From: Vincent Royer (vroyer@althes.fr)
Date: Mon May 15 2000 - 10:16:27 PDT


As you can see above, there's an index.html file containing
relatives links to news articles. The index.html page is
correctly indexed but none of the articles. Moreover, apache
use the MIME type message/news when news articles
are browsed. Any idea ?

Thanks.

althes64:/var/spool/news/vuln/cli/ie # ls
. 1 11 13 15 17 19 20 22 24 4 6 8 index.html
.. 10 12 14 16 18 2 21 23 3 5 7 9
althes64:/var/spool/news/vuln/cli/ie #

althes64:/opt/www/htdig/bin # ./rundig -v

New server: althes64.althes.fr, 80
0:0:0:http://althes64.althes.fr/news/vuln/cli/ie: redirect
1:1:0:http://althes64.althes.fr/news/vuln/cli/ie/:
++++++++++++++++++++++++ size
 = 4627
2:2:1:http://althes64.althes.fr/news/vuln/cli/ie/5: not HTML
3:3:1:http://althes64.althes.fr/news/vuln/cli/ie/1: not HTML
4:4:1:http://althes64.althes.fr/news/vuln/cli/ie/10: not HTML
5:5:1:http://althes64.althes.fr/news/vuln/cli/ie/11: not HTML
6:6:1:http://althes64.althes.fr/news/vuln/cli/ie/12: not HTML
7:7:1:http://althes64.althes.fr/news/vuln/cli/ie/13: not HTML
8:8:1:http://althes64.althes.fr/news/vuln/cli/ie/14: not HTML
9:9:1:http://althes64.althes.fr/news/vuln/cli/ie/15: not HTML
10:10:1:http://althes64.althes.fr/news/vuln/cli/ie/16: not HTML
11:11:1:http://althes64.althes.fr/news/vuln/cli/ie/17: not HTML
12:12:1:http://althes64.althes.fr/news/vuln/cli/ie/18: not HTML
13:13:1:http://althes64.althes.fr/news/vuln/cli/ie/19: not HTML
14:14:1:http://althes64.althes.fr/news/vuln/cli/ie/2: not HTML
15:15:1:http://althes64.althes.fr/news/vuln/cli/ie/20: not HTML
16:16:1:http://althes64.althes.fr/news/vuln/cli/ie/21: not HTML
17:17:1:http://althes64.althes.fr/news/vuln/cli/ie/22: not HTML
18:18:1:http://althes64.althes.fr/news/vuln/cli/ie/23: not HTML
19:19:1:http://althes64.althes.fr/news/vuln/cli/ie/24: not HTML
20:20:1:http://althes64.althes.fr/news/vuln/cli/ie/3: not HTML
21:21:1:http://althes64.althes.fr/news/vuln/cli/ie/4: not HTML
22:22:1:http://althes64.althes.fr/news/vuln/cli/ie/6: not HTML
23:23:1:http://althes64.althes.fr/news/vuln/cli/ie/7: not HTML
24:24:1:http://althes64.althes.fr/news/vuln/cli/ie/8: not HTML
25:25:1:http://althes64.althes.fr/news/vuln/cli/ie/9: not HTML
htmerge: Sorting...
htmerge: Removing doc #0
htmerge: Removing doc #10
htmerge: Removing doc #11
htmerge: Removing doc #12
htmerge: Removing doc #13
htmerge: Removing doc #14
htmerge: Removing doc #15
htmerge: Removing doc #16
htmerge: Removing doc #17
htmerge: Removing doc #18
htmerge: Removing doc #19
htmerge: Removing doc #2
htmerge: Removing doc #20
htmerge: Removing doc #21
htmerge: Removing doc #22
htmerge: Removing doc #23
htmerge: Removing doc #24
htmerge: Removing doc #25
htmerge: Removing doc #3
htmerge: Removing doc #4
htmerge: Removing doc #5
htmerge: Removing doc #6
htmerge: Removing doc #7
htmerge: Removing doc #8
htmerge: Removing doc #9
htmerge: Merging...

Deleted, no excerpt: 0/http://althes64.althes.fr/news/vuln/cli/ie
Deleted, no excerpt: 3/http://althes64.althes.fr/news/vuln/cli/ie/1
Deleted, no excerpt: 4/http://althes64.althes.fr/news/vuln/cli/ie/10
Deleted, no excerpt: 5/http://althes64.althes.fr/news/vuln/cli/ie/11
Deleted, no excerpt: 6/http://althes64.althes.fr/news/vuln/cli/ie/12
Deleted, no excerpt: 8/http://althes64.althes.fr/news/vuln/cli/ie/14
Deleted, no excerpt: 9/http://althes64.althes.fr/news/vuln/cli/ie/15
Deleted, no excerpt: 10/http://althes64.althes.fr/news/vuln/cli/ie/16
Deleted, no excerpt: 11/http://althes64.althes.fr/news/vuln/cli/ie/17
Deleted, no excerpt: 12/http://althes64.althes.fr/news/vuln/cli/ie/18
Deleted, no excerpt: 13/http://althes64.althes.fr/news/vuln/cli/ie/19
Deleted, no excerpt: 14/http://althes64.althes.fr/news/vuln/cli/ie/2
Deleted, no excerpt: 15/http://althes64.althes.fr/news/vuln/cli/ie/20
Deleted, no excerpt: 16/http://althes64.althes.fr/news/vuln/cli/ie/21
Deleted, no excerpt: 17/http://althes64.althes.fr/news/vuln/cli/ie/22
Deleted, no excerpt: 18/http://althes64.althes.fr/news/vuln/cli/ie/23
Deleted, no excerpt: 19/http://althes64.althes.fr/news/vuln/cli/ie/24
Deleted, no excerpt: 20/http://althes64.althes.fr/news/vuln/cli/ie/3
Deleted, no excerpt: 21/http://althes64.althes.fr/news/vuln/cli/ie/4
Deleted, no excerpt: 2/http://althes64.althes.fr/news/vuln/cli/ie/5
Deleted, no excerpt: 22/http://althes64.althes.fr/news/vuln/cli/ie/6
Deleted, no excerpt: 23/http://althes64.althes.fr/news/vuln/cli/ie/7
Deleted, no excerpt: 24/http://althes64.althes.fr/news/vuln/cli/ie/8
Deleted, no excerpt: 25/http://althes64.althes.fr/news/vuln/cli/ie/9

althes64:/opt/www/htdig/bin #

______________________________________________________________
althes64:/opt/www/htdig/bin # netcat althes64 80
GET /news/vuln/cli/ie/8 HTTP/1.0

HTTP/1.1 200 OK
Date: Mon, 15 May 2000 19:58:31 GMT
Server: Apache/1.3.12 (Unix) (SuSE/Linux)
Last-Modified: Fri, 28 Apr 2000 18:01:46 GMT
ETag: "c2a4e-b45-3909d20a"
Accept-Ranges: bytes
Content-Length: 2885
Connection: close
Content-Type: message/news
Content-Encoding: 8bit

Path: mailix.althes.fr!not-for-mail
From: root <vroyer@althes.fr>
Newsgroups: alt.vuln.cli.ie
Subject: IE can read local files and spoof windows. Vulnerable: IE 4.0,
4.01
Date: Wed, 26 Apr 2000 19:17:01 +0200
Organization: Althes
Lines: 66
Message-ID: <3907248D.A59DCE7B@althes.fr>
NNTP-Posting-Host: mailix.althes.fr
Mime-Version: 1.0
Content-Type: multipart/mixed;
 boundary="------------AC045772918EC853BAC3DDFA"
X-Trace: mailix.althes.fr 956769421 18539 172.21.1.128 (26 Apr 2000
17:17:01 GMT)
X-Complaints-To: news@mailix.althes.fr
NNTP-Posting-Date: 26 Apr 2000 17:17:01 GMT
X-Mailer: Mozilla 4.7 [en] (X11; I; Linux 2.2.13 i586)
X-Accept-Language: en
Xref: mailix.althes.fr alt.vuln.cli.ie:8

This is a multi-part message in MIME format.
--------------AC045772918EC853BAC3DDFA
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
.....

Geoff Hutchison wrote:

> At 3:42 PM +0300 5/15/00, Vincent Royer wrote:
> >Hi,
> >in files named by a number without any extension (1, 2, etc ...).
> >An index.html page contains links to these articles.
> >Although you can set valid and bad extentions in the configuration file,
> >
> >is there a way to index files whithout any extension?
>
> If they are linked and you are indexing over HTTP (or if you don't
> have local_urls_only set), then they will be indexed.
>
> Most likely, your webserver will send them as text files.
>
> --
> -Geoff Hutchison
> Williams Students Online
> http://wso.williams.edu/

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-unsubscribe@htdig.org
You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Mon May 15 2000 - 09:05:46 PDT