Re: [htdig] update vs. initial digging


Joseph Cheek (joseph@cheek.com)
Sat, 29 May 1999 03:01:01 -0700


hello,

this is what htdig -vvv -h 0 gave me [which did not stop after the first url,
btw...] Date returned seemed to be current date of dig.

seattle:/var/spool/news# /opt/htdig/bin/htdig -a -s -c /opt/htdig/conf/linuxnews
.conf -vvv -h 0
Warning: unknown locale!
BAD TAG IN SERIALIZED DATA: 109
BAD TAG IN SERIALIZED DATA: 119
        0:0:http://linuxnews.cheek.com/
New server: linuxnews.cheek.com, 80
Retrieval command for http://linuxnews.cheek.com/robots.txt: GET /robots.txt HTT
P/1.0
User-Agent: htdig/3.1.1 (bogus@unconfigured.htdig.user)
Host: linuxnews.cheek.com

Header line: HTTP/1.1 404 Not Found
Header line: Date: Sat, 29 May 1999 09:52:19 GMT
Header line: Server: Apache/1.3.6 (Unix) FrontPage/3.0.4.2 PHP/3.0.7
Header line: Connection: close
Header line: Content-Type: text/html
Header line:
returnStatus = 1
 pushed
        0:0:http://linuxnews.cheek.com/a.biaies.mis/ pushed
        0:0:http://linuxnews.cheek.com/a.biaies.mis/269623.php pushed
        0:0:http://linuxnews.cheek.com/a.biaies.mis/269636.php pushed

[...snip... about 16000+ of these 'pushed' lines...]

        0:0:http://linuxnews.cheek.com/utah.linux/1409.php pushed
        0:0:http://linuxnews.cheek.com/utah.linux/1410.php pushed
        0:0:http://linuxnews.cheek.com/utah.linux/1411.php pushed
        1:0:http://linuxnews.cheek.com/ skipped
pick: linuxnews.cheek.com, # servers = 1
0:0:255:http://linuxnews.cheek.com/: Retrieval command for http://linuxnews.chee

Header line: HTTP/1.1 200 OK
Header line: Date: Sat, 29 May 1999 09:52:45 GMT
Header line: Server: Apache/1.3.6 (Unix) FrontPage/3.0.4.2 PHP/3.0.7
Header line: Connection: close
Header line: Content-Type: text/html
Header line:
returnStatus = 0
Read 8192 from document
Read 8192 from document
Read 7736 from document
Read a total of 24120 bytes
 retrieved but not changed
pick: linuxnews.cheek.com, # servers = 1
1:2:1:http://linuxnews.cheek.com/a.biaies.mis/: Retrieval command for http://lin

Header line: HTTP/1.1 200 OK
Header line: Date: Sat, 29 May 1999 09:52:46 GMT
Header line: Server: Apache/1.3.6 (Unix) FrontPage/3.0.4.2 PHP/3.0.7
Header line: Connection: close
Header line: Content-Type: text/html
Header line:
returnStatus = 0
Read 743 from document
Read a total of 743 bytes
 retrieved but not changed

======
so Date: returned is current time, plus i'm still getting the "retrieved but not
changed". 8-(

thanks!

joe

Geoff Hutchison wrote:

> On Fri, 28 May 1999, Joseph Cheek wrote:
>
> > so since all files are getting the "retrieved but not changed" message, does
> > that mean that apache is telling htdig that nothing has changed in the document
> > root of http://linuxnews.cheek.com/? if so, that is a blatant lie 8-). is
>
> That would be the implication--that Apache is sending a Last Modified
> header with the same date.
>
> > there any way to verify this, somehow by telnetting to port 80 and typing a
> > request in by hand?
>
> You can get HTTP headers by any number of means. You can use a tool like
> curl or pavuuk, or by running htdig -vvv -h 0 (which will limit to the
> first page).
>
> -Geoff
>
> ------------------------------------
> To unsubscribe from the htdig mailing list, send a message to
> htdig@htdig.org containing the single word "unsubscribe" in
> the SUBJECT of the message.

--
      ___            ___
   __ | |_   __   __ | |_      __   __   _____  * Joseph Cheek, Director
  / _)|   \ / _) / _)|  _)    / _) /  \ |     | * joseph@cheek.com or
 ( (_ | | |(  _)(  _)|  \  _ ( (_ ( () )| |_| | * (877) CHEEK.COM
  \__)|_|_| \__) \__)|_\_)(_) \__) \__/ |_| |_| * http://www.cheek.com/
    Cheek Consulting, Seattle, provides Linux and Internet solutions
   linux * web commerce * html * java * perl * php * informix * mysql

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig@htdig.org containing the single word "unsubscribe" in the SUBJECT of the message.



This archive was generated by hypermail 2.0b3 on Sat May 29 1999 - 02:18:51 PDT