Subject: [htdig3-dev] FW: [htdig3-dev] HtDig Perl Modules on CPAN
From: Ben Parker (ben@ocha.unon.org)
Date: Wed May 03 2000 - 02:32:29 PDT
Could someone put the latest HtDig::Database or a link thereto on
sourceforge too?
this is the link
ftp://ftp.halcyon.com/pub/users/wjones/HtDig-Database-0.52.tar.gz
HtDig fixes a small bug mentioned here regarding file extensions.
I retract my alleged bug on dates with HtDig::Database - it's some Front
Page nonsense on the site I'm indexing.
Meanwhile I am working on a better version of newwhatsnew.pl using this
HtDig::Database 0.52.
Ben
-----Original Message-----
From: Warren Jones [mailto:wjones@tc.fluke.com]
Sent: Thursday, April 20, 2000 8:44 PM
To: Ben Parker
Subject: Re: [htdig3-dev] HtDig Perl Modules on CPAN
On Mon, Apr 03, 2000 at 09:36:55AM +0300, Ben Parker wrote:
> Please upload Warren's HtDig::Database there (but it does not have a perl
> Makefile.PL installation). I have been using it to fix the whatsnew.pl
> script and have found it extrememly useful. IMHO so far, there are two
> possible bugs in it:
>
> 1. Files ending .htm are reported as ending it .gif
>
> 2. Some problems with correctly reporting modification dates
>
> It would be a useful basis for anyone trying to pull data from / parse the
> .db files in Perl.
>
> I hope if more people will use it, they can help me isolate these little
> problems.
Hi Ben,
Sorry to be so tardy in responding to your message. I haven't
been keeping up with the ht://Dig lists, and have had procmail
filing away mail from the list as it comes in. I just got around
to looking through the folder yesterday, and was chagrined to
find another question about the HtDig::Database module that you
sent way last February. I don't usually ignore mail like that.
Anyway, about the confusion of .htm and .gif ... If you haven't
figured it out already, the fix is to change the value of
@default_url_part in Database.pm from this:
my @default_url_parts = qw(
http:// http://www. ftp:// ftp://ftp. /pub/
.html .gif .jpg .jpeg /index.html /index.htm
.com/ .com mailto:
);
to this:
my @default_url_parts = qw(
http:// http://www. ftp:// ftp://ftp. /pub/
.html .htm .gif .jpg .jpeg /index.html /index.htm
.com/ .com mailto:
);
This list needs to match the value of "common_url_parts" in
htcommon/defaults.cc. Thanks for catching the bug!
I haven't seen the problems you mention with modification
dates, but if you can provide any more details I'll look
into it.
I've just put together a new version of the module with your
bug fix and a few other minor changes, which you can get from:
ftp://ftp.halcyon.com/pub/users/wjones/HtDig-Database-0.51.tar.gz
However, the future of this module is unclear. Because of changes
in the database format, it won't work with ht://Dig 3.2. It really
should be replaced by an XS module, but I don't know when I'll
get time to work on that.
-- Warren------------------------------------ To unsubscribe from the htdig3-dev mailing list, send a message to htdig3-dev-unsubscribe@htdig.org You will receive a message to confirm this.
This archive was generated by hypermail 2b28 : Wed May 03 2000 - 00:21:38 PDT