[htdig] ANNOUNCE: ht://Dig version 3.1.0 RPMs for Red Hat

Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Thu, 11 Feb 1999 18:06:42 -0600 (CST)

I'm attempting to upload source and binary RPMs for the ht://Dig 3.1.0
web site search engine to incoming.redhat.com, for eventual inclusion
on contrib.redhat.com. Their ftp server isn't being very co-operative
right now, but I'll keep trying. However, I'll issue the announcement
right away. In the interim, the RPMs can be downloaded from the SCRC web
site, at


The following RPMs were built on Red Hat Linux 4.2 and 5.0* respectively:

htdig-3.1.0-1glibc.i386.rpm * (see note below)
htdig-3.1.0-1glibc.src.rpm * (see note below)

If you've already installed one of the earlier 3.1.0b? releases, this release
is an update to the final 3.1.0 release, though the rpm command will think
it's an older version. You'll need to update with:

        rpm -Uvh --oldpackage htdig-3.1.0-0.{arch}.rpm

Run /usr/sbin/rundig after installing, to rebuild all your databases.


Name : htdig Distribution: (none)
Version : 3.1.0 Vendor: (none)
Release : 0 Build Date: Wed Feb 10 15:24:54 1999
Install date: Wed Feb 10 16:36:58 1999 Build Host: cliff.scrc.umanitoba.ca
Group : Networking/Utilities Source RPM: htdig-3.1.0-0.src.rpm
Size : 2976267
Packager : Gilles Detillieux <grdetil@scrc.umanitoba.ca>
URL : http://www.htdig.org/
Summary : A web indexing and searching system for a small domain or intranet
Description :
The ht://Dig system is a complete world wide web indexing and searching
system for a small domain or intranet. This system is not meant to replace
the need for powerful internet-wide search systems like Lycos, Infoseek,
Webcrawler and AltaVista. Instead it is meant to cover the search needs for
a single company, campus, or even a particular sub section of a web site.

As opposed to some WAIS-based or web-server based search engines, ht://Dig
can span several web servers at a site. The type of these different web
servers doesn't matter as long as they understand the HTTP 1.0 protocol.


* Note to Red Hat 5.0 & 5.1 users:

There's an obscure bug in vixie-cron on Red Hat 5.0 and 5.1 systems,
in its SIGCHLD signal handling. It causes htmerge to fail consistently
with a "Word sort failed" error, when run from a cron job. It could
potentially cause similar problems with other jobs. I recommend upgrading
to the latest vixie-cron from the 5.2 distribution:


Unfortunately, even though Red Hat discovered and fixed the problem back
in June, they did not mention it in their errata or issue update RPMs.
They can be obtained from any Red Hat Linux distribution mirror site, or
along with the htdig RPMs from my web site above.


   Release notes for htdig-3.1.0 9 Feb 1999
   This version marks the "full release" of version 3.1.0. Naturally,
   this version adds a few new features and fixes a large number of
   remaining bugs. This version is the latest stable release of ht://Dig
   and is recommended for all production servers for current bug-fixes
   and oft-requested features.

     NOTE: You must rebuild your databases from scratch after updating
     to this version. Several database-related bugs were fixed and will
     remain unless you rebuild from scratch. We're sorry for any

     * Fixed a variety of small memory leaks.
     * Fixed a bug that could duplicate documents in the document
     * Fixed a bug that would not remove documents marked as deleted.
     * Fixed a bug that could dump core with incorrectly defined
       template_map attributes.
     * Fixed a bug that could dump core or produce bogus dates when a
       server returns the date in an incorrect format.
     * Fixed a variety of string-matching bugs that caused problems with
       restricting indexing and searching.
     * Fixed a bug that could dump core if logging searches and CGI
       environment variables were not set.
     * Fixed a bug that would not hilight searches properly if they
       contained punctuation.
     * Fixed PDF parsing to support programs beyond acroread.
     * Fixed a bug that caused problems with large robots.txt files.
     * Fixed a bug in the sample rundig script from a non-portable test
       for the age of databases.
     * Fixed bugs in the fuzzy matching code that could prevent searches
       from completing if fuzzy databases were not present.
     * Fixed bugs in the soundex and metaphone algorithms that would only
       return the first word of several matching words. Note that to
       completely fix this bug, you must rebuild your soundex and
       metaphone databases.
     * Fixed up many compilation warnings and errors.
     * Fixed a performance slowdown in htsearch when backlink_factor
       and date_factor are zero and can be ignored.
     * Improved performance when a server ignores the If-Modified-Since
       request during update digs.
     * Added a warning message if the locale: option is set to a locale
       that is not present.
     * Some minor performance improvements.
     * Allow "include" keyword in config file to include other config
     * Uses latest (2.6.4) version of the Berkeley database.
     * Two databases may be merged together using htmerge.
     * The htdig program can be safely stopped and restarted in the
       middle of a dig. The dig will write the progress to the file
       specified by the new url_log option.
     * Added support for anchors in excerpts with the
       add_anchors_to_excerpt option and the ANCHOR template variable.
     * Added support for sorting results in increasing or decreasing
       order of document date, size, title and score using the search
       form. Note that changing sort from the default of score will
       result in a performance decrease.
     * Added config options sort and sort_names to change the
       default sort and names used in the SORT template variable.
     * Added the option compression_level to compress the document
       database if the zlib library is present.
     * Added the options noindex_start and noindex_stop to
       delimit sections of HTML documents to be ignored.
     * Added the option allow_in_form to allow specific config
       options to be set in the search form.
     * Added the option bad_querystr to ingore URLs containing
       specified CGI queries.
     * Added the option search_results_wrapper to replace separate
       header and footer files. For mor information, see the general
       htsearch documentation.
     * Added option no_title_text to allow configuration of the text
       used when no title is found.
     * Added option url_part_aliases to allow rewriting portions of
     * Added option common_url_parts to compress common portions
       of URLs. Requires rebuilding databases when changed.
     * Added option remove_default_doc to control whether ht://Dig
       strips off the default document in a folder. Set to empty will
       prevent problems with servers that treat / and /index.html as
       different URLs.
     * Of course there are many other bug-fixes and small enhancements.
       Many thanks to everyone who reported a bug or contributed code for
       this release!

   The RPMs also contain my patch to fix parse_date in htnotify.cc, so it
   works as documented, and Hans-Peter Nilsson's patch to DocumentRef.cc,
   to correct the alignment bug that plagued sparc users.

Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930
To unsubscribe from the htdig mailing list, send a message to
htdig@htdig.org containing the single word "unsubscribe" in
the SUBJECT of the message.

This archive was generated by hypermail 2.0b3 on Wed Feb 17 1999 - 10:10:02 PST