Re: [htdig] Valid punctuation

Subject: Re: [htdig] Valid punctuation
From: Gilles Detillieux (
Date: Tue Nov 30 1999 - 08:17:59 PST

According to J. op den Brouw:
> As I see it, valid_punctuation merely deletes characters from
> words. But what about replacing them with spaces so they act
> like word boundaries.
> Andrew's will be Andrews okay, but will be
> wwwmydomaincom. If you leave out the dot, the dot will be
> a character as in last_word_for_sentence.
> So what I want is to replace dot with space. Any clues on
> this subject?

As of 3.1.3, htdig not only strips out valid_punctuation, but also
breaks up words at the spots with punctuation, so that post-doctoral
will put postdoctoral, post and doctoral in the words database.
If you can't upgrade to 3.1.3, look at the patch archive for 3.1.2,
for my compound words patch. If you do upgrade, be sure you apply the
urlparmbug patch to 3.1.3. (BTW, 3.1.4 will be out before too long,
with several more fixes.)

Gilles R. Detillieux              E-mail: <>
Spinal Cord Research Centre       WWW:
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------ To unsubscribe from the htdig mailing list, send a message to You will receive a message to confirm this.

This archive was generated by hypermail 2b25 : Tue Nov 30 1999 - 08:30:15 PST