AW: [htdig] Valid Punctiation Question


Subject: AW: [htdig] Valid Punctiation Question
From: Reich, Stefan (Stefan.Reich@dgn-service.de)
Date: Thu Oct 26 2000 - 01:08:04 PDT


Hi Geoff,

with extra_word_characters I have a different Problem.

On one hand I want 1998-10-11 to be treated like 19981011. So The document
contains 1998-10-11 and the search for 19981011 shuld give me a result too.
(-> Valid Punctuation)

On the other hand I want a result only if I search for the full string and
no match for 1998.

My dilemma: Valid Punctuation strips the - but splits the string too
            Extra Word Characters doesn't split the string, but doesn't
remove the -

So is there an option to have a combination of both ????

I solved the problem in a different way now, but would be good to know if
there is another option.

-----Ursprüngliche Nachricht-----
Von: Geoff Hutchison [mailto:ghutchis@wso.williams.edu]
Gesendet: Donnerstag, 26. Oktober 2000 05:20
An: Reich, Stefan
Cc: 'htdig@htdig.org'
Betreff: Re: [htdig] Valid Punctiation Question

At 4:02 PM +0200 10/25/00, Reich, Stefan wrote:
>In my document I have a Datestring like 1998-10-05
>I set valid punctuation to -

This will treat the - character as a word separator. (For example,
periods in sentences are, by default, treated as a whitespace.)

You want extra_word_chars instead:

<http://www.htdig.org/attrs.html#extra_word_chars>

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this. List archives: <http://www.htdig.org/mail/menu.html> FAQ: <http://www.htdig.org/FAQ.html>



This archive was generated by hypermail 2b28 : Thu Oct 26 2000 - 01:15:30 PDT