htdig: Patches for htdoc - missing attributes/bad defaults


Gilles Detillieux (grdetil@scrc.umanitoba.ca)
Wed, 6 Jan 1999 14:58:27 -0600 (CST)


I had noticed some discrepancies between the default values listed in
htdoc/attrs.html and htcommon/defaults.cc, so I finally took the time
to compare them fairly extensively, and here are the corrections I came
up with. By the way, I always thought that .tgz and .rpm should be added
to the list of default bad_extensions, but I'll leave that to Geoff. :)

Wed Jan 6 14:49:47 1999 Gilles Detillieux <grdetil@scrc.umanitoba.ca>

        * attrs.html: Added four new attributes, fixed defaults & typos.
        * cf_byname.html: Added four new attributes.
        * cf_byprog.html: Added four new attributes.

--- htdoc/attrs.html.orig Tue Dec 22 19:53:13 1998
+++ htdoc/attrs.html Wed Jan 6 14:22:04 1999
@@ -226,7 +226,7 @@
           </dt>
           <dd>
             .wav .gz .z .sit .au .zip .tar .hqx .exe .com .gif .jpg
- .jpeg .aiff
+ .jpeg .aiff .class .map .ram
           </dd>
           <dt>
             <em>description:</em>
@@ -339,7 +339,6 @@
       </dd>
     </dl>
     <hr>
- <hr>
     <dl>
       <dt>
         <strong><a name="common_dir">common_dir</a></strong>
@@ -1151,7 +1150,7 @@
             <em>default:</em>
           </dt>
           <dd>
- cgi-bin
+ cgi-bin .cgi
           </dd>
           <dt>
             <em>description:</em>
@@ -1194,6 +1193,9 @@
           <dt>
             <em>default:</em>
           </dt>
+ <dd>
+ <em>&lt;empty&gt;</em>
+ </dd>
           <dt>
             <em>description:</em>
           </dt>
@@ -1810,7 +1812,7 @@
             <em>default:</em>
           </dt>
           <dd>
- none
+ <em>&lt;empty&gt;</em>
           </dd>
           <dt>
             <em>description:</em>
@@ -1857,7 +1859,7 @@
             <em>default:</em>
           </dt>
           <dd>
- .sdsu.edu/
+ ${start_url}
           </dd>
           <dt>
             <em>description:</em>
@@ -1950,7 +1952,7 @@
             <em>default:</em>
           </dt>
           <dd>
- None
+ <em>&lt;empty&gt;</em>
           </dd>
           <dt>
             <em>description:</em>
@@ -1996,7 +1998,7 @@
             <em>default:</em>
           </dt>
           <dd>
- None
+ <em>&lt;empty&gt;</em>
           </dd>
           <dt>
             <em>description:</em>
@@ -2133,7 +2135,7 @@
             <em>default:</em>
           </dt>
           <dd>
- badguy@localhost
+ bogus@unconfigured.htdig.user
           </dd>
           <dt>
             <em>description:</em>
@@ -2176,7 +2178,7 @@
             <em>default:</em>
           </dt>
           <dd>
- or
+ and
           </dd>
           <dt>
             <em>description:</em>
@@ -2436,6 +2438,49 @@
     <hr>
     <dl>
       <dt>
+ <strong><a name="max_meta_description_length">
+ max_meta_description_length</a></strong>
+ </dt>
+ <dd>
+ <dl>
+ <dt>
+ <em>type:</em>
+ </dt>
+ <dd>
+ number
+ </dd>
+ <dt>
+ <em>used by:</em>
+ </dt>
+ <dd>
+ <a href="htdig.html">htdig</a>
+ </dd>
+ <dt>
+ <em>default:</em>
+ </dt>
+ <dd>
+ 512
+ </dd>
+ <dt>
+ <em>description:</em>
+ </dt>
+ <dd>
+ While gathering descriptions from meta description tags,
+ <a href= "htdig.html">htdig</a> will truncate
+ descriptions which are longer than this length.
+ </dd>
+ <dt>
+ <em>example:</em>
+ </dt>
+ <dd>
+ max_meta_description_length: 1000
+ </dd>
+ </dl>
+ </dd>
+ </dl>
+ <hr>
+ <dl>
+ <dt>
         <strong><a name="max_prefix_matches">
         max_prefix_matches</a></strong>
       </dt>
@@ -2523,6 +2568,50 @@
     <hr>
     <dl>
       <dt>
+ <strong><a name="maximum_pages">
+ maximum_pages</a></strong>
+ </dt>
+ <dd>
+ <dl>
+ <dt>
+ <em>type:</em>
+ </dt>
+ <dd>
+ integer
+ </dd>
+ <dt>
+ <em>used by:</em>
+ </dt>
+ <dd>
+ <a href="htsearch.html" target="_top">htsearch</a>
+ </dd>
+ <dt>
+ <em>default:</em>
+ </dt>
+ <dd>
+ 10
+ </dd>
+ <dt>
+ <em>description:</em>
+ </dt>
+ <dd>
+ This value limits the number of page links that will be
+ included in the page list at the bottom of the search
+ results page. Note that this does not limit the number
+ of documents that are matched in any way.
+ </dd>
+ <dt>
+ <em>example:</em>
+ </dt>
+ <dd>
+ maximum_pages: 20
+ </dd>
+ </dl>
+ </dd>
+ </dl>
+ <hr>
+ <dl>
+ <dt>
         <strong><a name="meta_description_factor">
         meta_description_factor</a></strong>
       </dt>
@@ -2968,6 +3057,51 @@
     <hr>
     <dl>
       <dt>
+ <strong><a name="no_page_list_header">
+ no_page_list_header</a></strong>
+ </dt>
+ <dd>
+ <dl>
+ <dt>
+ <em>type:</em>
+ </dt>
+ <dd>
+ string
+ </dd>
+ <dt>
+ <em>used by:</em>
+ </dt>
+ <dd>
+ <a href="htsearch.html" target="_top">htsearch</a>
+ </dd>
+ <dt>
+ <em>default:</em>
+ </dt>
+ <dd>
+ <em>&lt;empty&gt;</em>
+ </dd>
+ <dt>
+ <em>description:</em>
+ </dt>
+ <dd>
+ This text will be used as the value of the PAGEHEADER
+ variable, for use in templates or the <a href=
+ "#search_results_footer">search_results_footer</a> file,
+ when all search results fit on a single page.
+ </dd>
+ <dt>
+ <em>example:</em>
+ </dt>
+ <dd>
+ no_page_list_header:
+ &lt;hr noshade size=2&gt;All results on this page.&lt;br&gt;
+ </dd>
+ </dl>
+ </dd>
+ </dl>
+ <hr>
+ <dl>
+ <dt>
         <strong><a name="no_prev_page_text">
         no_prev_page_text</a></strong>
       </dt>
@@ -3057,6 +3191,50 @@
     <hr>
     <dl>
       <dt>
+ <strong><a name="page_list_header">
+ page_list_header</a></strong>
+ </dt>
+ <dd>
+ <dl>
+ <dt>
+ <em>type:</em>
+ </dt>
+ <dd>
+ string
+ </dd>
+ <dt>
+ <em>used by:</em>
+ </dt>
+ <dd>
+ <a href="htsearch.html" target="_top">htsearch</a>
+ </dd>
+ <dt>
+ <em>default:</em>
+ </dt>
+ <dd>
+ &lt;hr noshade size=2&gt;Pages:&lt;br&gt;
+ </dd>
+ <dt>
+ <em>description:</em>
+ </dt>
+ <dd>
+ This text will be used as the value of the PAGEHEADER
+ variable, for use in templates or the <a href=
+ "#search_results_footer">search_results_footer</a> file,
+ when all search results fit on more than one page.
+ </dd>
+ <dt>
+ <em>example:</em>
+ </dt>
+ <dd>
+ page_list_header:
+ </dd>
+ </dl>
+ </dd>
+ </dl>
+ <hr>
+ <dl>
+ <dt>
         <strong><a name="pdf_parser">
         pdf_parser</a></strong>
       </dt>
@@ -3121,7 +3299,7 @@
             <em>default:</em>
           </dt>
           <dd>
- None
+ *
           </dd>
           <dt>
             <em>description:</em>
@@ -3206,7 +3384,7 @@
             <em>default:</em>
           </dt>
           <dd>
- false
+ true
           </dd>
           <dt>
             <em>description:</em>
@@ -3459,11 +3637,20 @@
                 A string of the search words with spaces in
                 between.
               </dd>
+ <dt>
+ <b>PAGEHEADER</b>
+ </dt>
+ <dd>
+ This expands to either the value of the <a href=
+ "#page_list_header">page_list_header</a> or <a href=
+ "#no_page_list_header">no_page_list_header</a>
+ attribute depending on how many pages there are.
+ </dd>
             </dl>
- Note that this file will <strong>NOT</strong>be output
+ Note that this file will <strong>NOT</strong> be output
             if no matches were found. In this case the <a href=
             "#nothing_found_file">nothing_found_file</a> attribute
- is used in stead.
+ is used instead.
           </dd>
           <dt>
             <em>example:</em>
@@ -3566,10 +3753,10 @@
                 between.
               </dd>
             </dl>
- Note that this file will <strong>NOT</strong>be output
+ Note that this file will <strong>NOT</strong> be output
             if no matches were found. In this case the <a href=
             "#nothing_found_file">nothing_found_file</a> attribute
- is used in stead.
+ is used instead.
           </dd>
           <dt>
             <em>example:</em>
@@ -3604,7 +3791,7 @@
             <em>default:</em>
           </dt>
           <dd>
- none
+ <em>&lt;empty&gt;</em>
           </dd>
           <dt>
             <em>description:</em>
@@ -3648,7 +3835,7 @@
             <em>default:</em>
           </dt>
           <dd>
- -1 (no limit)
+ -1 <em>(no limit)</em>
           </dd>
           <dt>
             <em>description:</em>
@@ -3878,6 +4065,9 @@
           <dt>
             <em>default:</em>
           </dt>
+ <dd>
+ <em>&lt;empty&gt;</em>
+ </dd>
           <dt>
             <em>description:</em>
           </dt>
@@ -3994,7 +4184,7 @@
             <em>default:</em>
           </dt>
           <dd>
- http://www/
+ http://www.htdig.org/
           </dd>
           <dt>
             <em>description:</em>
--- htdoc/cf_byname.html.orig Wed Jan 6 14:28:35 1999
+++ htdoc/cf_byname.html Wed Jan 6 14:36:47 1999
@@ -150,10 +150,15 @@
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#max_hop_count">max_hop_count</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#max_meta_description_length">
+ max_meta_description_length</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#max_prefix_matches">max_prefix_matches</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#max_stars">max_stars</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#maximum_pages">maximum_pages</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#meta_description_factor">meta_description_factor</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#metaphone_db">metaphone_db</a><br>
@@ -176,11 +181,15 @@
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#no_next_page_text">no_next_page_text</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#no_page_list_header">no_page_list_header</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#no_prev_page_text">no_prev_page_text</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#nothing_found_file">nothing_found_file</a><br>
     </font> <br>
      <b>P</b> <font face="helvetica,arial" size="2"><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#page_list_header">page_list_header</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#pdf_parser">pdf_parser</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
--- htdoc/cf_byprog.html.orig Tue Dec 22 19:53:13 1998
+++ htdoc/cf_byprog.html Wed Jan 6 14:44:21 1999
@@ -95,6 +95,9 @@
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#max_hop_count">max_hop_count</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#max_meta_description_length">
+ max_meta_description_length</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#meta_description_factor">meta_description_factor</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#minimum_word_length">minimum_word_length</a><br>
@@ -204,6 +207,8 @@
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#max_stars">max_stars</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#maximum_pages">maximum_pages</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#method_names">method_names</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#minimum_prefix_length">minimum_prefix_length</a><br>
@@ -218,9 +223,13 @@
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#no_next_page_text">no_next_page_text</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#no_page_list_header">no_page_list_header</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#no_prev_page_text">no_prev_page_text</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#nothing_found_file">nothing_found_file</a><br>
+ <img src="dot.gif" alt="*"> <a target="body" href=
+ "attrs.html#page_list_header">page_list_header</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=
     "attrs.html#prefix_match_character">prefix_match_character</a><br>
      <img src="dot.gif" alt="*"> <a target="body" href=

-- 
Gilles R. Detillieux              E-mail: <grdetil@scrc.umanitoba.ca>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.



This archive was generated by hypermail 2.0b3 on Thu Jan 07 1999 - 07:52:40 PST