Re: [htdig] Re: the mysterious "Deleted, no excerpt" problem


Subject: Re: [htdig] Re: the mysterious "Deleted, no excerpt" problem
From: Patrick Robinson (pgr@ramandu.ext.vt.edu)
Date: Tue May 23 2000 - 12:38:53 PDT


[regarding null chars in HTML files...]

Geoff wrote:
> Out of curiousity, how does it show in browsers? This may be one of those
> "real world" issues where the HTML parser just needs to be more liberal.

In Netscape 4.72 on WinNT:
Null bytes at the beginnings of documents seem to be ignored. But a
null byte in the middle of a line of text causes the rest of the line
(up to the next newline) to be truncated.

In IE5/WinNT:
Null bytes at the beginning ignored. Null bytes in the middle of lines
are displayed as little white boxes. Or, rather, a series of them are
treated as a unit, and displayed as a single little white box.

OmniWeb 3.0.2 on Mac OS X Server 1.2:
Ignores them at the beginning; treats series of them, midline, as whitespace
(i.e. looks like same amount of space as between any other two words).

Now --- here's what caused all this: The folks that created these files
were using BBEdit 5.0. As it turns out, there's a bug with this version.
According to the folks at Bare Bones:

| There is a memory bug in BEBdit 5.0 which can cause this sort of random
| corruption. Updating to the current release (5.1.1) will fix this. A free
| updater to 5.1.1 is available at
|
| ftp://ftp.barebones.com/pub/updaters/bbedit_5.0/

---
Patrick Robinson
AHNR Info Technology, Virginia Tech
pgr@ramandu.ext.vt.edu

------------------------------------ To unsubscribe from the htdig mailing list, send a message to htdig-unsubscribe@htdig.org You will receive a message to confirm this.



This archive was generated by hypermail 2b28 : Tue May 23 2000 - 10:26:18 PDT