Remove trailing whitespace.

Signed-off-by: Edward Z. Yang <edwardzyang@thewritingpot.com>
2025-10-14 21:54:24 +02:00 · 2008-12-06 02:28:20 -05:00
parent 3a6b63dff1
commit 2c955af135
476 changed files with 5595 additions and 5547 deletions
--- a/docs/dev-advanced-api.html
+++ b/docs/dev-advanced-api.html
@@ -164,7 +164,7 @@ to make things simpler. It can be a literal object or:</p>
        class with that function registered as a callback.</li>-->
    <li>String attribute type: We'll use <code>HTMLPurifier_AttrTypes</code>
        to resolve it for you. Any data that follows a hash mark (#) will
-        be used to customize the attribute type: in the example above, 
+        be used to customize the attribute type: in the example above,
        we specify which values for Enum to allow.</li>
 </ul>

--- a/docs/dev-config-schema.html
+++ b/docs/dev-config-schema.html
@@ -9,29 +9,29 @@
    <title>Config Schema - HTML Purifier</title>
  </head>
  <body>
-    
+
    <h1>Config Schema</h1>
-    
+
    <div id="filing">Filed under Development</div>
    <div id="index">Return to the <a href="index.html">index</a>.</div>
    <div id="home"><a href="http://htmlpurifier.org/">HTML Purifier</a> End-User Documentation</div>
-    
+
    <p>
      HTML Purifier has a fairly complex system for configuration. Users
      interact with a <code>HTMLPurifier_Config</code> object to
      set configuration directives. The values they set are validated according
      to a configuration schema, <code>HTMLPurifier_ConfigSchema</code>.
    </p>
-    
+
    <p>
      The schema is mostly transparent to end-users, but if you're doing development
      work for HTML Purifier and need to define a new configuration directive,
      you'll need to interact with it. We'll also talk about how to define
      userspace configuration directives at the very end.
    </p>
-    
+
    <h2>Write a directive file</h2>
-    
+
    <p>
      Directive files define configuration directives to be used by
      HTML Purifier. They are placed in <code>library/HTMLPurifier/ConfigSchema/schema/</code>
@@ -42,7 +42,7 @@
      <a href="http://qa.php.net/write-test.php">PHPT</a> tests. Here's a
      sample directive file, <code>Test.Sample.txt</code>:
    </p>
-    
+
    <pre>Test.Sample
 TYPE: string/null
 DEFAULT: NULL
@@ -54,11 +54,11 @@ This is a sample configuration directive for the purposes of the
 &lt;code&gt;dev-config-schema.html&lt;code&gt; documentation.
 --ALIASES--
 Test.Example</pre>
-    
+
    <p>
      Each of these segments has a specific meaning:
    </p>
-    
+
    <table class="table">
      <thead>
        <tr>
@@ -143,11 +143,11 @@ Test.Example</pre>
        </tr>
      </tbody>
    </table>
-    
+
    <p>
      Some notes on format and style:
    </p>
-    
+
    <ul>
      <li>
        Each of these keys can be expressed in the short format
@@ -162,11 +162,11 @@ Test.Example</pre>
        not rely on editor word-wrapping.
      </li>
    </ul>
-    
+
    <p>
      Also, as promised, here is the set of possible types:
    </p>
-    
+
    <table class="table">
      <thead>
        <tr>
@@ -233,7 +233,7 @@ Test.Example</pre>
        </tr>
      </tbody>
    </table>
-    
+
    <p>
      The examples represent what will be returned out of the configuration
      object; users have a little bit of leeway when setting configuration
@@ -242,7 +242,7 @@ Test.Example</pre>
      in <a href="http://repo.or.cz/w/htmlpurifier.git?a=blob;hb=HEAD;f=library/HTMLPurifier/VarParser.php">
      library/HTMLPurifier/VarParser.php</a>.
    </p>
-    
+
    <p>
      For more information on what values are allowed, and how they are parsed,
      consult <a href="http://repo.or.cz/w/htmlpurifier.git?a=blob;hb=HEAD;f=library/HTMLPurifier/ConfigSchema/InterchangeBuilder.php">
@@ -251,9 +251,9 @@ Test.Example</pre>
      library/HTMLPurifier/ConfigSchema/Interchange/Directive.php</a> for
      the semantics of the parsed values.
    </p>
-    
+
    <h2>Refreshing the cache</h2>
-    
+
    <p>
      You may have noticed that your directive file isn't doing anything
      yet. That's because it hasn't been added to the runtime
@@ -262,14 +262,14 @@ Test.Example</pre>
      If there were no errors, you're good to go! Don't forget to add
      some unit tests for your functionality!
    </p>
-    
+
    <p>
      If you ever make changes to your configuration directives, you
      will need to run this script again.
    </p>
-    
+
    <h2>Errors</h2>
-    
+
    <p>
      All directive files go through a rigorous validation process
      through <a href="http://repo.or.cz/w/htmlpurifier.git?a=blob;hb=HEAD;f=library/HTMLPurifier/ConfigSchema/Validator.php">
@@ -279,16 +279,16 @@ Test.Example</pre>
      can give some general tips for interpreting error messages.
      There are two types of errors: builder errors and validation errors.
    </p>
-    
+
    <h3>Builder errors</h3>
-    
+
    <blockquote>
      <p>
        <strong>Exception:</strong> Expected type string, got
        integer in DEFAULT in directive hash 'Ns.Dir'
      </p>
    </blockquote>
-    
+
    <p>
      You can identify a builder error by the keyword "directive hash."
      These are the easiest to deal with, because they directly correspond
@@ -298,28 +298,28 @@ Test.Example</pre>
      This particular error would occur if your default value is not the same
      type as TYPE.
    </p>
-    
+
    <h3>Validation errors</h3>
-    
+
    <blockquote>
      <p>
        <strong>Exception:</strong> Alias 3 in valueAliases in directive
        'Ns.Dir' must be a string
      </p>
    </blockquote>
-    
+
    <p>
      These are a little trickier, because we're not actually validating
      your directive file, or even the direct string hash representation.
      We're validating an Interchange object, and the error messages do
      not mention any string hash keys.
    </p>
-    
+
    <p>
      Nevertheless, it's not difficult to figure out what went wrong.
      Read the "context" statements in reverse:
    </p>
-    
+
    <dl>
      <dt>in directive 'Ns.Dir'</dt>
        <dd>This means we need to look at the directive file <code>Ns.Dir.txt</code></dd>
@@ -329,12 +329,12 @@ Test.Example</pre>
      <dt>Alias 3</dt>
        <dd>The value alias that is equal to 3 is the culprit.</dd>
    </dl>
-    
+
    <p>
      In this particular case, you're not allowed to alias integers values to
      strings values.
    </p>
-    
+
    <p>
      The most difficult part is translating the Interchange member variable (valueAliases)
      into a directive file key (VALUE-ALIASES), but there's a one-to-one
@@ -342,9 +342,9 @@ Test.Example</pre>
      will be described in <a href="http://repo.or.cz/w/htmlpurifier.git?a=blob;hb=HEAD;f=library/HTMLPurifier/ConfigSchema/InterchangeBuilder.php">
      library/HTMLPurifier/ConfigSchema/InterchangeBuilder.php</a>.
    </p>
-    
+
    <h2>Internals</h2>
-    
+
    <p>
      Much of the configuration schema framework's codebase deals with
      shuffling data from one format to another, and doing validation on this
@@ -352,7 +352,7 @@ Test.Example</pre>
      The keystone of all of this is the <code>HTMLPurifier_ConfigSchema_Interchange</code>
      class, which represents the purest, parsed representation of the schema.
    </p>
-    
+
    <p>
      Hand-writing this data is unwieldy, however, so we write directive files.
      These directive files are parsed by <code>HTMLPurifier_StringHashParser</code>
@@ -360,7 +360,7 @@ Test.Example</pre>
      are run through <code>HTMLPurifier_ConfigSchema_InterchangeBuilder</code>
      to construct the interchange object.
    </p>
-    
+
    <p>
      From the interchange object, the data can be siphoned into other forms
      using <code>HTMLPurifier_ConfigSchema_Builder</code> subclasses.
--- a/docs/dev-includes.txt
+++ b/docs/dev-includes.txt
@@ -91,14 +91,14 @@ use the stub for all cases (which might not be a bad idea).
    Deprecated
    ----------
    One of the things I'd like to do is have the code search for any classes
-    that are explicitly mentioned in the code. If a class isn't mentioned, I 
+    that are explicitly mentioned in the code. If a class isn't mentioned, I
    get to assume that it is "optional," i.e. included via introspection.
    The choice is either to use PHP's tokenizer or use regexps; regexps would
    be faster but a tokenizer would be more correct. If this ends up being
    unfeasible, adding dependency comments isn't a bad idea. (This could
    even be done automatically by search/replacing require_once, although
    we'd have to manually inspect the results for the optional requires.)
-    
+
    NOTE: This ends up not being necessary, as we're going to make the user
    figure out all the extra classes they need, and only include the core
    which is predetermined.
@@ -193,7 +193,7 @@ configuration directives would be used to generate our key (meta-directives!)
    mechanism works better. However, we can uniquely identify the
    schema based on the directories they loaded, so there's no need
    for a DefinitionId until we give them full programmatic control.
-    
+
    These variables should be directly incorporated into ConfigSchema,
    and ConfigSchema should handle serialization. Some refactoring will be
    necessary for the DefinitionCache classes, as they are built with
--- a/docs/dev-naming.html
+++ b/docs/dev-naming.html
@@ -35,7 +35,7 @@ help you find the correct functionality more quickly.  Here they are:</p>
 <dt>Harness and Test are reserved class names for unit tests</dt>
    <dd>The suffix <code>Test</code> indicates that the class is a subclass of UnitTestCase
    (of the Simpletest library) and is testable. "Harness" indicates a subclass
-    of UnitTestCase that is not meant to be run but to be extended into 
+    of UnitTestCase that is not meant to be run but to be extended into
    concrete test cases and contains custom test methods (i.e. assert*())</dd>

 <dt>Class names do not necessarily represent inheritance hierarchies</dt>
@@ -51,7 +51,7 @@ help you find the correct functionality more quickly.  Here they are:</p>
    all must be present in order for proper functioning.</dd>

 <dt>Abbreviations are avoided</dt>
-    <dd>We try to avoid abbreviations as much as possible, but in some cases, 
+    <dd>We try to avoid abbreviations as much as possible, but in some cases,
    abbreviated version is more readable than the full version. Here, we
    list common abbreviations:
    <ul>
--- a/docs/enduser-customize.html
+++ b/docs/enduser-customize.html
@@ -206,7 +206,7 @@ $def = $config->getHTMLDefinition(true);</pre>
 <h2>Turn off caching</h2>

 <p>
-  To make development easier, we're going to temporarily turn off 
+  To make development easier, we're going to temporarily turn off
  definition caching:
 </p>

@@ -401,8 +401,8 @@ $def = $config->getHTMLDefinition(true);

 <p>
  Adding attributes is really small-fry stuff, though, and it was possible
-  to add them (albeit a bit more wordy) prior to 2.0. The real gem of 
-  the Advanced API is adding elements. There are five questions to 
+  to add them (albeit a bit more wordy) prior to 2.0. The real gem of
+  the Advanced API is adding elements. There are five questions to
  ask when adding a new element:
 </p>

@@ -493,9 +493,9 @@ $def = $config->getHTMLDefinition(true);
 <p>
  The <code>(%flow;)*</code> indicates the allowed children of the
  <code>li</code> tag: <code>li</code> allows any number of flow
-  elements as its children. (The <code>- O</code> allows the closing tag to be 
-  omitted, though in XML this is not allowed.) In HTML Purifier, 
-  we'd write it like <code>Flow</code> (here's where the content sets 
+  elements as its children. (The <code>- O</code> allows the closing tag to be
+  omitted, though in XML this is not allowed.) In HTML Purifier,
+  we'd write it like <code>Flow</code> (here's where the content sets
  we were discussing earlier come into play). There are three shorthand
  content models you can specify:
 </p>
@@ -596,7 +596,7 @@ $def = $config->getHTMLDefinition(true);
  be nuked.  This is why there is are specific content model types like
  Optional and Required: while they could be implemented as <code>Custom:
  (valid | elements)*</code>, the custom classes contain special recovery
-  measures that make sure as much of the user's original content gets 
+  measures that make sure as much of the user's original content gets
  through. HTML Purifier's core, as a rule, does not use Custom.
 </p>

@@ -676,7 +676,7 @@ $def = $config->getHTMLDefinition(true);
  href="http://www.w3.org/TR/xhtml-modularization/abstract_modules.html#s_commonatts">abstract
  modules of the XHTML Modularization 1.1</a>. We believe this section
  to be in error, as <code>br</code> permits the use of the <code>style</code>
-  attribute even though it uses the <code>Core</code> collection, and 
+  attribute even though it uses the <code>Core</code> collection, and
  the DTD and XML Schemas supplied by W3C support our interpretation.
 </p>

--- a/docs/enduser-slow.html
+++ b/docs/enduser-slow.html
@@ -17,25 +17,25 @@
 <div id="index">Return to the <a href="index.html">index</a>.</div>
 <div id="home"><a href="http://htmlpurifier.org/">HTML Purifier</a> End-User Documentation</div>

-<p>HTML Purifier is a very powerful library. But with power comes great 
-responsibility, in the form of longer execution times.  Remember, this 
-library isn't lightly grazing over submitted HTML: it's deconstructing 
-the whole thing, rigorously checking the parts, and then putting it back 
+<p>HTML Purifier is a very powerful library. But with power comes great
+responsibility, in the form of longer execution times.  Remember, this
+library isn't lightly grazing over submitted HTML: it's deconstructing
+the whole thing, rigorously checking the parts, and then putting it back
 together. </p>

-<p>So, if it so turns out that HTML Purifier is kinda too slow for outbound 
+<p>So, if it so turns out that HTML Purifier is kinda too slow for outbound
 filtering, you've got a few options: </p>

 <h2>Inbound filtering</h2>

-<p>Perform filtering of HTML when it's submitted by the user. Since the 
-user is already submitting something, an extra half a second tacked on 
-to the load time probably isn't going to be that huge of a problem.  
-Then, displaying the content is a simple a manner of outputting it 
-directly from your database/filesystem. The trouble with this method is 
-that your user loses the original text, and when doing edits, will be 
-handling the filtered text.  While this may be a good thing, especially 
-if you're using a WYSIWYG editor, it can also result in data-loss if a 
+<p>Perform filtering of HTML when it's submitted by the user. Since the
+user is already submitting something, an extra half a second tacked on
+to the load time probably isn't going to be that huge of a problem.
+Then, displaying the content is a simple a manner of outputting it
+directly from your database/filesystem. The trouble with this method is
+that your user loses the original text, and when doing edits, will be
+handling the filtered text.  While this may be a good thing, especially
+if you're using a WYSIWYG editor, it can also result in data-loss if a
 user makes a typo. </p>

 <p>Example (non-functional):</p>
@@ -66,14 +66,14 @@ user makes a typo. </p>

 <h2>Caching the filtered output</h2>

-<p>Accept the submitted text and put it unaltered into the database, but 
-then also generate a filtered version and stash that in the database.  
-Serve the filtered version to readers, and the unaltered version to 
-editors.  If need be, you can invalidate the cache and have the cached 
-filtered version be regenerated on the first page view.  Pros? Full data 
-retention. Cons? It's more complicated, and opens other editors up to 
-XSS if they are using a WYSIWYG editor (to fix that, they'd have to be 
-able to get their hands on the *really* original text served in 
+<p>Accept the submitted text and put it unaltered into the database, but
+then also generate a filtered version and stash that in the database.
+Serve the filtered version to readers, and the unaltered version to
+editors.  If need be, you can invalidate the cache and have the cached
+filtered version be regenerated on the first page view.  Pros? Full data
+retention. Cons? It's more complicated, and opens other editors up to
+XSS if they are using a WYSIWYG editor (to fix that, they'd have to be
+able to get their hands on the *really* original text served in
 plaintext mode). </p>

 <p>Example (non-functional):</p>
@@ -108,9 +108,9 @@ plaintext mode). </p>
 <p>In short, inbound filtering is the simple option and caching is the
 robust option (albeit with bigger storage requirements). </p>

-<p>There is a third option, independent of the two we've discussed: profile 
-and optimize HTMLPurifier yourself. Be sure to report back your results 
-if you decide to do that! Especially if you port HTML Purifier to C++. 
+<p>There is a third option, independent of the two we've discussed: profile
+and optimize HTMLPurifier yourself. Be sure to report back your results
+if you decide to do that! Especially if you port HTML Purifier to C++.
 <tt>;-)</tt></p>

 </body>
--- a/docs/enduser-tidy.html
+++ b/docs/enduser-tidy.html
@@ -50,7 +50,7 @@ not need Tidy installed on your PHP to use these features!</strong></p>

 <h2>What are levels?</h2>

-<p>Levels describe how aggressive the Tidy module should be when 
+<p>Levels describe how aggressive the Tidy module should be when
 cleaning up HTML. There are four levels to pick: none, light, medium
 and heavy. Each of these levels has a well-defined set of behavior
 associated with it, although it may change depending on your doctype.</p>
--- a/docs/enduser-uri-filter.html
+++ b/docs/enduser-uri-filter.html
@@ -92,7 +92,7 @@
 </tbody></table>

 <p>
-  Because the URI is presented to us in this form, and not 
+  Because the URI is presented to us in this form, and not
  <code>http://bob@example.com:8080/foo.php?q=string#hash</code>, it saves us
  a lot of trouble in having to parse the URI every time we want to filter
  it. For the record, the above URI has the following components:
--- a/docs/enduser-utf8.html
+++ b/docs/enduser-utf8.html
@@ -106,7 +106,7 @@ there are now many character encodings floating around.</p>
    interpret raw zeroes and ones into real characters. It
    usually does this by pairing numbers with characters.</p>
    <p>There are many different types of character encodings floating
-    around, but the ones we deal most frequently with are ASCII, 
+    around, but the ones we deal most frequently with are ASCII,
    8-bit encodings, and Unicode-based encodings.</p>
    <ul>
        <li><strong>ASCII</strong> is a 7-bit encoding based on the
@@ -306,7 +306,7 @@ languages</a>. The appropriate code is:</p>

 <p>...replacing UTF-8 with whatever your embedded encoding is.
 This code must come before any output, so be careful about
-stray whitespace in your application (i.e., any whitespace before 
+stray whitespace in your application (i.e., any whitespace before
 output excluding whitespace within &lt;?php ?&gt; tags).</p>

 <h4 id="fixcharset-server-phpini">PHP ini directive</h4>
@@ -366,9 +366,9 @@ to send anything at all:</p>
 <pre><a href="http://httpd.apache.org/docs/1.3/mod/core.html#adddefaultcharset">AddDefaultCharset</a> Off</pre>

 <p>...making your internal charset declaration (usually the <code>META</code> tags)
-the sole source of character encoding 
-information. In these cases, it is <em>especially</em> important to make 
-sure you have valid <code>META</code> tags on your pages and all the 
+the sole source of character encoding
+information. In these cases, it is <em>especially</em> important to make
+sure you have valid <code>META</code> tags on your pages and all the
 text before them is ASCII.</p>

 <blockquote class="aside"><p>These directives can also be
@@ -443,9 +443,9 @@ Declarations. They look like:</p>
 <p>For XHTML, this XML Declaration theoretically
 overrides the <code>META</code> tag. In reality, this happens only when the
 XHTML is actually served as legit XML and not HTML, which is almost always
-never due to Internet Explorer's lack of support for 
+never due to Internet Explorer's lack of support for
 <code>application/xhtml+xml</code> (even though doing so is often
-argued to be <a href="http://www.hixie.ch/advocacy/xhtml">good 
+argued to be <a href="http://www.hixie.ch/advocacy/xhtml">good
 practice</a> and is required by the XHTML 1.1 specification).</p>

 <p>For XML, however, this XML Declaration is extremely important.
@@ -456,7 +456,7 @@ ISO-8859-1 encoding (you see this in garbled RSS feeds).</p>

 <p>In short, if you use XHTML and have gone through the
 trouble of adding the XML Declaration, make sure it jives
-with your <code>META</code> tags (which should only be present 
+with your <code>META</code> tags (which should only be present
 if served in text/html) and HTTP headers.</p>

 <h3 id="fixcharset-internals">Inside the process</h3>
@@ -489,7 +489,7 @@ it gets to the Content-Type tag, extract the character encoding
 tag, then re-parse the document according to this new information.</p>

 <p>Obviously this is complicated, so browsers prefer the simpler
-and more efficient solution: get the character encoding from a 
+and more efficient solution: get the character encoding from a
 somewhere other than the document itself, i.e. the HTTP headers,
 much to the chagrin of HTML authors who can't set these headers.</p>

@@ -578,7 +578,7 @@ files.</p>
 <a href="http://web.archive.org/web/20060427015200/ppewww.ph.gla.ac.uk/~flavell/charset/form-i18n.html">
 <code>FORM</code> submission and i18n</a>. That document contains lots
 of useful information, but is written in a rambly manner, so
-here I try to get right to the point. (Note: the original has 
+here I try to get right to the point. (Note: the original has
 disappeared off the web, so I am linking to the Web Archive copy.)</p>

 <h4 id="whyutf8-forms-urlencoded"><code>application/x-www-form-urlencoded</code></h4>
@@ -886,7 +886,7 @@ sure the page is saved WITHOUT the BOM.</p>
 </blockquote>

 <p>If you are reading in text files to insert into the middle of another
-page, it is strongly advised (but not strictly necessary) that you replace out the UTF-8 byte 
+page, it is strongly advised (but not strictly necessary) that you replace out the UTF-8 byte
 sequence for BOM <code>&quot;\xEF\xBB\xBF&quot;</code> before inserting it in,
 via:</p>

@@ -949,7 +949,7 @@ to known good Unicode fonts.</p>
 heavy lifting for you. Get the CSS from the horses mouth here:
 <a href="http://en.wikipedia.org/wiki/MediaWiki:Common.css">Common.css</a>,
 and search for &quot;.IPA&quot; There are also a smattering of
-other classes you can use for other purposes, check out 
+other classes you can use for other purposes, check out
 <a href="http://meta.wikimedia.org/wiki/Help:Special_characters#Displaying_Special_Characters">this page</a>
 for more details. For you lazy ones, this should work:</p>

--- a/docs/proposal-errors.txt
+++ b/docs/proposal-errors.txt
@@ -2,7 +2,7 @@ Considerations for ErrorCollection

 Presently, HTML Purifier takes a code-execution centric approach to handling
 errors. Errors are organized and grouped according to which segment of the
-code triggers them, not necessarily the portion of the input document that 
+code triggers them, not necessarily the portion of the input document that
 triggered the error. This means that errors are pseudo-sorted by category,
 rather than location in the document.

@@ -13,7 +13,7 @@ can report errors still process the document mostly linearly. Furthermore,
 not only do they process linearly, but the way they pass off operations to
 sub-systems mirrors that of the document. For example, AttrValidator will
 linearly proceed through elements, and on each element will use AttrDef to
-validate those contents. From there, the attribute might have more 
+validate those contents. From there, the attribute might have more
 sub-components, which have execution passed off accordingly.

 In fact, each strategy handles a very specific class of "error."
@@ -56,7 +56,7 @@ set it as a document-wide error. And actually, nothing needs to be done here.
 Something interesting to consider is whether or not we care about the locations
 of attributes and CSS properties, i.e. the sub-objects that compose these things.
 In terms of consistency, at the very least attributes should have column/line
-numbers attached to them. However, this may be overkill, as attributes are 
+numbers attached to them. However, this may be overkill, as attributes are
 uniquely identifiable. You could go even further, with CSS, but they are also
 uniquely identifiable.

@@ -80,12 +80,12 @@ cases).
    4. Setup ErrorCollector to use context information to setup hierarchies.
       This may require a different internal format. Use objects if it gets
       complex. [DONE]
-       
+
       ASIDE
            More on this topic: since we are now binding errors to lines
            and columns, a particular error can have three relationships to that
            specific location:
-            
+
            1. The token at that location directly
                RemoveForeignElements
                AttrValidator (transforms)
@@ -95,50 +95,50 @@ cases).
            3. A modification to that node (i.e. contents from start to end
               token) as a whole
                FixNesting
-            
+
            This needs to be marked accordingly. In the presentation, it might
            make sense keep (3) separate, have (2) a sublist of (1). (1) can
            be a closing tag, in which case (3) makes no sense at all, OR it
            should be related with its opening tag (this may not necessarily
            be possible before MakeWellFormed is run).
-            
+
            So, the line and column counts as our identifier, so:
-            
+
            $errors[$line][$col] = ...
-            
+
            Then, we need to identify case 1, 2 or 3. They are identified as
            such:
-            
+
            1. Need some sort of semaphore in RemoveForeignElements, etc.
            2. If CurrentAttr/CurrentCssProperty is non-null
            3. Default (FixNesting, MakeWellFormed)
-            
+
            One consideration about (1) is that it usually is actually a
            (3) modification, but we have no way of knowing about that because
            of various optimizations. However, they can probably be treated
            the same. The other difficulty is that (3) is never a line and
            column; rather, it is a range (i.e. a duple) and telling the user
            the very start of the range may confuse them. For example,
-            
+
            <b>Foo<div>bar</div></b>
            ^     ^
-            
+
            The node being operated on is <b>, so the error would be assigned
            to the first caret, with a "node reorganized" error. Then, the
            ChildDef would have submitted its own suggestions and errors with
            regard to what's going in the internals.  So I suppose this is
            ok. :-)
-            
+
            Now, the structure of the earlier mentioned ... would be something
            like this:
-            
+
            object {
                type = (token|attr|property),
                value, // appropriate for type
                errors => array(),
                sub-errors = [recursive],
            }
-            
+
            This helps us keep things agnostic. It is also sufficiently complex
            enough to warrant an object.

@@ -173,7 +173,7 @@ Then we setup suggestions.

    5. Setup a separate error class which tells the user any modifications
       HTML Purifier made.
-       
+
 Some information about this:

 Our current paradigm is to tell the user what HTML Purifier did to the HTML.
@@ -187,7 +187,7 @@ the correct version isn't a bad idea, but problems arise when:
 - The user has such bad HTML we do something odd, when we should have just
  flagged the HTML as an error. Such examples are when we do things like
  remove text from directly inside a <table> tag. It was probably meant to
-  be in a <td> tag or be outside the table, but we're not smart enough to 
+  be in a <td> tag or be outside the table, but we're not smart enough to
  realize this so we just remove it. In such a case, we should tell the user
  that there was foreign data in the table, but then we shouldn't "demand"
  the user remove the data; it's more of a "here's a possible way of
@@ -204,6 +204,6 @@ Don't forget to spruce up output.

    6. Output needs to automatically give line and column numbers, basically
       "at line" on steroids. Look at W3C's output; it's ok. [PARTIALLY DONE]
-       
+
       - We need a standard CSS to apply (check demo.css for some starting
         styling; some buttons would also be hip)
--- a/docs/proposal-new-directives.txt
+++ b/docs/proposal-new-directives.txt
@@ -16,7 +16,7 @@ implemented, give us a ring, and we'll move it up the priority chain.
    %Attr.ClassBlacklist. When it's Whitelist, only allow those in
    %Attr.ClassWhitelist.

-%Attr.MaxWidth, 
+%Attr.MaxWidth,
 %Attr.MaxHeight - caps for width and height related checks.
    (the hack in Pixels for an image crashing attack could be replaced by this)

--- a/docs/ref-css-length.txt
+++ b/docs/ref-css-length.txt
@@ -22,7 +22,7 @@ Relative:
    1 ex ~= 0.5 em, though Mozilla Firefox says 1 ex = 6px
    1 px ~= 1 pt

-Watch out: font-sizes can also be nested to get successively larger 
+Watch out: font-sizes can also be nested to get successively larger
 (although I do not relish having to keep track of context font-sizes,
 this may be necessary, especially for some of the more advanced features
 for preventing things like white on white).
--- a/docs/ref-html-modularization.txt
+++ b/docs/ref-html-modularization.txt
@@ -121,7 +121,7 @@ a proprietary system called ChildDef for performance and flexibility
 reasons, but this does not line up very well with W3C's notion of
 regexps for defining the allowed children of an element.

-HTMLPurifier->elements[$element]->content_model and 
+HTMLPurifier->elements[$element]->content_model and
 HTMLPurifier->elements[$element]->content_model_type store information
 about the final ChildDef that will be stored in
 HTMLPurifier->elements[$element]->child (we use a different variable
--- a/docs/ref-whatwg.txt
+++ b/docs/ref-whatwg.txt
@@ -18,7 +18,7 @@ committing ourselves till the spec stabilizes, though.

 More immediately speaking though, however, is the well-defined parsing
 behavior that HTML 5 adds. While I have little interest in writing
-another DirectLex parser, other parsers like ph5p 
+another DirectLex parser, other parsers like ph5p
 <http://jero.net/lab/ph5p/> can be adapted to DOMLex to support much more
 flexible HTML parsing (a cool feature I've seen is how they resolve
 <b>bold<i>both</b>italic</i>).
--- a/docs/specimens/windows-live-mail-desktop-beta.html
+++ b/docs/specimens/windows-live-mail-desktop-beta.html
@@ -4,71 +4,71 @@
 <STYLE></STYLE>

 <META content="MSHTML 6.00.6000.16414" name=GENERATOR></HEAD>
-<BODY id=MailContainerBody 
-style="PADDING-RIGHT: 10px; PADDING-LEFT: 10px; FONT-SIZE: 10pt; COLOR: #000000; PADDING-TOP: 15px; FONT-FAMILY: Arial" 
-bgColor=#ff6600 leftMargin=0 background="" topMargin=0 
+<BODY id=MailContainerBody
+style="PADDING-RIGHT: 10px; PADDING-LEFT: 10px; FONT-SIZE: 10pt; COLOR: #000000; PADDING-TOP: 15px; FONT-FAMILY: Arial"
+bgColor=#ff6600 leftMargin=0 background="" topMargin=0
 name="Compose message area" acc_role="text" CanvasTabStop="false">
-<DIV 
-style="BORDER-TOP: #dddddd 1px solid; FONT-SIZE: 10pt; WIDTH: 100%; MARGIN-RIGHT: 10px; PADDING-TOP: 5px; BORDER-BOTTOM: #dddddd 1px solid; FONT-FAMILY: Verdana; HEIGHT: 25px; BACKGROUND-COLOR: #ffffff"><NOBR><SPAN 
-title="View a slideshow of the pictures in this e-mail message." 
-style="PADDING-RIGHT: 20px"><A style="COLOR: #0088e4" 
-href="http://g.msn.com/5meen_us/171?path=/photomail/{6fc0065f-ffdd-4ca6-9a4c-cc5a93dc122f}&amp;image=47D7B182CFEFB10!127&amp;imagehi=47D7B182CFEFB10!125&amp;CID=323550092004883216">Play 
-slideshow </A></SPAN><SPAN style="COLOR: #909090"><SPAN>|</SPAN><SPAN 
-style="PADDING-LEFT: 20px"> Download the highest quality version of a picture by 
+<DIV
+style="BORDER-TOP: #dddddd 1px solid; FONT-SIZE: 10pt; WIDTH: 100%; MARGIN-RIGHT: 10px; PADDING-TOP: 5px; BORDER-BOTTOM: #dddddd 1px solid; FONT-FAMILY: Verdana; HEIGHT: 25px; BACKGROUND-COLOR: #ffffff"><NOBR><SPAN
+title="View a slideshow of the pictures in this e-mail message."
+style="PADDING-RIGHT: 20px"><A style="COLOR: #0088e4"
+href="http://g.msn.com/5meen_us/171?path=/photomail/{6fc0065f-ffdd-4ca6-9a4c-cc5a93dc122f}&amp;image=47D7B182CFEFB10!127&amp;imagehi=47D7B182CFEFB10!125&amp;CID=323550092004883216">Play
+slideshow </A></SPAN><SPAN style="COLOR: #909090"><SPAN>|</SPAN><SPAN
+style="PADDING-LEFT: 20px"> Download the highest quality version of a picture by
 clicking the + above it </SPAN></SPAN></NOBR></DIV>
-<DIV 
+<DIV
 style="PADDING-RIGHT: 5px; PADDING-LEFT: 7px; PADDING-BOTTOM: 2px; WIDTH: 100%; PADDING-TOP: 2px">
 <OL>
-  <LI><IMG title="Angry smile emoticon" 
-  style="FLOAT: none; MARGIN: 0px; POSITION: static" tabIndex=-1 
-  alt="Angry smile emoticon" src="cid:49F0C856199E4D688D2D740680733D74@wc" 
-  MSNNonUserImageOrEmoticon="true">Un ka <FONT style="BACKGROUND-COLOR: #800000" 
-  color=#cc99ff><STRONG>Tev</STRONG></FONT> iet, un ko tu dari? 
+  <LI><IMG title="Angry smile emoticon"
+  style="FLOAT: none; MARGIN: 0px; POSITION: static" tabIndex=-1
+  alt="Angry smile emoticon" src="cid:49F0C856199E4D688D2D740680733D74@wc"
+  MSNNonUserImageOrEmoticon="true">Un ka <FONT style="BACKGROUND-COLOR: #800000"
+  color=#cc99ff><STRONG>Tev</STRONG></FONT> iet, un ko tu dari?
  <LI>Aha!</LI></OL>

 <UL>
-  <LI>Buletets 
+  <LI>Buletets
  <LI>
-  <DIV align=justify><A title=http://laacz.lv/blog/ 
-  href="http://laacz.lv/blog/">http://laacz.lv/blog/</A> un <A 
+  <DIV align=justify><A title=http://laacz.lv/blog/
+  href="http://laacz.lv/blog/">http://laacz.lv/blog/</A> un <A
  title=http://google.com/ href="http://google.com/">gugle</A></DIV>
-  <LI>Sarakstucitis</LI></UL></DIV><SPAN><SPAN xmlns:canvas="canvas-namespace-id" 
-layoutEmptyTextWellFont="Tahoma"><SPAN 
-style="MARGIN-BOTTOM: 15px; OVERFLOW: visible; HEIGHT: 16px"></SPAN><SPAN 
+  <LI>Sarakstucitis</LI></UL></DIV><SPAN><SPAN xmlns:canvas="canvas-namespace-id"
+layoutEmptyTextWellFont="Tahoma"><SPAN
+style="MARGIN-BOTTOM: 15px; OVERFLOW: visible; HEIGHT: 16px"></SPAN><SPAN
 style="MARGIN-BOTTOM: 25px; VERTICAL-ALIGN: top; OVERFLOW: visible; MARGIN-RIGHT: 25px; HEIGHT: 234px">
 <TABLE style="DISPLAY: inline">
  <TBODY>
  <TR>

    <TD>
-      <DIV 
-      style="FONT-WEIGHT: bold; FONT-SIZE: 12pt; FONT-FAMILY: arial; TEXT-ALIGN: center"><A 
-      id=HiresARef 
-      title="Click here to view or download a high resolution version of this picture" 
-      style="COLOR: #0088e4; TEXT-DECORATION: none" 
+      <DIV
+      style="FONT-WEIGHT: bold; FONT-SIZE: 12pt; FONT-FAMILY: arial; TEXT-ALIGN: center"><A
+      id=HiresARef
+      title="Click here to view or download a high resolution version of this picture"
+      style="COLOR: #0088e4; TEXT-DECORATION: none"
      href="http://byfiles.storage.msn.com/x1pMvt0I80jTgT6DuaCpEMbprX3nk3jNv_vjigxV_EYVSMyM_PKgEvDEUtuNhQC-F-23mTTcKyqx6eGaeK2e_wMJ0ikwpDdFntk4SY7pfJUv2g2Ck6R2S2vAA?download">+</A></DIV>
-      <DIV 
-      title="Click here to view the full image using the online photo viewer." 
-      style="DISPLAY: inline; OVERFLOW: hidden; WIDTH: 140px; HEIGHT: 140px"><A 
-      href="http://g.msn.com/5meen_us/171?path=/photomail/{6fc0065f-ffdd-4ca6-9a4c-cc5a93dc122f}&amp;image=47D7B182CFEFB10!127&amp;imagehi=47D7B182CFEFB10!125&amp;CID=323550092004883216" 
-      border="0"><IMG 
-      style="MARGIN-TOP: 15px; DISPLAY: inline-block; MARGIN-LEFT: 0px" 
-      height=109 src="cid:006A71303B80404E9FB6184E55D6A446@wc" width=140 
+      <DIV
+      title="Click here to view the full image using the online photo viewer."
+      style="DISPLAY: inline; OVERFLOW: hidden; WIDTH: 140px; HEIGHT: 140px"><A
+      href="http://g.msn.com/5meen_us/171?path=/photomail/{6fc0065f-ffdd-4ca6-9a4c-cc5a93dc122f}&amp;image=47D7B182CFEFB10!127&amp;imagehi=47D7B182CFEFB10!125&amp;CID=323550092004883216"
+      border="0"><IMG
+      style="MARGIN-TOP: 15px; DISPLAY: inline-block; MARGIN-LEFT: 0px"
+      height=109 src="cid:006A71303B80404E9FB6184E55D6A446@wc" width=140
      border=0></A></DIV></TD></TR>
  <TR>
    <TD>
-      <DIV 
-      style="FONT-SIZE: 10pt; WIDTH: 140px; FONT-FAMILY: verdana; TEXT-ALIGN: center"><EM><STRONG>This 
-      <U>is </U></STRONG><U>tit</U>le</EM> fo<STRONG>r <FONT 
-      face="Arial Black">t<FONT color=#800000 size=7>h<U>i</U></FONT>s 
+      <DIV
+      style="FONT-SIZE: 10pt; WIDTH: 140px; FONT-FAMILY: verdana; TEXT-ALIGN: center"><EM><STRONG>This
+      <U>is </U></STRONG><U>tit</U>le</EM> fo<STRONG>r <FONT
+      face="Arial Black">t<FONT color=#800000 size=7>h<U>i</U></FONT>s
      </FONT>picture</STRONG></DIV></TD></TR></TBODY></TABLE></SPAN></SPAN></SPAN>

-<DIV 
+<DIV
 style="PADDING-RIGHT: 5px; PADDING-LEFT: 7px; PADDING-BOTTOM: 2px; WIDTH: 100%; PADDING-TOP: 2px; HEIGHT: 50px">
 <DIV>&nbsp;</DIV></DIV>
-<DIV 
-style="BORDER-TOP: #dddddd 1px solid; FONT-SIZE: 10pt; MARGIN-BOTTOM: 10px; WIDTH: 100%; COLOR: #909090; MARGIN-RIGHT: 10px; PADDING-TOP: 9px; FONT-FAMILY: Verdana; HEIGHT: 42px; BACKGROUND-COLOR: #ffffff"><NOBR><SPAN 
-title="Join Windows Live to share photos using Windows Live Photo E-mail.">Online 
-pictures are available for 30 days. <A style="COLOR: #0088e4" 
-href="http://g.msn.com/5meen_us/175">Get Windows Live Mail desktop to create 
+<DIV
+style="BORDER-TOP: #dddddd 1px solid; FONT-SIZE: 10pt; MARGIN-BOTTOM: 10px; WIDTH: 100%; COLOR: #909090; MARGIN-RIGHT: 10px; PADDING-TOP: 9px; FONT-FAMILY: Verdana; HEIGHT: 42px; BACKGROUND-COLOR: #ffffff"><NOBR><SPAN
+title="Join Windows Live to share photos using Windows Live Photo E-mail.">Online
+pictures are available for 30 days. <A style="COLOR: #0088e4"
+href="http://g.msn.com/5meen_us/175">Get Windows Live Mail desktop to create
 your own photo e-mails. </A></SPAN></NOBR></DIV></BODY></HTML>