1
0
mirror of https://github.com/ezyang/htmlpurifier.git synced 2025-01-16 13:39:02 +01:00

fix: non-substantive typos (#434)

Co-authored-by: Viktor Szépe <viktor@szepe.net>
Co-authored-by: Edward Z. Yang <ezyang@mit.edu>
This commit is contained in:
Pieter Oliver 2025-01-14 21:31:27 +00:00 committed by GitHub
parent b21a59101f
commit c2bc3549a3
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
43 changed files with 64 additions and 64 deletions

View File

@ -99,7 +99,7 @@ Autoload compatibility
HTML Purifier attempts to be as smart as possible when registering an HTML Purifier attempts to be as smart as possible when registering an
autoloader, but there are some cases where you will need to change autoloader, but there are some cases where you will need to change
your own code to accomodate HTML Purifier. These are those cases: your own code to accommodate HTML Purifier. These are those cases:
AN __autoload FUNCTION IS DECLARED AFTER OUR AUTOLOADER IS REGISTERED AN __autoload FUNCTION IS DECLARED AFTER OUR AUTOLOADER IS REGISTERED
spl_autoload_register() has the curious behavior of disabling spl_autoload_register() has the curious behavior of disabling

16
NEWS
View File

@ -482,7 +482,7 @@ real release we decided to skip this version number.
3.3.0, released 2009-02-16 3.3.0, released 2009-02-16
! Implement CSS property 'overflow' when %CSS.AllowTricky is true. ! Implement CSS property 'overflow' when %CSS.AllowTricky is true.
! Implement generic property list classess ! Implement generic property list classes
- Fix bug with testEncodingSupportsASCII() algorithm when iconv() implementation - Fix bug with testEncodingSupportsASCII() algorithm when iconv() implementation
does not do the "right thing" with characters not supported in the output does not do the "right thing" with characters not supported in the output
set. set.
@ -522,7 +522,7 @@ real release we decided to skip this version number.
! %Core.AggressivelyFixLt is on by default. This causes more sensible ! %Core.AggressivelyFixLt is on by default. This causes more sensible
processing of left angled brackets in smileys and other whatnot. processing of left angled brackets in smileys and other whatnot.
! Test scripts now have a 'type' parameter, which lets you say 'htmlpurifier', ! Test scripts now have a 'type' parameter, which lets you say 'htmlpurifier',
'phpt', 'vtest', etc. in order to only execute those tests. This supercedes 'phpt', 'vtest', etc. in order to only execute those tests. This supersedes
the --only-phpt parameter, although for backwards-compatibility the flag the --only-phpt parameter, although for backwards-compatibility the flag
will still work. will still work.
! AutoParagraph auto-formatter will now preserve double-newlines upon output. ! AutoParagraph auto-formatter will now preserve double-newlines upon output.
@ -573,7 +573,7 @@ real release we decided to skip this version number.
. Added --quick option to multitest.php, which tests only the most recent . Added --quick option to multitest.php, which tests only the most recent
release for each series. release for each series.
. Added --distro option to multitest.php, which accepts either 'normal' or . Added --distro option to multitest.php, which accepts either 'normal' or
'standalone'. This supercedes --exclude-normal and --exclude-standalone 'standalone'. This supersedes --exclude-normal and --exclude-standalone
3.1.1, released 2008-06-19 3.1.1, released 2008-06-19
# %URI.Munge now, by default, does not munge resources (for example, <img src="">) # %URI.Munge now, by default, does not munge resources (for example, <img src="">)
@ -723,7 +723,7 @@ real release we decided to skip this version number.
# HTMLPurifier->addFilter is deprecated; built-in filters can now be # HTMLPurifier->addFilter is deprecated; built-in filters can now be
enabled using %Filter.$filter_name or by setting your own filters using enabled using %Filter.$filter_name or by setting your own filters using
%Filter.Custom %Filter.Custom
# Directive-level safety properties superceded in favor of module-level # Directive-level safety properties superseded in favor of module-level
safety. Internal method HTMLModule->addElement() has changed, although safety. Internal method HTMLModule->addElement() has changed, although
the externally visible HTMLDefinition->addElement has *not* changed. the externally visible HTMLDefinition->addElement has *not* changed.
! Extra utility classes for testing and non-library operations can ! Extra utility classes for testing and non-library operations can
@ -769,7 +769,7 @@ real release we decided to skip this version number.
. A couple of new historical maintenance scripts were added. . A couple of new historical maintenance scripts were added.
. HTMLPurifier/HTMLModule/Tidy/XHTMLAndHTML4.php split into two files . HTMLPurifier/HTMLModule/Tidy/XHTMLAndHTML4.php split into two files
. tests/index.php can now be run from any directory. . tests/index.php can now be run from any directory.
. HTMLPurifier_Token subclasses split into seperate files . HTMLPurifier_Token subclasses split into separate files
. HTMLPURIFIER_PREFIX now is defined in Bootstrap.php, NOT HTMLPurifier.php . HTMLPURIFIER_PREFIX now is defined in Bootstrap.php, NOT HTMLPurifier.php
. HTMLPURIFIER_PREFIX can now be defined outside of HTML Purifier . HTMLPURIFIER_PREFIX can now be defined outside of HTML Purifier
. New --php=php flag added, allows PHP executable to be specified (command . New --php=php flag added, allows PHP executable to be specified (command
@ -835,7 +835,7 @@ real release we decided to skip this version number.
- Fix warning emitted when a non-supported URI scheme is passed to the - Fix warning emitted when a non-supported URI scheme is passed to the
MakeAbsolute URIFilter, thanks NykO18 (again) MakeAbsolute URIFilter, thanks NykO18 (again)
- Further refine AutoParagraph injector. Behavior inside of elements - Further refine AutoParagraph injector. Behavior inside of elements
allowing paragraph tags clarified: only inline content delimeted by allowing paragraph tags clarified: only inline content delimited by
double newlines (not block elements) are paragraphed. double newlines (not block elements) are paragraphed.
- Buggy treatment of end tags of elements that have required attributes - Buggy treatment of end tags of elements that have required attributes
fixed (does not manifest on default tag-set) fixed (does not manifest on default tag-set)
@ -881,7 +881,7 @@ real release we decided to skip this version number.
! CSS property border-spacing implemented ! CSS property border-spacing implemented
- Fix non-visible parsing error in DirectLex with empty tags that have - Fix non-visible parsing error in DirectLex with empty tags that have
slashes inside attribute values. slashes inside attribute values.
- Fix typo in CSS definition: border-collapse:seperate; was incorrectly - Fix typo in CSS definition: border-collapse:separate; was incorrectly
accepted as valid CSS. Usually non-visible, because this styling is the accepted as valid CSS. Usually non-visible, because this styling is the
default for tables in most browsers. Thanks Brett Zamir for pointing default for tables in most browsers. Thanks Brett Zamir for pointing
this out. this out.
@ -932,7 +932,7 @@ real release we decided to skip this version number.
. HTMLPurifier_Config->getSerial() implemented, this is extremely useful . HTMLPurifier_Config->getSerial() implemented, this is extremely useful
for output cache invalidation for output cache invalidation
. ConfigForm printer now can retrieve CSS and JS files as strings, in . ConfigForm printer now can retrieve CSS and JS files as strings, in
case HTML Purifier's directory is not publically accessible case HTML Purifier's directory is not publicly accessible
. Introduce new text/itext configuration directive values: these represent . Introduce new text/itext configuration directive values: these represent
longer strings that would be more appropriately edited with a textarea longer strings that would be more appropriately edited with a textarea
. Allow newlines to act as separators for lists, hashes, lookups and . Allow newlines to act as separators for lists, hashes, lookups and

4
TODO
View File

@ -41,7 +41,7 @@ FUTURE VERSIONS
- Config: Add examples to everything (make built-in which also automatically - Config: Add examples to everything (make built-in which also automatically
gives output) gives output)
- Add "register" field to config schemas to eliminate dependence on - Add "register" field to config schemas to eliminate dependence on
naming conventions (try to remember why we ultimately decided on tihs) naming conventions (try to remember why we ultimately decided on this)
5.0 release [HTML 5] 5.0 release [HTML 5]
# Swap out code to use html5lib tokenizer and tree-builder # Swap out code to use html5lib tokenizer and tree-builder
@ -112,7 +112,7 @@ Neat feature related
Also, enable disabling of directionality Also, enable disabling of directionality
? Externalize inline CSS to promote clean HTML, proposed by Sander Tekelenburg ? Externalize inline CSS to promote clean HTML, proposed by Sander Tekelenburg
? Remove redundant tags, ex. <u><u>Underlined</u></u>. Implementation notes: ? Remove redundant tags, ex. <u><u>Underlined</u></u>. Implementation notes:
1. Analyzing which tags to remove duplicants 1. Analyzing which tags to remove duplicates
2. Ensure attributes are merged into the parent tag 2. Ensure attributes are merged into the parent tag
3. Extend the tag exclusion system to specify whether or not the 3. Extend the tag exclusion system to specify whether or not the
contents should be dropped or not (currently, there's code that could do contents should be dropped or not (currently, there's code that could do

View File

@ -392,7 +392,7 @@ Women practicing non-martial T'ai Chi in <a href="/wiki/Chinatown_%28Manhattan%2
<li><a href="http://www.scheele.org/lee/tcclinks.html" class="external text" title="http://www.scheele.org/lee/tcclinks.html">Lee Scheele's Links to T'ai Chi Ch'uan Web Sites</a></li> <li><a href="http://www.scheele.org/lee/tcclinks.html" class="external text" title="http://www.scheele.org/lee/tcclinks.html">Lee Scheele's Links to T'ai Chi Ch'uan Web Sites</a></li>
<li><a href="http://news.bbc.co.uk/1/hi/health/3543907.stm" class="external text" title="http://news.bbc.co.uk/1/hi/health/3543907.stm">BBC article</a></li> <li><a href="http://news.bbc.co.uk/1/hi/health/3543907.stm" class="external text" title="http://news.bbc.co.uk/1/hi/health/3543907.stm">BBC article</a></li>
<li><a href="http://www.acupuncturetoday.com/archives2004/jul/07taichi.html" class="external text" title="http://www.acupuncturetoday.com/archives2004/jul/07taichi.html">Tai Chi: Good for the Mind, Good for the Body</a></li> <li><a href="http://www.acupuncturetoday.com/archives2004/jul/07taichi.html" class="external text" title="http://www.acupuncturetoday.com/archives2004/jul/07taichi.html">Tai Chi: Good for the Mind, Good for the Body</a></li>
<li><a href="http://www.taichiunion.com/" class="external text" title="http://www.taichiunion.com/">Tai Chi Chuan Union for Great Britian: The largest collective of independent Tai Chi Chuan Instructors in the British Isles</a></li> <li><a href="http://www.taichiunion.com/" class="external text" title="http://www.taichiunion.com/">Tai Chi Chuan Union for Great Britain: The largest collective of independent Tai Chi Chuan Instructors in the British Isles</a></li>
</ul> </ul>

View File

@ -49,7 +49,7 @@ the properties:
AllowedFrameTargets -> heavily <a> specific, but also used by <area> AllowedFrameTargets -> heavily <a> specific, but also used by <area>
and <form>. Transitional DTD %FrameTarget, not present in strict, and <form>. Transitional DTD %FrameTarget, not present in strict,
HTML5 calls them "browsing contexts" HTML5 calls them "browsing contexts"
Default*Image* -> as a default parameter, is almost entirely exlcusive Default*Image* -> as a default parameter, is almost entirely exclusive
to <img> to <img>
EnableID -> global attribute EnableID -> global attribute
Name.UseCDATA -> heavily <a> specific, but has heavy other usage by Name.UseCDATA -> heavily <a> specific, but has heavy other usage by

View File

@ -122,7 +122,7 @@ thead th {text-align:left;padding:0.1em;background-color:#EEE;}
<tbody> <tbody>
<tr><th colspan="2">Table</th></tr> <tr><th colspan="2">Table</th></tr>
<tr class="impl-yes"><td>border-collapse</td><td>ENUM(collapse, seperate)</td></tr> <tr class="impl-yes"><td>border-collapse</td><td>ENUM(collapse, separate)</td></tr>
<tr class="impl-yes"><td>border-space</td><td>MULTIPLE</td></tr> <tr class="impl-yes"><td>border-space</td><td>MULTIPLE</td></tr>
<tr class="impl-yes"><td>caption-side</td><td>ENUM(top, bottom)</td></tr> <tr class="impl-yes"><td>caption-side</td><td>ENUM(top, bottom)</td></tr>
<tr class="feature"><td>empty-cells</td><td>ENUM(show, hide), No IE support makes this useless, <tr class="feature"><td>empty-cells</td><td>ENUM(show, hide), No IE support makes this useless,

View File

@ -163,7 +163,7 @@ smoketest</a>.</p>
<p>So you want HTML Purifier to clean up your HTML, but you're not <p>So you want HTML Purifier to clean up your HTML, but you're not
so happy about the br@clear implementation. That's perfectly fine! so happy about the br@clear implementation. That's perfectly fine!
HTML Purifier will make accomodations:</p> HTML Purifier will make accommodations:</p>
<pre>$config-&gt;set('HTML.Doctype', 'XHTML 1.0 Transitional'); <pre>$config-&gt;set('HTML.Doctype', 'XHTML 1.0 Transitional');
$config-&gt;set('HTML.TidyLevel', 'heavy'); // all changes, minus... $config-&gt;set('HTML.TidyLevel', 'heavy'); // all changes, minus...

View File

@ -60,7 +60,7 @@ These are special use tags, they should be enabled on a blanket basis.
Lists - dd, dl, dt, li, ol, ul ~ menu, dir Lists - dd, dl, dt, li, ol, ul ~ menu, dir
Tables - caption, table, td, th, tr / col, colgroup, tbody, tfoot, thead Tables - caption, table, td, th, tr / col, colgroup, tbody, tfoot, thead
Forms - fieldset, form, input, lable, legend, optgroup, option, select, textarea Forms - fieldset, form, input, label, legend, optgroup, option, select, textarea
XSS - noscript, object, script ~ applet XSS - noscript, object, script ~ applet
Meta - base, basefont, body, head, html, link, meta, style, title Meta - base, basefont, body, head, html, link, meta, style, title
Frames - frame, frameset, iframe Frames - frame, frameset, iframe
@ -91,7 +91,7 @@ attribute and put URI filtering higher up on the priority list.
== Attribute Risk Analysis == == Attribute Risk Analysis ==
We actually have a suprisingly small assortment of allowed attributes (the We actually have a surprisingly small assortment of allowed attributes (the
rest are deprecated in strict, and thus we opted not to allow them, even rest are deprecated in strict, and thus we opted not to allow them, even
though our output is XHTML Transitional by default.) though our output is XHTML Transitional by default.)

View File

@ -70,7 +70,7 @@ Backfills/Data integrity:
Type systems: Type systems:
- Flags: ReadOnly, Permanent, DontEnum - Flags: ReadOnly, Permanent, DontEnum
- Typed properties isn't that useful [It's also Not-PHP] - Typed properties isn't that useful [It's also Not-PHP]
- Seperate meta-list of directive properties IS useful - Separate meta-list of directive properties IS useful
- Duck typing is useful for systems designed fully around properties pattern - Duck typing is useful for systems designed fully around properties pattern
Trade-off: Trade-off:

View File

@ -195,7 +195,7 @@ class HTMLPurifier_AttrDef_CSS_FontFamily extends HTMLPurifier_AttrDef
// transforms don't pose a security risk (as \\ and \" // transforms don't pose a security risk (as \\ and \"
// might--these escapes are not supported by most browsers). // might--these escapes are not supported by most browsers).
// We could try to be clever and use single-quote wrapping // We could try to be clever and use single-quote wrapping
// when there is a double quote present, but I have choosen // when there is a double quote present, but I have chosen
// not to implement that. (NOTE: you can reduce the amount // not to implement that. (NOTE: you can reduce the amount
// of escapes by one depending on what quoting style you use) // of escapes by one depending on what quoting style you use)
// $font = str_replace('\\', '\\5C ', $font); // $font = str_replace('\\', '\\5C ', $font);

View File

@ -37,7 +37,7 @@ class HTMLPurifier_AttrDef_URI_IPv6 extends HTMLPurifier_AttrDef_URI_IPv4
} }
} }
// IPv4-compatiblity check // IPv4-compatibility check
if (preg_match('#(?<=:' . ')' . $this->ip4 . '$#s', $aIP, $find)) { if (preg_match('#(?<=:' . ')' . $this->ip4 . '$#s', $aIP, $find)) {
$aIP = substr($aIP, 0, 0 - strlen($find[0])); $aIP = substr($aIP, 0, 0 - strlen($find[0]));
$ip = explode('.', $find[0]); $ip = explode('.', $find[0]);

View File

@ -3,7 +3,7 @@
// this MUST be placed in post, as it assumes that any value in dir is valid // this MUST be placed in post, as it assumes that any value in dir is valid
/** /**
* Post-trasnform that ensures that bdo tags have the dir attribute set. * Post-transform that ensures that bdo tags have the dir attribute set.
*/ */
class HTMLPurifier_AttrTransform_BdoDir extends HTMLPurifier_AttrTransform class HTMLPurifier_AttrTransform_BdoDir extends HTMLPurifier_AttrTransform
{ {

View File

@ -135,7 +135,7 @@ class HTMLPurifier_AttrValidator
// we'd also want slightly more complicated substitution // we'd also want slightly more complicated substitution
// involving an array as the return value, // involving an array as the return value,
// although we're not sure how colliding attributes would // although we're not sure how colliding attributes would
// resolve (certain ones would be completely overriden, // resolve (certain ones would be completely overridden,
// others would prepend themselves). // others would prepend themselves).
} }

View File

@ -5,7 +5,7 @@ if (!defined('HTMLPURIFIER_PREFIX')) {
define('HTMLPURIFIER_PREFIX', realpath(dirname(__FILE__) . '/..')); define('HTMLPURIFIER_PREFIX', realpath(dirname(__FILE__) . '/..'));
} }
// accomodations for versions earlier than 5.0.2 // accommodations for versions earlier than 5.0.2
// borrowed from PHP_Compat, LGPL licensed, by Aidan Lister <aidan@php.net> // borrowed from PHP_Compat, LGPL licensed, by Aidan Lister <aidan@php.net>
if (!defined('PHP_EOL')) { if (!defined('PHP_EOL')) {
switch (strtoupper(substr(PHP_OS, 0, 3))) { switch (strtoupper(substr(PHP_OS, 0, 3))) {

View File

@ -66,7 +66,7 @@ class HTMLPurifier_ConfigSchema_Interchange_Directive
public $version; public $version;
/** /**
* ID of directive that supercedes this old directive. * ID of directive that supersedes this old directive.
* Null if not deprecated. * Null if not deprecated.
* @type HTMLPurifier_ConfigSchema_Interchange_Id * @type HTMLPurifier_ConfigSchema_Interchange_Id
*/ */

View File

@ -5,10 +5,10 @@ DEFAULT: ''
--DESCRIPTION-- --DESCRIPTION--
Temporary prefix for IDs used in conjunction with %Attr.IDPrefix. If you Temporary prefix for IDs used in conjunction with %Attr.IDPrefix. If you
need to allow multiple sets of user content on web page, you may need to need to allow multiple sets of user content on web page, you may need to
have a seperate prefix that changes with each iteration. This way, have a separate prefix that changes with each iteration. This way,
seperately submitted user content displayed on the same page doesn't separately submitted user content displayed on the same page doesn't
clobber each other. Ideal values are unique identifiers for the content it clobber each other. Ideal values are unique identifiers for the content it
represents (i.e. the id of the row in the database). Be sure to add a represents (i.e. the id of the row in the database). Be sure to add a
seperator (like an underscore) at the end. Warning: this directive will separator (like an underscore) at the end. Warning: this directive will
not work unless %Attr.IDPrefix is set to a non-empty value! not work unless %Attr.IDPrefix is set to a non-empty value!
--# vim: et sw=4 sts=4 --# vim: et sw=4 sts=4

View File

@ -8,6 +8,6 @@ converting all non-ASCII characters into decimal numeric entities before
converting it to its native encoding. This means that even characters that converting it to its native encoding. This means that even characters that
can be expressed in the non-UTF-8 encoding will be entity-ized, which can can be expressed in the non-UTF-8 encoding will be entity-ized, which can
be a real downer for encodings like Big5. It also assumes that the ASCII be a real downer for encodings like Big5. It also assumes that the ASCII
repetoire is available, although this is the case for almost all encodings. repertoire is available, although this is the case for almost all encodings.
Anyway, use UTF-8! Anyway, use UTF-8!
--# vim: et sw=4 sts=4 --# vim: et sw=4 sts=4

View File

@ -16,7 +16,7 @@ DEFAULT: NULL
</dd> </dd>
<dt><em>string</em> lexer identifier</dt> <dt><em>string</em> lexer identifier</dt>
<dd> <dd>
This is a slim way of manually overridding the implementation. This is a slim way of manually overriding the implementation.
Currently recognized values are: DOMLex (the default PHP5 Currently recognized values are: DOMLex (the default PHP5
implementation) implementation)
and DirectLex (the default PHP4 implementation). Only use this if and DirectLex (the default PHP4 implementation). Only use this if

View File

@ -5,7 +5,7 @@
// $config or $context to the callback functions. // $config or $context to the callback functions.
/** /**
* Handles referencing and derefencing character entities * Handles referencing and dereferencing character entities
*/ */
class HTMLPurifier_EntityParser class HTMLPurifier_EntityParser
{ {

View File

@ -4,7 +4,7 @@
* Represents a pre or post processing filter on HTML Purifier's output * Represents a pre or post processing filter on HTML Purifier's output
* *
* Sometimes, a little ad-hoc fixing of HTML has to be done before * Sometimes, a little ad-hoc fixing of HTML has to be done before
* it gets sent through HTML Purifier: you can use filters to acheive * it gets sent through HTML Purifier: you can use filters to achieve
* this effect. For instance, YouTube videos can be preserved using * this effect. For instance, YouTube videos can be preserved using
* this manner. You could have used a decorator for this task, but * this manner. You could have used a decorator for this task, but
* PHP's support for them is not terribly robust, so we're going * PHP's support for them is not terribly robust, so we're going

View File

@ -244,7 +244,7 @@ class HTMLPurifier_Generator
// whitespace (in fact, most don't, at least for attributes // whitespace (in fact, most don't, at least for attributes
// like alt, but an extra space at the end is barely // like alt, but an extra space at the end is barely
// noticeable). Still, we have a configuration knob for // noticeable). Still, we have a configuration knob for
// this, since this transformation is not necesary if you // this, since this transformation is not necessary if you
// don't process user input with innerHTML or you don't plan // don't process user input with innerHTML or you don't plan
// on supporting Internet Explorer. // on supporting Internet Explorer.
if ($this->_innerHTMLFix) { if ($this->_innerHTMLFix) {

View File

@ -28,7 +28,7 @@ class HTMLPurifier_HTMLModule_Edit extends HTMLPurifier_HTMLModule
// HTML 4.01 specifies that ins/del must not contain block // HTML 4.01 specifies that ins/del must not contain block
// elements when used in an inline context, chameleon is // elements when used in an inline context, chameleon is
// a complicated workaround to acheive this effect // a complicated workaround to achieve this effect
// Inline context ! Block context (exclamation mark is // Inline context ! Block context (exclamation mark is
// separator, see getChildDef for parsing) // separator, see getChildDef for parsing)

View File

@ -2,7 +2,7 @@
/** /**
* XHTML 1.1 Ruby Annotation Module, defines elements that indicate * XHTML 1.1 Ruby Annotation Module, defines elements that indicate
* short runs of text alongside base text for annotation or pronounciation. * short runs of text alongside base text for annotation or pronunciation.
*/ */
class HTMLPurifier_HTMLModule_Ruby extends HTMLPurifier_HTMLModule class HTMLPurifier_HTMLModule_Ruby extends HTMLPurifier_HTMLModule
{ {

View File

@ -1,7 +1,7 @@
<?php <?php
/** /**
* Name is deprecated, but allowed in strict doctypes, so onl * Name is deprecated, but allowed in strict doctypes, so only
*/ */
class HTMLPurifier_HTMLModule_Tidy_Name extends HTMLPurifier_HTMLModule_Tidy class HTMLPurifier_HTMLModule_Tidy_Name extends HTMLPurifier_HTMLModule_Tidy
{ {

View File

@ -1223,14 +1223,14 @@ class HTML5
'type' => self::COMMENT 'type' => self::COMMENT
); );
/* Otherwise if the next seven chacacters are a case-insensitive match /* Otherwise if the next seven characters are a case-insensitive match
for the word "DOCTYPE", then consume those characters and switch to the for the word "DOCTYPE", then consume those characters and switch to the
DOCTYPE state. */ DOCTYPE state. */
} elseif (strtolower($this->character($this->char + 1, 7)) === 'doctype') { } elseif (strtolower($this->character($this->char + 1, 7)) === 'doctype') {
$this->char += 7; $this->char += 7;
$this->state = 'doctype'; $this->state = 'doctype';
/* Otherwise, is is a parse error. Switch to the bogus comment state. /* Otherwise, it is a parse error. Switch to the bogus comment state.
The next character that is consumed, if any, is the first character The next character that is consumed, if any, is the first character
that will be in the comment. */ that will be in the comment. */
} else { } else {

View File

@ -20,7 +20,7 @@
* The second objective is to ensure that explicitly excluded elements of * The second objective is to ensure that explicitly excluded elements of
* an element do not appear in its children. Code that accomplishes this * an element do not appear in its children. Code that accomplishes this
* task is pervasive through the strategy, though the two are distinct tasks * task is pervasive through the strategy, though the two are distinct tasks
* and could, theoretically, be seperated (although it's not recommended). * and could, theoretically, be separated (although it's not recommended).
* *
* @note Whether or not unrecognized children are silently dropped or * @note Whether or not unrecognized children are silently dropped or
* translated into text depends on the child definitions. * translated into text depends on the child definitions.

View File

@ -641,7 +641,7 @@ class HTMLPurifier_Strategy_MakeWellFormed extends HTMLPurifier_Strategy
// Needless to say, we need to UN-skip the token so it gets // Needless to say, we need to UN-skip the token so it gets
// reprocessed. // reprocessed.
// //
// - Suppose that you successfuly process a token, replace it with // - Suppose that you successfully process a token, replace it with
// one with your skip mark, but now another injector wants to // one with your skip mark, but now another injector wants to
// process the skipped token with another token. Should you continue // process the skipped token with another token. Should you continue
// to skip that new token, or reprocess it? If you reprocess, // to skip that new token, or reprocess it? If you reprocess,

View File

@ -71,7 +71,7 @@ class HTMLPurifier_URIFilter_Munge extends HTMLPurifier_URIFilter
} // ignore unknown schemes, maybe another postfilter did it } // ignore unknown schemes, maybe another postfilter did it
if (!$scheme_obj->browsable) { if (!$scheme_obj->browsable) {
return true; return true;
} // ignore non-browseable schemes, since we can't munge those in a reasonable way } // ignore non-browsable schemes, since we can't munge those in a reasonable way
if ($uri->isBenign($config, $context)) { if ($uri->isBenign($config, $context)) {
return true; return true;
} // don't redirect if a benign URL } // don't redirect if a benign URL

View File

@ -29,7 +29,7 @@ class HTMLPurifier_URIParser
// Regexp is as per Appendix B. // Regexp is as per Appendix B.
// Note that ["<>] are an addition to the RFC's recommended // Note that ["<>] are an addition to the RFC's recommended
// characters, because they represent external delimeters. // characters, because they represent external delimiters.
$r_URI = '!'. $r_URI = '!'.
'(([a-zA-Z0-9\.\+\-]+):)?'. // 2. Scheme '(([a-zA-Z0-9\.\+\-]+):)?'. // 2. Scheme
'(//([^/?#"<>]*))?'. // 4. Authority '(//([^/?#"<>]*))?'. // 4. Authority
@ -43,7 +43,7 @@ class HTMLPurifier_URIParser
if (!$result) return false; // *really* invalid URI if (!$result) return false; // *really* invalid URI
// seperate out parts // separate out parts
$scheme = !empty($matches[1]) ? $matches[2] : null; $scheme = !empty($matches[1]) ? $matches[2] : null;
$authority = !empty($matches[3]) ? $matches[4] : null; $authority = !empty($matches[3]) ? $matches[4] : null;
$path = $matches[5]; // always present, can be empty $path = $matches[5]; // always present, can be empty

View File

@ -800,14 +800,14 @@ class HTML5
'type' => self::COMMENT 'type' => self::COMMENT
); );
/* Otherwise if the next seven chacacters are a case-insensitive match /* Otherwise if the next seven characters are a case-insensitive match
for the word "DOCTYPE", then consume those characters and switch to the for the word "DOCTYPE", then consume those characters and switch to the
DOCTYPE state. */ DOCTYPE state. */
} elseif(strtolower($this->character($this->char + 1, 7)) === 'doctype') { } elseif(strtolower($this->character($this->char + 1, 7)) === 'doctype') {
$this->char += 7; $this->char += 7;
$this->state = 'doctype'; $this->state = 'doctype';
/* Otherwise, is is a parse error. Switch to the bogus comment state. /* Otherwise, it is a parse error. Switch to the bogus comment state.
The next character that is consumed, if any, is the first character The next character that is consumed, if any, is the first character
that will be in the comment. */ that will be in the comment. */
} else { } else {

View File

@ -14,7 +14,7 @@ if(!defined("PHORUM_ADMIN")) exit;
// error reporting is good! // error reporting is good!
error_reporting(E_ALL ^ E_NOTICE); error_reporting(E_ALL ^ E_NOTICE);
// load library and other paraphenalia // load library and other paraphernalia
require_once './include/admin/PhorumInputForm.php'; require_once './include/admin/PhorumInputForm.php';
require_once (dirname(__FILE__) . '/htmlpurifier/HTMLPurifier.auto.php'); require_once (dirname(__FILE__) . '/htmlpurifier/HTMLPurifier.auto.php');
require_once (dirname(__FILE__) . '/init-config.php'); require_once (dirname(__FILE__) . '/init-config.php');

View File

@ -36,7 +36,7 @@ function formatCode($string)
<p>XSS attacks are from <p>XSS attacks are from
<a href="http://ha.ckers.org/xss.html">http://ha.ckers.org/xss.html</a>.</p> <a href="http://ha.ckers.org/xss.html">http://ha.ckers.org/xss.html</a>.</p>
<p><strong>Caveats:</strong> <p><strong>Caveats:</strong>
<tt>Google.com</tt> has been programatically disallowed, but as you can <tt>Google.com</tt> has been programmatically disallowed, but as you can
see, there are ways of getting around that, so coverage in this area see, there are ways of getting around that, so coverage in this area
is not complete. Most XSS broadcasts its presence by spawning an alert dialogue. is not complete. Most XSS broadcasts its presence by spawning an alert dialogue.
The displayed code is not strictly correct, as linebreaks have been forced for The displayed code is not strictly correct, as linebreaks have been forced for
@ -50,7 +50,7 @@ if (version_compare(PHP_VERSION, '5', '<')) exit('<p>Requires PHP 5.</p>');
$xml = simplexml_load_file('xssAttacks.xml'); $xml = simplexml_load_file('xssAttacks.xml');
// programatically disallow google.com for URI evasion tests // programmatically disallow google.com for URI evasion tests
// not complete // not complete
$config = HTMLPurifier_Config::createDefault(); $config = HTMLPurifier_Config::createDefault();
$config->set('URI.HostBlacklist', array('google.com')); $config->set('URI.HostBlacklist', array('google.com'));

View File

@ -864,7 +864,7 @@ As a side note, this was also effective against a real world XSS filter I came a
<attack> <attack>
<name>Extraneous Open Brackets</name> <name>Extraneous Open Brackets</name>
<code>&lt;&lt;SCRIPT&gt;alert(&quot;XSS&quot;);//&lt;&lt;/SCRIPT&gt;</code> <code>&lt;&lt;SCRIPT&gt;alert(&quot;XSS&quot;);//&lt;&lt;/SCRIPT&gt;</code>
<desc>(Submitted by Franz Sedlmaier http://www.pilorz.net/). This XSS vector could defeat certain detection engines that work by first using matching pairs of open and close angle brackets and then by doing a comparison of the tag inside, instead of a more efficient algorythm like Boyer-Moore (http://www.cs.utexas.edu/users/moore/best-ideas/string-searching/) that looks for entire string matches of the open angle bracket and associated tag (post de-obfuscation, of course). The double slash comments out the ending extraneous bracket to supress a JavaScript error.</desc> <desc>(Submitted by Franz Sedlmaier http://www.pilorz.net/). This XSS vector could defeat certain detection engines that work by first using matching pairs of open and close angle brackets and then by doing a comparison of the tag inside, instead of a more efficient algorithm like Boyer-Moore (http://www.cs.utexas.edu/users/moore/best-ideas/string-searching/) that looks for entire string matches of the open angle bracket and associated tag (post de-obfuscation, of course). The double slash comments out the ending extraneous bracket to suppress a JavaScript error.</desc>
<label>Embedded Character Attacks</label> <label>Embedded Character Attacks</label>
<browser>Browser support: [&lt;span class=&quot;s&quot;&gt;IE6.0&lt;/span&gt;|&lt;span class=&quot;s&quot;&gt;NS8.1-IE&lt;/span&gt;] [&lt;span class=&quot;s&quot;&gt;NS8.1-G&lt;/span&gt;|&lt;span class=&quot;s&quot;&gt;FF1.5&lt;/span&gt;] [&lt;span class=&quot;s&quot;&gt;O8.54&lt;/span&gt;]</browser> <browser>Browser support: [&lt;span class=&quot;s&quot;&gt;IE6.0&lt;/span&gt;|&lt;span class=&quot;s&quot;&gt;NS8.1-IE&lt;/span&gt;] [&lt;span class=&quot;s&quot;&gt;NS8.1-G&lt;/span&gt;|&lt;span class=&quot;s&quot;&gt;FF1.5&lt;/span&gt;] [&lt;span class=&quot;s&quot;&gt;O8.54&lt;/span&gt;]</browser>
@ -940,7 +940,7 @@ alert(a.source)&lt;/SCRIPT&gt;</code>
-onCut() (user needs to copy something or it can be exploited using the execCommand(&quot;Cut&quot;) command) -onCut() (user needs to copy something or it can be exploited using the execCommand(&quot;Cut&quot;) command)
-onDataAvailible() (user would need to change data in an element, or attacker could perform the same function) -onDataAvailable() (user would need to change data in an element, or attacker could perform the same function)
-onDataSetChanged() (fires when the data set exposed by a data source object changes) -onDataSetChanged() (fires when the data set exposed by a data source object changes)

View File

@ -34,12 +34,12 @@ class HTMLPurifier_AttrCollectionsTest extends HTMLPurifier_Harness
$modules['Module2'] = new HTMLPurifier_HTMLModule(); $modules['Module2'] = new HTMLPurifier_HTMLModule();
$modules['Module2']->attr_collections = array( $modules['Module2']->attr_collections = array(
'Core' => array( 'Core' => array(
0 => array('Brocolli') 0 => array('Broccoli')
), ),
'Soup' => array( 'Soup' => array(
'attribute-3' => 'Type3' 'attribute-3' => 'Type3'
), ),
'Brocolli' => array() 'Broccoli' => array()
); );
$collections->doConstruct($types, $modules); $collections->doConstruct($types, $modules);
@ -48,14 +48,14 @@ class HTMLPurifier_AttrCollectionsTest extends HTMLPurifier_Harness
$collections->info, $collections->info,
array( array(
'Core' => array( 'Core' => array(
0 => array('Soup', 'Undefined', 'Brocolli'), 0 => array('Soup', 'Undefined', 'Broccoli'),
'attribute' => 'Type', 'attribute' => 'Type',
'attribute-2' => 'Type2' 'attribute-2' => 'Type2'
), ),
'Soup' => array( 'Soup' => array(
'attribute-3' => 'Type3' 'attribute-3' => 'Type3'
), ),
'Brocolli' => array() 'Broccoli' => array()
) )
); );

View File

@ -59,7 +59,7 @@ extends HTMLPurifier_ChildDefHarness
); );
} }
public function testWrapComplicatedSring() public function testWrapComplicatedString()
{ {
$this->assertResult( $this->assertResult(
$var = 'He said<br />perhaps<br />we should <b>nuke</b> them.', $var = 'He said<br />perhaps<br />we should <b>nuke</b> them.',

View File

@ -184,7 +184,7 @@ class HTMLPurifier_ConfigTest extends HTMLPurifier_Harness
) )
); );
// grab a non-existant namespace // grab a non-existent namespace
$this->expectError('Cannot retrieve undefined namespace Constants'); $this->expectError('Cannot retrieve undefined namespace Constants');
$config->getBatch('Constants'); $config->getBatch('Constants');

View File

@ -331,7 +331,7 @@ Par1
); );
} }
public function testBlockNodeTextDelimeterInBlockNode() public function testBlockNodeTextDelimiterInBlockNode()
{ {
$this->assertResult( $this->assertResult(
'<div>Par1 '<div>Par1
@ -343,7 +343,7 @@ Par1
); );
} }
public function testBlockNodeTextDelimeterWithoutDoublespaceInBlockNode() public function testBlockNodeTextDelimiterWithoutDoublespaceInBlockNode()
{ {
$this->assertResult( $this->assertResult(
'<div>Par1 '<div>Par1
@ -351,7 +351,7 @@ Par1
); );
} }
public function testBlockNodeTextDelimeterWithoutDoublespace() public function testBlockNodeTextDelimiterWithoutDoublespace()
{ {
$this->assertResult( $this->assertResult(
'Par1 'Par1

View File

@ -10,7 +10,7 @@ class HTMLPurifier_Injector_PurifierLinkifyTest extends HTMLPurifier_InjectorHar
$this->config->set('AutoFormat.PurifierLinkify.DocURL', '#%s'); $this->config->set('AutoFormat.PurifierLinkify.DocURL', '#%s');
} }
public function testNoTriggerCharacer() public function testNoTriggerCharacter()
{ {
$this->assertResult('Foobar'); $this->assertResult('Foobar');
} }

View File

@ -669,7 +669,7 @@ class HTMLPurifier_LexerTest extends HTMLPurifier_Harness
return; return;
} elseif (LIBXML_VERSION < 20628) { } elseif (LIBXML_VERSION < 20628) {
// libxml's behavior is wrong prior to this version, so make // libxml's behavior is wrong prior to this version, so make
// appropriate accomodations // appropriate accommodations
$extra['DOMLex'] = $extra['DirectLex']; $extra['DOMLex'] = $extra['DirectLex'];
} }
$this->assertTokenization( $this->assertTokenization(

View File

@ -29,7 +29,7 @@ class HTMLPurifier_PercentEncoderTest extends HTMLPurifier_Harness
$this->assertDecode('%2B2B%3A3A'); // extra text $this->assertDecode('%2B2B%3A3A'); // extra text
$this->assertDecode('%2b2B%4141', '%2B2BA41'); // extra text, with normalization $this->assertDecode('%2b2B%4141', '%2B2BA41'); // extra text, with normalization
$this->assertDecode('%', '%25'); // normalize stray percent sign $this->assertDecode('%', '%25'); // normalize stray percent sign
$this->assertDecode('%5%25', '%255%25'); // permaturely terminated encoding $this->assertDecode('%5%25', '%255%25'); // prematurely terminated encoding
$this->assertDecode('%GJ', '%25GJ'); // invalid hexadecimal chars $this->assertDecode('%GJ', '%25GJ'); // invalid hexadecimal chars
// contested behavior, if this changes, we'll also have to have // contested behavior, if this changes, we'll also have to have

View File

@ -157,7 +157,7 @@ class HTMLPurifier_Strategy_MakeWellFormedTest extends HTMLPurifier_StrategyHarn
); );
} }
public function testNoAutocloseIfNoParentsCanAccomodateTag() public function testNoAutocloseIfNoParentsCanAccommodateTag()
{ {
$this->assertResult( $this->assertResult(
'<table><tr><td><li>foo</li></td></tr></table>', '<table><tr><td><li>foo</li></td></tr></table>',

View File

@ -1,6 +1,6 @@
<?php <?php
// needs to be seperated into files // needs to be separated into files
class HTMLPurifier_TagTransformTest extends HTMLPurifier_Harness class HTMLPurifier_TagTransformTest extends HTMLPurifier_Harness
{ {

View File

@ -101,7 +101,7 @@ if (!SimpleReporter::inCli()) {
} }
// initialize and load HTML Purifier // initialize and load HTML Purifier
// use ?standalone to load the alterative standalone stub // use ?standalone to load the alternative standalone stub
if ($AC['standalone']) { if ($AC['standalone']) {
require '../library/HTMLPurifier.standalone.php'; require '../library/HTMLPurifier.standalone.php';
} else { } else {