From ccae73c25a12affe7cdff50d137d67908e5f96d3 Mon Sep 17 00:00:00 2001 From: "Edward Z. Yang" Date: Sun, 21 Jan 2007 15:23:42 +0000 Subject: [PATCH] Update TODO, remove caching: it won't help our memory usage or speed. git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@674 48356398-32a2-884e-a903-53898d9a118a --- TODO | 43 ++++++++++++++++++++----------------------- 1 file changed, 20 insertions(+), 23 deletions(-) diff --git a/TODO b/TODO index 6d5cf64f..aa625d3a 100644 --- a/TODO +++ b/TODO @@ -7,16 +7,14 @@ TODO List ? At-risk ========================== -1.4 release - - Aggressive caching - ? Configuration profiles: sets of directives that get set with one func call - 1.5 release # Implement all non-essential attribute transforms # URI validation routines tighter (see docs/dev-code-quality.html) (COMPLEX) # Advanced URI filtering schemes (see docs/proposal-new-directives.txt) # Error logging for filtering/cleanup procedures - Requires I18N facilities to be created first (COMPLEX) + ? Configuration profiles: sets of directives that get set with one func call + - XSS-attempt detection 1.6 release # Add pre-packaged "levels" of cleaning (custom behavior already done) @@ -25,11 +23,26 @@ TODO List specification of elements that, when detected as foreign, trigger removal of children, although unbalanced tags could wreck havoc (or at least delete the rest of the document)). + - Allow specifying global attributes on a tag-by-tag basis in + %HTML.AllowAttributes + ? More user-friendly warnings when %HTML.Allow* attempts to specify a + tag or attribute that is not supported + - Parse TinyMCE whitelist into our %HTML.Allow* whitelists 1.7 release # Additional support for poorly written HTML - Microsoft Word HTML cleaning (i.e. MsoNormal, but research essential!) - Friendly strict handling of
(block ->
) + - Remove redundant tags, ex. Underlined. Implementation notes: + 1. Analyzing which tags to remove duplicants + 2. Ensure attributes are merged into the parent tag + 3. Extend the tag exclusion system to specify whether or not the + contents should be dropped or not (currently, there's code that could do + something like this if it didn't drop the inner text too.) + - Remove tags that don't do anything (no attributes) + - Remove empty inline tags + - Append something to duplicate IDs so they're still usable (impl. note: the + dupe detector would also need to detect the suffix as well) 2.0 release # Legit token based CSS parsing (will require revamping almost every @@ -46,6 +59,8 @@ TODO List - Hooks for adding custom processors to custom namespaced tags and attributes, offer default implementation - Lots of documentation and samples + - Allow tags to be "armored", an internal flag that protects them + from validation and passes them out unharmed - XHTML 1.1 support Ongoing @@ -56,38 +71,20 @@ Ongoing - more! (look for ones that use WYSIWYGs) Unknown release (on a scratch-an-itch basis) + - Upgrade SimpleTest testing code to newest versions - Fixes for Firefox's inability to handle COL alignment props (Bug 915) - Automatically add non-breaking spaces to empty table cells when empty-cells:show is applied to have compatibility with Internet Explorer - Convert RTL/LTR override characters to tags, or vice versa on demand. Also, enable disabling of directionality - - Append something to duplicate IDs so they're still usable (impl. note: the - dupe detector would also need to detect the suffix as well) - Have 'lang' attribute be checked against official lists ? Semi-lossy dumb alternate character encoding transformations, achieved by encoding all characters that have string entity equivalents - - Upgrade SimpleTest testing code to newest version - - Allow tags to be "armored", an internal flag that protects them - from validation and passes them out unharmed Requested ? Native content compression, whitespace stripping (don't rely on Tidy, make sure we don't remove from
 or related tags)
  ? Win32 Phalanger C# binaries
- - Remove redundant tags, ex. Underlined. Implementation notes:
-    1. Analyzing which tags to remove duplicants
-    2. Ensure attributes are merged into the parent tag
-    3. Extend the tag exclusion system to specify whether or not the
-    contents should be dropped or not (currently, there's code that could do
-    something like this if it didn't drop the inner text too.)
- ? More user-friendly warnings when %HTML.Allow* attempts to specify a
-   tag or attribute that is not supported
- - Allow specifying global attributes on a tag-by-tag basis in
-   %HTML.AllowAttributes
- - Parse TinyMCE whitelist into our %HTML.Allow* whitelists
- - XSS-attempt detection
- - Remove  tags that don't do anything (no attributes)
- - Remove empty inline tags
 
 Wontfix
  - Non-lossy smart alternate character encoding transformations (unless