1
0
mirror of https://github.com/ezyang/htmlpurifier.git synced 2025-07-14 19:16:35 +02:00
Commit Graph

1694 Commits

Author SHA1 Message Date
609977f9f5 Add CDATA support to the Lexers, as well as give PEARSax3 entity replacement.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@106 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 23:04:34 +00:00
5ce0ae7056 Implement EntityLookup and put in the Lexer. Some behavior was migrated, since it looks like it will have to be used in all Lexers, not just DirectLex (which is the only one that uses it).
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@105 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 21:07:30 +00:00
7268987846 Replace the HTML entity files with the XHTML files, they're the same, but being valid XML declarations they should be easier to parse.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@104 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 19:43:19 +00:00
5a18dad64f Typo, space between comma and word.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@103 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:57:13 +00:00
bcc2b09ac7 Finish documenting PEARSax3, touch up the other docs. Nuke the original lexer.txt document.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@102 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:56:00 +00:00
48cf55eae4 Switch benchmark to relative percents: makes comparisons easier.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@101 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:18:11 +00:00
2fa1161d3d - Implemented special entity conversion.
- Optimized and documented DirectLex.
- Rearranged test cases.

git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@100 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:13:04 +00:00
dc24215874 Token.php: Document the code.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@99 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 13:20:15 +00:00
728848c4c7 Begin adding Doxygen documentation.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@98 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 03:43:53 +00:00
14f481bcf6 svn:eol-style = native
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@97 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:11:03 +00:00
39c16f5cfd svn:keywords = Id
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@96 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:06:34 +00:00
042eda0463 Hook new test file in and delete duplicated tests.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@95 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:03:11 +00:00
6a5ff6d2a9 Branch the LexerTest in order to partition DirectLex specific unit tests.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@94 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:01:11 +00:00
cf403eb257 Add Lexer test directory.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@93 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:00:36 +00:00
ca1aefe271 Commit various optimizations to the Lexer, and add stub file for profiling the lexer.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@92 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 22:48:07 +00:00
de5ab5e6a0 Kick the auto-prepend habit.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@91 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 19:01:59 +00:00
ac1e62e043 Optimize next*() functions in DirectLex, add test for offset. Update Lexer documents.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@90 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 18:55:34 +00:00
eac83995e1 Remove dud test file.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@89 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 16:04:22 +00:00
1ab3ae160a Move classes into Zend style setup.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@88 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:38:41 +00:00
758e70dd94 Add test HTMLPurifier folder.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@87 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:27:03 +00:00
067a38f988 Add HTMLPurifier folder.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@86 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:22:48 +00:00
5da6634d86 We're adopting Zend's library organization, so create a library folder.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@85 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:19:40 +00:00
9180d6256f Make PHP5 lexer default DOMLex.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@84 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:07:53 +00:00
5bcb3c60cd Update docs, add lexer.txt
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@83 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:57:12 +00:00
d22140b9a6 Fix benchmarks to work in PHP4 by excluding DOMLex.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@82 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:28:51 +00:00
f0d74a3bf4 Fix parse error in PHP4.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@81 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:00:52 +00:00
4bf3305dff Build another lexer from PHP5's DOM library. Extremely fast!
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@80 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 13:50:05 +00:00
3e982c7f2c Rename Lexer, separate files. Also augmented benchmarks and benchmarker,
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@79 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 12:53:04 +00:00
e52890276b Branch out the SAX impl.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@78 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:28:28 +00:00
49fea8f8e9 Begin seperating Lexer implementations by copying the Lexer file to the Lexer directory.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@77 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:26:09 +00:00
a77a720377 Add Lexer folder to separate Lexer implementations.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@76 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:22:39 +00:00
cafa24482d Setup include web. Unit tests should now be able to be run independently.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@75 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:13:08 +00:00
1d8bfebb3a Update todo (rm renaming and refactoring)
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@74 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:04:27 +00:00
025b648c99 Split out code in Definition.php .
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@73 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:02:48 +00:00
f00a80c561 (Test) Branch Definition to ChildDef in order to split out code in the file.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@72 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:01:38 +00:00
6d72d061a9 Branch Definition to AttrDef in order to split out code in the file.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@71 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:58:45 +00:00
fb4efc52bf Branch Definition to ChildDef in order to split out code in the file.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@70 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:58:12 +00:00
404ca68c87 Rename PureHTMLDefinition -> HTMLPurifier_Definition, and internal classes too.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@69 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:38:44 +00:00
436873e227 Move tester, complete HTML_Purifier -> HTMLPurifier move.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@68 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:27:00 +00:00
1836b73c63 HTML_Purifier to HTMLPurifier. May want to rethink schematics for this though.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@67 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:13:22 +00:00
ccebe77e70 Rename HTML_Generator to HTMLPurifier_Generator.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@66 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:10:30 +00:00
6a6afaccc5 Rename HTML_Lexer to HTMLPurifier_Lexer. However, some more refactoring still needs to be done (namely making the lexer an interface).
Also fixed broken benchmarks.

git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@65 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:07:47 +00:00
cf4776cfbd Add a todo list.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@64 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 11:31:43 +00:00
23dba8b55e Rename MarkupFragment.php to Token.php, change internal class names and rewire the classes. We also started adding more dependence on the Lexer and Generator in unrelated tests.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@63 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 11:27:54 +00:00
8bde230c99 Renamed remotely
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@62 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 02:16:36 +00:00
0ad09e5d94 Commit entity information from the HTML 4.01 spec. This should be parsed into corresponding mapping files, even though they will be versioned.
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@61 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 22:47:26 +00:00
3427c6c079 Begin getting parsing of character data into shape, not done yet.
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@60 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 02:43:06 +00:00
dadfa87acc Update documentation (esp. revamp status)
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@59 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 00:40:04 +00:00
ff8f24458d Finish implementing fixNesting(). Removed security-in-depth check for optimization reasons, since the info array will never cause such a condition.
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@58 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 00:30:35 +00:00
3e6bcb7a0f Implemented good behavior structure for fixNesting. Goes into infinite loop if bad stuff is passed.
Remove dud test and note which tests need to be added. Also, we're only running one test at a time to ease debugging.

git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@57 48356398-32a2-884e-a903-53898d9a118a
2006-07-19 00:21:21 +00:00