Efficiency: Entity handling

Previously, used a regexp to find and convert named entities in the content. Now use a more efficient algorithm. Similar tweak for converting NCRs before checking whether text is valid utf-8.
2008-05-17 01:43:11 -05:00 · 2008-05-17 01:43:11 -05:00 · 41346bf8bd
commit 41346bf8bd
parent 5ca0760f7c
7 changed files with 50 additions and 29 deletions
--- a/test/unit/page_renderer_test.rb
+++ b/test/unit/page_renderer_test.rb
@ -344,6 +344,13 @@ class PageRendererTest < Test::Unit::TestCase
        "</ins> and lovely morning<ins class='diffins'> today</ins></span></p>", test_renderer(@page.revisions.last).display_diff
  end
  
+  def test_nowiki_sanitization
+    assert_markup_parsed_as('<p>This sentence contains <span>a &amp; b</span> ' +
+     '&lt;script&gt;alert("XSS!");&lt;/script&gt;. Do not touch!</p>',
+      'This sentence contains <nowiki><span>a & b</span> <script>alert("XSS!");' +
+      '</script></nowiki>. Do not touch!')
+  end
+  
  def test_link_to_file
    assert_markup_parsed_as( 
      "<p><span class='newWikiWord'>doc.pdf<a href='../file/doc.pdf'>?</a></span></p>",