Sync with the latest html5lib. Having the Maruku unit tests on-hand may be useful for debugging; so let's include them.
Add some tests. Sync with latest HTML5lib (includes above sanitization improvements).