Description
I’d like to merge everything from the validator-nu branch to master, then delete the branch.
And then as far as the HTML checker, the parser integration for it could be built from the master branch here rather from from the validator-nu branch it has been getting built from.
So, looking at https://github.com/validator/htmlparser/compare/validator-nu, there are 19 commits in the validator-nu branch which aren’t in master. None have any merge conflicts with master.
(The validator-nu branch did previously have 20 commits; the additional commit was 2bbdc4d, which removed all the code for the XOM API — but that commit’s no longer in the branch.)
Among those 19 commits, 15 are changes to the error-reporting behavior of the parser.
And among those 15, the largest set are changes necessary to make the error-reporting behavior conform to current requirements in the HTML spec; those 7 commits are the following:
e03e56e Require UTF-8
9ce4bd4 Conform ampersand-error reporting to HTML spec
29a2645 Make consecutive hyphens in comments a non-error
f738d97 Drop parse error for missing end tag
1cd256d Emit error (not warning) for HTML4/XHTML1 doctype
da6c1ec Report error always for Transitional doctype
1ae6c63 Support “generate all implied end tags thoroughly”
The second-largest set of those error-reporting-behavior commits are 5 bug fixes:
003ad33 Fix typo in error message about over-deep tree
ef15099 Fix "non-space characters insided a table" typo
de301bc Fix grammar problem in HTML parser error message
f4f266c Correct error for EOF in “in template” state
7528857 Report 1024 as byte limit for meta charset sniff
So I think all 12 of those commits listed above could be considered necessary changes.
The remaining 3 changes to the error-reporting behavior, although not strictly necessary, are nonetheless pretty clearly improvements:
143a10b Improve message: bad start tag in noscript in head
5a209fd Remove warning about comments before doctype
5c8fe7a Stop reporting HTML4-specific parse errors
Then, along with the 15 changes above to the error-reporting behavior, there’s 1 change to the core code that, from discussion in #10, it seems we already clearly know we want in master too:
1357528 Ensure every Locator is also a Locator2
And then the remaining 2 changes to the code are minor ones that we don’t necessarily need but that, since the work’s already been done, I guess we should go ahead and merge:
1c66c6f Ensure Java8-runnable code even if Java9-compiled
895f606 Add suppress "unchecked" for a TreeBuilder method
Lastly, there’s one commit that’s not a code change but that just checks support files into the repo:
c5d11f9 Add .classpath, .settings, .project, .mailmap
So while that last change isn’t strictly necessary, it’s still generally useful — or at least it us for anybody using Eclipse. And in fact, I guess the .settings
bits of it are particularly useful in that they can help contributors adhere to a consistent coding style in additions/updates to the source.