Skip to content

Fix #19: lone surrogates should not be replaced by U+FFFD. #35

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Aug 14, 2014

Conversation

gsnedders
Copy link
Member

Fixes #19. See that bug for all the discussion thereof.

These can only occur in the input stream from arbitrary unicode
strings being passed into the parser (e.g., from script); no decoder
will emit them nowadays. This fixes us to match the current spec.
@hoppipolla-critic-bot
Copy link

Critic review: https://critic.hoppipolla.co.uk/r/518

This is an external review system which you may optionally use for the code review of your pull request.

In order to help critic track your changes, please do not make in-place history rewrites (e.g. via git rebase -i or git commit --amend) when updating this pull request.

Ms2ger added a commit that referenced this pull request Aug 14, 2014
Fix #19: lone surrogates should not be replaced by U+FFFD; r=jgraham
@Ms2ger Ms2ger merged commit e395ead into html5lib:master Aug 14, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

surrogate Unicode code points in input stream
3 participants