Skip to content

Conversation

PeterCon
Copy link
Contributor

@PeterCon PeterCon commented Sep 7, 2024

The description of UTF-16 fails to distinguish between Unicode characters and UTF-16 code units. This oversight frequently leads to code bugs when handling text because developers end up treating individual surrogate code units as a complete character. Such bugs violate global readiness and internationalization best-practice guidelines.

This change adds a note explaining this distinction and points to another topic with more details.

The description of UTF-16 fails to distinguish between Unicode characters and UTF-16 code units. This oversight frequently leads to code bugs when handling text because developers end up treating individual surrogate code units as a complete character. Such bugs violate global readiness and internationalization best-practice guidelines.

This change adds a note explaining this distinction and points to another topic with more details.
Copy link
Contributor

@PeterCon : Thanks for your contribution! The author(s) have been notified to review your proposed change.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants