Fix Unicode byte order mark documentation #1911

aaronfranke · 2024-09-11T06:38:13Z

The statement "Unicode plain text is a sequence of 16-bit code values" is incorrect. Unicode can be encoded in several encodings including UTF-8, UTF-16, and UTF-32. Unicode itself is a mapping of numbers to code points, many of which cannot fit into 16 bits.
The statement "Microsoft uses UTF-16, little endian byte order." is incorrect. Some legacy Microsoft products such as Visual Studio use Windows-1252 by default. Some legacy Microsoft products use the name "Unicode" to refer to UCS-2, which is similar to UTF-16 but is restricted to the Basic Multilingual Plane and is a fixed-width 16-bit encoding. Modern Microsoft products such as Visual Studio Code and .NET use UTF-8 by default, and over 98% of websites use UTF-8, so note that this is recommended for new applications.
The statement "which informs an application receiving the file that the file is byte-ordered" is nonsense. All bytes in a file are in some order, there is no such thing as a file with unordered bytes. The byte order mark is useful for UTF-16 and UTF-32 to indicate whether their byte order is little endian or big endian, not whether they are byte-ordered in general.

This PR attempts to fix these problems. If further tweaks are required to the text, let me know and I can update the PR.

prmerger-automator · 2024-09-11T06:38:24Z

@aaronfranke : Thanks for your contribution! The author(s) have been notified to review your proposed change.

Fix Unicode byte order mark documentation

98b95d6

prmerger-automator bot added the do-not-merge label Sep 11, 2024

prmerger-automator bot requested a review from Karl-Bridge-Microsoft September 11, 2024 06:38

prmerger-automator bot assigned Karl-Bridge-Microsoft Sep 11, 2024

prmerger-automator bot added Change sent to author desktop-app-ui/subsvc windows-api-desktop-tech/svc labels Sep 11, 2024

Karl-Bridge-Microsoft approved these changes Sep 26, 2024

View reviewed changes

Karl-Bridge-Microsoft merged commit fe5d2b6 into MicrosoftDocs:docs Sep 26, 2024
1 check passed

aaronfranke deleted the fix-bom-doc branch September 26, 2024 22:50

Provide feedback