Skip to content

Commit 8b45c9c

Browse files
string docs: clarify handling of overlong and too-high sequences
1 parent 3b6773d commit 8b45c9c

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

doc/src/manual/strings.md

+3-2
Original file line numberDiff line numberDiff line change
@@ -384,8 +384,9 @@ a sequence of code units from left to right characters are formed by the longest
384384
* `10xxxxxx`;
385385
* `11111xxx`.
386386

387-
In particular this implies that overlong and too high code unit sequences are accepted.
388-
This rule is best explained by an example:
387+
In particular this means that overlong and too-high code unit sequences and prefixes thereof are treated
388+
as a single invalid character rather than multiple invalid characters.
389+
This rule may be best explained with an example:
389390

390391
```julia-repl
391392
julia> s = "\xc0\xa0\xe2\x88\xe2|"

0 commit comments

Comments
 (0)