Skip to content

Commit

Permalink
Clarify text parsing behaviour in docs
Browse files Browse the repository at this point in the history
  • Loading branch information
ullenius committed Apr 20, 2024
1 parent 7707878 commit a0c8c15
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -74,13 +74,13 @@ can't compile stuff on their platform.
## Character encodings
Supported character encodings are:

* utf-8
* UTF-8
* latin-1

The program validates utf-8 tags, invalid tags are parsed as latin-1. This may
result in mojibake.
The program auto-detect character encodings, if text is not valid UTF-8 then
latin-1 is used. This may result in mojibake.

Noncharacters in utf-8 are disallowed for security reasons.
Noncharacters in UTF-8 are disallowed for security reasons.

* [Which code points are noncharacters?](https://www.unicode.org/faq/private_use.html#noncharacters)

Expand Down

0 comments on commit a0c8c15

Please sign in to comment.