NormalizeMode
- class NormalizeMode
Defines how a Unicode string is transformed in a canonical form, standardizing such issues as whether a character with an accent is represented as a base character and combining accent or as a single precomposed character. Unicode strings should generally be normalized before comparing them.
Fields
- class NormalizeMode
- ALL
Beyond
DEFAULT
also standardize the “compatibility” characters in Unicode, such as SUPERSCRIPT THREE to the standard forms (in this case DIGIT THREE). Formatting information may be lost but for most text operations such characters should be considered the same
- DEFAULT
Standardize differences that do not affect the text content, such as the above-mentioned accent representation
- NFC
Another name for
DEFAULT_COMPOSE
- NFKC
Another name for
ALL_COMPOSE