GHC.IO.Encoding.Failure
Types for specifying how text encoding/decoding fails
Documentation
data CodingFailureMode Source #
The CodingFailureMode
is used to construct TextEncoding
s,
and specifies how they handle illegal sequences.
Constructors
ErrorOnCodingFailure | Throw an error when an illegal sequence is encountered |
IgnoreCodingFailure | Attempt to ignore and recover if an illegal sequence is encountered |
TransliterateCodingFailure | Replace with the closest visual match upon an illegal sequence |
RoundtripFailure | Use the private-use escape mechanism to attempt to allow illegal sequences to be roundtripped. |
Instances
Show CodingFailureMode Source # | Since: base-4.4.0.0 |
Defined in GHC.IO.Encoding.Failure |
isSurrogate :: Char -> Bool Source #
Some characters are actually "surrogate" codepoints defined for
use in UTF-16. We need to signal an invalid character if we detect
them when encoding a sequence of Char
s into Word8
s because they
won't give valid Unicode.
We may also need to signal an invalid character if we detect them
when encoding a sequence of Char
s into Word8
s because the
RoundtripFailure
mode creates these to round-trip bytes through
our internal UTF-16 encoding.