This function converts input strings to ASCII, i.e., to character strings consisting of bytes not greater than 127.
Details
All code points greater than 127 are replaced with the ASCII SUBSTITUTE
CHARACTER (0x1A).
R encoding declarations are always used to determine
which encoding is assumed for each input, see stri_enc_mark.
If ill-formed byte sequences are found in UTF-8 byte
streams, a warning is generated.
A bytes-marked string is assumed to be in an 8-bit encoding
extending the ASCII map (a common assumption in R itself).
Note that the SUBSTITUTE CHARACTER (\x1a == \032) may be interpreted
as the ASCII missing value for single characters.
See also
The official online manual of stringi at https://stringi.gagolewski.com/
Gagolewski M., stringi: Fast and portable character string processing in R, Journal of Statistical Software 103(2), 2022, 1-59, doi:10.18637/jss.v103.i02
Other encoding_conversion:
about_encoding,
stri_enc_fromutf32(),
stri_enc_tonative(),
stri_enc_toutf32(),
stri_enc_toutf8(),
stri_encode()
Author
Marek Gagolewski and other contributors