At 22:01 07/11/18, Wolfgang N˘Ždasi-Donner wrote:
>Hi!
>
>I recognized a problem when using special character modifying sequences
>(\M-x and \C-x) in utf-8 encoded strings...

Good catch!


>..., because it produces the leading byte 0x82 for an utf-8 encoding
>sequence, which is invalid. The resulting String is an ill-formed
>utf-8 sequence.
>
>My proposal is either not to allow these special modifiers for other
>encodings than Ascii (which might be complicated),

Strictly speaking, for binary or ASCII+binary.

>or to enforce
>Ascii encoding for Strings, that contain these special modifiers.

Another alternative, in the case of UTF-8 (but not for Shift_JIS,
EUC-JP, and so on) is to interpret these as codepoints and transcode
to UTF-8.

Regards,    Martin.



#-#-#  Martin J. Du"rst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp       mailto:duerst / it.aoyama.ac.jp