At 22:01 07/11/18, Wolfgang N˘Ždasi-Donner wrote: >Hi! > >I recognized a problem when using special character modifying sequences >(\M-x and \C-x) in utf-8 encoded strings... Good catch! >..., because it produces the leading byte 0x82 for an utf-8 encoding >sequence, which is invalid. The resulting String is an ill-formed >utf-8 sequence. > >My proposal is either not to allow these special modifiers for other >encodings than Ascii (which might be complicated), Strictly speaking, for binary or ASCII+binary. >or to enforce >Ascii encoding for Strings, that contain these special modifiers. Another alternative, in the case of UTF-8 (but not for Shift_JIS, EUC-JP, and so on) is to interpret these as codepoints and transcode to UTF-8. Regards, Martin. #-#-# Martin J. Du"rst, Assoc. Professor, Aoyama Gakuin University #-#-# http://www.sw.it.aoyama.ac.jp mailto:duerst / it.aoyama.ac.jp