TAKAHASHI Masayoshi wrote: > It's the reason why Mojikyo is not (should not) unified in > Unicode, I think. That sounds fair, but computers still need to process such symbols. IMO the ISO 10646 folk should be approached to allocate a 24-bit block inside the UCS-4 encoding, but outside the Unicode space. That way the UCS-4 Mojikyo characters can be encoded using either the 4/5/6 byte extension of UTF-8, or using the UTF-8 style of encoding with 1/2/3/4 bytes (i.e. with an assumed UCS-4 top byte that isn't zero, as with Unicode). I agree with Dan's comments, and think this would be the best way to resolve the issue. -- Clifford Heath