TAKAHASHI Masayoshi wrote:
> It's the reason why Mojikyo is not (should not) unified in
> Unicode, I think.

That sounds fair, but computers still need to process such symbols.
IMO the ISO 10646 folk should be approached to allocate a 24-bit
block inside the UCS-4 encoding, but outside the Unicode space.
That way the UCS-4 Mojikyo characters can be encoded using either
the 4/5/6 byte extension of UTF-8, or using the UTF-8 style of
encoding with 1/2/3/4 bytes (i.e. with an assumed UCS-4 top byte
that isn't zero, as with Unicode).

I agree with Dan's comments, and think this would be the best way
to resolve the issue.

--
Clifford Heath