On 16/09/2008, Tanaka Akira <akr / fsij.org> wrote:
> In article <6.0.0.20.2.20080916184943.08a281f0 / localhost>,
>
>   Martin Duerst <duerst / it.aoyama.ac.jp> writes:
>
>  >>> So ruby should treat it as a character.
>  >
>  > I don't think #each_character should do that, although
>  > its name may suggest so. What may happen (maybe for Ruby 2.0)
>  > is that we have a parameter to #each_character which, if present,
>  > leads to lumping the above two characters together.
>
>
> Unless doing that, single character in JIS X 0213 is mapped
>  to two characters in Unicode.
>
>  It is not desired result.

Can I ask what character(s) that would be?

I thought that every Kanji and Kana character has its codepoint in
either encoding so there is never any reason to split or join
characters while recoding into a different encoding.

Thanks

Michal