John Joyce wrote: > And yes, the overhead will be greater, but that's just a fact of > unicode and large character sets like chinese and japanese. > You will also want to check which chinese! > Chinese is split into two (politically safe) names : Traditional and > Simpllified. > If you were doing Japanese text, separating English or other western > languages wouldn't be so easy, since Japanese essentially includes a > number of other languages' character sets in its unicode set and in > everyday usage. You are right. And let alone the characters, there is a different set of punctuations! So, you don't think there is a doc about the number range string[0] return with a specified language? I wonder what those number mean... -- Posted via http://www.ruby-forum.com/.