On Sep 17, 2008, at 10:10 PM, Yukihiro Matsumoto wrote: > OK, now Ruby 1.9 has String#each_codepoint and understands \p{Lu} for > regular expression. I hope all Unicode whiners would complain no > longer. The community of Unicode whiners says "thank you" to the community of Ruby implementors. I'll grab this code and see how it works for XML parsing. -Tim