Brian Candler wrote: > Ryan Smith wrote: >>> The first is not valid UTF-8. I suppose it might be UTF-16: U+A3A4 or >>> U+A4A3 depending on little or big-endian. Or it could be some older >>> proprietary Asian encoding. >> >> [Ryan] How to correct this (to UTF-8), it is a English XP Pro with PRC >> as system locale. > > Sorry, I have no idea. Are you sure that \xa3\xa4 correponds exactly to > that one character? Is the rest of the encoding variable length or fixed > length? (e.g. are all characters two bytes long, even a western letter > "A"?) > > Questions about Microsoft operating systems and what encodings they use > really belong in a Microsoft users' forum, as it's not anything to do > with Ruby. I have no idea either, but I will upgrade to ruby 1.9 to leverage string.encoding feature. thank you. -- Posted via http://www.ruby-forum.com/.