Paul Battley wrote: > On 25/05/06, Wes Gamble <weyus / att.net> wrote: >> > I don't understand how to make my text, which now contains UTF-8 >> > characters, display correctly in say, Notepad. All of the entities are >> > preceded by the character A-circumflex. My guess is that Notepad >> > doesn't know how to handle UTF-8, for example. > > Windows Notepad does handle UTF-8, but requires the presence of a BOM > (the three bytes "\xef\xbb\xbf") at the start of the file to read it > properly. Other, more competent applications may allow you to select > the appropriate encoding, or may even automatically detect it. > >> OK I have found the iconv library, however, I am still having trouble. >> >> What is the default text encoding for Ruby? I assume it's gotten from >> the OS, right? So if I'm on Windows XP in the US, it's probably >> ISO-8859-1? > > Windows XP uses UTF-16 internally, I believe, but retains the concept > of a legacy code page to allow non-Unicode-aware applications to run. > English Windows uses Windows-1252 (mostly, but not completely the same > as ISO-8859-1). Ruby on Windows uses the legacy code page to > communicate with the operating system, so things like file names will > be in Windows-1252. > > Paul. I don't really care about Notepad. I can't get VIM to show this text correctly. I want to be able to convert (what I believe to be) UTF-8 into Windows-1252 succesfully. Wes -- Posted via http://www.ruby-forum.com/.