On Thu, Nov 25, 2010 at 3:48 PM, Philip Rhoades <phil / pricom.com.au> wrote: > James, > > > On 2010-11-26 00:55, James Edward Gray II wrote: >> >> On Nov 25, 2010, at 6:37 AM, Phillip Gawlowski wrote: >> >>> Thus, UTF-8 is a subset of UTF-16 is a subset of UTF-16. Thus, >>> also, the future-proofing, in case even more glyphs are needed. >> >> You are confusing us. >> >> UTF-8, UTF-16, and UTF-32 are encodings of Unicode code points. ¨Âèå>> are all capable of representing all code points. ¨Âïôèéîç éî ôèéó >> discussion is a subset of anything else. > > > This is all really interesting but I don't understand what you mean by "code > points" - is what you have said expressed diagrammatically somewhere? A "code point" is basically a unique identifier of a special symbol. http://en.wikipedia.org/wiki/Unicode http://www.unicode.org/ http://www.unicode.org/charts/About.html http://www.unicode.org/charts/ HTH robert -- remember.guy do |as, often| as.you_can - without end http://blog.rubybestpractices.com/