On Thu, Nov 25, 2010 at 3:48 PM, Philip Rhoades <phil / pricom.com.au> wrote:
> James,
>
>
> On 2010-11-26 00:55, James Edward Gray II wrote:
>>
>> On Nov 25, 2010, at 6:37 AM, Phillip Gawlowski wrote:
>>
>>> Thus, UTF-8 is a subset of UTF-16 is a subset of UTF-16. Thus,
>>> also, the future-proofing, in case even more glyphs are needed.
>>
>> You are confusing us.
>>
>> UTF-8, UTF-16, and UTF-32 are encodings of Unicode code points.  >> are all capable of representing all code points.    
>> discussion is a subset of anything else.
>
>
> This is all really interesting but I don't understand what you mean by "code
> points" - is what you have said expressed diagrammatically somewhere?

A "code point" is basically a unique identifier of a special symbol.

http://en.wikipedia.org/wiki/Unicode

http://www.unicode.org/
http://www.unicode.org/charts/About.html
http://www.unicode.org/charts/

HTH

robert

-- 
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/