Curt Sampson wrote:
> Remember, Unicode uses 16-bit code values,

No. Unicode uses UCS-4 characters, 32 bits. It also provides UCS-2,
which has surrogates, which don't allow easy extension to encoding
all UCS-4 characters. However that's not a good argument why programs
should deal with characters as anything less than 32-bit. UCS-2 has
always been a broken encoding and should be avoided, but UTF-8 
resolves the issue (up to 31 bits anyway).

--
Clifford Heath