I beg your pardon, you have all the wonderful standards at your fingertips. What is meant by "representing" a character? How are glyphs mapped to code points? Is that a many to many mapping? What are the attributes of a glyph? What are the attributes of a code point? Outside of natural language text processing, are there areas where the parsing of non-Latin-1 strings is relevant? If so, what are they? Please help my ignorance Jan - Ecclesiastes 1:9The thing that hath been, it is that which shall be; and that which is done is that which shall be done: and there is no new thing under the sun. The King James Version (Authorized) __________________________________________________ Do You Yahoo!? Yahoo! Health - Feel better, live better http://health.yahoo.com