On Sep 25, 2007, at 18:55, F. Senault wrote: >> do u know of a way to replace diacritics by simple character (ie. : é >> -o-> e) >> >> the same with ligatures (ie. : -o-> AE ) >> >> using tables ? > > IConv can do that for you : An alternative approach is something like Sean M. Burke's Text::Unidecode: http://interglacial.com/~sburke/tpj/as_html/tpj22.html http://search.cpan.org/~sburke/Text-Unidecode-0.04/lib/Text/Unidecode.pm Here is an example of an implementation of Unidecode in Lua [1]: local Unidecode = require( 'Unidecode' ) print( Unidecode( '§®§à§ã§Ü§Ó§Ñ' ) ) print( Unidecode( 'Ë̵þ' ) ) print( Unidecode( 'â¾°Ð¥¯¥Û¥¥Û¥¹âÀ¥«' ) ) print( Unidecode( '' ) ) print( Unidecode( 'Åìµþ' ) ) print( Unidecode( 'µþÅÔ»Ô' ) ) print( Unidecode( '' ) ) print( Unidecode( '-' ) ) print( Unidecode( ' ' ) ) print( Unidecode( '' ) ) print( Unidecode( 'Géometrie Différentielle' ) ) > Moskva > beijing > Athena > seoul > dongjing > jingdushi > nepaal > te'labiyb-yapvo > tal 'abiyb yaafaa > thran > Geometrie Differentielle Cheers, PA. [1] http://dev.alt.textdrive.com/browser/HTTP/Unidecode.lua