> Question raised by this: Can OCPs output control sequences, or do they just > produce characters? They can output any token (IIRC there is a bug when \input is used, but I'm not sure). In fact, tokens are necessary when translating Unicode to, say, OT1. >Can one specify what catcode the characters should have? Unfortunately not. The catcodes used are the catcodes when the replacement is done. That means that "private" names containing @ cannot be used (in general, or if \csname is used). > Another question regarding OCPs: Is the OCP model general enough to support > a reasonable size OCP that translates UTF-8 to 32-bit Unicode, or is it > necessary to implement the entire translation as a gigantic (2^31 entries) > table? They can, because OCPs can do calculations. The utf8 OCP takes about ten lines of code. The uppercase OCP is also very short because most of uppercase variants follow some rule in the Unicode Standard (either -1 or -32). Javier ______________________________________________________________________________ Consigue tu cuenta de correo universal y gratuita en http://webmail.wanadoo.es