> Question raised by this: Can OCPs output control sequences, or do they just
> produce characters?
They can output any token (IIRC there is a bug
when \input is used, but I'm not sure). In fact,
tokens are necessary when translating Unicode to,
>Can one specify what catcode the characters should have?
Unfortunately not. The catcodes used are the
catcodes when the replacement is done. That means
that "private" names containing @ cannot be
used (in general, or if \csname is used).
> Another question regarding OCPs: Is the OCP model general enough to support
> a reasonable size OCP that translates UTF-8 to 32-bit Unicode, or is it
> necessary to implement the entire translation as a gigantic (2^31 entries)
They can, because OCPs can do
calculations. The utf8 OCP takes about ten lines
of code. The uppercase OCP is also very short
because most of uppercase variants follow some
rule in the Unicode Standard (either -1 or -32).
Consigue tu cuenta de correo universal y gratuita en http://webmail.wanadoo.es