LISTSERV - LATEX-L Archives - LISTSERV.UNI-HEIDELBERG.DE

LATEX-L Archives

Mailing list for the LaTeX3 project

LATEX-L@LISTSERV.UNI-HEIDELBERG.DE

	LISTSERV Archives
	LATEX-L Home

	Log In
	Register

	Subscribe or Unsubscribe

	Search Archives

Options:	Use Classic View Use Monospaced Font Show Text Part by Default Condense Mail Headers
Topic:	[<< First] [< Prev] [Next >] [Last >>]

Sender: Mailing list for the LaTeX3 project <[log in to unmask]>

Subject: Re: default inputenc/fontenc tight to language

From: "William F. Hammond" <[log in to unmask]>

Date: Tue, 6 Feb 2001 11:09:10 -0500

Reply-To: Mailing list for the LaTeX3 project <[log in to unmask]>

Parts/Attachments: text/plain (30 lines)

Just out of curiosity, I'm wondering what those here think about
unicode and, in particular:

1.  Is its concept of character -- basically unsigned 32 bit
    integer -- durable for, say, the next 100 years?

    (As I read the discussion here, I think not.)

2.  Do we think that 2^32 is a wise upper bound?

    (This question vanishes if we think that representing
    characters as integers, rather than as more complicated data
    structures, is inadequate.)

Unicode is directly relevant to the future of LaTeX to the extent that
LaTeX is going to be robust for formatting XML document types because
normal document content can consist of arbitary sequences of unicode
characters.  XML systems are designed to make decisions only where
markup occurs.  It is reasonable for an XML processor writing in a
typesetting language to know the markup ancestry of a character, e.g.,
whether it is within a math zone, but not reasonable -- unless the
processor, like David Carlisle's xmltex, is a TeX thing -- for it to
know that a particular character must have \ensuremath applied.

I note that in GNU Emacs these days characters can have property lists.

Thanks for your thoughts.

                                    -- Bill

ATOM RSS1 RSS2

LISTSERV.UNI-HEIDELBERG.DE
Universität Heidelberg \| Impressum \| Datenschutzerklärung