unicode internal

Stephen J. Turnbull stephen at xemacs.org
Thu Oct 6 03:10:21 EDT 2005


>>>>> "Ben" == Ben Wing <ben at xemacs.org> writes:

    Ben> we *don't* simply want to use some "charset" tag, because
    Ben> then we make artificial distinctions between iso-2022-jp,
    Ben> shift-jis and euc-jp, for example.

We'll have to see if charset -> language heuristic mapping is
sufficient, but if users/standards force charset -> font mapping on
us, so be it.

    Ben> unicode also has the possibility of embedding a language tag
    Ben> in it (forced down the unicode consortium's throat by the
    Ben> japanese), but it is highly dispreferred.

People find it distasteful, but there is no standard alternative
except ISO 2022 ... which is precisely what we're trying to escape
from.  I don't see any difference in principle from the BIDI markers,
which nobody seems to object to.


-- 
School of Systems and Information Engineering http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba                    Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
               Ask not how you can "do" free software business;
              ask what your business can "do for" free software.




More information about the XEmacs-Beta mailing list