Warning: Missing charsets in String to FontSet conversion

Nix nix
Sat Oct 21 18:55:54 EDT 2006


On Sat, 21 Oct 2006, stephen at xemacs.org noted:
> The other bug is either in gcc or in a wrapper supplied by Ubuntu; it
> is using non-ASCII characters (specifically, typographically correct
> balanced single quotes) in the output from the compiler.

This is an upstream change: GCC has done this since GCC 4.0 (and it was
documented in the release notes specifically because it might break
scripts).

>                                                           IMHO this is
> unwise at best.

It is conditionalized off the locale. GCC makes the (reasonable)
assumption that if you're in a UTF-8 locale, your tools, display
devices, and so on can handle UTF-8.

Wrapping GCC up in a script that sets LC_ALL to C or something like that
would fix it. Mind you, if Manuel's Emacs can't handle UTF-8
(non-MULE?), he should probably change LANG or LC_ALL before starting
it.

> If you are using XEmacs 21.5.6 or later, then you can probably put
> `(setq default-process-coding-system '(utf-8 . raw-text))' in your
> init file and get pretty output.  However, you will almost surely lose
> some or all of the facilities that use functions that parse the error
> output, as they depend on matching the ASCII backtick and apostrophe
> characters instead of the "beautiful" Unicode left and right single
> quote characters.  This will take a very long time to fully fix, as we
> can't just do a global replace of "`" with "[`?]".---we need to find
> out which occurances need to be changed, and which should not.

Almost everything GCC can emit should be quoted with Unicode quotes
where possible: everything using the core pretty-printing infrastructure
in gcc/pretty-print.c, and everything else Joseph Myers could find.

There are a few places in fixed strings where `' are still used, but
those are rare, and I can't imagine that much parsing code will
want to pick things out of the middle of *fixed* strings like that
anyway. (Variable components, language types, et seq, are all
users of pretty-print.c, and so all use Unicode quoting.)

-- 
`When we are born we have plenty of Hydrogen but as we age our
 Hydrogen pool becomes depleted.'



More information about the XEmacs-Beta mailing list