Path: utzoo!utgpu!news-server.csri.toronto.edu!cs.utexas.edu!uunet!hoptoad!gnu
From: gnu@hoptoad.uucp (John Gilmore)
Newsgroups: comp.std.c
Subject: Re: wchar_t values
Message-ID: <16968@hoptoad.uucp>
Date: 9 Apr 91 23:43:58 GMT
References: <keld.670436534@dkuugin> <15651@smoke.brl.mil> <661@taumet.com>
Organization: Cygnus Support, Palo Alto
Lines: 61

peter@ficc.ferranti.com (Peter da Silva) writes:
>Why can't the Danes define a character set, and preprocess it into trigraphs
>for compilation? There's no reason the editing character set needs to match
>the character set the compiler sees.

They don't even need to preprocess it, they can just use a compiler that
handles that input character encoding.  Nothing in the standard prohibits this!
If they use characters outside the standard C character set, their code
won't be portable to other character encodings, but what else is new?

steve@taumet.com (Stephen Clamage) wrote:
> This is the kind of attitude which annoys those in the world whose native
> language is not English...

Americans are not boors or uncivilized because they invent programming
languages that use the entire ASCII character set.  Not guilty!

> One member's name contains an umlaut (two horizontal dots above a vowel).
> He asked us to imagine how it feels NEVER to be able to see your name
> spelled correctly in any computer correspondence.

Why doesn't he get a better computer?  Surely some local company sells
machines that include umlauted letters.  I correspond with a Swede
whose name is Torbj|rn; it looks funny in the States (a vertical bar in
the middle of the name), but should look fine on his computer in
Sweden.

And my windows all use ISO Latin 1.  If Torbj|rn would send the
umlauted letter in that standardized character set, it would look right
in both the States and in Sweden.

> Another member asked how we would feel if, for example, the letters
> 'l' and 'r' would always be considered equivalent, and the letter 'f'
> was forbidden.

Why don't we stick to discussions of the C language rather than generic
character set guilt trips?  So far, NOBODY has proposed a way to change
ANSI/ISO C so that the full local character set could be used in
identifier names in portable programs.  So any language that adds some
alphabetics beyond ASCII's is going to have some words that just can't
appear in portable C programs.  Note I'm saying PORTABLE; on your
local compiler you can do what you want.

> So let's turn Peter's question around:  Why can't the Americans use
> a preprocessor to convert ASCII source into some international
> character set before compiling?

Americans *can* use a preprocessor to convert ASCII source into some
international character set before compiling.  What's the point?

Compiler vendors are free to choose whatever input character set and
encoding they want to implement -- including ASCII, Danish local
characters, JIS, ISO Latin 1, or others, as long as it contains the
required character set specified in the C standard.  All of these do.
What is your complaint?
-- 
John Gilmore   {sun,uunet,pyramid}!hoptoad!gnu   gnu@toad.com   gnu@cygnus.com
*  Truth :  the most deadly weapon ever discovered by humanity. Capable of   *
*  destroying entire perceptual sets, cultures, and realities. Outlawed by   *
*  all governments everywhere. Possession is normally punishable by death.   *
*      ..{amdahl|decwrl|octopus|pyramid|ucbvax}!avsd!childers@tycho          *