Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP
Path: utzoo!utgpu!water!watmath!clyde!rutgers!seismo!mcvax!hafro!askja!frisk
From: frisk@askja.UUCP
Newsgroups: comp.std.internat
Subject: Re: Character representation
Message-ID: <274@askja.UUCP>
Date: Tue, 18-Aug-87 05:57:04 EDT
Article-I.D.: askja.274
Posted: Tue Aug 18 05:57:04 1987
Date-Received: Thu, 20-Aug-87 01:27:58 EDT
References: <2171@enea.UUCP> <176@hobbes.UUCP>
Organization: University of Iceland (RHI)
Lines: 34
Summary: It ain't so easy

In article <176@hobbes.UUCP> root@hobbes.UUCP (John Plocher) writes:
>I will be posting (as soon as I finish this note) a routine which we use
>called stracmp() to the newsgroup comp.sources.misc.  It compares two
>strings of 8 bit characters while taking into account the correct collating
>sequence and precedence (if any) of accented letters.  The routine is
>designed to drop in in place of the common strcmp().

Now - the problem with this is that there is no "correct" collating
sequence for all languages. One example is the position of character 197
in Latin-1 (A with a circle above). In some languages it is the first
letter in the alphabet, in other one of the last.

The method described in this article would work partially with Icelandic,
but not quite. To see why, consider the Icelandic alphabet.

A  'A  B  D  (ETH)  E  'E  F  G  H  I  'I  J  K  L  M  N  O  'O  P  R  S  T
U  'U  V  X  Y  'Y (THORN) (AE) (o with two dots above)
  
Ot these 32 characters, the last two would end up in the wrong places.

What is needed is either:

    a strcpy(string1,string2,LANGUAGE) function

or

    a strcpy_LANGUAGE(string1,string2)
 

-- 
Fridrik Skulason  Univ. of Iceland, Computing Center
       UUCP  ...mcvax!hafro!askja!frisk                BIX  frisk

                     "This line intentionally left blank"