Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!utgpu!water!watmath!clyde!rutgers!seismo!mcvax!hafro!askja!frisk From: frisk@askja.UUCP Newsgroups: comp.std.internat Subject: Re: Character representation Message-ID: <274@askja.UUCP> Date: Tue, 18-Aug-87 05:57:04 EDT Article-I.D.: askja.274 Posted: Tue Aug 18 05:57:04 1987 Date-Received: Thu, 20-Aug-87 01:27:58 EDT References: <2171@enea.UUCP> <176@hobbes.UUCP> Organization: University of Iceland (RHI) Lines: 34 Summary: It ain't so easy In article <176@hobbes.UUCP> root@hobbes.UUCP (John Plocher) writes: >I will be posting (as soon as I finish this note) a routine which we use >called stracmp() to the newsgroup comp.sources.misc. It compares two >strings of 8 bit characters while taking into account the correct collating >sequence and precedence (if any) of accented letters. The routine is >designed to drop in in place of the common strcmp(). Now - the problem with this is that there is no "correct" collating sequence for all languages. One example is the position of character 197 in Latin-1 (A with a circle above). In some languages it is the first letter in the alphabet, in other one of the last. The method described in this article would work partially with Icelandic, but not quite. To see why, consider the Icelandic alphabet. A 'A B D (ETH) E 'E F G H I 'I J K L M N O 'O P R S T U 'U V X Y 'Y (THORN) (AE) (o with two dots above) Ot these 32 characters, the last two would end up in the wrong places. What is needed is either: a strcpy(string1,string2,LANGUAGE) function or a strcpy_LANGUAGE(string1,string2) -- Fridrik Skulason Univ. of Iceland, Computing Center UUCP ...mcvax!hafro!askja!frisk BIX frisk "This line intentionally left blank"