Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!utgpu!utcsri!utegc!utai!musocs!mcgill-vision!iros1!fortin From: fortin@iros1.UUCP Newsgroups: can.francais,can.general Subject: Re: Accents et objectifs Message-ID: <251@Mannix.iros1.UUCP> Date: Thu, 13-Aug-87 23:24:17 EDT Article-I.D.: Mannix.251 Posted: Thu Aug 13 23:24:17 1987 Date-Received: Fri, 21-Aug-87 00:39:23 EDT References: <424@pembina.UUCP> <228@Mannix.iros1.UUCP> <232@Mannix.iros1.UUCP> <1202@cognos.UUCP> Sender: news@iros1.UUCP Reply-To: fortin@iros1.UUCP (Denis Fortin) Distribution: can Organization: Universite de Montreal Lines: 173 Keywords: ISO, Latin/1, accents Xref: utgpu can.francais:52 can.general:696 Summary: The ISO-Latin/1 8-bit code Ce message (un peu long) est forme a partir de deux messages ayant paru recemment dans le groupe comp.std.internat. Il decrit un code 8-bits qui contient les caracteres accentues utilises par plusieurs pays de l'Europe de l'ouest. Ce code, connu sous le nom d'ISO-Latin/1 est maintenant un standard international. (Enfin, lire le texte qui suit pour avoir plus de details!) (PS. Et pas d'EBCDIC!) Denis Fortin fortin@zap.UUCP fortin@iros1.UUCP From: sommar@enea.UUCP (Erland Sommarskog) (in Stockholm) * ISO-Latin/1 * The byte value is in the document represented by a notation xx/yy, where xx is the upper nibble (four bits), and yy is the lower nibble (in decimal). The lower part of the table, i.e. positions 02/00 to 07/14 is exactly the same as ASCII. The upper part of the table contains the characters we can't live without in large parts of the world. Since I do not know how to send pictures in a standardised way (is macpaint documents OK?), I here include a table from ISO No.1: (Note: This was the draft. See below for more info) 10/00 NO-BREAK SPACE 10/01 INVERTED EXCLAMATION MARK 10/02 CENT SIGN 10/03 POUND SIGN 10/04 CURRENCY SIGN 10/05 YEN SIGN 10/06 BROKEN BAR 10/07 PARAGRAPH SIGN, SECTION SIGN 10/08 DIAERESIS 10/09 COPYRIGHT SIGN 10/10 FEMININE ORDINAL INDICATOR 10/11 LEFT ANGLE QUOTATION MARK 10/12 NOT SIGN 10/13 SOFT HYPHEN 10/14 REGISTERED TRADE MARK SIGN 10/15 MACRON 11/00 DEGREE SIGN 11/01 PLUS-MINUS SIGN 11/02 SUPERSCRIPT TWO 11/03 SUPERSCRIPT THREE 11/04 ACUTE ACCENT 11/05 SMALL GREEK LETTER MU, MICRO SIGN 11/06 PILCROW SIGN 11/07 MIDDLE DOT 11/08 CEDILLA 11/09 SUPERSCRIPT ONE 11/10 MASCULINE ORDINAL INDICATOR 11/11 RIGHT ANGLE QUOTATION MARK 11/12 VULGAR FRACTION ONE QUARTER 11/13 VULGAR FRACTION ONE HALF 11/14 VULGAR FRACTION THREE QUARTERS 11/15 INVERTED QUESTION MARK 12/00 CAPITAL LETTER A WITH GRAVE ACCENT 12/01 CAPITAL LETTER A WITH ACUTE ACCENT 12/02 CAPITAL LETTER A WITH CIRCUMFLEX ACCENT 12/03 CAPITAL LETTER A WITH TILDE 12/04 CAPITAL LETTER A DIAERESIS 12/05 CAPITAL LETTER A WITH RING ABOVE 12/06 CAPITAL DIPHTHONG A WITH E 12/07 CAPITAL LETTER C WITH CEDILLA 12/08 CAPITAL LETTER E WITH GRAVE ACCENT 12/09 CAPITAL LETTER E WITH ACUTE ACCENT 12/10 CAPITAL LETTER E WITH CIRCUMFLEX ACCENT 12/11 CAPITAL LETTER E WITH DIAERESIS 12/12 CAPITAL LETTER I WITH GRAVE ACCENT 12/13 CAPITAL LETTER I WITH ACUTE ACCENT 12/14 CAPITAL LETTER I WITH CIRCUMFLEX ACCENT 12/15 CAPITAL LETTER I WITH DIAERESIS 13/00 CAPITAL ICELANDIC LETTER ETH 13/01 CAPITAL LETTER N WITH TILDE 13/02 CAPITAL LETTER O WITH GRAVE ACCENT 13/03 CAPITAL LETTER O WITH ACUTE ACCENT 13/05 CAPITAL LETTER O WITH TILDE 13/06 CAPITAL LETTER O WITH DIAERESIS 13/07 (This position shall not be used) 13/08 CAPITAL LETTER O WITH OBLIQUE STROKE 13/09 CAPITAL LETTER U WITH GRAVE ACCENT 13/10 CAPITAL LETTER U WITH ACUTE ACCENT 13/11 CAPITAL LETTER U WITH CIRCUMFLEX ACCENT 13/12 CAPITAL LETTER U WITH DIAERESIS 13/13 CAPITAL LETTER Y WITH ACUTE ACCENT 13/14 CAPITAL ICELANDIC LETTER THORN 13/15 SMALL GERMAN LETTER SHARP s 14/00 SMALL LETTER a WITH GRAVE ACCENT 14/01 SMALL LETTER a WITH ACUTE ACCENT 14/02 SMALL LETTER a WITH CIRCUMFLEX ACCENT 14/03 SMALL LETTER a WITH TILDE 14/04 SMALL LETTER a WITH DIAERESIS 14/05 SMALL LETTER a WITH RING ABOVE 14/06 SMALL DIPHTHONG a WITH e 14/07 SMALL LETTER c WITH CEDILLA 14/08 SMALL LETTER e WITH GRAVE ACCENT 14/09 SMALL LETTER e WITH ACUTE ACCENT 14/10 SMALL LETTER e WITH CIRCUMFLEX ACCENT 14/11 SMALL LETTER e WITH DIAERESIS 14/12 SMALL LETTER i WITH GRAVE ACCENT 14/13 SMALL LETTER i WITH ACUTE ACCENT 14/14 SMALL LETTER i WITH CIRCUMFLEX ACCENT 14/15 SMALL LETTER i WITH DIAERESIS 15/00 SMALL ICELANDIC LETTER ETH 15/01 SMALL LETTER n WITH TILDE 15/02 SMALL LETTER o WITH GRAVE ACCENT 15/03 SMALL LETTER o WITH ACUTE ACCENT 15/04 SMALL LETTER o WITH CIRCUMFLEX ACCENT 15/05 SMALL LETTER o WITH TILDE 15/06 SMALL LETTER o WITH DIAERESIS 15/07 (This position shall not be used) 15/08 SMALL LETTER o WITH OBLIQUE STROKE 15/09 SMALL LETTER u WITH GRAVE ACCENT 15/10 SMALL LETTER u WITH ACUTE ACCENT 15/11 SMALL LETTER u WITH CIRCUMFLEX ACCENT 15/12 SMALL LETTER u WITH DIAERESIS 15/13 SMALL LETTER y WITH ACUTE ACCENT 15/14 SMALL ICELANDIC LETTER THORN 15/15 SMALL LETTER y WITH DIAERESIS End of table -------------------------------- Note from lasko@video.dec.com (Tim Lasko) about ISO-Latin/1: ISO Latin-1, or more completely ISO Latin Alphabet No 1, is now an international standard as of February 1987 (IS 8859, Part 1). For those American USEnet'rs that care, the 8-bit ASCII standard, which is essentially the same code, is going through the final administrative processes prior to publication. The code table that was posted earlier by Mr. Sommarskog to the net is from an earlier draft of the standard, the following changes have been made: OLD DRAFT: 13/07 (This position shall not be used) 15/07 (This position shall not be used) FINAL STANDARD: 13/07 MULTIPLICATION SIGN 15/07 DIVISION SIGN Those two characters were added mainly out of the fear that individual vendors would use the positions for non-interchangeable and incompatible purposes, thus defeating the idea of the standard. The two symbols chosen were more or less a compromise from a large list of eligible characters. ISO Latin-1 (IS 8859/1) is actually one of an entire family of eight-bit one-byte character sets, all having ASCII on the left hand side, and with varying repertoires on the right hand side: Pt 1. Latin Alphabet No 1 (caters to Western Europe - now approved) Pt 2. Latin Alphabet No 2 (caters to Eastern Europe - now approved) Pt 3. Latin Alphabet No 3 (caters to SE Europe + others - in draft ballot) Pt 4. Latin Alphabet No 4 (caters to Northern Europe - in draft ballot) Pt 5. Latin-Cyrillic alphabet (right half all Cyrillic - processing currently suspended pending USSR input) Pt 6. Latin-Arabic alphabet (right half all Arabic - now approved) Pt 7. Latin-Greek alphabet (right half Greek + symbols - in draft ballot) Pt 8. Latin-Hebrew alphabet (right half Hebrew + symbols - proposed) I expect to update this list shortly, because next week I'm attending the meeting of the ISO Working Group concerned with these standards is being held. (ISO TC97/SC2/WG3 for those that can decipher that.)