Path: utzoo!attcan!uunet!jarthur!elroy.jpl.nasa.gov!ncar!zaphod.mps.ohio-state.edu!rpi!uupsi!sunic!ugle.unit.no!nuug!ifi!enag From: enag@ifi.uio.no (Erik Naggum) Newsgroups: comp.text Subject: Re: ISO 8859-1 (Latin-1) Message-ID: Date: 2 Mar 91 02:52:03 GMT References: <8775@exodus.Eng.Sun.COM> Sender: enag@ifi.uio.no (Erik Naggum) Organization: Naggum Software, Oslo, Norway Lines: 51 In-Reply-To: tut@cairo.Eng.Sun.COM's message of 27 Feb 91 22:52:24 GMT In article <8775@exodus.Eng.Sun.COM>, Bill Tuthill writes: Does the ISO 8859 standard designate any function or behavior for control codes between hex 80 and 9F? Are they supposed to be just like the control codes between hex 00 and 1F, except with the 8th bit turned on? My 8859 sample pages just show gray boxes in all these positions, and the Unicode standard just has CTRL in boxes 80-9F (though it has specific names in 00-1F). ISO 8859 only covers the areas named G0 and G1 in ISO 2022, according to the specifications in ISO 4873. Control codes are found in C0 and C1. C0 is specified in ISO 646 and further qualified in ISO 2022. You will find a complete set of definitions for the control characters in C1 in ISO 6429. C1 contains at least Single-Shift 2 (SS2) [08/14] and Single-Shift 3 (SS3) [08/15] defined in ISO 2022 and ISO 4873. ISO DIS 10646 introduces three control characters in C1: PAD [08/00], Single Graphic Character Introducer (SGCI) [09/09], High Octet Preset (HOP) [08/01]. There exists a New Line control character in C1, the code for which I don't know, not having received my copy of ISO 6429, yet. ------- References: [1] ISO 646: Information processing -- ISO 7-bit coded character set for information interchange [2] ISO 2022: Information processing -- ISO 7-bit and 8-bit coded character set -- Code extension techniques [3] ISO 4873: Information processing -- ISO 8-bit code for information interchange -- Structure and rules for implementation [4] ISO 6429: Information processing -- ISO 7-bit and 8-bit coded character sets -- Additional control functions for character imaging devices [5] ISO 6937: Information processing -- Coded character sets for text communication (consists at present of two parts) [6] ISO 8859: Information processing -- ISO 8-bit coded character set for information interchange (consists at present of nine parts, part 1 containing rules of implementation, in addition to ISO Latin 1). [6] ISO DIS 10646: Information technology -- Universal Coded Character Set (UCS) [probably a preliminary title, as it does not follow ISO standard naming rules] If you are rich, you will not necessarily remain so if you order all of the above from your local ISO outlet. ------- -- [Erik Naggum] Naggum Software, Oslo, Norway