Path: utzoo!censor!comspec!lethe!torsqnt!news-server.csri.toronto.edu!cs.utexas.edu!bcm!dimacs.rutgers.edu!seismo!uunet!ceco!garry From: garry@ceco.ceco.com (Garry Garrett) Newsgroups: comp.os.msdos.programmer Subject: Re: Unicode: Details Please Summary: Unicode Keywords: multilingual Unicode character set Message-ID: <405@ceco.ceco.com> Date: 15 Feb 91 00:43:10 GMT References: <1991Feb14.001842.24415@progress.com> Organization: Commonwealth Edison Co., Chicago, IL Lines: 63 In article <1991Feb14.001842.24415@progress.com>, neil@progress.COM (Neil Galarneau) writes: > I have heard of a new multi-lingual character set called Unicode. > > It is supposed to give one all the character sets in the world in 16-bit > charcters. It is supposed to be backed by several Unix companies, Apple, > and the DOS companies. > > ... I hope not. I was working on this myself, and I was also going to include several other features as well. Well, I they are working on it I hope that they were smart enough to make character 0 = '0', char 1 = '1'... char 9 = '9', char 10 = 'A', char 11 = 'B' ... This would make conversions to Hexidecimal (or Octal) much much easier. You see there is no real reason that the control characters HAD to occupy the first 32 characters in ASCII. (They could have just as easily made them the last 32 and used positive logic circuts rather than negitve logic) IMHO, many of the problems that programmers face with character sets (ASCII & EBCDIC) are that they were designed by engineers who were (naturally) more concerned with what was going to be easy to build hardware-wise. You only build hardware once, but people write software for it for years. If it's a little bit harder to build, but easier to program, it's worth it. As for my thoughts on a NEW character set, there is no reason why all written languages could not be included. There are also a wealth of special characters that could be included, making people of various professions that use computers, jobs easier. Symbols like less-than-or-equal-to and not-equal-to, could make every programmer's job easier. Meteorology has alot of symbols that could be included in the character set that would simplify the excange of weather data, for example. We also need not limit a new character set to today's technology. (ASCII was designed for teletype machines) What I mean by this is that we should include characters that represent colors and music. granted, not everyone's computer has these capablities today, but why limit the character set? If your computer doesn't have a speaker, ignore the music characters. If you don't have the capabilities that some given character implies, then take an appropriate action that is within your hardware's realm. I think that there are alot of special characters that would help to unify word processing files (like a character for Boldface-on, Itallics-off...) If these characters existed in the character set, word processors would not need to make up their own representation for these things, and thus they could use "standard" unicode files. Immagine, having a file of "music" characters: you could "print" it to your synthizer and listen to it, or you could "print" it to you printer, and get out sheet music. (I realize that this is a bit idealized, but I think that it is possible). Joe Musician could write his new song on a computer, upload it to the studio, Record it, (most likely his record label will sell the theme to it to a video game maker to include as background to a game), and the record company will put it on a CD ROM with it's other Top 40 songs of the month to distribute to record stores so that you can come in and buy a copy of the sheet music (which the music store prints off on it's laser printer from the file on the CD ROM). I am not saying that this form of marketing is my goal, but I am only trying to show how much time & effort can be saved for members of a certain profession, if they are kept in mind when a new code is developed. I certainly hope that if there is a Unicode, that it's makers have had such a far reaching outlook on it's possiblities. It would be a shame for a new "standard" to emerge that is outdated about the time that it is accepted. If any of you out there have some ideas for things that may be included in my character set, please e-mail them to me. I still plan on working on this unless I get some more info on Unicode, and it does have some forthought to it. Garry Garrett garry@ceco.ceco.com