Path: utzoo!utgpu!news-server.csri.toronto.edu!cs.utexas.edu!usc!zaphod.mps.ohio-state.edu!rpi!uupsi!sunic!dkuug!dkuugin!keld From: keld@login.dkuug.dk (Keld J|rn Simonsen) Newsgroups: comp.text Subject: Re: ASCII digraphs for ISO 8859-1 requested Keywords: troff, dwb Message-ID: Date: 11 Jan 91 22:44:50 GMT References: <1991Jan7.223914.19056@cbnewsl.att.com> <7726@castle.ed.ac.uk> Sender: news@slyrf.dkuug.dk Lines: 47 yfcw14@castle.ed.ac.uk (K P Donnelly) writes: >npn@cbnewsl.att.com (nils-peter.nelson) writes: >>To the point, are there any more proposals out there for how to >>represent 8 bit characters in a 7 bit subset? >>My goal is to have the troff convention be the same as the >>keyboard convention most familiar to people, if such a thing exists. >Surely by far the most common keyboard convention for ISO 8859-1 is that >on used VT320 terminals, in which for example you generate the 8 bit >character "half" type pressing "Compose-character" then "1" then "2". Well, well. I did some other work on this, and also had a look at the VT320 names. What I wanted to do was to have ASCII encodings of all the ISO 8859 character sets and also other character sets. I found that the VT320 codes were fine for ISO 8859-1, but when all the parts of ISO 8859 (there is about 10 parts) should be coded, there were conflicts in the naming. I now have a set of more than 1300 character names in 2 character ASCII (or actually invariant ISO 646), which is used for definition of POSIX locales and used in email. I also have tables of the encoding of about 60 character sets with these two-char names. Unfortunately these names are incompatible with Ossanna/Kernighan titroff. And thus the names in K/S/A are incompatible with this extended list. And I was involved in both lists.... The 1300 character list was a lot bigger than the K/S/A list and therefore had to be designed more consistently and carefully. One fundamental design decision was to shift around the letter position in letter names, so titroff *a is now a* in the 1300 char list. The 1300 char list includes the following: extended latin, greek, cyrillic, hebrew, arabic, mathematics, hiragana, katakana, bopomofo. Two ideas for this: 1. A titroff list of the 1300 characters (or the like) could be done - which could be as compatible as possible with the big list I have now. 2. the new titroff could have a specification of input character set - thus more markets could be opened up for this product. The 1300 char list (and some code to handle it) is available by anon ftp in dkuug.dk:pub/ch.shar* Keld Simonsen