Xref: utzoo comp.sys.ibm.pc:16894 comp.binaries.ibm.pc.d:508 comp.emacs:3748 Path: utzoo!attcan!uunet!husc6!purdue!i.cc.purdue.edu!j.cc.purdue.edu!nwd From: nwd@j.cc.purdue.edu (Daniel Lawrence) Newsgroups: comp.sys.ibm.pc,comp.binaries.ibm.pc.d,comp.emacs Subject: Re: US PC programmers still live in a 7-bit world! Summary: 8 bit ascii Keywords: 8bit uEMACS Message-ID: <7350@j.cc.purdue.edu> Date: 2 Jul 88 16:30:35 GMT References: <1988Jun22.223158.1366@LTH.Se> <126@dcs.UUCP> <920@infbs.UUCP> <345@krafla.rhi.hi.is> Reply-To: nwd@j.cc.purdue.edu.UUCP (Daniel Lawrence) Organization: Purdue University Lines: 29 There has been a lot of talk about problems with 8 bit characters in various programs and editors. I had many comments about this problem in the last couple of years about MicroEMACS 3.9 (although it was 3.6 and 3.7 at the time) and I fixed it to handle transparently 8 bit characters quite some time ago. So, entering high byte characters from the keyboard using the alternet key works properly. Also keys can be set up like the following: store-macro-21 insert-string &chr 130 !endm bind-to-key execute-macro-20 FN^R This causes an e with an acute accent to be inserted when the alt-e combination is struck. My problem now is this.... How do I determine how such a character should be treated when converted to uppercase. Not knowing the different languages involved, could someone in the know post such info? Does it vary from language to language? How are characters like this treated on UNIX machines of different sorts. Rather than bemoaning the USA's programmers lack of attention, could someone tell us what is the right way to handle things like this? Daniel Lawrence (317) 742-5153 UUCP: {ihnp4!pur-ee!}j.cc.purdue.edu!nwd ARPA: nwd@j.cc.purdue.edu FIDO: 1:201/2 The Programmer's Room (317) 742-5533