Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!mnetor!uunet!husc6!cmcl2!beta!unm-la!unmvax!turing!mike From: mike@turing.unm.edu (Michael I. Bushnell) Newsgroups: comp.std.internat Subject: Chinese words Message-ID: <615@unmvax.UNM.EDU> Date: Wed, 12-Aug-87 14:51:58 EDT Article-I.D.: unmvax.615 Posted: Wed Aug 12 14:51:58 1987 Date-Received: Sat, 15-Aug-87 01:24:49 EDT Sender: news@unmvax.UNM.EDU Reply-To: mike@turing.UUCP (Michael I. Bushnell) Distribution: world Organization: University of New Mexico, Albuquerque Lines: 36 I think the discussion about Chinese words would benefit from some interesting knowledge I gleaned from a book on writing systems and the alphabet. In Chinese, every word is one syllable. Needless to say, there is lots of overloading here, but the multiple meanings of a word are usually quite different and can be easily distinguished from context. There is not one character per word(== syllable), rather, there is one character per word meaning. Representations of Chinese in data-processing is always assumed to be like writing Chinese. In that case, you need enough bits to hold the large lexicon. But is it not possible to represent the syllables? There may be problems (many computer "things" have little context) but it might be workable. As for the size of the lexicon, a recent article here said that the OED had about 1,000,000 words, and English slightly more than that. From this, the poster derived a figure of 1,000,000 for the size of the Chinese lexicon. But English is a remarkable language. For most types of things, we have TWO words, one Latinic, one Germanic. For example: teeth/dental dead/mortal car/automobile. The list of such pairs is huge. In no other language to my knowledge is there such a phenomenon. My estimate, from this and other reasons for the large English lexicon, is about 500,000 words in Chinese. Unfortunately, this means that each character would not fit in 16 bits. But the number of syllables is MUCH less. That could probably fit. Michael I. Bushnell a/k/a Bach II mike@turing.UNM.EDU --- Where do your SOCKS go when you lose them in th' WASHER? -- Zippy the Pinhead