Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Posting-Version: version B 2.10.2 9/17/84; site utopia.UUCP Path: utzoo!linus!philabs!cmcl2!seismo!hao!nbires!utopia!sorflat From: sorflat@utopia.UUCP (John Sorflaten) Newsgroups: net.research,net.unix,net.nlang Subject: Seeking word lists with frequency of use Message-ID: <107@utopia.UUCP> Date: Tue, 12-Mar-85 23:05:13 EST Article-I.D.: utopia.107 Posted: Tue Mar 12 23:05:13 1985 Date-Received: Sun, 17-Mar-85 23:15:41 EST Distribution: net Organization: G.I.T. Inc., Fairfield IA Lines: 22 Xref: linus net.research:88 net.unix:3299 net.nlang:2373 After reading about Webster's 2a being available, I am inspired to dare ask for a word list of any language. The first 5000 words are about all I need, with data reflecting their percentage of use in standard text. (For instance, "the" will reflect a high percentage of usage in English.) For starters, I'm really interested in English, Korean, Chinese, Arabic, Hebrew and Hindi. For non-roman languages such as Chinese, there exist standard identification numbers for each word (character) in the vocabulary. Other non-roman languages such as Arabic, Hebrew and Hindi are another issue. I don't know if standard ASCII methods of text transmission is possible for those instances. (Let me know.) English is our current high priority! Book references or hard copies are ok. John Sorflaten c/o G.I.T. PO Box 628 Fairfield, IA 52556 tel (515) 472-5979 Thanks in advance.