Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!watmath!clyde!burl!ulysses!mhuxl!houxm!vax135!cornell!uw-beaver!tektronix!hplabs!sri-unix!AFB%MIT-OZ@MIT-MC.ARPA From: AFB%MIT-OZ@MIT-MC.ARPA Newsgroups: net.ai Subject: Understanding speech versus hearing words Message-ID: <13053@sri-arpa.UUCP> Date: Wed, 29-Aug-84 10:12:00 EDT Article-I.D.: sri-arpa.13053 Posted: Wed Aug 29 10:12:00 1984 Date-Received: Mon, 3-Sep-84 11:25:28 EDT Lines: 46 From: Aaron F. Bobick About speech recognition: From: Sidney Markowitz It turns out that even to separate the syllables in continuous speech you need to have some understanding of what the speaker is talking about! You can discover this for yourself by trying to hear the sounds of the words when someone is speaking a foreign language. You can't even repeat them correctly as nonsense syllables. What this implies is an approach to speech recognition that goes beyond pattern recognition to include understanding of utterances. This in turn implies that the system has some understanding of the "world view" of the speaker, i.e., common sense knowledge and the probable intentions of the speaker..... Many psycho-lingists would dispute this. The problem with the foreign language example is that you don't recognize WORDS, not that you don't understand the utterance (for now let us define understanding as building some sort of SEMANTIC model, the details don't matter). Consider the classic: "Green ideas sleep furiously." I doubt one can "understand" this in any plausible way yet it's encoding is easy. Even if one removes grammar and is listening to a randomized listing of Websters dictionary, one can easily parse the string into syllables and words. In fact, *except under noise conditions much worse than normal conversation*, there is psycho-linguistic evidence that context does not greatly affect word recognition by humans in terms of the parsing of the input signal. .... (I am over simplifying a little; there is also evidence that context can help you make judgements about incoming words and syllables. However, this may be a post-access phenomena, sort of a surprise effect when an anomalous word or syllable is encountered; the jury is still out. Regardless, it is certainly reasonable to consider a context independent word recognition system. ) ..... Therefore, it is clearly possible to consider speech *recognition* as separate from understanding. Hearsay (I or II) does not; some psychologists (and, by the way, many AI speech hackers) do. Stuck in the middle again ...... aaron bobick (afb%mit-oz@mit-mc)