Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!mnetor!seismo!rutgers!princeton!mind!harnad From: harnad@mind.UUCP (Stevan Harnad) Newsgroups: comp.cog-eng,comp.ai Subject: Re: The symbol grounding problem Message-ID: <917@mind.UUCP> Date: Fri, 26-Jun-87 15:41:11 EDT Article-I.D.: mind.917 Posted: Fri Jun 26 15:41:11 1987 Date-Received: Sat, 27-Jun-87 10:11:11 EDT References: <764@mind.UUCP> <768@mind.UUCP> <770@mind.UUCP> <6174@diamond.BBN.COM> <8310@ut-sally.UUCP> Organization: Cognitive Science, Princeton University Lines: 159 Summary: Critique of the Roschian approach to category representation Xref: mnetor comp.cog-eng:148 comp.ai:575 berleant@ut-sally.UUCP (Dan Berleant) of U. Texas CS Dept., Austin, Texas writes: > Are you saying that the categorical representations are to be > nonsymbolic? The review of human concept representation I recently read > (Smith and Medin, Categories and Concepts, 1981) came down... hard on > the holistic theory of concept representation... The alternative > nonsymbolic approach would be the 'dimensional' one. It seems a > strongish statement to say that this would be sufficient, to the > exclusion of symbolic properties... However, the metric > hypothesis -- that a concept is sufficiently characterized by a point > in a multi-dimensional space -- seems wrong, as experiments have shown. Categorical representations are the representations of purely SENSORY categories, and I am indeed saying that they are to be NONsymbolic. Let me also point out that the theory I am putting forward represents a direct challenge to the Roschian line of category research in which the book you cite belongs. To put it very briefly, I claim that that line of experimental and theoretical work is not really investigating the representations underlying the capacity to categorize at all; it is only looking at the fine tuning of category judgments. The experiments are typically not addressing the question of how it is that a device or organism can successfully categorize the inputs in question in the first place; instead it examines (1) how QUICKLY or EASILY subjects do it, (2) how TYPICAL (of the members of the category in question) subjects rate the inputs to be and (3) what features subjects INTROSPECT that they are using. This completely bypasses the real question of how anyone or anything actually manages to accomplish the categorization at all. Let me quickly add that there is nothing wrong with reaction-time experiments if they suggest hypotheses about the basic underlying mechanism, or provide ways of testing them. But in this case -- as in many others in experimental cognitive psychology -- the basic mechanisms are bypassed and the focus is on fine-tuning questions that are beside the point (or premature) -- if, that is, the objective is to explain how organisms or devices actually manage to generate successful categorization performance given the inputs in question. As an exercise, see where the constructs you mention above -- "holistic," "dimensional," or "metric" representations -- are likely to get you if you're actually trying to get a device to categorize, as we do. There is also an "entry point" problem with this line of research, which typically looks willy-nilly at higher-order, abstract categories, as well as "basic level" object categories (an incoherent concept, in my opinion, except as an arbitrary default level), and even some sensory categories. But it seems obvious that the question of how the higher-order categories are represented is dependent on how the lower-order ones are represented, the abstract ones on the concrete ones, and perhaps all of these depend on the sensory ones. Moreover, often the inputs used are members of familiar, overlearned categories, and the task is a trivial one, not engaging the mechanisms that were involved in their acquisition. In other experiments, artificial stimuli are used, but it is not clear how representative these are of the category acquisition process either. Finally, and perhaps most important: In bypassing the problem of categorization capacity itself -- i.e., the problem of how devices manage to categorize as correctly and successfully as they do, given the inputs they have encountered -- in favor of its fine tuning, this line of research has unhelpfully blurred the distinction between the following: (a) the many all-or-none categories that are the real burden for an explanatory theory of categorization (a penguin, after all, be it ever so atypical a bird, and be it ever so time-consuming for us to judge that it is indeed a bird, is, after all, indeed a bird, and we know it, and can say so, with 100% accuracy every time, irrespective of whether we can successfully introspect what features we are using to say so) and (b) true "graded" categories such as "big," "intelligent," etc. Let's face the all-or-none problem before we get fancy... > To discuss "invariant features... sufficient to guide reliable > categorization" sounds like the "classical" theory (as Smith & Medin > call it) of concept representation: Concepts are represented as > necessary and sufficient features (i.e., there are defining features, > i.e. there is a boolean conjunction of predicates for a concept). This > approach has serious problems, not the least of which is the inability > of humans to describe these features for seemingly elementary concepts, > like "chair", as Weinstein and others point out. I contend that a > boolean function (including ORs as well as ANDs) could work, but that > is not what was mentioned. An example might be helpful: A vehicle must > have a steering wheel OR handlebars. But to remove the OR by saying, > a vehicle must have a means of steering, is to rely on a feature which > is symbolic, high level, functional, which I gather we are not allowing. It certainly is the "classical" theory, but the one with the serious problems is the fine-tuning approach I just described, not the quite reasonable assumption that if 100% correct, all-or-none categorization is possible at all (without magic), then there must be a set of features in the inputs that is SUFFICIENT to generate it. I of course agree that disjunctive features are legitimate -- but whoever said they weren't? That was another red herring introduced by this line of research. And, as I mentioned, "the inability of humans to describe these features" is irrelevant. If they could do it, they'd be cognitive modelers! We must INFER what features they're using to categorize successfully; nothing guarantees they can tell us. (If by "Weinstein" you mean "Wittgenstein" on "games," etc., I have to remind you that Wittgenstein did not have the contemporary burden of speaking in terms of internal mechanisms a device would have to have in order to categorize successfully. Otherwise he would have had to admit that "games" are either (i) an all-or-none category, i.e., there is a "right" or "wrong" of the matter, and we are able to sort accordingly, whether or not we can introspect the basis of our correct sorting, or (ii) "games" are truly a fuzzy category, in which membership is arbitrary, uncertain, or a matter of degree. But if the latter, then games are simply not representative of the garden-variety all-or-none categorization capacity that we exercise when we categorize most objects, such as chairs, tables, birds. And again, there's nothing whatsoever wrong with disjunctive features.) Finally, it is not that we are not "allowing" higher-order symbolically described features. They are the goal of the whole grounding project. But the approach I am advocating requires that symbolic descriptions be composed of primitive symbols which are in turn the labels of sensory categories, grounded in nonsymbolic (iconic and categorical) representations. > [Concerning model-theoretic "grounding":] The more statements > you have (that you wish to be deemed correct), the more the possible > meanings of the terms will be constrained. To illustrate, consider > the statement FISH SWIM. Think of the terms FISH and SWIM as variables > with no predetermined meaning -- so that FISH SWIM is just another way > of writing A B. What variable bindings satisfy this? Well, many do... > Now consider the statement FISH LIVE, where FISH and LIVE are variables. > Now there are two statements to be satisfied. The assignment to the > variable LIVE restricts the possible assignments to the variable SWIM... > Of course, we have many many statements in our minds that must be > simultaneously satisfied, so the possible meanings that each word name > can be assigned is correspondingly restricted. Could the restrictions be > sufficient to require such a small amount of ambiguity that the word > names could be said to have intrinsic meaning?... footnote: This > leaves unanswered the question of how the meanings themselves are > grounded. Non-symbolically, seems to be the gist of the discussion, > in which case logic would be useless for that task even in an > "in principle" capacity since the stuff of logic is symbols. I agree that there are constraints on the correlations of symbols in a natural language, and that the degrees of freedom probably shrink, in a sense, as the text grows. That is probably the basis of successful cryptography. But I still think (and you appear to agree) that even if the degrees of freedom are close to zero for a natural language's symbol combinatons and their interpretations, this still leaves the grounding problem intact: How are the symbols connected to their referents? And what justifies our interpretation of their meanings? With true cryptography, the decryption of the symbols of the unknown language is always grounded in the meanings of the symbols of a known language, which are in turn grounded in our heads, and their understanding of the symbols and their relation to the world. But that's the standard DERIVED meaning scenario, and for cognitive modeling we need INTRINSICALLY grounded symbols. (I do believe, though, that the degrees-of-freedom constraint on symbol combinations does cut somewhat into Quine's claims about the indeterminacy of radical translation, and ESPECIALLY for an intrinsically grounded symbol system.) -- Stevan Harnad (609) - 921 7771 {bellcore, psuvax1, seismo, rutgers, packard} !princeton!mind!harnad harnad%mind@princeton.csnet harnad@mind.Princeton.EDU