Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP Path: utzoo!mnetor!seismo!husc6!rutgers!princeton!mind!harnad From: harnad@mind.UUCP (Stevan Harnad) Newsgroups: comp.cog-eng,comp.ai Subject: Re: The symbol grounding problem: Against Rosch & Wittgenstein Message-ID: <972@mind.UUCP> Date: Fri, 3-Jul-87 23:55:19 EDT Article-I.D.: mind.972 Posted: Fri Jul 3 23:55:19 1987 Date-Received: Sat, 4-Jul-87 17:19:46 EDT References: <.... <6174@diamond.BBN.COM> <917@mind.UUCP> <6885@diamond.BBN.COM> Organization: Cognitive Science, Princeton University Lines: 318 Summary: And against Frege and Quine and Putnam too... Xref: mnetor comp.cog-eng:182 comp.ai:613 aweinste@Diamond.BBN.COM (Anders Weinstein) of BBN Laboratories, Inc., Cambridge, MA writes: > [It's] tempting to suppose that all complex concepts *must* have implicit > definitions in terms of some atomic ones, even if...largely > unconscious... [but] philosophy has spent two thousand years searching > for implicit definitions of concepts without any conspicuous success. First of all, let me say that this rejoinder of Weinstein's is excellent. It portrays the standard Quinean view on these matters, so far as I can tell, faithfully and resourcefully. It will be a pleasure attempting to refute this sophisticated position, and if I succeed, I hope that the outcome cannot fail to be informative to all who have been following these exchanges, particularly in view of the influential status of the Quinean view. In replying, however, I have been obliged to quote extensively from Weinstein's articulate statements, despite Ken Laws's valid request that we minimize quotes (and my sincere efforts to comply with it). The facility of quoting is one of the unique powers of electronic communication, though, and I think in this case paraphrase or cross-reference would have caused more confusion and discontinuity than it was worth. Now my response: It is important to note that -- even in this ecumenical age of "cognitive science" -- the concerns of philosophy and of empirical psychology are not the same. In other words, it may be that philosophy was searching for 2000 years in the wrong way or for the wrong thing. Probably both. This will become clearer in my response, but what I claim is that (i) the only way to find out how far a bottom-up approach to concepts grounded in sensory features can get you is actually to model it -- to see what performance you can get out of a device that functions that way. (Even what I'm doing is just prolegomena to such modeling, by the way, but I think I've got the methodological constraints right and some hints as to how one might start.) Philosophy has certainly not been doing that. (ii) "Definitions" (implicit or otherwise) are not what we're looking for in modeling our use of concepts. We're looking for what kinds of internal structures and processes a device must have in order to be able to do what we can do. The fact that philosophers have failed to introspect exhaustive definitions of concepts is not evidence about what internal representations may or may not actually underlie concepts. Not only is it possible that explicit, verbalizable definitions are not what these representations consist of, but it is unlikely that even their "implicit" counterparts will be "definitions" at all. According to my own model, for example, the features that pick out a category named "X" will never be the real, "essential" features that define X's ontologically, from the eternal, omniscient point of view. They will only be the local, context-dependent features that allow a categorizer to sort X's and non-X's CORRECTLY (sic -- I'll return to this) on the basis of the sample of interconfusable X's he has encountered to date. We're not defining X's. We're picking out the features available from the sensory projection that will reliably sort the X's and non-X's we encounter. This provisional, approximate, context-dependent representation then allows us to use "X" in grounded composite symbolic descriptions of higher-order objects not so closely tied to our sense experience. These descriptions too merely provisionally pick out rather than definitively define. Nor is it some exact object that's being picked out; just the best current approximation on the data available. > [Why the psychology of categorization won't dent the problem of meaning:] > "Angry gods nearby" is composite in *English*, but it need not be > composite in native, or, more to the point, in the supposed inner > language of the native's categorical mechanisms. They may have a single > word, say "gog", which we would want to translate as "god-noise" or some> such. Perhaps they train their children to detect gog in precisely the > same way we train children to detect thunder -- our internal > thunder-detectors are identical. Nevertheless, the output of their > thunder-detector does not *mean* "thunder". Besides the obvious rejoinder that -- in a very real sense -- "gog" and "thunder" ARE picking out the same thing to an approximation, I think you are understimating the complexity and resources of compositeness versus atomicity (i.e., descriptions versus names). Utterances are not infinitely decomposable; there are elementary labels that simply refer to an object or a state of affairs, rather than predicate something more complex about it, and some of these objects will be sensory, and picked out by sensory attributes alone. The rest, I claim, can be grounded in combinations of these labels (stating category inclusion relations, to begin with). If "gog" is really a holophrastic description rather than an atomic name, then there must be a way of decomposing it into its components ("angry," "gods," etc.), which will then themselves either be composite or atomic, and if atomic (and sensory), then the grounding can start there. "Thunder," on the other hand, need not be decomposable in that way, and its representation need not presuppose a similar set of interrelations with other representations. (This is not to say that it does not have interrelations with other names and descriptions and their underlying features; just that it does not have the ones the composite holophrastic "gog" must have.) I'll return to the issue of training below. For now, let me say that although I myself introduced the problem of "meaning" ("intentionality" etc.) in formulating the symbol grounding in the first place, I was appealing mainly to the informal, intuitive "folk-psychological" meaning of meaning. We all know what "meaningful" vs. "meaningless" means. We all know what it's like to mean something, and what it's like not to know what something means. That's really all I want to take on. On the other hand, the long line of intentionality conundrums -- beginning with Frege's "morning star" and "evening star" and passing through puzzles about referential opacity and culminating in Putnam's "water/twin-water" koans (and related Quinean "gavagai" problems and even Goodmanian "green/grue" and Kuhnian incommensurability) -- I would rather keep my distance from, as not a helpful legacy from philosophers' 2000-year unsuccessful struggle with meaning. > there are two reasons why meaning resists explication by this kind > of psychology: (1) holism: the meaning of even a "grounded" symbol will > still depend on the rest of the cognitive system; and (2) normativity: > meaning is dependent upon a determination of what is a *correct* > response, and you can't simply read such a norm off from a description > of how the mechanism in fact performs. (1) "Holism" is a vague notion, but I take it that Quine has in mind that the meanings of words are intimately interrelated, and that a change of meaning in one may require adjustments, perhaps even radical ones, throughout the entire system. I think this is something that the kind of bottom-up grounding scheme I'm proposing is particularly well suited to handle, and I discuss it explicitly in the theoretical chapter of the book under discussion here ("Categorical Perception"). One of the most important features of this approach is what I've dubbed "approximationism": All category representations are provisional and approximate, depending on the confusable alternatives sampled to date. This means that feature-sets are open to revision, perhaps even radical revision, if the existing context is too narrow or unrepresentative. The only constraint is that all prior contexts must be subsumed as special cases; in other words, the updating is convergent. Grounding is itself a "holistic" relation, and any ground-level change in the representation will ramify bottom-up to everything that's grounded in it (for example, gog's meaning changes if gods turn out not to exist). This is not to say, however, that incoherencies can't make their way into such a system, or that it will always behave optimally or rationally. (2) "Normativity" is no problem for an approximationist device whose internal principles of function have nothing at all to do with questions about what things "really" are (or "really mean"). These principles only concern what you can reliably sort and label on the evidence available: Every category learning task has a source of feedback about "right" and "wrong." If you are in the wild and you're hungry and only mushrooms are available, there's a distinct ecological constraint to guide you in sorting "edibles" from "inedibles." Less radically, most of our transactions with objects and events that require categorization are attended by feedback from the consequences of MIScategorization (otherwise why bother?). And often the feedback source is good old-fashioned instruction, some of it based on preemptive ecological experience, some of it just based on arbitrary convention. The trick for the theorist is to forget about what a label "really" picks out and just worry about the actually sample a device sorts, and how. So neither holism nor "norms" seem to be a problem for the categorization model I am describing. And whether some of its internal representations are justifiably interpreted as "meanings" depends ultimately on whether or not its performance is TTT-indistinguishable (Total Turing Test) from ours. (Let's not get into another round about whether this is the ONLY criterion again...) > The fact that a subject's brain reliably asserts the symbol "foo" when > and only when thunder is presented in no way "fixes" the meaning of > "foo". Of course it is obviously a *constraint* on what "foo" may > mean: it is in fact part of what Quine called the "stimulus meaning" > of "foo", his first constraint on acceptable translation. Nevertheless, > by itself it is still way too weak to do the whole job, for in different > contexts the positive output of a reliable thunder-detector could mean > "thunder", something co-extensive but non-synonymous with "thunder", > "god-noise", or just about anything else. Indeed, it might not *mean* > anything at all, if it were only part of a mechanical thunder-detector > which couldn't do anything else... I wonder if you disagree with this? I agree with most of this. I certainly agree about context-dependence and what sounds like approximateness. I don't really know what Quine's "stimulus meaning" is, but perhaps it could be cashed in by coming up with the right performance model. That theoretical task, however, is anything but trivial, and the real work seems to begin where Quine's vague descriptor leaves off. (Same for "behavioral dispositions.") I also agree that a sub-TTT device may have nothing worthy of being interpreted as "meaning" at all. Hence much of meaning must have to do with the interrelations among the representations subserving our total sorting, labeling and describing capacity; and it of course depends on the context of interconfusable alternatives that any given device can successfully sort and describe -- the "compared to what?" factor. Widen the context and you narrow the options on what an isolated act of stimulus-naming (and the underlying structures and processes generating it) might "mean." (I've always felt that radical alternative translations are unlikely to exist because of constraints on the permutations and combinations that will still yield a coherently decryptable story. In the propositional calculus, conjunction/negation and disjunction/negation may be "duals," but it's not clear that more complex alternative permutations are possible in the semantics of natural language. They may leave no degrees of freedom. See also the contributions of Dan Berleant to this discussion on that topic. I think similar considerations may apply to inverted-spectrum thought-experiments regarding qualia, i.e., swapping red and green, etc.) > As to normativity, the force of problem (2) is particularly acute when > talking about the supposed intentionality of animals, since there aren't > any obvious linguistic or intellectual norms that they are trying to > adhere to. Although the mechanics of a frog's prey-detector may be > crystal clear, I am convinced that we could easily get into an endless > debate about what, if anything, the output of this detector really > *means*. I agree, although that may partly be a problem with the weakness of our ecological knowledge and cross-species intuitions with respect to the infrahuman-TTT. It may also be a consequence of the preeminent role language plays in our judgments (as perhaps it should). But one can certainly speak about "right" and "wrong" in an animal's categorization performance, both with respect to evolutionary adaptation and learning. And approximationism relieves us of having to decide the fact of the matter about what EXACTLY the frog's bug-detector is picking out. To an approximation it might be the same thing ours is picking out... But, not being TTT-equivalent to us, frogs may well be "meaning" nothing at all. > in doing this sort of psychology, we probably won't care about the > difference between correctly identifying a duck and mis-identifying > a good decoy -- we're interested in the perceptual mechanisms that are > the same in both cases. In effect, we are limiting our notion of > "categorization" to something like "quick and largely automatic > classification by observation alone". Whether duck/decoy is a good enough approximation for duck depends on context and consequences. (For the unfortunate hunted duck, it matters.) But there are big differences between innate and learned categories (the former are not revisable in an indivdual lifetime) and not all categories are sensory. They're simple all GROUNDED in sensory categories. > We pretty much *have* to restrict ourselves in this way, because, in the > general case, there's just no limit to the amount of cognitive activity > that might be required in order to positively classify something. > Consider what might go into deciding whether a dolphin ought to be > classified as a fish, whether a fetus ought to be classified as a > person, etc. These decisions potentially call for the full range of > science and philosophy, and a psychology which tries to encompass such > decisions has just bitten off more than it can chew: it would have to > provide a comprehensive theory of rationality, and such an ambitious > theory has eluded philosophers for some time now... we seem committed > to the notion that we are limiting ourselves to particular *modules* > as explained in Fodor's modularity book. Unfortunately... these > normative distinctions *are* significant for the *meaning* of symbols. > ("Duck" doesn't *mean* the same thing as "decoy"). I'd like to try having that bite and chewing it too. As I suggested before, philosophers may have failed because they never really tried. And holism is not a problem for my kind of model, for example, because there's no restriction on how much of a grounded hybrid system is used to form one categorization, concrete or abstract. And, as I mentioned, the current provisional approximation is always open to updating (say, on the basis of new scientific findings) by widening the context. Nor are "norms" a problem; category formation is always guided by feedback -- either ecological or social -- about what labels and descriptions are right and wrong. I disagree, though, that a successful model calls for a comprehensive theory of rationality (any more than it needs a periscope on ontic reality): It need only be able to make the fallible practical inferences we can and do make. I also see nothing that commits a grounded bottom-up system of the kind I'm describing to any kind of modularity, on the contrary. And "duck" doesn't mean the same as "decoy" only because there are ways we can and do tell them apart. > I think there's some confusion as to whether Harnad's claim [about > the necessity of a sufficient feature-set] is just an empty tautology > or a significant empirical claim. To wit: it's clear that we can > reliably recognize chairs from sensory input, and we don't do this by > magic. Hence, we can perhaps take it as trivially true that there are > some "features" of the input that are being detected. If we are taking > this line however, we have to remember that it doesn't really say > *anything* about the operation of the mechanism -- it's just a fancy > way of saying we can recognize chairs. I agree that I haven't provided a feature-learning mechanism (although I've suggested some candidates, such as connectionist nets or some other inductive statistical algorithm). I've just argued that one must exist. But those who were disagreeing were suggesting that category membership is really graded, not all-or-none, and that sufficient feature-sets do not and need not exist. Be it ever so fancy, it matters whether we categorize chairs as chairs on an all-or-none featural basis or as a matter of degree (of similarity to a "template," say). I think the whole line of research based on "family resemblances" and protoype-matching is wrong-headed and based on misunderstandings about what features and feature-detectors are; moreover, it begs most of the questions involved in trying to get a device to perform successful all-or-none categorization at all. If the existence and use of sufficient feature-sets is so certain that it's tautologous, tell that to the ones who seem to be denying it! > On the other hand, it might be taken as a significant claim about the > nature of the chair-recognition device, viz., that we can understand > its workings as a process of actually parsing the input into a set of > features and actually comparing these against what is essentially some > logical formula in featurese. This *is* an empirical claim, and it is > certainly dubitable: there could be pattern recognition devices > (holograms are one speculative suggestion) which cannot be > interestingly broken down into feature-detecting parts. In another response I argue that holograms and other iconic representations cannot do nontrivial categorization (i.e., problems in which there are no obvious gaps in the variation and the feature-set is complex and underdetermined). I also do not favor "logical formulas in featurese" (which sounds as if it has gone symbolic prematurely). A disjunctive feature-detector need not have any explicit formulas. It could be a selective filter that only passes input that is, say, red or green; i.e., it could be "micro-iconic," -- invertible only in red-or-green-ness. I also don't think the representation of "chair" is likely to be purely sensory; it's probably a higher-order category grounded in sensory categories. I think there's plenty in what I claim that is dubitable (hence empirical), if not dubious. -- Stevan Harnad (609) - 921 7771 {bellcore, psuvax1, seismo, rutgers, packard} !princeton!mind!harnad harnad%mind@princeton.csnet harnad@mind.Princeton.EDU