Relay-Version: version B 2.10 5/3/83; site utzoo.UUCP
Path: utzoo!mnetor!seismo!husc6!rutgers!princeton!mind!harnad
From: harnad@mind.UUCP (Stevan Harnad)
Newsgroups: comp.cog-eng,comp.ai
Subject: Re: The symbol grounding problem: Against Rosch & Wittgenstein
Message-ID: <972@mind.UUCP>
Date: Fri, 3-Jul-87 23:55:19 EDT
Article-I.D.: mind.972
Posted: Fri Jul  3 23:55:19 1987
Date-Received: Sat, 4-Jul-87 17:19:46 EDT
References: <.... <6174@diamond.BBN.COM> <917@mind.UUCP> <6885@diamond.BBN.COM>
Organization: Cognitive Science, Princeton University
Lines: 318
Summary: And against Frege and Quine and Putnam too...
Xref: mnetor comp.cog-eng:182 comp.ai:613


aweinste@Diamond.BBN.COM (Anders Weinstein)
of BBN Laboratories, Inc., Cambridge, MA writes:

>      [It's] tempting to suppose that all complex concepts *must* have implicit
>	definitions in terms of some atomic ones, even if...largely
>	unconscious... [but] philosophy has spent two thousand years searching
>	for implicit definitions of concepts without any conspicuous success.

First of all, let me say that this rejoinder of Weinstein's is
excellent. It portrays the standard Quinean view on these matters, so
far as I can tell, faithfully and resourcefully. It will be a pleasure
attempting to refute this sophisticated position, and if I succeed, I
hope that the outcome cannot fail to be informative to all who have
been following these exchanges, particularly in view of the
influential status of the Quinean view. In replying, however, I have
been obliged to quote extensively from Weinstein's articulate
statements, despite Ken Laws's valid request that we minimize quotes
(and my sincere efforts to comply with it). The facility of quoting is
one of the unique powers of electronic communication, though, and I
think in this case paraphrase or cross-reference would have caused
more confusion and discontinuity than it was worth. Now my response:

It is important to note that -- even in this ecumenical age of
"cognitive science" -- the concerns of philosophy and of empirical
psychology are not the same. In other words, it may be that philosophy
was searching for 2000 years in the wrong way or for the wrong thing.
Probably both. This will become clearer in my response, but what I
claim is that (i) the only way to find out how far a bottom-up
approach to concepts grounded in sensory features can get you is
actually to model it -- to see what performance you can get out of a
device that functions that way. (Even what I'm doing is just
prolegomena to such modeling, by the way, but I think I've got the
methodological constraints right and some hints as to how one might
start.) Philosophy has certainly not been doing that.

(ii) "Definitions" (implicit or otherwise) are not what we're looking
for in modeling our use of concepts. We're looking for what kinds of
internal structures and processes a device must have in order to be able to do
what we can do. The fact that philosophers have failed to introspect
exhaustive definitions of concepts is not evidence about what internal
representations may or may not actually underlie concepts. Not only is it
possible that explicit, verbalizable definitions are not what these
representations consist of, but it is unlikely that even their
"implicit" counterparts will be "definitions" at all.

According to my own model, for example, the features that pick out a category
named "X" will never be the real, "essential" features that define X's
ontologically, from the eternal, omniscient point of view. They will
only be the local, context-dependent features that allow a categorizer
to sort X's and non-X's CORRECTLY (sic -- I'll return to this) on the
basis of the sample of interconfusable X's he has encountered to
date. We're not defining X's. We're picking out the features available
from the sensory projection that will reliably sort the X's and
non-X's we encounter. This provisional, approximate, context-dependent
representation then allows us to use "X" in grounded composite
symbolic descriptions of higher-order objects not so closely tied to our
sense experience. These descriptions too merely provisionally pick out
rather than definitively define. Nor is it some exact object that's
being picked out; just the best current approximation on the data
available.

>      [Why the psychology of categorization won't dent the problem of meaning:]
>	"Angry gods nearby" is composite in *English*, but it need not be
>	composite in native, or, more to the point, in the supposed inner
>	language of the native's categorical mechanisms. They may have a single
>	word, say "gog", which we would want to translate as "god-noise" or some>	such. Perhaps they train their children to detect gog in precisely the
>	same way we train children to detect thunder -- our internal
>	thunder-detectors are identical. Nevertheless, the output of their
>	thunder-detector does not *mean* "thunder".

Besides the obvious rejoinder that -- in a very real sense -- "gog"
and "thunder" ARE picking out the same thing to an approximation, I
think you are understimating the complexity and resources of
compositeness versus atomicity (i.e., descriptions versus names).
Utterances are not infinitely decomposable; there are elementary
labels that simply refer to an object or a state of affairs, rather
than predicate something more complex about it, and some of these
objects will be sensory, and picked out by sensory attributes alone.
The rest, I claim, can be grounded in combinations of these labels
(stating category inclusion relations, to begin with).

If "gog" is really a holophrastic description rather than an atomic
name, then there must be a way of decomposing it into its components
("angry," "gods," etc.), which will then themselves either be
composite or atomic, and if atomic (and sensory), then the grounding can
start there. "Thunder," on the other hand, need not be decomposable in that
way, and its representation need not presuppose a similar set of
interrelations with other representations. (This is not to say that it does
not have interrelations with other names and descriptions and their
underlying features; just that it does not have the ones the
composite holophrastic "gog" must have.)

I'll return to the issue of training below. For now, let me say that
although I myself introduced the problem of "meaning" ("intentionality" etc.)
in formulating the symbol grounding in the first place, I was
appealing mainly to the informal, intuitive "folk-psychological"
meaning of meaning. We all know what "meaningful" vs. "meaningless"
means. We all know what it's like to mean something, and what it's
like not to know what something means. That's really all I want to
take on. On the other hand, the long line of intentionality
conundrums -- beginning with Frege's "morning star" and "evening star"
and passing through puzzles about referential opacity and culminating in
Putnam's "water/twin-water" koans (and related Quinean "gavagai"
problems and even Goodmanian "green/grue" and Kuhnian
incommensurability) -- I would rather keep my distance from, as not a helpful
legacy from philosophers' 2000-year unsuccessful struggle with meaning.

>	there are two reasons why meaning resists explication by this kind
>	of psychology:  (1) holism: the meaning of even a "grounded" symbol will
>	still depend on the rest of the cognitive system; and (2) normativity:
>	meaning is dependent upon a determination of what is a *correct*
>	response, and you can't simply read such a norm off from a description
>	of how the mechanism in fact performs.

(1) "Holism" is a vague notion, but I take it that Quine has in mind
that the meanings of words are intimately interrelated, and that a
change of meaning in one may require adjustments, perhaps even radical
ones, throughout the entire system. I think this is something that the
kind of bottom-up grounding scheme I'm proposing is particularly well
suited to handle, and I discuss it explicitly in the theoretical chapter
of the book under discussion here ("Categorical Perception").

One of the most important features of this approach is what I've dubbed
"approximationism": All category representations are provisional and
approximate, depending on the confusable alternatives sampled to date.
This means that feature-sets are open to revision, perhaps even
radical revision, if the existing context is too narrow or
unrepresentative. The only constraint is that all prior contexts must
be subsumed as special cases; in other words, the updating is
convergent. Grounding is itself a "holistic" relation, and any
ground-level change in the representation will ramify bottom-up to
everything that's grounded in it (for example, gog's meaning changes
if gods turn out not to exist). This is not to say, however, that
incoherencies can't make their way into such a system, or that it will
always behave optimally or rationally.

(2) "Normativity" is no problem for an approximationist device 
whose internal principles of function have nothing at all to
do with questions about what things "really" are (or "really mean").
These principles only concern what you can reliably sort and label on
the evidence available: Every category learning task has a source of
feedback about "right" and "wrong." If you are in the wild and you're
hungry and only mushrooms are available, there's a distinct
ecological constraint to guide you in sorting "edibles" from "inedibles."
Less radically, most of our transactions with objects and events that
require categorization are attended by feedback from the consequences of
MIScategorization (otherwise why bother?). And often the feedback source
is good old-fashioned instruction, some of it based on preemptive
ecological experience, some of it just based on arbitrary convention.
The trick for the theorist is to forget about what a label "really" picks
out and just worry about the actually sample a device sorts, and how.

So neither holism nor "norms" seem to be a problem for the
categorization model I am describing. And whether some of its internal
representations are justifiably interpreted as "meanings" depends
ultimately on whether or not its performance is TTT-indistinguishable
(Total Turing Test) from ours. (Let's not get into another round about
whether this is the ONLY criterion again...)

>	The fact that a subject's brain reliably asserts the symbol "foo" when
>	and only when thunder is presented in no way "fixes" the meaning of
>	"foo". Of course it is obviously a *constraint* on what "foo" may
>	mean: it is in fact part of what Quine called the "stimulus meaning"
>	of "foo", his first constraint on acceptable translation. Nevertheless, 
>	by itself it is still way too weak to do the whole job, for in different
>	contexts the positive output of a reliable thunder-detector could mean
>	"thunder", something co-extensive but non-synonymous with "thunder",
>	"god-noise", or just about anything else. Indeed, it might not *mean*
>	anything at all, if it were only part of a mechanical thunder-detector
>	which couldn't do anything else... I wonder if you disagree with this?

I agree with most of this. I certainly agree about context-dependence
and what sounds like approximateness. I don't really know what Quine's
"stimulus meaning" is, but perhaps it could be cashed in by coming up with
the right performance model. That theoretical task, however, is anything but
trivial, and the real work seems to begin where Quine's vague descriptor
leaves off. (Same for "behavioral dispositions.") I also agree that a
sub-TTT device may have nothing worthy of being interpreted as "meaning" at
all. Hence much of meaning must have to do with the interrelations
among the representations subserving our total sorting, labeling and
describing capacity; and it of course depends on the context of interconfusable
alternatives that any given device can successfully sort and describe -- the
"compared to what?" factor. Widen the context and you narrow the
options on what an isolated act of stimulus-naming (and the underlying
structures and processes generating it) might "mean."

(I've always felt that radical alternative translations are unlikely to
exist because of constraints on the permutations and combinations that will
still yield a coherently decryptable story. In the propositional
calculus, conjunction/negation and disjunction/negation may be "duals,"
but it's not clear that more complex alternative permutations are
possible in the semantics of natural language. They may leave no degrees
of freedom. See also the contributions of Dan Berleant to this discussion
on that topic. I think similar considerations may apply to
inverted-spectrum thought-experiments regarding qualia, i.e., swapping
red and green, etc.)

>	As to normativity, the force of problem (2) is particularly acute when
>	talking about the supposed intentionality of animals, since there aren't
>	any obvious linguistic or intellectual norms that they are trying to
>	adhere to. Although the mechanics of a frog's prey-detector may be
>	crystal clear, I am convinced that we could easily get into an endless
>	debate about what, if anything, the output of this detector really
>	*means*.

I agree, although that may partly be a problem with the weakness of our
ecological knowledge and cross-species intuitions with respect to the
infrahuman-TTT. It may also be a consequence of the preeminent
role language plays in our judgments (as perhaps it should). But one
can certainly speak about "right" and "wrong" in an animal's
categorization performance, both with respect to evolutionary
adaptation and learning. And approximationism relieves us of having to
decide the fact of the matter about what EXACTLY the frog's bug-detector
is picking out. To an approximation it might be the same thing ours is
picking out... But, not being TTT-equivalent to us, frogs may well be
"meaning" nothing at all.

>	in doing this sort of psychology, we probably won't care about the
>	difference between correctly identifying a duck and mis-identifying
>	a good decoy -- we're interested in the perceptual mechanisms that are
>	the same in both cases. In effect, we are limiting our notion of
>	"categorization" to something like "quick and largely automatic
>	classification by observation alone".

Whether duck/decoy is a good enough approximation for duck depends on
context and consequences. (For the unfortunate hunted duck, it
matters.) But there are big differences between innate and learned
categories (the former are not revisable in an indivdual lifetime) and
not all categories are sensory. They're simple all GROUNDED in sensory
categories.

>	We pretty much *have* to restrict ourselves in this way, because, in the
>	general case, there's just no limit to the amount of cognitive activity
>	that might be required in order to positively classify something.
>	Consider what might go into deciding whether a dolphin ought to be
>	classified as a fish, whether a fetus ought to be classified as a
>	person, etc.  These decisions potentially call for the full range of
>	science and philosophy, and a psychology which tries to encompass such
>	decisions has just bitten off more than it can chew: it would have to
>	provide a comprehensive theory of rationality, and such an ambitious
>	theory has eluded philosophers for some time now... we seem committed
>	to the notion that we are limiting ourselves to particular *modules*
>	as explained in Fodor's modularity book. Unfortunately... these
>	normative distinctions *are* significant for the *meaning* of symbols.
>	("Duck" doesn't *mean* the same thing as "decoy").

I'd like to try having that bite and chewing it too. As I suggested before,
philosophers may have failed because they never really tried. And holism is
not a problem for my kind of model, for example, because there's no restriction 
on how much of a grounded hybrid system is used to form one categorization,
concrete or abstract. And, as I mentioned, the current provisional approximation
is always open to updating (say, on the basis of new scientific findings)
by widening the context. Nor are "norms" a problem; category formation
is always guided by feedback -- either ecological or social -- about what
labels and descriptions are right and wrong.

I disagree, though, that a successful model calls for a comprehensive theory
of rationality (any more than it needs a periscope on ontic reality): It
need only be able to make the fallible practical inferences we can and do
make. I also see nothing that commits a grounded bottom-up system of the
kind I'm describing to any kind of modularity, on the contrary. And "duck"
doesn't mean the same as "decoy" only because there are ways we can and do
tell them apart.

>	I think there's some confusion as to whether Harnad's claim [about
>	the necessity of a sufficient feature-set] is just an empty tautology
>	or a significant empirical claim. To wit: it's clear that we can
>	reliably recognize chairs from sensory input, and we don't do this by
>	magic. Hence, we can perhaps take it as trivially true that there are
>	some "features" of the input that are being detected. If we are taking
>	this line however, we have to remember that it doesn't really say
>	*anything* about the operation of the mechanism -- it's just a fancy
>	way of saying we can recognize chairs.

I agree that I haven't provided a feature-learning mechanism (although
I've suggested some candidates, such as connectionist nets or some
other inductive statistical algorithm). I've just argued that one
must exist. But those who were disagreeing were suggesting that category
membership is really graded, not all-or-none, and that sufficient
feature-sets do not and need not exist. Be it ever so fancy, it
matters whether we categorize chairs as chairs on an all-or-none featural
basis or as a matter of degree (of similarity to a "template," say). I
think the whole line of research based on "family resemblances" and
protoype-matching is wrong-headed and based on misunderstandings about
what features and feature-detectors are; moreover, it begs most of the
questions involved in trying to get a device to perform successful
all-or-none categorization at all. If the existence and use of sufficient
feature-sets is so certain that it's tautologous, tell that to the
ones who seem to be denying it!

>	On the other hand, it might be taken as a significant claim about the
>	nature of the chair-recognition device, viz., that we can understand
>	its workings as a process of actually parsing the input into a set of
>	features and actually comparing these against what is essentially some
>	logical formula in featurese.  This *is* an empirical claim, and it is
>	certainly dubitable: there could be pattern recognition devices
>	(holograms are one speculative suggestion) which cannot be
>	interestingly broken down into feature-detecting parts.

In another response I argue that holograms and other iconic
representations cannot do nontrivial categorization (i.e.,
problems in which there are no obvious gaps in the variation and the
feature-set is complex and underdetermined). I also do not favor
"logical formulas in featurese" (which sounds as if it has gone
symbolic prematurely). A disjunctive feature-detector need not have any
explicit formulas. It could be a selective filter that only passes input
that is, say, red or green; i.e., it could be "micro-iconic," -- invertible
only in red-or-green-ness. I also don't think the representation of
"chair" is likely to be purely sensory; it's probably a higher-order
category grounded in sensory categories. I think there's plenty in
what I claim that is dubitable (hence empirical), if not dubious.
-- 

Stevan Harnad                                  (609) - 921 7771
{bellcore, psuvax1, seismo, rutgers, packard}  !princeton!mind!harnad
harnad%mind@princeton.csnet       harnad@mind.Princeton.EDU