| |
|
|
Language as Vision: The Ocularcentrism of Chomskyan Linguistics
Chris Werry, Rhetoric & Writing Studies, SDSU
Note: this paper
is based on material in my dissertation, Rhetoric and Reflexivity in
Chomskyan and Cognitive Linguistics (Carnegie Mellon University, 2002). Please do not quote from this
paper. If you would like to read or quote from my dissertation, email
me at cwerry@mail.sdsu.edu.
1.0 Introduction
References to vision pervade Chomsky’s work.
They are a key component of the figures he uses, the examples he provides,
the analogies he makes, and the argumentative warrants supporting his
central claims. When dealing with opponents Chomsky repeatedly exploits
the rhetorical potential of visual analogies and metaphors in order to
construct rebuttals. References to vision and to spatio-visual phenomenon
constitute a key component of the most characteristic rhetorical moves
Chomsky makes, and are central to the way Chomsky defines the project
of linguistics. From Syntactic Structures to his most recently
published texts, Chomsky’s writing is permeated by a constellation of
terms centered on space, vision, optics and form. This is perhaps not
altogether surprising, given that Chomsky is a thinker who identifies
so strongly with Descartes, and who describes his theoretical project
as “Cartesian”. In Modernity and the Hegemony of Vision David
Levin argues that Descartes is the modern philosopher most obviously indebted
to the metaphor of knowledge as spatio-visual, a writer whose work most
clearly exemplifies a discourse that is dominated by an ocular metaphoric.
Levin writes:
For Descartes, darkness is a nightmare. There is
nothing to be learned from entering its domain. He is the philosopher
obsessed with clarity and light. If a discourse in which light, vision,
and its metaphorics are constitutive of its very logic may be called
ocularcentric, then it would be difficult to deny that Descartes’ philosophy
exemplifies ocularcentrism.
Correspondingly, Chomsky is perhaps the post-war linguist most focused
on clarity, light, space and vision. While a range of figurative expressions
characterize generative discourse, ocular metaphors are assigned a place
of particular importance. It might be said that the generativist vision
of language has been articulated primarily in the language of vision.
It has tended to be heavily “ocularcentric”, to use the term coined by
Martin Jay. The assumption that
language exists primarily as something visible; that it can be represented
in terms of comparisons with various forms of visual phenomenon, and that
linguistic analysis can be understood as a kind of visual perception,
has been integral to the objects, concepts, data and methodologies that
characterize generative inquiry, and is deeply embedded in its conceptual
logic. This tendency has shaped the field’s predominant “ways of seeing”
language, and in particular its treatment of grammar.
This paper explores the ocular metaphoric that characterizes generative
discourse. It demonstrates the degree to which vision constitutes the
central analogical figure in Chomsky’s writings. The paper examines the
role visual imagery plays in Chomsky’s main arguments, analyzes why it
is so important to his theoretical framework, and considers the rhetorical
work it is used to carry out. I discuss how ocularcentrism shapes Chomsky’s
understanding of what is central to linguistic inquiry and what is peripheral;
what is “visible” and “invisible” within his theoretical framework, and
what the nature and limits of linguistic inquiry are. The first section
of the paper provides examples that show how pervasively Chomsky makes
use of visual references.
1.1 Examples of Chomsky’s Vision-Centered Vision of Language
Chomsky’s discussions of linguistic theory often begin with references
to the rationalist tradition, and make reference to figures such as Plato,
Descartes, Leibniz and Cudsworth. The rationalist framework upon which
Chomsky’s work is built is articulated and justified largely in terms
of references to vision. When arguing for the basic soundness of rationalism,
and by extension his own theories of language, Chomsky regularly cites
the nature of the visual faculty as the central piece of evidence for
rationalism. For example, in a discussion of rationalism in Knowledge
of Language Chomsky writes that “the Platonistic conception would
suggest that knowledge of a particular language grows and matures along
a course that is in part intrinsically determined, with modifications
reflecting observed usage, rather in the manner of the visual system”
(Knowledge of Language, page 2). In Reflections on Language
Chomsky states that his views of language are based on rationalist
presuppositions, and that while certain aspects of rationalism must be
updated, the overall approach is valid. As evidence for the plausibility
of rationalism, Chomsky cites research indicating that the visual system
is “wired in”, and that human infants are equipped from birth with an
inbuilt ability to perceive and process three-dimensional space. Chomsky
argues that the same is likely true of the linguistic system, and the
idea that language can be considered in terms analogous to a bodily organ
such as the visual system is “quite natural and plausible” (11). The
plausibility of a rationalist theoretical framework is articulated primarily
through references to vision and the visual system.
Chomsky’s discussions of rationalism are typically used to introduce
one of the central claims of generative linguistics, namely that language
is an innate faculty of the human mind. Chomsky’s arguments about the
innate character of linguistic knowledge turn repeatedly on references
to and analogies with vision. His argument for the plausibility of innateness
rests on establishing that vision constitutes a particularly compelling
example of an innate, biologically based cognitive faculty, and that important
similarities exist between language and vision. From
his earliest works to his latest writings, Chomsky uses ocular metaphors
and references to support the innateness hypothesis. In Aspects of
a Theory of Syntax he writes that “on the basis of the best information
now available, it seems reasonable to suppose that a child cannot help
constructing a particular sort of transformational grammar to account
for the data presented to him, any more than he can control his perception
of solid objects or his attention to line and angle.” (Aspects,
59.) In Reflections on Language Chomsky’s arguments for innateness
are based on a series of references to the nature of visual perception:
the “wired-in” character of the visual system; the inbuilt ability of
the infant perceptual system to understand various aspects of 3-dimensional
space; the notion that an innate “grammar of vision” may be built into
the human nervous system (Reflections, page 8.) In Rules and
Representations Chomsky states that the language faculty may be thought
of as an innate “mental organ” analogous to the human visual system (39).
In Language and Thought he asserts, “the child’s language ‘grows
in the mind’ as the visual system develops the capacity for binocular
vision.” (page 29). In a recent interview in which Chomsky outlines the
main components of his theory of language, most of his major points are
made through examples, comparisons, or research having to do with visual
perception. On the issue of innateness, Chomsky writes:
If I say, "He thinks that John is intelligent," you know
that "he" is somebody other than John. If I say, "his
mother thinks John is intelligent," "his" may refer to
John. Those are things we aren't taught. The basic features of language
structure are built into our biological nature. They're like the principles
that determine why mammals see things in a particular way and insects
see things in a different way.
Supporters and expositors of Chomsky’s works frequently use similar language
when discussing innateness. For example, throughout Smith’s book on Chomskyan
linguistics innateness is supported through appeals to the similarity
between language and vision. Comments such as the following are typical:
“in vision as in language our innate endowment is such that minimal input
can give rise to great richness of knowledge. Arguments from the poverty
of the stimulus are comparable in the two domains.” (Smith, page 94.)
Smith writes that development of the language faculty does not proceed
through learning or interaction, but is instead “more like the development
in the child of stereoscopic vision”. (Smith, 117).
Chomsky’s “innateness hypothesis” leaves little room for learning,
experience or practice in the development of linguistic abilities (the
marginal status of learning is signified by the fact that it is often
put in quotation marks.) The
acquisition of language is represented in terms of the unfolding of a
biological program. The nature of this program, and Chomsky’s explanation
of why learning has such a minor role in the overall process, is typically
discussed in terms of analogies with visual perception. For example,
in Rules and Representations Chomsky writes:
If the general structure of binocular vision is genetically determined,
then naturally we must seek to explain its origin in terms of biological
(evolutionary) development rather than in terms of learning. Exactly
the same is true when we turn to cognitive structures or the (unknown)
physical mechanisms that underlie them. If, say, we find extensive
evidence that the principles that underlie the wh-island constraint
belong to universal grammar and are available to the language learner
without experience, then it would only be rational to suppose that these
mechanisms are genetically determined and to search for a further account
in terms of biological development. (209)
The development of linguistic ability is described as activated through
the triggering effects of experience, or, in Chomsky’s later work, as
the setting of parameters. This process of development is also described
and supported largely in terms of parallels with visual development.
For example, in Reflections on Language Chomsky argues that development
takes place via the triggering effects of external stimuli on innate faculties.
Chomsky’s argument for this position is constructed through references
to work on the development of visual perception. He writes that “work
of the past years has shown that much of the detailed structure of visual
experience is ‘wired in’, though triggering experience is required to
set the system in operation”. (Reflections, page 8.) In Aspects
of the Theory of Syntax Chomsky uses figures and examples that center
on visual perception to describe the role that exposure to the environment
has on the triggering of language acquisition. For example, when arguing
that exposure to language is entirely “peripheral” to linguistic development,
Chomsky draws on parallels between the role learning plays in the acquisition
of language, and the role it plays in the “acquisition” of vision in humans
and in animals. He writes:
This distinction [that acquisition requires exposure to the environment,
yet this exposure plays no determining role in development] is quite
familiar outside of the domain of language acquisition. For example,
Richard Held has shown in numerous experiments that reafferent stimulation
(that is stimulation resulting from voluntary activity) is a prerequisite
to the development of a concept of visual space, although it may not
determine the character of this concept…or, to take one of innumerable
examples from studies of animal learning, it has been observed (Lemmon
and Patterson, 1964) that depth perception in lambs is considerably
facilitated by mother-neonate contact, although again there is no reason
to suppose that the nature of the lamb’s ‘theory of visual space’ depends
on this contact. 33-34.
And when Chomsky takes up the issue of the triggering effects of the
environment in Rules and Representations, he turns once again to
the example of visual development. He writes that the factors that shape
the development of language “are on a par with the factors that determine
that a child will have binocular vision”, in the sense that both language
and vision:
develop in the individual along an intrinsically determined course
under the triggering effect of appropriate social interaction and partially
shaped by the environment - English is not Japanese, just as the distribution
of horizontal and vertical receptors in the visual cortex can be modified
by early experience. (Rules, page 44-5.)
The description of language development in terms of visual analogies
is common not just in Chomsky’s work, but also in the wider generative
literature. Consider, for example, Smith’s argument for the viability
of a Chomskyan theory of linguistic development:
We do not ‘learn’ how to see in color or in 3-D…. We obviously need
some visual input for the process whereby we become able to see in this
way to be triggered: being brought up in total darkness will guarantee
that one sees not at all, hence not in 3-D or in colour. Similarly,
we need to be exposed to faces for the appropriate development of the
face recognition module to be triggered; and we need to be exposed to
examples of language in order to acquire normal knowledge of a language.
Smith, page 26.
Smith’s discussions of acquisition, linguistic development, and learning
are couched in similar terms. Smith argues we no more “learn” language
than we “learn” how to see, and uses a series of analogies with visual
perception to construct a distinction between the development of innate,
internal abilities, and the development of abilities derived from external
stimulation. In a statement that typifies Smith’s treatment of language,
he writes: “In vision, the contrast between developing stereoscopic vision
and learning to identify different kinds of moth is clear; in language
the contrast between acquiring the principles of binding theory and learning
the vocabulary of biochemistry is similarly not in dispute.” (133)
Visual references and examples are used not just to advance points concerning
the object of inquiry, but are also used when talking about the theoretical
approach to language adopted by generative linguistics. For example,
when explaining how generative theories of language can simultaneously
consider physical mechanisms, theoretical abstractions, and several other
levels of analysis, Chomsky frequently explains this practice in terms
of a hypothetical approach to the study of vision. For example, after
describing how generative theories of language refer to both physical
mechanisms and theoretical abstractions, Chomsky argues that a similar
situation exists in theories of vision. He writes:
In the same way, a theory of vision might be formulated in concrete
terms, referring, say, to specific cells in the visual cortex, and their
properties; or it might be formulated abstractly in terms of certain
modes of representation (say, images or stick sketches), computations
on such representations, organizing principles that determine the nature
of such representations and rules, and so on. (Rules & Representations,
page 5.)
Chomsky frequently supports the theoretical approach taken by generative
linguistics via analogies with hypothetical theories of vision. For example,
Chomsky states that a generative theory of language is similar to a theory
of visual perception in that both do not concern themselves with evidence
taken from the intuitions or self-report of particular subjects, but must
instead focus on what the subject “actually” knows or sees:
A generative grammar attempts to specify what the speaker actually
knows, not what he may report about his knowledge. Similarly, a theory
of visual perception would attempt to account for what a person actually
sees and the mechanisms that determine this rather than his statements
about what he sees and why. Aspects, page 8.
Furthermore, Chomsky supports key aspects of his theoretical framework
(idealization, abstraction, “systematic ambiguity” between theoretical
and physical objects, the use of small amounts of data taken from introspection)
via analogies and examples of how various studies of vision might, in
principle, proceed. For example, in Rules and Representations he
writes:
There is a familiar morass of problems about just what is meant when
we take a theory to be true: what is the status of its theoretical entities,
its principles, its idealizations…Consider again the study of vision.
Suppose that some series of experiments leads to the conclusion that
particular cells are sensitive to lines with certain orientations.
In this case, no special problems are held to arise, though of course
the conclusion is underdetermined by evidence, the cell is abstracted
from its environment, nothing is said (at this level) about the mechanisms
that might be responsible for what the cell is alleged to be doing,
the results are obtained under highly idealized conditions built into
the experimental situation and apparatus, and so on. (p. 105)
Chomsky presents hypothetical studies of vision in order to legitimate
the methodological principles adopted by generative linguistics. He assumes
that the study of vision is a natural analogue to the study of language,
and that methodological choices that are legitimate in the study of vision
therefore apply also to language. When defending the practice of abstraction
and idealization Chomsky turns to visual analogies particularly often.
The quotation below captures this rhetorical move nicely:
Consider the matter of ‘defining’ such notions as ‘human language’
or ‘French’. The former task poses questions of a familiar sort, as
when we speak of the ‘human visual system’ as distinct from that of
cats or bees. In speaking of these terms, we abstract away from individual
differences and from interconnections among systems, focusing on one
(idealized) element of a complex integrated whole…We might postulate
that some principle for identifying the structure of a body in motion
is a property (perhaps innate) of the human visual system, and that
some principle (say, opacity or locality) is a property of the human
language system. In this respect, it seems to me useful and appropriate
to think of the human visual system or human language as analogous to
an organ or bodily system, and to try to characterize their properties.
Throughout Rules and Representations, the text from which the
above two quotations are taken, issues concerning what is properly scientific,
what are adequate theoretical assumptions, and what the proper methodological
framework for the study of language ought to be, are discussed in terms
of figures, comparisons and hypothetical studies concerning vision.
One of the most characteristic aspects of Chomskyan linguistics is the
way it constructs a fundamental isomorphism between language and vision,
an alignment that works to superimpose research on vision with generative
research on language, and vice versa. This alignment takes many forms.
To begin with, Chomsky singles out research on vision as the only other
area of the cognitive sciences that has had success that is comparable
to generative linguistics’. Thus he writes: “In the specific empirical
areas of the so-called cognitive sciences, some have been doing pretty
reasonably, like the study of vision and the study of language”, and “there
are pretty successful computational theories of, say, vision, and language…they
achieve some pretty surprising things”. In
considering whether “there other areas of human competence where one might
hope to develop a fruitful theory, analogous to generative grammar”, Chomsky
writes that one might “consider the problem of how a person comes to acquire
a certain concept of three-dimensional space.” (Language & Mind,
page 73.) Chomsky argues that visual processing is a prime candidate
for being studied in a “similar way” to language, because the visual system
operates in a way that is fundamentally akin to language (that is, vision
is an innate system that develops “in a more or less uniform way on the
basis of restricted data.”) Chomsky often talks
about vision as if it is amenable to the kind of analyses carried out
by generative linguists. Chomsky is thus particularly interested in the
work of researchers such as Richard Gregory, who have argued that language
and vision are based on common cognitive ground, and thus must be considered
together. Chomsky cites Gregory’s work, and states that this work suggests
that:
There may be a ‘grammar of vision’, rather like the grammar of human
language, and possibly related to the latter in the evolution of the
species. Employing this ‘grammar of vision’ – largely innate – higher
animals are able to ‘read from retinal images even hidden features of
objects, and predict their immediate future states’, thus ‘to classify
objects according to an internal grammar, to read reality from their
eyes’. (Reflections, page 8)
One of the most frequent analogies Chomsky’s makes involves drawing parallels
between language and the faculty (proposed to exist by some cognitive
scientists) that infants have for visually identifying faces. Chomsky
proposes that children possess an innate “grammar of faces,” and writes:
Consider recognition of faces. A person can recognize an enormous
number of human faces and identify a presentation of a single face with
various orientations. This is a remarkable feat that cannot be duplicated
with other figures of comparable complexity. It might therefore be
interesting to try to develop a ‘grammar of faces’, or even ‘a universal
grammar of faces’, to explain these abilities. Perhaps, at some stage
of maturation, some part of the brain develops an abstract theory of
faces and a system of projection that allows it to determine how an
arbitrary human face will appear in a given presentation. Rules and
Representations, page 248.
Some of the most explicit and extended comparisons between vision and
language occur in Chomsky’s discussions of the computational neurophysiologist
David Marr. Chomsky describes Marr’s work on vision as highly compatible
with his own research, as evidence for generative linguistics, and argues
that generative work on grammar and Marr’s work on vision constitute the
most advanced research in cognition. Chomsky reserves his highest praise
for Marr, calling his work “interesting”, “real science”, and “analogous
to what we are doing.” Chomsky
writes:
They [Marr and his group] are interested in developing systems of representation
and levels of representation which will on the one hand have a basis
in physiology, if they can find it, and on the other hand will account
for important perceptual phenomena…what they are doing is, in a sense,
artificial intelligence, but what we are doing is in the same sense
artificial intelligence…what they point out is that there are a number
of different levels of investigation that you might imagine. First
there is the level at which you deal with particular elements like diodes
and neurons, and that is biochemistry, or physiology. Second, there
is a level at which you talk about bigger units, assemblies or groupings
of these units, and that is more abstract. Third, there is a level
at which you talk about algorithms, actual procedures, parsing procedures,
etc. And finally, there is what they call the theory of the computation,
where you try to abstractly characterize the general properties of the
system and how it functions and what its nature is. They argue, plausibly
I think, that the theory of the computation is the most crucial and
also the most neglected level of research, and that the most fundamental
work will be at the level of the theory of that computation. Chomsky
1982, pp. 8 – 10
Chomsky argues that Marr correctly differentiates between levels of conceptual
analysis, just as his own theory does (and in stark contrast to Chomsky’s
critics, who Chomsky suggests unfairly criticize him in this regard.)
Chomsky notes that Marr’s research proposes that vision is innate, algorithmic,
and that the same kinds of restrictions on reflexive access to innate
knowledge characterize both vision and language. Both research programs
are strongly computational and focus on “real systems and their nature.”
(11) Chomsky often expresses a sense of kinship between his work and Marr’s.
He applauds Marr’s criticism of competing areas of AI that concentrate
on too “concrete a level” of analysis, or on human behavior rather than
on more conceptual/systemic considerations; he sympathizes with Marr’s
claim that scientific work on vision has neglected more abstract levels
of analysis, and in particular theoretical approaches that are computational
in character. Chomsky argues that the four levels of analysis described
by Marr provide an important methodological model for generative linguistics,
one that parallels and validates Chomsky’s own methodological choices.
Chomsky writes:
He [Marr] and his colleagues distinguish
four levels:
At the lowest level there, there
is basic component and circuit analysis – how do transistors (or neurons),
diodes (or synapses) work? The second level is the study of particular
mechanisms: adders, multipliers and memories, these being assemblies
made from basic components. The third level is that of the algorithm,
the scheme for a computation; and the top level contains the theory
of the computation.
They suggest that the ‘top level is the most neglected [and] also
the most important’, and that current research is misguided in constructing
algorithms without appropriate prior understanding of the top level.
Adopting this framework, we may consider the study of grammar and
UG to be at the level of the theory of the computation.
Marr’s research is represented as homologous with Chomsky’s in terms
of scientific goals, approach, categories, concepts and methodology.
Marr’s work on vision thus functions as key exemplar, analogy and support
for generative linguistics.
The references Chomsky makes to Marr, along with the various other examples
cited above, help indicate just how central vision is to generative linguistics,
and the degree to which Chomsky constructs language in terms of an equivalence
with vision. One area of Chomsky’s writing that illustrates particularly
well just how powerfully this equivalence functions can be seen in the
sections devoted to rebuttals. Chomsky continually draws on references
to vision and visual perception in the counterarguments and refutations
he constructs (Chomsky is justly famous for his skill in dealing with
opponents, and for his ability to construct rhetorically powerful rebuttals
and refutations. Yet one of the most distinctive characteristics of these
rebuttals – his use of visual analogies, metaphors and examples - appears
to have gone unnoticed by both critics and admirers of Chomsky.) This
is a standard move, a rhetorical strategy that Chomsky returns to with
extraordinary frequency. For
example, in Reflections on Language Chomsky engages in a sustained
debate with the communication theorist P.F Strawson, in which Strawson’s
criticisms are answered with rebuttals based on visual analogies. Chomsky
cites Strawson’s argument that linguistic rules are social, conventional,
public phenomena, and Strawson’s claim that generative linguistics ignores
this. Chomsky considers Strawson’s argument that ‘the function of communication
remains secondary, derivative, conceptually inessential’ in generative
linguistics, and a proper account of learning is not given. Chomsky responds:
The question whether communication is ‘primary’ and ‘conceptually
essential’ is begged throughout Strawson’s counterarguments. Furthermore,
the picture that Strawson rejects as perverse and arbitrary seems quite
reasonable and probably correct…The organism is so constituted that
it acquires a system of language that includes ‘meaning-determining
rules’ (again, perhaps, in interaction with other faculties of mind.)
These rules are then used by the speaker to express his beliefs (inter
alia). The learner has no ‘reason’ for acquiring the language;
he does not choose to learn, and cannot fail to learn under normal conditions,
any more than he chooses (or can fail) to organize visual space in a
certain way…As for the fact that the rules of language are ‘public rules’,
this is indeed a contingent fact. Having acquired the system of language,
the person can (in principle) choose to use it or not, as he can choose
to keep to or disregard his judgments concerning the position of objects
in space. He cannot choose to have sentences mean other than what they
do, any more than he can choose to have objects distributed in perceptual
space otherwise than the way they are…. It is a fact of nature that
the cognitive structures developed by people in similar circumstances,
within cognitive capacity, are similar, by virtue of their similar innate
constitution. Thus we share rules of language with others as we share
an organization of visual space with them. (Reflections on Language,
page 71.)
As is often the case, the force of Chomsky’s counterargument rests largely
on an assumed equivalence between language and vision, and his critics’
failure to appreciate this. He consistently replies to criticism concerning
the methodological adequacy of generative linguistics (the focus on abstraction,
idealized conditions, data based on introspection, etc.) by comparing
the study of linguistic and visual systems. Chomsky argues that the same
issues arise in the study of vision, yet this isn’t considered a problem.
Since language is the same kind of object as the visual system, such criticisms
have no force for generative linguistics. Such a strategy is used throughout
the debates Chomsky conducts in Rules and Representations. For
example, Chomsky outlines the objections commonly leveled at generative
linguistics, and then argues for their irrelevance via an extended comparison
with vision. He concludes his discussion
with the following statement:
Consider again the study of vision. Suppose that some series of experiments
leads to the conclusion that particular cells are sensitive to lines
with certain orientations. In this case, no special problems are held
to arise, though of course the conclusion is underdetermined by evidence,
the cell is abstracted from its environment, nothing is said (at this
level) about the mechanisms that might be responsible for what the cell
is alleged to be doing, the results are obtained under highly idealized
conditions built into the experimental situation and apparatus, and
so on. Suppose next that it is proposed that identification of objects
involves analysis into stick-figures or geometrical structures, though
nothing is said or known about neural mechanisms that might carry out
such analysis. Is the situation fundamentally different in some way,
apart from the physical and chemical properties of the brain? It is
not clear why one should assume so.
(p. 105)
Chomsky’s rebuttal is predicated on the notion that the study of language
and vision are largely equivalent when it comes to methodological considerations.
Similarly, in response to Putnam’s objections to the generative concept
of innateness and the marginal role assigned learning, Chomsky writes
that ‘He [Putnam] makes the tacit assumption that language is cognitive
in a way that vision is not, and hence that discussions of language have
to meet additional criteria of adequacy.’ (172) In Knowledge of Language
Chomsky engages the critiques of cognitivists who take a connectionist
approach to the study of language and mind. Chomsky opposes the connectionist
argument that preexisting structure and innate mechanism are not required
in order to explain linguistic and cognitive structures. His frequent
response to connectionist opponents consists of the counterargument that
they cannot explain vision without reference to innate mechanisms and
preexisting structure, yet they try to do with language. Since language
is a system that is analogous to vision, such an approach must therefore
be flawed. Chomsky
argues against the relevance of functional approaches to linguistic inquiry
with this memorable riposte: ‘The child does not acquire the rule by virtue
of its function any more than he learns to have an eye because of the
advantage of sight.’ (Rules
and Representations, page 231.) And when arguing against the notion
that language is something constructed by its users, Chomsky often fashions
rebuttals in terms of analogies between language and the visual system.
Thus in Rules and Representations Chomsky opposes constructivist
theories (as suggested by Vico and Rorty) in the following terms:
Have we, as individuals, ‘made’ our language? That is, have you or
I ‘made’ English? That seems either senseless or wrong. We had no
choice at all as to the language we acquired...there is no more reason
to think of language as ‘made’ than there is to think of the human visual
system and the various forms that it assumes as ‘made by us’. (Rules
& Representations, page 11.)
For Chomsky, language is self-evidently the same kind of object as the
visual system, and since we do not ‘make’ our visual faculties, it makes
no sense to imagine language as socially constructed.
In general, vision is a central analogic figure in Chomsky’s writings.
His work is full of expressions that suggest fundamental parallels between
the visual and verbal domains. Often these parallels are broad (as for
example when Chomsky writes that research on vision “is highly suggestive
for the study of cognitive structures such as language”, or states that
when it comes to the biological development of the visual system, “comparable
conclusions seem to hold in the case of human language”.) Sometimes they are
specific (as for example when he outlines his computational approach to
grammar, then writes that “by parity of argument, a scientist studying
vision might develop a theory involving certain types of computation and
representation for the identification of objects in motion.”)
Ocular metaphors, examples, and analogies pervade Chomsky’s work, and
references to actual or hypothetical research on visual systems function
as important sources of support, authority and evidence for generative
linguistics. However, Chomsky’s comparisons between language and vision
rest on a very distinctive and selective understanding of vision. In
the sections that follow I examine Chomsky’s vision of vision, consider
where it comes from, and how it is used.
1.2 Chomsky’s Construction of Vision
It is worth looking in some detail at precisely how Chomsky understands
vision, for Chomsky’s comparisons between language and vision rest on
a very particular, and rather idiosyncratic understanding of vision.
He is very selective in the examples and research he cites, and he constructs
a model of vision that affords specific rhetorical uses. In this section
I look at the particular representation of vision Chomsky constructs,
how this relates to Chomsky’s theory of grammar, and consider some of
the rhetorical uses this model of vision is put to.
1.2.1 Chomsky’s Understanding
of Vision is Cartesian and Computational
Descartes’ theory of vision plays an important role in Chomsky’s linguistics.
Chomsky uses it as evidence for innateness, and describes the Cartesian
theory of vision as crucial for the development of computational approaches
to mind. Describing how his work is prefigured by the “Cartesian Revolution”
of the seventeenth century, Chomsky writes:
Inner mechanisms and inner processes appear to be computational systems,
mentally representative and, in some unknown manner, physically instantiated.
But that again is highly reminiscent of something that took place in
the seventeenth century – in particular, Descartes’ theory of vision,
which was a crucial breakthrough and developed a kind of representational,
computational theory of mind.
Descartes’ theory of vision is presented as a “crucial breakthrough”,
and as forerunner to the ‘cognitive turn’ that Chomsky has become synonymous
with. For Chomsky, the Cartesian model of vision is the foundation of
an internalist, computational approach to language and mind. Chomsky
is so interested in Descartes’ theory of vision, and draws on it so frequently
in his descriptions of grammar, because it so neatly fits Chomsky’s heavily
mathematized, computational account of language. The above quotation nicely illustrates
an important chain of equivalences that exist within Chomskyan discourse.
The focus of linguistic science is “inner processes”, which are essentially
“computational systems”.
And computational systems can be likened to the operation of vision, since
vision (in its Cartesian form) is itself computational in character.
Language, vision, and computational systems constitute a set of foundational
concepts in generative linguistics. The generative object of knowledge
is evoked and given shape through a range of figurative comparisons, parallels,
and substitutions involving language, vision and computation. Throughout
Chomsky’s works these three domains are equated with each other to such
an extent that Chomsky often forgoes the use of overt verbal cues signaling
comparison, and will instead simply juxtapose references to language,
vision and computation when constructing a line of argument, as if their
equivalence can be taken for granted. Consider, for example, an argument
about innateness in Language and Mind. Chomsky introduces the
familiar rationalist distinction between inner and outer vision, between
the “eye of the mind” and the “eye of the body,” in order to argue that
knowledge proceeds from internal principles rather than exposure to external
objects. Chomsky endorses Descartes’ notion that when we see a triangle,
our knowledge of the triangle proceeds not from the “figure depicted on
paper,” but from the internal process of mentally envisioning “the authentic
triangle”. Chomsky’s
discussion of innateness connects internal vision of a triangle with computational
processing and generative grammar:
In this sense the idea of the triangle is innate. Surely the notion
is comprehensible; there would be no difficulty, for example, in programming
a computer to react to stimuli along these lines…Similarly, there is
no difficulty in programming a computer with a schematism that sharply
restricts the form of a generative grammar, with an evaluation procedure
for grammars of the given form, with a technique for determining whether
given data are compatible with a grammar of the given form, with a fixed
substructure of entities (such as distinctive features), rules, and
principles, and so on – in short, with a universal grammar of the sort
that has been proposed in recent years. For reasons that have already
been mentioned, I believe that these proposals can be properly regarded
as a further development of classical rationalist doctrine, as an elaboration
of some of its main ideas regarding language and mind.
Chomsky’s discussion of innateness proceeds through a contrast between
‘outer’ and ‘inner’ vision, and its analogue in a hypothetical computer
system. Chomsky suggests that a computer could be made to simulate the
relationship between the exterior (peripheral) and interior (innate) aspects
of visual perception. Correspondingly, a computer could be programmed
to simulate many of the characteristics of universal grammar. Vision,
language and computers are comparable in that each consists of an internal,
formal-symbolic, computational center. The plausibility of this set of
equivalences rests on the assumption that vision should be understood
in neo-Cartesian terms as a computational system, and that vision and
language share fundamental similarities.
Another good example of the way Chomsky equates language, vision, and
computation occurs in Rules and Representations. In the passage
below, in which Chomsky distinguishes between different levels of theoretical
analysis, a fundamental correspondence is assumed between these three
domains. Chomsky writes:
Ultimately, the study of language is a part of human biology. In the
study of any organism or machine, we may distinguish between the abstract
investigation of the principles by which it operates and the study of
the physical realization of the processes and components postulated
in the abstract investigation. Thus, the study of visual perception
might lead to the hypothetical construction of certain abstract components
– for example, feature detectors – that enter in this system. A further
inquiry might reveal the physical mechanisms that meet the abstract
conditions postulated. In studying some automaton, we might attempt
to determine its program at an abstract level, then proceed to inquire
into the circuitry or mechanical principles by which this abstract program
is realized. We may say that the same program is represented in devices
of very different design and constitution. Rules and Representations,
page 226.
The passage is organized around the conceptual dualism “abstract/physical.”
This dualism is developed through a set of parallel linguistic constructions
that align language, vision and computation. Chomsky writes that language
can be studied at two different levels (“abstract investigation of principles”,
and “physical realization of processes”). He follows this with the assertion
that vision can be studied in terms of “abstract components” and “physical
mechanisms”, and that an automaton can be studied in terms of the “abstract
program”, or in terms of “circuitry or mechanical principles”. The correspondence
between language, vision and computation is constructed without any explicit
comparison, but is achieved through parataxis. The degree to which Chomsky
equates language, vision and computation is signaled by the fact that
after introducing the topic of studying language at different levels of
analysis, he then describes potentially parallel studies of vision and
automata, without giving any explanation as to how these three realms
are logically related. The examination of each object is merely juxtaposed,
as if it is obvious that they are conceptually equivalent. Passages such
as the one above, in which language, vision and computation are equated,
can be seen as enthymemes, in both the Isocratean and Aristotelian senses
of the word. Fahnstock
notes that the Isocratean sense of the word “enthymeme” connotes a compressed
recapitulation of assertions already explicit in previously argued material.
Chomsky’s references to language, vision and computation often function
this way – as a kind of short hand for a much larger, more explicit argument
about language, mind, science, and methodology. The passage also functions
as an enthymeme in terms of the more traditional meaning associated with
the term. An Aristotelian “enthymeme” suggests a line of argument constructed
from a truncated syllogism in which a mediating premise is left out.
In the passage above, the missing premise involves the notion that language,
vision and computational systems can in fact be treated as isomorphic.
Many of the central arguments of generative linguistics can be formulated
in the following terms: language is equivalent to vision; vision is a
form of computation, therefore language can be understood as a computational
system and all three objects can be studied in the same way.
1.2.2 Chomsky is Influenced by Marr’s Neo-Cartesian Account
of Vision
Another way in which vision is equated with computation, and both are
connected to language, occurs in Chomsky’s use of the work of David Marr.
As mentioned above, Marr’s work on vision is a central point of reference
for Chomsky. Of all the work done in cognitive science and neuroscience,
Chomsky focuses particular attention on the study of vision. And of all
the work on vision carried out within cognitive science and neuroscience,
Marr’s is selected for particular emphasis. Marr’s influence is apparent
in the explicit references Chomsky makes to him, but it can also be seen
in the many examples and turns of phrase that have their origin in Marr’s
work on vision.
Chomsky is attracted to the work on vision by his colleague at M.I.T.
in large part because it so closely echoes key aspects of the generative
paradigm. Marr’s approach to vision is heavily formal, algorithmic and
computational. Marr
“embarked on a vigorous research program seeking computational insights
into the working of the visual system, putting them to the test of implementation
as computer models”. He
treats vision as an autonomous cognitive system, and maintains that within
this system there are different modules for computing different aspects
of visual information (Marr’s arguments about the modularity of cognitive
systems have been influential in cognitive science). His approach parallels
Chomsky’s, in that it focuses on the level of the theory of computation,
on internal representations and inner processing, and seeks an idealized,
purified version of the object of inquiry. Gardner writes that “just
as Chomsky wished to examine syntax in its pristine form (uncontaminated
by semantics or pragmatics), Marr wanted his analysis of visual processing
insulated as far as possible from the intrusion of ‘real world’ knowledge.”
Marr is heavily Cartesian; he emphasizes the abstract coordinate geometry
of visual perception, and focuses more on the computations that underlie
vision than their material manifestation in either physical or organic
“hardware”. And like Descartes, he equates external perception with mechanical
devices, assuming that both embody the same logic and can be studied in
similar ways. Marr’s work on vision thus functions as an important exemplar,
analogy and support for generative linguistics. Vision is the central
analogy, the root metaphor in generative linguistics, however it is vision
understood in largely computational terms, based on the example of writers
such as Descartes and Marr.
1.2.2 Problems with Chomsky’s Vision of Vision
A problem for Chomsky, and for generative linguistics, is that the
computational model of vision that Chomsky spends so much time comparing
language to has had a relatively brief shelf life in the cognitive sciences.
Marr’s work has not fared well in neurophysiological research and in the
cognitive sciences over the last 15 years. More
generally, the computational-representational model of vision that Chomsky
advances has seen declining support across a range of disciplines. Edelman
and Vaina note that Marr’s computational theory of vision has been associated
with the idea that “constructing an internal model of the world is a prerequisite
for carrying out any visual task. The accumulation of findings to the
contrary in neurobiology and in the behavioral sciences gradually brought
to the fore the possibility that vision does not require geometric reconstruction.
This encouraged researchers to seek alternative theories”. Many of the
alternative theories that have since appeared propose that vision is distributed,
emergent, and nonrepresentational. These theories reject the notion that
vision is an autonomous system that can be understood in terms of traditional
information processing models. Myin
summarizes some of the main precepts shared by nonrepresentationalist
theories of vision:
The heart of the nonrepresentational theory is that vision is seen
as a process of active exploration of the environment, rather than the
elaboration of an internal replica of the external world. The force
of the perceptual poverty of the stimulus argument is denied in this
approach, because vision is seen as a property of the whole animal (in
its environment) and as extended in time. From the point of view of
the whole animal, the stimulus is no longer fragmented. Neither is it
ambiguous, in this theory, because movement in time disambiguates stimuli
that are ambiguous when seen from a stationary position.
More generally, the view of “the nervous system as a modular, information-processing
machine” has come under attack from connectionist and “dynamical systems”
approaches to cognition. For example, Thelen and Smith write that “recent
neurophysiology and developmental studies have turned this eminently plausible
[the modular-information processing paradigm] view on its head…studies
show vast and previously unimagined networks of interconnections both
within and among anatomically distinct areas. There is the primary sensory
organization within modalities, for example for color, form, and motion
in the visual system.” They
write that there is
strong evidence of convergence of sensory information…multiple areas
where projections from the visual, auditory, and somatosensory cortices
converge…Indeed, some of the neural structures showing intersensory
integration are subcortical, early maturing, and phylogenetically old.
In their elegant book, The Merging of the Sense, Stein and Meredith
(1993) build a compelling case for sensory convergence as a fundamental
and enduring characteristic of animal nervous systems, universal in
all phyla, and occurring at many levels of the neuraxis.
Chomsky has been consistently hostile to connectionist and dynamic accounts
of vision, cognition and language, continuing to ally his theory of language
with modular, information processing models.
Chomsky has received a certain amount of criticism from linguists, psychologists
and cognitive scientists for hitching his theoretical wagon to what might
be deemed “outdated” paradigms. As far back as 1980, in a roundtable
discussion of Chomsky’s work in The Behavioral and Brain Sciences,
critics attacked what they saw as an anachronistic account of biology
and cognition. A number of the respondents argued that Chomsky’s focus on separate, modular systems, rather than
the more integrated terms of modern biological research, risked
resurrecting Gall’s organology.
Lakoff’s charges are among the most severe. He argues that the biological
analogies and metaphors Chomsky uses, and in particular the analogies
with vision he constructs, focus exclusively on “the biology of separate
systems”. Lakoff argues for the
relevance of modern biological models that foreground interconnection,
particularly between vision and language. He writes: “For me, the most
exciting question at present is what similarities there are in the various
cognitive faculties - language, thought, vision, motor control, and so
on.” (page 23) He also argues that Chomsky tends to base his biological
analogies on scientific work that is rather antiquated, since such work
tends to treat biological systems as isolated (Lakoff writes: “It is particularly
striking that the biological analogies Chomsky uses come not from contemporary
biology – molecular biology, genetics, and so on – but from earlier biology
– the biology of separate systems – the circulatory system, the visual
system, and so on.”) While Chomsky’s opponents
make some important points, much of their criticism consists of charges
that generative linguistics lacks proper scientific legitimacy (ironically,
this is a rhetorical strategy used often by generativists against opponents
within linguistics.) What is ultimately of most interest to me is not
so much whether Chomsky has accurately based his arguments on the most
up to date research on vision and cognition. What does seem worth examining
is the selectivity with which Chomsky draws on scientific work
on vision (along with other areas of biology). Chomsky draws heavily
on Marr’s work, ignoring most other work on the neurophysiology of vision.
And as research on the neurophysiology of vision has moved away from Marr’s
approach, Chomsky’s analogies to the study of vision have not lessened,
but have become more general and abstract. References
to research on vision are much more likely to be formulated in terms of
hypothetical cases and generalized examples. A more relevant criticism
of Chomsky’s use of biological and visual analogies can be found in the
way he reconstructs research to fit his particular agenda. This brings
us to the topic of how vision is used as a rhetorical strategy in generative
linguistics.
1.2.4 Constructing
Vision as ‘Generative’
When describing how language is comparable to vision, Chomsky constructs
vision in terms of the language and categories of generative linguistics. A
good example of this can be found in Aspects of the Theory of Syntax,
where Chomsky argues that parallels can be drawn between vision and language
with respect to the way development proceeds in infants. Citing the work
of Held, Lemmon and Patterson on the visual development of children and
newborn lambs, Chomsky writes:
Richard Held has shown in numerous experiments that reafferent stimulation
(that is stimulation resulting from voluntary activity) is a prerequisite
to the development of a concept of visual space, although it may not
determine the character of this concept…or, to take one of innumerable
examples from studies of animal learning, it has been observed (Lemmon
and Patterson, 1964) that depth perception in lambs is considerably
facilitated by mother-neonate contact, although again there is no reason
to suppose that the nature of the lamb’s “theory of visual space” depends
on this contact. 33-34.
Chomsky describes the work of Held, and Lemmon and Patterson as confirming
his own hypotheses about the development of language. Yet he does this
by extrapolating from their research, and by redescribing it in terms
of the concepts and categories of generative linguistics. For example,
nowhere do these researchers discuss “the development of a concept of
visual space”, or of an animal’s “theory of visual space”. Nor do they
argue that environmental stimulation exerts no influence on the development
of a ‘concept’ of visual space. In a critique of Chomsky’s use of this
research on vision, De Beaugrande argues that the three researchers in
fact merely state that they observed the “facilitation” of development
by external stimulation, and “not its irrelevance to some theory of visual
space.” De
Beaugrande argues further that “the scientific, biological finding does
not support his [Chomsky’s] thesis except through his own non-biological
interpretation,” and he accuses Chomsky of distorting the work he cites.
While there is much truth to De Beaugrande’s charge, what is more interesting
from a rhetorical perspective is the way Chomsky’s treatment of these
researchers illustrates several key argumentative strategies. Chomsky
constructs vision and the study of the visual system in terms of the conceptual
framework of generative linguistics in order to be able to argue several
things: that the study of vision confirms his research on language; that
language is a cognitive faculty just like vision, and is amenable to the
same kind of analyses; and that linguistics ought to model itself on the
study of vision (which just happens to be a mirror-image of the generative
approach to language). One can identify such
rhetorical moves in a range of Chomsky’s writings. Consider, for example,
this passage from Rules and Representations. Chomsky proposes that
the study of language be modeled on the study of vision. He writes:
How can we proceed to investigate the properties of language? To clarify
the issue, we might think about the less controversial task of studying
the physical structure of the body. A rational approach would be to
select some reasonably self-contained physical system of the body…Consider
the kinds of questions we might ask about an organ of the body – say
the eye, or more broadly the visual system regarded as an organ. We
might organize our inquiry along the following lines:
(1) (a) function
(b) structure
(c) physical basis
(d) development in the individual
(e) evolutionary development.
Chomsky quickly rejects (1) (a) as a viable topic of study, and proceeds
to talk about an analysis of the visual system based on the other questions,
spending most of his time on 1 (b). Chomsky writes that we ought to begin
with “some characterization of the structure of the visual system at the
abstract level”, and might then proceed to a set of questions about the
“structure of the visual system”, how physical mechanisms relate to structure,
development in the individual, etc. This is all couched in terms of the
language of generative linguistics, as can be seen in the passage below:
The organism begins in some genetically determined initial state common
to the species with variations we may ignore at the outset. It passes
through a sequence of states until it attains a mature final state which
then undergoes only marginal further change. This ‘steady state’ is,
it seems, attained at some relatively early stage in life. But though
the organ of vision is essentially fixed in structure at that time,
we may still ‘learn to see’ in new ways throughout our lives, for example,
by applying knowledge gained later in life or through exposure to some
new form of visual representation in the arts, say cubism.
The development of the visual faculty is discussed in terms of an “initial
state”, “steady state,” and “final state,” familiar entries in the generative
lexicon. It is said that “variation” in visual processing can be ignored.
While the visual faculty is largely fixed, a person may produce novel
visualizations and “learn” to see in new ways. Having described the study
of vision in terms of the approach, methodology, categories, and questions
posed within generative linguistics, Chomsky then proposes that linguistic
inquiry ought to be modeled on the study of vision. That is, Chomsky
argues that we should model linguistics on the study of visual systems
represented in terms of the generative framework. After outlining what
might be dubbed a “Chomskyan theory of vision”, Chomsky writes: “Suppose
that we attempt to study language on the model of a bodily organ [the
visual system], raising the questions (1a) – (1e).” (Rules and Representations,
page 229.) Chomsky then outlines how the study of language might be based
on the “model” provided by research on visual systems – that is, he describes
the project of generative linguistics.
1.3 Vision, Knowledge & the Study of Grammar
One of the main
reasons that a visual metaphoric plays such an important role in generative
linguistics is that Chomsky conceptualizes grammar largely in terms of
knowledge, and he associates knowledge with vision. There is a long tradition
in the West of representing knowledge in ocular terms. For Plato, true knowledge is modeled on an act
of vision – that of ascending to a realm in which the essence of things
can be clearly seen, or of seeing behind the surface of things in order
to discover the underlying regularities. (Eidos and idea,
the words Plato gives for the Forms, both derive from the verb idein,
‘to see’.) In the Phaedrus Plato describes true knowledge as “seeing
that which is well ordered and ever unchangeable”; in the Meno Plato
describes knowledge as anamnesis, or the recollection of the soul’s experience
of seeing the Forms in their pure state. Aristotelian philosophy is
similarly visualist. In the first sentence of the Metaphysics,
Aristotle writes: “Of all the senses, trust only the sense of sight”,
and he defines vision as the “most noble sense” because it is the faculty
that most closely resembles and guarantees true knowledge. Dewey argues
that in Western philosophy since Plato and Aristotle “the theory of knowing
is modeled after what was supposed to take place in the act of vision”. Rorty writes that the “ocular metaphor seized
the imagination of the founders of Western thought”, and a focus on knowledge
of universal concepts contributed to make “the Eye of the mind the inescapable
model for the better sort of knowledge.” (Rorty 1979, p. 38-39) Arendt
echoes these claims, stating that ‘from the very outset, in formal philosophy,
thinking has been thought of in terms of seeing…The predominance of sight
is so deeply embedded in Greek speech, and therefore in our conceptual
language, that we seldom find any consideration bestowed upon it, as though
it belonged among things too obvious to be noticed.’ Martin Jay argues that vision is “the master trope
of the modern era”. And in Wittgenstein’s later writings he considers
how our language may predispose us to think about the world in visualist
terms (“a picture held us captive. And we could not get outside it, for
it lay in our language and language seemed to repeat it to us inexorably.”)
Nowhere is the representation
of knowledge and mind more strongly visualist than in Cartesian philosophy.
The metaphor of the mind as inner vision is central to Descartes’ conception
of knowledge and cognition, and figures prominently in Chomsky’s writings
also. Descartes states, "We shall learn how to employ our mental
intuition by comparing it with the way that we employ our eyes",
and “when the mind understands, it in some way turns towards itself and
inspects one of the ideas which are within it.”.
Descartes represents knowledge as a purified internal vision situated
in the mind that stands in contrast to an impure, external vision that
operates through the body. Rorty notes that this distinction dates back
to classical Greek philosophy, where knowledge of universals and particulars
are talked about primarily in terms of two forms of vision. The “eye of
the mind” is associated with “knowledge of the highest and purest things:
mathematics, philosophy itself, theoretical physics, anything which contemplates
universals.” By contrast, “the eye of the body knows particulars by internalizing
their individual colors and shapes.”
Chomsky draws often
on the work of Descartes and other rationalist writers to explain his
understanding of knowledge and language. Chomsky writes that the rationalist
account of knowledge, once updated and “purged of the error of preexistence”
provides a fruitful and plausible framework for understanding cognition. Chomsky talks of the rationalist assumption that
the ‘natural light’ of common sense can lay bare the basic elements of
human reasoning. (Reflections on Language, page 244).
Chomsky’s discussions of knowledge and the mind are strongly Cartesian,
and even include the familiar Cartesian distinction between the “Eye of
the Mind” and the “Eye of the Body”. Consider, for example, the following
statement in which Chomsky describes the mind in strongly visualist terms,
and in which he contrasts “inner” and “outer” vision in a way that closely
resembles the schema described by Rorty above. Chomsky writes:
The eye perceives, but the mind can compare, analyze, see cause-and-effect
relations, symmetries, and so on, giving a comprehensive idea of the
whole, with its parts, relations, and proportions.
Chomsky’s concept
of knowledge is modeled on the visual perception of conceptual objects
in a field of ordered, geometric, Euclidian space. The mind “sees” objects,
parts and relations, and is predisposed to view these objects in terms
of relationships of symmetry and proportion. Chomsky follows Descartes
in assuming that there exists a “natural
geometry of the mind”. He endorses Descartes’ notion that “the mind is
so constituted that it constructs regular geometric figures as “exemplars”
for the interpretation of experience.” (Rules & Representations,
page 38). Chomsky writes:
Descartes and Cudworth believed the mind to be endowed with the principles
of Euclidian geometry as an a priori property. We see a presented irregular
figure as a (possibly distorted) triangle, straight line, circle, and
so forth, because our minds produce these figures as ‘exemplars’, just
as ‘the intelligible essences of things’ are produced by ‘the innate
cognoscitive power’. In Kant’s phrase, objects conform to our ‘modes
of cognition.’ Rules and Representations, page 246
There are thus as
Chomsky puts it “first principles of geometry” that are internal to the
mind – the mind both constructively envisions and makes visible objects
of a certain orderly character, and inclines us to understand the objects
we perceive with our external senses as ideals. Chomsky sometimes elaborates
on this metaphor of the mind as a space of inner vision by drawing on
another strongly optical metaphor, that of visually deciphering a piece
of writing. For example, citing Cudsworth, Chomsky states in Reflections
on Language:
The ‘book of nature’, then, is ‘legible only to an intellectual eye’….just
as a man who reads a book in a language that he knows can learn something
from the ‘inky scrawls’.
The ocularcentrism
of rationalist philosophy is frequently expressed in terms of analogies
with written language, and Chomsky follows figures like Descartes, Cudsworth
and Berkeley in this regard (I take up this aspect of Chomsky’s ocularcentrism
in more detail in chapter 5 of my dissertation, Rhetoric and Reflexivity
in Chomskyan and Cognitive Linguistics.)
In the sections that follow
I examine three aspects of the way Chomsky’s epistemological premises
are grounded in visual analogies. First, I will consider how Chomsky
constructs both the subject and object of knowledge in terms of vision.
Second, I will examine how language is defined as a mirror of knowledge,
with both language and knowledge represented in terms of visual paradigms.
Third, I will address how Chomsky’s visualist representation of knowledge
defines some approaches to and aspects of language as knowable/visible,
and others as unknowable/invisible.
1.3.1 Chomsky’s
Ocularcentric Construction of the Subject and Object of Knowledge
Generative linguistics shares many of the tendencies that characterize
ocularcentric disciplinary discourses. This includes the epistemological
ideals of objectivity, certainty, transparency, universality, subject-object
duality, and corporeal transcendence. These ideals are embedded in the
visualist language and imagery Chomsky uses to discuss knowledge, language
and mind, and profoundly shape Chomsky’s understanding of the subject
and object of linguistic study. In Chomskyan linguistics knowing is seeing,
and the object to be known is best understood not only in terms of visual
perception, as a visually manifest phenomenon that can be observed with
clarity, determinacy and certainty, but ultimately as itself a kind of
vision. Chomsky’s ocularcentric construction of the subject and object
of knowledge is evident in the key analogies he uses to describe the perspective
from which an idealized knower could see/know language in its totality,
in its true essential form.
Chomsky frequently argues that while on the surface language seems ‘complex and defective’, mired in ‘difficult and
murky contingencies,’ this is only due to the distorting influence of
its physical realization in the body, and our inability to adopt a properly
scientific and objective perspective. Chomsky
writes that ‘this apparent variety
and complexity is superficial, reducing to minor parametric differences.’ The reason language
appears ‘imperfect’ is that it must be materialized in the ‘sensorimotor
system’. Chomsky states:
A large range of imperfections may have to do with the need to "externalize"
language. If we could communicate by telepathy, they would not arise.
The phonological component is in a certain sense "extrinsic"
to language, and the locus of a good part of its imperfection. (Chomsky,
1997b)
Chomsky associates ‘imperfection’, and deviation from
the ideal, with the body, and with the requirement that language be voiced.
He argues that the imperfections of language caused by its embodiment
would disappear if humans communicated by telepathy - that is, without
a body. The distorting effects of our physical embodiment mask the unity
and singularity of the underlying language system. In ‘Minimalist Explorations’
Chomsky states that the sensorimotor system is ‘extraneous to language,’
a ‘nuisance’ that is ‘imposed by external systems,’ and that ‘the one
unique computational process’ would become apparent ‘if we could think
and communicate by telepathy.’ He jokes that ‘if
you were God’ it would be obvious that the ‘imperfections’ of language
result from the vagaries of its physical realization.
That is, if one could occupy the perspective of God, for whom the perfection and design of human cognition
is visible in its totality, or if we were unshackled from the distorting
influence of the body (a condition often associated with the divine),
we would be able to perceive the essence of human language. Later
in this same text, Chomsky states
that if we could approach language with the objectivity afforded a Martian,
a being that has not experienced the cultural conditioning humans undergo,
a comparable clarity of vision might be achieved. He writes that
‘Martians looking at humans would say there’s one language with a bunch
of lexical exceptions.’ Similarly, in Language and Thought Chomsky
writes that the ‘computational system’ underlying language is invariant,
and it follows that ‘there is only one human language, as a rational Martian
observing humans would have assumed’. (page 50.) In “Language and Mind:
Current Thoughts on Ancient Problems,” Chomsky writes:
In their essential properties, languages are cast to the same mold.
The Martian scientist might reasonably conclude that there is a single
human language, with differences only at the margins. For our lives,
the slight differences are what matter, not the overwhelming similarities
which we unconsciously take for granted. No doubt frogs look at other
frogs the same way. But if we want to understand what kind of creature
we are, we have to adopt a very different point of view, basically that
of the Martian studying humans.
And in “Models, Nature
and Language” Chomsky states:
There is fairly good reason now to believe that in a certain, rather
deep sense, there is only one human language. If a Martian scientist
looked at us the way that we look at frogs he might well conclude that
with marginal, minor modifications, there is only one language. You
and I might say "tree," and a German would say "baum,"
but we're using basically the same concepts from the same inventory,
which is both rich and restrictive. (Chomsky 1994, p. 173.)
Chomsky argues that we are confused by the
existence of different languages and by variation in language use because
we cannot adopt the proper perspective that will allow us to see the object
as it really is. The ‘view from outer
space’ presumably enables the hypothetical Martian to see through the
surface irregularity and variation of language. A Non-terrestrial
Being that is outside human history, culture, and social life, free of
earthbound attitudes and prejudices, able to observe human language from
a privileged, culturally ‘uncontaminated’ perspective, would see that
there is only one underlying language system.
Chomsky also equates the perspective of the Martian with the vantage point
afforded the child acquiring language. He writes that ‘from the Martian
point of view,’ as well as from ‘a child’s point of view,’ languages are
‘essentially identical’ (Chomsky 1995b). Chomsky argues that “languages
must look identical from the child’s point of view”, since ‘otherwise
it’s impossible to learn any.’ The speed and precision with which children
pick up new words “leaves no real alternative to the conclusion that the
child somehow has the concepts available before experience with language
and is basically learning labels for concepts that are already part of
his or her conceptual apparatus.” Like God and
the Martian, the child is represented as encountering language in an idealized,
pristine form, his ‘knowledge’ of language untainted by experience and
bodily realization. In each case a universal, totalizing vision constitutes
idealized knowledge of language, and connects us to the truth as it distances
us from the body, history, culture and society. Objectivity, certainty
and knowledge of the ideal are associated with the visual apprehension
of an object by a transcendental subjectivity. Chomskyan
linguistics thus privileges what Nietzsche describes as ‘an eye outside
time and history, an eye that no living being can imagine, an eye required
to have no direction, to abrogate its active and interpretive powers’. Rorty notes that Western
philosophy is characterized by the visualist ‘wish to see the world from
above… as spectator of time and eternity,’ and has sought to constitute
itself as a ‘discipline which lets us stand over and against the world
of everyday practice by seeing it as God sees it, as a limited whole.” Chomskyan
linguistics is characterized by similar aspirations, and is animated by
a similar set of root metaphors. Chomsky’s descriptions of idealized
knowers are permeated with the language of spectatorial epistemology –
‘observation’, ‘perspective,’ ‘point of view,’ ‘look,’ ‘see,’ ‘focus,’
etc. In the examples above, and in much of the rest of his work, knowledge
is figuratively characterized as a visual enterprise. The object of inquiry,
language, is represented in correspondingly ocular terms, and is assigned
many of the same characteristics, including disengagement, objectivity,
disembodiment, certainty and universality.
Chomsky’s investment in optical metaphors, analogies, evidence and examples
shows up at every level of his work. His philosophical commitments are
closely connected to the way vision has been culturally constructed, and
to the specific material properties assigned it in the context of its
use as a metaphor for knowledge.
For example, vision is typically assumed to be the most immaterial of
the senses.
Visual interaction with an object is thought not to involve the kind of
incorporation associated with touch or taste, taking place as it does
via the seemingly immaterial medium of light.
Philosophers such as Hans Jonas who study the phenomenology of vision
have suggested that sight is preeminently the sense of simultaneity, enabling
the subject to instantly survey a wide visual field. Furthermore,
while each mode of perception maintains objects at a distinctive distance
from the body, vision is typically understood as the most distancing.
Echoing Jonas’ arguments about vision, Fabian writes:
Vision requires distance from its objects;
the eye maintains its ‘purity’ as long as it is not in close contact
with ‘foreign objects’. Visualism, by instituting distance as that
which enables us to know, and purity or immateriality as that which
characterizes true knowledge, aimed to remove all the other senses and
thereby the body from knowledge production. Fabian 1993, p. 99.
Ong makes a congruent argument,
stating that “of all the senses sight is the most distancing sense: it
requires always that eye and object be removed to a considerable extent
from one another.’
Sight is also considered the most disengaged sense, the least attuned
to temporality and the body.
The externality of sight allows the observer to avoid contact with the
object of perception. Vision is thus the sense most associated with objectivity
and disengagement. Keller and Grontkowski state:
Although itself one of the senses, by virtue of its apparent incorporeality,
it [vision] is that sense which most readily promotes the illusion of
disengagement and objectification. At the same time it provides a compelling
model for intangible communication. (p. 213)
Houlgate notes that Dewey’s
critique of the spectator theory of knowledge centers precisely on the
assumption of disengagement, on the idea that ‘the aim of philosophical
or scientific inquiry is to come to know reality - as we seem to do in vision - without in any way interfering with it or
modifying it through practical activity.’
In Chomskyan linguistics the ideal knowing subject is represented as an
all-seeing, all-knowing observer who occupies a ‘view from nowhere,’ and
above all, who is disengaged and objective. The perspective of God, a
Martian, and the child is deemed perfect precisely because it is knowledge
that is removed from the world.
Vision has been constructed
as the sense which registers objects of perception instantaneously. Visual
metaphors for knowing suggest immediacy, unity, and unmediated access
to reality. Rorty states that ocular theories of knowledge seem
to offer the hope of an ‘immediacy which would make discourse and description
superfluous.’ (Rorty 1993, p. 375) Visual metaphors for knowing are frequently
used to repress the role of language in knowledge production and suggest
that understanding can proceed in an essentially unmediated way. Ocular
metaphors efface the importance of rhetoric and reflexivity by suggesting
that knowledge occurs instantly, at a glance, unmediated by language.
Ocularcentric discourses tend to background the metaphorical status of
vision (Derrida writes that with ‘photocentric’ discourses description
is conflated with, or projected onto, the object it is supposed to understand,
and that as a result ‘structure becomes the object itself.’)
Correspondingly, Chomskyan linguistics is characterized by a constant
drive to establish immediacy and efface signs of constructedness and mediation.
It seeks to escape the need for language to talk about language. This
rhetorical imperative depends on the conflation of the theoretical object
and the ‘natural’ object, on closing the gap between the object and the
language used to represent it. Chomsky represents knowledge in terms
of inner vision and in terms of the act of observation by a transcendental
subjectivity. In each case knowledge is immediate, disengaged, disembodied
and unmediated. Furthermore, the subject and object are once more aligned
by making the act of knowing and the object of knowledge similar in many
key areas. Knowledge and the object of knowledge are described primarily in terms of
vision, and both in their ideal form are dematerialized, immediate, unmediated
and noncorporeal. This tendency is also identifiable in Chomsky’s discussions
of the essence of language understood as a form of telepathy - which if
not exactly a form of vision, is associated with the same qualities attributed
to vision. As telepathy, communication is unmediated, transparent, disembodied
and immediate. The properties typically assigned vision are associated
with pure communication understood in terms of pure thought. In generative
linguistics perfect knowledge thus mirrors perfect communication - a disembodied
eye can perceive disembodied communication in its essential form.
Vision has also been associated
with certainty, universality, and with what is self evidently true. It
has been depicted as atop the hierarchy of senses with respect to reliability,
and is often presented as the most universal sense (in the sense that
perceiving subjects are assumed to see the same thing). Rorty talks of
how Western philosophy has sought to associate knowledge with the certainty
that appears to be provided ‘when staring at an object,’ (Rorty
1993, p. 159) and Descartes writes
that “sight is the most universal and the most noble of the senses.’
Chomsky consistently associates the certainty attributed to vision
with knowledge of language (references
to vision in generative linguistics thus play a role somewhat akin to
Dr Johnson’s stone.) When confronted with skeptics or dissenters, Chomsky
argues that knowledge of language (as well as language as knowledge) is
as certain and well-grounded as knowledge of the visual system. For example,
when arguing against the notion that linguistic rules are cultural and
variable, and that linguistic meaning is conventional and context-dependent,
Chomsky writes that a language user
cannot choose to have sentences mean other than what they do, any more
than he can choose to have objects distributed in perceptual space otherwise
than the way they are…. It is a fact of nature that the cognitive structures
developed by people in similar circumstances, within cognitive capacity,
are similar, by virtue of their similar innate constitution. Thus we
share rules of language with others as we share an organization of visual
space with them.
In passages like
the one above the universality and certainty commonly attributed to visual
perception is associated with language in order to authorize generative
linguistics and naturalize the object of inquiry. Chomsky’s frequent
appeals to the certainty of visual perception and the visual system thus
function as what Potter et al call an “undeniability device.” That is,
references to vision constitute rhetorical strategies designed to establish
beyond any doubt the existence of a brute reality, something external
to talk, discursively unmediated. Like hitting the furniture, kicking
a rock, invoking death, power or the Holocaust, Chomsky’s statements about
the distribution of objects in ‘perceptual space’ (in other words, the
undeniability of what we see) are used to establish that which cannot
be denied, a bedrock nonverbal reality that is present, self-evidently
true, an ultimately trustable form of experiential realism. Potter et
al analyze some of the most common undeniability devices, arguing that
each inevitably fails in its attempt to escape representation. For example,
they examine the ‘furniture argument’ (“The realist thumps the table.
What a loud noise! Much louder than talk. Much more gritty. Much more
real. And yet we insist that this noise, being produced in this
place, at this time, in the course of this argument, is
an argument, is talk”)
sometimes invoked by realists. They write:
The Furniture argument, as the argument of no argument, purports to be the
one that ends the rhetoric, is above rhetoric, and demonstrates its
limits: it is "the naked truth," unconstructed, unsupported,
unclothed, needing no allies. The counter to this is to name it as a
device, a rhetorical construct, occasioned and deployed. For example,
we can place it amongst a set of similar devices for accomplishing undeniability.
In discourse, the Furniture device shares features with other rhetorical
ploys that, difficult to undermine in themselves, are deployed as shields
behind which some rather more vulnerable entity is placed; thus positioned,
they lend their robustness to some more contentious issue. Page 3.
Chomsky’s appeals
to the undeniability of visual perception might similarly be dubbed “the
Vision argument.” References to vision function to shield the ‘more vulnerable
entity’ of language, to establish its certainty, its status as a bedrock
nonverbal reality that is present, self-evidently true, discursively unmediated.
Idealized knowledge of language is depicted in terms of inner vision that
delivers universal understanding, or in terms of idealized observers who
occupy a vantage point that enables universal understanding. The linguistic
examples that accompany Chomsky’s description of ideal knowledge, and
which Chomsky uses to establish the universality and certainty of the
object of inquiry, also depend on visualist assumptions about language
and knowledge. Consider, for example, the passage cited earlier describing
a Martian view of language:
If a Martian scientist looked at us the way that we look at frogs he
might well conclude that with marginal, minor modifications, there is
only one language. You and I might say "tree," and a German
would say "baum," but we're using basically the same concepts
from the same inventory, which is both rich and restrictive. (Chomsky
1994, p. 173.)
Chomsky asserts that objective knowledge
would reveal a single, universal conceptual system that underlies and
is reflected in language. The universality of the conceptual and linguistic
system is suggested through the example of German and English words for
the concept “tree.” What is interesting about the example is that like
many other arguments for conceptual universals, it focuses on an individual
noun, one denoting a physical object with concrete physical characteristics,
and which in many contexts of use connotes availability to visual perception.
The plausibility of Chomsky’s example rests on an implicit appeal to the
universality of thought and language understood in terms of visual perception.
Chomsky does not consider a word expressing process, sensation, or some
culturally specific practice or expression.
His example presupposes that when an English speaker hears the word “tree,”
s/he has in mind a mental image that can be compared and found equivalent
to the German word “baum”. Chomsky thus assumes that words and sentences
function primarily as pictures, rather than as the later Wittgenstein
would argue, as tools integrated into the fabric of human action and life.
His example presupposes Aristotle’s visualist dictum that “words are the
image of thought”,
and Wittgenstein’s remark that “the proposition is a picture of reality.” Such a model of language
clearly depends on an assumed equivalence between thought, knowledge and
vision – if, for example, we imagine thought and knowledge in terms of
an alternative set of modal metaphors, the plausibility of such a universal
linguistic and conceptual system appears less convincing.
Generally speaking, philosophical and linguistic appeals to universality
depend at some level on the epistemological certainty and universality
that has come to surround visual metaphors. Much of the suspicion toward
visual figures identifiable in poststructural and postmodern theories
centers on the claims of universality that often accompany visualist language.
Becker has produced an interesting interrogation of arguments for conceptual
and linguistic universals, one that can also be read as a critique of
visualist ontology. Becker examines the se |