|
To appear in J.W. Minett & W.S.-Y. Wang
(Eds.), Language acquisition, change and emergence: Essay in evolutionary linguistics.
Hong Kong: City University of Hong Kong Press.
Multiple-cue integration in language acquisition: A connectionist
model of speech segmentation and rule-like behavior
Morten H. Christiansen
Christopher M. Conway
Suzanne Curtin
Introduction
Considerable research in language acquisition has addressed the extent
to which basic aspects of linguistic structure might be identified on
the basis of probabilistic cues in caregiver speech to children. In
this chapter, we examine systems that have the capacity to extract and
store various statistical properties of language. In particular, groups
of overlapping, partially predictive cues are increasingly attested to
in research on language development (e.g., Morgan & Demuth, 1996). Such
cues tend to be probabilistic and violable, rather than categorical or
rule-governed. Importantly, these systems incorporate mechanisms for
integrating different sources of information, including cues that may
not be very informative when considered in isolation. We explore the
idea that conjunctions of these cues provide evidence about aspects of
linguistic structure that is not available from any single source of
information, and that this process of integration reduces the potential
for making false generalisations. Thus, we argue that there are
mechanisms for efficiently combining cues of even very low validity,
that such combinations of cues are the source of evidence about aspects
of linguistic structure that would be opaque to a system insensitive to
such combinations, and that these mechanisms are used by children
acquiring languages (for a similar view, see Bates & MacWhinney, 1987).
These mechanisms also play a role in skilled language comprehension and
are the focus of socalled constraint-based theories of sentence
processing (Cottrell, 1989; MacDonald, Pearlmutter & Seidenberg, 1994;
Trueswell & Tanenhaus, 1994) that emphasise the use of probabilistic
sources of information in the service of computing linguistic
representations. Since the learners of a language grow up to use it,
investigating these mechanisms provides a link between language
learning and language processing (Seidenberg, 1997).
In the standard learnability approach, language acquisition is viewed
in terms of the task of acquiring a grammar (e.g., Pinker, 1994; Gold,
1967). This type of learning mechanism presents classic learnability
issues: there are aspects of language for which the input is thought to
provide no evidence, and the evidence that does exist tends to be
unreliable. Following Christiansen, Allen & Seidenberg (1998), we
propose an alternative view in which language acquisition can be seen
as involving several simultaneous tasks. The primary task -- the language
learner's goal -- is to comprehend the utterances to which she is exposed
for the purpose of achieving specific outcomes. In the service of this
goal the child attends to the linguistic input, picking up different
kinds of information, subject to perceptual and attentional
constraints. There is a growing body of evidence that as a result of
attending to sequential stimuli, both adults and children incidentally
encode statistically salient regularities of the signal (e.g.,
Cleeremans, 1993; Saffran, Aslin & Newport, 1996; Saffran, Newport &
Aslin, 1996). The child's immediate task, then, is to update its
representation of these statistical aspects of language. Our claim is
that knowledge of other, more covert aspects of language is derived as
a result of how these representations are combined through multiple cue
integration. Linguistically relevant units (e.g., words, phrases, and
clauses) emerge from statistical computations over the regularities
induced via the immediate task. On this view, the acquisition of
knowledge about linguistic structures that are not explicitly marked in
the speech signal -- on the basis of information that is -- can be seen as a
third derived task. We address these issues in the specific context of
learning to identify individual words in speech. In the research
reported below, the immediate task is to encode statistical
regularities concerning phonology, lexical stress and utterance
boundaries. The derived task is to integrate these regularities in
order to identify the boundaries between words in speech.
The remainder of this chapter presents our work on the modelling of
early infant speech segmentation in connectionist networks trained to
integrate multiple probabilistic cues. We first describe past work
exploring the segmentation abilities of our model (Allen &
Christiansen, 1996; Christiansen, 1998; Christiansen et al., 1998).
Although we concentrate here on the relevance of combinatorial
information to this specific aspect of acquisition, our view is that
similar mechanisms are likely to be relevant to other aspects of
acquisition and to skilled performance. Next, we present results from a
new set of simulationsi that extends the coverage of the model to
include recent controversial data on purported rule-learning by infants
(Marcus, Vijayan, Rao & Vishton, 1999). New empirical predictions
concerning the role of segmentation in rule-like behavior is derived
from the model, and confirmed by artificial language learning
experiments with adult participants. Finally, we discuss how multiple
cue integration works and how this approach may be extended beyond
speech segmentation.
Click here to
download a PDF version.

|