Invited paper presented at a symposium on New Directions in Statistical Learning at the 2002 International Conference on Infant Studies, Toronto, Canada.

Integration Multiple Probabilistic Cues: Going beyond Distributional Statistics



Morten H. Christiansen


Abstract

The statistical learning of distributional information undoubtedly plays an important role in many areas of cognition. However, it also seems clear that distributional learning is likely to be insufficient to explain complex cognitive development. For example, even though transitional probabilities are useful for segmenting speech, such distributional learning cannot provide a complete explanation of infant speech segmentation. Likewise, sensitivity to distributional statistics at the syntactic level (e.g., n-grams) can only partially explain the acquisition of grammatical regularities. In this talk, I suggest that the integration of multiple probabilistic cues may help overcome the limitations of simple distributional learning, providing evidence from a series of connectionist simulations of early syntax acquisition.

When acquiring the syntactic structure of their native language children face a difficult "bootstrapping" problem. Discovering syntactic constraints requires knowledge about grammatical categories; but, conversely, grammatical categories are only useful for acquisition insofar as they support syntactic constraints. Recent work in developmental psycholinguistics suggests that children may solve the bootstrapping problem by exploiting distributional, phonological, and prosodic cues. However, these cues are probabilistic and individually unreliable. The simulations presented in this talk demonstrate how a statistical learning device attuned to phonological and prosodic cues can overcome some of the limitations of a simple distributional learning, resulting in significant improvements in syntactic acquisition.

A phrase-structure grammar was constructed based on previous research and independent corpus analyses. This fairly complex grammar was intended to capture the general syntactic trends prevalent in child-directed speech, including declarative, imperative, and interrogative sentences. Five groups of simple recurrent networks were trained on sentences generated from this grammar. Each group of networks integrated a different constellation of three probabilistic cues: word length (in syllables), stress, and pitch. A baseline network relied entirely on distributional information, with cues added to the remaining groups, including a three-cue network that took advantage of all three cues.

The multiple-cue integration networks all performed better than corresponding trigram models — a standard computational linguistics benchmark for distributional syntax learning. Multiple-cue integration also resulted in faster and more uniform learning when the performance of baseline networks was compared with the cue-augmented networks. Additional simulations demonstrated that the presence of distractor cues uncorrelated with syntactic structure does not prevent the networks from successfully integrating the partially reliable cues. These simulations (and others) underscore the computational feasibility of multiple-cue approach to the acquisition of syntax, and, more generally, the advantage of going beyond simple distributional information to statistical learning through the integration of multiple probabilistic information sources.




rule.gif (155 bytes)

Home | People | Research | Links | Contact | Publications | Presentations
Cognitive Neuroscience Laboratory

Please email suggestions/errors to mhc27@cornell.edu