Tid: Torsdag 21 maj 2015, kl. 15:00–17:00
Plats: C307, Södra huset, Frescati

Postseminarium följer direkt efter seminariet i institutionens pentry.

Abstract
The child learning language is faced with a difficult problem: given a set of specific linguistic observations, the learner must infer some abstract representation (a grammar) that generalizes correctly to novel observations and productions. In this talk, I argue that Bayesian computational models provide a principled way to examine the kinds of representations, biases, and sources of information that lead to successful learning. As an example, I discuss my work on modelling word segmentation. I first present a computational study exploring the effects of context on statistical word segmentation.  In this study, a model that assumes words are statistically independent (as in the stimuli used in many human experiments) is compared to a model that defines words as units that help to predict following words. I show that the context-independent model undersegments the data, while the contextual model yields much more accurate segmentations, outperforming previous models on realistic corpus data. This difference suggests the need to consider contextual effects in infant word segmentation.

Simulations using corpus data provide insight into the kinds of information that are useful for learning, but it is also important to address the question of whether model predictions are consistent with human learning patterns.  In the second part of this talk, I present results from experiments inspired by the well-known statistical segmentation studies of Saffran et al. (1996), but where the stimuli were varied between subjects to modify the difficulty of the task. The Bayesian model described above correlates better with human patterns of difficulty than any other model tested, suggesting that this model does indeed capture important properties of human segmentation.

Hjärtligt välkomna!

Mats Wirén och Ljuba Veselinova

Vidare information om föreläsaren:
Sharon Goldwater (Institute for Language, Cognition and Computation, School of Informatics at University of Edinburgh)

Fredagen 22 maj 2015 är Sharon Goldwater opponent vid Robert Östlings disputation: "Bayesian Models for Multilingual Word Alignment"