Psycholinguistics Seminar


Jeff Lidz & Colin Phillips

Tuesday 2-5pm

MMH 1108

Spring 2006


The problem

Learning and parsing share a certain abstract structure. Learners must determine the structure of their language on the basis of partially ambiguous data. Likewise, parsers must determine the structure of a sentence on the basis of incomplete or ambiguous data. This kind of "reasoning under uncertainty" lends itself to solutions from the domain of probability theory. Looked at from another angle, learners and parsers have been shown to be sensitive to statistical information under various experimental conditions. Many have argued that the successes of probabilistic methods in NLP and the demonstrations of statistical learning in young children fundamentally change the nature of long-standing debates about the structure of the language acquisition device. However, there have been very few attempts to show how these methods can actually solve the problems that have motivated approaches such as the Principles and Parameters approach to syntax. At the same time, parameter-based learning models have also had only limited successes in explaining the kinds of problems that they were designed to solve. In this seminar we aim to explore what, if anything, distributional learning and parsing models have to offer for the problems of learning and parsing grammatically rich representations. We will start by outlining the learning and parsing challenges that we aim to address. This will include selecting a series of 'model problems' that we will bear in mind as we read relevant experimental and computational literature. We will then review first a selection of experimental studies on statistical learning and parsing, and then some of the methods that the NLP community considers to be most useful, and evaluate whether these tools provide what we need in order to solve our problems.


How to find us:

Jeff Colin

MMH 1417D MMH 1413F

301.405.8220 301.405.3082

jlidz AT umd dot edu colin AT umd dot edu


Student responsibilities

Class Participation (10%):  Do all of the readings and come to class prepared with questions, comments, things you didn’t understand, things you think are wrong-headed, things you think are beautiful, etc. Being an active participant in class discussions will increase your understanding and will help you to develop your own research projects. As a part of class participation, you are responsible for ensuring that a learning or parsing problem that you 'adopt' gets sufficient attention during the class discussions. 


Presentations (30%):  Each week, some number of you will be responsible for some part of the discussion. In the week(s) in which you are presenting, come prepared with either a handout or a computer presentation.  Your presentations should (a) cover the main arguments of the papers, discussing the methods used, the hypotheses tested and how the results bear on the hypotheses (30 min. max); and, (b) evaluate the conclusions reached, identify open questions and ways of testing them, integrate the conclusions with other readings or other things you know.  'Book reports' are not acceptable in this kind of class. We encourage you to discuss your presentation with at least one of the instructors beforehand.


Commentaries (40%):  You will also be responsible for writing up two commentaries on the material that you present, expanding upon the issues in your presentation and reacting to class discussion. Your two commentaries must be based on different aspects of the class (i.e., you can't write two commentaries on experimental studies of language acquisition, although you could write on an experimental and a computational approach to language learning). 


Short Paper (20%): In the first several weeks, you will identify a problem in either learning or parsing that you think fits into the overall problem investigated in the seminar. As we progress, you will keep track of how what we are learning bears your particular problem. At the end of the semester, you will write up some ideas concerning novel possible solutions to your problem.


Schedule of activities: This schedule will almost certainly change, depending on class interests, discussion, etc.


Date

Topic

Reading

Presenters

Due Dates

1/31

Overview of Course

Jeff & Colin

 

2/7

Defining the Problem

Gibson & Wexler 1994; Fodor 1998; Yang 2002

Jeff & Colin

 

2/14

Defining the problem

Pinker 1989, ch.1; Han et al 2005. Pinker 1989, ch.2-4, optional.

Jeff & Colin

 

2/21

Infants: Segmentation

Statistics & Algebra

Saffran et al. 1996; Gamble & Yang 2005; Marcus et al 1998; Gerken 2006; Pena et al 2002; Bonatti et al 2005; Newport & Aslin 2004.

Phil & Yuval

Consult with us about your topic

2/28

Infants: syntactic categories

Mintz et al 2002; Redington, Chater & Finch 1998; Mintz 2003; Mintz 2006; Gomez & Lakusta 2004

Clare & Rebecca

 

3/7

Infants: artificial syntax

Gerken & Gomez 1999; Gomez 2002; Saffran 2002; Hudson-Kam & Newport 2005

Michael & Eri

3/14

Children's Parsers

Snedeker & Trueswell 2004; Thothathiri & Snedeker 2006; Huttenlocher et al. 2004; Savage et al. 2006

Akira & Stacey

3/21

SPRING BREAK

 

3/28

Adults: Lexical Frequency

MacDonald et al. 1994, Frazier 1995, Pickering & Traxler 2003; Pickering et al. 2000; Gibson 2006.

Matt & Phil

4/4

Adults: Nonlexical Frequency

Boland et al 2004; Spivey-Knowlton & Sedivy 1995; Desmet et al., in press; Desmet & Declerq 2006; Scheepers 2003.

Akira & Eri

4/11

Statistics and Grammatical Description

Bresnan et al., 2005; Bresnan & Nikitina 2003; Newmeyer 2003; Clark 2005; Gahl & Garnsey 2004

Matt & Rebecca

4/18

Basics of Probability and Statistics

Goldsmith 2001, Manning & Schutze 1999, Zettlemoyer & Collins 2006

Michael, Clare, & Stacey

4/25

Computational Results

Klein & Manning 2001; Jurafsky 2002

SKIP

5/2

Computational Techniques and Human Parsing

Hale 2003, Levy 2006

BONUS: FODOR 1966

Yuval

5/9

Overflow

5/16

FINALS WEEK

Papers due by Thursday 5/18