|
|
Computational Syntax and Semantics SyllabusLinguistics 582
We will begin with a review of some classic all paths parsing algorithms, topdown and bottum-up, introducing Chart parsing as a framework in which various approaches to parsing can be understood and implemented. The Natural Language toolkit (NLTK) will provide us with a rich set of reference implementations. Python, the language NLTK is written in, will be the programming language of choice. We will also introduce unification, a general framework for introducing arbitrary constraints into the parser, and eyeball some of the practical consequences of introducing this computationally expensive mechanism into the parsing process. Finally we will look at an NLTK implementation of chunk parsing, a form of shallow parsing and apply it to a simple information extraction task.
The text for the class will be Jurafsky and Martin, Speech and Natural Language Processing.
Prequisite: Some computer science or some linguistics; preferably Ling 581. Grading will be based on exercises/projects a take-home midterm and final.
Tu Th 11:00-12:30
http://www-rohan.sdsu.edu/~gawron/parsing
Mailing address:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||