|
|
Advanced Statistical Methods SyllabusLinguistics 696
For parsing we will begin a review Context free parsing, focusing on CKY, the bottum up algorithm that has the most direct application to PCFGs (Probabilistic Context-Free Grammars). We will continue with some enhancements to PCFGs which are loosenings of its context freeness assumptions which have come to be called Generalized PCFGs, which actually don't change theor computational properties but greatly enhance their predictive power. We will look at issues of parser evaluation, search, lexicalizing PCFGs, and discriminative parsing. For MT, we will review some classic MT systems, move on to the Noisy Channel model that has been so influential in statistical MT (SMT), and then cover basic components of a modern system, the word-alignment training, phrase alignment training, target language-modeling, and decoding. We will look at the contributions made by introducing classes and hierarchical synatctic information. Then we will read some papers and do some simple experiments with sense-disambiguation.
The text for the class will be Jurafsky and Martin, Speech and Natural Language Processing, with some material from the 2nd Edition, focusing on Chapters 19 and 24, the word sense and MT chapters. There are also additional readings available online (see course outline).
Prequisite: Some computer science or some linguistics; preferably Ling 581. Grading will be based on exercises/projects a take-home midterm and final.
Wed 4:00-6:40
http://www-rohan.sdsu.edu/~gawron/ling696
Mailing address:
|
||||||||||||||