CS 682 Speech Processing
Class meets: MW, 16:00 - 17:15, GMCS-308,
On-line syllabus
Text: Spoken Language Processing, Huang, Acero & Hon,
Prentice Hall 2002.
Slides
- Motivation and architecture
- Sound, speech, and perception
- Sound, speech, and perception (convolution demo)
- Classifiers Part I, Part II
- Cepstral features
- Language modeling
- hidden Markov models (HMMs) Part I, Part II
- Decisions for acoustic modeling
- Search
Assignments
Due dates are posted on the calendar.
Problem sets:
PS01(see Gockenbach's Intro to Matlab), PS02, PS03, PS04, PS05, PS06/Lab 3
Labs:
L01, L02, PS06/Lab 3
Readings: See blackboard, most recent R01
Calendar
An on-line calendar with dates relevant to the class.
Additional texts on speech recognition or audio
If you have difficulties with a presentation in Huang, Acero, and Hon, the simplest method is to simply ask me to explain it during office hours. If you so choose, you might want to consult another text. Here are a few relevant texts:- Speech and Language Processing, Jurafsky & Martin, 2009.
- Digital Speech Processing, Synthesis, and Recognition, Second edition, Furui, 2000.
- Fundamentals of Speech Recognition, Rabiner and Juang, 1993.
- Speech Communications: Human and Machine, Second edition, O'Shaughnessy, 2000.
- Statistical Methods for Speech Recognition, Jelinek, 1998.
- The following texts are not on speech recognition, but offer accesible
presentations to relevant material:
- The Science of Musical Sounds, Sundberg, 1991.
- Fundamentals of Hearing, Third edition, Yost, 1994.
- Signals and Systems Made Ridiculously Simple, Karu, 1994.
- A Course in Phonetics, Ladefoged, Heinle & Heinle, 2001. Ladefoged has nice point and click to listen chart of the IPA vowels and consonants.
Frequently Asked Questions
- rohan accounts - How to obtain one? How to transfer files to or from?
- How can I submit soft-copies of code for assignments?
- How can I access a GUI program (e.g. Matlab) on rohan using XWin-32 from GMCS 425 or the library?
- How can I make using Matlab more pleasant over a slow X
Window connection?
- How can I listen to or records speech (Wavesurfer)?
- Instructions for mapping Windows networked drives can be found in the course documents section of this course's Blackboard site.
- IPA/TIMIT/CMU phone mappings
- CMU Pronunciation Dictionary
- How can I set the PATH for Windows or UNIX?
- Guide to ciations (IEEE style).
