SDSU logo

Marie A. Roch

Associate Professor of Computer Science



CS 682 Speech Processing

Class meets: MW, 16:00 - 17:15, GMCS-308,

On-line syllabus
Text: Spoken Language Processing, Huang, Acero & Hon, Prentice Hall 2002.

Slides

  1. Motivation and architecture
  2. Sound, speech, and perception
  3. Sound, speech, and perception (convolution demo)
  4. Classifiers Part I, Part II
  5. Cepstral features
  6. Language modeling
  7. hidden Markov models (HMMs) Part I, Part II
  8. Decisions for acoustic modeling
  9. Search

Assignments

 

Due dates are posted on the calendar.


Problem sets:

PS01(see Gockenbach's Intro to Matlab), PS02, PS03, PS04, PS05, PS06/Lab 3


Labs:

L01, L02, PS06/Lab 3


Readings: See blackboard, most recent R01


Calendar

An on-line calendar with dates relevant to the class.

 

Additional texts on speech recognition or audio

If you have difficulties with a presentation in Huang, Acero, and Hon, the simplest method is to simply ask me to explain it during office hours. If you so choose, you might want to consult another text. Here are a few relevant texts:

Frequently Asked Questions