[Back]

Voice Recognition


[ Up to Voice Recognition]

Phonemes


[ Up to Phonemes]

Consonants

[Jump to page by Molly Overholt at Dartmouth University]

[ Up to Phonemes]

Vowels

[Jump to page by Molly Overholt at Dartmouth University]

[ Up to Voice Recognition]

Parameters


QUIZ...

[ Up to Voice Recognition]

Typical Process


QUIZ...

[ Up to Typical Process]

Extract a spectrogram from the raw signal

QUIZ...

[ Up to Extract a spectrogram from the raw signal]

sine-waves

[Jump to page by Key Curriculum Press, Inc.]

[ Up to Extract a spectrogram from the raw signal]

Additive sine waves

[Jump to page by David Eisenberg]

[ Up to Extract a spectrogram from the raw signal]

Line Spectra


QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Sound decomposition


QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Envelopes


QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Formants

[Jump to page by Kevin Russell] QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Vowel sounds

[Jump to page by Kevin Russell] QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Spectrograms


QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Stops


QUIZ...

[ Up to Extract a spectrogram from the raw signal]

Fricatives


QUIZ...

[ Up to Typical Process]

Map the spectrogram to a stream of sub-phonetic symbols ('codes')


[ Up to Map the spectrogram to a stream of sub-phonetic symbols ('codes')]

The spectrogram is a sequence of vectors

QUIZ...

[ Up to Map the spectrogram to a stream of sub-phonetic symbols ('codes')]

From each segment we can extract yet another vector which gives us more informative features.

QUIZ...

[ Up to Map the spectrogram to a stream of sub-phonetic symbols ('codes')]

Example: a toy 'dad' recognizer


[ Up to Example: a toy 'dad' recognizer]

Extracting the codes

QUIZ...

[ Up to Example: a toy 'dad' recognizer]

The finite-state recognizer

QUIZ...

[ Up to Typical Process]

Feed the codes into a statistically trained model


[ Up to Feed the codes into a statistically trained model]

Hidden Markov Models (HMM)


QUIZ...

[ Up to Hidden Markov Models (HMM)]

Example: computing the next probability of state 'A'

[ Up to Typical Process]

Consult models of higher-up levels of language to resolve ambiguities.

QUIZ...

[ Up to Voice Recognition]

Ling. 354. Assignment 1 - Voice Recognition.

In this assignment you will this this spectrogram.

It is a spectrogram of 'funny money' Note that since it rhymes, the 'unny' sound occurs twice while there's a contrast between the f- and m- sounds.

Print out the file on a laser printer. You may want to make several copies.

Use a highlighter pen to mark off alternate segments where the spectrogram changes abruptly.

Indicate the probable boundaries between the phonemes on the bottom of the transcript. Do this by drawing brackets with a pencil along the bottom of the spectrogram.

Look here for an example on transcript of 'Jack Sprat'.

For each of the phonemes in your sample (F, M, U, N, Y), describe the segments which map onto it.