SDSU logo

Marie A. Roch

Associate Professor of Computer Science



 

Audio Processing Lab Overview

 

Lab picnic photo


Summer 2005 picnic - lab members, alumni, & families

The audio processing lab has several active projects with collaborators both internal and external to San Diego State University. Current and prior supporters of the lab include: The Office of Naval Research, Chief of Naval Operations N45 Environmental Readiness Division, and The Scripps Institution of Oceanography.

Bioacoustics

Pacific white-sided dolphinWe work with the Marine Physical Lab's Whale Acoustics group led by Dr. John Hildebrand at The Scripps Institution of Oceanography. We use passive acoustic monitoring to learn about populations of marine mammals. Much of our recent work has focused on using echolocation clicks produced by toothed whales and dolphins (odontocetes) to determine species identity. Some of our efforts include development of real-time systems for platforms such as the waveglider. Other projects include the analysis of tonal calls in blue whales and isolating individual individual whistles in groups of vocalizing odontocetes.

Providing automated analysis of sounds produced by animals helps biologists to better understand the ecosystem by providing information about population dynamics (abundance, seasonality, etc.) and may eventually yield information about how they communicate with one another.

Current students: Chris Marsh, Johanna Stinner-Sloan, and Jonathan Walton.

A note to students interested in joining the lab

It is extremely rare to accept a student without the student having successfully completed CS 682, Speech Processing, which provides a graduate level introduction to speech and speaker recognition. Typically, we accept one or two new students per year, so successful completion of the course is not a guarantee that there will be an open position. Students who are interested in taking CS 682 (offered once per year) should endeavor to make sure they meet the minimum prerequisites for the course: CS 310 or COMPE 260 (Data Structures, higher level CS/COMPE typically expected). Although not required, a background in statistics is highly desireable, and knowledge of basic linear algebra is helpful.

Lab Alumni

Previous areas of research

In general, the lab is no longer accepting students in these areas:

Speaker recognition

Speaker recognition is a form of biometric where the goal is to construct algorithms to determine a person's identity based upon their voice. Projects we have worked on include recognition in the presence of noise, speaker segmentation, extensions to Gaussian selection to set thresholds independently of the number of dimensions, and high level speaker recognition.

Control systems for hearing aids

uncompressed and compressed spectrumIn collaboration with the Assistive Devices Lab led by Dr. Richard Hurtig in The Dept. of Speech Pathology and Audiology at The University of Iowa, we design dynamic control systems for hearing aids. Our current system parameterizes a frequency compression algorithm which reduces the frequency bandwidth of a signal while maintaining formant ratios. Formants are the harmonic frequencies which are reinforced by our vocal tract and the ratio between formants is known to be important for perception. By remapping the frequency domain, we move the formants to an audible range for a severely hearing impaired listener.