OEAW, ARI OEAW ARI

LocaPhoto

Virtual Acoustics: Localization Model & Numeric Simulations (LocaPhoto)

Localization of sound sources is an important task of the human auditory system and much research effort has been put into the development of audio devices for virtual acoustics, i.e. the reproduction of spatial sounds via headphones. Even though the process of sound localization is not completely understood yet, it is possible to simulate spatial sounds via headphones by using head-related transfer functions (HRTFs). HRTFs describe the filtering of the incoming sound due to head, torso and particularly the pinna and thus they strongly depend on the particular details in the listener's geometry. In general, for realistic spatial-sound reproduction via headphones, the individual HRTFs must be measured. The currently available acoustic measurement is a technically-complex process, involves placing microphones into the listener's ears, and lasts for tens of minutes.overview

As a first step towards an easily accessible method for virtual acoustics, we visually scanned listeners' heads, obtained a geometrical representation of the heads (3-D mesh), and calculated HRTFs using numerical algorithms (part of the FWF-project P18401-B15). While our results showed that the numerical method is generally capable to calculate HRTFs, several questions remained open, especially in the interpretation from the perceptual perspective. The perceptual evaluation of the effect of the systematic parameter variation turned out to be a challenging task because of the large number of tests conditions and the lengthy psychoacoustic tests. It became clear that a sound-localization model predicting the performance is required to reduce the number of conditions for further behavioral evaluations in experiments. In LocaPhoto, we propose to develop such a sound-localization model and to continue our work on numerical HRTF calculations.

Thus the first goal of LocaPhoto is to create a functional localization model which is able to predict localization performance for sound sources positioned in 3-D free field. The model will include recent neuro-physiological findings and is aimed as a tool for further research on modeling the mechanisms behind spatial hearing by the scientific community outside of LocaPhoto.

In LocaPhoto, the model will be used to evaluate HRTFs from the perceptual perspective. This will allow to achieve the second goal of LocaPhoto: the development of a method to numerically calculate perceptually valid HRTFs based on the visually retrieved geometry of a listener. Based on the results from our previous project, we plan to simplify the geometry acquisition and to increase the mesh quality by generating 3-D meshes from 2-D photos with photogrammetric-reconstruction algorithms. Given the huge amount of parameters in the numerical calculations, we expect hundreds of calculated HRTF sets. The localization model will be used to select the promising HRTF candidates, all of which will be subjectively tested in localization experiments.

Photos & Graphics

Predictions localization model Predictions based on the localization model from Langendijk and Bronkhorst (2002). Left panel: incoming sound with frequencies up to 16 kHz. Middle panel: incoming sound with frequencies up to 8 kHz. Right panel: incoming sound as for the middle panel, the prediction improved by modifying the model to consider the incoming signal bandwidth only. Bright colors denote regions where the model predicts sound localization, dots denote the actual sound localization performance of the subject measured in a psychoacoustic experiment.
mesh
Mesh from Kreuzer et al. (2009)
Amplitude Spectra

Results preliminary sound localization experiment

LEFT: The amplitude spectra of the measured (top) and calculated (bottom) HRTFs as a function of elevation angle.

TOP: Results of a preliminary sound localization experiment with the corresponding listener; Measured: acoustically measured DTFs; smoothed: measured and spectrally smoothed DTFs (only first 32 cepstral coefficients used, compare Kulkarni and Colburn (1998)); Strongly smoothed: only the first 8 cepstral coefficients used; Calculated: calculated DTFs (from the left panel); KEMAR: DTFs of a dummy head Gardner and Martin (1995).

Results preliminary sound localization experiment Results of a preliminary sound localization experiment with the corresponding listener; Measured: acoustically measured DTFs; smoothed: measured and spectrally smoothed DTFs (only first 32 cepstral coefficients used, compare Kulkarni and Colburn (1998)); Strongly smoothed: only the first 8 cepstral coefficients used; Calculated: calculated DTFs (from the left panel); KEMAR: DTFs of a dummy head Gardner and Martin (1995).
Photographic reconstruction of a person

Photographic reconstruction of a person2

 

 

 

 

 

 

 

 

 

 

 

 

LEFT & TOP: Photogrammetric reconstruction of a person (preliminary).

© 2014 - Austrian Academy of Sciences - Acoustics Research Institute
Wohllebengasse 12-14, A-1040, Vienna

Thursday, 28 08 2014