Neural and Statistical Processing of Spatial Cues for Sound Source Localisation

Jorge Dávila-Chacón , Sven Magg , Jindong Liu , Stefan Wermter
Proceedings of International Joint Conference on Neural Networks (IJCNN 2013), pages 1274--1281, doi: 10.1109/IJCNN.2013.6706886 - Aug 2013
Associated documents :  
When confronting binaural sound source localisation (SSL) algorithms with different environments and robotic platforms, there is an increasing need for non-linear integration methods of spatial cues. Based on interaural time and level differences, we compare the performance of several SSL systems. The architecture has three degrees of freedom, i.e. each tested architecture employs a different combination of representation of binaural cues, clustering and classification algorithms. The heuristic for the selection of methods is the same at each degree of freedom: to compare the impact of traditional statistical techniques versus machine learning algorithms with different degrees of biological inspiration. The overall performance is evaluated in the analysis of each system, including the accuracy of its output, training time and adequateness for life-long learning. The results support the use of hybrid systems, consisting different kinds of artificial neural networks, as they present an effective compromise between the characteristics evaluated.

 

@InProceedings{DMLW13, 
 	 author =  {Dávila-Chacón, Jorge and Magg, Sven and Liu, Jindong and Wermter, Stefan},  
 	 title = {Neural and Statistical Processing of Spatial Cues for Sound Source Localisation}, 
 	 booktitle = {Proceedings of International Joint Conference on Neural Networks (IJCNN 2013)},
 	 editors = {},
 	 number = {},
 	 volume = {},
 	 pages = {1274--1281},
 	 year = {2013},
 	 month = {Aug},
 	 publisher = {IEEE},
 	 doi = {10.1109/IJCNN.2013.6706886}, 
 }