Observatorio de I+D+i UPM

Memorias de investigación
Other publications:
Classification of Acoustic Scenes Based on Modulation Spectra and Position-Pitch Maps
Year:2018
Research Areas
  • Electronic technology and of the communications
Information
Abstract
A system for the automatic classification of acoustic scenes is proposed that uses the stereophonic signal captured by a binaural microphone. This system uses one channel for calculating the spectral distribution of energy across auditory-relevant frequency bands. It further obtains some descriptors of the envelope modulation spectrum (EMS) by applying the discrete cosine transform to the logarithm of the EMS. The availability of the two-channel binaural recordings is used for representing the spatial distribution of acoustic sources by means of position-pitch maps. These maps are further parametrized using the two-dimensional Fourier transform. These three types of features (energy spectrum, EMS and position pitch maps) are used as inputs for a standard multilayer perceptron with two hidden layers.
International
Si
Entity
DCASE2018 Challenge
Place
Surrey (Reino Unido)
Pages
Reference/URL
http://dcase.community/documents/challenge2018/technical_reports/DCASE2018_Fraile_84.pdf
Publication type
Technical Report
Participants
  • Autor: Ruben Fraile Muñoz (UPM)
  • Autor: Elena Blanco Martin (UPM)
  • Autor: Juana Maria Gutierrez Arriola (UPM)
  • Autor: Nicolas Saenz Lechon (UPM)
  • Autor: Victor Jose Osma Ruiz (UPM)
Research Group, Departaments and Institutes related
  • Creador: Grupo de Investigación: Aplicaciones Multimedia y Acústica
  • Centro o Instituto I+D+i: Centro de Investigación en Tecnologías del Software y Sistemas Multimedia para la Sostenibilidad (CITSEM)
  • Departamento: Teoría de la Señal y Comunicaciones (Provisional)
S2i 2020 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)