Observatorio de I+D+i UPM

Memorias de investigación
Artículos en revistas:
Resource2Vec: Linked Data distributed representations for term discovery in automatic speech recognition
Año:2018
Áreas de investigación
  • Tecnología electrónica y de las comunicaciones,
  • Ingeniería eléctrica, electrónica y automática
Datos
Descripción
In this work we present a neural network embedding we call Resource2Vec, which is able to represent the resources that make up some Linked Data (LD) corpora. A vector representation of these resources allows more advantageous processing (in computational terms) as is the case with known word or doc ument embeddings. We give a quantitative analysis for their study. Furthermore, we employ them in an Automatic Speech Recognition (ASR) task to demonstrate their functionality by designing a strategy for term discovery. This strategy permits out-of-vocabulary (OOV) terms in a Large Vocabulary Continuous Speech Recognition (LVCSR) system to be discovered and then put into the ?nal transcription. First, we detect where a potential OOV term may have been uttered in the LVCSR output speech segments. Second, we carry out a candidate OOV search in some LD corpora. This search is oriented by distance measure ments between the transcription context around the potential-OOV speech segment and the resources of the LD corpora in Resource2Vec format, obtaining a set of candidates. To rank them, we mainly depend on the phone transcription of that segment. Finally, we decide whether or not to incorporate a candidate into the ?nal transcription. The results show we are able to improve the transcription in Word Error Rate (WER) terms signi?cantly, after our strategy is used on speech in Spanish.
Internacional
Si
JCR del ISI
Si
Título de la revista
Expert Systems With Applications
ISSN
0957-4174
Factor de impacto JCR
3,768
Información de impacto
Volumen
112
DOI
Número de revista
Desde la página
301
Hasta la página
320
Mes
JUNIO
Ranking
Journal Rank in Category 20/132
Esta actividad pertenece a memorias de investigación
Participantes
  • Autor: Alejandro Coucheiro Limeres (UPM)
  • Autor: Javier Ferreiros Lopez (UPM)
  • Autor: Ruben San Segundo Hernandez (UPM)
  • Autor: Ricardo de Cordoba Herralde (UPM)
Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Grupo de Tecnología del Habla
  • Departamento: Ingeniería Electrónica
S2i 2021 Observatorio de investigación @ UPM con la colaboración del Consejo Social UPM
Cofinanciación del MINECO en el marco del Programa INNCIDE 2011 (OTR-2011-0236)
Cofinanciación del MINECO en el marco del Programa INNPACTO (IPT-020000-2010-22)