Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Art�culos en revistas:

Resource2Vec: Linked Data distributed representations for term discovery in automatic speech recognition

A�o:2018

�reas de investigaci�n

Tecnolog�a electr�nica y de las comunicaciones,
Ingenier�a el�ctrica, electr�nica y autom�tica

Datos

Descripci�n
In this work we present a neural network embedding we call Resource2Vec, which is able to represent the resources that make up some Linked Data (LD) corpora. A vector representation of these resources allows more advantageous processing (in computational terms) as is the case with known word or doc ument embeddings. We give a quantitative analysis for their study. Furthermore, we employ them in an Automatic Speech Recognition (ASR) task to demonstrate their functionality by designing a strategy for term discovery. This strategy permits out-of-vocabulary (OOV) terms in a Large Vocabulary Continuous Speech Recognition (LVCSR) system to be discovered and then put into the ?nal transcription. First, we detect where a potential OOV term may have been uttered in the LVCSR output speech segments. Second, we carry out a candidate OOV search in some LD corpora. This search is oriented by distance measure ments between the transcription context around the potential-OOV speech segment and the resources of the LD corpora in Resource2Vec format, obtaining a set of candidates. To rank them, we mainly depend on the phone transcription of that segment. Finally, we decide whether or not to incorporate a candidate into the ?nal transcription. The results show we are able to improve the transcription in Word Error Rate (WER) terms signi?cantly, after our strategy is used on speech in Spanish.
Internacional	Si
JCR del ISI	Si
T�tulo de la revista	Expert Systems With Applications
ISSN	0957-4174
Factor de impacto JCR	3,768
Informaci�n de impacto
Volumen	112
DOI
N�mero de revista
Desde la p�gina	301
Hasta la p�gina	320
Mes	JUNIO
Ranking	Journal Rank in Category 20/132

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Alejandro Coucheiro Limeres UPM
Autor: Javier Ferreiros Lopez UPM
Autor: Ruben San Segundo Hernandez UPM
Autor: Ricardo de Cordoba Herralde UPM

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Grupo de Investigaci�n: Grupo de Tecnolog�a del Habla
Departamento: Ingenier�a Electr�nica