Descripción
|
|
---|---|
This paper describes a novel approach to phonotactic LID, where instead of using soft-counts based on phoneme lattices, we use posteriogram to obtain n-gram counts. The high-dimensional vectors of counts are reduced to low-dimensional units for which we adapted the commonly used term i-vectors. The reduction is based on multinomial subspace modeling and is designed to work in the total-variability space. The proposed technique was tested on the NIST 2009 LRE set with better results to a system based on using soft-counts (Cavg on 30s: 3.15% vs 3.43%), and with very good results when fused with an acoustic i-vector LID system (Cavg on 30s acoustic 2.4% vs 1.25%). The proposed technique is also compared with another low dimensional projection system based on PCA. In comparison with the original soft-counts, the proposed technique provides better results, reduces the problems due to sparse counts, and avoids the process of using pruning techniques when creating the lattices. | |
Internacional
|
Si |
Nombre congreso
|
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association |
Tipo de participación
|
960 |
Lugar del congreso
|
Portland, Oregon |
Revisores
|
Si |
ISBN o ISSN
|
1990-9772 |
DOI
|
|
Fecha inicio congreso
|
09/09/2012 |
Fecha fin congreso
|
13/09/2012 |
Desde la página
|
1 |
Hasta la página
|
4 |
Título de las actas
|
InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association |