Repositorio Dspace

Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish

Mostrar el registro sencillo del ítem

dc.contributor 31249 es_ES
dc.contributor.other 0000-0002-7337-8974 es_ES
dc.contributor.other https://orcid.org/0000-0002-7337-8974
dc.contributor.other https://orcid.org/0000-0002-8060-6170
dc.coverage.spatial Global es_ES
dc.creator Becerra, Aldonso
dc.creator De la Rosa Vargas, José Ismael
dc.creator González Ramírez, Efrén
dc.date.accessioned 2020-04-16T19:09:17Z
dc.date.available 2020-04-16T19:09:17Z
dc.date.issued 2018-08
dc.identifier info:eu-repo/semantics/publishedVersion es_ES
dc.identifier.issn 1380-7501 es_ES
dc.identifier.issn 1573-7721 es_ES
dc.identifier.uri http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713
dc.identifier.uri https://doi.org/10.48779/2d22-9s79
dc.description.abstract The aim of this paper is to illustrate an overview of the automatic speech recognition (ASR) module in a spoken dialog system and how it has evolved from the conventional GMM-HMM (Gaussian mixture model - hidden Markov model) architecture toward the recent nonlinear DNN-HMM (deep neural network) scheme. GMMs have dominated for a long time the baseline of speech recognition, but in the past years with the resurgence of artificial neural networks (ANNs), the former models have been surpassed in most recognition tasks. An outstanding consideration for ANNs-based acoustic model is the fact that their weights can be adjusted in two training steps: i) initialization of the weights (with or without pre-training) and ii) fine-tuning. es_ES
dc.language.iso eng es_ES
dc.publisher Springer es_ES
dc.relation https://doi.org/10.1007/s11042-017-5160-5 es_ES
dc.relation.uri generalPublic es_ES
dc.rights Atribución-NoComercial-SinDerivadas 3.0 Estados Unidos de América *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/us/ *
dc.source Multimedia Tools Applications, Vol. 77, No. 12, pp. 15875-15911 es_ES
dc.subject.classification INGENIERIA Y TECNOLOGIA [7] es_ES
dc.subject.other Speech recognition es_ES
dc.subject.other Neural networks es_ES
dc.subject.other Gaussian mixture models es_ES
dc.subject.other Hidden Markov models es_ES
dc.subject.other Deep learning es_ES
dc.subject.other Spoken dialog system es_ES
dc.title Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish es_ES
dc.type info:eu-repo/semantics/article es_ES


Ficheros en el ítem

El ítem tiene asociados los siguientes ficheros de licencia:

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

Atribución-NoComercial-SinDerivadas 3.0 Estados Unidos de América Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución-NoComercial-SinDerivadas 3.0 Estados Unidos de América

Buscar en DSpace


Búsqueda avanzada

Listar

Mi cuenta

Estadísticas