Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish

Becerra, Aldonso; De la Rosa Vargas, José Ismael; González Ramírez, Efrén

DSpace Principal
→
Maestría en Ciencias del Procesamiento de la Información
→
*Documentos Académicos*-- M. en Ciencias del Proc. de la Info.
→
Ver ítem

dc.contributor	31249	es_ES
dc.contributor.other	0000-0002-7337-8974	es_ES
dc.contributor.other	https://orcid.org/0000-0002-7337-8974
dc.contributor.other	https://orcid.org/0000-0002-8060-6170
dc.coverage.spatial	Global	es_ES
dc.creator	Becerra, Aldonso
dc.creator	De la Rosa Vargas, José Ismael
dc.creator	González Ramírez, Efrén
dc.date.accessioned	2020-04-16T19:09:17Z
dc.date.available	2020-04-16T19:09:17Z
dc.date.issued	2018-08
dc.identifier	info:eu-repo/semantics/publishedVersion	es_ES
dc.identifier.issn	1380-7501	es_ES
dc.identifier.issn	1573-7721	es_ES
dc.identifier.uri	http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/1713
dc.identifier.uri	https://doi.org/10.48779/2d22-9s79
dc.description.abstract	The aim of this paper is to illustrate an overview of the automatic speech recognition (ASR) module in a spoken dialog system and how it has evolved from the conventional GMM-HMM (Gaussian mixture model - hidden Markov model) architecture toward the recent nonlinear DNN-HMM (deep neural network) scheme. GMMs have dominated for a long time the baseline of speech recognition, but in the past years with the resurgence of artificial neural networks (ANNs), the former models have been surpassed in most recognition tasks. An outstanding consideration for ANNs-based acoustic model is the fact that their weights can be adjusted in two training steps: i) initialization of the weights (with or without pre-training) and ii) fine-tuning.	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Springer	es_ES
dc.relation	https://doi.org/10.1007/s11042-017-5160-5	es_ES
dc.relation.uri	generalPublic	es_ES
dc.rights	Atribución-NoComercial-SinDerivadas 3.0 Estados Unidos de América	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/us/	*
dc.source	Multimedia Tools Applications, Vol. 77, No. 12, pp. 15875-15911	es_ES
dc.subject.classification	INGENIERIA Y TECNOLOGIA [7]	es_ES
dc.subject.other	Speech recognition	es_ES
dc.subject.other	Neural networks	es_ES
dc.subject.other	Gaussian mixture models	es_ES
dc.subject.other	Hidden Markov models	es_ES
dc.subject.other	Deep learning	es_ES
dc.subject.other	Spoken dialog system	es_ES
dc.title	Speech recognition in a dialog system: from conventional to deep processing A case study applied to Spanish	es_ES
dc.type	info:eu-repo/semantics/article	es_ES