Becerra, Aldonso; De la Rosa Vargas, José Ismael; González Ramírez, Efrén; Pedroza, David; Escalante, Iracemi; Santos, Eduardo
Resumen:
Training procedures of a deep neural network are still an area with ample research possibilities and constant improvement either to increase its efficiency or its time performance. One of the lesser-addressed components is its objective function, which is an underlying aspect to consider when there is the necessity to achieve better error rates in the area of automatic speech recognition. The aim of this paper is to present two new variations of the frame-level cost function for training a deep neural network with the purpose of obtaining superior word error rates in speech recognition applied to a case study in Spanish.