Speaker Identification in Noisy Environments for Forensic Purposes

Rodarte Rodríguez, Armando; Becerra Sánchez, Aldonso; De La Rosa Vargas, José I.; Escalante García, Nivia I.; Olvera González, José E.; Velásquez Martínez, Emmanuel de J.; Zepeda Valles, Gustavo

DSpace Principal
→
Maestría en Ciencias del Procesamiento de la Información
→
*Documentos Académicos*-- M. en Ciencias del Proc. de la Info.
→
Ver ítem

Speaker Identification in Noisy Environments for Forensic Purposes

Rodarte Rodríguez, Armando; Becerra Sánchez, Aldonso; De La Rosa Vargas, José I.; Escalante García, Nivia I.; Olvera González, José E.; Velásquez Martínez, Emmanuel de J.; Zepeda Valles, Gustavo

URI: http://ricaxcan.uaz.edu.mx/jspui/handle/20.500.11845/3429
http://dx.doi.org/10.48779/ricaxcan-260

Fecha: 2022-10-30

Resumen:

The speech is a biological or physical feature unique to each person, and this is widely used in speaker identification tasks like access control, transaction authentication, home automation applications, among others. The aim of this research is to propose a connected-words speaker recognition scheme based on a closed-set speaker-independent voice corpus in noisy environments that can be applied in contexts such as forensic purposes. Using a KDD analysis, MFCCs were used as filtering technique to extract speech features from 158 speakers, to later carry out the speaker identification process. Paper presents a performance comparison of ANN, KNN and logistic regression models, which obtained a F1 score of 98%, 98.32% and 97.75%, respectively. The results show that schemes such as KNN and ANN can achieve a similar performance in full voice files when applying the proposed KDD framework, generating robust models applied in forensic environments.

Mostrar el registro completo del ítem