Proceedings of the International Conference on Advances and Applications in Artificial Intelligence (ICAAAI 2025)

Speech Recognition Using Machine Learning

Authors
Gautam Kumar1, *, Jai Kumar Vanjare1, Akriti Barthwal1, Yogesh Rajput1, Kashish1
1Chandigarh University, Chandigarh, India
*Corresponding author. Email: gautam.e16534@gmail.com
Corresponding Author
Gautam Kumar
Available Online 22 June 2025.
DOI
10.2991/978-94-6463-738-0_33How to use a DOI?
Keywords
ASR; Speech Recognition; Signal processing; Machine Learning; Neural Networking
Abstract

The research paper focuses on studying in-depth procedures that machine learning techniques combine with deep learning and Natural Language Processing (NLP) methods to develop speech recognition models. The remarkable progress made with speech recognition technology caused it to become an important element for virtual assistants and transcription services in addition to accessibility tools. This research evaluates several speech recognition models by assessing their operational methods and technical features and their efficiency performance. Model comparison during this research study presented diverse strengths and limitations of different language training approaches. Recognition enhancement relies on both extensive data collection along with optimization of model features and extraction methods and optimization techniques. The solution provides approaches to process varied accents and contains two sets of functions that reduce background noise and achieve real-time performance. The research assesses the performance capacity of HMMs and DNNs and Transformer architectures in multilingual speech processing applications.

The research conducts studies to enhance advanced speech recognition model development by analyzing the performance characteristics of different methods. Research data obtained by experts enables both the developer community and researcher community to develop improved speech recognition systems that work across diverse acoustic environments and linguistic settings.

Copyright
© 2025 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International Conference on Advances and Applications in Artificial Intelligence (ICAAAI 2025)
Series
Advances in Intelligent Systems Research
Publication Date
22 June 2025
ISBN
978-94-6463-738-0
ISSN
1951-6851
DOI
10.2991/978-94-6463-738-0_33How to use a DOI?
Copyright
© 2025 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Gautam Kumar
AU  - Jai Kumar Vanjare
AU  - Akriti Barthwal
AU  - Yogesh Rajput
AU  - Kashish
PY  - 2025
DA  - 2025/06/22
TI  - Speech Recognition Using Machine Learning
BT  - Proceedings of the International Conference on Advances and Applications in Artificial Intelligence (ICAAAI 2025)
PB  - Atlantis Press
SP  - 409
EP  - 421
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6463-738-0_33
DO  - 10.2991/978-94-6463-738-0_33
ID  - Kumar2025
ER  -