Research Article |
Speaker Recognition Assessment in a Continuous System for Speaker Identification
Author(s): Mahesh K. Singh1, P. Mohana Satya2, Vella Satyanarayana3 and Sridevi Gamini4
Published In : International Journal of Electrical and Electronics Research (IJEER) Volume 10, Issue 4
Publisher : FOREX Publication
Published : 18 October 2022
e-ISSN : 2347-470X
Page(s) : 862-867
Abstract
This research article presented and focused on recognizing speakers through multi-speaker speeches. The participation of several speakers includes every conference, talk or discussion. This type of talk has different problems as well as stages of processing. Challenges include the unique impurity of the surroundings, the involvement of speakers, speaker distance, microphone equipment etc. In addition to addressing these hurdles in real time, there are also problems in the treatment of the multi-speaker speech. Identifying speech segments, separating the speaking segments, constructing clusters of similar segments and finally recognizing the speaker using these segments are the common sequential operations in the context of multi-speaker speech recognition. All linked phases of speech recognition processes are discussed with relevant methodologies in this article. This entire article will examine the common metrics, methods and conduct. This paper examined the algorithm of speech recognition system at different stages. The voice recognition systems are built through many phases such as voice filter, speaker segmentation, speaker idolization and the recognition of the speaker by 20 speakers.
Keywords: Speaker recognition
, DSL
, SVM
, BPNN
, Speaker identification
.
Mahesh K. Singh*, Department of ECE, Aditya College of Engineering, Surampalem, India; Email: mahesh.singh@accendere.co.in
P. Mohana Satya, Department of ECE, Aditya College of Engineering, Surampalem, India; Email: perurimohanasatya999@gmail.com
Vella Satyanarayana, Department of ECE, Aditya College of Engineering, Surampalem, India; Email: vasece_vella@aec.edu.in
Sridevi Gamini, Department of ECE, Aditya College of Engineering, Surampalem, India; Email: sridevi_gamini@yahoo.com
-
[1] Pahar, M., & Smith, L. S. (2020, December). Coding and Decoding Speech using a Biologically Inspired Coding System. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI) (pp. 3025-3032). IEEE.[Cross Ref]
-
[2] Yong, S., & Fuguang, Y. (2020). Application of NLP-based respiratory audio recognition framework in physical health exercise intervention. International Journal of Speech Technology, 1-13.[Cross Ref]
-
[3] Balaji, V. N., Srinivas, P. B., & Singh, M. K. (2021). Neuromorphic advancements architecture design and its implementations technique. Materials Today: Proceedings.[Cross Ref]
-
[4] Singh, M. K., Singh, A. K., & Singh, N. (2019). Multimedia analysis for disguised voice and classification efficiency. Multimedia Tools and Applications, 78(20), 29395-29411.[Cross Ref]
-
[5] Ghalamiosgouei, S., & Geravanchizadeh, M. (2021). Robust Speaker Identification Based on Binaural Masks. Speech Communication.[Cross Ref]
-
[6] Siddiqa, S. K., Apurva, K., Nandan, D., & Kumar, S. (2021). Documentation on smart home monitoring using internet of things. In ICCCE 2020 (pp. 1115-1124). Springer, Singapore.[Cross Ref]
-
[7] Padma, U., Jagadish, S., & Singh, M. K. (2021). Recognition of plant’s leaf infection by image processing approach. Materials Today: Proceedings.[Cross Ref]
-
[8] Singh, M. K., Singh, A. K., & Singh, N. (2018). Disguised voice with fast and slow speech and its acoustic analysis. Int. J. Pure Appl. Math, 11(14), 241-246.[Cross Ref]
-
[9] Al-Hassani, R. T., Atilla, D. C., & Aydin, Ç. (2021). Development of High Accuracy Classifier for the Speaker Recognition System. Applied Bionics and Biomechanics, 2021.[Cross Ref]
-
[10] Vestman, V. (2020). Methods for fast, robust, and secure speaker recognition (Doctoral dissertation, Itä-Suomen yliopisto).[Cross Ref]
-
[11] Priya, B. J., Kunda, P., & Kumar, S. (2021). Design and Implementation of Smart Real-Time Billing, GSM, and GPS-Based Theft Monitoring and Accident Notification Systems. In Proceedings of International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications (pp. 647-661). Springer, Singapore.[Cross Ref]
-
[12] Singh, M. K., Singh, A. K., & Singh, N. (2019). Multimedia utilization of non-computerized disguised voice and acoustic similarity measurement. Multimedia Tools and Applications, 1-16.[Cross Ref]
-
[13] Sudeep, S. V. N. V. S., Venkata Kiran, S., Nandan, D., & Kumar, S. (2021). An Overview of Biometrics and Face Spoofing Detection. ICCCE 2020, 871-881.[Cross Ref]
-
[14] Prasanna, G. S., Pavani, K., & Singh, M. K. (2021). Spliced images detection by using Viola-Jones algorithms method. Materials Today: Proceedings.[Cross Ref]
-
[15] Singh, M., Nandan, D., & Kumar, S. (2019). Statistical Analysis of Lower and Raised Pitch Voice Signal and Its Efficiency Calculation. Traitement du Signal, 36(5), 455-461.[Cross Ref]
-
[16] Veerendra, G., Swaroop, R., Dattu, D. S., Jyothi, C. A., & Singh, M. K. (2021). Detecting plant Diseases, quantifying and classifying digital image processing techniques. Materials Today: Proceedings.[Cross Ref]
-
[17] Santhoshi, M. S., Sharath Babu, K., Kumar, S., & Nandan, D. (2021). An investigation on rolling element bearing fault and real-time spectrum analysis by using short-time fourier transform. In Proceedings of International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications (pp. 561-567). Springer, Singapore.[Cross Ref]
-
[18] Singh, M. K., Singh, A. K., & Singh, N. (2018). Acoustic comparison of electronics disguised voice using different semitones. Int. J. Eng. Technol.(UAE). https://doi. org/10.14419/ijet. v7i2, 16.[Cross Ref]
-
[19] Reynolds, D. A. (2002, May). An overview of automatic speaker recognition technology. In 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (Vol. 4, pp. IV-4072). IEEE.[Cross Ref]
-
[20] Furui, S. (1996). An overview of speaker recognition technology. Automatic speech and speaker recognition, 31-56.[Cross Ref]
-
[21] Singh, M. K., Singh, N., & Singh, A. K. (2019, March). Speaker's Voice Characteristics and Similarity Measurement using Euclidean Distances. In 2019 International Conference on Signal Processing and Communication (ICSC) (pp. 317-322). IEEE.[Cross Ref]
-
[22] Kanchana, V., Nath, S., & Singh, M. K. (2021). A study of internet of things oriented smart medical systems. Materials Today: Proceedings.[Cross Ref]
-
[23] Furui, S. (1997). Recent advances in speaker recognition. Pattern recognition letters, 18(9), 859-872.[Cross Ref]
-
[24] Wang, Y. (2020). Implementation and Improvement of Common Text-Independent Speaker Identification (Doctoral dissertation, Northern Illinois University).[Cross Ref]
Mahesh K. Singh, P. Mohana Satya, Vella Satyanarayana and Sridevi Gamini (2022), Speaker Recognition Assessment in a Continuous System for Speaker Identification. IJEER 10(4), 862-867. DOI: 10.37391/IJEER.100418.