Speech Enhancement with Background Noise Suppression in Various Data Corpus Using Bi-LSTM Algorithm

Vinothkumar G; Manoj Kumar D

doi:10.37391/IJEER.120144

Research Article |

Speech Enhancement with Background Noise Suppression in Various Data Corpus Using Bi-LSTM Algorithm

Author(s): Vinothkumar G^* and Manoj Kumar D

Published In : International Journal of Electrical and Electronics Research (IJEER) Volume 12, Issue 1

Publisher : FOREX Publication

Published : 28 march 2024

e-ISSN : 2347-470X

Page(s) : 322-328

DOI: https://doi.org/10.37391/IJEER.120144

Abstract

Noise reduction is one of the crucial procedures in today’s teleconferencing scenarios. The signal-to-noise ratio (SNR) is a paramount factor considered for reducing the Bit error rate (BER). Minimizing the BER will result in the increase of SNR which improves the reliability and performance of the communication system. The microphone is the primary audio input device that captures the input signal, as the input signal is carried away it gets interfered with white noise and phase noise. Thus, the output signal is the combination of the input signal and reverberation noise. Our idea is to minimize the interfering noise thus improving the SNR. To achieve this, we develop a real-time speech-enhancing method that utilizes an enhanced recurrent neural network with Bidirectional Long Short Term Memory (Bi-LSTM). One LSTM in this sequence processing framework accepts the input in the forward direction, whereas the other LSTM takes it in the opposite direction, making up the Bi-LSTM. Considering Bi-LSTM, it takes fewer tensor operations which makes it quicker and more efficient. The Bi-LSTM is trained in real-time using various noise signals. The trained system is utilized to provide an unaltered signal by reducing the noise signal, thus making the proposed system comparable to other noise-suppressing systems. The STOI and PESQ metrics demonstrate a rise of approximately 0.5% to 14.8% and 1.77% to 29.8%, respectively, in contrast to the existing algorithms across various sound types and different input signal-to-noise ratio (SNR) levels.

Keywords: RNN, Bi-LSTM, SNR, Speech Enhancement, Background Noise, DNN.

Vinothkumar G^*, Research Scholar, Department of Electronics and Communication Engineering, SRM Institute of Science and Technology, Ramapuram Campus, Chennai, Tamil Nadu, India; Email: vinothkg@srmist.edu.in

Manoj Kumar D, Assistant Professor, Department of Electronics and Communication Engineering, SRM Institute of Science and Technology, Ramapuram Campus, Chennai, Tamil Nadu, India; Email: manojkud1@srmist.edu.in

[1] Loizou, P.C. Speech Enhancement: Theory and Practice; CRC Press: New York, NY, USA, 2013.

[CrossRef]

[14] Yu, Meng, et al. "NeuralEcho: A self-attentive recurrent neural network for unified acoustic echo suppression and speech enhancement." arXiv preprint arXiv:2205.10401 (2022).

[CrossRef]

Vinothkumar G and Manoj Kumar D (2024), Speech Enhancement with Background Noise Suppression in Various Data Corpus Using Bi-LSTM Algorithm. IJEER 12(1), 322-328. DOI: 10.37391/IJEER.120144.

I. J. of Electrical & Electronics Research Support Open Access

Speech Enhancement with Background Noise Suppression in Various Data Corpus Using Bi-LSTM Algorithm

Abstract

I. J. of Electrical & Electronics Research
Support Open Access