Detecting Lombard Speech Using Deep Learning Approach

Citation DataSensors, ISSN: 1424-8220, Vol: 23, Issue: 1

Publication Year2023

2
Citations
0
Usage
4
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
2
- Citation Indexes
  2
Captures
4
- Readers
  4

Article Description

Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the background concerning the Lombard effect. Then, assumptions of the work performed for Lombard speech detection are outlined. The framework proposed combines convolutional neural networks (CNNs) and various two-dimensional (2D) speech signal representations. To reduce the computational cost and not resign from the 2D representation-based approach, a strategy for threshold-based averaging of the Lombard effect detection results is introduced. The pseudocode of the averaging process is also included. A series of experiments are performed to determine the most effective network structure and the 2D speech signal representation. Investigations are carried out on German and Polish recordings containing Lombard speech. All 2D signal speech representations are tested with and without augmentation. Augmentation means using the alpha channel to store additional data: gender of the speaker, F0 frequency, and first two MFCCs. The experimental results show that Lombard and neutral speech recordings can clearly be discerned, which is done with high detection accuracy. It is also demonstrated that the proposed speech detection process is capable of working in near real-time. These are the key contributions of this work.

Bibliographic Details

DOI10.3390/s23010315

PMID36616913

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85145874079&origin=inward; http://dx.doi.org/10.3390/s23010315; http://www.ncbi.nlm.nih.gov/pubmed/36616913; https://www.mdpi.com/1424-8220/23/1/315; https://dx.doi.org/10.3390/s23010315

AUTHOR(S)

Kąkol, Krzysztof; Korvel, Gražina; Tamulevičius, Gintautas; Kostek, Bożena

PUBLISHER(S)

MDPI AG

TAG(S)

Chemistry; Computer Science; Physics and Astronomy; Biochemistry, Genetics and Molecular Biology; Engineering

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know