Sound source localization based on three-element microphone array and RepMobileViT model
Applied Acoustics, ISSN: 0003-682X, Vol: 231, Page: 110480
2025
Metric Options: CountsSelecting the 1-year or 3-year option will change the metrics count to percentiles, illustrating how an article or review compares to other articles or reviews within the selected time period in the same journal. Selecting the 1-year option compares the metrics against other articles/reviews that were also published in the same calendar year. Selecting the 3-year option compares the metrics against other articles/reviews that were also published in the same calendar year plus the two years prior.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Article Description
Traditional microphone array-based sound source localization methods typically begin with signal processing, where the source position is regarded as an estimated continuous value in a certain space. However, in some practical scenarios, such as conference rooms, source localization only needs to focus on some predefined areas, thus can be transformed into a deep learning classification problem. To solve the requirements of small size microphone arrays while maintaining the accuracy of source localization in reverberant environments, a feature extraction method based on sound intensity and an efficient lightweight model are proposed for the direction of arrival (DOA) estimation using a three-element microphone array. Firstly, two types of sound intensity feature spectrograms are designed to effectively extract sound source azimuth information under reverberant conditions. Secondly, the RepViT architecture is introduced into the MobileViT network to construct a new lightweight network, which estimates the source position with smaller parameters and faster speed. Finally, the sound intensity spectrogram is fed into the proposed model to estimate the DOA. Compared to existing methods, the proposed method exhibits more efficient and accurate performance in localizing with small arrays under simulated and real environments.
Bibliographic Details
http://www.sciencedirect.com/science/article/pii/S0003682X24006315; http://dx.doi.org/10.1016/j.apacoust.2024.110480; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85211643272&origin=inward; https://linkinghub.elsevier.com/retrieve/pii/S0003682X24006315; https://dx.doi.org/10.1016/j.apacoust.2024.110480
Elsevier BV
Provide Feedback
Have ideas for a new metric? Would you like to see something else here?Let us know