Relational recurrent neural networks for polyphonic sound event detection

Citation DataMultimedia Tools and Applications, ISSN: 1573-7721, Vol: 78, Issue: 20, Page: 29509-29527

Publication Year2019

10
Citations
0
Usage
67
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
10
- Citation Indexes
  10
Captures
67
- Readers
  67

Article Description

A smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate.

Bibliographic Details

DOI10.1007/s11042-018-7142-7

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85059869953&origin=inward; http://dx.doi.org/10.1007/s11042-018-7142-7; http://link.springer.com/10.1007/s11042-018-7142-7; http://link.springer.com/content/pdf/10.1007/s11042-018-7142-7.pdf; http://link.springer.com/article/10.1007/s11042-018-7142-7/fulltext.html; https://dx.doi.org/10.1007/s11042-018-7142-7; https://link.springer.com/article/10.1007/s11042-018-7142-7

AUTHOR(S)

Junbo Ma; Ruili Wang; Wanting Ji; Hao Zheng; En Zhu; Jianping Yin

PUBLISHER(S)

Springer Science and Business Media LLC

TAG(S)

Computer Science; Engineering

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know