Video Summarization Using Knowledge Distillation-Based Attentive Network

Citation DataCognitive Computation, ISSN: 1866-9964, Vol: 16, Issue: 3, Page: 1022-1031

Publication Year2024

2
Citations
0
Usage
5
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
2
- Citation Indexes
  2
Captures
5
- Readers
  5

Article Description

The vast volumes of videos produced daily require highly efficient measures to ensure that key information is reported for effective review and storage, which leads to the popularity of video summarization techniques. Deep learning has shown its advantages in video summarization, especially convolutional neural network, which are effective in extracting features for video summarization. However, the deep network layers and the limited range of temporal dependence make it challenging to deploy the network and thus affect the accuracy of identifying important video frames. To tackle these issues, we present a knowledge distillation-based attentive network (KDAN) for supervised video summarization in this paper. The proposed method separates the full convolutional network from the attention mechanism based on the idea of education and learning processes in biology and uses a full convolutional network as a teacher network to guide the learning of the student network consisting of an attention mechanism. The obtained lightweight network considers the knowledge learned from both networks, thus solving the problems of explosion in the number of participants and slow training. We have conducted experiments on two widely used benchmarks SumMe and TVSum. DANtea achieves F-scores 53.09 and 60.30, and DAN achieves F-scores 51.26 and 61.55 in Canonical settings on the SumMe and TVSum datasets, respectively. Experiments on two public benchmarks SumMe and TVSum demonstrate the effectiveness and superiority of the proposed network over existing state-of-the-art methods.

Bibliographic Details

DOI10.1007/s12559-023-10243-3

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85182498233&origin=inward; http://dx.doi.org/10.1007/s12559-023-10243-3; https://link.springer.com/10.1007/s12559-023-10243-3; https://dx.doi.org/10.1007/s12559-023-10243-3; https://link.springer.com/article/10.1007/s12559-023-10243-3

AUTHOR(S)

Jialin Qin; Hui Yu; Wei Liang; Derui Ding

PUBLISHER(S)

Springer Science and Business Media LLC

TAG(S)

Computer Science; Neuroscience

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know