PlumX Metrics
Embed PlumX Metrics

Video sketch: A middle-level representation for action recognition

Applied Intelligence, ISSN: 1573-7497, Vol: 51, Issue: 4, Page: 2589-2608
2021
  • 19
    Citations
  • 0
    Usage
  • 8
    Captures
  • 0
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

  • Citations
    19
    • Citation Indexes
      19
  • Captures
    8

Article Description

Different modalities extracted from videos, such as RGB and optical flows, may provide complementary cues for improving video action recognition. In this paper, we introduce a new modality named video sketch, which implies the human shape information, as a complementary modality for video action representation. We show that video action recognition can be enhanced by using the proposed video sketch. More specifically, we first generate video sketch with class distinctive action areas and then employ a two-stream network to combine the shape information extracted from image-based sketch and point-based sketch, followed by fusing the classification scores of two streams to generate shape representation for videos. Finally, we use the shape representation as the complementary one for the traditional appearance (RGB) and motion (optical flow) representations for the final video classification. We conduct extensive experiments on four human action recognition datasets – KTH, HMDB51, UCF101, Something-Something and UTI. The experimental results show that the proposed method outperforms the existing state-of-the-art action recognition methods.

Bibliographic Details

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know