Unsupervised Video Hashing with Multi-granularity Contextualization and Multi-structure Preservation

Citation DataMM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, Page: 3754-3763

Publication Year2022

8
Citations
14
Usage
3
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
8
- Citation Indexes
  8
Usage
14
- Downloads
  14
Captures
3
- Readers
  3

Conference Paper Description

Unsupervised video hashing typically aims to learn a compact binary vector to represent complex video content without using manual annotations. Existing unsupervised hashing methods generally suffer from incomplete exploration of various perspective dependencies (e.g., long-range and short-range) and data structures that exist in visual contents, resulting in less discriminative hash codes. In this paper, we propose aMulti-granularity Contextualized and Multi-Structure preserved Hashing (MCMSH) method, exploring multiple axial contexts for discriminative video representation generation and various structural information for unsupervised learning simultaneously. Specifically, we delicately design three self-gating modules to separately model three granularities of dependencies (i.e., long/middle/short-range dependencies) and densely integrate them into MLP-Mixer for feature contextualization, leading to a novel model MC-MLP. To facilitate unsupervised learning, we investigate three kinds of data structures, including clusters, local neighborhood similarity structure, and inter/intra-class variations, and design a multi-objective task to train MC-MLP. These data structures show high complementarities in hash code learning. We conduct extensive experiments using three video retrieval benchmark datasets, demonstrating that our MCMSH not only boosts the performance of the backbone MLP-Mixer significantly but also outperforms the competing methods notably. Code is available at: https://github.com/haoyanbin918/MCMSH.

Bibliographic Details

DOI10.1145/3503161.3547836

REPOSITORY URLhttps://ink.library.smu.edu.sg/sis_research/9014

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85143415046&origin=inward; http://dx.doi.org/10.1145/3503161.3547836; https://dl.acm.org/doi/10.1145/3503161.3547836; https://ink.library.smu.edu.sg/sis_research/9014; https://ink.library.smu.edu.sg/cgi/viewcontent.cgi?article=10017&context=sis_research; https://dx.doi.org/10.1145/3503161.3547836

AUTHOR(S)

Yanbin Hao; Jingru Duan; Pengyuan Zhou; Xiangnan He; Hao Zhang; Bin Zhu

PUBLISHER(S)

Association for Computing Machinery (ACM)

TAG(S)

Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know