Two-stage fine-grained image classification model based on multi-granularity feature fusion

Citation DataPattern Recognition, ISSN: 0031-3203, Vol: 146, Page: 110042

Publication Year2024

8
Citations
0
Usage
2
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
8
- Citation Indexes
  8
Captures
2
- Readers
  2

Article Description

Fine-grained visual classification (FGVC) is a difficult task due to the challenges of discriminative feature learning. Most existing methods directly use the final output of the network which always contains the global feature with high-level semantic information. However, the differences between fine-grained images are reflected in subtle local regions which often appear in the front of the network. When the texture of the background and object are similar or the proportion of the background is too large, the prediction will be greatly affected. In order to solve the above problems, this paper proposes multi-granularity feature fusion module (MGFF) and two-stage classification based on Vision-Transformer (ViT). The former comprehensively represents images by fusing features of different granularities, thus avoiding the limitations of single-scale features. The latter leverages the ViT model to separate the object from the background at a very small cost, thereby improving the accuracy of the prediction. We conduct comprehensive experiments and achieves the best performance in two fine-grained tasks on CUB-200-2011 and NA-Birds.

Bibliographic Details

DOI10.1016/j.patcog.2023.110042

URL IDhttp://www.sciencedirect.com/science/article/pii/S0031320323007392; http://dx.doi.org/10.1016/j.patcog.2023.110042; http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85174442880&origin=inward; https://linkinghub.elsevier.com/retrieve/pii/S0031320323007392; https://dx.doi.org/10.1016/j.patcog.2023.110042

AUTHOR(S)

Yang Xu; Shanshan Wu; Biqi Wang; Ming Yang; Zebin Wu; Yazhou Yao; Zhihui Wei

PUBLISHER(S)

Elsevier BV

TAG(S)

Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know