Vision Transformers for Breast Cancer Histology Image Classification

Citation DataLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN: 1611-3349, Vol: 14366, Page: 15-26

Publication Year2024

2
Citations
0
Usage
8
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
2
- Citation Indexes
  2
Captures
8
- Readers
  8

Conference Paper Description

We propose a self-attention Vision Transformer (ViT) model tailored for breast cancer histology image classification. The proposed architecture uses a stack of transformer layers, with each layer consisting of a multi-head self-attention mechanism and a position-wise feed-forward network, and it is trained with different strategies and configurations, including pretraining, resize dimension, data augmentation, patch overlap, and patch size, to investigate their impact on performance on the histology image classification task. Experimental results show that pretraining on ImageNet and using geometric and color data augmentation techniques significantly improve the model’s accuracy on the task. Additionally, a patch size of 16 × 16 and no patch overlap were found to be optimal for this task. These findings provide valuable insights for the design of future ViT-based models for similar image classification tasks.

Bibliographic Details

DOI10.1007/978-3-031-51026-7_2

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85184102214&origin=inward; http://dx.doi.org/10.1007/978-3-031-51026-7_2; https://link.springer.com/10.1007/978-3-031-51026-7_2; https://dx.doi.org/10.1007/978-3-031-51026-7_2; https://link.springer.com/chapter/10.1007/978-3-031-51026-7_2

AUTHOR(S)

Giulia L. Baroni; Laura Rasotto; Kevin Roitero; Ameer Hamza Siraj; Vincenzo Della Mea

PUBLISHER(S)

Springer Science and Business Media LLC

TAG(S)

Mathematics; Computer Science

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know