A unified cycle-consistent neural model for text and image retrieval

Citation DataMultimedia Tools and Applications, ISSN: 1573-7721, Vol: 79, Issue: 35-36, Page: 25697-25721

Publication Year2020

14
Citations
0
Usage
5
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
14
- Citation Indexes
  14
Captures
5
- Readers
  5

Article Description

Text-image retrieval has been recently becoming a hot-spot research field, thanks to the development of deeply-learnable architectures which can retrieve visual items given textual queries and vice-versa. The key idea of many state-of-the-art approaches has been that of learning a joint multi-modal embedding space in which text and images could be projected and compared. Here we take a different approach and reformulate the problem of text-image retrieval as that of learning a translation between the textual and visual domain. Our proposal leverages an end-to-end trainable architecture that can translate text into image features and vice versa and regularizes this mapping with a cycle-consistency criterion. Experimental evaluations for text-to-image and image-to-text retrieval, conducted on small, medium and large-scale datasets show consistent improvements over the baselines, thus confirming the appropriateness of using a cycle-consistent constrain for the text-image matching task.

Bibliographic Details

DOI10.1007/s11042-020-09251-4

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85087652207&origin=inward; http://dx.doi.org/10.1007/s11042-020-09251-4; https://link.springer.com/10.1007/s11042-020-09251-4; https://dx.doi.org/10.1007/s11042-020-09251-4; https://link.springer.com/article/10.1007/s11042-020-09251-4

AUTHOR(S)

Marcella Cornia; Lorenzo Baraldi; Rita Cucchiara; Hamed R. Tavakoli

PUBLISHER(S)

Springer Science and Business Media LLC

TAG(S)

Computer Science; Engineering

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know