Frequency domain adaptive framework for visible-infrared person re-identification
International Journal of Machine Learning and Cybernetics, ISSN: 1868-808X, Vol: 16, Issue: 4, Page: 2553-2566
2024
Metric Options: CountsSelecting the 1-year or 3-year option will change the metrics count to percentiles, illustrating how an article or review compares to other articles or reviews within the selected time period in the same journal. Selecting the 1-year option compares the metrics against other articles/reviews that were also published in the same calendar year. Selecting the 3-year option compares the metrics against other articles/reviews that were also published in the same calendar year plus the two years prior.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Example: if you select the 1-year option for an article published in 2019 and a metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019. If you select the 3-year option for the same article published in 2019 and the metric category shows 90%, that means that the article or review is performing better than 90% of the other articles/reviews published in that journal in 2019, 2018 and 2017.
Citation Benchmarking is provided by Scopus and SciVal and is different from the metrics context provided by PlumX Metrics.
Article Description
The visible-infrared person re-identification task aims to achieve mutual retrieval between infrared images and visible images. The primary challenge is to learn the mapping of these two modalities into a common latent space. Prior works have mainly focused on network feature extraction, but have overlooked the local information of high-frequency channel features, the global information of low-frequency channel features, and the interaction effects between them, all of which are crucial for effectively aligning feature spaces and enhancing cross-modal recognition accuracy, robustness, and overall performance. To address this issue, we propose a frequency domain adaptive framework. Specifically, we designed the frequency domain adaptive encoder to achieve frequency domain adaptation. And the diverse wise embedding was designed to efficiently extract multi-scale features with fewer parameters. Additionally, we proposed the similarity distance clustering strategy, which reduces the large gaps between different modalities by minimizing the KL divergence between visible-infrared similarity distributions images and the normalized label clustering distributions. Our method has been proven superior on two public datasets and achieves state-of-the-art performance on the RegDB dataset.
Bibliographic Details
Springer Science and Business Media LLC
Provide Feedback
Have ideas for a new metric? Would you like to see something else here?Let us know