Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

Version 1 : Received: 23 January 2020 / Approved: 24 January 2020 / Online: 24 January 2020 (15:03:34 CET)

How to cite: Zhu, L.; Song, J.; Wei, X.; Jun, L. Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval. Preprints 2020, 2020010288. https://doi.org/10.20944/preprints202001.0288.v1 Zhu, L.; Song, J.; Wei, X.; Jun, L. Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval. Preprints 2020, 2020010288. https://doi.org/10.20944/preprints202001.0288.v1

Abstract

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes cross-modal retrieval problem become a hot issue in this years. Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data. However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity. This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial. CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation. Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity. Comprehensive experiments are conducted on three benchmark datasets. The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

Keywords

Cross-modal retrieval; Adversarial learning; Semantic correlation; Deep learning

Subject

Computer Science and Mathematics, Information Systems

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.