Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video Search

Varsha Devi; Philippe Mulhem; Georges Quénot

doi:10.1145/3549555.3549600

Communication Dans Un Congrès Année : 2022

Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video Search

(1, 2, 3) , (2, 3) , (2, 3)

1
2
3

Varsha Devi

Fonction : Auteur

Université Grenoble Alpes

Laboratoire d'Informatique de Grenoble

Modélisation et Recherche d’Information Multimédia [Grenoble]

Philippe Mulhem

Fonction : Auteur

Laboratoire d'Informatique de Grenoble

Modélisation et Recherche d’Information Multimédia [Grenoble]

Georges Quénot

Fonction : Auteur

Laboratoire d'Informatique de Grenoble

Modélisation et Recherche d’Information Multimédia [Grenoble]

Résumé

This paper focuses on studying the complementarity between the spaces from hybrid crossmodal state-of-the-art systems for video retrieval like [5]. We aim at investigating if these spaces really convey different features, or if they are representing the same things. We use PCA (Principal Component Analysis) to study the optimal dimensions, CCA (Canonical Correlation Analysis) to assess the similarity of the spaces, and check if such approach is in fact similar to ensemble learning. We achieve experiments on the MST-VTT corpus, and show that in fact these two spaces are indeed very similar, paving the way for new models that could enforce more dissimilar spaces.

Mots clés

Latent Space Concept Space Correlation CCA Ensemble Learning Cross Modal Retrieval

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

Paper_CBMI_Hal_Version.pdf (256.47 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Varsha DEVI : Connectez-vous pour contacter le contributeur

https://hal.univ-grenoble-alpes.fr/hal-03813421

Soumis le : jeudi 13 octobre 2022-12:45:06

Dernière modification le : mercredi 18 décembre 2024-09:35:21

Archivage à long terme le : samedi 14 janvier 2023-18:56:24

Dates et versions

hal-03813421 , version 1 (13-10-2022)

Identifiants

HAL Id : hal-03813421 , version 1
DOI : 10.1145/3549555.3549600

Citer

Varsha Devi, Philippe Mulhem, Georges Quénot. Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video Search. CBMI 2022: International Conference on Content-based Multimedia Indexing, Sep 2022, Graz, Austria. pp.84-90, ⟨10.1145/3549555.3549600⟩. ⟨hal-03813421⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_TDCGE_MRIM MIAI ANR LIG_SIDCH

45 Consultations

109 Téléchargements

Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video Search

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager