A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation

The automatic translation of spoken language into pictogram units can facilitate communication involving individuals with language impairments. However, there is no established translation formalism or publicly available datasets for training end-to-end speech translation systems. This paper introduces the first aligned speech, text, and pictogram translation dataset ever created in any language. We provide a French dataset that contains 230 hours of speech resources. We create a rule-based pictogram grammar with a restricted vocabulary and include a discussion of the strategic decisions involved. It takes advantage of an in-depth linguistic study of resources taken from the ARASAAC website. We validate these rules through multiple post-editing phases by expert annotators. The constructed dataset is then used to experiment with a Speech-to-Pictogram cascade model, which employs state-of-the-art Automatic Speech Recognition models. The dataset is freely available under a non-commercial licence. This marks a starting point to conduct research into the automatic translation of speech into pictogram units.

Mots clés

Pictograms Speech Machine Translation

Domaines

Informatique et langage [cs.CL] Intelligence artificielle [cs.AI]

Fichier principal

1210_Paper_LREC_Coling_Macaire.pdf (769.31 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Cécile Macaire : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04534234

Soumis le : vendredi 5 avril 2024-13:22:42

Dernière modification le : mercredi 18 décembre 2024-09:42:06

Archivage à long terme le : samedi 6 juillet 2024-19:21:12

Dates et versions

hal-04534234 , version 1 (05-04-2024)

Identifiants

HAL Id : hal-04534234 , version 1

Citer

Cécile Macaire, Chloé Dion, Jordan Arrigo, Claire Lemaire, Emmanuelle Esperança-Rodier, et al.. A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation. LREC-COLING 2024, May 2024, Turin, Italy. ⟨hal-04534234⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS LIG LIG_TDCGE_GETALP LAIRDIL ANR LIG_SIDCH UNIV-UT3 UT3-TOULOUSEINP

120 Consultations

97 Téléchargements