Addressing Code-Switching in French/Algerian Arabic Speech - Université Sorbonne Paris Cité Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Addressing Code-Switching in French/Algerian Arabic Speech

Résumé

This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies, such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Alge-rian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.
Fichier principal
Vignette du fichier
Addressing_Code-Switching_in_FrenchAlgerian_Arabic.pdf (1009.15 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-01969148 , version 1 (03-01-2019)

Identifiants

Citer

Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel. Addressing Code-Switching in French/Algerian Arabic Speech. Interspeech 2017, Aug 2017, Stockholm, Sweden. pp.62-66, ⟨10.21437/interspeech.2017-1373⟩. ⟨halshs-01969148⟩
964 Consultations
1281 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More