Reflecting Comprehension through French Textual Complexity Factors

Abstract : Research efforts in terms of automatic textual complexity analysis are mainly focused on English vocabulary and few adaptations exist for other languages. Starting from a solid base in terms of discourse analysis and existing textual complexity assessment model for English, we introduce a French model trained on 200 documents extracted from school manuals pre-classified into five complexity classes. The underlying textual complexity metrics include surface, syntactic, morphological, semantic and discourse specific factors that are afterwards combined through the use of Support Vector Machines. In the end, each factor is correlated to pupil comprehension metrics scores, spanning throughout multiple classes, therefore creating a clearer perspective in terms of measurements impacting the perceived difficulty of a given text. In addition to purely quantitative surface factors, specific parts of speech and cohesion have proven to be reliable predictors of learners' comprehension level, creating nevertheless a strong background for building dependable French textual complexity models.
Complete list of metadatas
Contributor : Philippe Dessus <>
Submitted on : Monday, November 17, 2014 - 3:21:41 PM
Last modification on : Tuesday, September 17, 2019 - 9:46:52 AM
Long-term archiving on : Friday, April 14, 2017 - 4:05:49 PM


Files produced by the author(s)





Mihai Dascalu, Lucia Larise Stavarache, Stefan Trausan-Matu, Philippe Dessus, Maryse Bianco. Reflecting Comprehension through French Textual Complexity Factors. 26th IEEE Int. Conf. on Tools with Artificial Intelligence (ICTAI 2014), Nov 2014, Limassol, Cyprus. pp.615-619, ⟨10.1109/ICTAI.2014.97⟩. ⟨hal-01083594⟩



Record views


Files downloads