Mining tortured acronyms from the scientific literature - Recherche d’Information et Synthèse d’Information
Pré-Publication, Document De Travail (Preprint/Prepublication) Année : 2023

Mining tortured acronyms from the scientific literature

Résumé

The 'Problematic Paper Screener' (PPS, WCRI'22, https://doi.org/10.48550/arXiv.2210.04895) supports the human re-assessment of scientific articles flagged as suspicious. The 'tortured detector' tabulates 12k papers containing tortured phrases: established scientific concepts paraphrased with synonyms, such as 'butt-centric waterway' for 'anal canal.' Some acronyms are even tortured, such as 'Convolutional Brain Organisation (CNN)' for 'Convolutional Neural Network (CNN).' This abstract tackles the following task: discover and classify all acronyms from any given article: tortured or genuine.

Mots clés

Fichier principal
Vignette du fichier
WCRI2024-TorturedAcronyms.pdf (82.13 Ko) Télécharger le fichier
OP14.3_WCRI2024_presentation.pdf (1.27 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04311600 , version 1 (28-11-2023)
hal-04311600 , version 2 (24-09-2024)

Licence

Identifiants

  • HAL Id : hal-04311600 , version 1

Citer

Alexandre Clausse, Guillaume Cabanac, Pascal Cuxac, Cyril Labbé. Mining tortured acronyms from the scientific literature. 2023. ⟨hal-04311600v1⟩

Collections

IRIT-UT3
378 Consultations
107 Téléchargements

Partager

More