Accéder directement au contenu Accéder directement à la navigation
Communication dans un congrès

A semantic approach to analyze scientific paper abstracts

Abstract : Each domain and its underlying communities evolve in time and each period is centered on specific topics that emerge from textual sources that characterize the domain. Our analysis represents an extension of other researches performed on the same corpora that were focusing more on evaluating co-citations between the articles in order to compute their importance score (Grauwin and Jensen [1]). Our approach presents a general perspective of the domain by performing semantic comparisons between article abstracts using natural language processing techniques such as Latent Semantic Analysis, Latent Dirichlet Allocation or semantic distances in lexicalized ontologies, i.e. WordNet. Moreover, graph visual representations are generated using Gephi in order to highlight the keywords of each paper and of the domain, the document similarity view or the table of keyword-abstract overlap score. The purpose of the views is to minimize the learning curve of the domain and to facilitate the research process for someone interested in a particular subject. Also, in order to further argue the benefits of our approach, some potential refinements of the methods for classification that can be performed as future improvements are presented.
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger
Contributeur : Philippe Dessus Connectez-vous pour contacter le contributeur
Soumis le : mercredi 10 juin 2015 - 20:28:02
Dernière modification le : vendredi 6 novembre 2020 - 04:48:23
Archivage à long terme le : : mardi 25 avril 2017 - 06:30:52


Fichiers produits par l'(les) auteur(s)





Ionut Cristian Paraschiv, Mihai Dascalu, Stefan Trausan-Matu, Philippe Dessus. A semantic approach to analyze scientific paper abstracts. 11th Int. Conf. ELearning and Software for Education (eLSE 2015), Apr 2015, Bucarest, Romania. pp.393-399, ⟨10.12753/2066-026X-15-000⟩. ⟨hal-01162590⟩



Consultations de la notice


Téléchargements de fichiers