A semantic approach to analyze scientific paper abstracts

Abstract : Each domain and its underlying communities evolve in time and each period is centered on specific topics that emerge from textual sources that characterize the domain. Our analysis represents an extension of other researches performed on the same corpora that were focusing more on evaluating co-citations between the articles in order to compute their importance score (Grauwin and Jensen [1]). Our approach presents a general perspective of the domain by performing semantic comparisons between article abstracts using natural language processing techniques such as Latent Semantic Analysis, Latent Dirichlet Allocation or semantic distances in lexicalized ontologies, i.e. WordNet. Moreover, graph visual representations are generated using Gephi in order to highlight the keywords of each paper and of the domain, the document similarity view or the table of keyword-abstract overlap score. The purpose of the views is to minimize the learning curve of the domain and to facilitate the research process for someone interested in a particular subject. Also, in order to further argue the benefits of our approach, some potential refinements of the methods for classification that can be performed as future improvements are presented.
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

Contributor : Philippe Dessus <>
Submitted on : Wednesday, June 10, 2015 - 8:28:02 PM
Last modification on : Wednesday, September 18, 2019 - 1:31:46 AM
Long-term archiving on : Tuesday, April 25, 2017 - 6:30:52 AM


Files produced by the author(s)





Ionut Cristian Paraschiv, Mihai Dascalu, Stefan Trausan-Matu, Philippe Dessus. A semantic approach to analyze scientific paper abstracts. 11th Int. Conf. ELearning and Software for Education (eLSE 2015), Apr 2015, Bucarest, Romania. pp.393-399, ⟨10.12753/2066-026X-15-000⟩. ⟨hal-01162590⟩



Record views


Files downloads