Open Journal of Applied Sciences

Volume 11, Issue 4 (April 2021)

ISSN Print: 2165-3917   ISSN Online: 2165-3925

Google-based Impact Factor: 0.92  Citations  h5-index & Ranking

Identification of Topics from Scientific Papers through Topic Modeling

HTML  XML Download Download as PDF (Size: 245KB)  PP. 541-548  
DOI: 10.4236/ojapps.2021.104038    798 Downloads   3,306 Views  Citations

ABSTRACT

Topic modeling is a probabilistic model that identifies topics covered in text(s). In this paper, topics were loaded from two implementations of topic modeling, namely, Latent Semantic Indexing (LSI) and Latent Dirichlet Allocation (LDA). This analysis was performed in a corpus of 1000 academic papers written in English, obtained from PLOS ONE website, in the areas of Biology, Medicine, Physics and Social Sciences. The objective is to verify if the four academic fields were represented in the four topics obtained by topic modeling. The four topics obtained from Latent Semantic Indexing (LSI) and Latent Dirichlet Allocation (LDA) did not represent the four academic fields.

Share and Cite:

Owa, D. (2021) Identification of Topics from Scientific Papers through Topic Modeling. Open Journal of Applied Sciences, 11, 541-548. doi: 10.4236/ojapps.2021.104038.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.