AUTHOR=Voskergian Daniel , Bakir-Gungor Burcu , Yousef Malik TITLE=TextNetTopics Pro, a topic model-based text classification for short text by integration of semantic and document-topic distribution information JOURNAL=Frontiers in Genetics VOLUME=14 YEAR=2023 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2023.1243874 DOI=10.3389/fgene.2023.1243874 ISSN=1664-8021 ABSTRACT=
With the exponential growth in the daily publication of scientific articles, automatic classification and categorization can assist in assigning articles to a predefined category. Article titles are concise descriptions of the articles’ content with valuable information that can be useful in document classification and categorization. However, shortness, data sparseness, limited word occurrences, and the inadequate contextual information of scientific document titles hinder the direct application of conventional text mining and machine learning algorithms on these short texts, making their classification a challenging task. This study firstly explores the performance of our earlier study, TextNetTopics on the short text. Secondly, here we propose an advanced version called