LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 1 of total 1

Search options

Article ; Online: An Efficient Topic Modeling Approach for Text Mining and Information Retrieval through K-means Clustering

Junaid Rashid / Syed Muhammad Adnan Shah / Syed Aun Irtaza

Mehran University Research Journal of Engineering and Technology, Vol 39, Iss 1, Pp 213-

2020  Volume 222

Abstract: Topic modeling is an effective text mining and information retrieval approach to organizing knowledge with various contents under a specific topic. Text documents in form of news articles are increasing very fast on the web. Analysis of these documents ... ...

Abstract Topic modeling is an effective text mining and information retrieval approach to organizing knowledge with various contents under a specific topic. Text documents in form of news articles are increasing very fast on the web. Analysis of these documents is very important in the fields of text mining and information retrieval. Meaningful information extraction from these documents is a challenging task. One approach for discovering the theme from text documents is topic modeling but this approach still needs a new perspective to improve its performance. In topic modeling, documents have topics and topics are the collection of words. In this paper, we propose a new k-means topic modeling (KTM) approach by using the k-means clustering algorithm. KTM discovers better semantic topics from a collection of documents. Experiments on two real-world Reuters 21578 and BBC News datasets show that KTM performance is better than state-of-the-art topic models like LDA (Latent Dirichlet Allocation) and LSA (Latent Semantic Analysis). The KTM is also applicable for classification and clustering tasks in text mining and achieves higher performance with a comparison of its competitors LDA and LSA.
Keywords Technology ; T ; Engineering (General). Civil engineering (General) ; TA1-2040 ; Science ; Q
Subject code 006
Language English
Publishing date 2020-01-01T00:00:00Z
Publisher Mehran University of Engineering and Technology
Document type Article ; Online
Database BASE - Bielefeld Academic Search Engine (life sciences selection)

More links

Kategorien

To top