Microblog Topic Detection Based on LDA Model and Single-Pass Clustering

Microblogging is a recent social phenomenon of Web2.0 technology, having applications in many domains. It is another form of social media, recognized as Real-Time Web Publishing, which has won an impressive audience acceptance and surprisingly changed online expression and interaction for millions o...

Full description

Saved in:
Bibliographic Details
Published inRough Sets and Current Trends in Computing pp. 166 - 171
Main Authors Huang, Bo, Yang, Yan, Mahmood, Amjad, Wang, Hongjun
Format Book Chapter
LanguageEnglish
Japanese
Published Berlin, Heidelberg Springer Berlin Heidelberg 2012
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783642321146
3642321143
ISSN0302-9743
1611-3349
DOI10.1007/978-3-642-32115-3_19

Cover

More Information
Summary:Microblogging is a recent social phenomenon of Web2.0 technology, having applications in many domains. It is another form of social media, recognized as Real-Time Web Publishing, which has won an impressive audience acceptance and surprisingly changed online expression and interaction for millions of users.It is observed that clustering by topic can be very helpful for the quick retrieval of desired information. We propose a novel topic detection technique that permits to retrieve in real-time the most emergent topics expressed by the community. Traditional text mining techniques have no special considerations for short and sparse microblog data. Keeping in view these special characteristics of data, we adopt Single-pass Clustering technique by using Latent Dirichlet Allocation (LDA) Model in place of traditional VSM model, to extract the hidden microblog topics information. Experiments on actual dataset results showed that the proposed method decreased the probabilities of miss and false alarm, as well as reduced the normalized detection cost.
Bibliography:This work is partially supported by the National Science Foundation of China (Nos. 61170111 , 61003142 and 61152001) and the Fundamental Research Funds for the Central Universities (No. SWJTU11ZT08).
ISBN:9783642321146
3642321143
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-642-32115-3_19