Microblog Topic Detection Based on LDA Model and Single-Pass Clustering
Microblogging is a recent social phenomenon of Web2.0 technology, having applications in many domains. It is another form of social media, recognized as Real-Time Web Publishing, which has won an impressive audience acceptance and surprisingly changed online expression and interaction for millions o...
Saved in:
| Published in | Rough Sets and Current Trends in Computing pp. 166 - 171 |
|---|---|
| Main Authors | , , , |
| Format | Book Chapter |
| Language | English Japanese |
| Published |
Berlin, Heidelberg
Springer Berlin Heidelberg
2012
|
| Series | Lecture Notes in Computer Science |
| Subjects | |
| Online Access | Get full text |
| ISBN | 9783642321146 3642321143 |
| ISSN | 0302-9743 1611-3349 |
| DOI | 10.1007/978-3-642-32115-3_19 |
Cover
| Summary: | Microblogging is a recent social phenomenon of Web2.0 technology, having applications in many domains. It is another form of social media, recognized as Real-Time Web Publishing, which has won an impressive audience acceptance and surprisingly changed online expression and interaction for millions of users.It is observed that clustering by topic can be very helpful for the quick retrieval of desired information. We propose a novel topic detection technique that permits to retrieve in real-time the most emergent topics expressed by the community. Traditional text mining techniques have no special considerations for short and sparse microblog data. Keeping in view these special characteristics of data, we adopt Single-pass Clustering technique by using Latent Dirichlet Allocation (LDA) Model in place of traditional VSM model, to extract the hidden microblog topics information. Experiments on actual dataset results showed that the proposed method decreased the probabilities of miss and false alarm, as well as reduced the normalized detection cost. |
|---|---|
| Bibliography: | This work is partially supported by the National Science Foundation of China (Nos. 61170111 , 61003142 and 61152001) and the Fundamental Research Funds for the Central Universities (No. SWJTU11ZT08). |
| ISBN: | 9783642321146 3642321143 |
| ISSN: | 0302-9743 1611-3349 |
| DOI: | 10.1007/978-3-642-32115-3_19 |