Content Representation for Microblog Rumor Detection

In recent years, various social network applications have emerged to meet users demand of social activity. As the biggest Chinese Microblog platform, Sina Weibo not only provides users with a lot of information, but also promotes the diffusion spread of rumors which generated huge negative social im...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Computational Intelligence Systems Vol. 513; pp. 245 - 251
Main Authors Ma, Ben, Lin, Dazhen, Cao, Donglin
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2016
Springer International Publishing
SeriesAdvances in Intelligent Systems and Computing
Subjects
Online AccessGet full text
ISBN3319465619
9783319465616
ISSN2194-5357
2194-5365
DOI10.1007/978-3-319-46562-3_16

Cover

More Information
Summary:In recent years, various social network applications have emerged to meet users demand of social activity. As the biggest Chinese Microblog platform, Sina Weibo not only provides users with a lot of information, but also promotes the diffusion spread of rumors which generated huge negative social impacts. To quickly detect rumors from Sina Weibo, many research works focus on social attributes in social network. However, content play an important role in rumor diffusion, and it was ignored in many research works. In this paper, we use two different text representations, bag of words model and neural network language model, to generate text vectors from rumor contents. Furthermore, we compared performance of two text representations in rumor detection by using some state-of-the-art classification algorithms. From the experiments in 10,000 Sina Weibo posts, we found that the best classification accuracy of bag of words model is over 90 %, and the best classification accuracy of neural network language model is over 60 %. It indicates that words of posts are more useful than semantic context vectors representation in rumor detection.
ISBN:3319465619
9783319465616
ISSN:2194-5357
2194-5365
DOI:10.1007/978-3-319-46562-3_16