A Novel Feature Engineering Approach for Twitter-Based Text Sentiment Analysis

With the increasing availability of handheld devices and greater affordability of mobile data, social media has become an inseparable part of the daily life of most of the society. Free availability, diversity, and massiveness of this data have inspired many research work to use it for extracting va...

Full description

Saved in:
Bibliographic Details
Published inEvolving Technologies for Computing, Communication and Smart World Vol. 694; pp. 299 - 315
Main Authors Nandy, Hiran, Sridhar, Rajeswari
Format Book Chapter
LanguageEnglish
Published Singapore Springer 2020
Springer Singapore
SeriesLecture Notes in Electrical Engineering
Subjects
Online AccessGet full text
ISBN9789811578038
9811578036
ISSN1876-1100
1876-1119
DOI10.1007/978-981-15-7804-5_23

Cover

More Information
Summary:With the increasing availability of handheld devices and greater affordability of mobile data, social media has become an inseparable part of the daily life of most of the society. Free availability, diversity, and massiveness of this data have inspired many research work to use it for extracting various insights about a variety of subjects. Text sentiment analysis is the process of mining opinion polarity from a given document in natural language. Social media data is no exception to extract sentiments which could be used for a variety of tasks right from opinion mining to recommendations. In this work, Twitter has been chosen as the source of data corpus required due to the summarized content, ease of availability, and humongous reach among all classes of the society. This work uses traditional machine learning approaches for solving the problem of text sentiment analysis and proposes a novel feature engineering approach for merging the text-based and non-textual features of the dataset by including the predicted output lists also in the final feature set. The result analysis shows significant improvement in the performance of the machine learning model on the test dataset using the proposed approach.
ISBN:9789811578038
9811578036
ISSN:1876-1100
1876-1119
DOI:10.1007/978-981-15-7804-5_23