Not All Words Are Created Equal: Extracting Semantic Orientation as a Function of Adjective Relevance

Semantic orientation (SO) for texts is often determined on the basis of the positive or negative polarity, or sentiment, found in the text. Polarity is typically extracted using the positive and negative words in the text, with a particular focus on adjectives, since they convey a high degree of opi...

Full description

Saved in:
Bibliographic Details
Published inAI 2007: Advances in Artificial Intelligence Vol. 4830; pp. 337 - 346
Main Authors Voll, Kimberly, Taboada, Maite
Format Book Chapter
LanguageEnglish
Published Germany Springer Berlin / Heidelberg 2007
Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783540769262
3540769269
ISSN0302-9743
1611-3349
DOI10.1007/978-3-540-76928-6_35

Cover

More Information
Summary:Semantic orientation (SO) for texts is often determined on the basis of the positive or negative polarity, or sentiment, found in the text. Polarity is typically extracted using the positive and negative words in the text, with a particular focus on adjectives, since they convey a high degree of opinion. Not all adjectives are created equal, however. Adjectives found in certain parts of the text, and adjectives that refer to particular aspects of what is being evaluated have more significance for the overall sentiment of the text. To capitalize upon this, we weigh adjectives according to their relevance and create three measures of SO: a baseline SO using all adjectives (no restriction); SO using adjectives found in on-topic sentences as determined by a decision-tree classifier; and SO using adjectives in the nuclei of sentences extracted from a high-level discourse parse of the text. In both cases of restricting adjectives based on relevance, performance is comparable to current results in automated SO extraction. Improvements in the decision classifier and discourse parser will likely cause this result to surpass current benchmarks.
ISBN:9783540769262
3540769269
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-540-76928-6_35