An unsupervised sentiment classifier on summarized or full reviews

Maria Soledad Pera, Rani Qumsiyeh, Yiu Kai Ng

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

5 Citations (Scopus)

Abstract

These days web users searching for opinions expressed by others on a particular product or service PS can turn to review repositories, such as Epinions.com or Imdb.com. While these repositories often provide a high quantity of reviews on PS, browsing through archived reviews to locate different opinions expressed on PS is a time-consuming and tedious task, and in most cases, a very labor-intensive process. To simplify the task of identifying reviews expressing positive, negative, and neutral opinions on PS, we introduce a simple, yet effective sentiment classifier, denoted SentiClass, which categorizes reviews on PS using the semantic, syntactic, and sentiment content of the reviews. To speed up the classification process, SentiClass summarizes each review to be classified using eSummar, a single-document, extractive, sentiment summarizer proposed in this paper, based on various sentence scores and anaphora resolution. SentiClass (eSummar, respectively) is domain and structure independent and does not require any training for performing the classification (summarization, respectively) task. Empirical studies conducted on two widely-used datasets, Movie Reviews and Game Reviews, in addition to a collection of Epinions.com reviews, show that SentiClass (i) is highly accurate in classifying summarized or full reviews and (ii) outperforms well-known classifiers in categorizing reviews.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering, WISE 2010 - 11th International Conference, Proceedings
Pages142-156
Number of pages15
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event11th International Conference on Web Information Systems Engineering, WISE 2010 - Hong Kong, China
Duration: 12 Dec 201014 Dec 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6488 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th International Conference on Web Information Systems Engineering, WISE 2010
Country/TerritoryChina
CityHong Kong
Period12/12/1014/12/10

Fingerprint

Dive into the research topics of 'An unsupervised sentiment classifier on summarized or full reviews'. Together they form a unique fingerprint.

Cite this