RMIT University
Browse

Sentiment analysis on Twitter through topic-based lexicon expansion

Download (307.46 kB)
conference contribution
posted on 2024-11-23, 05:59 authored by Zhixin Zhou, Xiuzhen ZhangXiuzhen Zhang, Mark SandersonMark Sanderson
Supervised learning approaches are domain-dependent and it is costly to obtain labeled training data from different domains. Lexiconbased approaches enjoy stable performance across domains, but often cannot capture domain-dependent features. It is also hard for lexicon-based classifiers to identify the polarities of abbreviations and misspellings, which are common in short informal social text but usually not found in general sentiment lexicons. We propose to overcome this limitation by expanding a general lexicon with domain-dependent opinion words as well as abbreviations and informal opinion expressions. The expanded terms are automatically selected based on their mutual information with emoticons. As there is an abundant amount of emoticon-bearing tweets on Twitter, our approach provides a way to do domain-dependent sentiment analysis without the cost of data annotation. We show that our technique leads to statistically significant improvements in classification accuracies across 56 topics with a state-of-the-art lexicon-based classifier. We also present the expanded terms, and show the most representative opinion expressions obtained from co-occurrence with emoticons.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1007/978-3-319-08608-8_9
  2. 2.
    ISBN - Is published in 9783319086088 (urn:isbn:9783319086088)

Start page

98

End page

109

Total pages

12

Outlet

Proceedings of The 25th Australasian Database Conference (ADC 2014): Databases theory and applications

Editors

Hua Wang, Mohamed A. Sharaf

Name of conference

ADC 2014

Publisher

Springer

Place published

Cham, Germany

Start date

2014-07-14

End date

2014-07-16

Language

English

Copyright

© Springer International Publishing Switzerland 2014

Former Identifier

2006047475

Esploro creation date

2020-06-22

Fedora creation date

2015-01-15

Open access

  • Yes

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC