RMIT University
Browse

CDS: Collaborative distant supervision for Twitter account classification

journal contribution
posted on 2024-11-02, 04:06 authored by Lishan Cui, Xiuzhen ZhangXiuzhen Zhang, A. K. Qin, Timoleon Sellis, Lifang Wu
Individuals use Twitter for personal communication, whereas businesses, politicians and celebrities use Twitter for branding purposes. Distinguishing Personal from Branding Twitter accounts is important for Twitter analytics. Existing studies of Twitter account classification apply classical supervised learning, which requires intensive manual annotation for training. In this paper, we propose CDS (Collaborative Distant Supervision), a novel learning scheme for Twitter account classification that does not require intensive manual labelling. Twitter accounts are automatically labelled using heuristics for distant supervision learning. To achieve effective learning from heuristic labels, active learning is applied to identify and correct false positive labels, and semi-supervised learning is applied to further use false negatives missed by labelling heuristics for learning. Extensive experiments on Twitter data showed that CDS achieved high classification accuracy.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.eswa.2017.03.075
  2. 2.
    ISSN - Is published in 09574174

Journal

Expert Systems with Applications

Volume

83

Start page

94

End page

103

Total pages

10

Publisher

Elsevier

Place published

United Kingdom

Language

English

Copyright

© 2017 Elsevier Ltd

Former Identifier

2006076492

Esploro creation date

2020-06-22

Fedora creation date

2017-09-05

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC