RMIT University
Browse

Automatic labelling of topics via analysis of user summaries

conference contribution
posted on 2024-10-31, 20:06 authored by Lishan Cui, Xiuzhen ZhangXiuzhen Zhang, Amanda-Jane KimptonAmanda-Jane Kimpton, Daryl D'Souza
Topic models have been widely used to discover useful structures in large collections of documents. A challenge in applying topic models to any text analysis task is to meaningfully label the discovered topics so that users can interpret them. In existing studies, words and bigram phrases extracted internally from documents are used as candidate labels but are not always understandable to humans. In this paper, we propose a novel approach to extracting words and meaningful phrases from external user generated summaries as candidate labels and then rank them via the Kullback-Leibler semantic distance metric. We further apply our approach to analyse an Australian healthcare discussion forum. User study results show that our proposed approach produces meaningful labels for topics and outperforms state-of-the-art approaches to labelling topics.

History

Start page

295

End page

307

Total pages

13

Outlet

Proceedings of the 27th Australasian Database Conference (ADC 2016)

Editors

Muhammad Aamir Cheema, Wenjie Zhang, Lijun Chang

Name of conference

ADC 2016: Databases Theory and Applications

Publisher

Springer

Place published

Switzerland

Start date

2016-09-28

End date

2016-09-29

Language

English

Copyright

© Springer International Publishing AG 2016

Former Identifier

2006067087

Esploro creation date

2020-06-22

Fedora creation date

2016-10-18

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC