RMIT University
Browse

Data sets for spoken conversational search

conference contribution
posted on 2024-11-03, 14:02 authored by Johanne TrippasJohanne Trippas, Paul Thomas
There is increasing interest in spoken conversational search—multi-turn interactions with a search engine, spoken in natural language—but until recently there was little public data to support research. We describe our experiences building two data sets for spoken conversational search: the Microsoft Information-Seeking Conversation set (“MISC”) and the Spoken Conversational Search set (“SCSdata”). Each data set contains recordings of spoken interactions between two people collaborating on web search tasks, but relatively small differences in protocol have led to observably different data. We discuss some consequences of these differences, and describe attempts to reproduce analyses from one set to the other.

History

Volume

2337

Start page

1

End page

5

Total pages

5

Outlet

Proceedings of the Workshop on Barriers to Interactive IR Resources Re-use at the ACM SIGIR Conference on Human Information Interaction and Retrieval (BIIRRR 2019)

Editors

Toine Bogers, Samuel Dodson, Maria Gäde, Luanne Freund, Mark M. Hall, Marijn Koolen, Vivien Petras, Nils Pharo, Mette Skov

Name of conference

BIIRRR 2019: Volume 2337

Publisher

Rheinisch-Westfaelische Technische Hochschule Aachen * Lehrstuhl Informatik V

Place published

Germany

Start date

2019-03-14

End date

2019-03-14

Language

English

Copyright

Copyright © 2019 for the individual papers by the papers' authors. Copyright © 2019 for the volume as a collection by its editors. This volume and its papers are published under the Creative Commons License Attribution 4.0 International (CC BY 4.0).

Former Identifier

2006106554

Esploro creation date

2022-10-22

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC