RMIT University
Browse

Search of spoken documents retrieves well recognized transcripts

conference contribution
posted on 2024-10-31, 10:03 authored by Mark SandersonMark Sanderson, Xiao Shou
This paper presents a series of analyses and experiments on spoken document retrieval systems: search engines that retrieve transcripts produced by speech recognizers. Results show that transcripts that match queries well tend to be recognized more accurately than transcripts that match a query less well. This result was described in past literature, however, no study or explanation of the effect has been provided until now. This paper provides such an analysis showing a relationship between word error rate and query length. The paper expands on past research by increasing the number of recognitions systems that are tested as well as showing the effect in an operational speech retrieval system. Potential future lines of enquiry are also described.

History

Start page

505

End page

516

Total pages

12

Outlet

29th European Conference on IR Research, ECIR 2007

Editors

Giambattista Amati, Claudio Carpineto and Giovanni Romano

Name of conference

Advances in Information Retrieval

Publisher

Springer

Place published

Berlin, Germany

Start date

2007-04-02

End date

2007-04-05

Language

English

Copyright

© Springer-Verlag Berlin Heidelberg 2007

Former Identifier

2006021724

Esploro creation date

2020-06-22

Fedora creation date

2013-02-19

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC