RMIT University
Browse

Challenges in information extraction from tables in biomedical research publications: A dataset analysis

conference contribution
posted on 2024-10-31, 18:18 authored by Tatyana Shmanina, Lawrence CavedonLawrence Cavedon, Ingrid Zukerman
We present a study of a dataset of tables from biomedical research publications. Our aim is to identify characteristics of biomedical tables that pose challenges for the task of extracting information from tables, and to determine which parts of research papers typically contain information that is useful for this task. Our results indicate that biomedical tables are hard to interpret without their source papers due to the brevity of the entries in the tables. In many cases, unstructured text segments, such as table titles, footnotes and non-table prose discussing a table, are required to interpret the table's entries.

History

Start page

118

End page

122

Total pages

5

Outlet

Proceedings of the 12th Australasian Language Technology Association Workshop (ALTA 2014)

Editors

Gabriela Ferraro, Stephen Wan

Name of conference

ALTA 2014: The Twelfth Annual Workshop of the Australasia Language Technology Association

Publisher

NICTA (National ICT Australia)

Place published

Australia

Start date

2014-11-26

End date

2014-11-28

Language

English

Copyright

© NICTA 2014

Former Identifier

2006049729

Esploro creation date

2020-06-22

Fedora creation date

2015-01-21

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC