We present a study of a dataset of tables from biomedical research publications. Our aim is to identify characteristics of biomedical tables that pose challenges for the task of extracting information from tables, and to determine which parts of research papers typically contain information that is useful for this task. Our results indicate that biomedical tables are hard to interpret without their source papers due to the brevity of the entries in the tables. In many cases, unstructured text segments, such as table titles, footnotes and non-table prose discussing a table, are required to interpret the table's entries.
History
Start page
118
End page
122
Total pages
5
Outlet
Proceedings of the 12th Australasian Language Technology Association Workshop (ALTA 2014)
Editors
Gabriela Ferraro, Stephen Wan
Name of conference
ALTA 2014: The Twelfth Annual Workshop of the Australasia Language Technology Association