RMIT University
Browse

Automatic detection of patients with invasive fungal disease from free-text computed tomography (CT) scans

journal contribution
posted on 2024-11-01, 16:42 authored by David Martinez, Michelle Ananda-Rajah, Hanna Suominen, Monica Slavin, Karin Thursky, Lawrence CavedonLawrence Cavedon
Background: Invasive fungal diseases (IFDs) are associated with considerable health and economic costs. Surveillance of the more diagnostically challenging invasive fungal diseases, specifically of the sino-pulmonary system, is not feasible for many hospitals because case finding is a costly and labour intensive exercise. We developed text classifiers for detecting such IFDs from free-text radiology (CT) reports, using machine-learning techniques. Method: We obtained free-text reports of CT scans performed over a specific hospitalisation period (2003-2011), for 264 IFD and 289 control patients from three tertiary hospitals. We analysed IFD evidence at patient, report, and sentence levels. Three infectious disease experts annotated the reports of 73 IFD-positive patients for language suggestive of IFD at sentence level, and graded the sentences as to whether they suggested or excluded the presence of IFD. Reliable agreement between annotators was obtained and this was used as training data for our classifiers. We tested a variety of Machine Learning (ML), rule based, and hybrid systems, with feature types including bags of words, bags of phrases, and bags of concepts, as well as report-level structured features. Evaluation was carried out over a robust framework with separate Development and Held-Out datasets. Results: The best systems (using Support Vector Machines) achieved very high recall at report- and patient-levels over unseen data: 95% and 100% respectively. Precision at report-level over held-out data was 71%; however, most of the associated false-positive reports (53%) belonged to patients who had a previous positive report appropriately flagged by the classifier, reducing negative impact in practice. Conclusions: Our machine learning application holds the potential for developing systematic IFD surveillance systems for hospital populations.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.jbi.2014.11.009
  2. 2.
    ISSN - Is published in 15320464

Journal

Journal of Biomedical Informatics

Volume

53

Issue

1

Start page

251

End page

260

Total pages

10

Publisher

Academic Press

Place published

United States

Language

English

Copyright

© 2014 Elsevier Inc. All rights reserved.

Former Identifier

2006049730

Esploro creation date

2020-06-22

Fedora creation date

2015-01-21

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC