RMIT University
Browse

Use and validation of text mining and cluster algorithms to derive insights from Corona Virus Disease-2019 (COVID-19) medical literature

journal contribution
posted on 2024-11-02, 19:57 authored by Sandeep Reddy, Ravi Bhaskar, Sandosh Padmanabhan, Cornelia VerspoorCornelia Verspoor, Chaitanya Mamillapalli, Rani Lahoti, Ville-Petteri Makinen, Smitan Pradhan, Puru Kushwah, Saumya Sinha
The emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) late last year has not only led to the world-wide coronavirus disease 2019 (COVID-19) pandemic but also a deluge of biomedical literature. Following the release of the COVID-19 open research dataset (CORD-19) comprising over 200,000 scholarly articles, we a multi-disciplinary team of data scientists, clinicians, medical researchers and software engineers developed an innovative natural language processing (NLP) platform that combines an advanced search engine with a biomedical named entity recognition extraction package. In particular, the platform was developed to extract information relating to clinical risk factors for COVID-19 by presenting the results in a cluster format to support knowledge discovery. Here we describe the principles behind the development, the model and the results we obtained.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1016/j.cmpbup.2021.100010
  2. 2.
    ISSN - Is published in 26669900

Journal

Computer Methods and Programs in Biomedicine Update

Volume

1

Number

100010

Start page

1

End page

6

Total pages

6

Publisher

Elsevier BV

Place published

Netherlands

Language

English

Copyright

© 2021 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Former Identifier

2006114654

Esploro creation date

2023-03-10

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC