RMIT University
Browse

WikiUMLS: Aligning UMLS to Wikipedia via Cross-lingual Neural Ranking

conference contribution
posted on 2024-11-03, 14:37 authored by Afshin Rahimi, Timothy Baldwin, Cornelia VerspoorCornelia Verspoor
We present our work on aligning the Unified Medical Language System (UMLS) to Wikipedia, to facilitate manual alignment of the two resources. We propose a cross-lingual neural reranking model to match a UMLS concept with a Wikipedia page, which achieves a recall@1of 72%, a substantial improvement of 20% over word- and char-level BM25, enabling manual alignment with minimal effort. We release our resources, including ranked Wikipedia pages for 700k UMLSconcepts, and WikiUMLS, a dataset for training and evaluation of alignment models between UMLS and Wikipedia collected from Wikidata. This will provide easier access to Wikipedia for health professionals, patients, and NLP systems, including in multilingual settings.

History

Related Materials

  1. 1.
    DOI - Is published in 10.18653/v1/2020.coling-main.523
  2. 2.
    ISBN - Is published in 9781952148279 (urn:isbn:9781952148279)

Start page

5957

End page

5962

Total pages

6

Outlet

Proceedings of the 28th International Conference on Computational Linguistics (2020)

Name of conference

2020 International Conference on Computational Linguistics (COLING)

Publisher

International Committee on Computational Linguistics

Place published

United States of America

Start date

2020-12-08

End date

2020-12-13

Language

English

Copyright

Creative Commons Attribution 4.0 International Licence

Former Identifier

2006114787

Esploro creation date

2022-11-26

Usage metrics

    Scholarly Works

    Keywords

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC