RMIT University
Browse

Soft Target-enhanced Matching Framework for Deep Entity Resolution

conference contribution
posted on 2024-11-03, 15:44 authored by Wenzhou Dou, Derong Shen, Xiangmin ZhouXiangmin Zhou, Tiezheng Nie, Yue Kou, Hang Cui, Ge Yu
Deep Entity Matching (EM) is one of the core research topics in data integration. Typical existing works construct EM models by training deep neural networks (DNNs) based on the training samples with onehot labels. However, these sharp supervision signals of onehot labels harm the generalization of EM models, causing them to overfit the training samples and perform badly in unseen datasets. To solve this problem, we first propose that the challenge of training a well-generalized EM model lies in achieving the compromise between fitting the training samples and imposing regularization, ie, the bias-variance tradeoff. Then, we propose a novel Soft Target-EnhAnced Matching (STEAM) framework, which exploits the automatically generated soft targets as label-wise regularizers to constrain the model training. Specifically, STEAM regards the EM model trained in previous iteration as a virtual teacher and takes its softened output as the extra regularizer to train the EM model in the current iteration. As such, STEAM effectively calibrates the obtained EM model, achieving the bias-variance tradeoff without any additional computational cost. We conduct extensive experiments over open datasets and the results show that our proposed STEAM outperforms the state-of-the-art EM approaches in terms of effectiveness and label efficiency.

Funding

Effective and Efficient Situation Awareness in Big Social Media Data

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1609/aaai.v37i4.25544
  2. 2.
    ISBN - Is published in 9781577358800 (urn:isbn:9781577358800)

Start page

4259

End page

4266

Total pages

8

Outlet

Proceedings of the AAAI Conference on Artificial Intelligence 2023

Name of conference

AAAI 2023

Publisher

Association for the Advancement of Artificial Intelligence

Place published

United States

Start date

2023-02-07

End date

2023-02-14

Language

English

Copyright

Copyright © 2023, Association for the Advancement of Artifcial Intelligence (www.aaai.org). All rights reserved.

Former Identifier

2006126436

Esploro creation date

2023-11-17

Usage metrics

    Scholarly Works

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC