RMIT University
Browse

Entity extraction from the web with webknox

conference contribution
posted on 2024-10-31, 09:21 authored by David Urbansky, Marius Feldmann, James Thom, A Schill
This paper describes a system for entity extraction from the web. The system uses three different extraction techniques which are tightly coupled with mechanisms for retrieving entity rich web pages. The main contributions of this paper are a new entity retrieval approach, a comparison of different extraction techniques and a more precise entity extraction algorithm. The presented approach allows to extract domain-independent information from the web requiring only minimal human effort.

History

Start page

209

End page

218

Total pages

10

Outlet

Proceedings of the AWIC'09 6th Atlantic Web Conference

Editors

Fatos Xhafa; Hana Rezankova

Name of conference

AWIC 2009 & NWeSP 2009

Publisher

Springer

Place published

Heidelberg, Germany

Start date

2009-09-09

End date

2009-09-11

Language

English

Copyright

© Springer-Verlag Berlin Heidelberg 2010

Former Identifier

2006015361

Esploro creation date

2020-06-22

Fedora creation date

2011-11-08

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC