RMIT University
Browse

Bi-level semantic representation analysis for multimedia event detection

journal contribution
posted on 2024-11-02, 17:46 authored by Xiaojun ChangXiaojun Chang, Zhigang Ma, Yi Yang, Zhiqiang Zeng, Alexander Hauptmann
Multimedia event detection has been one of the major endeavors in video event analysis. A variety of approaches have been proposed recently to tackle this problem. Among others, using semantic representation has been accredited for its promising performance and desirable ability for human-understandable reasoning. To generate semantic representation, we usually utilize several external image/video archives and apply the concept detectors trained on them to the event videos. Due to the intrinsic difference of these archives, the resulted representation is presumable to have different predicting capabilities for a certain event. Notwithstanding, not much work is available for assessing the efficacy of semantic representation from the source-level. On the other hand, it is plausible to perceive that some concepts are noisy for detecting a specific event. Motivated by these two shortcomings, we propose a bi-level semantic representation analyzing method. Regarding source-level, our method learns weights of semantic representation attained from different multimedia archives. Meanwhile, it restrains the negative influence of noisy or irrelevant concepts in the overall concept-level. In addition, we particularly focus on efficient multimedia event detection with few positive examples, which is highly appreciated in the real-world scenario. We perform extensive experiments on the challenging TRECVID MED 2013 and 2014 datasets with encouraging results that validate the efficacy of our proposed approach.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/TCYB.2016.2539546
  2. 2.
    ISSN - Is published in 21682267

Journal

IEEE Transactions on Cybernetics

Volume

47

Number

7442559

Issue

5

Start page

1180

End page

1197

Total pages

18

Publisher

IEEE

Place published

United States

Language

English

Copyright

© 2016 IEEE.

Former Identifier

2006109433

Esploro creation date

2021-08-29

Usage metrics

    Scholarly Works

    Categories

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC