RMIT University
Browse

A compressed-domain robust descriptor for near duplicate video copy detection

conference contribution
posted on 2024-10-31, 18:31 authored by Amir Hossein Rouhi, James Thom
This paper introduces a global descriptor from the compressed video domain (H.264) for near duplicate video copy detection tasks. The proposed descriptor uses a spatial-temporal feature structure in an ordinal pattern distribution format. The proposed descriptor is constructed from Intra-Prediction Modes (IPM) of key frames (IDR & I slices) and extracted from the compressed video files, using the MPEG4/AVC (H.264) codec. Intra-prediction is the compression technique used in the key frames of the H.264 codec. As the proposed feature describes pictures globally, this research compares the feature with the two other well-known global image descriptors, ordinal intensity/colour Histograms and ordinal Auto-correlograms, as baselines. Our experiments show how the proposed feature outperforms the baseline features in non-geometric transformations T3, T4 and T5 in effectiveness as well as efficiency. It is due to better representation of the image content and smaller feature vector size. The core competency of the proposed feature is in non-linear brightness and contrast changes (Gamma expansion and compression) in which the intensity/colour Histograms and Auto-correlograms are deficient.

History

Related Materials

  1. 1.
    DOI - Is published in 10.1145/2683405.2683417
  2. 2.
    ISBN - Is published in 9781450331845 (urn:isbn:9781450331845)

Start page

130

End page

135

Total pages

6

Outlet

Proceedings of the 29th International Conference on Image and Vision Computing New Zealand

Editors

M. J. Cree

Name of conference

IVCNZ 2014

Publisher

Association for Computing Machinery

Place published

New York, United States

Start date

2014-11-19

End date

2014-11-21

Language

English

Copyright

© ACM 2014

Former Identifier

2006050333

Esploro creation date

2020-06-22

Fedora creation date

2015-02-04