RMIT University
Browse

Content redundancy in YouTube and its application to video tagging

journal contribution
posted on 2024-11-01, 10:07 authored by Jose San Pedro, Stefan Siersdorfer, Mark SandersonMark Sanderson
The emergence of large-scale social Web communities has enabled users to share online vast amounts of multimedia content. An analysis of YouTube reveals a high amount of redundancy, in the form of videos with overlapping or duplicated content. We use robust content-based video analysis techniques to detect overlapping sequences between videos. Based on the output of these techniques, we present an in-depth study of duplication and content overlap in YouTube, and analyze various dependencies between content overlap and meta data such as video titles, views, video ratings, and tags. As an application, we show that content-based links provide useful information for generating new tag assignments. We propose different tag propagation methods for automatically obtaining richer video annotations. Experiments on video clustering and classi?cation as well as a user evaluation demonstrate the viability of our approach.

History

Journal

ACM Transactions on Information Systems (TOIS)

Volume

29

Number

13

Issue

3

Start page

1

End page

29

Total pages

29

Publisher

ACM Press

Place published

United States

Language

English

Copyright

© 2011 ACM

Former Identifier

2006031672

Esploro creation date

2020-06-22

Fedora creation date

2012-04-27

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC