The User-Pleasant Video Skimming by Multi-Modal Keywords Semantics

Yiqing Shen

doi:10.20944/preprints201812.0086.v1

Submitted:

05 December 2018

Posted:

06 December 2018

Read the latest preprint version here

Abstract

In this paper, we propose a novel approach of video skimming by exploiting the fusion of video temporal information and keyword information representation extracted from multi-model video information including audio, text and visual indices. In addition, we introduce the brand-safe filtering and sentiment analysis in order to only reserve the user-friendly content in the video skim. In the experiment by using the videos from YouTube-8M dataset, we have proved that the semantic conservation in the video skim from the proposed approach highly outperforms the approaches by only partial information of the video in conserving the semantic content of the video.

Keywords:

Multi-model information fusion

;

Video skimming

;

Audio and text classification

;

keyframe extraction

Subject:

Computer Science and Mathematics - Computer Science

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

The User-Pleasant Video Skimming by Multi-Modal Keywords Semantics

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe