Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

The User-Pleasant Video Skimming by Multi-Modal Keywords Semantics

Version 1 : Received: 5 December 2018 / Approved: 6 December 2018 / Online: 6 December 2018 (13:19:57 CET)
Version 4 : Received: 1 August 2019 / Approved: 5 August 2019 / Online: 5 August 2019 (03:48:49 CEST)

How to cite: Shen, Y. The User-Pleasant Video Skimming by Multi-Modal Keywords Semantics. Preprints 2018, 2018120086. https://doi.org/10.20944/preprints201812.0086.v1 Shen, Y. The User-Pleasant Video Skimming by Multi-Modal Keywords Semantics. Preprints 2018, 2018120086. https://doi.org/10.20944/preprints201812.0086.v1

Abstract

In this paper, we propose a novel approach of video skimming by exploiting the fusion of video temporal information and keyword information representation extracted from multi-model video information including audio, text and visual indices. In addition, we introduce the brand-safe filtering and sentiment analysis in order to only reserve the user-friendly content in the video skim. In the experiment by using the videos from YouTube-8M dataset, we have proved that the semantic conservation in the video skim from the proposed approach highly outperforms the approaches by only partial information of the video in conserving the semantic content of the video.

Keywords

Multi-model information fusion, Video skimming, Audio and text classification, keyframe extraction

Subject

Computer Science and Mathematics, Computer Science

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.