Preprint Data Descriptor Version 2 Preserved in Portico This version is not peer-reviewed

Korean Audio-Visual Dataset of Characters in 3D Animation: Construction and Validation

Version 1 : Received: 8 October 2023 / Approved: 9 October 2023 / Online: 9 October 2023 (11:04:10 CEST)
Version 2 : Received: 1 November 2023 / Approved: 2 November 2023 / Online: 2 November 2023 (10:59:40 CET)

How to cite: Hyun, S.; Son, Y.; Park, J.W. Korean Audio-Visual Dataset of Characters in 3D Animation: Construction and Validation. Preprints 2023, 2023100514. https://doi.org/10.20944/preprints202310.0514.v2 Hyun, S.; Son, Y.; Park, J.W. Korean Audio-Visual Dataset of Characters in 3D Animation: Construction and Validation. Preprints 2023, 2023100514. https://doi.org/10.20944/preprints202310.0514.v2

Abstract

Characters are one of the most important elements in composing digital animation. The appear-ance and voice of a character should be designed to express the personality and values of the character. However, it is not easy for animation producers to harmoniously match the appear-ance and voice of a character. Advances in deep learning technology have made it possible to overcome this limitation. To achieve this, firstly, an audio-visual dataset of characters is required. In this study, we construct and verify a Korean audio-visual dataset consisting of frontal face im-ages of various characters and short voice clips. We developed an application that can automati-cally extract the frontal face image and a short voice clip of a character by collecting videos up-loaded to YouTube. Through this, a dataset consisting of a total of 1,522 face images and a total of 7,999 seconds of voice clips was built based on 490 characters. Furthermore, we automatically la-bel characters by gender and age to validate the dataset. The dataset built in this study is expected to be used in various deep learning fields, such as classification, generative adversarial networks, and speech synthesis.

Keywords

anime character; 3D animation; audio-visual dataset

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (1)

Comment 1
Received: 2 November 2023
Commenter: Jae Wan Park
Commenter's Conflict of Interests: Author
Comment: Images that may pose a copyright issue have been deleted. However, readers will have no problem reading this manuscript.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.