Submitted:
02 September 2024
Posted:
03 September 2024
You are already at the latest version
Abstract
Keywords:
I. Introduction
II. Advancement of Artificial Intelligence
A. Selecting a Template (Heading 2)
III. Generative AI in Video Creation
IV. Types of Generative Artificial Intelligence Ai-Models
A. Generative Adversarial Networks (GANS)
B. Variation Autoencoders
C. Diffusion Models in Artificial Intelligence
D. Transformers
E. Autoregressive Models in Artificial Intelligence
V. Video Generation
VI. Video Cloning
VII. Inputs to Video Generation
A. Text to Video Models
B. Text to Text Models
VIII. Video Editing Software
A. Animation and Motion Graphics
IX. Tools for Video Generation
A. Text to Video Tools
B. Audio to Video Tools
C. Image to Video Tools
X. Challenges Associated with AI Video Generation
XI. Conclusion
References
- A. Bozkurt, “Generative artificial intelligence (AI) powered conversational educational agents: The inevitable paradigm shift,” Asian Journal of Distance Education, vol. 18, no. 1, Art. no. 1, Mar. 2023, Accessed: Aug. 18, 2024. [Online]. Available: https://www.asianjde.com/ojs/index.php/AsianJDE/article/view/718.
- S. Feuerriegel, J. Hartmann, C. Janiesch, and P. Zschech, “Generative AI,” Bus Inf Syst Eng, vol. 66, no. 1, pp. 111–126, Feb. 2024. [CrossRef]
- B. L. Moorhouse, M. A. Yeo, and Y. Wan, “Generative AI tools and assessment: Guidelines of the world’s top-ranking universities,” Computers and Education Open, vol. 5, p. 100151, Dec. 2023. [CrossRef]
- S. Bengesi, H. El-Sayed, M. K. Sarker, Y. Houkpati, J. Irungu, and T. Oladunni, “Advancements in Generative AI: A Comprehensive Review of GANs, GPT, Autoencoders, Diffusion Model, and Transformers,” IEEE Access, vol. 12, pp. 69812–69837, 2024. [CrossRef]
- C. Longoni, A. Fradkin, L. Cian, and G. Pennycook, “News from Generative Artificial Intelligence Is Believed Less,” in Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, in FAccT ’22. New York, NY, USA: Association for Computing Machinery, Jun. 2022, pp. 97–106. [CrossRef]
- N. Kshetri, Y. K. Dwivedi, T. H. Davenport, and N. Panteli, “Generative artificial intelligence in marketing: Applications, opportunities, challenges, and research agenda,” International Journal of Information Management, vol. 75, p. 102716, Apr. 2024. [CrossRef]
- S. K. J. Rizvi, M. A. Azad, and M. M. Fraz, “Spectrum of Advancements and Developments in Multidisciplinary Domains for Generative Adversarial Networks (GANs),” Arch Computat Methods Eng, vol. 28, no. 7, pp. 4503–4521, Dec. 2021. [CrossRef]
- S. Feuerriegel, J. Hartmann, C. Janiesch, and P. Zschech, “Generative AI,” Bus Inf Syst Eng, vol. 66, no. 1, pp. 111–126, Feb. 2024. [CrossRef]
- S. Karthika. and M. Durgadevi, “Generative Adversarial Network (GAN): a general review on different variants of GAN and applications,” in 2021 6th International Conference on Communication and Electronics Systems (ICCES), Jul. 2021, pp. 1–8. [CrossRef]
- Z. Epstein et al., “Art and the science of generative AI: A deeper dive,” Science, vol. 380, no. 6650, pp. 1110–1111, Jun. 2023. [CrossRef]
- L. Jiang, B. Dai, W. Wu, and C. C. Loy, “Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data,” in Advances in Neural Information Processing Systems, Curran Associates, Inc., 2021, pp. 21655–21667. Accessed: Aug. 18, 2024. [Online]. Available: https://proceedings.neurips.cc/paper/2021/hash/b534ba68236ba543ae44b22bd110a1d6-Abstract.html.
- A. S. Kumar, L. Tesfaye Jule, K. Ramaswamy, S. Sountharrajan, N. Yuuvaraj, and A. H. Gandomi, “Chapter 12 - Analysis of false data detection rate in generative adversarial networks using recurrent neural network,” in Generative Adversarial Networks for Image-to-Image Translation, A. Solanki, A. Nayyar, and M. Naved, Eds., Academic Press, 2021, pp. 289–312. [CrossRef]
- Y. Zhao and S. Linderman, “Revisiting Structured Variational Autoencoders,” in Proceedings of the 40th International Conference on Machine Learning, PMLR, Jul. 2023, pp. 42046–42057. Accessed: Aug. 18, 2024. [Online]. Available: https://proceedings.mlr.press/v202/zhao23c.html.
- Misino, G. Marra, and E. Sansone, “VAEL: Bridging Variational Autoencoders and Probabilistic Logic Programming,” Advances in Neural Information Processing Systems, vol. 35, pp. 4667–4679, Dec. 2022.
- Y. Liu et al., “Cloud-VAE: Variational autoencoder with concepts embedded,” Pattern Recognition, vol. 140, p. 109530, Aug. 2023. [CrossRef]
- C. Zhang, C. Zhang, M. Zhang, and I. S. Kweon, “Text-to-image Diffusion Models in Generative AI: A Survey,” Apr. 02, 2023, arXiv: arXiv:2303.07909. [CrossRef]
- H. Cao et al., “A Survey on Generative Diffusion Models,” IEEE Transactions on Knowledge and Data Engineering, vol. 36, no. 7, pp. 2814–2830, Jul. 2024. [CrossRef]
- D. Rothman, Transformers for Natural Language Processing: Build, train, and fine-tune deep neural network architectures for NLP with Python, Hugging Face, and OpenAI’s GPT-3, ChatGPT, and GPT-4. Packt Publishing Ltd, 2022.
- D. Soydaner, “Attention mechanism in neural networks: where it comes and where it goes,” Neural Comput & Applic, vol. 34, no. 16, pp. 13371–13385, Aug. 2022. [CrossRef]
- S. Khan, M. Naseer, M. Hayat, S. W. Zamir, F. S. Khan, and M. Shah, “Transformers in Vision: A Survey,” ACM Comput. Surv., vol. 54, no. 10s, p. 200:1-200:41, Sep. 2022. [CrossRef]
- D. Djeudeu, S. Moebus, and K. Ickstadt, “Multilevel Conditional Autoregressive models for longitudinal and spatially referenced epidemiological data,” Spatial and Spatio-temporal Epidemiology, vol. 41, p. 100477, Jun. 2022. [CrossRef]
- A. Zeng, M. Chen, L. Zhang, and Q. Xu, “Are Transformers Effective for Time Series Forecasting?,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 9, Art. no. 9, Jun. 2023. [CrossRef]
- S. Yazdani, N. Saxena, Z. Wang, Y. Wu, and W. Zhang, A Comprehensive Survey of Image and Video Generative AI: Recent Advances, Variants, and Applications. 2024. [CrossRef]
- D. Grba, “Deep Else: A Critical Framework for AI Art,” Digital, vol. 2, no. 1, Art. no. 1, Mar. 2022. [CrossRef]
- D. Leiker, A. R. Gyllen, I. Eldesouky, and M. Cukurova, “Generative AI for learning: Investigating the potential of synthetic learning videos,” May 03, 2023, arXiv: arXiv:2304.03784. [CrossRef]
- M. Patel, A. Gupta, S. Tanwar, and M. S. Obaidat, “Trans-DF: A Transfer Learning-based end-to-end Deepfake Detector,” in 2020 IEEE 5th International Conference on Computing Communication and Automation (ICCCA), Oct. 2020, pp. 796–801. [CrossRef]
- Y. Lu and T. Ebrahimi, “Impact of Video Processing Operations in Deepfake Detection,” Mar. 30, 2023, arXiv: arXiv:2303.17247. [CrossRef]
- T.-N. Le, H. H. Nguyen, J. Yamagishi, and I. Echizen, “Robust Deepfake On Unrestricted Media: Generation And Detection,” Feb. 13, 2022, arXiv: arXiv:2202.06228. [CrossRef]
- M. J. Israel and A. Amer, “Rethinking data infrastructure and its ethical implications in the face of automated digital content generation,” AI Ethics, vol. 3, no. 2, pp. 427–439, May 2023. [CrossRef]
- S. Lyu, “DeepFake Detection: Current Challenges and Next Steps,” Mar. 11, 2020, arXiv: arXiv:2003.09234. [CrossRef]
- T.-N. Le, H. H. Nguyen, J. Yamagishi, and I. Echizen, “Robust Deepfake On Unrestricted Media: Generation And Detection,” Feb. 13, 2022, arXiv: arXiv:2202.06228. [CrossRef]
- J. Akers et al., “Technology-Enabled Disinformation: Summary, Lessons, and Recommendations,” Jan. 03, 2019, arXiv: arXiv:1812.09383. [CrossRef]
- J. Akers et al., “Technology-Enabled Disinformation: Summary, Lessons, and Recommendations,” Jan. 03, 2019, arXiv: arXiv:1812.09383. [CrossRef]
- T. Kirchengast, “Deepfakes and image manipulation: criminalisation and control,” Information & Communications Technology Law, vol. 29, no. 3, pp. 308–323, Sep. 2020. [CrossRef]
- A. K. Tiwari, A. Sharma, P. Rayakar, M. K. Bhavriya, and Nisha, “AI-Generated Video Forgery Detection and Authentication,” in 2024 IEEE 9th International Conference for Convergence in Technology (I2CT), Apr. 2024, pp. 1–8. [CrossRef]
- M. Masood, M. Nawaz, K. M. Malik, A. Javed, A. Irtaza, and H. Malik, “Deepfakes generation and detection: state-of-the-art, open challenges, countermeasures, and way forward,” Appl Intell, vol. 53, no. 4, pp. 3974–4026, Feb. 2023. [CrossRef]
- A. Swenson, “Teaching digital identity: opportunities, challenges, and ethical considerations for avatar creation in educational settings,” Brazilian Creative Industries Journal, vol. 3, no. 2, Art. no. 2, Dec. 2023.
- T.-N. Le, H. H. Nguyen, J. Yamagishi, and I. Echizen, “Robust Deepfake On Unrestricted Media: Generation And Detection,” Feb. 13, 2022, arXiv: arXiv:2202.06228. [CrossRef]
- R. Gozalo-Brizuela and E. C. Garrido-Merchán, “A survey of Generative AI Applications,” Jun. 14, 2023, arXiv: arXiv:2306.02781. [CrossRef]
- U. Singer et al., “Make-A-Video: Text-to-Video Generation without Text-Video Data,” Sep. 29, 2022, arXiv: arXiv:2209.14792. [CrossRef]
- J. Wang et al., “AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production,” Mar. 11, 2024, arXiv: arXiv:2403.07952. [CrossRef]
- R. Gozalo-Brizuela and E. C. Garrido-Merchán, “A survey of Generative AI Applications,” Jun. 14, 2023, arXiv: arXiv:2306.02781. [CrossRef]
- R. Bhagwatkar, S. Bachu, K. Fitter, A. Kulkarni, and S. Chiddarwar, “A Review of Video Generation Approaches,” in 2020 International Conference on Power, Instrumentation, Control and Computing (PICC), Dec. 2020, pp. 1–5. [CrossRef]
- M. Kale and A. Rastogi, “Text-to-Text Pre-Training for Data-to-Text Tasks,” arXiv.org. Accessed: Aug. 19, 2024. [Online]. Available: https://arxiv.org/abs/2005.10433v3.
- R. Gozalo-Brizuela and E. C. Garrido-Merchán, “A survey of Generative AI Applications,” Jun. 14, 2023, arXiv: arXiv:2306.02781. [CrossRef]
- J. Yang, “Assessment of the strength and weakness of production design platforms in arts and entertainment management,” May 2023, Accessed: Aug. 20, 2024. [Online]. Available: https://hdl.handle.net/2346/96017.
- O. Karras and K. Schneider, “Software Professionals are Not Directors: What Constitutes a Good Video?,” Aug. 15, 2018, arXiv: arXiv:1808.04986. [CrossRef]
- M. G. Jones and L. Harris, “Audio and Video Production for Instructional Design Professionals,” 2021.
- J. Yang, “Assessment of the strength and weakness of production design platforms in arts and entertainment management,” May 2023, Accessed: Aug. 20, 2024. [Online]. Available: https://hdl.handle.net/2346/96017.
- D. Wei, “Construction of a Digital Color Grading Laboratory Based on DaVinci Resolve,” FSST, vol. 5, no. 14, 2023. [CrossRef]
- L. Fridsma and B. Gyncild, Adobe After Effects Classroom in a Book (2021 release). Adobe Press, 2020.
- D. A. Hussain, “The Effective Motion Graphics Production,” vol. 7, no. 8, 2022.
- V. Maselli and L. D. Cecca, “Collaborative production model and the animation industry: The role of the Blender community in the making of the Italian short film Arturo e il gabbiano,” Animation Practice, Process & Production, vol. 11, no. 1, pp. 7–29, Jun. 2022. [CrossRef]
- O. Villar, Learning Blender. Addison-Wesley Professional, 2021.
- B. Hasirci and D. Hasirci, “FOSTERING CREATIVITY IN EDUCATION WITH THE DESIGN OF ANIMATED SHOWS FOR CHILDREN,” EDULEARN22 Proceedings, pp. 4970–4974, 2022. [CrossRef]
- J. Chambless, “2D Animation of the 21st Century: The Digital Age,” Electronic Theses and Dissertations, 2020-2023, Jan. 2022, [Online]. Available: https://stars.library.ucf.edu/etd2020/986.
- E. Navarrete, A. Nehring, S. Schanze, R. Ewerth, and A. Hoppe, “A Closer Look into Recent Video-based Learning Research: A Comprehensive Review of Video Characteristics, Tools, Technologies, and Learning Effectiveness,” Aug. 11, 2023, arXiv: arXiv:2301.13617. [CrossRef]
- T. W. Tan, “Mastering Lumen Global Illumination in Unreal Engine 5,” in Game Development with Unreal Engine 5 Volume 1: Design Phase, T. W. Tan, Ed., Berkeley, CA: Apress, 2024, pp. 223–275. [CrossRef]
- T. Volarić, Z. Tomić, and H. Ljubić, “Artificial Intelligence Tools for Public Relations Practitioners: An Overview,” in 2024 IEEE 28th International Conference on Intelligent Engineering Systems (INES), Jul. 2024, pp. 000031–000036. [CrossRef]
- A. Sufian, “AI-Generated Videos and Deepfakes: A Technical Primer,” Aug. 12, 2024. [CrossRef]
- “The Headliner Blog,” The Headliner Blog. Accessed: Aug. 26, 2024. [Online]. Available: https://www.headliner.app/blog/.
- J. H. Park, “The Growth of OTT Platforms’ Investments in Korean Content and Opportunities for Global Expansion,” Dec. 31, 2023, Rochester, NY: 4677552. [CrossRef]
- “Wavve Blog,” Wavve. Accessed: Aug. 26, 2024. [Online]. Available: https://wavve.co/blog/.
- Y. Wu, X. Shen, T. Mei, X. Tian, N. Yu, and Y. Rui, “Monet: A System for Reliving Your Memories by Theme-Based Photo Storytelling,” IEEE Transactions on Multimedia, vol. 18, no. 11, pp. 2206–2216, Nov. 2016. [CrossRef]
- W. Kung, “Using the PESTEL Analysis to Determine the Effectiveness of New Digital Media Strategies,” Advances in Economics, Management and Political Sciences, vol. 5, pp. 19–25, Apr. 2023. [CrossRef]
- J. Amankwah-Amoah, S. Abdalla, E. Mogaji, A. Elbanna, and Y. K. Dwivedi, “The impending disruption of creative industries by generative AI: Opportunities, challenges, and research agenda,” International Journal of Information Management, vol. 79, p. 102759, Dec. 2024. [CrossRef]
- A. Bandi, P. V. S. R. Adapa, and Y. E. V. P. K. Kuchi, “The Power of Generative AI: A Review of Requirements, Models, Input–Output Formats, Evaluation Metrics, and Challenges,” Future Internet, vol. 15, no. 8, Art. no. 8, Aug. 2023. [CrossRef]
- T. C. Helmus, “Artificial Intelligence, Deepfakes, and Disinformation: A Primer,” RAND Corporation, 2022. Accessed: Aug. 19, 2024. [Online]. Available: https://www.jstor.org/stable/resrep42027.
- R. Abbott and E. Rothman, “Disrupting Creativity: Copyright Law in the Age of Generative Artificial Intelligence,” Fla. L. Rev., vol. 75, p. 1141, 2023.
- J. K. P. Seng, K. L. Ang, E. Peter, and A. Mmonyi, “Artificial Intelligence (AI) and Machine Learning for Multimedia and Edge Information Processing,” Electronics, vol. 11, no. 14, Art. no. 14, Jan. 2022. [CrossRef]
- M. Zink, R. Sitaraman, and K. Nahrstedt, “Scalable 360 Video Stream Delivery: Challenges, Solutions, and Opportunities,” Proceedings of the IEEE, vol. 107, no. 4, pp. 639–650, Apr. 2019. [CrossRef]
- R. Nishant, D. Schneckenberg, and M. Ravishankar, “The formal rationality of artificial intelligence-based algorithms and the problem of bias,” Journal of Information Technology, vol. 39, no. 1, pp. 19–40, Mar. 2024. [CrossRef]
- F. Magni, J. Park, and M. M. Chao, “Humans as Creativity Gatekeepers: Are We Biased Against AI Creativity?,” J Bus Psychol, vol. 39, no. 3, pp. 643–656, Jun. 2024. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).