AI -POWERED WEB APPLICATION FOR AUTOMATED SHORT VIDEO GENERATION
: In the rapidly evolving digital landscape, video content has become a dominant medium for communication, education, entertainment, and marketing due to its high engagement and effectiveness. However, traditional video production is a complex, time-consuming, and resource-intensive process that requires significant technical expertise. To address these challenges, this project presents the design and development of an AI-powered one-minute text-to-video generation web application that automates the creation of short, high-quality videos from user-provided textual inputs. The proposed system integrates advanced Artificial Intelligence techniques, including Natural Language Processing (NLP), Text-to-Speech (TTS), and diffusion-based generative models, to convert textual descriptions into dynamic video content. The input text is processed into semantic representations and structured into meaningful scenes, ensuring logical flow and coherence. For each scene, relevant visual content is generated using AI-based image and video synthesis, while voice narration is produced through TTS systems. Additional multimedia elements such as subtitles, transitions, and background music are incorporated to enhance the quality and effectiveness of the generated video. The system is built upon a pretrained video diffusion pipeline that iteratively refines latent representations to produce temporally consistent video frames. These frames are then encoded into standard video formats using multimedia processing techniques. The application is developed with a user-friendly web interface and a robust backend powered by modern deep learning frameworks.
To ensure efficient performance, optimization techniques such as mixed-precision computation, memory-efficient processing, and GPU acceleration are employed. These enhancements enable faster inference and improved scalability, making the system suitable for deployment in GPU-enabled environments. The proposed solution significantly reduces manual effort, production time, and cost, thereby making video creation accessible to students, educators, content creators, and businesses. This project demonstrates the practical application of generative artificial intelligence in multimedia content creation and highlights the transition from static image synthesis to automated video generation. Despite its advantages, challenges such as computational complexity , hardware
Kumar, R. N. (2026). AI -Powered Web Application for Automated Short Video Generation. International Journal of Science, Strategic Management and Technology, 02(05). https://doi.org/10.55041/ijsmt.v2i4.618
Kumar, R.. "AI -Powered Web Application for Automated Short Video Generation." International Journal of Science, Strategic Management and Technology, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i4.618.
Kumar, R.. "AI -Powered Web Application for Automated Short Video Generation." International Journal of Science, Strategic Management and Technology 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i4.618.
2.Hugging Face, “Diffusers Library Documentation,” 2024. [Online]. Available: https://huggingface.co/docs/diffusers
3.Cerspense, “ZeroScope V2 Model,” Hugging Face, 2024.
4.Vaswani et al., “Attention Is All You Need,” Proc. NIPS, 2017.
5.PyTorch, “PyTorch Documentation,” 2024. [Online]. Available: https://pytorch.org
6.Gradio, “Gradio: Build Machine Learning Web Apps,” 2024. [Online]. Available: https://gradio.app
7.OpenAI, “ChatGPT: AI Language Model,” 2025.
8.Goodfellow et al., “Deep Learning,” MIT Press, 2016.
9.Arunachalam, A. S., and K. Rajeswari. "An Inclusive Survey of Student Performance With Various Data Mining Methods." International Journal of Engineering and Technology (IJET) vol7 (2018): 522-525.