IJSMT Journal

International Journal of Science, Strategic Management and Technology

An International, Peer-Reviewed, Open Access Scholarly Journal Indexed in recognized academic databases · DOI via Crossref The journal adheres to established scholarly publishing, peer-review, and research ethics guidelines set by the UGC

ISSN: 3108-1762 (Online)
webp (1)

Plagiarism Passed
Peer reviewed
Open Access

AI -POWERED WEB APPLICATION FOR AUTOMATED SHORT VIDEO GENERATION

AUTHORS:
R. Naveen Kumar
Mentor
Dr. AS. Arunachalam
Affiliation
CC BY 4.0 License:
This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract

:  In the rapidly evolving digital landscape, video content has become a dominant medium for communication, education, entertainment, and marketing due to its high engagement and effectiveness. However, traditional video production is a complex, time-consuming, and resource-intensive process that requires significant technical expertise. To address these challenges, this project presents the design and development of an AI-powered one-minute text-to-video generation web application that automates the creation of short, high-quality videos from user-provided textual inputs. The proposed system integrates advanced Artificial Intelligence techniques, including Natural Language Processing (NLP), Text-to-Speech (TTS), and diffusion-based generative models, to convert textual descriptions into dynamic video content. The input text is processed into semantic representations and structured into meaningful scenes, ensuring logical flow and coherence. For each scene, relevant visual content is generated using AI-based image and video synthesis, while voice narration is produced through TTS systems. Additional multimedia elements such as subtitles, transitions, and background music are incorporated to enhance the quality and effectiveness of the generated video. The system is built upon a pretrained video diffusion pipeline that iteratively refines latent representations to produce temporally consistent video frames. These frames are then encoded into standard video formats using multimedia processing techniques. The application is developed with a user-friendly web interface and a robust backend powered by modern deep learning frameworks.


To ensure efficient performance, optimization techniques such as mixed-precision computation, memory-efficient processing, and GPU acceleration are employed. These enhancements enable faster inference and improved scalability, making the system suitable for deployment in GPU-enabled environments. The proposed solution significantly reduces manual effort, production time, and cost, thereby making video creation accessible to students, educators, content creators, and businesses. This project demonstrates the practical application of generative artificial intelligence in multimedia content creation and highlights the transition from static image synthesis to automated video generation. Despite its advantages, challenges such as computational complexity ,  hardware


Keywords
Article Metrics
Article Views
57
PDF Downloads
1
HOW TO CITE
APA

MLA

Chicago

Copy

Kumar, R. N. (2026). AI -Powered Web Application for Automated Short Video Generation. International Journal of Science, Strategic Management and Technology, 02(05). https://doi.org/10.55041/ijsmt.v2i4.618

Kumar, R.. "AI -Powered Web Application for Automated Short Video Generation." International Journal of Science, Strategic Management and Technology, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i4.618.

Kumar, R.. "AI -Powered Web Application for Automated Short Video Generation." International Journal of Science, Strategic Management and Technology 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i4.618.

References
1.Rombach et al., “High-Resolution Image Synthesis with Latent Diffusion Models,” Proc. IEEE CVPR, 2022.

2.Hugging Face, “Diffusers Library Documentation,” 2024. [Online]. Available: https://huggingface.co/docs/diffusers

3.Cerspense, “ZeroScope V2 Model,” Hugging Face, 2024.

4.Vaswani et al., “Attention Is All You Need,” Proc. NIPS, 2017.

5.PyTorch, “PyTorch Documentation,” 2024. [Online]. Available: https://pytorch.org

6.Gradio, “Gradio: Build Machine Learning Web Apps,” 2024. [Online]. Available: https://gradio.app

7.OpenAI, “ChatGPT: AI Language Model,” 2025.

8.Goodfellow et al., “Deep Learning,” MIT Press, 2016.

9.Arunachalam, A. S., and K. Rajeswari. "An Inclusive Survey of Student Performance With Various Data Mining Methods." International Journal of Engineering and Technology (IJET) vol7 (2018): 522-525.

 

 

 
Ethics and Compliance
✓ All ethical standards met
This article has undergone plagiarism screening and double-blind peer review. Editorial policies have been followed. Authors retain copyright under CC BY-NC 4.0 license. The research complies with ethical standards and institutional guidelines.
Indexed In
Similar Articles
Empowering Learning Through Language: NEP 2020’s Vision for Multilingual and Mother Tongue Education
string(15) "Biswajit Sarkar" Sarkar, B.
(2026)
DOI: 10.55041/ijsmt.v2i3.014
Market Feasibility Study for Launching Theatre-Quality Popcorn at Economy Pricing – A Consumer-Centric Approach
string(27) "M.Karthikeyan,Dr.K.Rajamani" M.Karthikeyan,Dr.K.Rajamani,
(2026)
DOI: 10.55041/ijsmt.v2i3.351
Leveraging –Rag for Social Media Sentiment Analysisand Trend Detection
string(14) "M. Siva Harsan" Harsan, M. S.et al.
(2026)
DOI: 10.55041/ijsmt.v2i3.353
Scroll to Top