Synthetic Image Detection using Deep Learning

Marimuthu, P.; Parameshwaran, B.; Anusurya, R. A.

doi:https://doi.org/10.55041/ijsmt.v2i5.015

Plagiarism Passed

Peer reviewed

Open Access

SYNTHETIC IMAGE DETECTION USING DEEP LEARNING

AUTHORS:

P. Marimuthu

B. Parameshwaran

R. A. Anusurya

Mentor

Dr. A. Akila

Affiliation

DOI: 10.55041/ijsmt.v2i5.015

CC BY 4.0 License:

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

DOWNLOAD ARTICLE

REVIEW REPORT

CITE THIS ARTICLE

Abstract

The proliferation of AI-generated synthetic images produced by Generative Adversarial Networks (GANs) and diffusion-based models has created critical challenges for digital forensics, media authentication, and public trust. This paper presents a complete Synthetic Image Detection (SID) system capable of classifying any digital image as REAL or AI-generated without requiring access to any external dataset at inference time. The system combines two complementary deep learning architectures — a fine-tuned ResNet50 Convolutional Neural Network and a Vision Transformer (ViT-B/16) — both trained on the CIFAKE benchmark comprising 120,000 images. ResNet50 achieved 98.08% test accuracy with AUC-ROC of 0.9971, and ViT-B/16 achieved 98.05% accuracy with AUC-ROC of 0.9984, both substantially exceeding the published CIFAKE baseline of 92.98% (Bird and Lotfi, 2024). GradCAM++ (Gradient-weighted Class Activation Mapping++) was implemented from scratch to generate pixel-level visual explanations identifying the specific image regions influencing each classification decision. Beyond deep learning, a novel multi-signal forensic analysis pipeline was developed incorporating six independent pixel-level detectors: Discrete Fourier Transform checkerboard analysis, Error Level Analysis, SRM noise residual inspection, colour channel statistical analysis, Local Binary Pattern texture entropy, and Haar wavelet energy ratio computation. All six signals are combined with the two deep learning models in a weighted mega-ensemble producing a final verdict purely from the uploaded image pixels. The complete system is deployed as a RESTful API using FastAPI with a five-page Streamlit web interface providing real-time detection, GradCAM++ visualisation, and forensic signal breakdown

Keywords

Article Information

Article Metrics

Article Views

PDF Downloads

HOW TO CITE

References

Ethics and Compliance

✓ All ethical standards met

This article has undergone plagiarism screening and double-blind peer review. Editorial policies have been followed. Authors retain copyright under CC BY-NC 4.0 license. The research complies with ethical standards and institutional guidelines.

Indexed In

International Journal of Science, Strategic Management and Technology

ISSN: 3108-1762 (Online)

SYNTHETIC IMAGE DETECTION USING DEEP LEARNING

About Journal

Policies & Ethics

Indexing Platforms

Contact Us

SYNTHETIC IMAGE DETECTION USING DEEP LEARNING

About Journal

Contact Us

Share on