SYNTHETIC IMAGE DETECTION USING DEEP LEARNING
The proliferation of AI-generated synthetic images produced by Generative Adversarial Networks (GANs) and diffusion-based models has created critical challenges for digital forensics, media authentication, and public trust. This paper presents a complete Synthetic Image Detection (SID) system capable of classifying any digital image as REAL or AI-generated without requiring access to any external dataset at inference time. The system combines two complementary deep learning architectures — a fine-tuned ResNet50 Convolutional Neural Network and a Vision Transformer (ViT-B/16) — both trained on the CIFAKE benchmark comprising 120,000 images. ResNet50 achieved 98.08% test accuracy with AUC-ROC of 0.9971, and ViT-B/16 achieved 98.05% accuracy with AUC-ROC of 0.9984, both substantially exceeding the published CIFAKE baseline of 92.98% (Bird and Lotfi, 2024). GradCAM++ (Gradient-weighted Class Activation Mapping++) was implemented from scratch to generate pixel-level visual explanations identifying the specific image regions influencing each classification decision. Beyond deep learning, a novel multi-signal forensic analysis pipeline was developed incorporating six independent pixel-level detectors: Discrete Fourier Transform checkerboard analysis, Error Level Analysis, SRM noise residual inspection, colour channel statistical analysis, Local Binary Pattern texture entropy, and Haar wavelet energy ratio computation. All six signals are combined with the two deep learning models in a weighted mega-ensemble producing a final verdict purely from the uploaded image pixels. The complete system is deployed as a RESTful API using FastAPI with a five-page Streamlit web interface providing real-time detection, GradCAM++ visualisation, and forensic signal breakdown
Marimuthu, P., Parameshwaran, B. & Anusurya, R. A. (2026). Synthetic Image Detection using Deep Learning. International Journal of Science, Strategic Management and Technology, 02(05). https://doi.org/10.55041/ijsmt.v2i5.015
Marimuthu, P., et al.. "Synthetic Image Detection using Deep Learning." International Journal of Science, Strategic Management and Technology, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i5.015.
Marimuthu, P.,B. Parameshwaran, and R. Anusurya. "Synthetic Image Detection using Deep Learning." International Journal of Science, Strategic Management and Technology 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i5.015.
[2] R. Corvi, D. Cozzolino, G. Zingarini, G. Poggi, K. Nagano, and L. Verdoliva, "On the detection of synthetic images generated by diffusion models," in Proc. IEEE ICASSP, 2023.
[3] A. Chattopadhay, A. Sarkar, P. Howlader, and V. N. Balasubramanian, "Grad-CAM++: Generalised gradient-based visual explanations for deep convolutional networks," in Proc. IEEE WACV, 2018.
[4] A. Dosovitskiy et al., "An image is worth 16×16 words: Transformers for image recognition at scale," in Proc. ICLR, 2021.
[5] X. Zhang, S. Karaman, and S. F. Chang, "Detecting and simulating artifacts in GAN fake images," in Proc. IEEE WIFS, 2019.
[6] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, "Grad-CAM: Visual explanations from deep networks via gradient-based localization," in Proc. IEEE ICCV, 2017.
[7] K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proc. IEEE CVPR, pp. 770–778, 2016.
[8] I. Goodfellow et al., "Generative adversarial nets," in Advances in Neural Information Processing Systems (NeurIPS), vol. 27, 2014.
[9] R. Corvi, D. Cozzolino, G. Poggi, K. Nagano, and L. Verdoliva, "Intriguing properties of synthetic images: From GANs to diffusion models," in Proc. IEEE/CVF CVPRW, 2023.
[10] D. Cozzolino, G. Poggi, R. Corvi, M. Nießner, and L. Verdoliva, "Raising the bar of AI-generated image detection with CLIP," in Proc. IEEE/CVF CVPR, pp. 4356–4366, 2024.