BENCHMARKING GREEN AI METHODS FOR AUDIO DEEPFAKE DETECTION:A COMPARATIVE STUDY OF EFFICIENCY AND ACCURACY
Audio deepfake detection has emerged as a critical challenge in AI security, driven by the rapid proliferation of advanced voice synthesis and voice conversion technologies. State-of-the-art detectors deliver impressive accuracy but impose substantial computational and environmental costs. Green AI offers a compelling alternative by leveraging frozen, pre-trained self-supervised learning (SSL) models as feature extractors paired with lightweight classical machine learning classifiers — enabling CPU-only training and inference. This paper presents a systematic benchmarking study of existing Green AI approaches for audio deepfake detection, evaluating multiple SSL front-ends (wav2vec 2.0, WavLM, HuBERT) in conjunction with multiple classical ML back-ends (SVM-RBF, Logistic Regression, MLP) across two benchmark datasets — ASVspoof 2019 LA and ASVspoof 2021 DF. Beyond accuracy (measured by Equal Error Rate), we introduce a multi-dimensional efficiency analysis encompassing trainable parameter count, training time, inference time, estimated energy consumption, and approximate CO2 emissions. Our results demonstrate that SSL(wav2vec 2.0, Layer 9) + SVM-RBF achieves the best Green AI accuracy with an EER of 0.90% on ASVspoof 2019 LA using fewer than 1,000 trainable parameters, training in under 3 minutes on a standard CPU.
Fatima, B. & Pandey, R. (2026). Benchmarking Green AI Methods for Audio Deepfake Detection:A Comparative Study of Efficiency and Accuracy. International Journal of Science, Strategic Management and Technology, 02(05). https://doi.org/10.55041/ijsmt.v2i5.165
Fatima, Bushra, and Rohitashwa Pandey. "Benchmarking Green AI Methods for Audio Deepfake Detection:A Comparative Study of Efficiency and Accuracy." International Journal of Science, Strategic Management and Technology, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i5.165.
Fatima, Bushra, and Rohitashwa Pandey. "Benchmarking Green AI Methods for Audio Deepfake Detection:A Comparative Study of Efficiency and Accuracy." International Journal of Science, Strategic Management and Technology 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i5.165.
[2] J. Jung et al., "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks," Proc. IEEE ICASSP, 2022.
[3] H. Tak et al., "Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0," Proc. Odyssey, 2022.
[4] A. Baevski, Y. Zhou, A. Mohamed, and M. Auli, "Wav2vec 2.0: A Framework for Self-Supervised Learning," NeurIPS, 2020.
[5] S. Chen et al., "WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing," IEEE JSTSP, 2022.
[6] W.-N. Hsu et al., "HuBERT: Self-Supervised Speech Representation Learning," IEEE/ACM TASLP, vol. 29, 2021.
[7] X. Wang et al., "ASVspoof 2019: A Large-Scale Public Database," Comput. Speech Lang., vol. 64, 2020.
[8] J. Yamagishi et al., "ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild," IEEE/ACM TASLP, vol. 31, 2023.
[9] R. Schwartz et al., "Green AI," Commun. ACM, vol. 63, no. 12, pp. 54–63, 2020.
[10] E. Strubell, A. Ganesh, and A. McCallum, "Energy and Policy Considerations for Deep Learning in NLP," Proc. ACL, 2019.