MACHINE LEARNING APPROACH FOR PREDICTING ROAD ACCIDENT IMPACT LEVELS USING PYTHON
Road traffic accidents present a premier global public safety and economic crisis, responsible for millions of annual fatalities and severe infrastructure disruptions [1]. Conventional safety interventions heavily rely on reactive, post-incident reporting rather than proactive, predictive systems. This paper introduces an integrated machine learning framework engineered to forecast road accident impact levels and delineate high-risk geographic clusters, colloquially termed "black spots." Leveraging a highly detailed, pre-processed manual record dataset from India (2017–2022) featuring 32 attributes across 12,316 distinct records alongside regional data arrays [6], we evaluate the predictive efficacy of four major supervised classification algorithms: Logistic Regression, Decision Trees, Naïve Bayes, and Random Forest Classifiers. Distinct from clinical injury models, the severity metric optimized wherein primarily measures localized operational degradation and impact on regional traffic flow stability [9]. Empirical results demonstrate that the Random Forest Classifier achieves superior performance, outperforming competitive architectures in terms of global classification accuracy ($92.0\%$), robustness to feature collinearity, and multi-class F1-scores [7]. Furthermore, a spatial clustering protocol is applied to isolate accident black spots [5]. The outputs of this framework provide a foundational system capable of translating multi-modal data streams into real-time hazard warnings for intelligent transportation networks and target policy interventions for urban planners.
Kumar, K., Hembrom, J., Kumar, S. & Kumar, R. (2026). Machine Learning Approach for Predicting Road Accident Impact Levels using Python. International Journal of Science, Strategic Management and Technology, 02(05). https://doi.org/10.55041/ijsmt.v2i5.390
Kumar, Kishor, et al.. "Machine Learning Approach for Predicting Road Accident Impact Levels using Python." International Journal of Science, Strategic Management and Technology, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i5.390.
Kumar, Kishor,Jyotish Hembrom,Suraj Kumar, and Ranvir Kumar. "Machine Learning Approach for Predicting Road Accident Impact Levels using Python." International Journal of Science, Strategic Management and Technology 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i5.390.
2.Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
3.Karamanlis, I., Kokkalis, A., Profillidis, V., Botzoris, G., Kiourt, C., Sevetlidis, V., & Pavlidis, G. (2023). Deep learning based black spot identification on Greek road networks. Data, 8(6), 110. https://doi.org/10.3390/data8060110
4.Mitchell, T. M. (1997). Machine Learning. McGraw-Hill Education.
5.Sarkar, A. (2024). Accident black spot identification based on classical and computational intelligence methods. AIP Conference Proceedings, 3181(1), 030004. https://doi.org/10.1063/5.0214707
6.Kaggle Dataset Repository. (2022). Road Accident Severity in India (2017-2022). Data Source: S3Programmer.
7.Géron, A. (2019). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow (2nd ed.). O'Reilly Media.
8.Raschka, S., & Mirjalili, V. (2019). Python Machine Learning (3rd ed.). Packt Publishing.
9.Liu, H., & Shetty, R. R. (2021). Analytical Models for Traffic Congestion and Accident Analysis. Mineta Transportation Institute. https://doi.org/10.31979/mti.2021.2102
10.Alobidan, Y. A. (2026). Feature selection for accident severity modeling: A WCFR-based analysis on the U.S. Accidents dataset. MDPI Electronics, 15(6), 1308. https://doi.org/10.3390/electronics15061308