SOFTWARE EFFORT ESTIMATION USING MACHINE LEARNING
The forecast of effort needed to complete software development is essential to control costs, schedules, and allocation of resources in the current projects. However, initial estimates are often inaccurate because of the multifaceted nature of project data. This is the kind of data that is usually inconsistent, non-linear relationships, and the effect of both human and technical aspects, which classical models can hardly address entirely account for. The machine proposed in this paper is a novel, two-part machine method of learning that is used to approximate software development effort and spot projects likely to go over budget limits. The framework takes a blended strategy: a stacked ensemble regressor, a combination of Random Forest and Gradient Enhancing algorithms, to generate accurate effort predictions, and a high-risk project identification layer to secondary classification layer of budget overruns. Experiment on the Desharnais and NASA93 datasets showed interesting outcomes, as the model obtained a Mean Magnitude of Relative Error (MMRE) of 0.138 and a classification accuracy of 91.5%. Additional discussion of error rates means that the model provides reliable risk estimates and without producing too many false alarms.
Keywords- Software Effort Estimation, Machine Learning, Stacked Ensemble, Risk Classification, Random Forest, Gradient Boosting, MMRE.
Choubey, S. & Tiwari, V. (2026). Software Effort Estimation using Machine Learning. International Journal of Science, Strategic Management and Technology, 02(6). https://doi.org/10.55041/ijsmt.v2i6.079
Choubey, Satyam, and Vikas Tiwari. "Software Effort Estimation using Machine Learning." International Journal of Science, Strategic Management and Technology, vol. 02, no. 6, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i6.079.
Choubey, Satyam, and Vikas Tiwari. "Software Effort Estimation using Machine Learning." International Journal of Science, Strategic Management and Technology 02, no. 6 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i6.079.
2.P. Brooks Jr., The Mythical Man-Month: Essays on Software Engineering. Addison-Wesley, 1995.
3.W. Boehm, Software Engineering Economics. Englewood Cliffs, NJ: Prentice-Hall, 1981.
4.F. Kemerer, An empirical validation of software cost estimation models,” Commun. ACM, vol. 30, no. 5, pp. 416–429, 1987.
5.Shepperd and C. Schofield, Estimating software project effort using analogies,” IEEE Trans. Softw. Eng., vol. 23, no. 11, pp. 736–743, 1997.
6.Kocaguneli, T. Menzies, and J. W. Keung, On the value of ensemble effort estimation,” IEEE Trans. Softw. Eng., vol. 38, no. 6, pp. 1403–1416, 2012.
7.Breiman, Random Forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.
8.H. Friedman, Greedy function approximation: A gradient boosting machine,” Annals of Statistics, vol. 29, pp. 1189–1232, 2001.
9.H. Wolpert, Stacked generalization,” Neural Networks, vol. 5, no. 2, pp. 241–259, 1992.
10.Corazza, et al., How effective is tabu search to configure support vector regression for effort estimation?” Proc. 6th Int. Conf. Predict. Models Softw. Eng., 2010.