Predicting Housing Sale Prices Using Machine Learning with Various Data Split Ratios
DOI:
https://doi.org/10.56294/dm2024.231Keywords:
Data, Computational methods, House PredictionAbstract
Introduction: Recent advancements in technology and data analytics have propelled the rapid growth of artificial intelligence (AI) and machine learning (ML), which are now central to various industries. These technologies have become essential tools in many sectors, especially in predictive modeling for asset pricing.
Objective: From stock markets and rental properties to real estate and second-hand goods, AI and ML algorithms are widely applied to estimate values, optimize pricing strategies, and forecast market trends.
Method: By analyzing vast amounts of data, these tools enable more accurate predictions and informed decision-making, revolutionizing traditional approaches to pricing and valuation. In this study, the primary goal is to achieve the most accurate price prediction for houses or apartments by experimenting with different data split ratios.
Result: RMSE (House Price) 188965.28 is acceptable as best average price for houses.
Conclusions: The value of RMSE of this model are relatively low and also the value Squared Correlation is 64% which is above the threshold of 50%, so the predicted price of this model is seems appropriate, so I have presented this model and its predicted house price as final acceptable value for my research outcome
References
1. Awais Azam M, Rai S, Shams Raza M. Predictive Analytics for Housing Market Trends and Valuation. Manag [Internet]. 2025 Jan 1;3 SE-Or:115. Available from: https://doi.org/10.62486/agma2025115
2. Fourkiotis KP, Tsadiras A. Comparing Machine Learning Techniques for House Price Prediction. In: IFIP International Conference on Artificial Intelligence Applications and Innovations. Springer; 2023. p. 292–303.
3. Choy LHT, Ho WKO. The use of machine learning in real estate research. Land. 2023;12(4):740.
4. Park B, Bae JK. Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data. Expert Syst Appl. 2015;42(6):2928–34.
5. Pai PF, Wang WC. Using machine learning models and actual transaction data for predicting real estate prices. Appl Sci. 2020;10(17):5832.
6. Md Alimul Haque DS, Shameemul Haque MR and, Kumar K. Learning Management System Empowered by Machine Learning. In: AIPCP21-AR-CRSE2021-00085 Recent Trends in Science and Engineering (CRSE2021). 2021.
7. Zeba S, Haque MA, Alhazmi S, Haque S. Advanced Topics in Machine Learning. Mach Learn Methods Eng Appl Dev. 2022;197.
8. Azrar A, Ali Y, Awais M, Zaheer K. Data mining models comparison for diabetes prediction. Int J Adv Comput Sci Appl. 2018;9(8):320–3.
9. Whig V, Othman B, Gehlot A, Haque MA, Qamar S, Singh J. An Empirical Analysis of Artificial Intelligence (AI) as a Growth Engine for the Healthcare Sector. In: 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE). IEEE; 2022. p. 2454–7.
10. Oh SY, Hang SP, Wang JTW. Prediction of residential property prices using machine learning algorithms. In: ITM Web of Conferences. EDP Sciences; 2024. p. 1042
Published
Issue
Section
License
Copyright (c) 2025 Awais Azam, Alimul Haque, Sakshi Rai (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.