TY - JOUR
T1 - A performance comparison of machine learning models for stock market prediction with novel investment strategy
AU - Khan, Azaz Hassan
AU - Shah, Abdullah
AU - Ali, Abbas
AU - Shahid, Rabia
AU - Zahid, Zaka Ullah
AU - Sharif, Malik Umar
AU - Jan, Tariqullah
AU - Zafar, Mohammad Haseeb
N1 - Publisher Copyright:
Copyright: © 2023 Khan et al.
PY - 2023/9/21
Y1 - 2023/9/21
N2 - Stock market forecasting is one of the most challenging problems in today’s financial markets. According to the efficient market hypothesis, it is almost impossible to predict the stock market with 100% accuracy. However, Machine Learning (ML) methods can improve stock market predictions to some extent. In this paper, a novel strategy is proposed to improve the prediction efficiency of ML models for financial markets. Nine ML models are used to predict the direction of the stock market. First, these models are trained and validated using the traditional methodology on a historic data captured over a 1-day time frame. Then, the models are trained using the proposed methodology. Following the traditional methodology, Logistic Regression achieved the highest accuracy of 85.51% followed by XG Boost and Random Forest. With the proposed strategy, the Random Forest model achieved the highest accuracy of 91.27% followed by XG Boost, ADA Boost and ANN. In the later part of the paper, it is shown that only classification report is not sufficient to validate the performance of ML model for stock market prediction. A simulation model of the financial market is used in order to evaluate the risk, maximum draw down and returns associate with each ML model. The overall results demonstrated that the proposed strategy not only improves the stock market returns but also reduces the risks associated with each ML model.
AB - Stock market forecasting is one of the most challenging problems in today’s financial markets. According to the efficient market hypothesis, it is almost impossible to predict the stock market with 100% accuracy. However, Machine Learning (ML) methods can improve stock market predictions to some extent. In this paper, a novel strategy is proposed to improve the prediction efficiency of ML models for financial markets. Nine ML models are used to predict the direction of the stock market. First, these models are trained and validated using the traditional methodology on a historic data captured over a 1-day time frame. Then, the models are trained using the proposed methodology. Following the traditional methodology, Logistic Regression achieved the highest accuracy of 85.51% followed by XG Boost and Random Forest. With the proposed strategy, the Random Forest model achieved the highest accuracy of 91.27% followed by XG Boost, ADA Boost and ANN. In the later part of the paper, it is shown that only classification report is not sufficient to validate the performance of ML model for stock market prediction. A simulation model of the financial market is used in order to evaluate the risk, maximum draw down and returns associate with each ML model. The overall results demonstrated that the proposed strategy not only improves the stock market returns but also reduces the risks associated with each ML model.
UR - http://www.scopus.com/inward/record.url?scp=85171960116&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0286362
DO - 10.1371/journal.pone.0286362
M3 - Article
C2 - 37733720
AN - SCOPUS:85171960116
SN - 1932-6203
VL - 18
JO - PLoS ONE
JF - PLoS ONE
IS - 9 September
M1 - e0286362
ER -