1
|
Khan AH, Shah A, Ali A, Shahid R, Zahid ZU, Sharif MU, Jan T, Zafar MH. A performance comparison of machine learning models for stock market prediction with novel investment strategy. PLoS One 2023; 18:e0286362. [PMID: 37733720 PMCID: PMC10513304 DOI: 10.1371/journal.pone.0286362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 05/15/2023] [Indexed: 09/23/2023] Open
Abstract
Stock market forecasting is one of the most challenging problems in today's financial markets. According to the efficient market hypothesis, it is almost impossible to predict the stock market with 100% accuracy. However, Machine Learning (ML) methods can improve stock market predictions to some extent. In this paper, a novel strategy is proposed to improve the prediction efficiency of ML models for financial markets. Nine ML models are used to predict the direction of the stock market. First, these models are trained and validated using the traditional methodology on a historic data captured over a 1-day time frame. Then, the models are trained using the proposed methodology. Following the traditional methodology, Logistic Regression achieved the highest accuracy of 85.51% followed by XG Boost and Random Forest. With the proposed strategy, the Random Forest model achieved the highest accuracy of 91.27% followed by XG Boost, ADA Boost and ANN. In the later part of the paper, it is shown that only classification report is not sufficient to validate the performance of ML model for stock market prediction. A simulation model of the financial market is used in order to evaluate the risk, maximum draw down and returns associate with each ML model. The overall results demonstrated that the proposed strategy not only improves the stock market returns but also reduces the risks associated with each ML model.
Collapse
Affiliation(s)
- Azaz Hassan Khan
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Abdullah Shah
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Abbas Ali
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Rabia Shahid
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Zaka Ullah Zahid
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Malik Umar Sharif
- Department of Electrical Engineering and Computer Science, Jalozai Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Tariqullah Jan
- Department of Electrical Engineering, Main Campus, University of Engineering and Technology, Peshawar, Pakistan
| | - Mohammad Haseeb Zafar
- Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff, United Kingdom
| |
Collapse
|
2
|
Yeganeh A, Shongwe SC. A novel application of statistical process control charts in financial market surveillance with the idea of profile monitoring. PLoS One 2023; 18:e0288627. [PMID: 37471396 PMCID: PMC10359006 DOI: 10.1371/journal.pone.0288627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 07/02/2023] [Indexed: 07/22/2023] Open
Abstract
The implementation of statistical techniques in on-line surveillance of financial markets has been frequently studied more recently. As a novel approach, statistical control charts which are famous tools for monitoring industrial processes, have been applied in various financial applications in the last three decades. The aim of this study is to propose a novel application of control charts called profile monitoring in the surveillance of the cryptocurrency markets. In this way, a new control chart is proposed to monitor the price variation of a pair of two most famous cryptocurrencies i.e., Bitcoin (BTC) and Ethereum (ETH). Parameter estimation, tuning and sensitivity analysis are conducted assuming that the random explanatory variable follows a symmetric normal distribution. The triggered signals from the proposed method are interpreted to convert the BTC and ETH at proper times to increase their total value. Hence, the proposed method could be considered a financial indicator so that its signal can lead to a tangible increase of the pair of assets. The performance of the proposed method is investigated through different parameter adjustments and compared with some common technical indicators under a real data set. The results show the acceptable and superior performance of the proposed method.
Collapse
Affiliation(s)
- Ali Yeganeh
- Faculty of Natural and Agricultural Sciences, Department of Mathematical Statistics and Actuarial Science, University of the Free State, Bloemfontein, South Africa
| | - Sandile Charles Shongwe
- Faculty of Natural and Agricultural Sciences, Department of Mathematical Statistics and Actuarial Science, University of the Free State, Bloemfontein, South Africa
| |
Collapse
|
3
|
Navarro MM, Young MN, Prasetyo YT, Taylar JV. Stock market optimization amidst the COVID-19 pandemic: Technical analysis, K-means algorithm, and mean-variance model (TAKMV) approach. Heliyon 2023; 9:e17577. [PMID: 37366512 PMCID: PMC10287180 DOI: 10.1016/j.heliyon.2023.e17577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 06/17/2023] [Accepted: 06/21/2023] [Indexed: 06/28/2023] Open
Abstract
The Philippine stock market, just like most of its neighbors in the region, was seriously impacted by the global pandemic COVID-19. Investors remain hopeful while continuing to seek great ones in the damaged market. This paper developed a methodology for portfolio selection and optimization with the use of technical analysis, machine learning techniques, and portfolio optimization model. The combined methods of technical analysis, K-means clustering algorithm, and mean-variance portfolio optimization model will result in the development of the proposed TAKMV method. The study aims to integrate these three important analyses to identify portfolio investments. This paper uses the average annual risk and annual rate of return data for the years 2018 and 2020 to form the clusters and assessed the stocks that correspond to the investor's technical strategy such as Moving Average Convergence/Divergence (MACD) and Hybrid MACD with Arnaud Legoux Moving Average (ALMA). This paper solved the risk minimization problem on selected shares of the companies, based on the mean-variance portfolio optimization model. There are 230 and 239 companies for 2018 and 2020, respectively, listed in Philippine Stock Market, and all simulations were performed in MATLAB environment platform. Results showed that MACD strategy dominates the MACD-ALMA strategy in terms of the number of assets with a positive annual rate of return. The MACD works efficiently in the pre-COVID-19 condition while MACD-ALMA works efficiently during-COVID-19 condition, regardless of the number of assets with a positive annual rate of return. The results also show that the maximum expected portfolio return (RP) can be achieved using the MACD and MACD-ALMA in the pre-and during-COVID-19 conditions, respectively. The MACD-ALMA shows an advantage during high-risk market conditions and can also provide maximum RP. The performance of the TAKMV method was validated by applying its results and comparing it to the next year's historical price. The 2018 results were compared to 2019 data and the 2020 results were compared to 2021 data. For consistency, the comparison was applied to the same company per portfolio. Simulation results show that the MACD strategy is more effective compared to MACD-ALMA.
Collapse
Affiliation(s)
- Maricar M. Navarro
- School of Industrial Engineering and Engineering Management, Mapúa University, 658 Muralla St., Intramuros, Manila 1002, Philippines
- School of Graduate Studies, Mapúa University, 658 Muralla St., Intramuros, Manila 1002, Philippines
- Department of Industrial Engineering, Technological Institute of the Philippines Quezon City, 938 Aurora Blvd, Cubao, Quezon City 1109, Metro Manila, Philippines
| | - Michael Nayat Young
- School of Industrial Engineering and Engineering Management, Mapúa University, 658 Muralla St., Intramuros, Manila 1002, Philippines
| | - Yogi Tri Prasetyo
- International Bachelor Program in Engineering, Yuan Ze University, 135 Yuan-Tung Road, Chung-Li 32003, Taiwan
- Department of Industrial Engineering and Management, Yuan Ze University, 135 Yuan-Tung Road, Chung-Li, 32003, Taiwan
| | - Jonathan V. Taylar
- Department of Computer Engineering, Technological Institute of the Philippines Quezon City, 938 Aurora Blvd, Cubao, Quezon City 1109, Metro Manila, Philippines
| |
Collapse
|
4
|
Ngo NT, Pham AD, Truong TTH, Truong NS, Huynh NT. Developing a hybrid time-series artificial intelligence model to forecast energy use in buildings. Sci Rep 2022; 12:15775. [PMID: 36131108 PMCID: PMC9492719 DOI: 10.1038/s41598-022-19935-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Accepted: 09/06/2022] [Indexed: 11/22/2022] Open
Abstract
The development of a reliable energy use prediction model is still difficult due to the inherent complex pattern of energy use data. There are few studies developing a prediction model for the one-day-ahead energy use prediction in buildings and optimizing the hyperparameters of a prediction model is necessary. This study aimed to propose a hybrid artificial intelligence model for forecasting one-day ahead time-series energy consumption in buildings. The proposed model was developed based on the integration of the Seasonal Autoregressive integrated Moving average, the Firefly-inspired Optimization algorithm, and the support vector Regression (SAMFOR). A large dataset of energy consumption in 30-min intervals, temporal data, and weather data from six real-world buildings in Vietnam was used to train and test the model. Sensitivity analyses were performed to identify appropriate model inputs. Comparison results show that the SAMFOR model was more effective than the others such as the seasonal autoregressive integrated moving average (SARIMA) and support vector regression (SVR), SARIMA-SVR, and random forests (RF) models. Evaluation results on real-world building depicted that the proposed SAMFOR model achieved the highest accuracy with the root-mean-square error (RMSE) of 1.77 kWh in, mean absolute percentage error (MAPE) of 9.56%, and correlation coefficient (R) of 0.914. The comparison results confirmed that the SAMFOR model was effective for forecasting one-day-ahead energy consumption. The study contributes to (1) the knowledge domain by proposing the hybrid SAMFOR model for forecasting energy consumption in buildings; and (2) the state of practice by providing building managers or users with a powerful tool for analyzing and improving building energy performance.
Collapse
Affiliation(s)
- Ngoc-Tri Ngo
- Faculty of Project Management, The University of Danang - University of Science and Technology, 54 Nguyen Luong Bang, Da Nang, Vietnam.
| | - Anh-Duc Pham
- Faculty of Project Management, The University of Danang - University of Science and Technology, 54 Nguyen Luong Bang, Da Nang, Vietnam
| | - Thi Thu Ha Truong
- Department of Civil Engineering, The University of Danang-University of Technology and Education, 48 Cao Thang Street, Da Nang City, Vietnam
| | - Ngoc-Son Truong
- Faculty of Project Management, The University of Danang - University of Science and Technology, 54 Nguyen Luong Bang, Da Nang, Vietnam
| | - Nhat-To Huynh
- Faculty of Project Management, The University of Danang - University of Science and Technology, 54 Nguyen Luong Bang, Da Nang, Vietnam
| |
Collapse
|
5
|
Omar AB, Huang S, Salameh AA, Khurram H, Fareed M. Stock Market Forecasting Using the Random Forest and Deep Neural Network Models Before and During the COVID-19 Period. FRONTIERS IN ENVIRONMENTAL SCIENCE 2022; 10. [DOI: 10.3389/fenvs.2022.917047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Stock market forecasting is considered the most challenging problem to solve for analysts. In the past 2 years, Covid-19 has severely affected stock markets globally, which, in turn, created a great problem for investors. The prime objective of this study is to use a machine learning model to effectively forecast stock index prices in three time frames: the whole period, the pre-Covid-19 period, and the Covid-19 period. The model accuracy testing results of mean absolute error, root mean square error, mean absolute percentage error, and r2 suggest that the proposed machine learning models autoregressive deep neural network (AR-DNN(1, 3, 10)), autoregressive deep neural network (AR-DNN(3, 3, 10)), and autoregressive random forest (AR-RF(1)) are the best forecasting models for stock index price forecasting for the whole period, for the pre-Covid-19 period, and during the Covid-19 period, respectively, under high stock price fluctuations compared to traditional time-series forecasting models such as autoregressive moving average models. In particular, AR-DNN(1, 3, 10) is suggested when the number of observations is large, whereas AR-RF(1) is suggested for a series with a low number of observations. Our study has a practical implication as they can be used by investors and policy makers in their investment decisions and in formulating financial decisions and policies, respectively.
Collapse
|
6
|
Luo J, Fang SC, Deng Z, Tian Y. Robust kernel-free support vector regression based on optimal margin distribution. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
7
|
An Intelligent Fusion Model with Portfolio Selection and Machine Learning for Stock Market Prediction. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7588303. [PMID: 35785077 PMCID: PMC9246624 DOI: 10.1155/2022/7588303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/17/2022] [Accepted: 05/26/2022] [Indexed: 11/18/2022]
Abstract
Developing reliable equity market models allows investors to make more informed decisions. A trading model can reduce the risks associated with investment and allow traders to choose the best-paying stocks. However, stock market analysis is complicated with batch processing techniques since stock prices are highly correlated. In recent years, advances in machine learning have given us a lot of chances to use forecasting theory and risk optimization together. The study postulates a unique two-stage framework. First, the mean-variance approach is utilized to select probable stocks (portfolio construction), thereby minimizing investment risk. Second, we present an online machine learning technique, a combination of “perceptron” and “passive-aggressive algorithm,” to predict future stock price movements for the upcoming period. We have calculated the classification reports, AUC score, accuracy, and Hamming loss for the proposed framework in the real-world datasets of 20 health sector indices for four different geographical reasons for the performance evaluation. Lastly, we conduct a numerical comparison of our method's outcomes to those generated via conventional solutions by previous studies. Our aftermath reveals that learning-based ensemble strategies with portfolio selection are effective in comparison.
Collapse
|
8
|
Introspecting predictability of market fear in Indian context during COVID-19 pandemic: An integrated approach of applied predictive modelling and explainable AI. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT DATA INSIGHTS 2021. [PMCID: PMC8463332 DOI: 10.1016/j.jjimei.2021.100039] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Financial markets across the globe have seen rapid volatility and uncertainty owing to scary and disruptive impacts of COVID-19 pandemic. Mayhem wrecked by frequent lockdowns, curfews, emergencies, etc. has stoked the high quantum of chaotic movement in equity markets and resulted in perplexed investor behaviour. It, therefore, is of paramount practical relevance to measure predictability of market fear at such a crucial juncture of time. Market fear can effectively be measured in terms of implied and historic volatility of equity markets. The present study chooses India VIX and 20-day rolling standard deviation of NIFTY returns to account for implied and historic volatility respectively during the ongoing COVID-19 timeline. Pertinent macroeconomic constructs, technical indicators and Google search volume index on meaningful keywords have been chosen as raw explanatory features for inspecting predictability. Boruta feature selection methodology has been used in a supervised manner to select significant features. State-Of-The-Art machine and deep learning algorithms namely Gradient Boosting (GB), Extra Tree Regression (ERT), Deep Neural Network (DNN), Long Short Term Memory Network (LSTM) are then used on processed feature set to scrupulously evaluate the quantum of predictability of said assets. The integrated predictive frameworks have been subjected to a battery of numerical and statistical checks to draw inferences. Additionally, Explainable AI frameworks have been used to analyse the nature of influence of respective features. Findings indeed suggest that despite exhibiting high degree of volatile traits, both India VIX and historic volatility can be predicted utilizing the proposed architectures effectively and serve practical actionable insights.
Collapse
|