1
|
Abo El Nasr MM, Abdelmegaly AA, Abdo DA. Performance evaluation of different regression models: application in a breast cancer patient data. Sci Rep 2024; 14:12986. [PMID: 38839771 DOI: 10.1038/s41598-024-62627-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Accepted: 05/20/2024] [Indexed: 06/07/2024] Open
Abstract
This paper provides a comprehensive analysis of linear regression models, focusing on addressing multicollinearity challenges in breast cancer patient data. Linear regression methodologies, including GAM, Beta, GAM Beta, Ridge, and Beta Ridge, are compared using two statistical criteria. The study, conducted with R software, showcases the Beta regression model's exceptional performance, achieving a BIC of - 5520.416. Furthermore, the Ridge regression model demonstrates remarkable results with the best AIC at - 8002.647. The findings underscore the practical application of these models in real-world scenarios and emphasize the Beta regression model's superior ability to handle multicollinearity challenges. The preference for AIC over BIC in Generalized Additive Models (GAMs) is rooted in the AIC's calculation framework, highlighting its effectiveness in capturing the complexity and flexibility inherent in GAMs.
Collapse
Affiliation(s)
- Mona Mahmoud Abo El Nasr
- Department of Applied Statistics and Insurance, Faculty of Commerce, Mansoura University, Mansoura, 33516, Egypt.
| | - Alaa A Abdelmegaly
- Higher Institute of Advanced Management Sciences and Computers, Al-Buhayrah, Egypt
| | - Doaa A Abdo
- Department of Applied Statistics and Insurance, Faculty of Commerce, Mansoura University, Mansoura, 33516, Egypt
| |
Collapse
|
2
|
Sirohi A, Alsaedi BS, Ahelali MH, Jayaswal MK. Biased proportional hazard regression estimator in the existence of collinearity. Heliyon 2023; 9:e21394. [PMID: 38027716 PMCID: PMC10665663 DOI: 10.1016/j.heliyon.2023.e21394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 10/03/2023] [Accepted: 10/20/2023] [Indexed: 12/01/2023] Open
Abstract
This paper proposed a new biased proportional hazard regression (PHR) estimator which is the combination of elastic net proportional hazard regression (ENPHR) and principal components proportional hazard regression (PCPHR) estimator. Comparison of proposed estimator with ENPHR, PCPHR, ridge PHR, lasso PHR, r - k class PHR and maximum likelihood (ML) estimators is done in terms of scalar mean square error (MSE). Simulation study is conducted to examine the performance of each estimator. Furthermore, the developed estimator is utilized to analyze the infant mortality in Delhi, India.
Collapse
Affiliation(s)
- Anu Sirohi
- Department of Statistics, AIAS, Amity University, Noida, India
| | - Basim S.O. Alsaedi
- Department of Statistics, University of Tabuk, Tabuk 71491, Saudi Arabia
| | - Marwan H. Ahelali
- Department of Statistics, University of Tabuk, Tabuk 71491, Saudi Arabia
| | | |
Collapse
|
3
|
Pang X, Ge M. Effect of geographical factors on reference values of creatine kinase isoenzyme. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY 2023; 67:553-563. [PMID: 36941512 PMCID: PMC10027583 DOI: 10.1007/s00484-023-02429-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Revised: 10/16/2022] [Accepted: 01/10/2023] [Indexed: 05/25/2023]
Abstract
The aim of this study was to investigate the geographical spatial distribution of creatine kinase isoenzyme (CK-MB) in order to provide a scientific basis for clinical examination. The reference values of CK-MB of 8697 healthy adults in 137 cities in China were collected by reading a large number of literates. Moran index was used to determine the spatial relationship, and 24 factors were selected, which belonged to terrain, climate, and soil indexes. Correlation analysis was conducted between CK-MB and geographical factors to determine significance, and 9 significance factors were extracted. Based on R language to evaluate the degree of multicollinearity of the model, CK-MB Ridge model, Lasso model, and PCA model were established, through calculating the relative error to choose the best model PCA, testing the normality of the predicted values, and choosing the disjunctive kriging interpolation to make the geographical distribution. The results show that CK-MB reference values of healthy adults were generally correlated with latitude, annual sunshine duration, annual mean relative humidity, annual precipitation amount, and annual range of air temperature and significantly correlated with annual mean air temperature, topsoil gravel content, topsoil cation exchange capacity in clay, and topsoil cation exchange capacity in silt. The geospatial distribution map shows that on the whole, it is higher in the north and lower in the south, and gradually increases from the southeast coastal area to the northwest inland area. If the geographical factors are obtained in a location, the CK-MB model can be used to predict the CK-MB of healthy adults in the region, which provides a reference for us to consider regional differences in clinical diagnosis.
Collapse
Affiliation(s)
- Xinrui Pang
- College of Tourist and Environment Science, Shaanxi Normal University, Xi’an, Shaanxi 710119 China
| | - Miao Ge
- College of Tourist and Environment Science, Shaanxi Normal University, Xi’an, Shaanxi 710119 China
| |
Collapse
|
4
|
Arabi Belaghi R, Asar Y, Larsson R. Improved shrinkage estimators in the beta regression model with application in econometric and educational data. Stat Pap (Berl) 2022. [DOI: 10.1007/s00362-022-01355-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
AbstractAlthough beta regression is a very useful tool to model the continuous bounded outcome variable with some explanatory variables, however, in the presence of multicollinearity, the performance of the maximum likelihood estimates for the estimation of the parameters is poor. In this paper, we propose improved shrinkage estimators via Liu estimator to obtain more efficient estimates. Therefore, we defined linear shrinkage, pretest, shrinkage pretest, Stein and positive part Stein estimators to estimate of the parameters in the beta regression model, when some of them have not a significant effect to predict the outcome variable so that a sub-model may be sufficient. We derived the asymptotic distributional biases, variances, and then we conducted extensive Monte Carlo simulation study to obtain the performance of the proposed estimation strategy. Our results showed a great benefit of the new methodologies for practitioners specifically in the applied sciences. We concluded the paper with two real data analysis from economics and education.
Collapse
|
5
|
Afzal N, Amanullah M. Dawoud–Kibria Estimator for the Logistic Regression Model: Method, Simulation and Application. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY, TRANSACTIONS A: SCIENCE 2022. [DOI: 10.1007/s40995-022-01354-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
6
|
Hadia M, Amin M, Akram MN. Comparison of link functions for the estimation of logistic ridge regression: an application to urine data. COMMUN STAT-SIMUL C 2022. [DOI: 10.1080/03610918.2022.2127769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Affiliation(s)
- Mehmoona Hadia
- Department of Statistics, University of Sargodha, Sargodha, Pakistan
| | - Muhammad Amin
- Department of Statistics, University of Sargodha, Sargodha, Pakistan
| | | |
Collapse
|
7
|
Zhao L, Yang J. Batch Process Monitoring Based on Quality-Related Time-Batch 2D Evolution Information. SENSORS (BASEL, SWITZERLAND) 2022; 22:2235. [PMID: 35336405 PMCID: PMC8954576 DOI: 10.3390/s22062235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 03/10/2022] [Accepted: 03/11/2022] [Indexed: 06/14/2023]
Abstract
This paper proposed a quality-related online monitoring strategy based on time and batch two-dimensional evolution information for batch processes. In the direction of time, considering the difference between each phase and the steady part and the transition part in the same phase, the change trend of the regression coefficient of the PLS model is used to divide each batch into phases, and each phase into parts. The phases, the steady parts, and the transition parts are finally distinguished and dealt with separately in the subsequent modeling process. In the batch direction, considering the slow time-varying characteristics of batch evolution, sliding windows are used to perform mode division by analyzing the evolution trend of the score matrix T in the PLS model on the base of phase division and within-phase part division. Finally, an online monitoring model that comprehensively considers the evolution information of time and batch is obtained. In a typical batch operation process, injection molding is used as an example for experimental analysis. The results show that the proposed algorithm takes advantage of mixing the time-batch two-dimensional evolution information. Compared with the traditional methods, the proposed method can overcome the shortcomings caused by the single dimension analysis and has better monitoring results.
Collapse
|
8
|
Zhao L, Huang X. Slow Time-Varying Batch Process Quality Prediction Based on Batch Augmentation Analysis. SENSORS 2022; 22:s22020512. [PMID: 35062472 PMCID: PMC8780299 DOI: 10.3390/s22020512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 12/19/2021] [Accepted: 01/09/2022] [Indexed: 11/25/2022]
Abstract
In this paper, focusing on the slow time-varying characteristics, a series of works have been conducted to implement an accurate quality prediction for batch processes. To deal with the time-varying characteristics along the batch direction, sliding windows can be constructed. Then, the start-up process is identified and the whole process is divided into two modes according to the steady-state identification. In the most important mode, the process data matrix, used to establish the regression model of the current batch, is expanded to involve the process data of previous batches, which is called batch augmentation. Thus, the process data of previous batches, which have an important influence on the quality of the current batch, will be identified and form a new batch augmentation matrix for modeling using the partial least squares (PLS) method. Moreover, considering the multiphase characteristic, batch augmentation analysis and modeling is conducted within each phase. Finally, the proposed method is applied to a typical batch process, the injection molding process. The quality prediction results are compared with those of the traditional quality prediction method based on PLS and the ridge regression method under the proposed batch augmentation analysis framework. The conclusion is obtained that the proposed method based on the batch augmentation analysis is superior.
Collapse
|
9
|
Qasim M, Akram MN, Amin M, Månsson K. A restricted gamma ridge regression estimator combining the gamma ridge regression and the restricted maximum likelihood methods of estimation. J STAT COMPUT SIM 2021. [DOI: 10.1080/00949655.2021.2005063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Muhammad Qasim
- Department of Economics, Finance and Statistics, Jönköping International Business School, Jönköping University, Jönköping, Sweden
| | | | - Muhammad Amin
- Department of Statistics, University of Sargodha, Sargodha, Pakistan
| | - Kristofer Månsson
- Department of Economics, Finance and Statistics, Jönköping International Business School, Jönköping University, Jönköping, Sweden
| |
Collapse
|
10
|
An Optimization Technique for Solving a Class of Ridge Fuzzy Regression Problems. Neural Process Lett 2021. [DOI: 10.1007/s11063-021-10538-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
11
|
Akram MN, Amin M, Qasim M. A new biased estimator for the gamma regression model: Some applications in medical sciences. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1977958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
| | - Muhammad Amin
- Department of Statistics, University of Sargodha, Sargodha, Pakistan
| | - Muhammad Qasim
- Department of Economics, Finance and Statistics, Jönköping International Business School, Jönköping University, Jönköping, Sweden
| |
Collapse
|
12
|
Qasim M, Månsson K, Sjölander P, Kibria BMG. A new class of efficient and debiased two-step shrinkage estimators: method and application. J Appl Stat 2021; 49:4181-4205. [DOI: 10.1080/02664763.2021.1973389] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Muhammad Qasim
- Department of Economics, Finance and Statistics, Jönköping University, Jönköping, Sweden
| | - Kristofer Månsson
- Department of Economics, Finance and Statistics, Jönköping University, Jönköping, Sweden
| | - Pär Sjölander
- Department of Economics, Finance and Statistics, Jönköping University, Jönköping, Sweden
| | - B. M. Golam Kibria
- Department of Mathematics and Statistics, Florida International University, Miami, FL, USA
| |
Collapse
|
13
|
Affiliation(s)
- Y. Murat Bulut
- Department of Statistics, Eskişehir Osmangazi University, Eskisehir, Turkey
| |
Collapse
|
14
|
Majid A, Amin M, Akram MN. On the Liu estimation of Bell regression model in the presence of multicollinearity. J STAT COMPUT SIM 2021. [DOI: 10.1080/00949655.2021.1955886] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Abdul Majid
- Pakistan Bureau of Statistics, Regional Office, Multan, Pakistan
| | - Muhammad Amin
- Department of Statistics, University of Sargodha, Sargodha, Pakistan
| | | |
Collapse
|
15
|
Farghali RA, Qasim M, Kibria BMG, Abonazel MR. Generalized two-parameter estimators in the multinomial logit regression model: methods, simulation and application. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.1934023] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Rasha A. Farghali
- Department of Mathematics, Insurance and Applied Statistics, Faculty of Commerce and Business Administration, Helwan University, Cairo, Egypt
| | - Muhammad Qasim
- Department of Economics, Finance and Statistics, Jönköping International Business School, Jönköping University, Jonkoping, Sweden
| | - B. M. Golam Kibria
- Department of Mathematics and Statistics, Florida International University, Miami, Florida, USA
| | - Mohamed R. Abonazel
- Department of Applied Statistics and Econometrics, Faculty of Graduate Studies for Statistical Research, Cairo University, Giza, Egypt
| |
Collapse
|