1
|
Mehmood K, Anees SA, Muhammad S, Shahzad F, Liu Q, Khan WR, Shrahili M, Ansari MJ, Dube T. Machine Learning and Spatio Temporal Analysis for Assessing Ecological Impacts of the Billion Tree Afforestation Project. Ecol Evol 2025; 15:e70736. [PMID: 39975709 PMCID: PMC11839268 DOI: 10.1002/ece3.70736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2024] [Revised: 11/28/2024] [Accepted: 12/03/2024] [Indexed: 02/21/2025] Open
Abstract
This study evaluates the Billion Tree Afforestation Project (BTAP) in Pakistan's Khyber Pakhtunkhwa (KPK) province using remote sensing and machine learning. Applying Random Forest (RF) classification to Sentinel-2 imagery, we observed an increase in tree cover from 25.02% in 2015 to 29.99% in 2023 and a decrease in barren land from 20.64% to 16.81%, with an accuracy above 85%. Hotspot and spatial clustering analyses revealed significant vegetation recovery, with high-confidence hotspots rising from 36.76% to 42.56%. A predictive model for the Normalized Difference Vegetation Index (NDVI), supported by SHAP analysis, identified soil moisture and precipitation as primary drivers of vegetation growth, with the ANN model achieving an R 2 of 0.8556 and an RMSE of 0.0607 on the testing dataset. These results demonstrate the effectiveness of integrating machine learning with remote sensing as a framework to support data-driven afforestation efforts and inform sustainable environmental management practices.
Collapse
Affiliation(s)
- Kaleem Mehmood
- College of ForestryBeijing Forestry UniversityBeijingChina
- Key Laboratory for Silviculture and Conservation of Ministry of EducationBeijing Forestry UniversityBeijingChina
- Institute of Forest ScienceUniversity of SwatSwatPakistan
| | - Shoaib Ahmad Anees
- Department of ForestryThe University of AgricultureDera Ismail KhanPakistan
| | | | - Fahad Shahzad
- Precision Forestry Key Laboratory of BeijingBeijing Forestry UniversityBeijingChina
| | - Qijing Liu
- College of ForestryBeijing Forestry UniversityBeijingChina
- Key Laboratory for Silviculture and Conservation of Ministry of EducationBeijing Forestry UniversityBeijingChina
| | - Waseem Razzaq Khan
- Department of Forestry Science and Biodiversity, Faculty of Forestry and EnvironmentUniversiti Putra MalaysiaSerdangMalaysia
| | - Mansour Shrahili
- Department of Statistics and Operations Research, College of ScienceKing Saud UniversityRiyadhSaudi Arabia
| | - Mohammad Javed Ansari
- Department of BotanyHindu College Moradabad (Mahatma Jyotiba Phule Rohilkhand University Bareilly)MoradabadIndia
| | - Timothy Dube
- Institute for Water Studies, Faculty of ScienceUniversity of the Western CapeCape TownSouth Africa
| |
Collapse
|
2
|
Ghofrani A, Taherdoost H. Biomedical data analytics for better patient outcomes. Drug Discov Today 2025; 30:104280. [PMID: 39732322 DOI: 10.1016/j.drudis.2024.104280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 12/16/2024] [Accepted: 12/20/2024] [Indexed: 12/30/2024]
Abstract
Medical professionals today have access to immense amounts of data, which enables them to make decisions that enhance patient care and treatment efficacy. This innovative strategy can improve global health care by bridging the divide between clinical practice and medical research. This paper reviews biomedical developments aimed at improving patient outcomes by addressing three main questions regarding techniques, data sources and challenges. The review includes peer-reviewed articles from 2018 to 2023, found via systematic searches in PubMed, Scopus and Google Scholar. The results show diverse disease-specific applications. Challenges such as data quality and ethics are discussed, underscoring data analytics' potential for patient-focused health care. The review concludes that successful implementation requires addressing gaps, collaboration and innovation in biomedical science and data analytics.
Collapse
Affiliation(s)
| | - Hamed Taherdoost
- Hamta Business Corporation, Vancouver, Canada; University Canada West, Vancouver, Canada; Westcliff University, Irvine, USA; GUS Institute | Global University Systems, London, UK.
| |
Collapse
|
3
|
Huang C, Zhuo J, Liu C, Wu S, Zhu J, Chen T, Zhang B, Feng S, Zhou C, Wang Z, Huang S, Chen L, Xinli Zhan. Development and validation of a diagnostic model to differentiate spinal tuberculosis from pyogenic spondylitis by combining multiple machine learning algorithms. BIOMOLECULES & BIOMEDICINE 2024; 24:401-410. [PMID: 37897663 PMCID: PMC10950342 DOI: 10.17305/bb.2023.9663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/19/2023] [Accepted: 10/27/2023] [Indexed: 10/30/2023]
Abstract
This study focused on the development and validation of a diagnostic model to differentiate between spinal tuberculosis (STB) and pyogenic spondylitis (PS). We analyzed a total of 387 confirmed cases, out of which 241 were diagnosed with STB and 146 were diagnosed with PS. These cases were randomly divided into a training group (n = 271) and a validation group (n = 116). Within the training group, four machine learning (ML) algorithms (least absolute shrinkage and selection operator [LASSO], logistic regression analysis, random forest, and support vector machine recursive feature elimination [SVM-RFE]) were employed to identify distinctive variables. These specific variables were then utilized to construct a diagnostic model. The model's performance was subsequently assessed using the receiver operating characteristic (ROC) curves and the calibration curves. Finally, internal validation of the model was undertaken in the validation group. Our findings indicate that PS patients had an average platelet-to-neutrophil ratio (PNR) of 277.86, which was significantly higher than the STB patients' average of 69.88. The average age of PS patients was 54.71 years, older than the 48 years recorded for STB patients. Notably, the neutrophil-to-lymphocyte ratio (NLR) was higher in PS patients at 6.15, compared to the 3.46 NLR in STB patients. Additionally, the platelet volume distribution width (PDW) in PS patients was 0.2, compared to 0.15 in STB patients. Conversely, the mean platelet volume (MPV) was lower in PS patients at an average of 4.41, whereas STB patients averaged 8.31. Hemoglobin (HGB) levels were lower in PS patients at an average of 113.31 compared to STB patients' average of 121.64. Furthermore, the average red blood cell (RBC) count was 4.26 in PS patients, which was less than the 4.58 average observed in STB patients. After evaluation, seven key factors were identified using the four ML algorithms, forming the basis of our diagnostic model. The training and validation groups yielded area under the curve (AUC) values of 0.841 and 0.83, respectively. The calibration curves demonstrated a high alignment between the nomogram-predicted values and the actual measurements. The decision curve indicated optimal model performance with a threshold set between 2% and 88%. In conclusion, our model offers healthcare practitioners a reliable tool to efficiently and precisely differentiate between STB and PS, thereby facilitating swift and accurate diagnoses.
Collapse
Affiliation(s)
- Chengqian Huang
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jing Zhuo
- Surgical Operation Department, Baise People’s Hospital, Affiliated Southwest Hospital of Youjiang Medical University for Nationalities, Baise, China
| | - Chong Liu
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Shaofeng Wu
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Jichong Zhu
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Tianyou Chen
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Bin Zhang
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Sitan Feng
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Chenxing Zhou
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Zequn Wang
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Shengsheng Huang
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Liyi Chen
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| | - Xinli Zhan
- Department of Spine and Osteopathy Ward, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
| |
Collapse
|
4
|
Swain SS, Khura TK, Sahoo PK, Chobhe KA, Al-Ansari N, Kushwaha HL, Kushwaha NL, Panda KC, Lande SD, Singh C. Proportional impact prediction model of coating material on nitrate leaching of slow-release Urea Super Granules (USG) using machine learning and RSM technique. Sci Rep 2024; 14:3053. [PMID: 38321086 PMCID: PMC10847469 DOI: 10.1038/s41598-024-53410-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 01/31/2024] [Indexed: 02/08/2024] Open
Abstract
An accurate assessment of nitrate leaching is important for efficient fertiliser utilisation and groundwater pollution reduction. However, past studies could not efficiently model nitrate leaching due to utilisation of conventional algorithms. To address the issue, the current research employed advanced machine learning algorithms, viz., Support Vector Machine, Artificial Neural Network, Random Forest, M5 Tree (M5P), Reduced Error Pruning Tree (REPTree) and Response Surface Methodology (RSM) to predict and optimize nitrate leaching. In this study, Urea Super Granules (USG) with three different coatings were used for the experiment in the soil columns, containing 1 kg soil with fertiliser placed in between. Statistical parameters, namely correlation coefficient, Mean Absolute Error, Willmott index, Root Mean Square Error and Nash-Sutcliffe efficiency were used to evaluate the performance of the ML techniques. In addition, a comparison was made in the test set among the machine learning models in which, RSM outperformed the rest of the models irrespective of coating type. Neem oil/ Acacia oil(ml): clay/sulfer (g): age (days) for minimum nitrate leaching was found to be 2.61: 1.67: 2.4 for coating of USG with bentonite clay and neem oil without heating, 2.18: 2: 1 for bentonite clay and neem oil with heating and 1.69: 1.64: 2.18 for coating USG with sulfer and acacia oil. The research would provide guidelines to researchers and policymakers to select the appropriate tool for precise prediction of nitrate leaching, which would optimise the yield and the benefit-cost ratio.
Collapse
Affiliation(s)
- Sidhartha Sekhar Swain
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Tapan Kumar Khura
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Pramod Kumar Sahoo
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Kapil Atmaram Chobhe
- Division of Soil Science and Agricultural Chemistry, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Nadhir Al-Ansari
- Department of Civil, Environmental and Natural Resources Engineering, Lulea University of Technology, 97187, Lulea, Sweden.
| | - Hari Lal Kushwaha
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Nand Lal Kushwaha
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Kanhu Charan Panda
- Department of Soil Conservation, National PG College (Barhalganj), DDU Gorakhpur University, Gorakhpur, UP, 273402, India
| | - Satish Devram Lande
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Chandu Singh
- Division of Genetics, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| |
Collapse
|
5
|
Cohen S, Lior E, Bocher M, Rokach L. Improving severity classification of Hebrew PET-CT pathology reports using test-time augmentation. J Biomed Inform 2024; 149:104577. [PMID: 38101689 DOI: 10.1016/j.jbi.2023.104577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 12/08/2023] [Accepted: 12/10/2023] [Indexed: 12/17/2023]
Abstract
Classifying medical reports written in Hebrew is challenging due to the ambiguity and complexity of the language. This study proposes Text Test Time Augmentation (TTTA), a novel method to improve the classification accuracy of cancer severity levels from PET-CT diagnostic reports in Hebrew. Hebrew, being a morphologically rich language, often leads to each word having multiple ambiguous interpretations. TTTA leverages test-time augmentation to enhance text information retrieval and model robustness. During training and testing phases, this method generates and evaluates sets of augmentations to enhance the semantics extracted from each report. Experiments utilize a large institutional report repository from Ziv hospital, Israel, where physicians manually labeled the reports. The results demonstrate that the proposed TTTA approach achieves superior performance over baseline models without TTA, improving PR-AUC by 15.18% on classifying cancer severity levels. The study highlights the efficacy of TTTA in extracting essential medical concepts from free text reports and accurately classifying the severity of cancer. The approach addresses the limitations of prior methods and contributes towards improved automated analysis of Hebrew medical reports. TTTA has the potential to assist physicians in cancer diagnosis and treatment planning.
Collapse
Affiliation(s)
- Seffi Cohen
- Department of Software and Information Systems Engineering, Ben Gurion University, Beer-Sheva, 8410501, Israel.
| | - Edo Lior
- Department of Software and Information Systems Engineering, Ben Gurion University, Beer-Sheva, 8410501, Israel
| | - Moshe Bocher
- Ziv Hospital, Zfat, 13200, Israel; Faculty of medicine, Bar-Ilan University, Ramat Gan, 5290002, Israel
| | - Lior Rokach
- Department of Software and Information Systems Engineering, Ben Gurion University, Beer-Sheva, 8410501, Israel
| |
Collapse
|
6
|
Castel-Feced S, Malo S, Aguilar-Palacio I, Feja-Solana C, Casasnovas JA, Maldonado L, Rabanaque-Hernández MJ. Influence of cardiovascular risk factors and treatment exposure on cardiovascular event incidence: Assessment using machine learning algorithms. PLoS One 2023; 18:e0293759. [PMID: 37971977 PMCID: PMC10653526 DOI: 10.1371/journal.pone.0293759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 10/19/2023] [Indexed: 11/19/2023] Open
Abstract
Assessment of the influence of cardiovascular risk factors (CVRF) on cardiovascular event (CVE) using machine learning algorithms offers some advantages over preexisting scoring systems, and better enables personalized medicine approaches to cardiovascular prevention. Using data from four different sources, we evaluated the outcomes of three machine learning algorithms for CVE prediction using different combinations of predictive variables and analysed the influence of different CVRF-related variables on CVE prediction when included in these algorithms. A cohort study based on a male cohort of workers applying populational data was conducted. The population of the study consisted of 3746 males. For descriptive analyses, mean and standard deviation were used for quantitative variables, and percentages for categorical ones. Machine learning algorithms used were XGBoost, Random Forest and Naïve Bayes (NB). They were applied to two groups of variables: i) age, physical status, Hypercholesterolemia (HC), Hypertension, and Diabetes Mellitus (DM) and ii) these variables plus treatment exposure, based on the adherence to the treatment for DM, hypertension and HC. All methods point out to the age as the most influential variable in the incidence of a CVE. When considering treatment exposure, it was more influential than any other CVRF, which changed its influence depending on the model and algorithm applied. According to the performance of the algorithms, the most accurate was Random Forest when treatment exposure was considered (F1 score 0.84), followed by XGBoost. Adherence to treatment showed to be an important variable in the risk of having a CVE. These algorithms could be applied to create models for every population, and they can be used in primary care to manage interventions personalized for every subject.
Collapse
Affiliation(s)
- Sara Castel-Feced
- Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Spain
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
| | - Sara Malo
- Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Spain
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
| | - Isabel Aguilar-Palacio
- Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Spain
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
| | - Cristina Feja-Solana
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
- Directorate of Public Health, Government of Aragon, Zaragoza, Spain
| | - José Antonio Casasnovas
- Hospital Universitario Miguel Servet, Instituto de Investigación Sanitaria Aragón (IIS Aragón), CIBERCV, Zaragoza, Spain
- Department of Medicine, Psychiatry and Dermatology, University of Zaragoza, Zaragoza, Spain
| | - Lina Maldonado
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
- Department of Applied Economic, University of Zaragoza, Zaragoza, Spain
| | - María José Rabanaque-Hernández
- Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Spain
- Fundación Instituto de Investigación Sanitaria de Aragón (IIS Aragón), Zaragoza, Spain
- GRISSA Research Group, Zaragoza, Spain
| |
Collapse
|
7
|
Srinivasan S, Gunasekaran S, Mathivanan SK, M B BAM, Jayagopal P, Dalu GT. An active learning machine technique based prediction of cardiovascular heart disease from UCI-repository database. Sci Rep 2023; 13:13588. [PMID: 37604952 PMCID: PMC10442398 DOI: 10.1038/s41598-023-40717-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 08/16/2023] [Indexed: 08/23/2023] Open
Abstract
Heart disease is a significant global cause of mortality, and predicting it through clinical data analysis poses challenges. Machine learning (ML) has emerged as a valuable tool for diagnosing and predicting heart disease by analyzing healthcare data. Previous studies have extensively employed ML techniques in medical research for heart disease prediction. In this study, eight ML classifiers were utilized to identify crucial features that enhance the accuracy of heart disease prediction. Various combinations of features and well-known classification algorithms were employed to develop the prediction model. Neural network models, such as Naïve Bayes and Radial Basis Functions, were implemented, achieving accuracies of 94.78% and 90.78% respectively in heart disease prediction. Among the state-of-the-art methods for cardiovascular problem prediction, Learning Vector Quantization exhibited the highest accuracy rate of 98.7%. The motivation behind predicting Cardiovascular Heart Disease lies in its potential to save lives, improves health outcomes, and allocates healthcare resources efficiently. The key contributions encompass early intervention, personalized medicine, technological advancements, the impact on public health, and ongoing research, all of which collectively work toward reducing the burden of CHD on both individual patients and society as a whole.
Collapse
Affiliation(s)
- Saravanan Srinivasan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr.Sagunthala R&D Institute of Science and Technology, Avadi, Chennai, India
| | - Subathra Gunasekaran
- Department of Computer Science and Engineering, Sathyabama Institute of Science and Technology, Chennai, India
| | | | - Benjula Anbu Malar M B
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, 632014, Tamil Nadu, India
| | - Prabhu Jayagopal
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, 632014, Tamil Nadu, India
| | - Gemmachis Teshite Dalu
- Department of Software Engineering, College of Computing and Informatics, Haramaya University, POB 138, Dire Dawa, Ethiopia.
| |
Collapse
|
8
|
Intelligence and Neuroscience C. Retracted: Clinical Data Analysis for Prediction of Cardiovascular Disease Using Machine Learning Techniques. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2023; 2023:9815067. [PMID: 37622062 PMCID: PMC10447058 DOI: 10.1155/2023/9815067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 08/15/2023] [Indexed: 08/26/2023]
Abstract
[This retracts the article DOI: 10.1155/2022/2973324.].
Collapse
|
9
|
Saputra J, Lawrencya C, Saini JM, Suharjito S. Hyperparameter optimization for cardiovascular disease data-driven prognostic system. Vis Comput Ind Biomed Art 2023; 6:16. [PMID: 37524951 PMCID: PMC10390457 DOI: 10.1186/s42492-023-00143-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 07/04/2023] [Indexed: 08/02/2023] Open
Abstract
Prediction and diagnosis of cardiovascular diseases (CVDs) based, among other things, on medical examinations and patient symptoms are the biggest challenges in medicine. About 17.9 million people die from CVDs annually, accounting for 31% of all deaths worldwide. With a timely prognosis and thorough consideration of the patient's medical history and lifestyle, it is possible to predict CVDs and take preventive measures to eliminate or control this life-threatening disease. In this study, we used various patient datasets from a major hospital in the United States as prognostic factors for CVD. The data was obtained by monitoring a total of 918 patients whose criteria for adults were 28-77 years old. In this study, we present a data mining modeling approach to analyze the performance, classification accuracy and number of clusters on Cardiovascular Disease Prognostic datasets in unsupervised machine learning (ML) using the Orange data mining software. Various techniques are then used to classify the model parameters, such as k-nearest neighbors, support vector machine, random forest, artificial neural network (ANN), naïve bayes, logistic regression, stochastic gradient descent (SGD), and AdaBoost. To determine the number of clusters, various unsupervised ML clustering methods were used, such as k-means, hierarchical, and density-based spatial clustering of applications with noise clustering. The results showed that the best model performance analysis and classification accuracy were SGD and ANN, both of which had a high score of 0.900 on Cardiovascular Disease Prognostic datasets. Based on the results of most clustering methods, such as k-means and hierarchical clustering, Cardiovascular Disease Prognostic datasets can be divided into two clusters. The prognostic accuracy of CVD depends on the accuracy of the proposed model in determining the diagnostic model. The more accurate the model, the better it can predict which patients are at risk for CVD.
Collapse
Affiliation(s)
- Jayson Saputra
- Industrial Engineering Department, BINUS Graduate Program - Master of Industrial Engineering, Bina Nusantara University, Jakarta 11480, Indonesia
| | - Cindy Lawrencya
- Industrial Engineering Department, BINUS Graduate Program - Master of Industrial Engineering, Bina Nusantara University, Jakarta 11480, Indonesia
| | - Jecky Mitra Saini
- Industrial Engineering Department, BINUS Graduate Program - Master of Industrial Engineering, Bina Nusantara University, Jakarta 11480, Indonesia
| | - Suharjito Suharjito
- Industrial Engineering Department, BINUS Graduate Program - Master of Industrial Engineering, Bina Nusantara University, Jakarta 11480, Indonesia
| |
Collapse
|
10
|
Nancy AA, Ravindran D, Vincent DR, Srinivasan K, Chang CY. Fog-Based Smart Cardiovascular Disease Prediction System Powered by Modified Gated Recurrent Unit. Diagnostics (Basel) 2023; 13:2071. [PMID: 37370966 DOI: 10.3390/diagnostics13122071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/25/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open
Abstract
The ongoing fast-paced technology trend has brought forth ceaseless transformation. In this regard, cloud computing has long proven to be the paramount deliverer of services such as computing power, software, networking, storage, and databases on a pay-per-use basis. The cloud is a big proponent of the internet of things (IoT), furnishing the computation and storage requisite to address internet-of-things applications. With the proliferating IoT devices triggering a continual data upsurge, the cloud-IoT interaction encounters latency, bandwidth, and connectivity restraints. The inclusion of the decentralized and distributed fog computing layer amidst the cloud and IoT layer extends the cloud's processing, storage, and networking services close to end users. This hierarchical edge-fog-cloud model distributes computation and intelligence, yielding optimal solutions while tackling constraints like massive data volume, latency, delay, and security vulnerability. The healthcare domain, warranting time-critical functionalities, can reap benefits from the cloud-fog-IoT interplay. This research paper propounded a fog-assisted smart healthcare system to diagnose heart or cardiovascular disease. It combined a fuzzy inference system (FIS) with the recurrent neural network model's variant of the gated recurrent unit (GRU) for pre-processing and predictive analytics tasks. The proposed system showcases substantially improved performance results, with classification accuracy at 99.125%. With major processing of healthcare data analytics happening at the fog layer, it is observed that the proposed work reveals optimized results concerning delays in terms of latency, response time, and jitter, compared to the cloud. Deep learning models are adept at handling sophisticated tasks, particularly predictive analytics. Time-critical healthcare applications reap benefits from deep learning's exclusive potential to furnish near-perfect results, coupled with the merits of the decentralized fog model, as revealed by the experimental results.
Collapse
Affiliation(s)
- A Angel Nancy
- Department of Computer Science, St. Joseph's College (Autonomous), Bharathidasan University, Tiruchirappalli 620002, India
| | - Dakshanamoorthy Ravindran
- Department of Computer Science, St. Joseph's College (Autonomous), Bharathidasan University, Tiruchirappalli 620002, India
| | - Durai Raj Vincent
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Chuan-Yu Chang
- Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Douliu City 64002, Taiwan
- Service Systems Technology Center, Industrial Technology Research Institute, Hsinchu 310401, Taiwan
| |
Collapse
|
11
|
Influence of Optimal Hyperparameters on the Performance of Machine Learning Algorithms for Predicting Heart Disease. Processes (Basel) 2023. [DOI: 10.3390/pr11030734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023] Open
Abstract
One of the most difficult challenges in medicine is predicting heart disease at an early stage. In this study, six machine learning (ML) algorithms, viz., logistic regression, K-nearest neighbor, support vector machine, decision tree, random forest classifier, and extreme gradient boosting, were used to analyze two heart disease datasets. One dataset was UCI Kaggle Cleveland and the other was the comprehensive UCI Kaggle Cleveland, Hungary, Switzerland, and Long Beach V. The performance results of the machine learning techniques were obtained. The support vector machine with tuned hyperparameters achieved the highest testing accuracy of 87.91% for dataset-I and the extreme gradient boosting classifier with tuned hyperparameters achieved the highest testing accuracy of 99.03% for the comprehensive dataset-II. The novelty of this work was the use of grid search cross-validation to enhance the performance in the form of training and testing. The ideal parameters for predicting heart disease were identified through experimental results. Comparative studies were also carried out with the existing studies focusing on the prediction of heart disease, where the approach used in this work significantly outperformed their results.
Collapse
|
12
|
Rana M, Bhushan M. Machine learning and deep learning approach for medical image analysis: diagnosis to detection. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:1-39. [PMID: 36588765 PMCID: PMC9788870 DOI: 10.1007/s11042-022-14305-w] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 11/01/2022] [Accepted: 12/10/2022] [Indexed: 06/17/2023]
Abstract
Computer-aided detection using Deep Learning (DL) and Machine Learning (ML) shows tremendous growth in the medical field. Medical images are considered as the actual origin of appropriate information required for diagnosis of disease. Detection of disease at the initial stage, using various modalities, is one of the most important factors to decrease mortality rate occurring due to cancer and tumors. Modalities help radiologists and doctors to study the internal structure of the detected disease for retrieving the required features. ML has limitations with the present modalities due to large amounts of data, whereas DL works efficiently with any amount of data. Hence, DL is considered as the enhanced technique of ML where ML uses the learning techniques and DL acquires details on how machines should react around people. DL uses a multilayered neural network to get more information about the used datasets. This study aims to present a systematic literature review related to applications of ML and DL for the detection along with classification of multiple diseases. A detailed analysis of 40 primary studies acquired from the well-known journals and conferences between Jan 2014-2022 was done. It provides an overview of different approaches based on ML and DL for the detection along with the classification of multiple diseases, modalities for medical imaging, tools and techniques used for the evaluation, description of datasets. Further, experiments are performed using MRI dataset to provide a comparative analysis of ML classifiers and DL models. This study will assist the healthcare community by enabling medical practitioners and researchers to choose an appropriate diagnosis technique for a given disease with reduced time and high accuracy.
Collapse
Affiliation(s)
- Meghavi Rana
- School of Computing, DIT University, Dehradun, India
| | - Megha Bhushan
- School of Computing, DIT University, Dehradun, India
| |
Collapse
|
13
|
Hassan CAU, Iqbal J, Irfan R, Hussain S, Algarni AD, Bukhari SSH, Alturki N, Ullah SS. Effectively Predicting the Presence of Coronary Heart Disease Using Machine Learning Classifiers. SENSORS (BASEL, SWITZERLAND) 2022; 22:7227. [PMID: 36236325 PMCID: PMC9573101 DOI: 10.3390/s22197227] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 06/03/2022] [Accepted: 07/27/2022] [Indexed: 06/16/2023]
Abstract
Coronary heart disease is one of the major causes of deaths around the globe. Predicating a heart disease is one of the most challenging tasks in the field of clinical data analysis. Machine learning (ML) is useful in diagnostic assistance in terms of decision making and prediction on the basis of the data produced by healthcare sector globally. We have also perceived ML techniques employed in the medical field of disease prediction. In this regard, numerous research studies have been shown on heart disease prediction using an ML classifier. In this paper, we used eleven ML classifiers to identify key features, which improved the predictability of heart disease. To introduce the prediction model, various feature combinations and well-known classification algorithms were used. We achieved 95% accuracy with gradient boosted trees and multilayer perceptron in the heart disease prediction model. The Random Forest gives a better performance level in heart disease prediction, with an accuracy level of 96%.
Collapse
Affiliation(s)
- Ch. Anwar ul Hassan
- Department of Creative Technologies, Air University Islamabad, Islamabad 44000, Pakistan
| | - Jawaid Iqbal
- Department of Computer Science, Capital University of Science and Technology, Islamabad 44000, Pakistan
| | - Rizwana Irfan
- Department of Computer Science, University of Jeddah, P.O. Box 123456, Jeddah 21959, Saudi Arabia
| | - Saddam Hussain
- School of Digital Science, Universiti Brunei Darussalam, Jalan Tungku Link, Gadong BE1410, Brunei
| | - Abeer D. Algarni
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | | | - Nazik Alturki
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Syed Sajid Ullah
- Department of Information and Communication Technology, University of Agder (UiA), N-4898 Grimstad, Norway
| |
Collapse
|