1
|
Nair AR, Rajaguru H, Karthika MS, Keerthivasan C. Metaheuristic integrated machine learning classification of colon cancer using STFT LASSO and EHO feature extraction from microarray gene expressions. Sci Rep 2024; 14:16485. [PMID: 39019906 PMCID: PMC11255302 DOI: 10.1038/s41598-024-67135-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 07/08/2024] [Indexed: 07/19/2024] Open
Abstract
The microarray gene expression data poses a tremendous challenge due to their curse of dimensionality problem. The sheer volume of features far surpasses available samples, leading to overfitting and reduced classification accuracy. Thus the dimensionality of microarray gene expression data must be reduced with efficient feature extraction methods to reduce the volume of data and extract meaningful information to enhance the classification accuracy and interpretability. In this research, we discover the uniqueness of applying STFT (Short Term Fourier Transform), LASSO (Least Absolute Shrinkage and Selection Operator), and EHO (Elephant Herding Optimisation) for extracting significant features from lung cancer and reducing the dimensionality of the microarray gene expression database. The classification of lung cancer is performed using the following classifiers: Gaussian Mixture Model (GMM), Particle Swarm Optimization (PSO) with GMM, Detrended Fluctuation Analysis (DFA), Naive Bayes classifier (NBC), Firefly with GMM, Support Vector Machine with Radial Basis Kernel (SVM-RBF) and Flower Pollination Optimization (FPO) with GMM. The EHO feature extraction with the FPO-GMM classifier attained the highest accuracy in the range of 96.77, with an F1 score of 97.5, MCC of 0.92 and Kappa of 0.92. The reported results underline the significance of utilizing STFT, LASSO, and EHO for feature extraction in reducing the dimensionality of microarray gene expression data. These methodologies also help in improved and early diagnosis of lung cancer with enhanced classification accuracy and interpretability.
Collapse
Affiliation(s)
- Ajin R Nair
- Department of Electronics and Communication Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India.
- Bannari Amman Institute of Technology, Sathyamangalam, India.
| | - Harikumar Rajaguru
- Department of Electronics and Communication Engineering, Bannari Amman Institute of Technology, Sathyamangalam, India
- Bannari Amman Institute of Technology, Sathyamangalam, India
| | - M S Karthika
- Department of Information Technology, Bannari Amman Institute of Technology, Sathyamangalam, India
- Bannari Amman Institute of Technology, Sathyamangalam, India
| | | |
Collapse
|
2
|
Khan A, Hussain S, Iyer JK, Kaul A, Bonnewitz M, Kaul R. Human papillomavirus-mediated expression of complement regulatory proteins in human cervical cancer cells. Eur J Obstet Gynecol Reprod Biol 2023; 288:222-228. [PMID: 37572452 DOI: 10.1016/j.ejogrb.2023.07.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 07/18/2023] [Accepted: 07/24/2023] [Indexed: 08/14/2023]
Abstract
OBJECTIVES This study aimed to evaluate the expression pattern of complement regulatory proteins (CRPs) CD46, CD59, and CD55 in HPV-positive (HPV+) & negative (HPV-) cervical cancer cell lines in search of a reliable differential biomarker. STUDY DESIGN We analysed the expression of CRPs in HPV 16-positive SiHa cell line, HPV 18-positive HeLa cell line, and HPV-negative cell line C33a using RT-qPCR, Western blotting, flow cytometry, and confocal microscopy. RESULTS We observed a differential expression profile of CRPs in HPV+ and HPV- cervical cancer cell lines. The mRNA level of CD59 & CD55 showed a higher expression pattern in HPV+ cells when compared to HPV- cancer cells. However, flow cytometry-based experiments revealed that CD46 was preferentially expressed more in HPV 16-positive SiHa cells followed by HPV 18-positive HeLa cells when compared to HPV- C33a cells. Interestingly, confocal microscopy revealed a high level of CD59 expression in Hela cells and SiHa cells but low expression in HPV- C33a cells. In addition, HPV 18-positive HeLa cells expressed more CD55, which was lower in SiHa cells and very weak in C33a cells. CONCLUSION The study demonstrates the differential expression of CRPs in both HPV+ and HPV- cervical cancer cells for the first time, and their potential to serve as an early diagnostic marker for cervical carcinogenesis.
Collapse
Affiliation(s)
- Asiya Khan
- Dr. Babasaheb R. Ambedkar Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi 110029, India; Amity Institute of Biotechnology, Amity University, Noida 201303, India
| | - Showket Hussain
- Division of Molecular Oncology & Molecular Diagnostics, Indian Council of Medical Research-National Institute of Cancer Prevention and Research, Noida 201301, India
| | - Janaki K Iyer
- Department of Biochemistry and Microbiology, Oklahoma State University Centre for Health Sciences, 1111 West 17(th) Street, Tulsa, OK 74107, USA; Department of Natural Sciences, Northeastern State University, Broken Arrow, OK 74014, USA
| | - Anil Kaul
- Health Care Administration, Oklahoma State University Centre for Health Sciences, Tulsa, OK 74107, USA
| | - Mackenzie Bonnewitz
- Department of Natural Sciences, Northeastern State University, Broken Arrow, OK 74014, USA
| | - Rashmi Kaul
- Department of Biochemistry and Microbiology, Oklahoma State University Centre for Health Sciences, 1111 West 17(th) Street, Tulsa, OK 74107, USA.
| |
Collapse
|
3
|
Chen SM, Zhang JH. Genetic Algorithm in Data Mining of Colorectal Images. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:3854518. [PMID: 34691237 PMCID: PMC8536457 DOI: 10.1155/2021/3854518] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 09/25/2021] [Indexed: 11/17/2022]
Abstract
There is currently no effective analytical method in colorectal image analysis, which leads to certain errors in colorectal image analysis. In order to improve the accuracy of colorectal imaging detection, this study used a genetic algorithm as the data mining algorithm and combined it with image processing technology to perform image analysis. At the same time, combined with the actual requirements of image detection, the gray theory model is used as the basic theory of image processing, and the image detection prediction model is constructed to predict the data. In addition, in order to study the effectiveness of the algorithm, the experiment is carried out to analyze the validity of the data of the study, and the predicted value is compared with the actual value. The research shows that the proposed algorithm has certain accuracy and can provide theoretical reference for subsequent related research.
Collapse
Affiliation(s)
- Shou-Ming Chen
- Department of Radiology, The Affiliated Hospital of Panzhihua University, Panzhihua, Sichuan 617000, China
| | - Jun-Hui Zhang
- Department of Medical Imaging, Baoji People's Hospital, Baoji, Shaanxi 721000, China
| |
Collapse
|
4
|
Malik A, Thanekar U, Amarachintha S, Mourya R, Nalluri S, Bondoc A, Shivakumar P. "Complimenting the Complement": Mechanistic Insights and Opportunities for Therapeutics in Hepatocellular Carcinoma. Front Oncol 2021; 10:627701. [PMID: 33718121 PMCID: PMC7943925 DOI: 10.3389/fonc.2020.627701] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 12/22/2020] [Indexed: 12/15/2022] Open
Abstract
Hepatocellular carcinoma (HCC) is the most common primary malignancy of the liver and a leading cause of death in the US and worldwide. HCC remains a global health problem and is highly aggressive with unfavorable prognosis. Even with surgical interventions and newer medical treatment regimens, patients with HCC have poor survival rates. These limited therapeutic strategies and mechanistic understandings of HCC immunopathogenesis urgently warrant non-palliative treatment measures. Irrespective of the multitude etiologies, the liver microenvironment in HCC is intricately associated with chronic necroinflammation, progressive fibrosis, and cirrhosis as precedent events along with dysregulated innate and adaptive immune responses. Central to these immunological networks is the complement cascade (CC), a fundamental defense system inherent to the liver which tightly regulates humoral and cellular responses to noxious stimuli. Importantly, the liver is the primary source for biosynthesis of >80% of complement components and expresses a variety of complement receptors. Recent studies implicate the complement system in liver inflammation, abnormal regenerative responses, fibrosis, carcinogenesis, and development of HCC. Although complement activation differentially promotes immunosuppressive, stimulant, and angiogenic microenvironments conducive to HCC development, it remains under-investigated. Here, we review derangement of specific complement proteins in HCC in the context of altered complement regulatory factors, immune-activating components, and their implications in disease pathogenesis. We also summarize how complement molecules regulate cancer stem cells (CSCs), interact with complement-coagulation cascades, and provide therapeutic opportunities for targeted intervention in HCC.
Collapse
Affiliation(s)
- Astha Malik
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Unmesha Thanekar
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Surya Amarachintha
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Reena Mourya
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Shreya Nalluri
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Alexander Bondoc
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, United States
- Division of Pediatric General and Thoracic Surgery, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
| | - Pranavkumar Shivakumar
- Division of Gastroenterology, Hepatology and Nutrition, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, United States
| |
Collapse
|
5
|
Geller A, Yan J. The Role of Membrane Bound Complement Regulatory Proteins in Tumor Development and Cancer Immunotherapy. Front Immunol 2019; 10:1074. [PMID: 31164885 PMCID: PMC6536589 DOI: 10.3389/fimmu.2019.01074] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 04/26/2019] [Indexed: 12/17/2022] Open
Abstract
It has long been understood that the control and surveillance of tumors within the body involves an intricate dance between the adaptive and innate immune systems. At the center of the interplay between the adaptive and innate immune response sits the complement system—an evolutionarily ancient response that aids in the destruction of microorganisms and damaged cells, including cancer cells. Membrane-bound complement regulatory proteins (mCRPs), such as CD46, CD55, and CD59, are expressed throughout the body in order to prevent over-activation of the complement system. These mCRPs act as a double-edged sword however, as they can also over-regulate the complement system to the extent that it is no longer effective at eliminating cancerous cells. Recent studies are now indicating that mCRPs may function as a biomarker of a malignant transformation in numerous cancer types, and further, are being shown to interfere with anti-tumor treatments. This highlights the critical roles that therapeutic blockade of mCRPs can play in cancer treatment. Furthermore, with the complement system having the ability to both directly and indirectly control adaptive T-cell responses, the use of a combinatorial approach of complement-related therapy along with other T-cell activating therapies becomes a logical approach to treatment. This review will highlight the biomarker-related role that mCRP expression may have in the classification of tumor phenotype and predicted response to different anti-cancer treatments in the context of an emerging understanding that complement activation within the Tumor Microenvironment (TME) is actually harmful for tumor control. We will discuss what is known about complement activation and mCRPs relating to cancer and immunotherapy, and will examine the potential for combinatorial approaches of anti-mCRP therapy with other anti-tumor therapies, especially checkpoint inhibitors such as anti PD-1 and PD-L1 monoclonal antibodies (mAbs). Overall, mCRPs play an essential role in the immune response to tumors, and understanding their role in the immune response, particularly in modulating currently used cancer therapeutics may lead to better clinical outcomes in patients with diverse cancer types.
Collapse
Affiliation(s)
- Anne Geller
- Department of Microbiology and Immunology, University of Louisville School of Medicine, Louisville, KY, United States
| | - Jun Yan
- Immuno-Oncology Program, Department of Medicine, The James Graham Brown Cancer Center, University of Louisville School of Medicine, Louisville, KY, United States
| |
Collapse
|
6
|
Wang GP, Yang JX. SKICA: A feature extraction algorithm based on supervised ICA with kernel for anomaly detection. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-17749] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Gui Ping Wang
- College of Information Science and Engineering, Chongqing Jiaotong University, Chongqing, China
| | - Jian Xi Yang
- College of Information Science and Engineering, Chongqing Jiaotong University, Chongqing, China
| |
Collapse
|
7
|
Meng Y, Liang J, Cao F, He Y. A new distance with derivative information for functional k-means clustering algorithm. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.06.035] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
8
|
|
9
|
Yazdani S, Shanbehzadeh J, Hadavandi E. MBCGP-FE: A modified balanced cartesian genetic programming feature extractor. Knowl Based Syst 2017. [DOI: 10.1016/j.knosys.2017.08.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
10
|
Attallah O, Karthikesalingam A, Holt PJ, Thompson MM, Sayers R, Bown MJ, Choke EC, Ma X. Using multiple classifiers for predicting the risk of endovascular aortic aneurysm repair re-intervention through hybrid feature selection. Proc Inst Mech Eng H 2017; 231:1048-1063. [PMID: 28925817 DOI: 10.1177/0954411917731592] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Feature selection is essential in medical area; however, its process becomes complicated with the presence of censoring which is the unique character of survival analysis. Most survival feature selection methods are based on Cox's proportional hazard model, though machine learning classifiers are preferred. They are less employed in survival analysis due to censoring which prevents them from directly being used to survival data. Among the few work that employed machine learning classifiers, partial logistic artificial neural network with auto-relevance determination is a well-known method that deals with censoring and perform feature selection for survival data. However, it depends on data replication to handle censoring which leads to unbalanced and biased prediction results especially in highly censored data. Other methods cannot deal with high censoring. Therefore, in this article, a new hybrid feature selection method is proposed which presents a solution to high level censoring. It combines support vector machine, neural network, and K-nearest neighbor classifiers using simple majority voting and a new weighted majority voting method based on survival metric to construct a multiple classifier system. The new hybrid feature selection process uses multiple classifier system as a wrapper method and merges it with iterated feature ranking filter method to further reduce features. Two endovascular aortic repair datasets containing 91% censored patients collected from two centers were used to construct a multicenter study to evaluate the performance of the proposed approach. The results showed the proposed technique outperformed individual classifiers and variable selection methods based on Cox's model such as Akaike and Bayesian information criterions and least absolute shrinkage and selector operator in p values of the log-rank test, sensitivity, and concordance index. This indicates that the proposed classifier is more powerful in correctly predicting the risk of re-intervention enabling doctor in selecting patients' future follow-up plan.
Collapse
Affiliation(s)
- Omneya Attallah
- 1 Department of Electronics and Communications, College of Engineering and Technology, Arab Academy for Science and Technology, Alexandria, Egypt.,2 School of Engineering and Applied Science, Aston University, Birmingham, UK
| | - Alan Karthikesalingam
- 3 St George's Vascular Institute, St George's University Hospitals NHS Foundation Trust, London, UK
| | - Peter Je Holt
- 3 St George's Vascular Institute, St George's University Hospitals NHS Foundation Trust, London, UK
| | - Matthew M Thompson
- 3 St George's Vascular Institute, St George's University Hospitals NHS Foundation Trust, London, UK
| | - Rob Sayers
- 4 NIHR Leicester Cardiovascular Biomedical Research Unit and Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Matthew J Bown
- 4 NIHR Leicester Cardiovascular Biomedical Research Unit and Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Eddie C Choke
- 4 NIHR Leicester Cardiovascular Biomedical Research Unit and Department of Cardiovascular Sciences, University of Leicester, Leicester, UK
| | - Xianghong Ma
- 2 School of Engineering and Applied Science, Aston University, Birmingham, UK
| |
Collapse
|
11
|
Attallah O, Karthikesalingam A, Holt PJE, Thompson MM, Sayers R, Bown MJ, Choke EC, Ma X. Feature selection through validation and un-censoring of endovascular repair survival data for predicting the risk of re-intervention. BMC Med Inform Decis Mak 2017; 17:115. [PMID: 28774329 PMCID: PMC5543447 DOI: 10.1186/s12911-017-0508-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 07/24/2017] [Indexed: 12/25/2022] Open
Abstract
Background Feature selection (FS) process is essential in the medical area as it reduces the effort and time needed for physicians to measure unnecessary features. Choosing useful variables is a difficult task with the presence of censoring which is the unique characteristic in survival analysis. Most survival FS methods depend on Cox’s proportional hazard model; however, machine learning techniques (MLT) are preferred but not commonly used due to censoring. Techniques that have been proposed to adopt MLT to perform FS with survival data cannot be used with the high level of censoring. The researcher’s previous publications proposed a technique to deal with the high level of censoring. It also used existing FS techniques to reduce dataset dimension. However, in this paper a new FS technique was proposed and combined with feature transformation and the proposed uncensoring approaches to select a reduced set of features and produce a stable predictive model. Methods In this paper, a FS technique based on artificial neural network (ANN) MLT is proposed to deal with highly censored Endovascular Aortic Repair (EVAR). Survival data EVAR datasets were collected during 2004 to 2010 from two vascular centers in order to produce a final stable model. They contain almost 91% of censored patients. The proposed approach used a wrapper FS method with ANN to select a reduced subset of features that predict the risk of EVAR re-intervention after 5 years to patients from two different centers located in the United Kingdom, to allow it to be potentially applied to cross-centers predictions. The proposed model is compared with the two popular FS techniques; Akaike and Bayesian information criteria (AIC, BIC) that are used with Cox’s model. Results The final model outperforms other methods in distinguishing the high and low risk groups; as they both have concordance index and estimated AUC better than the Cox’s model based on AIC, BIC, Lasso, and SCAD approaches. These models have p-values lower than 0.05, meaning that patients with different risk groups can be separated significantly and those who would need re-intervention can be correctly predicted. Conclusion The proposed approach will save time and effort made by physicians to collect unnecessary variables. The final reduced model was able to predict the long-term risk of aortic complications after EVAR. This predictive model can help clinicians decide patients’ future observation plan. Electronic supplementary material The online version of this article (doi:10.1186/s12911-017-0508-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Omneya Attallah
- School of Engineering and Applied Science, Aston University, B4 7ET, Birmingham, UK.,Department of Electronics and Communications, College of Engineering and Technology, Arab Academy for Science and Technology, Alexandria, Egypt
| | | | | | | | - Rob Sayers
- St George's Vascular Institute, St George's University Hospitals NHS Foundation Trust, Blackshaw Road, London, SW17 0QT, UK
| | - Matthew J Bown
- Vascular Surgery Group, University of Leicester, Leicester, UK
| | - Eddie C Choke
- Vascular Surgery Group, Robert Kilpatrick Clinical Sciences Building, Leicester Royal Infirmary, University of Leicester, Leicester, LE2 7LX, UK
| | - Xianghong Ma
- School of Engineering and Applied Science, Aston University, B4 7ET, Birmingham, UK.
| |
Collapse
|
12
|
Digital Image Analysis of Cells and Computational Tools for the Study of Mechanism of RSV Entry to Human Bronchial Epithelium. SISTEMAS E TECNOLOGIAS DE INFORMACAO : ATAS DE 12A CONFERENCIA IBERICA DE SISTEMAS E TECNOLOGIAS DE INFORMACAO (CISTI'2017) : 21 A 24 DE JUNHO DE 2017, LISBOA, PORTUGAL = INFORMATION SYSTEMS AND TECHNOLOGIES : PROCEEDINGS OF THE 12TH IB... 2017; 2017. [PMID: 34337619 DOI: 10.23919/cisti.2017.7975726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
this paper presents a research proposal which has been developed as a doctoral thesis in the PhD program in Computer Systems Engineering at the Universidad del Norte since August 2015. This research focuses on the analysis of cell images of the human bronchial epithelium infected with the Respiratory Syncytial Virus in order to understand the mechanisms of entry of the virus into the human body. Due to the large amount of information that is processed, it is necessary to use computational tools to finally differentiate between infected and uninfected cells.
Collapse
|
13
|
Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification. J Biomed Inform 2017; 67:11-20. [DOI: 10.1016/j.jbi.2017.01.016] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Revised: 01/24/2017] [Accepted: 01/31/2017] [Indexed: 12/24/2022]
|
14
|
|
15
|
Supervised wavelet method to predict patient survival from gene expression data. ScientificWorldJournal 2014; 2014:618412. [PMID: 25538955 PMCID: PMC4235600 DOI: 10.1155/2014/618412] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 10/03/2014] [Indexed: 11/18/2022] Open
Abstract
In microarray studies, the number of samples is relatively small compared to the number of genes per sample. An important aspect of microarray studies is the prediction of patient survival based on their gene expression profile. This naturally calls for the use of a dimension reduction procedure together with the survival prediction model. In this study, a new method based on combining wavelet approximation coefficients and Cox regression was presented. The proposed method was compared with supervised principal component and supervised partial least squares methods. The different fitted Cox models based on supervised wavelet approximation coefficients, the top number of supervised principal components, and partial least squares components were applied to the data. The results showed that the prediction performance of the Cox model based on supervised wavelet feature extraction was superior to the supervised principal components and partial least squares components. The results suggested the possibility of developing new tools based on wavelets for the dimensionally reduction of microarray data sets in the context of survival analysis.
Collapse
|
16
|
Doostparast Torshizi A, Fazel Zarandi MH. A new cluster validity measure based on general type-2 fuzzy sets: Application in gene expression data clustering. Knowl Based Syst 2014. [DOI: 10.1016/j.knosys.2014.03.023] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
17
|
Meng T, Soliman AT, Shyu ML, Yang Y, Chen SC, Iyengar SS, Yordy JS, Iyengar P. Wavelet analysis in current cancer genome research: a survey. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:1442-1459. [PMID: 24407303 DOI: 10.1109/tcbb.2013.134] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
With the rapid development of next generation sequencing technology, the amount of biological sequence data of the cancer genome increases exponentially, which calls for efficient and effective algorithms that may identify patterns hidden underneath the raw data that may distinguish cancer Achilles' heels. From a signal processing point of view, biological units of information, including DNA and protein sequences, have been viewed as one-dimensional signals. Therefore, researchers have been applying signal processing techniques to mine the potentially significant patterns within these sequences. More specifically, in recent years, wavelet transforms have become an important mathematical analysis tool, with a wide and ever increasing range of applications. The versatility of wavelet analytic techniques has forged new interdisciplinary bounds by offering common solutions to apparently diverse problems and providing a new unifying perspective on problems of cancer genome research. In this paper, we provide a survey of how wavelet analysis has been applied to cancer bioinformatics questions. Specifically, we discuss several approaches of representing the biological sequence data numerically and methods of using wavelet analysis on the numerical sequences.
Collapse
Affiliation(s)
- Tao Meng
- University of Miami, Coral Gables
| | | | | | | | | | | | - John S Yordy
- University of Texas Southwestern Medical Center, Dallas
| | | |
Collapse
|