1
|
Lu K, Lin S, Xue K, Huang D, Ji Y. Optimized multiple instance learning for brain tumor classification using weakly supervised contrastive learning. Comput Biol Med 2025; 191:110075. [PMID: 40220594 DOI: 10.1016/j.compbiomed.2025.110075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Revised: 02/28/2025] [Accepted: 03/21/2025] [Indexed: 04/14/2025]
Abstract
Brain tumors have a great impact on patients' quality of life and accurate histopathological classification of brain tumors is crucial for patients' prognosis. Multi-instance learning (MIL) has become the mainstream method for analyzing whole-slide images (WSIs). However, current MIL-based methods face several issues, including significant redundancy in the input and feature space, insufficient modeling of spatial relations between patches and inadequate representation capability of the feature extractor. To solve these limitations, we propose a new multi-instance learning with weakly supervised contrastive learning for brain tumor classification. Our framework consists of two parts: a cross-detection MIL aggregator (CDMIL) for brain tumor classification and a contrastive learning model based on pseudo-labels (PSCL) for optimizing feature encoder. The CDMIL consists of three modules: an internal patch anchoring module (IPAM), a local structural learning module (LSLM) and a cross-detection module (CDM). Specifically, IPAM utilizes probability distribution to generate representations of anchor samples, while LSLM extracts representations of local structural information between anchor samples. These two representations are effectively fused in CDM. Additionally, we propose a bag-level contrastive loss to interact with different subtypes in the feature space. PSCL uses the samples and pseudo-labels anchored by IPAM to optimize the performance of the feature encoder, which extracts a better feature representation to train CDMIL. We performed benchmark tests on a self-collected dataset and a publicly available dataset. The experiments show that our method has better performance than several existing state-of-the-art methods.
Collapse
Affiliation(s)
- Kaoyan Lu
- Key Laboratory of Atomic and Subatomic Structure and Quantum Control (Ministry of Education), Guangdong Basic Research Center of Excellence for Structure and Fundamental Interactions of Matter, School of Physics, South China Normal University, 378 Waihuan West Road, Panyu District, Guangzhou, 510006, Guangdong Province, China; Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials, Guangdong-Hong Kong Joint Laboratory of Quantum Matter, South China Normal University, 378 Waihuan West Road, Panyu District, 510006, Guangdong Province, Guangzhou, China
| | - Shiyu Lin
- Key Laboratory of Atomic and Subatomic Structure and Quantum Control (Ministry of Education), Guangdong Basic Research Center of Excellence for Structure and Fundamental Interactions of Matter, School of Physics, South China Normal University, 378 Waihuan West Road, Panyu District, Guangzhou, 510006, Guangdong Province, China; Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials, Guangdong-Hong Kong Joint Laboratory of Quantum Matter, South China Normal University, 378 Waihuan West Road, Panyu District, 510006, Guangdong Province, Guangzhou, China
| | - Kaiwen Xue
- School of Cyberspace Security, Beijing University of Posts and Telecommunications, 10 Xitucheng Road, Haidian District, Beijing, 100876, China
| | - Duoxi Huang
- The Third Affiliated Hospital of Southern Medical University, 183 Zhongshan Avenue West, Tianhe District, Guangzhou, 510630, Guangdong Province, China.
| | - Yanghong Ji
- Key Laboratory of Atomic and Subatomic Structure and Quantum Control (Ministry of Education), Guangdong Basic Research Center of Excellence for Structure and Fundamental Interactions of Matter, School of Physics, South China Normal University, 378 Waihuan West Road, Panyu District, Guangzhou, 510006, Guangdong Province, China; Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials, Guangdong-Hong Kong Joint Laboratory of Quantum Matter, South China Normal University, 378 Waihuan West Road, Panyu District, 510006, Guangdong Province, Guangzhou, China.
| |
Collapse
|
2
|
Zeng G, Zhao C, Li G, Huang Z, Zhuang J, Liang X, Yu X, Fang S. Identifying somatic driver mutations in cancer with a language model of the human genome. Comput Struct Biotechnol J 2025; 27:531-540. [PMID: 39968174 PMCID: PMC11833646 DOI: 10.1016/j.csbj.2025.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Revised: 01/12/2025] [Accepted: 01/14/2025] [Indexed: 02/20/2025] Open
Abstract
Somatic driver mutations play important roles in cancer and must be precisely identified to advance our understanding of tumorigenesis and its promotion and progression. However, identifying somatic driver mutations remains challenging in Homo sapiens genomics due to the random nature of mutations and the high cost of qualitative experiments. Building on the powerful sequence interpretation capabilities of language models, we propose a self-attention-based contextualized pretrained language model for somatic driver mutation identification. We pretrained the model with the Homo sapiens reference genome to equip it with the ability to understand genome sequences and then fine-tuned it for oncogene and tumor suppressor gene prediction tasks, enabling it to extract features related to driver genes from the original genome sequence. The fine-tuned model was used to obtain the mutations' carcinogenic effect characteristics to further identify whether the mutation is a driver or a passenger. Compared with other computational algorithms, our method achieved excellent somatic driver mutation identification performance on the test set, with an absolute improvement of 4.31% in AUROC over the best comparison method. The strong performance of our method indicates that it can provide new insights into the discovery of cancer drivers.
Collapse
Affiliation(s)
- Guangjian Zeng
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Chengzhi Zhao
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Guanpeng Li
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| | - Zhengyang Huang
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
| | - Jinhu Zhuang
- Shenzhen Health Development Research and Data Management Center, Guangdong, China
| | - Xiaohua Liang
- Department of Clinical Epidemiology and Biostatistics, Children's Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| | - Xiaxia Yu
- School of Biomedical Engineering, Shenzhen University, Shenzhen, China
| | - Shenying Fang
- School of Public Health and Emergency Management, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
3
|
Hassan J, Saeed SM, Deka L, Uddin MJ, Das DB. Applications of Machine Learning (ML) and Mathematical Modeling (MM) in Healthcare with Special Focus on Cancer Prognosis and Anticancer Therapy: Current Status and Challenges. Pharmaceutics 2024; 16:260. [PMID: 38399314 PMCID: PMC10892549 DOI: 10.3390/pharmaceutics16020260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/29/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024] Open
Abstract
The use of data-driven high-throughput analytical techniques, which has given rise to computational oncology, is undisputed. The widespread use of machine learning (ML) and mathematical modeling (MM)-based techniques is widely acknowledged. These two approaches have fueled the advancement in cancer research and eventually led to the uptake of telemedicine in cancer care. For diagnostic, prognostic, and treatment purposes concerning different types of cancer research, vast databases of varied information with manifold dimensions are required, and indeed, all this information can only be managed by an automated system developed utilizing ML and MM. In addition, MM is being used to probe the relationship between the pharmacokinetics and pharmacodynamics (PK/PD interactions) of anti-cancer substances to improve cancer treatment, and also to refine the quality of existing treatment models by being incorporated at all steps of research and development related to cancer and in routine patient care. This review will serve as a consolidation of the advancement and benefits of ML and MM techniques with a special focus on the area of cancer prognosis and anticancer therapy, leading to the identification of challenges (data quantity, ethical consideration, and data privacy) which are yet to be fully addressed in current studies.
Collapse
Affiliation(s)
- Jasmin Hassan
- Drug Delivery & Therapeutics Lab, Dhaka 1212, Bangladesh; (J.H.); (S.M.S.)
| | | | - Lipika Deka
- Faculty of Computing, Engineering and Media, De Montfort University, Leicester LE1 9BH, UK;
| | - Md Jasim Uddin
- Department of Pharmaceutical Technology, Faculty of Pharmacy, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Diganta B. Das
- Department of Chemical Engineering, Loughborough University, Loughborough LE11 3TU, UK
| |
Collapse
|
4
|
Nourbakhsh M, Degn K, Saksager A, Tiberti M, Papaleo E. Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks. Brief Bioinform 2024; 25:bbad519. [PMID: 38261338 PMCID: PMC10805075 DOI: 10.1093/bib/bbad519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 11/27/2023] [Accepted: 12/11/2023] [Indexed: 01/24/2024] Open
Abstract
The vast amount of available sequencing data allows the scientific community to explore different genetic alterations that may drive cancer or favor cancer progression. Software developers have proposed a myriad of predictive tools, allowing researchers and clinicians to compare and prioritize driver genes and mutations and their relative pathogenicity. However, there is little consensus on the computational approach or a golden standard for comparison. Hence, benchmarking the different tools depends highly on the input data, indicating that overfitting is still a massive problem. One of the solutions is to limit the scope and usage of specific tools. However, such limitations force researchers to walk on a tightrope between creating and using high-quality tools for a specific purpose and describing the complex alterations driving cancer. While the knowledge of cancer development increases daily, many bioinformatic pipelines rely on single nucleotide variants or alterations in a vacuum without accounting for cellular compartments, mutational burden or disease progression. Even within bioinformatics and computational cancer biology, the research fields work in silos, risking overlooking potential synergies or breakthroughs. Here, we provide an overview of databases and datasets for building or testing predictive cancer driver tools. Furthermore, we introduce predictive tools for driver genes, driver mutations, and the impact of these based on structural analysis. Additionally, we suggest and recommend directions in the field to avoid silo-research, moving towards integrative frameworks.
Collapse
Affiliation(s)
- Mona Nourbakhsh
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Kristine Degn
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Astrid Saksager
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
| | - Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Institute, 2100 Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Systems Biology, Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Lyngby, Denmark
- Cancer Structural Biology, Danish Cancer Institute, 2100 Copenhagen, Denmark
| |
Collapse
|
5
|
Sharma D, Xu W. ReGeNNe: genetic pathway-based deep neural network using canonical correlation regularizer for disease prediction. Bioinformatics 2023; 39:btad679. [PMID: 37963055 PMCID: PMC10666205 DOI: 10.1093/bioinformatics/btad679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/06/2023] [Accepted: 11/13/2023] [Indexed: 11/16/2023] Open
Abstract
MOTIVATION Common human diseases result from the interplay of genes and their biologically associated pathways. Genetic pathway analyses provide more biological insight as compared to conventional gene-based analysis. In this article, we propose a framework combining genetic data into pathway structure and using an ensemble of convolutional neural networks (CNNs) along with a Canonical Correlation Regularizer layer for comprehensive prediction of disease risk. The novelty of our approach lies in our two-step framework: (i) utilizing the CNN's effectiveness to extract the complex gene associations within individual genetic pathways and (ii) fusing features from ensemble of CNNs through Canonical Correlation Regularization layer to incorporate the interactions between pathways which share common genes. During prediction, we also address the important issues of interpretability of neural network models, and identifying the pathways and genes playing an important role in prediction. RESULTS Implementation of our methodology into three real cancer genetic datasets for different prediction tasks validates our model's generalizability and robustness. Comparing with conventional models, our methodology provides consistently better performance with AUC improvement of 11% on predicting early/late-stage kidney cancer, 10% on predicting kidney versus liver cancer type and 7% on predicting survival status in ovarian cancer as compared to the next best conventional machine learning model. The robust performance of our deep learning algorithm indicates that disease prediction using neural networks in multiple functionally related genes across different pathways improves genetic data-based prediction and understanding molecular mechanisms of diseases. AVAILABILITY AND IMPLEMENTATION https://github.com/divya031090/ReGeNNe.
Collapse
Affiliation(s)
- Divya Sharma
- Biostatistics Department, Princess Margaret Cancer Center, University Health Network, Toronto, ON M5G2C4, Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
| | - Wei Xu
- Biostatistics Department, Princess Margaret Cancer Center, University Health Network, Toronto, ON M5G2C4, Canada
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada
| |
Collapse
|
6
|
Rösler W, Altenbuchinger M, Baeßler B, Beissbarth T, Beutel G, Bock R, von Bubnoff N, Eckardt JN, Foersch S, Loeffler CML, Middeke JM, Mueller ML, Oellerich T, Risse B, Scherag A, Schliemann C, Scholz M, Spang R, Thielscher C, Tsoukakis I, Kather JN. An overview and a roadmap for artificial intelligence in hematology and oncology. J Cancer Res Clin Oncol 2023; 149:7997-8006. [PMID: 36920563 PMCID: PMC10374829 DOI: 10.1007/s00432-023-04667-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 02/23/2023] [Indexed: 03/16/2023]
Abstract
BACKGROUND Artificial intelligence (AI) is influencing our society on many levels and has broad implications for the future practice of hematology and oncology. However, for many medical professionals and researchers, it often remains unclear what AI can and cannot do, and what are promising areas for a sensible application of AI in hematology and oncology. Finally, the limits and perils of using AI in oncology are not obvious to many healthcare professionals. METHODS In this article, we provide an expert-based consensus statement by the joint Working Group on "Artificial Intelligence in Hematology and Oncology" by the German Society of Hematology and Oncology (DGHO), the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), and the Special Interest Group Digital Health of the German Informatics Society (GI). We provide a conceptual framework for AI in hematology and oncology. RESULTS First, we propose a technological definition, which we deliberately set in a narrow frame to mainly include the technical developments of the last ten years. Second, we present a taxonomy of clinically relevant AI systems, structured according to the type of clinical data they are used to analyze. Third, we show an overview of potential applications, including clinical, research, and educational environments with a focus on hematology and oncology. CONCLUSION Thus, this article provides a point of reference for hematologists and oncologists, and at the same time sets forth a framework for the further development and clinical deployment of AI in hematology and oncology in the future.
Collapse
Affiliation(s)
- Wiebke Rösler
- Department for Medical Oncology and Hematology, University Hospital Zurich, Zurich, Switzerland
| | - Michael Altenbuchinger
- Department of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
| | - Bettina Baeßler
- Department of Diagnostic and Interventional Radiology, University Hospital Würzburg, Würzburg, Germany
| | - Tim Beissbarth
- Department of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
| | - Gernot Beutel
- Department for Hematology, Hemostasis, Oncology and Stem Cell Transplantation, Hannover Medical School, Hannover, Germany
| | - Robert Bock
- IMMS Institute for Microelectronics and Mechatronics Systems GmbH (NPO), Ilmenau, Germany
| | - Nikolas von Bubnoff
- Department of Hematology and Oncology, Medical Center, University of Schleswig Holstein, Campus Lübeck, Lübeck, Germany
| | - Jan-Niklas Eckardt
- Department of Medicine 1, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany
- Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany
| | - Sebastian Foersch
- Institute of Pathology, University Medical Center Mainz, Mainz, Germany
| | - Chiara M L Loeffler
- Department of Medicine 1, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany
- Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany
| | - Jan Moritz Middeke
- Department of Medicine 1, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany
- Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany
| | | | - Thomas Oellerich
- Medizinische Klinik 2-Haematology/Oncology, University Hospital, Frankfurt am Main, Germany
| | - Benjamin Risse
- Computer Vision and Machine Learning Systems Group, Institute for Geoinformatics, University of Münster, Münster, Germany
| | - André Scherag
- Institute of Medical Statistics, Computer and Data Sciences, Jena University Hospital - Friedrich Schiller University, Jena, Germany
| | | | - Markus Scholz
- Institute for Medical Informatics, Statistics and Epidemiology, University of Leipzig, Leipzig, Germany
| | - Rainer Spang
- Department of Statistical Bioinformatics, University of Regensburg, Regensburg, Germany
| | | | - Ioannis Tsoukakis
- Department of Hematology and Oncology, Sana Klinikum Offenbach, Offenbach, Germany
| | - Jakob Nikolas Kather
- Department of Medicine 1, University Hospital Carl Gustav Carus, Technical University Dresden, Dresden, Germany.
- Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany.
- Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany.
| |
Collapse
|
7
|
Ruan Y, Lv W, Li S, Cheng Y, Wang D, Zhang C, Shimizu K. Identification of telomere-related genes associated with aging-related molecular clusters and the construction of a diagnostic model in Alzheimer's disease based on a bioinformatic analysis. Comput Biol Med 2023; 159:106922. [PMID: 37094463 DOI: 10.1016/j.compbiomed.2023.106922] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 03/07/2023] [Accepted: 04/13/2023] [Indexed: 04/26/2023]
Abstract
BACKGROUND Alzheimer's disease (AD) is a neurodegenerative disease that is strongly associated with aging. Telomeres are DNA sequences that protect chromosomes from damage and shorten with age. Telomere-related genes (TRGs) may play a role in AD's pathogenesis. OBJECTIVES To identify TRGs related to aging clusters in AD patients, explore their immunological characteristics, and build a TRG-based prediction model for AD and AD subtypes. METHODS We analyzed the gene expression profiles of 97 AD samples from the GSE132903 dataset, using aging-related genes (ARGs) as clustering variables. We also assessed immune-cell infiltration in each cluster. We performed a weighted gene co-expression network analysis to identify cluster-specific differentially expressed TRGs. We compared four machine-learning models (random forest, generalized linear model [GLM], gradient boosting model, and support vector machine) for predicting AD and AD subtypes based on TRGs and validated TRGs by conducting an artificial neural network (ANN) analysis and a nomogram model. RESULTS We identified two aging clusters in AD patients with distinct immunological features: Cluster A had higher immune scores than Cluster B. Cluster A and the immune system are intimately associated, and this association could affect immunological function and result in AD via the digestive system. The GLM predicted AD and AD subtypes most accurately and was validated by the ANN analysis and nomogram model. CONCLUSION Our analyses revealed novel TRGs associated with aging clusters in AD patients and their immunological characteristics. We also developed a promising prediction model based on TRGs for assessing AD risk.
Collapse
Affiliation(s)
- Yang Ruan
- Laboratory of Systematic Forest and Forest Products Sciences, Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, 819-0395, Japan
| | - Weichao Lv
- Sino-Jan Joint Lab of Natural Health Products Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 210009, China
| | - Shuaiyu Li
- Saigo Laboratory, School of Information Science, Kyushu University, Fukuoka, 819-0395, Japan
| | - Yuzhong Cheng
- Joint Graduate School of Mathematics for Innovation, Kyushu University, Fukuoka, 819-0395, Japan
| | - Duanyang Wang
- Laboratory of Systematic Forest and Forest Products Sciences, Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, 819-0395, Japan
| | - Chaofeng Zhang
- Sino-Jan Joint Lab of Natural Health Products Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing, 210009, China
| | - Kuniyoshi Shimizu
- Laboratory of Systematic Forest and Forest Products Sciences, Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, Fukuoka, 819-0395, Japan.
| |
Collapse
|
8
|
Ostroverkhova D, Przytycka TM, Panchenko AR. Cancer driver mutations: predictions and reality. Trends Mol Med 2023:S1471-4914(23)00067-9. [PMID: 37076339 DOI: 10.1016/j.molmed.2023.03.007] [Citation(s) in RCA: 60] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/17/2023] [Accepted: 03/23/2023] [Indexed: 04/21/2023]
Abstract
Cancer cells accumulate many genetic alterations throughout their lifetime, but only a few of them drive cancer progression, termed driver mutations. Driver mutations may vary between cancer types and patients, can remain latent for a long time and become drivers at particular cancer stages, or may drive oncogenesis only in conjunction with other mutations. The high mutational, biochemical, and histological tumor heterogeneity makes driver mutation identification very challenging. In this review we summarize recent efforts to identify driver mutations in cancer and annotate their effects. We underline the success of computational methods to predict driver mutations in finding novel cancer biomarkers, including in circulating tumor DNA (ctDNA). We also report on the boundaries of their applicability in clinical research.
Collapse
Affiliation(s)
- Daria Ostroverkhova
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON, Canada
| | - Teresa M Przytycka
- National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD, USA.
| | - Anna R Panchenko
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON, Canada; Department of Biology and Molecular Sciences, Queen's University, Kingston, ON, Canada; School of Computing, Queen's University, Kingston, ON, Canada; Ontario Institute of Cancer Research, Toronto, ON, Canada.
| |
Collapse
|
9
|
Sun Q, Cheng L, Meng A, Ge S, Chen J, Zhang L, Gong P. SADLN: Self-attention based deep learning network of integrating multi-omics data for cancer subtype recognition. Front Genet 2023; 13:1032768. [PMID: 36685873 PMCID: PMC9846505 DOI: 10.3389/fgene.2022.1032768] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 12/15/2022] [Indexed: 01/05/2023] Open
Abstract
Integrating multi-omics data for cancer subtype recognition is an important task in bioinformatics. Recently, deep learning has been applied to recognize the subtype of cancers. However, existing studies almost integrate the multi-omics data simply by concatenation as the single data and then learn a latent low-dimensional representation through a deep learning model, which did not consider the distribution differently of omics data. Moreover, these methods ignore the relationship of samples. To tackle these problems, we proposed SADLN: A self-attention based deep learning network of integrating multi-omics data for cancer subtype recognition. SADLN combined encoder, self-attention, decoder, and discriminator into a unified framework, which can not only integrate multi-omics data but also adaptively model the sample's relationship for learning an accurately latent low-dimensional representation. With the integrated representation learned from the network, SADLN used Gaussian Mixture Model to identify cancer subtypes. Experiments on ten cancer datasets of TCGA demonstrated the advantages of SADLN compared to ten methods. The Self-Attention Based Deep Learning Network (SADLN) is an effective method of integrating multi-omics data for cancer subtype recognition.
Collapse
Affiliation(s)
- Qiuwen Sun
- School of Medical Imaging, Xuzhou Medical University, Xuzhou, China
| | - Lei Cheng
- School of Medical Imaging, Xuzhou Medical University, Xuzhou, China
| | - Ao Meng
- School of Medical Imaging, Xuzhou Medical University, Xuzhou, China
| | - Shuguang Ge
- School of Information and Control Engineering, University of Mining and Technology, Xuzhou, China
| | - Jie Chen
- Department of Radiation Oncology, Affiliated Hospital of Xuzhou Medical University, Xuzhou, China
| | - Longzhen Zhang
- Department of Radiation Oncology, Affiliated Hospital of Xuzhou Medical University, Xuzhou, China
| | - Ping Gong
- School of Medical Imaging, Xuzhou Medical University, Xuzhou, China
| |
Collapse
|
10
|
Li Y, Wu X, Yang P, Jiang G, Luo Y. Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:850-866. [PMID: 36462630 PMCID: PMC10025752 DOI: 10.1016/j.gpb.2022.11.003] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 10/03/2022] [Accepted: 11/17/2022] [Indexed: 12/03/2022]
Abstract
The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively characterized lung cancer through the use of different perspectives from these accrued data. In this review, we provide an overview of machine learning-based approaches that strengthen the varying aspects of lung cancer diagnosis and therapy, including early detection, auxiliary diagnosis, prognosis prediction, and immunotherapy practice. Moreover, we highlight the challenges and opportunities for future applications of machine learning in lung cancer.
Collapse
Affiliation(s)
- Yawei Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Xin Wu
- Department of Medicine, University of Illinois at Chicago, Chicago, IL 60612, USA
| | - Ping Yang
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905 / Scottsdale, AZ 85259, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA.
| |
Collapse
|
11
|
Zhou X, Wang H, Feng C, Xu R, He Y, Li L, Tu C. Emerging Applications of Deep Learning in Bone Tumors: Current Advances and Challenges. Front Oncol 2022; 12:908873. [PMID: 35928860 PMCID: PMC9345628 DOI: 10.3389/fonc.2022.908873] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 06/15/2022] [Indexed: 12/12/2022] Open
Abstract
Deep learning is a subfield of state-of-the-art artificial intelligence (AI) technology, and multiple deep learning-based AI models have been applied to musculoskeletal diseases. Deep learning has shown the capability to assist clinical diagnosis and prognosis prediction in a spectrum of musculoskeletal disorders, including fracture detection, cartilage and spinal lesions identification, and osteoarthritis severity assessment. Meanwhile, deep learning has also been extensively explored in diverse tumors such as prostate, breast, and lung cancers. Recently, the application of deep learning emerges in bone tumors. A growing number of deep learning models have demonstrated good performance in detection, segmentation, classification, volume calculation, grading, and assessment of tumor necrosis rate in primary and metastatic bone tumors based on both radiological (such as X-ray, CT, MRI, SPECT) and pathological images, implicating a potential for diagnosis assistance and prognosis prediction of deep learning in bone tumors. In this review, we first summarized the workflows of deep learning methods in medical images and the current applications of deep learning-based AI for diagnosis and prognosis prediction in bone tumors. Moreover, the current challenges in the implementation of the deep learning method and future perspectives in this field were extensively discussed.
Collapse
Affiliation(s)
- Xiaowen Zhou
- Department of Orthopaedics, The Second Xiangya Hospital, Central South University, Changsha, China
- Xiangya School of Medicine, Central South University, Changsha, China
| | - Hua Wang
- Xiangya School of Medicine, Central South University, Changsha, China
| | - Chengyao Feng
- Department of Orthopaedics, The Second Xiangya Hospital, Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Ruilin Xu
- Department of Orthopaedics, The Second Xiangya Hospital, Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Yu He
- Department of Radiology, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Lan Li
- Department of Pathology, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Chao Tu
- Department of Orthopaedics, The Second Xiangya Hospital, Central South University, Changsha, China
- Hunan Key Laboratory of Tumor Models and Individualized Medicine, The Second Xiangya Hospital, Central South University, Changsha, China
- *Correspondence: Chao Tu,
| |
Collapse
|
12
|
Machine learning approaches to explore digenic inheritance. Trends Genet 2022; 38:1013-1018. [DOI: 10.1016/j.tig.2022.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 04/16/2022] [Accepted: 04/25/2022] [Indexed: 11/22/2022]
|