1
|
Sengupta P, Dutta S, Liew F, Samrot A, Dasgupta S, Rajput MA, Slama P, Kolesarova A, Roychoudhury S. Reproductomics: Exploring the Applications and Advancements of Computational Tools. Physiol Res 2024; 73:687-702. [PMID: 39530905 PMCID: PMC11629954 DOI: 10.33549/physiolres.935389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 06/25/2024] [Indexed: 12/13/2024] Open
Abstract
Over recent decades, advancements in omics technologies, such as proteomics, genomics, epigenomics, metabolomics, transcriptomics, and microbiomics, have significantly enhanced our understanding of the molecular mechanisms underlying various physiological and pathological processes. Nonetheless, the analysis and interpretation of vast omics data concerning reproductive diseases are complicated by the cyclic regulation of hormones and multiple other factors, which, in conjunction with a genetic makeup of an individual, lead to diverse biological responses. Reproductomics investigates the interplay between a hormonal regulation of an individual, environmental factors, genetic predisposition (DNA composition and epigenome), health effects, and resulting biological outcomes. It is a rapidly emerging field that utilizes computational tools to analyze and interpret reproductive data, with the aim of improving reproductive health outcomes. It is time to explore the applications of reproductomics in understanding the molecular mechanisms underlying infertility, identification of potential biomarkers for diagnosis and treatment, and in improving assisted reproductive technologies (ARTs). Reproductomics tools include machine learning algorithms for predicting fertility outcomes, gene editing technologies for correcting genetic abnormalities, and single cell sequencing techniques for analyzing gene expression patterns at the individual cell level. However, there are several challenges, limitations and ethical issues involved with the use of reproductomics, such as the applications of gene editing technologies and their potential impact on future generations are discussed. The review comprehensively covers the applications and advancements of reproductomics, highlighting its potential to improve reproductive health outcomes and deepen our understanding of reproductive molecular mechanisms.
Collapse
Affiliation(s)
- P Sengupta
- Department of Biomedical Sciences, College of Medicine, Gulf Medical University, Ajman, UAE; Department of Life Science and Bioinformatics, Assam University, Silchar, India.
| | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Xu Q, Kaur J, Wylie D, Mittal K, Li H, Kolachina R, Aleskandarany M, Toss MS, Green AR, Yang J, Yankeelov TE, Bhattarai S, Janssen EAM, Kong J, Rakha EA, Kowalski J, Aneja R. A Case Series Exploration of Multi-Regional Expression Heterogeneity in Triple-Negative Breast Cancer Patients. Int J Mol Sci 2022; 23:13322. [PMID: 36362107 PMCID: PMC9655720 DOI: 10.3390/ijms232113322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 10/12/2022] [Accepted: 10/13/2022] [Indexed: 08/13/2023] Open
Abstract
Extensive intratumoral heterogeneity (ITH) is believed to contribute to therapeutic failure and tumor recurrence, as treatment-resistant cell clones can survive and expand. However, little is known about ITH in triple-negative breast cancer (TNBC) because of the limited number of single-cell sequencing studies on TNBC. In this study, we explored ITH in TNBC by evaluating gene expression-derived and imaging-derived multi-region differences within the same tumor. We obtained tissue specimens from 10 TNBC patients and conducted RNA sequencing analysis of 2-4 regions per tumor. We developed a novel analysis framework to dissect and characterize different types of variability: between-patients (inter-tumoral heterogeneity), between-patients across regions (inter-tumoral and region heterogeneity), and within-patient, between-regions (regional intratumoral heterogeneity). We performed a Bayesian changepoint analysis to assess and classify regional variability as low (convergent) versus high (divergent) within each patient feature (TNBC and PAM50 subtypes, immune, stroma, tumor counts and tumor infiltrating lymphocytes). Gene expression signatures were categorized into three types of variability: between-patients (108 genes), between-patients across regions (183 genes), and within-patients, between-regions (778 genes). Based on the between-patient gene signature, we identified two distinct patient clusters that differed in menopausal status. Significant intratumoral divergence was observed for PAM50 classification, tumor cell counts, and tumor-infiltrating T cell abundance. Other features examined showed a representation of both divergent and convergent results. Lymph node stage was significantly associated with divergent tumors. Our results show extensive intertumoral heterogeneity and regional ITH in gene expression and image-derived features in TNBC. Our findings also raise concerns regarding gene expression based TNBC subtyping. Future studies are warranted to elucidate the role of regional heterogeneity in TNBC as a driver of treatment resistance.
Collapse
Affiliation(s)
- Qi Xu
- Department of Oncology, Livestrong Cancer Institutes, The University of Texas at Austin, Austin, TX 78712, USA
| | - Jaspreet Kaur
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
| | - Dennis Wylie
- Center for Biomedical Research Support, The University of Texas at Austin, Austin, TX 78705, USA
| | - Karuna Mittal
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
| | - Hongxiao Li
- Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30303, USA
| | - Rishab Kolachina
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
| | | | - Michael S. Toss
- University of Nottingham and Nottingham University Hospitals, Nottingham NG7 2UH, UK
| | - Andrew R. Green
- University of Nottingham and Nottingham University Hospitals, Nottingham NG7 2UH, UK
| | - Jianchen Yang
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, TX 78705, USA
- Departments of Diagnostic Medicine, Biomedical Engineering, and Oncology, The University of Texas at Austin, Austin, TX 78705, USA
| | - Thomas E. Yankeelov
- Department of Oncology, Livestrong Cancer Institutes, The University of Texas at Austin, Austin, TX 78712, USA
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, TX 78705, USA
- Departments of Diagnostic Medicine, Biomedical Engineering, and Oncology, The University of Texas at Austin, Austin, TX 78705, USA
| | - Shristi Bhattarai
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
| | - Emiel A. M. Janssen
- Department of Pathology, Stavanger University Hospital, 4011 Stavanger, Norway
| | - Jun Kong
- Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30303, USA
| | - Emad A. Rakha
- University of Nottingham and Nottingham University Hospitals, Nottingham NG7 2UH, UK
| | - Jeanne Kowalski
- Department of Oncology, Livestrong Cancer Institutes, The University of Texas at Austin, Austin, TX 78712, USA
| | - Ritu Aneja
- Department of Biology, Georgia State University, Atlanta, GA 30303, USA
- Department of Clinical and Diagnostic Sciences, School of Health Professions, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| |
Collapse
|
3
|
Li X, Xiang J, Wu FX, Li M. A Dual Ranking Algorithm Based on the Multiplex Network for Heterogeneous Complex Disease Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1993-2002. [PMID: 33577455 DOI: 10.1109/tcbb.2021.3059046] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Identifying biomarkers of heterogeneous complex diseases has always been one of the focuses in medical research. In previous studies, the powerful network propagation methods have been applied to finding marker genes related to specific diseases, but existing methods are mostly based on a single network, which may be greatly affected by the incompleteness of the network and the ignorance of a large amount of information about physical and functional interactions between biological components. Other methods that directly integrate multiple types of interactions into an aggregate network have the risks that different types of data may conflict with each other and the characteristics and topologies of each individual network are lost. Meanwhile, biomarkers used in clinical trials should have the characteristics of small quantity and strong discriminate ability. In this study, we developed a multiplex network-based dual ranking framework (DualRank) for heterogeneous complex disease analysis. We applied the proposed method to heterogeneous complex diseases for diagnosis, prognosis, and classification. The results showed that DualRank outperformed competing methods and could identify biomarkers with the small quantity, great prediction performance (average AUC = 0.818) and biological interpretability.
Collapse
|
4
|
Li B, Wang Y, Gu H, Yu Y, Wang P, Liu J, Zhang Y, Chen Y, Niu Q, Wang B, Liu Q, Guan S, Li Y, Zhang H, Wang Z. Modular Screening Reveals Driver Induced Additive Mechanisms of Baicalin and Jasminoidin on Cerebral Ischemia Therapy. Front Cardiovasc Med 2022; 9:813983. [PMID: 35265682 PMCID: PMC8899124 DOI: 10.3389/fcvm.2022.813983] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 01/17/2022] [Indexed: 11/13/2022] Open
Abstract
Combination therapy with increased efficacy and reduced toxicity plays a crucial role in treating complex diseases, such as stroke, but it remains an insurmountable barrier to elucidate the mechanisms of synergistic effects. Here, we present a Driver-induced Modular Screening (DiMS) strategy integrated synergistic module and driver gene identification to elucidate the additive mechanisms of Baicalin (BA) and Jasminoidin (JA) on cerebral ischemia (CI) therapy. Based on anti-ischemia genomic networks BA, JA, and their combination (BJ), we obtained 4, 3, and 9 On-modules of BA, JA, and BJ by modular similarity analysis. Compared with the monotherapy groups, four additive modules (Add-module, BJ_Mod-4, 7, 9, and 13), 15 driver genes of BJ were identified by modular similarity and network control methods, and seven driver proteins (PAQR8, RhoA, EMC10, GGA2, VIPR1, FAM120A, and SEMA3F) were validated by animal experiments. The functional analysis found neuroprotective roles of the Add-modules and driver genes, such as the Neurotrophin signaling pathway and FoxO signaling pathway, which may reflect the additive mechanisms of BJ. Moreover, such a DiMS paradigm provides a new angle to explore the synergistic mechanisms of combination therapy and screen multi-targeted drugs for complex diseases.
Collapse
Affiliation(s)
- Bing Li
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ying Wang
- College of Nursing, Chengde Medical University, Chengde, China
| | - Hao Gu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yanan Yu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Pengqian Wang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Jun Liu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yingying Zhang
- Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Yinying Chen
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | - Qikai Niu
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Bo Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Qiong Liu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Shuang Guan
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yanda Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Huamin Zhang
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
- *Correspondence: Huamin Zhang
| | - Zhong Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
- Zhong Wang
| |
Collapse
|
5
|
Demenkov PS, Oshchepkova ЕА, Demenkov PS, Ivanisenko TV, Ivanisenko VA. Prioritization of biological processes based on the reconstruction and analysis of associative gene networks describing the response of plants to adverse environmental factors. Vavilovskii Zhurnal Genet Selektsii 2021; 25:580-592. [PMID: 34723066 PMCID: PMC8543060 DOI: 10.18699/vj21.065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 06/21/2021] [Accepted: 06/21/2021] [Indexed: 11/23/2022] Open
Abstract
Methods for prioritizing or ranking candidate genes according to their importance based on specif ic criteria
via the analysis of gene networks are widely used in biomedicine to search for genes associated with diseases and to
predict biomarkers, pharmacological targets and other clinically relevant molecules. These methods have also been
used in other f ields, particularly in crop production. This is largely due to the development of technologies to solve
problems in marker-oriented and genomic selection, which requires knowledge of the molecular genetic mechanisms
underlying the formation of agriculturally valuable traits. A new direction for the study of molecular genetic mechanisms
is the prioritization of biological processes based on the analysis of associative gene networks. Associative gene
networks are heterogeneous networks whose vertices can depict both molecular genetic objects (genes, proteins, metabolites,
etc.) and the higher-level factors (biological processes, diseases, external environmental factors, etc.) related
to regulatory, physicochemical or associative interactions. Using a previously developed method, biological processes
involved in plant responses to increased cadmium content, saline stress and drought conditions were prioritized according
to their degree of connection with the gene networks in the SOLANUM TUBEROSUM knowledge base. The
prioritization results indicate that fundamental processes, such as gene expression, post-translational modif ications,
protein degradation, programmed cell death, photosynthesis, signal transmission and stress response play important
roles in the common molecular genetic mechanisms for plant response to various adverse factors. On the other hand, a
group of processes related to the development of seeds (“seeding development”) was revealed to be drought specif ic,
while processes associated with ion transport (“ion transport”) were included in the list of responses specif ic to salt
stress and processes associated with the metabolism of lipids were found to be involved specif ically in the response to
cadmium.
Collapse
Affiliation(s)
- P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - Е А Oshchepkova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - P S Demenkov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - T V Ivanisenko
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - V A Ivanisenko
- Novosibirsk State University, Novosibirsk, Russiavosibirsk, Russia Kurchatov Genomic Center of ICG SB RAS, Novosibirsk, Russia
| |
Collapse
|
6
|
Dotolo S, Marabotti A, Rachiglio AM, Esposito Abate R, Benedetto M, Ciardiello F, De Luca A, Normanno N, Facchiano A, Tagliaferri R. A multiple network-based bioinformatics pipeline for the study of molecular mechanisms in oncological diseases for personalized medicine. Brief Bioinform 2021; 22:6287337. [PMID: 34050359 PMCID: PMC8574709 DOI: 10.1093/bib/bbab180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 03/17/2021] [Accepted: 04/20/2021] [Indexed: 01/03/2023] Open
Abstract
Motivation Assessment of genetic mutations is an essential element in the modern era of personalized cancer treatment. Our strategy is focused on ‘multiple network analysis’ in which we try to improve cancer diagnostics by using biological networks. Genetic alterations in some important hubs or in driver genes such as BRAF and TP53 play a critical role in regulating many important molecular processes. Most of the studies are focused on the analysis of the effects of single mutations, while tumors often carry mutations of multiple driver genes. The aim of this work is to define an innovative bioinformatics pipeline focused on the design and analysis of networks (such as biomedical and molecular networks), in order to: (1) improve the disease diagnosis; (2) identify the patients that could better respond to a given drug treatment; and (3) predict what are the primary and secondary effects of gene mutations involved in human diseases. Results By using our pipeline based on a multiple network approach, it has been possible to demonstrate and validate what are the joint effects and changes of the molecular profile that occur in patients with metastatic colorectal carcinoma (mCRC) carrying mutations in multiple genes. In this way, we can identify the most suitable drugs for the therapy for the individual patient. This information is useful to improve precision medicine in cancer patients. As an application of our pipeline, the clinically significant case studies of a cohort of mCRC patients with the BRAF V600E-TP53 I195N missense combined mutation were considered. Availability The procedures used in this paper are part of the Cytoscape Core, available at (www.cytoscape.org). Data used here on mCRC patients have been published in [55]. Supplementary Information A supplementary file containing a more detailed discussion of this case study and other cases is available at the journal site as Supplementary Data.
Collapse
Affiliation(s)
- Serena Dotolo
- Dipartimento di Scienze Aziendali, Management & Innovation Systems, Università degli Studi di Salerno, Fisciano (SA), Italy
| | - Anna Marabotti
- Dipartimento di Chimica e Biologia "A. Zambelli", Università degli Studi di Salerno, Fisciano (SA), Italy
| | - Anna Maria Rachiglio
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Riziero Esposito Abate
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori -IRCCS - Fondazione G. Pascale, Naples, Italy
| | | | - Fortunato Ciardiello
- Dipartimento di Medicina di Precisione, Università degli Studi della Campania "Luigi Vanvitelli", Naples, Italy
| | - Antonella De Luca
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Nicola Normanno
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Angelo Facchiano
- Institute of Food Sciences, Italian National Research Council (CNR), Avellino, Italy
| | - Roberto Tagliaferri
- Dipartimento di Scienze Aziendali, Management & Innovation Systems, Università degli Studi di Salerno, Fisciano (SA), Italy
| |
Collapse
|
7
|
Rashid MM, Shatabda S, Hasan MM, Kurata H. Recent Development of Machine Learning Methods in Microbial Phosphorylation Sites. Curr Genomics 2020; 21:194-203. [PMID: 33071613 PMCID: PMC7521030 DOI: 10.2174/1389202921666200427210833] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 04/12/2020] [Accepted: 04/13/2020] [Indexed: 01/10/2023] Open
Abstract
A variety of protein post-translational modifications has been identified that control many cellular functions. Phosphorylation studies in mycobacterial organisms have shown critical importance in diverse biological processes, such as intercellular communication and cell division. Recent technical advances in high-precision mass spectrometry have determined a large number of microbial phosphorylated proteins and phosphorylation sites throughout the proteome analysis. Identification of phosphorylated proteins with specific modified residues through experimentation is often labor-intensive, costly and time-consuming. All these limitations could be overcome through the application of machine learning (ML) approaches. However, only a limited number of computational phosphorylation site prediction tools have been developed so far. This work aims to present a complete survey of the existing ML-predictors for microbial phosphorylation. We cover a variety of important aspects for developing a successful predictor, including operating ML algorithms, feature selection methods, window size, and software utility. Initially, we review the currently available phosphorylation site databases of the microbiome, the state-of-the-art ML approaches, working principles, and their performances. Lastly, we discuss the limitations and future directions of the computational ML methods for the prediction of phosphorylation.
Collapse
Affiliation(s)
| | | | - Md. Mehedi Hasan
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828;, E-mail: and Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| | - Hiroyuki Kurata
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828;, E-mail: and Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| |
Collapse
|
8
|
Shi M, Wang J, Zhang C. Integration of Cancer Genomics Data for Tree-based Dimensionality Reduction and Cancer Outcome Prediction. Mol Inform 2019; 39:e1900028. [PMID: 31490641 DOI: 10.1002/minf.201900028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 08/22/2019] [Indexed: 11/10/2022]
Abstract
Accurate outcome prediction is crucial for precision medicine and personalized treatment of cancer. Researchers have found that multi-dimensional cancer omics studies outperform each data type (mRNA, microRNA, methylation or somatic copy number alteration) study in human disease research. Existing methods leveraging multiple level of molecular data often suffer from various limitations, e. g., heterogeneity, poor robustness or loss of generality. To overcome these limitations, we presented the tree-based dimensionality reduction approach for the identification of smooth tree graph and developed accurate predictive model for clinical outcome prediction. We demonstrated that 1) Discriminative Dimensionality Reduction via learning a Tree (DDRTree) achieved reduced dimension space tree with statistical significance; 2) Tree based support vector machine (SVM) classifier improved prediction performance of cancer recurrence as compared to t-test based SVM classifier; 3) Tree based SVM classifier was robust with regard to the different number of multi-markers; 4) Combining multiple omics data improved prediction performance of cancer recurrence as compared to a single-omics data; and 5) Tree based SVM classifier achieved similar or better prediction performance when compared to the features from state-of-the-art feature selection methods. Our results demonstrated great potential of the tree-based dimensionality reduction approach based clinical outcome prediction.
Collapse
Affiliation(s)
- Mingguang Shi
- School of Electric Engineering and Automation, Hefei University of Technology, Hefei, Anhui, 230009, China
| | - Junwen Wang
- School of Electric Engineering and Automation, Hefei University of Technology, Hefei, Anhui, 230009, China
| | - Chenyu Zhang
- School of Electric Engineering and Automation, Hefei University of Technology, Hefei, Anhui, 230009, China
| |
Collapse
|
9
|
Mi X, Zou F, Zhu R. Bagging and deep learning in optimal individualized treatment rules. Biometrics 2019; 75:674-684. [PMID: 30365175 DOI: 10.1111/biom.12990] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 10/09/2018] [Indexed: 11/30/2022]
Abstract
An ENsemble Deep Learning Optimal Treatment (EndLot) approach is proposed for personalized medicine problems. The statistical framework of the proposed method is based on the outcome weighted learning (OWL) framework which transforms the optimal decision rule problem into a weighted classification problem. We further employ an ensemble of deep neural networks (DNNs) to learn the optimal decision rule. Utilizing the flexibility of DNNs and the stability of bootstrap aggregation, the proposed method achieves a considerable improvement over existing methods. An R package "ITRlearn" is developed to implement the proposed method. Numerical performance is demonstrated via simulation studies and a real data analysis of the Cancer Cell Line Encyclopedia data.
Collapse
Affiliation(s)
- Xinlei Mi
- Department of Biostatistics, University of Florida, Gainesville, Florida
| | - Fei Zou
- Department of Biostatistics, University of Florida, Gainesville, Florida
| | - Ruoqing Zhu
- Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, Illinois
| |
Collapse
|
10
|
Hu K, Hu JB, Tang L, Xiang J, Ma JL, Gao YY, Li HJ, Zhang Y. Predicting disease-related genes by path structure and community structure in protein–protein networks. JOURNAL OF STATISTICAL MECHANICS: THEORY AND EXPERIMENT 2018; 2018:100001. [DOI: 10.1088/1742-5468/aae02b] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
11
|
Eguchi R, Karim MB, Hu P, Sato T, Ono N, Kanaya S, Altaf-Ul-Amin M. An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease. BMC Bioinformatics 2018; 19:264. [PMID: 30005591 PMCID: PMC6043997 DOI: 10.1186/s12859-018-2251-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 06/18/2018] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND There are different and complicated associations between genes and diseases. Finding the causal associations between genes and specific diseases is still challenging. In this work we present a method to predict novel associations of genes and pathways with inflammatory bowel disease (IBD) by integrating information of differential gene expression, protein-protein interaction and known disease genes related to IBD. RESULTS We downloaded IBD gene expression data from NCBI's Gene Expression Omnibus, performed statistical analysis to determine differentially expressed genes, collected known IBD genes from DisGeNet database, which were used to construct a IBD related PPI network with HIPPIE database. We adapted our graph-based clustering algorithm DPClusO to cluster the disease PPI network. We evaluated the statistical significance of the identified clusters in the context of determining the richness of IBD genes using Fisher's exact test and predicted novel genes related to IBD. We showed 93.8% of our predictions are correct in the context of other databases and published literatures related to IBD. CONCLUSIONS Finding disease-causing genes is necessary for developing drugs with synergistic effect targeting many genes simultaneously. Here we present an approach to identify novel disease genes and pathways and discuss our approach in the context of IBD. The approach can be generalized to find disease-associated genes for other diseases.
Collapse
Affiliation(s)
- Ryohei Eguchi
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Mohammand Bozlul Karim
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Pingzhao Hu
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Canada.,George and Fay Yee Centre for Healthcare Innovation, University of Manitoba, Winnipeg, Canada.,Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, Canada
| | - Tetsuo Sato
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan.,Department of Radiological Technology, Gunma Prefectural College of Health Sciences, Gunma, Japan
| | - Naoaki Ono
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Shigehiko Kanaya
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan
| | - Md Altaf-Ul-Amin
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara, Japan.
| |
Collapse
|
12
|
Vandin F. Computational Methods for Characterizing Cancer Mutational Heterogeneity. Front Genet 2017; 8:83. [PMID: 28659971 PMCID: PMC5469877 DOI: 10.3389/fgene.2017.00083] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 05/30/2017] [Indexed: 12/21/2022] Open
Abstract
Advances in DNA sequencing technologies have allowed the characterization of somatic mutations in a large number of cancer genomes at an unprecedented level of detail, revealing the extreme genetic heterogeneity of cancer at two different levels: inter-tumor, with different patients of the same cancer type presenting different collections of somatic mutations, and intra-tumor, with different clones coexisting within the same tumor. Both inter-tumor and intra-tumor heterogeneity have crucial implications for clinical practices. Here, we review computational methods that use somatic alterations measured through next-generation DNA sequencing technologies for characterizing tumor heterogeneity and its association with clinical variables. We first review computational methods for studying inter-tumor heterogeneity, focusing on methods that attempt to summarize cancer heterogeneity by discovering pathways that are commonly mutated across different patients of the same cancer type. We then review computational methods for characterizing intra-tumor heterogeneity using information from bulk sequencing data or from single cell sequencing data. Finally, we present some of the recent computational methodologies that have been proposed to identify and assess the association between inter- or intra-tumor heterogeneity with clinical variables.
Collapse
Affiliation(s)
- Fabio Vandin
- Department of Information Engineering, University of PadovaPadova, Italy
| |
Collapse
|
13
|
Shim JE, Lee T, Lee I. From sequencing data to gene functions: co-functional network approaches. Anim Cells Syst (Seoul) 2017; 21:77-83. [PMID: 30460054 PMCID: PMC6138336 DOI: 10.1080/19768354.2017.1284156] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2017] [Accepted: 01/15/2017] [Indexed: 01/04/2023] Open
Abstract
Advanced high-throughput sequencing technology accumulated massive amount of genomics and transcriptomics data in the public databases. Due to the high technical accessibility, DNA and RNA sequencing have huge potential for the study of gene functions in most species including animals and crops. A proven analytic platform to convert sequencing data to gene functional information is co-functional network. Because all genes exert their functions through interactions with others, network analysis is a legitimate way to study gene functions. The workflow of network-based functional study is composed of three steps: (i) inferencing co-functional links, (ii) evaluating and integrating the links into genome-scale networks, and (iii) generating functional hypotheses from the networks. Co-functional links can be inferred from DNA sequencing data by using phylogenetic profiling, gene neighborhood, domain profiling, associalogs, and co-expression analysis from RNA sequencing data. The inferred links are then evaluated and integrated into a genome-scale network with aid from gold-standard co-functional links. Functional hypotheses can be generated from the network based on (i) network connectivity, (ii) network propagation, and (iii) subnetwork analysis. The functional analysis pipeline described here requires only sequencing data which can be readily available for most species by next-generation sequencing technology. Therefore, co-functional networks will greatly potentiate the use of the sequencing data for the study of genetics in any cellular organism.
Collapse
Affiliation(s)
- Jung Eun Shim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, Korea
| | - Tak Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, Korea
| | - Insuk Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, Korea
| |
Collapse
|
14
|
Zhu G, Zhao XM, Wu J. A survey on biomarker identification based on molecular networks. QUANTITATIVE BIOLOGY 2016. [DOI: 10.1007/s40484-016-0084-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
15
|
Yan W, Xue W, Chen J, Hu G. Biological Networks for Cancer Candidate Biomarkers Discovery. Cancer Inform 2016; 15:1-7. [PMID: 27625573 PMCID: PMC5012434 DOI: 10.4137/cin.s39458] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Revised: 06/06/2016] [Accepted: 06/16/2016] [Indexed: 12/16/2022] Open
Abstract
Due to its extraordinary heterogeneity and complexity, cancer is often proposed as a model case of a systems biology disease or network disease. There is a critical need of effective biomarkers for cancer diagnosis and/or outcome prediction from system level analyses. Methods based on integrating omics data into networks have the potential to revolutionize the identification of cancer biomarkers. Deciphering the biological networks underlying cancer is undoubtedly important for understanding the molecular mechanisms of the disease and identifying effective biomarkers. In this review, the networks constructed for cancer biomarker discovery based on different omics level data are described and illustrated from recent advances in the field.
Collapse
Affiliation(s)
- Wenying Yan
- Center for Systems Biology, Soochow University, Suzhou, Jiangsu, China
| | - Wenjin Xue
- Department of Electrical Engineering, Technician College of Taizhou, Taizhou, Jiangsu, China
| | - Jiajia Chen
- School of Chemistry, Biology and Material Engineering, Suzhou University of Science and Technology, Suzhou, China
| | - Guang Hu
- Center for Systems Biology, Soochow University, Suzhou, Jiangsu, China
| |
Collapse
|
16
|
Lee JH, Zhao XM, Yoon I, Lee JY, Kwon NH, Wang YY, Lee KM, Lee MJ, Kim J, Moon HG, In Y, Hao JK, Park KM, Noh DY, Han W, Kim S. Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers. Cell Discov 2016; 2:16025. [PMID: 27625789 PMCID: PMC5004232 DOI: 10.1038/celldisc.2016.25] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 06/21/2016] [Indexed: 12/11/2022] Open
Abstract
Despite the explosion in the numbers of cancer genomic studies, metastasis is still the major cause of cancer mortality. In breast cancer, approximately one-fifth of metastatic patients survive 5 years. Therefore, detecting the patients at a high risk of developing distant metastasis at first diagnosis is critical for effective treatment strategy. We hereby present a novel systems biology approach to identify driver mutations escalating the risk of metastasis based on both exome and RNA sequencing of our collected 78 normal-paired breast cancers. Unlike driver mutations occurring commonly in cancers as reported in the literature, the mutations detected here are relatively rare mutations occurring in less than half metastatic samples. By supposing that the driver mutations should affect the metastasis gene signatures, we develop a novel computational pipeline to identify the driver mutations that affect transcription factors regulating metastasis gene signatures. We identify driver mutations in ADPGK, NUP93, PCGF6, PKP2 and SLC22A5, which are verified to enhance cancer cell migration and prompt metastasis with in vitro experiments. The discovered somatic mutations may be helpful for identifying patients who are likely to develop distant metastasis.
Collapse
Affiliation(s)
- Ji-Hyun Lee
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul, Republic of Korea; Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University, Seoul, Republic of Korea
| | - Xing-Ming Zhao
- Department of Computer Science and Technology, Tongji University , Shanghai, China
| | - Ina Yoon
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Jin Young Lee
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Nam Hoon Kwon
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Yin-Ying Wang
- Department of Computer Science and Technology, Tongji University , Shanghai, China
| | - Kyung-Min Lee
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Min-Joo Lee
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Jisun Kim
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Hyeong-Gon Moon
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Yongho In
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Jin-Kao Hao
- LERIA, University of Angers , Angers, France
| | - Kyung-Mii Park
- Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Dong-Young Noh
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Wonshik Han
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea; Cancer Research Institute, Seoul National University, Seoul, Republic of Korea
| | - Sunghoon Kim
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul, Republic of Korea; Department of Molecular Medicine and Biopharmaceutical Sciences, Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
17
|
Frequent mutations in acetylation and ubiquitination sites suggest novel driver mechanisms of cancer. Genome Med 2016; 8:55. [PMID: 27175787 PMCID: PMC4864925 DOI: 10.1186/s13073-016-0311-2] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Accepted: 04/19/2016] [Indexed: 12/14/2022] Open
Abstract
Background Discovery of cancer drivers is a major goal of cancer research. Driver genes and pathways are often predicted using mutation frequency, assuming that statistically significant recurrence of specific somatic mutations across independent samples indicates their importance in cancer. However, many mutations, including known cancer drivers, are not observed at high frequency. Fortunately, abundant information is available about functional “active sites” in proteins that can be integrated with mutations to predict cancer driver genes, even based on low frequency mutations. Further, considering active site information predicts detailed biochemical mechanisms impacted by the mutations. Post-translational modifications (PTMs) are active sites that are regulatory switches in proteins and pathways. We analyzed acetylation and ubiquitination, two important PTM types often involved in chromatin organization and protein degradation, to find proteins that are significantly affected by tumor somatic mutations. Methods We performed computational analyses of acetylation and ubiquitination sites in a pan-cancer dataset of 3200 tumor samples from The Cancer Genome Atlas (TCGA). These analyses were targeted at different levels of biological organization including individual genes, pathway annotated gene sets, and protein-protein interaction networks. Results Acetylation and ubiquitination site mutations are enriched in cancer with significantly stronger evolutionary conservation and accumulation in protein domains. Gene-focused analysis with the ActiveDriver method reveals significant co-occurrences of acetylation and ubiquitination PTMs and mutation hotspots in known oncoproteins (TP53, AKT1, IDH1) and highlights candidate cancer driver genes with PTM-related mechanisms (e.g. several histone proteins and the splicing factor SF3B1). Pathway analysis shows that PTM mutations in acetylation and ubiquitination sites accumulate in cancer-related processes such as cell cycle, apoptosis, chromatin regulation, and metabolism. Integrated mutation analysis of clinical information and protein interaction networks suggests that many PTM-specific mutations associate with decreased patient survival. Conclusions Mutation analysis of acetylation and ubiquitination PTM sites reveals their importance in cancer. As PTM networks are increasingly mapped and related enzymes are often druggable, deeper investigation of specific associated mutations may lead to the discovery of treatment-relevant cellular mechanisms. Electronic supplementary material The online version of this article (doi:10.1186/s13073-016-0311-2) contains supplementary material, which is available to authorized users.
Collapse
|
18
|
Kowalski J, Dwivedi B, Newman S, Switchenko JM, Pauly R, Gutman DA, Arora J, Gandhi K, Ainslie K, Doho G, Qin Z, Moreno CS, Rossi MR, Vertino PM, Lonial S, Bernal-Mizrachi L, Boise LH. Gene integrated set profile analysis: a context-based approach for inferring biological endpoints. Nucleic Acids Res 2016; 44:e69. [PMID: 26826710 PMCID: PMC4838358 DOI: 10.1093/nar/gkv1503] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 12/10/2015] [Indexed: 11/13/2022] Open
Abstract
The identification of genes with specific patterns of change (e.g. down-regulated and methylated) as phenotype drivers or samples with similar profiles for a given gene set as drivers of clinical outcome, requires the integration of several genomic data types for which an 'integrate by intersection' (IBI) approach is often applied. In this approach, results from separate analyses of each data type are intersected, which has the limitation of a smaller intersection with more data types. We introduce a new method, GISPA (Gene Integrated Set Profile Analysis) for integrated genomic analysis and its variation, SISPA (Sample Integrated Set Profile Analysis) for defining respective genes and samples with the context of similar, a priori specified molecular profiles. With GISPA, the user defines a molecular profile that is compared among several classes and obtains ranked gene sets that satisfy the profile as drivers of each class. With SISPA, the user defines a gene set that satisfies a profile and obtains sample groups of profile activity. Our results from applying GISPA to human multiple myeloma (MM) cell lines contained genes of known profiles and importance, along with several novel targets, and their further SISPA application to MM coMMpass trial data showed clinical relevance.
Collapse
Affiliation(s)
- Jeanne Kowalski
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA
| | - Bhakti Dwivedi
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA
| | - Scott Newman
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA
| | - Jeffery M Switchenko
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA
| | - Rini Pauly
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA
| | - David A Gutman
- Department of Biomedical Informatics and Neurology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Jyoti Arora
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA
| | - Khanjan Gandhi
- Department of Human Genetics, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Kylie Ainslie
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA
| | - Gregory Doho
- Centers for Disease Control, Atlanta, GA 30322, USA
| | - Zhaohui Qin
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30333, USA Department of Biomedical Informatics and Neurology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Carlos S Moreno
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Pathology and Laboratory Medicine, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Michael R Rossi
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Radiation Oncology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Paula M Vertino
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Radiation Oncology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Sagar Lonial
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Hematology and Medical Oncology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Leon Bernal-Mizrachi
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Hematology and Medical Oncology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| | - Lawrence H Boise
- Winship Cancer Institute, Emory University, Atlanta, GA 30333, USA Department of Hematology and Medical Oncology, School of Medicine, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|
19
|
Jeanquartier F, Jean-Quartier C, Kotlyar M, Tokar T, Hauschild AC, Jurisica I, Holzinger A. Machine Learning for In Silico Modeling of Tumor Growth. LECTURE NOTES IN COMPUTER SCIENCE 2016. [DOI: 10.1007/978-3-319-50478-0_21] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
20
|
Creixell P, Reimand J, Haider S, Wu G, Shibata T, Vazquez M, Mustonen V, Gonzalez-Perez A, Pearson J, Sander C, Raphael BJ, Marks DS, Ouellette BFF, Valencia A, Bader GD, Boutros PC, Stuart JM, Linding R, Lopez-Bigas N, Stein LD. Pathway and network analysis of cancer genomes. Nat Methods 2015; 12:615-621. [PMID: 26125594 DOI: 10.1038/nmeth.3440] [Citation(s) in RCA: 230] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 04/27/2015] [Indexed: 12/26/2022]
Abstract
Genomic information on tumors from 50 cancer types cataloged by the International Cancer Genome Consortium (ICGC) shows that only a few well-studied driver genes are frequently mutated, in contrast to many infrequently mutated genes that may also contribute to tumor biology. Hence there has been large interest in developing pathway and network analysis methods that group genes and illuminate the processes involved. We provide an overview of these analysis techniques and show where they guide mechanistic and translational investigations.
Collapse
Affiliation(s)
- Pau Creixell
- Cellular Signal Integration Group (C-SIG), Technical University of Denmark, Lyngby, Denmark
| | - Jüri Reimand
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Syed Haider
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada
| | - Guanming Wu
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Tatsuhiro Shibata
- Division of Cancer Genomics, National Cancer Center, Chuo-ku, Tokyo, Japan
| | - Miguel Vazquez
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre, Madrid, Spain
| | - Ville Mustonen
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | - Abel Gonzalez-Perez
- Research Unit on Biomedical Informatics, University Pompeu Fabra, Barcelona, Spain
| | - John Pearson
- Queensland Centre for Medical Genomics, Institute for Molecular Bioscience, University of Queensland, St. Lucia, Brisbane, Queensland, Australia
| | - Chris Sander
- Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, NY, USA
| | - Benjamin J Raphael
- Department of Computer Science and Center for Computational Molecular Biology, Brown University, Providence, RI, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA USA
| | - B F Francis Ouellette
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Alfonso Valencia
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre, Madrid, Spain
| | - Gary D Bader
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Paul C Boutros
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Department of Pharmacology and Toxicology, University of Toronto, Toronto, Ontario, Canada
| | - Joshua M Stuart
- Department of Biomolecular Engineering, University of California, Santa Cruz, California, USA.,Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California, USA
| | - Rune Linding
- Cellular Signal Integration Group (C-SIG), Technical University of Denmark, Lyngby, Denmark.,Biotech Research & Innovation Centre (BRIC), University of Copenhagen (UCPH), DK-2200 Copenhagen, Denmark
| | - Nuria Lopez-Bigas
- Research Unit on Biomedical Informatics, University Pompeu Fabra, Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| | - Lincoln D Stein
- Informatics and Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | | |
Collapse
|
21
|
|
22
|
Laderas T, Wu G, Mcweeney S. Between pathways and networks lies context: implications for precision medicine. Sci Prog 2015; 98:253-63. [PMID: 26601340 PMCID: PMC10365530 DOI: 10.3184/003685015x14368898634462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Precision medicine, broadly defined as considering individual variability in genes, environment, and lifestyle for each person in disease prevention and selection of suitable medical intervention, shows strong promise in the treatment of cancer Selecting therapies is complicated by multiple routes to gene dysregulation, which manifest in the individual patient within the many different types of genomic measurements. Additionally, multiple mutations exist in patients, aphenomenon known as oncogenic collaboration, which further complicates the selection of therapy. In this article, we discuss current approaches using biological pathways and networks to unify the many types of OMICs data. We argue that a contextual approach combining cancer pathways and networks could lead to a proper understanding of the biology of this significant disease.
Collapse
Affiliation(s)
- Ted Laderas
- OHSU Knight Cancer Institute Oregon Health & Science University, Portland, Oregon, USA
| | | | - Shannon Mcweeney
- Biostatistics and genetics to develop approaches to solve research bottlenecks, US National Academy of Sciences for her contributions
| |
Collapse
|
23
|
Kelder T, Verschuren L, van Ommen B, van Gool AJ, Radonjic M. Network signatures link hepatic effects of anti-diabetic interventions with systemic disease parameters. BMC SYSTEMS BIOLOGY 2014; 8:108. [PMID: 25204982 PMCID: PMC4363943 DOI: 10.1186/s12918-014-0108-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 08/29/2014] [Indexed: 11/10/2022]
Abstract
Background Multifactorial diseases such as type 2 diabetes mellitus (T2DM), are driven by a complex network of interconnected mechanisms that translate to a diverse range of complications at the physiological level. To optimally treat T2DM, pharmacological interventions should, ideally, target key nodes in this network that act as determinants of disease progression. Results We set out to discover key nodes in molecular networks based on the hepatic transcriptome dataset from a preclinical study in obese LDLR-/- mice recently published by Radonjic et al. Here, we focus on comparing efficacy of anti-diabetic dietary (DLI) and two drug treatments, namely PPARA agonist fenofibrate and LXR agonist T0901317. By combining knowledge-based and data-driven networks with a random walks based algorithm, we extracted network signatures that link the DLI and two drug interventions to dyslipidemia-related disease parameters. Conclusions This study identified specific and prioritized sets of key nodes in hepatic molecular networks underlying T2DM, uncovering pathways that are to be modulated by targeted T2DM drug interventions in order to modulate the complex disease phenotype.
Collapse
Affiliation(s)
- Thomas Kelder
- TNO, Research Group Microbiology & Systems Biology, Zeist, The Netherlands. .,Current address: EdgeLeap B.V, Utrecht, The Netherlands.
| | - Lars Verschuren
- TNO, Research Group Microbiology & Systems Biology, Zeist, The Netherlands.
| | - Ben van Ommen
- TNO, Research Group Microbiology & Systems Biology, Zeist, The Netherlands.
| | - Alain J van Gool
- TNO, Research Group Microbiology & Systems Biology, Zeist, The Netherlands. .,Department of Laboratory Medicine, Radboud University Nijmegen Medical Center, Nijmegen, The Netherlands. .,Faculty of Physics, Mathematics and Informatics, Radboud University Nijmegen, Nijmegen, The Netherlands.
| | - Marijana Radonjic
- TNO, Research Group Microbiology & Systems Biology, Zeist, The Netherlands. .,Current address: EdgeLeap B.V, Utrecht, The Netherlands.
| |
Collapse
|
24
|
Fang H, Gough J. The 'dnet' approach promotes emerging research on cancer patient survival. Genome Med 2014; 6:64. [PMID: 25246945 PMCID: PMC4160547 DOI: 10.1186/s13073-014-0064-8] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2014] [Accepted: 08/15/2014] [Indexed: 12/20/2022] Open
Abstract
We present the 'dnet' package and apply it to the 'TCGA' mutation and clinical data of >3,000 patients. We uncover the existence of an underlying gene network that at least partially controls cancer 'survivalness', with mutations that are significantly correlated with patient survival, yet independent of tumour origin and type. The survivalness network has natural community structure corresponding to tumour hallmarks, and contains genes that are potentially druggable in the clinic. This network has evolutionary roots in Deuterostomia identifying PTK2 and VAV1 as under-valued relative to more studied genes from that era. The 'dnet' R package is available at http://cran.r-project.org/package=dnet.
Collapse
Affiliation(s)
- Hai Fang
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol, BS8 1UB UK
| | - Julian Gough
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, Bristol, BS8 1UB UK
| |
Collapse
|
25
|
Chen YA, Eschrich SA. Computational methods and opportunities for phosphorylation network medicine. Transl Cancer Res 2014; 3:266-278. [PMID: 25530950 PMCID: PMC4271781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Protein phosphorylation, one of the most ubiquitous post-translational modifications (PTM) of proteins, is known to play an essential role in cell signaling and regulation. With the increasing understanding of the complexity and redundancy of cell signaling, there is a growing recognition that targeting the entire network or system could be a necessary and advantageous strategy for treating cancer. Protein kinases, the proteins that add a phosphate group to the substrate proteins during phosphorylation events, have become one of the largest groups of 'druggable' targets in cancer therapeutics in recent years. Kinase inhibitors are being regularly used in clinics for cancer treatment. This therapeutic paradigm shift in cancer research is partly due to the generation and availability of high-dimensional proteomics data. Generation of this data, in turn, is enabled by increased use of mass-spectrometry (MS)-based or other high-throughput proteomics platforms as well as companion public databases and computational tools. This review briefly summarizes the current state and progress on phosphoproteomics identification, quantification, and platform related characteristics. We review existing database resources, computational tools, methods for phosphorylation network inference, and ultimately demonstrate the connection to therapeutics. Finally, many research opportunities exist for bioinformaticians or biostatisticians based on developments and limitations of the current and emerging technologies.
Collapse
Affiliation(s)
- Yian Ann Chen
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive Tampa, FL 33612, USA
| | - Steven A Eschrich
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive Tampa, FL 33612, USA
| |
Collapse
|