1
|
Perik-Zavodskii R, Perik-Zavodskaia O, Shevchenko J, Volynets M, Alrhmoun S, Nazarov K, Denisova V, Sennikov S. A subpopulation of human bone marrow erythroid cells displays a myeloid gene expression signature similar to that of classic monocytes. PLoS One 2024; 19:e0305816. [PMID: 39038020 PMCID: PMC11262679 DOI: 10.1371/journal.pone.0305816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 06/05/2024] [Indexed: 07/24/2024] Open
Abstract
Erythroid cells, serving as progenitors and precursors to erythrocytes responsible for oxygen transport, were shown to exhibit an immunosuppressive and immunoregulatory phenotype. Previous investigations from our research group have revealed an antimicrobial gene expression profile within murine bone marrow erythroid cells which suggested a role for erythroid cells in innate immunity. In the present study, we focused on elucidating the characteristics of human bone marrow erythroid cells through comprehensive analyses, including NanoString gene expression profiling utilizing the Immune Response V2 panel, a BioPlex examination of chemokine and TGF-beta family proteins secretion, and analysis of publicly available single-cell RNA-seq data. Our findings demonstrate that an erythroid cell subpopulation manifests a myeloid-like gene expression signature comprised of antibacterial immunity and neutrophil chemotaxis genes which suggests an involvement of human erythroid cells in the innate immunity. Furthermore, we found that human erythroid cells secreted CCL22, CCL24, CXCL5, CXCL8, and MIF chemokines. The ability of human erythroid cells to express these chemokines might facilitate the restriction of immune cells in the bone marrow under normal conditions or contribute to the ability of erythroid cells to induce local immunosuppression by recruiting immune cells in their immediate vicinity in case of extramedullary hematopoiesis.
Collapse
Affiliation(s)
- Roman Perik-Zavodskii
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| | - Olga Perik-Zavodskaia
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| | - Julia Shevchenko
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| | - Marina Volynets
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| | - Saleh Alrhmoun
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| | - Kirill Nazarov
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| | - Vera Denisova
- Clinic of Immunopathology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| | - Sergey Sennikov
- Laboratory of Molecular Immunology, Federal State Budgetary Scientific Institution Research Institute of Fundamental and Clinical Immunology, Novosibirsk, Russia
| |
Collapse
|
2
|
Daniels DE, Ferrer-Vicens I, Hawksworth J, Andrienko TN, Finnie EM, Bretherton NS, Ferguson DCJ, Oliveira ASF, Szeto JYA, Wilson MC, Brewin JN, Frayne J. Human cellular model systems of β-thalassemia enable in-depth analysis of disease phenotype. Nat Commun 2023; 14:6260. [PMID: 37803026 PMCID: PMC10558456 DOI: 10.1038/s41467-023-41961-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Accepted: 09/26/2023] [Indexed: 10/08/2023] Open
Abstract
β-thalassemia is a prevalent genetic disorder causing severe anemia due to defective erythropoiesis, with few treatment options. Studying the underlying molecular defects is impeded by paucity of suitable patient material. In this study we create human disease cellular model systems for β-thalassemia by gene editing the erythroid line BEL-A, which accurately recapitulate the phenotype of patient erythroid cells. We also develop a high throughput compatible fluorometric-based assay for evaluating severity of disease phenotype and utilize the assay to demonstrate that the lines respond appropriately to verified reagents. We next use the lines to perform extensive analysis of the altered molecular mechanisms in β-thalassemia erythroid cells, revealing upregulation of a wide range of biological pathways and processes along with potential novel targets for therapeutic investigation. Overall, the lines provide a sustainable supply of disease cells as research tools for identifying therapeutic targets and as screening platforms for new drugs and reagents.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Jenn-Yeu A Szeto
- School of Biochemistry, University of Bristol, Bristol, BS8 1TD, UK
| | | | - John N Brewin
- Haematology Department, King's college Hospital NHS Foundation, London, SE5 9RS, UK
- Red Cell Biology Group, Kings College London, London, SE5 9NU, UK
| | - Jan Frayne
- School of Biochemistry, University of Bristol, Bristol, BS8 1TD, UK.
| |
Collapse
|
3
|
Oh S, Song S. Differential Gene Expression (DEX) and Alternative Splicing Events (ASE) for Temporal Dynamic Processes Using HMMs and Hierarchical Bayesian Modeling Approaches. Methods Mol Biol 2017; 1552:165-176. [PMID: 28224498 DOI: 10.1007/978-1-4939-6753-7_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In gene expression profile, data analysis pipeline is categorized into four levels, major downstream tasks, i.e., (1) identification of differential expression; (2) clustering co-expression patterns; (3) classification of subtypes of samples; and (4) detection of genetic regulatory networks, are performed posterior to preprocessing procedure such as normalization techniques. To be more specific, temporal dynamic gene expression data has its inherent feature, namely, two neighboring time points (previous and current state) are highly correlated with each other, compared to static expression data which samples are assumed as independent individuals. In this chapter, we demonstrate how HMMs and hierarchical Bayesian modeling methods capture the horizontal time dependency structures in time series expression profiles by focusing on the identification of differential expression. In addition, those differential expression genes and transcript variant isoforms over time detected in core prerequisite steps can be generally further applied in detection of genetic regulatory networks to comprehensively uncover dynamic repertoires in the aspects of system biology as the coupled framework.
Collapse
Affiliation(s)
- Sunghee Oh
- Department of Computer Science and Statistics, Jeju National University, Jeju City, 690-756, South Korea.
| | - Seongho Song
- Department of Mathematical Science, University of Cincinnati, Cincinnati, OH, 45221-0025, USA
| |
Collapse
|
4
|
Zhu F, Panwar B, Dodge HH, Li H, Hampstead BM, Albin RL, Paulson HL, Guan Y. COMPASS: A computational model to predict changes in MMSE scores 24-months after initial assessment of Alzheimer's disease. Sci Rep 2016; 6:34567. [PMID: 27703197 PMCID: PMC5050516 DOI: 10.1038/srep34567] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 09/15/2016] [Indexed: 12/26/2022] Open
Abstract
We present COMPASS, a COmputational Model to Predict the development of Alzheimer’s diSease Spectrum, to model Alzheimer’s disease (AD) progression. This was the best-performing method in recent crowdsourcing benchmark study, DREAM Alzheimer’s Disease Big Data challenge to predict changes in Mini-Mental State Examination (MMSE) scores over 24-months using standardized data. In the present study, we conducted three additional analyses beyond the DREAM challenge question to improve the clinical contribution of our approach, including: (1) adding pre-validated baseline cognitive composite scores of ADNI-MEM and ADNI-EF, (2) identifying subjects with significant declines in MMSE scores, and (3) incorporating SNPs of top 10 genes connected to APOE identified from functional-relationship network. For (1) above, we significantly improved predictive accuracy, especially for the Mild Cognitive Impairment (MCI) group. For (2), we achieved an area under ROC of 0.814 in predicting significant MMSE decline: our model has 100% precision at 5% recall, and 91% accuracy at 10% recall. For (3), “genetic only” model has Pearson’s correlation of 0.15 to predict progression in the MCI group. Even though addition of this limited genetic model to COMPASS did not improve prediction of progression of MCI group, the predictive ability of SNP information extended beyond well-known APOE allele.
Collapse
Affiliation(s)
- Fan Zhu
- Department of Computational Medicine and Bioinformatics, University of Michigan, USA
| | - Bharat Panwar
- Department of Computational Medicine and Bioinformatics, University of Michigan, USA
| | - Hiroko H Dodge
- Department of Neurology and Michigan Alzheimer's Disease Center, University of Michigan, USA.,Department of Neurology and Layton Aging and Alzheimer's Disease Center, Oregon Health &Science University, USA
| | - Hongdong Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, USA
| | - Benjamin M Hampstead
- Department of Psychiatry, University of Michigan, USA.,Mental Health Service, VA Ann Arbor Healthcare System, USA
| | - Roger L Albin
- Department of Neurology and Michigan Alzheimer's Disease Center, University of Michigan, USA.,Neurology Service &Geriatric Research Education and Clinical Centers, VA Ann Arbor Healthcare System, USA.,University of Michigan Udall Center, USA
| | - Henry L Paulson
- Department of Neurology and Michigan Alzheimer's Disease Center, University of Michigan, USA
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, USA.,Departments of Internal Medicine and Human Genetics, and School of Public Health, University of Michigan, USA.,Departments of Internal Medicine and of Electrical Engineering and Computer Science, University of Michigan, USA
| |
Collapse
|
5
|
Guan Y, Martini S, Mariani LH. Genes Caught In Flagranti: Integrating Renal Transcriptional Profiles With Genotypes and Phenotypes. Semin Nephrol 2016. [PMID: 26215861 DOI: 10.1016/j.semnephrol.2015.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
In the past decade, population genetics has gained tremendous success in identifying genetic variations that are statistically relevant to renal diseases and kidney function. However, it is challenging to interpret the functional relevance of the genetic variations found by population genetics studies. In this review, we discuss studies that integrate multiple levels of data, especially transcriptome profiles and phenotype data, to assign functional roles of genetic variations involved in kidney function. Furthermore, we introduce state-of-the-art machine learning algorithms, Bayesian networks, support vector machines, and Gaussian process regression, which have been applied successfully to integrating genetic, regulatory, and clinical information to predict clinical outcomes. These methods are likely to be deployed successfully in the nephrology field in the near future.
Collapse
Affiliation(s)
- Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI; Department of Internal Medicine, University of Michigan, Ann Arbor, MI; Department of Computer Science and Engineering, University of Michigan, Ann Arbor, MI
| | - Sebastian Martini
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI; Nephrologisches Zentrum, Medizinische Klinik und Poliklinik IV, Klinikum der Universität München, Ludwig-Maximilians-University Munich, Munich, Germany
| | - Laura H Mariani
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI
| |
Collapse
|
6
|
Li HD, Omenn GS, Guan Y. A proteogenomic approach to understand splice isoform functions through sequence and expression-based computational modeling. Brief Bioinform 2016; 17:1024-1031. [PMID: 26740460 DOI: 10.1093/bib/bbv109] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 11/03/2015] [Indexed: 01/23/2023] Open
Abstract
The products of multi-exon genes are a mixture of alternatively spliced isoforms, from which the translated proteins can have similar, different or even opposing functions. It is therefore essential to differentiate and annotate functions for individual isoforms. Computational approaches provide an efficient complement to expensive and time-consuming experimental studies. The input data of these methods range from DNA sequence, to RNA selection pressure, to expressed sequence tags, to full-length complementary DNA, to exon array, to RNA-seq expression, to proteomic data. Notably, RNA-seq technology generates quantitative profiling of transcript expression at the genome scale, with an unprecedented amount of expression data available for developing isoform function prediction methods. Integrative analysis of these data at different molecular levels enables a proteogenomic approach to systematically interrogate isoform functions. Here, we briefly review the state-of-the-art methods according to their input data sources, discuss their advantages and limitations and point out potential ways to improve prediction accuracies.
Collapse
|
7
|
Predicting Anticancer Drug Responses Using a Dual-Layer Integrated Cell Line-Drug Network Model. PLoS Comput Biol 2015; 11:e1004498. [PMID: 26418249 PMCID: PMC4587957 DOI: 10.1371/journal.pcbi.1004498] [Citation(s) in RCA: 123] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Accepted: 08/10/2015] [Indexed: 01/22/2023] Open
Abstract
The ability to predict the response of a cancer patient to a therapeutic agent is a major goal in modern oncology that should ultimately lead to personalized treatment. Existing approaches to predicting drug sensitivity rely primarily on profiling of cancer cell line panels that have been treated with different drugs and selecting genomic or functional genomic features to regress or classify the drug response. Here, we propose a dual-layer integrated cell line-drug network model, which uses both cell line similarity network (CSN) data and drug similarity network (DSN) data to predict the drug response of a given cell line using a weighted model. Using the Cancer Cell Line Encyclopedia (CCLE) and Cancer Genome Project (CGP) studies as benchmark datasets, our single-layer model with CSN or DSN and only a single parameter achieved a prediction performance comparable to the previously generated elastic net model. When using the dual-layer model integrating both CSN and DSN, our predicted response reached a 0.6 Pearson correlation coefficient with observed responses for most drugs, which is significantly better than the previous results using the elastic net model. We have also applied the dual-layer cell line-drug integrated network model to fill in the missing drug response values in the CGP dataset. Even though the dual-layer integrated cell line-drug network model does not specifically model mutation information, it correctly predicted that BRAF mutant cell lines would be more sensitive than BRAF wild-type cell lines to three MEK1/2 inhibitors tested. In this study, using the Cancer Cell Line Encyclopedia (CCLE) and Cancer Genome Project (CGP) studies as benchmark datasets, we explored the application of similarity information between cell lines and drugs in drug response prediction. We found that similar cell lines by gene expression profiles exhibit similar response to the same drug. Meanwhile, drugs with similar chemical structures also show similar inhibitory effects across different cell lines. Based on the above observations, we proposed a dual-layer network and local weighted model to predict drug response of a cell line using proximal information of the drug-cell line network. The only three parameters of our model are optimized by leave-one-out cross-validation for each drug. Two case studies of MAPK and ERK signal pathways on CCLE dataset proved that the predicted-to-observed correlations of our dual-layer network model is significantly better than the previous predictor using elastic net model. Interestingly, predictions based on drug similarity network (DSN) alone were much better than those based on cell line similarity network (CSN) alone for most drugs, implying that drug similarities are more informative for drug response prediction than cell line similarities. Our network model can be applied to predict the response of a new cell line to existing already tested drugs or to predict the response of an existing cell line to new drugs, thus potentially saving the cost in a drug-cell line screening.
Collapse
|
8
|
Li HD, Menon R, Govindarajoo B, Panwar B, Zhang Y, Omenn GS, Guan Y. Functional Networks of Highest-Connected Splice Isoforms: From The Chromosome 17 Human Proteome Project. J Proteome Res 2015. [PMID: 26216192 DOI: 10.1021/acs.jproteome.5b00494] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Alternative splicing allows a single gene to produce multiple transcript-level splice isoforms from which the translated proteins may show differences in their expression and function. Identifying the major functional or canonical isoform is important for understanding gene and protein functions. Identification and characterization of splice isoforms is a stated goal of the HUPO Human Proteome Project and of neXtProt. Multiple efforts have catalogued splice isoforms as "dominant", "principal", or "major" isoforms based on expression or evolutionary traits. In contrast, we recently proposed highest connected isoforms (HCIs) as a new class of canonical isoforms that have the strongest interactions in a functional network and revealed their significantly higher (differential) transcript-level expression compared to nonhighest connected isoforms (NCIs) regardless of tissues/cell lines in the mouse. HCIs and their expression behavior in the human remain unexplored. Here we identified HCIs for 6157 multi-isoform genes using a human isoform network that we constructed by integrating a large compendium of heterogeneous genomic data. We present examples for pairs of transcript isoforms of ABCC3, RBM34, ERBB2, and ANXA7. We found that functional networks of isoforms of the same gene can show large differences. Interestingly, differential expression between HCIs and NCIs was also observed in the human on an independent set of 940 RNA-seq samples across multiple tissues, including heart, kidney, and liver. Using proteomic data from normal human retina and placenta, we showed that HCIs are a promising indicator of expressed protein isoforms exemplified by NUDFB6 and M6PR. Furthermore, we found that a significant percentage (20%, p = 0.0003) of human and mouse HCIs are homologues, suggesting their conservation between species. Our identified HCIs expand the repertoire of canonical isoforms and are expected to facilitate studying main protein products, understanding gene regulation, and possibly evolution. The network is available through our web server as a rich resource for investigating isoform functional relationships (http://guanlab.ccmb.med.umich.edu/hisonet). All MS/MS data were available at ProteomeXchange Web site (http://www.proteomexchange.org) through their identifiers (retina: PXD001242, placenta: PXD000754).
Collapse
Affiliation(s)
- Hong-Dong Li
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Rajasree Menon
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Brandon Govindarajoo
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Bharat Panwar
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, ‡Department of Internal Medicine, §Department of Human Genetics and School of Public Health, ∥Department of Electrical Engineering and Computer Science University of Michigan , Ann Arbor, Michigan 48109, United States
| |
Collapse
|
9
|
Zhu F, Panwar B, Guan Y. Algorithms for modeling global and context-specific functional relationship networks. Brief Bioinform 2015; 17:686-95. [PMID: 26254431 DOI: 10.1093/bib/bbv065] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2015] [Indexed: 02/07/2023] Open
Abstract
Functional genomics has enormous potential to facilitate our understanding of normal and disease-specific physiology. In the past decade, intensive research efforts have been focused on modeling functional relationship networks, which summarize the probability of gene co-functionality relationships. Such modeling can be based on either expression data only or heterogeneous data integration. Numerous methods have been deployed to infer the functional relationship networks, while most of them target the global (non-context-specific) functional relationship networks. However, it is expected that functional relationships consistently reprogram under different tissues or biological processes. Thus, advanced methods have been developed targeting tissue-specific or developmental stage-specific networks. This article brings together the state-of-the-art functional relationship network modeling methods, emphasizes the need for heterogeneous genomic data integration and context-specific network modeling and outlines future directions for functional relationship networks.
Collapse
|
10
|
Li HD, Omenn GS, Guan Y. MIsoMine: a genome-scale high-resolution data portal of expression, function and networks at the splice isoform level in the mouse. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav045. [PMID: 25953081 PMCID: PMC4423410 DOI: 10.1093/database/bav045] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Accepted: 04/15/2015] [Indexed: 12/22/2022]
Abstract
Products of multiexon genes, especially in higher organisms, are a mixture of isoforms with different or even opposing functions, and therefore need to be treated separately. However, most studies and available resources such as Gene Ontology provide only gene-level function annotations, and therefore lose the differential information at the isoform level. Here we report MIsoMine, a high-resolution portal to multiple levels of functional information of alternatively spliced isoforms in the mouse. This data portal provides tissue-specific expression patterns and co-expression networks, along with such previously published functional genomic data as protein domains, predicted isoform-level functions and functional relationships. The core utility of MIsoMine is allowing users to explore a preprocessed, quality-controlled set of RNA-seq data encompassing diverse tissues and cell lineages. Tissue-specific co-expression networks were established, allowing a 2D ranking of isoforms and tissues by co-expression patterns. The results of the multiple isoforms of the same gene are presented in parallel to facilitate direct comparison, with cross-talking to prioritized functions at the isoform level. MIsoMine provides the first isoform-level resolution effort at genome-scale. We envision that this data portal will be a valuable resource for exploring functional genomic data, and will complement the existing functionalities of the mouse genome informatics database and the gene expression database for the laboratory mouse. Database URL: http://guanlab.ccmb.med.umich.edu/misomine/
Collapse
Affiliation(s)
- Hong-Dong Li
- Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA Department of Computational Medicine and Bioinformatics, Department of Internal Medicine and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
11
|
Zhu F, Shi L, Engel JD, Guan Y. Regulatory network inferred using expression data of small sample size: application and validation in erythroid system. Bioinformatics 2015; 31:2537-44. [PMID: 25840044 DOI: 10.1093/bioinformatics/btv186] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 03/27/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Modeling regulatory networks using expression data observed in a differentiation process may help identify context-specific interactions. The outcome of the current algorithms highly depends on the quality and quantity of a single time-course dataset, and the performance may be compromised for datasets with a limited number of samples. RESULTS In this work, we report a multi-layer graphical model that is capable of leveraging many publicly available time-course datasets, as well as a cell lineage-specific data with small sample size, to model regulatory networks specific to a differentiation process. First, a collection of network inference methods are used to predict the regulatory relationships in individual public datasets. Then, the inferred directional relationships are weighted and integrated together by evaluating against the cell lineage-specific dataset. To test the accuracy of this algorithm, we collected a time-course RNA-Seq dataset during human erythropoiesis to infer regulatory relationships specific to this differentiation process. The resulting erythroid-specific regulatory network reveals novel regulatory relationships activated in erythropoiesis, which were further validated by genome-wide TR4 binding studies using ChIP-seq. These erythropoiesis-specific regulatory relationships were not identifiable by single dataset-based methods or context-independent integrations. Analysis of the predicted targets reveals that they are all closely associated with hematopoietic lineage differentiation.
Collapse
Affiliation(s)
- Fan Zhu
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Lihong Shi
- State Key Laboratory of Experimental Hematology, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China
| | | | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA, Department of Internal Medicine, and Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|