1
|
Su C, Lee D, Jin P, Zhang J. scMultiMap: Cell-type-specific mapping of enhancers and target genes from single-cell multimodal data. Nat Commun 2025; 16:3941. [PMID: 40287418 PMCID: PMC12033308 DOI: 10.1038/s41467-025-59306-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Accepted: 04/15/2025] [Indexed: 04/29/2025] Open
Abstract
Mapping enhancers and target genes in disease-related cell types provides critical insights into the functional mechanisms of genome-wide association studies (GWAS) variants. Single-cell multimodal data, which measure gene expression and chromatin accessibility in the same cells, enable the cell-type-specific inference of enhancer-gene pairs. However, this task is challenged by high data sparsity, sequencing depth variation, and the computational burden of analyzing a large number of pairs. We introduce scMultiMap, a statistical method that infers enhancer-gene association from sparse multimodal counts using a joint latent-variable model. It adjusts for technical confounding, permits fast moment-based estimation and provides analytically derived p-values. In blood and brain data, scMultiMap shows appropriate type I error control, high statistical power, and computational efficiency (1% of existing methods). When applied to Alzheimer's disease (AD) data, scMultiMap gives the highest heritability enrichment in microglia and reveals insights into the regulatory mechanisms of AD GWAS variants.
Collapse
Affiliation(s)
- Chang Su
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, USA.
- Department of Human Genetics, School of Medicine, Emory University, Atlanta, GA, USA.
| | - Dongsoo Lee
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, USA
| | - Peng Jin
- Department of Human Genetics, School of Medicine, Emory University, Atlanta, GA, USA
| | - Jingfei Zhang
- Information Systems and Operations Management, Emory University, Atlanta, GA, USA.
| |
Collapse
|
2
|
Chevalley M, Roohani YH, Mehrjou A, Leskovec J, Schwab P. A large-scale benchmark for network inference from single-cell perturbation data. Commun Biol 2025; 8:412. [PMID: 40069299 PMCID: PMC11897147 DOI: 10.1038/s42003-025-07764-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Accepted: 02/18/2025] [Indexed: 03/15/2025] Open
Abstract
Mapping biological mechanisms in cellular systems is a fundamental step in early-stage drug discovery that serves to generate hypotheses on what disease-relevant molecular targets may effectively be modulated by pharmacological interventions. With the advent of high-throughput methods for measuring single-cell gene expression under genetic perturbations, we now have effective means for generating evidence for causal gene-gene interactions at scale. However, evaluating the performance of network inference methods in real-world environments is challenging due to the lack of ground-truth knowledge. Moreover, traditional evaluations conducted on synthetic datasets do not reflect the performance in real-world systems. We thus introduce CausalBench, a benchmark suite revolutionizing network inference evaluation with real-world, large-scale single-cell perturbation data. CausalBench, distinct from existing benchmarks, offers biologically-motivated metrics and distribution-based interventional measures, providing a more realistic evaluation of network inference methods. An initial systematic evaluation of state-of-the-art causal inference methods using our CausalBench suite highlights how poor scalability of existing methods limits performance. Moreover, methods that use interventional information do not outperform those that only use observational data, contrary to what is observed on synthetic benchmarks. CausalBench subsequently enables the development of numerous promising methods through a community challenge, thus demonstrating its potential as a transformative tool in the field of computational biology, bridging the gap between theoretical innovation and practical application in drug discovery and disease understanding. Thus, CausalBench opens new avenues for method developers in causal network inference research, and provides to practitioners a principled and reliable way to track progress in network methods for real-world interventional data.
Collapse
Affiliation(s)
| | - Yusuf H Roohani
- GSK.ai, Zug, Switzerland
- Stanford University, Stanford, CA, USA
| | | | | | | |
Collapse
|
3
|
Chen L, Dautle M, Gao R, Zhang S, Chen Y. Inferring gene regulatory networks from time-series scRNA-seq data via GRANGER causal recurrent autoencoders. Brief Bioinform 2025; 26:bbaf089. [PMID: 40062616 PMCID: PMC11891664 DOI: 10.1093/bib/bbaf089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 01/26/2025] [Accepted: 02/18/2025] [Indexed: 05/13/2025] Open
Abstract
The development of single-cell RNA sequencing (scRNA-seq) technology provides valuable data resources for inferring gene regulatory networks (GRNs), enabling deeper insights into cellular mechanisms and diseases. While many methods exist for inferring GRNs from static scRNA-seq data, current approaches face challenges in accurately handling time-series scRNA-seq data due to high noise levels and data sparsity. The temporal dimension introduces additional complexity by requiring models to capture dynamic changes, increasing sensitivity to noise, and exacerbating data sparsity across time points. In this study, we introduce GRANGER, an unsupervised deep learning-based method that integrates multiple advanced techniques, including a recurrent variational autoencoder, GRANGER causality, sparsity-inducing penalties, and negative binomial (NB)-based loss functions, to infer GRNs. GRANGER was evaluated using multiple popular benchmarking datasets, where it demonstrated superior performance compared to eight well-known GRN inference methods. The integration of a NB-based loss function and sparsity-inducing penalties in GRANGER significantly enhanced its capacity to address dropout noise and sparsity in scRNA-seq data. Additionally, GRANGER exhibited robustness against high levels of dropout noise. We applied GRANGER to scRNA-seq data from the whole mouse brain obtained through the BRAIN Initiative project and identified GRNs for five transcription regulators: E2f7, Gbx1, Sox10, Prox1, and Onecut2, which play crucial roles in diverse brain cell types. The inferred GRNs not only recalled many known regulatory relationships but also revealed sets of novel regulatory interactions with functional potential. These findings demonstrate that GRANGER is a highly effective tool for real-world applications in discovering novel gene regulatory relationships.
Collapse
Affiliation(s)
- Liang Chen
- College of Computer and Information Engineering, Tianjin Normal University, 393 Binshui W Ave, Tianjin, Tianjin 300387, China
| | - Madison Dautle
- Department of Biological and Biomedical Sciences, Rowan University, 201 Mullica Hill Road, Glassboro, NJ 08028, United States
| | - Ruoying Gao
- College of Computer and Information Engineering, Tianjin Normal University, 393 Binshui W Ave, Tianjin, Tianjin 300387, China
| | - Shaoqiang Zhang
- College of Computer and Information Engineering, Tianjin Normal University, 393 Binshui W Ave, Tianjin, Tianjin 300387, China
| | - Yong Chen
- Department of Biological and Biomedical Sciences, Rowan University, 201 Mullica Hill Road, Glassboro, NJ 08028, United States
| |
Collapse
|
4
|
Stock M, Losert C, Zambon M, Popp N, Lubatti G, Hörmanseder E, Heinig M, Scialdone A. Leveraging prior knowledge to infer gene regulatory networks from single-cell RNA-sequencing data. Mol Syst Biol 2025; 21:214-230. [PMID: 39939367 PMCID: PMC11876610 DOI: 10.1038/s44320-025-00088-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 01/29/2025] [Accepted: 01/30/2025] [Indexed: 02/14/2025] Open
Abstract
Many studies have used single-cell RNA sequencing (scRNA-seq) to infer gene regulatory networks (GRNs), which are crucial for understanding complex cellular regulation. However, the inherent noise and sparsity of scRNA-seq data present significant challenges to accurate GRN inference. This review explores one promising approach that has been proposed to address these challenges: integrating prior knowledge into the inference process to enhance the reliability of the inferred networks. We categorize common types of prior knowledge, such as experimental data and curated databases, and discuss methods for representing priors, particularly through graph structures. In addition, we classify recent GRN inference algorithms based on their ability to incorporate these priors and assess their performance in different contexts. Finally, we propose a standardized benchmarking framework to evaluate algorithms more fairly, ensuring biologically meaningful comparisons. This review provides guidance for researchers selecting GRN inference methods and offers insights for developers looking to improve current approaches and foster innovation in the field.
Collapse
Affiliation(s)
- Marco Stock
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Helmholtz Center Munich Institute of Functional Epigenetics, Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Corinna Losert
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Department of Computer Science, TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Matteo Zambon
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Helmholtz Center Munich Institute of Functional Epigenetics, Munich, Germany
| | - Niclas Popp
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Helmholtz Center Munich Institute of Functional Epigenetics, Munich, Germany
| | - Gabriele Lubatti
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Helmholtz Center Munich Institute of Functional Epigenetics, Munich, Germany
| | - Eva Hörmanseder
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany
| | - Matthias Heinig
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany
- Department of Computer Science, TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- German Centre for Cardiovascular Research (DZHK), Munich Heart Association, Partner Site Munich, Berlin, Germany
| | - Antonio Scialdone
- Helmholtz Center Munich Institute of Epigenetics und Stem Cells, Munich, Germany.
- Helmholtz Center Munich Institute of Computational Biology, Munich, Germany.
- Helmholtz Center Munich Institute of Functional Epigenetics, Munich, Germany.
| |
Collapse
|
5
|
Ferreira RM, Gisch DL, Phillips CL, Cheng YH, Asthana M, Lake BB, Bowen WS, Fang F, Asghari M, Sabo A, Barwinska D, Ferkowicz MJ, Toto RD, Sedor JR, Rosas SE, Bjornstad P, Hodgin JB, Alpers CE, Sarder P, Himmelfarb J, Schaub JA, Nair V, Winfree S, Sutton TA, Kelly KJ, Kretzler M, Jain S, El-Achkar TM, Dagher PC, Eadon MT. A MEF2C transcription factor network regulates proliferation of glomerular endothelial cells in diabetic kidney disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.27.615250. [PMID: 39803522 PMCID: PMC11722318 DOI: 10.1101/2024.09.27.615250] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/05/2025]
Abstract
The maintenance of a healthy epithelial-endothelial juxtaposition requires cross-talk within glomerular cellular niches. We sought to understand the spatially-anchored regulation and transition of endothelial and mesangial cells from health to injury in DKD. From 74 human kidney samples, an integrated multi-omics approach was leveraged to identify cellular niches, cell-cell communication, cell injury trajectories, and regulatory transcription factor (TF) networks in glomerular capillary endothelial (EC-GC) and mesangial cells. Data were culled from single nucleus RNA and ATAC sequencing and three orthogonal spatial transcriptomic technologies for correlation with histopathological and clinical trial data. We identified a cellular niche in diabetic glomeruli enriched in a proliferative endothelial cell subtype (prEC) and altered vascular smooth muscle cells (VSMCs). Cellular communication within this niche maintained pro-angiogenic signaling with loss of anti-angiogenic factors. We identified a TF network of MEF2C, MEF2A, and TRPS1 which regulated SEMA6A and PLXNA2, a receptor-ligand pair opposing angiogenesis. In silico knockout of the TF network accelerated the transition from healthy EC-GCs toward a degenerative (injury) endothelial phenotype, with concomitant disruption of EC-GC and prEC expression patterns. Glomeruli enriched in the prEC niche had histologic evidence of neovascularization. MEF2C activity was increased in diabetic glomeruli with nodular mesangial sclerosis. The gene regulatory network (GRN) of MEF2C was dysregulated in EC-GCs of patients with DKD, but sodium glucose transporter-2 inhibitor (SGLT2i) treatment reversed the MEF2C GRN effects of DKD. The MEF2C, MEF2A, and TRPS1 TF network carefully balances the fate of the EC-GC in DKD. When the TF network is "on" or over-expressed in DKD, EC-GCs may progress to a prEC state, while TF suppression leads to cell death. SGLT2i therapy may restore the balance of MEF2C activity.
Collapse
|
6
|
Loers JU, Vermeirssen V. A single-cell multimodal view on gene regulatory network inference from transcriptomics and chromatin accessibility data. Brief Bioinform 2024; 25:bbae382. [PMID: 39207727 PMCID: PMC11359808 DOI: 10.1093/bib/bbae382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 06/27/2024] [Accepted: 07/23/2024] [Indexed: 09/04/2024] Open
Abstract
Eukaryotic gene regulation is a combinatorial, dynamic, and quantitative process that plays a vital role in development and disease and can be modeled at a systems level in gene regulatory networks (GRNs). The wealth of multi-omics data measured on the same samples and even on the same cells has lifted the field of GRN inference to the next stage. Combinations of (single-cell) transcriptomics and chromatin accessibility allow the prediction of fine-grained regulatory programs that go beyond mere correlation of transcription factor and target gene expression, with enhancer GRNs (eGRNs) modeling molecular interactions between transcription factors, regulatory elements, and target genes. In this review, we highlight the key components for successful (e)GRN inference from (sc)RNA-seq and (sc)ATAC-seq data exemplified by state-of-the-art methods as well as open challenges and future developments. Moreover, we address preprocessing strategies, metacell generation and computational omics pairing, transcription factor binding site detection, and linear and three-dimensional approaches to identify chromatin interactions as well as dynamic and causal eGRN inference. We believe that the integration of transcriptomics together with epigenomics data at a single-cell level is the new standard for mechanistic network inference, and that it can be further advanced with integrating additional omics layers and spatiotemporal data, as well as with shifting the focus towards more quantitative and causal modeling strategies.
Collapse
Affiliation(s)
- Jens Uwe Loers
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Corneel Heymanslaan 10, 9000 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Zwijnaarde-Technologiepark 71, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, 9000 Ghent, Belgium
| | - Vanessa Vermeirssen
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Corneel Heymanslaan 10, 9000 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Zwijnaarde-Technologiepark 71, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, 9000 Ghent, Belgium
| |
Collapse
|
7
|
Li Y, Ma A, Wang Y, Guo Q, Wang C, Fu H, Liu B, Ma Q. Enhancer-driven gene regulatory networks inference from single-cell RNA-seq and ATAC-seq data. Brief Bioinform 2024; 25:bbae369. [PMID: 39082647 PMCID: PMC11289686 DOI: 10.1093/bib/bbae369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 06/19/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open
Abstract
Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery. Application of STREAM to an Alzheimer's disease dataset and a diffuse small lymphocytic lymphoma dataset reveals its ability to identify TF-enhancer-gene relations associated with pseudotime, as well as key TF-enhancer-gene relations and TF cooperation underlying tumor cells.
Collapse
Affiliation(s)
- Yang Li
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, United States
| | - Yizhong Wang
- School of Mathematics, Shandong University, Jinan, Shandong 250100, China
| | - Qi Guo
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Cankun Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Hongjun Fu
- Department of Neuroscience, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong 250100, China
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, United States
| |
Collapse
|
8
|
Yu J, Leng J, Yuan F, Sun D, Wu LY. Reverse network diffusion to remove indirect noise for better inference of gene regulatory networks. Bioinformatics 2024; 40:btae435. [PMID: 38963312 PMCID: PMC11236096 DOI: 10.1093/bioinformatics/btae435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 06/24/2024] [Accepted: 07/03/2024] [Indexed: 07/05/2024] Open
Abstract
MOTIVATION Gene regulatory networks (GRNs) are vital tools for delineating regulatory relationships between transcription factors and their target genes. The boom in computational biology and various biotechnologies has made inferring GRNs from multi-omics data a hot topic. However, when networks are constructed from gene expression data, they often suffer from false-positive problem due to the transitive effects of correlation. The presence of spurious noise edges obscures the real gene interactions, which makes downstream analyses, such as detecting gene function modules and predicting disease-related genes, difficult and inefficient. Therefore, there is an urgent and compelling need to develop network denoising methods to improve the accuracy of GRN inference. RESULTS In this study, we proposed a novel network denoising method named REverse Network Diffusion On Random walks (RENDOR). RENDOR is designed to enhance the accuracy of GRNs afflicted by indirect effects. RENDOR takes noisy networks as input, models higher-order indirect interactions between genes by transitive closure, eliminates false-positive effects using the inverse network diffusion method, and produces refined networks as output. We conducted a comparative assessment of GRN inference accuracy before and after denoising on simulated networks and real GRNs. Our results emphasized that the network derived from RENDOR more accurately and effectively captures gene interactions. This study demonstrates the significance of removing network indirect noise and highlights the effectiveness of the proposed method in enhancing the signal-to-noise ratio of noisy networks. AVAILABILITY AND IMPLEMENTATION The R package RENDOR is provided at https://github.com/Wu-Lab/RENDOR and other source code and data are available at https://github.com/Wu-Lab/RENDOR-reproduce.
Collapse
Affiliation(s)
- Jiating Yu
- School of Mathematics and Statistics, Nanjing University of Information Science & Technology, Nanjing 210044, China
- IAM, MADIS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiacheng Leng
- IAM, MADIS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Zhejiang Lab, Hangzhou 311121, China
| | - Fan Yuan
- IAM, MADIS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Duanchen Sun
- School of Mathematics, Shandong University, Jinan 250100, China
| | - Ling-Yun Wu
- IAM, MADIS, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
9
|
Moeckel C, Mouratidis I, Chantzi N, Uzun Y, Georgakopoulos-Soares I. Advances in computational and experimental approaches for deciphering transcriptional regulatory networks: Understanding the roles of cis-regulatory elements is essential, and recent research utilizing MPRAs, STARR-seq, CRISPR-Cas9, and machine learning has yielded valuable insights. Bioessays 2024; 46:e2300210. [PMID: 38715516 PMCID: PMC11444527 DOI: 10.1002/bies.202300210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 04/22/2024] [Accepted: 04/23/2024] [Indexed: 05/16/2024]
Abstract
Understanding the influence of cis-regulatory elements on gene regulation poses numerous challenges given complexities stemming from variations in transcription factor (TF) binding, chromatin accessibility, structural constraints, and cell-type differences. This review discusses the role of gene regulatory networks in enhancing understanding of transcriptional regulation and covers construction methods ranging from expression-based approaches to supervised machine learning. Additionally, key experimental methods, including MPRAs and CRISPR-Cas9-based screening, which have significantly contributed to understanding TF binding preferences and cis-regulatory element functions, are explored. Lastly, the potential of machine learning and artificial intelligence to unravel cis-regulatory logic is analyzed. These computational advances have far-reaching implications for precision medicine, therapeutic target discovery, and the study of genetic variations in health and disease.
Collapse
Affiliation(s)
- Camille Moeckel
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Yasin Uzun
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
- Department of Pediatrics, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
10
|
Ledru N, Wilson PC, Muto Y, Yoshimura Y, Wu H, Li D, Asthana A, Tullius SG, Waikar SS, Orlando G, Humphreys BD. Predicting proximal tubule failed repair drivers through regularized regression analysis of single cell multiomic sequencing. Nat Commun 2024; 15:1291. [PMID: 38347009 PMCID: PMC10861555 DOI: 10.1038/s41467-024-45706-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 01/31/2024] [Indexed: 02/15/2024] Open
Abstract
Renal proximal tubule epithelial cells have considerable intrinsic repair capacity following injury. However, a fraction of injured proximal tubule cells fails to undergo normal repair and assumes a proinflammatory and profibrotic phenotype that may promote fibrosis and chronic kidney disease. The healthy to failed repair change is marked by cell state-specific transcriptomic and epigenomic changes. Single nucleus joint RNA- and ATAC-seq sequencing offers an opportunity to study the gene regulatory networks underpinning these changes in order to identify key regulatory drivers. We develop a regularized regression approach to construct genome-wide parametric gene regulatory networks using multiomic datasets. We generate a single nucleus multiomic dataset from seven adult human kidney samples and apply our method to study drivers of a failed injury response associated with kidney disease. We demonstrate that our approach is a highly effective tool for predicting key cis- and trans-regulatory elements underpinning the healthy to failed repair transition and use it to identify NFAT5 as a driver of the maladaptive proximal tubule state.
Collapse
Affiliation(s)
- Nicolas Ledru
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| | - Parker C Wilson
- Division of Anatomic and Molecular Pathology, Department of Pathology and Immunology, Washington University in St. Louis, St. Louis, MO, USA
| | - Yoshiharu Muto
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| | - Yasuhiro Yoshimura
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| | - Haojia Wu
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| | - Dian Li
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA
| | - Amish Asthana
- Department of Surgery, Wake Forest Baptist Medical Center; Wake Forest Institute for Regenerative Medicine, Wake Forest School of Medicine, Winston Salem, NC, USA
| | - Stefan G Tullius
- Division of Transplant Surgery and Transplant Surgery Research Laboratory, Department of Surgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Sushrut S Waikar
- Section of Nephrology, Department of Medicine, Boston University Chobanian and Avedisian School of Medicine, Boston Medical Center, Boston, MA, USA
| | - Giuseppe Orlando
- Department of Surgery, Wake Forest Baptist Medical Center; Wake Forest Institute for Regenerative Medicine, Wake Forest School of Medicine, Winston Salem, NC, USA
| | - Benjamin D Humphreys
- Division of Nephrology, Department of Medicine, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
- Department of Developmental Biology, Washington University in St. Louis School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
11
|
Pan X, Zhang X. Studying temporal dynamics of single cells: expression, lineage and regulatory networks. Biophys Rev 2024; 16:57-67. [PMID: 38495440 PMCID: PMC10937865 DOI: 10.1007/s12551-023-01090-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 06/27/2023] [Indexed: 03/19/2024] Open
Abstract
Learning how multicellular organs are developed from single cells to different cell types is a fundamental problem in biology. With the high-throughput scRNA-seq technology, computational methods have been developed to reveal the temporal dynamics of single cells from transcriptomic data, from phenomena on cell trajectories to the underlying mechanism that formed the trajectory. There are several distinct families of computational methods including Trajectory Inference (TI), Lineage Tracing (LT), and Gene Regulatory Network (GRN) Inference which are involved in such studies. This review summarizes these computational approaches which use scRNA-seq data to study cell differentiation and cell fate specification as well as the advantages and limitations of different methods. We further discuss how GRNs can potentially affect cell fate decisions and trajectory structures. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-023-01090-5.
Collapse
Affiliation(s)
- Xinhai Pan
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA
| | - Xiuwei Zhang
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA
| |
Collapse
|
12
|
Gisch DL, Brennan M, Lake BB, Basta J, Keller MS, Melo Ferreira R, Akilesh S, Ghag R, Lu C, Cheng YH, Collins KS, Parikh SV, Rovin BH, Robbins L, Stout L, Conklin KY, Diep D, Zhang B, Knoten A, Barwinska D, Asghari M, Sabo AR, Ferkowicz MJ, Sutton TA, Kelly KJ, De Boer IH, Rosas SE, Kiryluk K, Hodgin JB, Alakwaa F, Winfree S, Jefferson N, Türkmen A, Gaut JP, Gehlenborg N, Phillips CL, El-Achkar TM, Dagher PC, Hato T, Zhang K, Himmelfarb J, Kretzler M, Mollah S, Jain S, Rauchman M, Eadon MT. The chromatin landscape of healthy and injured cell types in the human kidney. Nat Commun 2024; 15:433. [PMID: 38199997 PMCID: PMC10781985 DOI: 10.1038/s41467-023-44467-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 12/14/2023] [Indexed: 01/12/2024] Open
Abstract
There is a need to define regions of gene activation or repression that control human kidney cells in states of health, injury, and repair to understand the molecular pathogenesis of kidney disease and design therapeutic strategies. Comprehensive integration of gene expression with epigenetic features that define regulatory elements remains a significant challenge. We measure dual single nucleus RNA expression and chromatin accessibility, DNA methylation, and H3K27ac, H3K4me1, H3K4me3, and H3K27me3 histone modifications to decipher the chromatin landscape and gene regulation of the kidney in reference and adaptive injury states. We establish a spatially-anchored epigenomic atlas to define the kidney's active, silent, and regulatory accessible chromatin regions across the genome. Using this atlas, we note distinct control of adaptive injury in different epithelial cell types. A proximal tubule cell transcription factor network of ELF3, KLF6, and KLF10 regulates the transition between health and injury, while in thick ascending limb cells this transition is regulated by NR2F1. Further, combined perturbation of ELF3, KLF6, and KLF10 distinguishes two adaptive proximal tubular cell subtypes, one of which manifested a repair trajectory after knockout. This atlas will serve as a foundation to facilitate targeted cell-specific therapeutics by reprogramming gene regulatory networks.
Collapse
Affiliation(s)
- Debora L Gisch
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | | | - Blue B Lake
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
- San Diego Institute of Science, Altos Labs, San Diego, CA, USA
| | - Jeannine Basta
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | | | | | | | - Reetika Ghag
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Charles Lu
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Ying-Hua Cheng
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | | | - Samir V Parikh
- Ohio State University Wexner Medical Center, Columbus, OH, 43210, USA
| | - Brad H Rovin
- Ohio State University Wexner Medical Center, Columbus, OH, 43210, USA
| | - Lynn Robbins
- St. Louis Veteran Affairs Medical Center, St. Louis, MO, 63106, USA
| | - Lisa Stout
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Kimberly Y Conklin
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Dinh Diep
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Bo Zhang
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Amanda Knoten
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Daria Barwinska
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Mahla Asghari
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Angela R Sabo
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | | | - Timothy A Sutton
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | | | | | - Sylvia E Rosas
- Joslin Diabetes Center, Harvard Medical School, Boston, MA, 02215, USA
| | | | | | | | - Seth Winfree
- University of Nebraska Medical Center, Omaha, NE, 68198, USA
| | - Nichole Jefferson
- Kidney Precision Medicine Project Community Engagement Committee, Dallas, TX, USA
| | - Aydın Türkmen
- Istanbul School of Medicine, Division of Nephrology, Istanbul, Turkey
| | - Joseph P Gaut
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | | | | | | | - Pierre C Dagher
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Takashi Hato
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Kun Zhang
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | | | | | - Shamim Mollah
- Washington University in Saint Louis, St. Louis, MO, 63103, USA
| | - Sanjay Jain
- Washington University in Saint Louis, St. Louis, MO, 63103, USA.
| | - Michael Rauchman
- Washington University in Saint Louis, St. Louis, MO, 63103, USA.
| | - Michael T Eadon
- Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
| |
Collapse
|
13
|
Badia-I-Mompel P, Wessels L, Müller-Dott S, Trimbour R, Ramirez Flores RO, Argelaguet R, Saez-Rodriguez J. Gene regulatory network inference in the era of single-cell multi-omics. Nat Rev Genet 2023; 24:739-754. [PMID: 37365273 DOI: 10.1038/s41576-023-00618-5] [Citation(s) in RCA: 126] [Impact Index Per Article: 63.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 06/28/2023]
Abstract
The interplay between chromatin, transcription factors and genes generates complex regulatory circuits that can be represented as gene regulatory networks (GRNs). The study of GRNs is useful to understand how cellular identity is established, maintained and disrupted in disease. GRNs can be inferred from experimental data - historically, bulk omics data - and/or from the literature. The advent of single-cell multi-omics technologies has led to the development of novel computational methods that leverage genomic, transcriptomic and chromatin accessibility information to infer GRNs at an unprecedented resolution. Here, we review the key principles of inferring GRNs that encompass transcription factor-gene interactions from transcriptomics and chromatin accessibility data. We focus on the comparison and classification of methods that use single-cell multimodal data. We highlight challenges in GRN inference, in particular with respect to benchmarking, and potential further developments using additional data modalities.
Collapse
Affiliation(s)
- Pau Badia-I-Mompel
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Lorna Wessels
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Department of Vascular Biology and Tumor Angiogenesis, European Center for Angioscience, Medical Faculty, MannHeim Heidelberg University, Mannheim, Germany
| | - Sophia Müller-Dott
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Rémi Trimbour
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Ricardo O Ramirez Flores
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | | | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany.
| |
Collapse
|
14
|
Gisch DL, Brennan M, Lake BB, Basta J, Keller M, Ferreira RM, Akilesh S, Ghag R, Lu C, Cheng YH, Collins KS, Parikh SV, Rovin BH, Robbins L, Conklin KY, Diep D, Zhang B, Knoten A, Barwinska D, Asghari M, Sabo AR, Ferkowicz MJ, Sutton TA, Kelly KJ, Boer IHD, Rosas SE, Kiryluk K, Hodgin JB, Alakwaa F, Jefferson N, Gaut JP, Gehlenborg N, Phillips CL, El-Achkar TM, Dagher PC, Hato T, Zhang K, Himmelfarb J, Kretzler M, Mollah S, Jain S, Rauchman M, Eadon MT. The chromatin landscape of healthy and injured cell types in the human kidney. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.07.543965. [PMID: 37333123 PMCID: PMC10274789 DOI: 10.1101/2023.06.07.543965] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
There is a need to define regions of gene activation or repression that control human kidney cells in states of health, injury, and repair to understand the molecular pathogenesis of kidney disease and design therapeutic strategies. However, comprehensive integration of gene expression with epigenetic features that define regulatory elements remains a significant challenge. We measured dual single nucleus RNA expression and chromatin accessibility, DNA methylation, and H3K27ac, H3K4me1, H3K4me3, and H3K27me3 histone modifications to decipher the chromatin landscape and gene regulation of the kidney in reference and adaptive injury states. We established a comprehensive and spatially-anchored epigenomic atlas to define the kidney's active, silent, and regulatory accessible chromatin regions across the genome. Using this atlas, we noted distinct control of adaptive injury in different epithelial cell types. A proximal tubule cell transcription factor network of ELF3 , KLF6 , and KLF10 regulated the transition between health and injury, while in thick ascending limb cells this transition was regulated by NR2F1 . Further, combined perturbation of ELF3 , KLF6 , and KLF10 distinguished two adaptive proximal tubular cell subtypes, one of which manifested a repair trajectory after knockout. This atlas will serve as a foundation to facilitate targeted cell-specific therapeutics by reprogramming gene regulatory networks.
Collapse
|