1
|
Morabito A, De Simone G, Pastorelli R, Brunelli L, Ferrario M. Algorithms and tools for data-driven omics integration to achieve multilayer biological insights: a narrative review. J Transl Med 2025; 23:425. [PMID: 40211300 PMCID: PMC11987215 DOI: 10.1186/s12967-025-06446-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2025] [Accepted: 03/30/2025] [Indexed: 04/13/2025] Open
Abstract
Systems biology is a holistic approach to biological sciences that combines experimental and computational strategies, aimed at integrating information from different scales of biological processes to unravel pathophysiological mechanisms and behaviours. In this scenario, high-throughput technologies have been playing a major role in providing huge amounts of omics data, whose integration would offer unprecedented possibilities in gaining insights on diseases and identifying potential biomarkers. In the present review, we focus on strategies that have been applied in literature to integrate genomics, transcriptomics, proteomics, and metabolomics in the year range 2018-2024. Integration approaches were divided into three main categories: statistical-based approaches, multivariate methods, and machine learning/artificial intelligence techniques. Among them, statistical approaches (mainly based on correlation) were the ones with a slightly higher prevalence, followed by multivariate approaches, and machine learning techniques. Integrating multiple biological layers has shown great potential in uncovering molecular mechanisms, identifying putative biomarkers, and aid classification, most of the time resulting in better performances when compared to single omics analyses. However, significant challenges remain. The high-throughput nature of omics platforms introduces issues such as variable data quality, missing values, collinearity, and dimensionality. These challenges further increase when combining multiple omics datasets, as the complexity and heterogeneity of the data increase with integration. We report different strategies that have been found in literature to cope with these challenges, but some open issues still remain and should be addressed to disclose the full potential of omics integration.
Collapse
Affiliation(s)
- Aurelia Morabito
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy.
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133, Milan, Italy.
| | - Giulia De Simone
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
- Department of Biotechnologies and Biosciences, Università degli Studi Milano Bicocca, 20126, Milan, Italy
| | - Roberta Pastorelli
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
| | - Laura Brunelli
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
| | - Manuela Ferrario
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133, Milan, Italy
| |
Collapse
|
2
|
Sabikunnahar B, Snyder JP, Rodriguez PD, Sessions KJ, Amiel E, Frietze SE, Krementsov DN. Natural genetic variation in wild-derived mice controls host survival and transcriptional responses during endotoxic shock. Immunohorizons 2025; 9:vlaf007. [PMID: 40139977 PMCID: PMC11945298 DOI: 10.1093/immhor/vlaf007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 01/22/2025] [Indexed: 03/29/2025] Open
Abstract
Innate immune cells sense microbial danger signals, resulting in dynamic transcriptional reprogramming and rapid inflammatory responses. If not properly regulated, such responses can be detrimental to the host, as is seen in septic shock. A better understanding of the genetic regulation of responses during endotoxemia could provide potential therapeutic insights. However, the majority of animal model studies have been performed using classic inbred laboratory strains of mice, capturing limited genetic diversity. Here, we compared classic inbred C57BL/6 (B6) mice with wild-derived and genetically divergent PWD/PhJ (PWD) mice using in vivo and in vitro models of endotoxemia. Compared with B6 mice, PWD mice were markedly resistant to bacterial lipopolysaccharide (LPS)-induced endotoxic shock. Using LPS stimulation of bone marrow derived dendritic cells (BMDC) and RNA sequencing, we demonstrate that B6 and PWD BMDCs exhibit partially overlapping yet highly divergent transcriptional responses, with B6 skewed toward stereotypical proinflammatory pathway activation, and PWD engaging regulatory or developmental pathways. To dissect genetic regulation of inflammatory responses by allelic variants, we used BMDCs from a sub-consomic strain carrying a ∼50 Mb PWD-derived portion of chromosome 11 on the B6 background. This identified a subset of cis-regulated and a large number of trans-regulated genes. Bioinformatic analyses identified candidate trans regulators encoded in the chromosome 11 locus as transcription factors Irf1, Ncor1, and Srebf1. Our results demonstrate that natural genetic variation controls host survival and transcriptional reprogramming during endotoxemia, suggesting possibilities for prediction of sepsis risk and/or personalized therapeutic interventions.
Collapse
Affiliation(s)
- Bristy Sabikunnahar
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| | - Julia P Snyder
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| | - Princess D Rodriguez
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT, United States
| | - Katherine J Sessions
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| | - Eyal Amiel
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| | - Seth E Frietze
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| | - Dimitry N Krementsov
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT, United States
| |
Collapse
|
3
|
Li Y, Chen S, Rao H, Cui S, Chen G. MicroRNA Gets a Mighty Award. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2414625. [PMID: 39836690 PMCID: PMC11831481 DOI: 10.1002/advs.202414625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2024] [Revised: 12/29/2024] [Indexed: 01/23/2025]
Abstract
Recent advancements in microRNAs (miRNAs) research have revealed their key roles in both normal physiological processes and pathological conditions, leading to potential applications in diagnostics and therapeutics. However, the path forward is fraught with several scientific and technical challenges. This review article briefly explores the milestones of the discovery, biogenesis, functions, and application for clinical diagnostic and therapeutic strategies of miRNAs. The potential challenges and future directions are also discussed to fully harness their capabilities.
Collapse
Affiliation(s)
- Yu Li
- Department of Human Cell Biology and GeneticsJoint Laboratory of Guangdong‐Hong Kong Universities for Vascular Homeostasis and DiseasesSchool of MedicineSouthern University of Science and TechnologyShenzhenGuangdong518055China
| | - Sijie Chen
- Department of Human Cell Biology and GeneticsJoint Laboratory of Guangdong‐Hong Kong Universities for Vascular Homeostasis and DiseasesSchool of MedicineSouthern University of Science and TechnologyShenzhenGuangdong518055China
| | - Hai Rao
- Department of BiochemistryKey University Laboratory of Metabolism and Health of GuangdongSchool of MedicineSouthern University of Science and TechnologyShenzhenGuangdong518055China
| | - Shengjin Cui
- Clinical LaboratoryThe University of Hong Kong‐Shenzhen HospitalShenzhenGuangdong518053China
| | - Guoan Chen
- Department of Human Cell Biology and GeneticsJoint Laboratory of Guangdong‐Hong Kong Universities for Vascular Homeostasis and DiseasesSchool of MedicineSouthern University of Science and TechnologyShenzhenGuangdong518055China
| |
Collapse
|
4
|
Aydin S, Skelly DA, Dewey H, Mahoney JM, Choi T, Reinholdt LG, Baker CL, Munger SC. Cross cell-type systems genetics reveals the influence of eQTL at multiple points in the developmental trajectory of mouse neural progenitor cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.24.634514. [PMID: 39896448 PMCID: PMC11785210 DOI: 10.1101/2025.01.24.634514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
Genetic variation leads to phenotypic variability in pluripotent stem cells that presents challenges for regenerative medicine. Although recent studies have investigated the impact of genetic variation on pluripotency maintenance and differentiation capacity, less is known about how genetic variants affecting the pluripotent state influence gene regulation in later stages of development. Here, we characterized expression of more than 12,000 genes in 127 donor-matched Diversity Outbred (DO) mouse embryonic stem cell (mESC) and neural progenitor cell (mNPC) lines. Quantitative trait locus (QTL) mapping identified 2,947 expression QTL (eQTL) unique to DO mNPCs and 1,113 eQTL observed in both mNPCs and mESCs with highly concordant allele effects. We mapped three eQTL hotspots on Chromosomes (Chrs) 1, 10, and 11 that were unique to mNPCs. Target genes of the Chr 1 hotspot were overrepresented for those involved in mRNA processing, DNA repair, chromatin organization, protein degradation, and cell cycle. Mediation analysis of the Chr 1 hotspot identified Rnf152 as the best candidate mediator expressed in mNPCs, while cross-cell type mediation using mESC gene expression along with partial correlation analysis strongly implicated genetic variant(s) affecting Pign expression in the mESC state as regulating the mNPC Chr 1 eQTL hotspot. Together these findings highlight that many local eQTL confer similar effects on gene expression in multiple cell states; distant eQTL in DO mNPCs are numerous and largely unique to that cell state, with many co-localizing to mNPC-specific hotspots; and mediation analysis across cell types suggests that expression of Pign early in development (mESCs) shapes the transcriptome of the more specialized mNPC state.
Collapse
Affiliation(s)
- Selcan Aydin
- The Jackson Laboratory, Bar Harbor, ME 04609 USA
| | | | - Hannah Dewey
- The Jackson Laboratory, Bar Harbor, ME 04609 USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111 USA
| | | | - Ted Choi
- Predictive Biology, Inc., Carlsbad, CA 92010 USA
| | - Laura G. Reinholdt
- The Jackson Laboratory, Bar Harbor, ME 04609 USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111 USA
| | - Christopher L. Baker
- The Jackson Laboratory, Bar Harbor, ME 04609 USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111 USA
| | - Steven C. Munger
- The Jackson Laboratory, Bar Harbor, ME 04609 USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111 USA
| |
Collapse
|
5
|
Teyssonnière EM, Trébulle P, Muenzner J, Loegler V, Ludwig D, Amari F, Mülleder M, Friedrich A, Hou J, Ralser M, Schacherer J. Species-wide quantitative transcriptomes and proteomes reveal distinct genetic control of gene expression variation in yeast. Proc Natl Acad Sci U S A 2024; 121:e2319211121. [PMID: 38696467 PMCID: PMC11087752 DOI: 10.1073/pnas.2319211121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/25/2024] [Indexed: 05/04/2024] Open
Abstract
Gene expression varies between individuals and corresponds to a key step linking genotypes to phenotypes. However, our knowledge regarding the species-wide genetic control of protein abundance, including its dependency on transcript levels, is very limited. Here, we have determined quantitative proteomes of a large population of 942 diverse natural Saccharomyces cerevisiae yeast isolates. We found that mRNA and protein abundances are weakly correlated at the population gene level. While the protein coexpression network recapitulates major biological functions, differential expression patterns reveal proteomic signatures related to specific populations. Comprehensive genetic association analyses highlight that genetic variants associated with variation in protein (pQTL) and transcript (eQTL) levels poorly overlap (3%). Our results demonstrate that transcriptome and proteome are governed by distinct genetic bases, likely explained by protein turnover. It also highlights the importance of integrating these different levels of gene expression to better understand the genotype-phenotype relationship.
Collapse
Affiliation(s)
- Elie Marcel Teyssonnière
- UMR 7156 Génétique Moléculaire, Génomique et Microbiologie, Université de Strasbourg, CNRS, Strasbourg67000, France
| | - Pauline Trébulle
- The Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, OxfordOX3 7BN, United Kingdom
| | - Julia Muenzner
- Department of Biochemistry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
| | - Victor Loegler
- UMR 7156 Génétique Moléculaire, Génomique et Microbiologie, Université de Strasbourg, CNRS, Strasbourg67000, France
| | - Daniela Ludwig
- Department of Biochemistry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
- Core Facility High-Throughput Mass Spectrometry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
| | - Fatma Amari
- Department of Biochemistry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
- Core Facility High-Throughput Mass Spectrometry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
| | - Michael Mülleder
- Core Facility High-Throughput Mass Spectrometry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
| | - Anne Friedrich
- UMR 7156 Génétique Moléculaire, Génomique et Microbiologie, Université de Strasbourg, CNRS, Strasbourg67000, France
| | - Jing Hou
- UMR 7156 Génétique Moléculaire, Génomique et Microbiologie, Université de Strasbourg, CNRS, Strasbourg67000, France
| | - Markus Ralser
- The Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, OxfordOX3 7BN, United Kingdom
- Department of Biochemistry, Charitéplatz 1, Charité – Universitätsmedizin Berlin, Berlin10117, Germany
- Max Planck Institute for Molecular Genetics, Berlin14195, Germany
| | - Joseph Schacherer
- UMR 7156 Génétique Moléculaire, Génomique et Microbiologie, Université de Strasbourg, CNRS, Strasbourg67000, France
- Institut Universitaire de France, Paris75000, France
| |
Collapse
|
6
|
Cortes DE, Escudero M, Korgan AC, Mitra A, Edwards A, Aydin SC, Munger SC, Charland K, Zhang ZW, O'Connell KMS, Reinholdt LG, Pera MF. An in vitro neurogenetics platform for precision disease modeling in the mouse. SCIENCE ADVANCES 2024; 10:eadj9305. [PMID: 38569042 PMCID: PMC10990289 DOI: 10.1126/sciadv.adj9305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 02/27/2024] [Indexed: 04/05/2024]
Abstract
The power and scope of disease modeling can be markedly enhanced through the incorporation of broad genetic diversity. The introduction of pathogenic mutations into a single inbred mouse strain sometimes fails to mimic human disease. We describe a cross-species precision disease modeling platform that exploits mouse genetic diversity to bridge cell-based modeling with whole organism analysis. We developed a universal protocol that permitted robust and reproducible neural differentiation of genetically diverse human and mouse pluripotent stem cell lines and then carried out a proof-of-concept study of the neurodevelopmental gene DYRK1A. Results in vitro reliably predicted the effects of genetic background on Dyrk1a loss-of-function phenotypes in vivo. Transcriptomic comparison of responsive and unresponsive strains identified molecular pathways conferring sensitivity or resilience to Dyrk1a1A loss and highlighted differential messenger RNA isoform usage as an important determinant of response. This cross-species strategy provides a powerful tool in the functional analysis of candidate disease variants identified through human genetic studies.
Collapse
Affiliation(s)
| | | | | | - Arojit Mitra
- The Jackson Laboratory, Bar Harbor, ME 04660, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
7
|
O'Connor C, Keele GR, Martin W, Stodola T, Gatti D, Hoffman BR, Korstanje R, Churchill GA, Reinholdt LG. Cell morphology QTL reveal gene by environment interactions in a genetically diverse cell population. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.18.567597. [PMID: 38014303 PMCID: PMC10680806 DOI: 10.1101/2023.11.18.567597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Genetically heterogenous cell lines from laboratory mice are promising tools for population-based screening as they offer power for genetic mapping, and potentially, predictive value for in vivo experimentation in genetically matched individuals. To explore this further, we derived a panel of fibroblast lines from a genetic reference population of laboratory mice (the Diversity Outbred, DO). We then used high-content imaging to capture hundreds of cell morphology traits in cells exposed to the oxidative stress-inducing arsenic metabolite monomethylarsonous acid (MMAIII). We employed dose-response modeling to capture latent parameters of response and we then used these parameters to identify several hundred cell morphology quantitative trait loci (cmQTL). Response cmQTL encompass genes with established associations with cellular responses to arsenic exposure, including Abcc4 and Txnrd1, as well as novel gene candidates like Xrcc2. Moreover, baseline trait cmQTL highlight the influence of natural variation on fundamental aspects of nuclear morphology. We show that the natural variants influencing response include both coding and non-coding variation, and that cmQTL haplotypes can be used to predict response in orthogonal cell lines. Our study sheds light on the major molecular initiating events of oxidative stress that are under genetic regulation, including the NRF2-mediated antioxidant response, cellular detoxification pathways, DNA damage repair response, and cell death trajectories.
Collapse
Affiliation(s)
- Callan O'Connor
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111, USA
| | - Gregory R Keele
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
- RTI International, RTP, NC 27709, USA
| | | | | | - Daniel Gatti
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | | | | | | | - Laura G Reinholdt
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
- Graduate School of Biomedical Sciences, Tufts University, Boston, MA 02111, USA
| |
Collapse
|
8
|
Allayee H, Farber CR, Seldin MM, Williams EG, James DE, Lusis AJ. Systems genetics approaches for understanding complex traits with relevance for human disease. eLife 2023; 12:e91004. [PMID: 37962168 PMCID: PMC10645424 DOI: 10.7554/elife.91004] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/16/2023] [Indexed: 11/15/2023] Open
Abstract
Quantitative traits are often complex because of the contribution of many loci, with further complexity added by environmental factors. In medical research, systems genetics is a powerful approach for the study of complex traits, as it integrates intermediate phenotypes, such as RNA, protein, and metabolite levels, to understand molecular and physiological phenotypes linking discrete DNA sequence variation to complex clinical and physiological traits. The primary purpose of this review is to describe some of the resources and tools of systems genetics in humans and rodent models, so that researchers in many areas of biology and medicine can make use of the data.
Collapse
Affiliation(s)
- Hooman Allayee
- Departments of Population & Public Health Sciences, University of Southern CaliforniaLos AngelesUnited States
- Biochemistry & Molecular Medicine, Keck School of Medicine, University of Southern CaliforniaLos AngelesUnited States
| | - Charles R Farber
- Center for Public Health Genomics, University of Virginia School of MedicineCharlottesvilleUnited States
- Departments of Biochemistry & Molecular Genetics, University of Virginia School of MedicineCharlottesvilleUnited States
- Public Health Sciences, University of Virginia School of MedicineCharlottesvilleUnited States
| | - Marcus M Seldin
- Department of Biological Chemistry, University of California, IrvineIrvineUnited States
| | - Evan Graehl Williams
- Luxembourg Centre for Systems Biomedicine, University of LuxembourgLuxembourgLuxembourg
| | - David E James
- School of Life and Environmental Sciences, University of SydneyCamperdownAustralia
- Faculty of Medicine and Health, University of SydneyCamperdownAustralia
- Charles Perkins Centre, University of SydneyCamperdownAustralia
| | - Aldons J Lusis
- Departments of Human Genetics, University of California, Los AngelesLos AngelesUnited States
- Medicine, University of California, Los AngelesLos AngelesUnited States
- Microbiology, Immunology, & Molecular Genetics, David Geffen School of Medicine of UCLALos AngelesUnited States
| |
Collapse
|
9
|
Teyssonnière E, Trébulle P, Muenzner J, Loegler V, Ludwig D, Amari F, Mülleder M, Friedrich A, Hou J, Ralser M, Schacherer J. Species-wide quantitative transcriptomes and proteomes reveal distinct genetic control of gene expression variation in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.18.558197. [PMID: 37781592 PMCID: PMC10541136 DOI: 10.1101/2023.09.18.558197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
Gene expression varies between individuals and corresponds to a key step linking genotypes to phenotypes. However, our knowledge regarding the species-wide genetic control of protein abundance, including its dependency on transcript levels, is very limited. Here, we have determined quantitative proteomes of a large population of 942 diverse natural Saccharomyces cerevisiae yeast isolates. We found that mRNA and protein abundances are weakly correlated at the population gene level. While the protein co-expression network recapitulates major biological functions, differential expression patterns reveal proteomic signatures related to specific populations. Comprehensive genetic association analyses highlight that genetic variants associated with variation in protein (pQTL) and transcript (eQTL) levels poorly overlap (3.6%). Our results demonstrate that transcriptome and proteome are governed by distinct genetic bases, likely explained by protein turnover. It also highlights the importance of integrating these different levels of gene expression to better understand the genotype-phenotype relationship. Highlights At the level of individual genes, the abundance of transcripts and proteins is weakly correlated within a species ( ρ = 0.165). While the proteome is not imprinted by population structure, co-expression patterns recapitulate the cellular functional landscapeWild populations exhibit a higher abundance of respiration-related proteins compared to domesticated populationsLoci that influence protein abundance differ from those that impact transcript levels, likely because of protein turnover.
Collapse
|
10
|
Keele GR. Which mouse multiparental population is right for your study? The Collaborative Cross inbred strains, their F1 hybrids, or the Diversity Outbred population. G3 (BETHESDA, MD.) 2023; 13:jkad027. [PMID: 36735601 PMCID: PMC10085760 DOI: 10.1093/g3journal/jkad027] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 12/30/2022] [Accepted: 01/23/2023] [Indexed: 02/04/2023]
Abstract
Multiparental populations (MPPs) encompass greater genetic diversity than traditional experimental crosses of two inbred strains, enabling broader surveys of genetic variation underlying complex traits. Two such mouse MPPs are the Collaborative Cross (CC) inbred panel and the Diversity Outbred (DO) population, which are descended from the same eight inbred strains. Additionally, the F1 intercrosses of CC strains (CC-RIX) have been used and enable study designs with replicate outbred mice. Genetic analyses commonly used by researchers to investigate complex traits in these populations include characterizing how heritable a trait is, i.e. its heritability, and mapping its underlying genetic loci, i.e. its quantitative trait loci (QTLs). Here we evaluate the relative merits of these populations for these tasks through simulation, as well as provide recommendations for performing the quantitative genetic analyses. We find that sample populations that include replicate animals, as possible with the CC and CC-RIX, provide more efficient and precise estimates of heritability. We report QTL mapping power curves for the CC, CC-RIX, and DO across a range of QTL effect sizes and polygenic backgrounds for samples of 174 and 500 mice. The utility of replicate animals in the CC and CC-RIX for mapping QTLs rapidly decreased as traits became more polygenic. Only large sample populations of 500 DO mice were well-powered to detect smaller effect loci (7.5-10%) for highly complex traits (80% polygenic background). All results were generated with our R package musppr, which we developed to simulate data from these MPPs and evaluate genetic analyses from user-provided genotypes.
Collapse
Affiliation(s)
- Gregory R Keele
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA
| |
Collapse
|