1
|
Ricker B, Castellanos Franco EA, de los Campos G, Pelled G, Gilad AA. A conserved phenylalanine motif among Teleost fish provides insight for improving electromagnetic perception. bioRxiv 2024:2024.04.04.588096. [PMID: 38617371 PMCID: PMC11014636 DOI: 10.1101/2024.04.04.588096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Magnetoreceptive biology as a field remains relatively obscure; compared to the breadth of species believed to sense magnetic fields, it remains under-studied. Here, we present grounds for the expansion of magnetoreception studies among Teleosts. We begin with the electromagnetic perceptive gene (EPG) from Kryptopterus vitreolus and expand to identify 72 Teleosts with homologous proteins containing a conserved three-phenylalanine (3F) motif. Phylogenetic analysis provides insight as to how EPG may have evolved over time, and indicates that certain clades may have experienced a loss of function driven by different fitness pressures. One potential factor is water type with freshwater fish significantly more likely to possess the functional motif version (FFF), and saltwater fish to have the non-functional variant (FXF). It was also revealed that when the 3F motif from the homolog of Brachyhypopomus gauderio (B.g.) is inserted into EPG - EPG(B.g.) - the response (as indicated by increased intracellular calcium) is faster. This indicates that EPG has the potential to be engineered to improve upon its response and increase its utility to be used as a controller for specific outcomes.
Collapse
Affiliation(s)
- Brianna Ricker
- Department of Chemical Engineering and Materials Sciences, Michigan State University, East Lansing MI, USA
| | | | - Gustavo de los Campos
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing MI, USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing MI, USA
- Department of Statistics and Probability, Michigan State University, East Lansing MI, USA
| | - Galit Pelled
- Department of Mechanical Engineering, Michigan State University, East Lansing MI, USA
- Department of Radiology, Michigan State University, East Lansing, MI, USA
| | - Assaf A. Gilad
- Department of Chemical Engineering and Materials Sciences, Michigan State University, East Lansing MI, USA
- Department of Radiology, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
2
|
Valente BD, de los Campos G, Grueneberg A, Chen CY, Ros-Freixedes R, Herring WO. Using residual regressions to quantify and map signal leakage in genomic prediction. Genet Sel Evol 2023; 55:57. [PMID: 37550618 PMCID: PMC10405418 DOI: 10.1186/s12711-023-00830-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 07/12/2023] [Indexed: 08/09/2023] Open
Abstract
BACKGROUND Most genomic prediction applications in animal breeding use genotypes with tens of thousands of single nucleotide polymorphisms (SNPs). However, modern sequencing technologies and imputation algorithms can generate ultra-high-density genotypes (including millions of SNPs) at an affordable cost. Empirical studies have not produced clear evidence that using ultra-high-density genotypes can significantly improve prediction accuracy. However, (whole-genome) prediction accuracy is not very informative about the ability of a model to capture the genetic signals from specific genomic regions. To address this problem, we propose a simple methodology that detects chromosome regions for which a specific model (e.g., single-step genomic best linear unbiased prediction (ssGBLUP)) may fail to fully capture the genetic signal present in such segments-a phenomenon that we refer to as signal leakage. We propose to detect regions with evidence of signal leakage by testing the association of residuals from a pedigree or a genomic model with SNP genotypes. We discuss how this approach can be used to map regions with signals that are poorly captured by a model and to identify strategies to fix those problems (e.g., using a different prior or increasing marker density). Finally, we explored the proposed approach to scan for signal leakage of different models (pedigree-based, ssGBLUP, and various Bayesian models) applied to growth-related phenotypes (average daily gain and backfat thickness) in pigs. RESULTS We report widespread evidence of signal leakage for pedigree-based models. Including a percentage of animals with SNP data in ssGBLUP reduced the extent of signal leakage. However, local peaks of missed signals remained in some regions, even when all animals were genotyped. Using variable selection priors solves leakage points that are caused by excessive shrinkage of marker effects. Nevertheless, these models still miss signals in some regions due to low linkage disequilibrium between the SNPs on the array used and causal variants. Thus, we discuss how such problems could be addressed by adding sequence SNPs from those regions to the prediction model. CONCLUSIONS Residual single-marker regression analysis is a simple approach that can be used to detect regional genomic signals that are poorly captured by a model and to indicate ways to fix such problems.
Collapse
Affiliation(s)
| | - Gustavo de los Campos
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI USA
- Department of Statistics and Probability, Michigan State University, East Lansing, MI USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI USA
| | - Alexander Grueneberg
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI USA
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus Plc, Hendersonville, TN USA
| | - Roger Ros-Freixedes
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio-CERCA Center, Lleida, Spain
| | | |
Collapse
|
3
|
Ejima K, Liu N, Mestre LM, de los Campos G, Allison DB. Conditioning on parental mating types can reduce necessary assumptions for Mendelian randomization. Front Genet 2023; 14:1014014. [PMID: 36950138 PMCID: PMC10025466 DOI: 10.3389/fgene.2023.1014014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open
Abstract
Mendelian randomization (MR) has become a common tool used in epidemiological studies. However, when confounding variables are correlated with the instrumental variable (in this case, a genetic/variant/marker), the estimation can remain biased even with MR. We propose conditioning on parental mating types (a function of parental genotypes) in MR to eliminate the need for one set of assumptions, thereby plausibly reducing such bias. We illustrate a situation in which the instrumental variable and confounding variables are correlated using two unlinked diallelic genetic loci: one, an instrumental variable and the other, a confounding variable. Assortative mating or population admixture can create an association between the two unlinked loci, which can violate one of the necessary assumptions for MR. We simulated datasets involving assortative mating and population admixture and analyzed them using three different methods: 1) conventional MR, 2) MR conditioning on parental genotypes, and 3) MR conditioning on parental mating types. We demonstrated that conventional MR leads to type I error rate inflation and biased estimates for cases with assortative mating or population admixtures. In the presence of non-additive effects, MR with an adjustment for parental genotypes only partially reduced the type I error rate inflation and bias. In contrast, conditioning on parental mating types in MR eliminated the type I error inflation and bias under these circumstances. Conditioning on parental mating types is a useful strategy to reduce the burden of assumptions and the potential bias in MR when the correlation between the instrument variable and confounders is due to assortative mating or population stratification but not linkage.
Collapse
Affiliation(s)
- Keisuke Ejima
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health-Bloomington, Bloomington, IN, United States
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- Department of Global Health Policy, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Nianjun Liu
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health-Bloomington, Bloomington, IN, United States
| | - Luis Miguel Mestre
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health-Bloomington, Bloomington, IN, United States
| | - Gustavo de los Campos
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, United States
- Department of Statistics and Probability, Michigan State University, East Lansing, MI, United States
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, United States
| | - David B. Allison
- Department of Epidemiology and Biostatistics, Indiana University School of Public Health-Bloomington, Bloomington, IN, United States
- *Correspondence: David B. Allison,
| |
Collapse
|
4
|
Hansen PB, Ruud AK, de los Campos G, Malinowska M, Nagy I, Svane SF, Thorup-Kristensen K, Jensen JD, Krusell L, Asp T. Integration of DNA Methylation and Transcriptome Data Improves Complex Trait Prediction in Hordeum vulgare. Plants 2022; 11:plants11172190. [PMID: 36079572 PMCID: PMC9459846 DOI: 10.3390/plants11172190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/19/2022] [Accepted: 08/21/2022] [Indexed: 11/30/2022]
Abstract
Whole-genome multi-omics profiles contain valuable information for the characterization and prediction of complex traits in plants. In this study, we evaluate multi-omics models to predict four complex traits in barley (Hordeum vulgare); grain yield, thousand kernel weight, protein content, and nitrogen uptake. Genomic, transcriptomic, and DNA methylation data were obtained from 75 spring barley lines tested in the RadiMax semi-field phenomics facility under control and water-scarce treatment. By integrating multi-omics data at genomic, transcriptomic, and DNA methylation regulatory levels, a higher proportion of phenotypic variance was explained (0.72–0.91) than with genomic models alone (0.55–0.86). The correlation between predictions and phenotypes varied from 0.17–0.28 for control plants and 0.23–0.37 for water-scarce plants, and the increase in accuracy was significant for nitrogen uptake and protein content compared to models using genomic information alone. Adding transcriptomic and DNA methylation information to the prediction models explained more of the phenotypic variance attributed to the environment in grain yield and nitrogen uptake. It furthermore explained more of the non-additive genetic effects for thousand kernel weight and protein content. Our results show the feasibility of multi-omics prediction for complex traits in barley.
Collapse
Affiliation(s)
- Pernille Bjarup Hansen
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
- Correspondence: (P.B.H.); (T.A.); Tel.: +45-87158243 (T.A.)
| | - Anja Karine Ruud
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Gustavo de los Campos
- Departments of Epidemiology & Biostatistics and Statistics & Probability, Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Marta Malinowska
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Istvan Nagy
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Simon Fiil Svane
- Section for Crop Sciences, Department of Plant and Environmental Sciences, Copenhagen University, 2630 Taastrup, Denmark
| | - Kristian Thorup-Kristensen
- Section for Crop Sciences, Department of Plant and Environmental Sciences, Copenhagen University, 2630 Taastrup, Denmark
| | | | - Lene Krusell
- Sejet Plant Breeding, Nørremarksvej 67, 8700 Horsens, Denmark
| | - Torben Asp
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
- Correspondence: (P.B.H.); (T.A.); Tel.: +45-87158243 (T.A.)
| |
Collapse
|
5
|
Schrauf MF, de los Campos G, Munilla S. Comparing Genomic Prediction Models by Means of Cross Validation. Front Plant Sci 2021; 12:734512. [PMID: 34868117 PMCID: PMC8639521 DOI: 10.3389/fpls.2021.734512] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/26/2021] [Indexed: 06/13/2023]
Abstract
In the two decades of continuous development of genomic selection, a great variety of models have been proposed to make predictions from the information available in dense marker panels. Besides deciding which particular model to use, practitioners also need to make many minor choices for those parameters in the model which are not typically estimated by the data (so called "hyper-parameters"). When the focus is placed on predictions, most of these decisions are made in a direction sought to optimize predictive accuracy. Here we discuss and illustrate using publicly available crop datasets the use of cross validation to make many such decisions. In particular, we emphasize the importance of paired comparisons to achieve high power in the comparison between candidate models, as well as the need to define notions of relevance in the difference between their performances. Regarding the latter, we borrow the idea of equivalence margins from clinical research and introduce new statistical tests. We conclude that most hyper-parameters can be learnt from the data by either minimizing REML or by using weakly-informative priors, with good predictive results. In particular, the default options in a popular software are generally competitive with the optimal values. With regard to the performance assessments themselves, we conclude that the paired k-fold cross validation is a generally applicable and statistically powerful methodology to assess differences in model accuracies. Coupled with the definition of equivalence margins based on expected genetic gain, it becomes a useful tool for breeders.
Collapse
Affiliation(s)
- Matías F. Schrauf
- Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina
- Animal Breeding & Genomics, Wageningen Livestock Research, Wageningen University & Research, Wageningen, Netherlands
| | - Gustavo de los Campos
- Departments of Epidemiology, Biostatistics, Statistics, and Probabilty, Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, United States
| | - Sebastián Munilla
- Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires, Argentina
- Instituto de Investigaciones en Producción Animal (INPA), CONICET-Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
6
|
Lopez-Cruz M, Beyene Y, Gowda M, Crossa J, Pérez-Rodríguez P, de los Campos G. Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices. Heredity (Edinb) 2021; 127:423-432. [PMID: 34564692 PMCID: PMC8551287 DOI: 10.1038/s41437-021-00474-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 09/10/2021] [Accepted: 09/11/2021] [Indexed: 02/07/2023] Open
Abstract
Genomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy. This leads to the question of whether all available data or a subset of it should be used to calibrate genomic prediction models. Previous research on training set optimization has focused on identifying a subset of the available data that is optimal for a given prediction set. However, this approach does not contemplate the possibility that different training sets may be optimal for different prediction genotypes. To address this problem, we recently introduced a sparse selection index (SSI) that identifies an optimal training set for each individual in a prediction set. Using additive genomic relationships, the SSI can provide increased accuracy relative to genomic-BLUP (GBLUP). Non-parametric genomic models using Gaussian kernels (KBLUP) have, in some cases, yielded higher prediction accuracies than standard additive models. Therefore, here we studied whether combining SSIs and kernel methods could further improve prediction accuracy when training genomic models using multi-generation data. Using four years of doubled haploid maize data from the International Maize and Wheat Improvement Center (CIMMYT), we found that when predicting grain yield the KBLUP outperformed the GBLUP, and that using SSI with additive relationships (GSSI) lead to 5-17% increases in accuracy, relative to the GBLUP. However, differences in prediction accuracy between the KBLUP and the kernel-based SSI were smaller and not always significant.
Collapse
Affiliation(s)
- Marco Lopez-Cruz
- grid.17088.360000 0001 2150 1785Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI USA ,grid.17088.360000 0001 2150 1785Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI USA
| | - Yoseph Beyene
- grid.512317.30000 0004 7645 1801Global Maize Program, International Maize and Wheat Improvement Center (CIMMYT), Nairobi, Kenya
| | - Manje Gowda
- grid.512317.30000 0004 7645 1801Global Maize Program, International Maize and Wheat Improvement Center (CIMMYT), Nairobi, Kenya
| | - Jose Crossa
- grid.433436.50000 0001 2289 885XBiometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Mexico, Mexico ,grid.418752.d0000 0004 1795 9752Colegio de Postgraduados, Montecillos, Edo. de México, Mexico
| | - Paulino Pérez-Rodríguez
- grid.418752.d0000 0004 1795 9752Colegio de Postgraduados, Montecillos, Edo. de México, Mexico
| | - Gustavo de los Campos
- grid.17088.360000 0001 2150 1785Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI USA ,grid.17088.360000 0001 2150 1785Department of Statistics and Probability, Michigan State University, East Lansing, MI USA ,grid.17088.360000 0001 2150 1785Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI USA
| |
Collapse
|
7
|
Strong BW, Pudar J, Thrift AG, de los Campos G, Howard VJ, Hussain M, Reeves MJ. Abstract 28: The Representation of Women in Randomized Clinical Trials of Acute Stroke (2010-2020). Stroke 2021. [DOI: 10.1161/str.52.suppl_1.28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Introduction:
The inadequate enrollment of women in RCTs represents a threat to trial generalizability and potential inequities in access to novel treatments. We sought to determine whether women were under-enrolled in contemporary acute stroke trials.
Methods:
We searched MEDLINE for completed RCTs published in one of nine major journals between 2010 and 2020. Eligible studies were phase 2 or 3 trials undertaken to test therapeutic interventions within one month of stroke onset. For each trial we calculated the proportion of trial participants that were women (PPW). We used Global Burden of Disease (GBD) data to estimate the expected proportion of strokes occurring in women in the underlying stroke populations (PSW). We matched individual estimates from the GBD data to each trial based on geographic location, year, and stroke type. To quantify disparities, we calculated the enrollment disparity difference (EDD), defined as EDD = PSW - PPW. A positive EDD indicates that women were under-represented in the trial. We used random effects meta-analysis to pool individual EDDs and conducted subgroup analyses.
Results:
We identified 115 trials that met eligibility criteria. The random effects summary EDD was 0.053 (95% CI = 0.040, 0.053), indicating that women were under-enrolled in acute stroke trials by 5% relative to their representation in the underlying stroke population. However, there was substantial between-trial variability in the EDD (I
2
=84.4%). In subgroup analyses, the EDD was similar across subgroups except for stroke type (figure); trials that only included subarachnoid hemorrhages enrolled women in excess of their representation in the underlying population (summary EDD = -0.117 [95% CI = -0.150, -0.084]).
Conclusions:
Overall, women were modestly under-represented in contemporary acute stroke trials compared to their representation among all strokes. Further study is needed to elucidate factors driving sex differences in enrollment between RCTs.
Collapse
Affiliation(s)
- Brent W Strong
- Dept of Epidemiology and Biostatistics, Michigan State Univ, East Lansing, MI
| | | | | | | | | | | | | |
Collapse
|
8
|
de los Campos G, Pook T, Gonzalez-Reymundez A, Simianer H, Mias G, Vazquez AI. ANOVA-HD: Analysis of variance when both input and output layers are high-dimensional. PLoS One 2020; 15:e0243251. [PMID: 33315963 PMCID: PMC7735570 DOI: 10.1371/journal.pone.0243251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 11/17/2020] [Indexed: 11/21/2022] Open
Abstract
Modern genomic data sets often involve multiple data-layers (e.g., DNA-sequence, gene expression), each of which itself can be high-dimensional. The biological processes underlying these data-layers can lead to intricate multivariate association patterns. We propose and evaluate two methods to determine the proportion of variance of an output data set that can be explained by an input data set when both data panels are high dimensional. Our approach uses random-effects models to estimate the proportion of variance of vectors in the linear span of the output set that can be explained by regression on the input set. We consider a method based on an orthogonal basis (Eigen-ANOVA) and one that uses random vectors (Monte Carlo ANOVA, MC-ANOVA) in the linear span of the output set. Using simulations, we show that the MC-ANOVA method gave nearly unbiased estimates. Estimates produced by Eigen-ANOVA were also nearly unbiased, except when the shared variance was very high (e.g., >0.9). We demonstrate the potential insight that can be obtained from the use of MC-ANOVA and Eigen-ANOVA by applying these two methods to the study of multi-locus linkage disequilibrium in chicken (Gallus gallus) genomes and to the assessment of inter-dependencies between gene expression, methylation, and copy-number-variants in data from breast cancer tumors from humans (Homo sapiens). Our analyses reveal that in chicken breeding populations ~50,000 evenly-spaced SNPs are enough to fully capture the span of whole-genome-sequencing genomes. In the study of multi-omic breast cancer data, we found that the span of copy-number-variants can be fully explained using either methylation or gene expression data and that roughly 74% of the variance in gene expression can be predicted from methylation data.
Collapse
Affiliation(s)
- Gustavo de los Campos
- Epidemiology & Biostatistics, Michigan State University, East Lansing, MI, United States of America
- Statistics & Probability, Michigan State University, East Lansing, MI, United States of America
- Institute for Quantitative Health Science and Engineering, East Lansing, MI, United States of America
| | - Torsten Pook
- Department of Animal Sciences, Center for Integrated Breeding Research, University of Goettingen, Goettingen, Germany
| | - Agustin Gonzalez-Reymundez
- Genetics and Genome Sciences Graduate Program, Michigan State University, East Lansing, MI, United States of America
| | - Henner Simianer
- Department of Animal Sciences, Center for Integrated Breeding Research, University of Goettingen, Goettingen, Germany
| | - George Mias
- Institute for Quantitative Health Science and Engineering, East Lansing, MI, United States of America
- Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, United States of America
| | - Ana I. Vazquez
- Epidemiology & Biostatistics, Michigan State University, East Lansing, MI, United States of America
- Institute for Quantitative Health Science and Engineering, East Lansing, MI, United States of America
| |
Collapse
|
9
|
Noble JD, Balmant KM, Dervinis C, de los Campos G, Resende MFR, Kirst M, Barbazuk WB. The Genetic Regulation of Alternative Splicing in Populus deltoides. Front Plant Sci 2020; 11:590. [PMID: 32582229 PMCID: PMC7291814 DOI: 10.3389/fpls.2020.00590] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 04/20/2020] [Indexed: 06/11/2023]
Abstract
Alternative splicing (AS) is a mechanism of regulation of the proteome via enabling the production of multiple mRNAs from a single gene. To date, the dynamics of AS and its effects on the protein sequences of individuals in a large and genetically unrelated population of trees have not been investigated. Here we describe the diversity of AS events within a previously genotyped population of 268 individuals of Populus deltoides and their putative downstream functional effects. Using a robust bioinformatics pipeline, the AS events and resulting transcript isoforms were discovered and quantified for each individual in the population. Analysis of the AS revealed that, as expected, most AS isoforms are conserved. However, we also identified a substantial collection of new, unannotated splice junctions and transcript isoforms. Heritability estimates for the expression of transcript isoforms showed that approximately half of the isoforms are heritable. The genetic regulators of these AS isoforms and splice junction usage were then identified using a genome-wide association analysis. The expression of AS isoforms was predominately cis regulated while splice junction usage was generally regulated in trans. Additionally, we identified 696 genes encoding alternatively spliced isoforms that changed putative protein domains relative to the longest protein coding isoform of the gene, and 859 genes exhibiting this same phenomenon relative to the most highly expressed isoform. Finally, we found that 748 genes gained or lost micro-RNA binding sites relative to the longest protein coding isoform of a given gene, while 940 gained or lost micro-RNA binding sites relative to the most highly expressed isoform. These results indicate that a significant fraction of AS events are genetically regulated and that this isoform usage can result in protein domain architecture changes.
Collapse
Affiliation(s)
- Jerald D. Noble
- Plant Molecular and Cellular Biology Graduate Program, University of Florida, Gainesville, FL, United States
| | - Kelly M. Balmant
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL, United States
| | - Christopher Dervinis
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL, United States
| | - Gustavo de los Campos
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, United States
- Department of Statistics and Probability, Michigan State University, East Lansing, MI, United States
| | - Márcio F. R. Resende
- Plant Molecular and Cellular Biology Graduate Program, University of Florida, Gainesville, FL, United States
- Department of Horticultural Science, University of Florida, Gainesville, FL, United States
| | - Matias Kirst
- Plant Molecular and Cellular Biology Graduate Program, University of Florida, Gainesville, FL, United States
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
| | - William Brad Barbazuk
- Plant Molecular and Cellular Biology Graduate Program, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
- Department of Biology, University of Florida, Gainesville, FL, United States
- Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL, United States
| |
Collapse
|
10
|
Cheng HG, Gonzalez-Reymundez A, Li I, Pathak A, Pathak DR, de los Campos G, Vazquez AI. Breast cancer survival and the expression of genes related to alcohol drinking. PLoS One 2020; 15:e0228957. [PMID: 32078659 PMCID: PMC7032692 DOI: 10.1371/journal.pone.0228957] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 01/27/2020] [Indexed: 12/12/2022] Open
Abstract
Breast cancer is the leading cause of cancer-related disease in women. Cumulative evidence supports a causal role of alcohol intake and breast cancer incidence. In this study, we explore the change on expression of genes involved in the biological pathways through which alcohol has been hypothesized to impact breast cancer risk, to shed new insights on possible mechanisms affecting the survival of breast cancer patients. Here, we performed differential expression analysis at individual genes and gene set levels, respectively, across survival and breast cancer subtype data. Information about postdiagnosis breast cancer survival was obtained from 1977 Caucasian female participants in the Molecular Taxonomy of Breast Cancer International Consortium. Expression of 16 genes that have been linked in the literature to the hypothesized alcohol-breast cancer pathways, were examined. We found that the expression of 9 out of 16 genes under study were associated with cancer survival within the first 4 years of diagnosis. Results from gene set analysis confirmed a significant differential expression of these genes as a whole too. Although alcohol consumption is not analyzed, nor available for this dataset, we believe that further study on these genes could provide important information for clinical recommendations about potential impact of alcohol drinking on breast cancer survival.
Collapse
Affiliation(s)
- Hui G. Cheng
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
| | - Agustin Gonzalez-Reymundez
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
- The Institute for Quantitative Health Science and Engineering, Michigan State University, MI, United States of America
| | - Irene Li
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
| | - Ania Pathak
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
| | - Dorothy R. Pathak
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
| | - Gustavo de los Campos
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
- The Institute for Quantitative Health Science and Engineering, Michigan State University, MI, United States of America
| | - Ana Ines Vazquez
- Department of Epidemiology & Biostatistics, Michigan State University, MI, United States of America
- The Institute for Quantitative Health Science and Engineering, Michigan State University, MI, United States of America
| |
Collapse
|
11
|
Toledo-Alvarado H, Vazquez AI, de los Campos G, Tempelman RJ, Gabai G, Cecchinato A, Bittante G. Changes in milk characteristics and fatty acid profile during the estrous cycle in dairy cows. J Dairy Sci 2018; 101:9135-9153. [DOI: 10.3168/jds.2018-14480] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 05/31/2018] [Indexed: 11/19/2022]
|
12
|
Montesinos-López A, Montesinos-López OA, de los Campos G, Crossa J, Burgueño J, Luna-Vazquez FJ. Correction to: Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 2018; 14:57. [PMID: 30002724 PMCID: PMC6036691 DOI: 10.1186/s13007-018-0321-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
[This corrects the article DOI: 10.1186/s13007-018-0314-7.].
Collapse
Affiliation(s)
- Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, 44430 Guadalajara, Jalisco Mexico
| | | | - Gustavo de los Campos
- Epidemiology and Biostatistics and Statistics and Probability Departments, Michigan State University, 909 Fee Road, East Lansing, MI 48824 USA
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Juan Burgueño
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | | |
Collapse
|
13
|
Montesinos-López A, Montesinos-López OA, de los Campos G, Crossa J, Burgueño J, Luna-Vazquez FJ. Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 2018; 14:46. [PMID: 29991959 PMCID: PMC5994840 DOI: 10.1186/s13007-018-0314-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 06/01/2018] [Indexed: 05/04/2023]
Abstract
BACKGROUND Modern agriculture uses hyperspectral cameras with hundreds of reflectance data at discrete narrow bands measured in several environments. Recently, Montesinos-López et al. (Plant Methods 13(4):1-23, 2017a. 10.1186/s13007-016-0154-2; Plant Methods 13(62):1-29, 2017b. 10.1186/s13007-017-0212-4) proposed using functional regression analysis (as functional data analyses) to help reduce the dimensionality of the bands and thus decrease the computational cost. The purpose of this paper is to discuss the advantages and disadvantages that functional regression analysis offers when analyzing hyperspectral image data. We provide a brief review of functional regression analysis and examples that illustrate the methodology. We highlight critical elements of model specification: (i) type and number of basis functions, (ii) the degree of the polynomial, and (iii) the methods used to estimate regression coefficients. We also show how functional data analyses can be integrated into Bayesian models. Finally, we include an in-depth discussion of the challenges and opportunities presented by functional regression analysis. RESULTS We used seven model-methods, one with the conventional model (M1), three methods using the B-splines model (M2, M4, and M6) and three methods using the Fourier basis model (M3, M5, and M7). The data set we used comprises 976 wheat lines under irrigated environments with 250 wavelengths. Under a Bayesian Ridge Regression (BRR), we compared the prediction accuracy of the model-methods proposed under different numbers of basis functions, and compared the implementation time (in seconds) of the seven proposed model-methods for different numbers of basis. Our results as well as previously analyzed data (Montesinos-López et al. 2017a, 2017b) support that around 23 basis functions are enough. Concerning the degree of the polynomial in the context of B-splines, degree 3 approximates most of the curves very well. Two satisfactory types of basis are the Fourier basis for period curves and the B-splines model for non-periodic curves. Under nine different basis, the seven method-models showed similar prediction accuracy. Regarding implementation time, results show that the lower the number of basis, the lower the implementation time required. Methods M2, M3, M6 and M7 were around 3.4 times faster than methods M1, M4 and M5. CONCLUSIONS In this study, we promote the use of functional regression modeling for analyzing high-throughput phenotypic data and indicate the advantages and disadvantages of its implementation. In addition, many key elements that are needed to understand and implement this statistical technique appropriately are provided using a real data set. We provide details for implementing Bayesian functional regression using the developed genomic functional regression (GFR) package. In summary, we believe this paper is a good guide for breeders and scientists interested in using functional regression models for implementing prediction models when their data are curves.
Collapse
Affiliation(s)
- Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, 44430 Guadalajara, Jalisco Mexico
| | | | - Gustavo de los Campos
- Epidemiology and Biostatistics and Statistics and Probability Departments, Michigan State University, 909 Fee Road, East Lansing, MI 48824 USA
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Juan Burgueño
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | | |
Collapse
|
14
|
Toledo-Alvarado H, Vazquez AI, de los Campos G, Tempelman RJ, Bittante G, Cecchinato A. Diagnosing pregnancy status using infrared spectra and milk composition in dairy cows. J Dairy Sci 2018; 101:2496-2505. [DOI: 10.3168/jds.2017-13647] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 11/08/2017] [Indexed: 01/01/2023]
|
15
|
Montesinos-López OA, Montesinos-López A, Crossa J, de los Campos G, Alvarado G, Suchismita M, Rutkoski J, González-Pérez L, Burgueño J. Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data. Plant Methods 2017; 13:4. [PMID: 28053649 PMCID: PMC5209864 DOI: 10.1186/s13007-016-0154-2] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Accepted: 12/01/2016] [Indexed: 05/21/2023]
Abstract
BACKGROUND Modern agriculture uses hyperspectral cameras to obtain hundreds of reflectance data measured at discrete narrow bands to cover the whole visible light spectrum and part of the infrared and ultraviolet light spectra, depending on the camera. This information is used to construct vegetation indices (VI) (e.g., green normalized difference vegetation index or GNDVI, simple ratio or SRa, etc.) which are used for the prediction of primary traits (e.g., biomass). However, these indices only use some bands and are cultivar-specific; therefore they lose considerable information and are not robust for all cultivars. RESULTS This study proposes models that use all available bands as predictors to increase prediction accuracy; we compared these approaches with eight conventional vegetation indexes (VIs) constructed using only some bands. The data set we used comes from CIMMYT's global wheat program and comprises 1170 genotypes evaluated for grain yield (ton/ha) in five environments (Drought, Irrigated, EarlyHeat, Melgas and Reduced Irrigated); the reflectance data were measured in 250 discrete narrow bands ranging between 392 and 851 nm. The proposed models for the simultaneous analysis of all the bands were ordinal least square (OLS), Bayes B, principal components with Bayes B, functional B-spline, functional Fourier and functional partial least square. The results of these models were compared with the OLS performed using as predictors each of the eight VIs individually and combined. CONCLUSIONS We found that using all bands simultaneously increased prediction accuracy more than using VI alone. The Splines and Fourier models had the best prediction accuracy for each of the nine time-points under study. Combining image data collected at different time-points led to a small increase in prediction accuracy relative to models that use data from a single time-point. Also, using bands with heritabilities larger than 0.5 only in Drought as predictor variables showed improvements in prediction accuracy.
Collapse
Affiliation(s)
- Osval A. Montesinos-López
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
- Facultad de Telemática, Universidad de Colima, 28040 Colima, Colima Mexico
| | - Abelardo Montesinos-López
- Departamento de Estadística, Centro de Investigación en Matemáticas (CIMAT), 36240 Guanajuato, Guanajuato Mexico
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Gustavo de los Campos
- Epidemiology and Biostatistics Department, Michigan State University, 909 Fee Road, East Lansing, MI 48824 USA
| | - Gregorio Alvarado
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Mondal Suchismita
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Jessica Rutkoski
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
- International Rice Research Institute, Los Baños Research Center, Los Baños, Laguna Philippines
| | - Lorena González-Pérez
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| | - Juan Burgueño
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico City, Mexico
| |
Collapse
|
16
|
Abstract
Summary
Statistical approaches for testing hypotheses of heterogeneity in fitness functions are needed to accommodate studies of phenotypic selection with repeated sampling across study units, populations or years. In this study, we tested directly for among‐population variation in complex fitness functions and demonstrate a new approach for locating the region of the trait distribution where variation in fitness and traits is greatest.
We modelled heterogeneity in fitness functions among populations by treating regression coefficients of fitness on traits as random variates. We applied random regression using two model specifications, (i) spline‐based curve and (ii) stepwise, to a 2‐year study of selection among 16 populations of the gall wasp, Belonocnema treatae. Log‐likelihood ratio tests of variance components and 10‐fold cross‐validation were used to assess the evidence that selection varied among populations.
Ten‐fold cross‐validation prediction error sums of squares (PSS) indicated that spline‐based fitness functions were population specific and that the strength of evidence for heterogeneity in selection differed between years. Hypothesis testing of variance components from both models was consistent with the PSS results. Both the stepwise model and the local prediction error estimates of spline‐based fitness functions identified the region(s) of the phenotype distribution harbouring the greatest heterogeneity among populations.
The adopted framework advances our understanding of phenotypic selection in natural populations by extending the analysis of spline‐based fitness functions to testing for heterogeneity among study units and isolating the regions of the phenotypic distribution where this variation is most pronounced.
Collapse
Affiliation(s)
- Richard J. Reynolds
- Department of Medicine, Division of Clinical Immunology and
Rheumatology, Department of Biostatistics, University of Alabama at Birmingham,
Birmingham, Alabama 35294; (205-975-9300)
| | - Gustavo de los Campos
- Departments of Epidemiology & Biostatistics, and Statistics,
Michigan State University, East Lansing, MI, 48824, (517-353-8623)
| | - Scott P. Egan
- Department of BioSciences, Rice University, Houston, Texas 77005;
(615-618-6601)
| | - James R. Ott
- Population and Conservation Biology Program, Department of Biology,
Texas State University, San Marcos, Texas 78666, (512-245-2321)
| |
Collapse
|
17
|
Lopez-Cruz M, Crossa J, Bonnett D, Dreisigacker S, Poland J, Jannink JL, Singh RP, Autrique E, de los Campos G. Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. G3 (Bethesda) 2015. [PMID: 25660166 DOI: 10.1534/g3.114.01609] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Genomic selection (GS) models use genome-wide genetic information to predict genetic values of candidates of selection. Originally, these models were developed without considering genotype × environment interaction(G×E). Several authors have proposed extensions of the single-environment GS model that accommodate G×E using either covariance functions or environmental covariates. In this study, we model G×E using a marker × environment interaction (M×E) GS model; the approach is conceptually simple and can be implemented with existing GS software. We discuss how the model can be implemented by using an explicit regression of phenotypes on markers or using co-variance structures (a genomic best linear unbiased prediction-type model). We used the M×E model to analyze three CIMMYT wheat data sets (W1, W2, and W3), where more than 1000 lines were genotyped using genotyping-by-sequencing and evaluated at CIMMYT's research station in Ciudad Obregon, Mexico, under simulated environmental conditions that covered different irrigation levels, sowing dates and planting systems. We compared the M×E model with a stratified (i.e., within-environment) analysis and with a standard (across-environment) GS model that assumes that effects are constant across environments (i.e., ignoring G×E). The prediction accuracy of the M×E model was substantially greater of that of an across-environment analysis that ignores G×E. Depending on the prediction problem, the M×E model had either similar or greater levels of prediction accuracy than the stratified analyses. The M×E model decomposes marker effects and genomic values into components that are stable across environments (main effects) and others that are environment-specific (interactions). Therefore, in principle, the interaction model could shed light over which variants have effects that are stable across environments and which ones are responsible for G×E. The data set and the scripts required to reproduce the analysis are publicly available as Supporting Information.
Collapse
Affiliation(s)
- Marco Lopez-Cruz
- Department of Plant, Soil and Microbial Sciences, Michigan State University (MSU), East Lansing, Michigan 4882
| | - Jose Crossa
- International Maize and Wheat Improvement Center(CIMMYT), Mexico D.F., Mexico
| | - David Bonnett
- International Maize and Wheat Improvement Center(CIMMYT), Mexico D.F., Mexico
| | | | - Jesse Poland
- Wheat Genetics Resource Center, Department of Plant Pathology and Department of Agronomy, Kansas State University (KSU), 4011 Throckmorton Hall, Manhattan, Kansas 66506
| | - Jean-Luc Jannink
- USDA-ARS and Department of Plant Breeding and Genetics, Cornell University, Ithaca, New York 14853, and
| | - Ravi P Singh
- International Maize and Wheat Improvement Center(CIMMYT), Mexico D.F., Mexico
| | - Enrique Autrique
- International Maize and Wheat Improvement Center(CIMMYT), Mexico D.F., Mexico
| | - Gustavo de los Campos
- Epidemiology & Biostatistics and Statistics departments, Michigan State University 909 Fee Road, East Lansing, Michigan 48824
| |
Collapse
|
18
|
Berger S, Pérez-Rodríguez P, Veturi Y, Simianer H, de los Campos G. Effectiveness of shrinkage and variable selection methods for the prediction of complex human traits using data from distantly related individuals. Ann Hum Genet 2015; 79:122-35. [PMID: 25600682 PMCID: PMC4428155 DOI: 10.1111/ahg.12099] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 12/03/2014] [Indexed: 02/02/2023]
Abstract
Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome regression (WGR) methods. However, there has been limited research on the factors that affect prediction accuracy (PA) of WGRs when applied to human data of distantly related individuals. Here, we examine, using real human genotypes and simulated phenotypes, how trait complexity, marker‐quantitative trait loci (QTL) linkage disequilibrium (LD), and the model used affect the performance of WGRs. Our results indicated that the estimated rate of missing heritability is dependent on the extent of marker‐QTL LD. However, this parameter was not greatly affected by trait complexity. Regarding PA our results indicated that: (a) under perfect marker‐QTL LD WGR can achieve moderately high prediction accuracy, and with simple genetic architectures variable selection methods outperform shrinkage procedures and (b) under imperfect marker‐QTL LD, variable selection methods can achieved reasonably good PA with simple or moderately complex genetic architectures; however, the PA of these methods deteriorated as trait complexity increases and with highly complex traits variable selection and shrinkage methods both performed poorly. This was confirmed with an analysis of human height.
Collapse
Affiliation(s)
- Swetlana Berger
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August-University Goettingen, Albrecht-Thaer-Weg 3, Goettingen, Germany
| | | | | | | | | |
Collapse
|
19
|
Mehta T, Fontaine KR, Keith SW, Bangalore SS, de los Campos G, Bartolucci A, Pajewski NM, Allison DB. Obesity and mortality: are the risks declining? Evidence from multiple prospective studies in the United States. Obes Rev 2014; 15:619-29. [PMID: 24913899 PMCID: PMC4121970 DOI: 10.1111/obr.12191] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 03/25/2014] [Accepted: 04/22/2014] [Indexed: 12/18/2022]
Abstract
We evaluated whether the obesity-associated years of life lost (YLL) have decreased over calendar time. We implemented a meta-analysis including only studies with two or more serial body mass index (BMI) assessments at different calendar years. For each BMI category (normal weight: BMI 18.5 to <25 [reference]; overweight: BMI 25 to <30; grade 1 obesity: BMI 30 to <35; and grade 2-3 obesity: BMI ≥ 35), we estimated the YLL change between 1970 and 1990. Because of low sample sizes for African-American, results are reported on Caucasian. Among men aged ≤60 years YLL for grade 1 obesity increased by 0.72 years (P < 0.001) and by 1.02 years (P = 0.01) for grade 2-3 obesity. For men aged >60, YLL for grade 1 obesity decreased by 1.02 years (P < 0.001) and increased by 0.63 years for grade 2-3 obesity (P = 0.63). Among women aged ≤60, YLL for grade 1 obesity decreased by 4.21 years (P < 0.001) and by 4.97 years (P < 0.001) for grade 2-3 obesity. In women aged >60, YLL for grade 1 obesity decreased by 3.98 years (P < 0.001) and by 2.64 years (P = 0.001) for grade 2-3 obesity. Grade 1 obesity's association with decreased longevity has reduced for older Caucasian men. For Caucasian women, there is evidence of a decline in the obesity YLL association across all ages.
Collapse
Affiliation(s)
- Tapan Mehta
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
| | - Kevin R. Fontaine
- Department of Health Behavior, University of Alabama at
Birmingham, AL, USA
| | - Scott W. Keith
- Division of Biostatistics, Department of Pharmacology and
Experimental Therapeutics, Thomas Jefferson University, Philadelphia, PA, USA
| | - Sai Santosh Bangalore
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
| | - Gustavo de los Campos
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
| | - Alfred Bartolucci
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
| | - Nicholas M. Pajewski
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
- Department of Biostatistical Sciences, Wake Forest University Health
Sciences, Winston-Salem, NC, USA
| | - David B. Allison
- Department of Biostatistics, Office of Energetics &
Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham,
AL, USA
| |
Collapse
|
20
|
Jarquín D, Crossa J, Lacaze X, Du Cheyron P, Daucourt J, Lorgeou J, Piraux F, Guerreiro L, Pérez P, Calus M, Burgueño J, de los Campos G. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor Appl Genet 2014; 127:595-607. [PMID: 24337101 PMCID: PMC3931944 DOI: 10.1007/s00122-013-2243-1] [Citation(s) in RCA: 252] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 11/20/2013] [Indexed: 05/18/2023]
Abstract
New methods that incorporate the main and interaction effects of high-dimensional markers and of high-dimensional environmental covariates gave increased prediction accuracy of grain yield in wheat across and within environments. In most agricultural crops the effects of genes on traits are modulated by environmental conditions, leading to genetic by environmental interaction (G × E). Modern genotyping technologies allow characterizing genomes in great detail and modern information systems can generate large volumes of environmental data. In principle, G × E can be accounted for using interactions between markers and environmental covariates (ECs). However, when genotypic and environmental information is high dimensional, modeling all possible interactions explicitly becomes infeasible. In this article we show how to model interactions between high-dimensional sets of markers and ECs using covariance functions. The model presented here consists of (random) reaction norm where the genetic and environmental gradients are described as linear functions of markers and of ECs, respectively. We assessed the proposed method using data from Arvalis, consisting of 139 wheat lines genotyped with 2,395 SNPs and evaluated for grain yield over 8 years and various locations within northern France. A total of 68 ECs, defined based on five phases of the phenology of the crop, were used in the analysis. Interaction terms accounted for a sizable proportion (16 %) of the within-environment yield variance, and the prediction accuracy of models including interaction terms was substantially higher (17-34 %) than that of models based on main effects only. Breeding for target environmental conditions has become a central priority of most breeding programs. Methods, like the one presented here, that can capitalize upon the wealth of genomic and environmental information available, will become increasingly important.
Collapse
Affiliation(s)
- Diego Jarquín
- Department of Biostatistics, University of Alabama at Birmingham, 1665 University Boulevard, 327L Ryals Public Health Building, Birmingham, AL 35216 USA
- Present Address: Agronomy and Horticulture Department, University of Nebraska, 321 Keim Hall, Lincoln, NE USA 68583-0915
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico, D.F., México
| | - Xavier Lacaze
- Arvalis Institut du végétal, Station Inter-institut, 6 chemin de la côte vieille, 31450 Baziège, France
| | - Philippe Du Cheyron
- Arvalis Institut du végétal, IBP Université Paris Sud, Rue de Noetzlin, Bât. 630, 91405 Orsay, France
| | - Joëlle Daucourt
- Arvalis Institut du végétal, IBP Université Paris Sud, Rue de Noetzlin, Bât. 630, 91405 Orsay, France
| | - Josiane Lorgeou
- Arvalis Institut du vegetal, Station expérimentale, 91720 Boigneville, France
| | - François Piraux
- Arvalis Institut du vegetal, Station expérimentale, 91720 Boigneville, France
| | - Laurent Guerreiro
- Arvalis Institut du végétal, 3 rue Joseph et Marie Hackin, 75116 Paris, France
| | - Paulino Pérez
- Department of Biostatistics, University of Alabama at Birmingham, 1665 University Boulevard, 327L Ryals Public Health Building, Birmingham, AL 35216 USA
- Colegio de Postgraduados, Montecillo, Edo. de México, Mexico, México
| | - Mario Calus
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, P.O. Box 135, 6700 AC Wageningen, The Netherlands
| | - Juan Burgueño
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600 Mexico, D.F., México
| | - Gustavo de los Campos
- Department of Biostatistics, University of Alabama at Birmingham, 1665 University Boulevard, 327L Ryals Public Health Building, Birmingham, AL 35216 USA
| |
Collapse
|
21
|
Klimentidis YC, Vazquez AI, de los Campos G, Allison DB, Dransfield MT, Thannickal VJ. Heritability of pulmonary function estimated from pedigree and whole-genome markers. Front Genet 2013; 4:174. [PMID: 24058366 PMCID: PMC3766834 DOI: 10.3389/fgene.2013.00174] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 08/22/2013] [Indexed: 11/13/2022] Open
Abstract
Asthma and chronic obstructive pulmonary disease (COPD) are major worldwide health problems. Pulmonary function testing is a useful diagnostic tool for these diseases, and is known to be influenced by genetic and environmental factors. Previous studies have demonstrated that a substantial proportion of the variation in pulmonary function phenotypes can be explained by familial relationships. The availability of whole-genome single nucleotide polymorphism (SNP) data enables us to further evaluate the extent to which genetic factors account for variation in pulmonary function and to compare pedigree- to SNP-based estimates of heritability. Here, we employ methods developed in the animal breeding field to estimate the heritability of forced expiratory volume in one second (FEV1), forced vital capacity (FVC), and the ratio of these two measures (FEV1/FVC) among subjects in the Framingham Heart Study dataset. We compare heritability estimates based on pedigree-based relationships to those based on genome-wide SNPs. We find that, in a family-based study, estimates of heritability using SNP data are nearly identical to estimates based on pedigree information, and range from 0.50 for FEV1 to 0.66 for FEV1/FVC. Therefore, we conclude that genetic factors account for a sizable proportion of inter-individual differences in pulmonary function, and that estimates of heritability based on SNP data are nearly identical to estimates based on pedigree data. Finally, our findings suggest a higher heritability for FEV1/FVC compared to either FEV1 or FVC.
Collapse
Affiliation(s)
- Yann C. Klimentidis
- Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of ArizonaTucson, AZ, USA
| | - Ana I. Vazquez
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at BirminghamBirmingham, AL, USA
| | - Gustavo de los Campos
- Department of Biostatistics, Section on Statistical Genetics, University of Alabama at BirminghamBirmingham, AL, USA
| | - David B. Allison
- Office of Energetics, School of Public Health, University of Alabama at BirminghamBirmingham, AL, USA
| | - Mark T. Dransfield
- Allergy and Critical Care Medicine, Department of Medicine, Division of Pulmonary, University of Alabama at BirminghamBirmingham, AL, USA
| | - Victor J. Thannickal
- Allergy and Critical Care Medicine, Department of Medicine, Division of Pulmonary, University of Alabama at BirminghamBirmingham, AL, USA
| |
Collapse
|
22
|
Robertson HT, de los Campos G, Allison DB. Turning the analysis of obesity-mortality associations upside down: modeling years of life lost through conditional distributions. Obesity (Silver Spring) 2013; 21:398-404. [PMID: 23404823 PMCID: PMC3610864 DOI: 10.1002/oby.20019] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 06/24/2012] [Indexed: 11/10/2022]
Abstract
OBJECTIVE We demonstrate the utility of parametric survival analysis. The analysis of longevity as a function of risk factors such as body mass index (BMI; kg/m(2) ), activity levels, and dietary factors is a mainstay of obesity research. Modeling survival through hazard functions, relative risks, or odds of dying with methods such as Cox proportional hazards or logistic regression are the most common approaches and have many advantages. However, they also have disadvantages in terms of the ease of interpretability, especially for non-statisticians; the need for additional data to convert parameter estimates to estimates of years of life lost (YLL); debates about the appropriate time scale in the model; and an inability to estimate median survival time when the censoring rate is too high. DESIGN AND METHODS We will conduct parametric survival analyses with multiple distributions, including distributions that are known to be poor fits (Gaussian), as well as a newly discovered "Compressed Gaussian"'' distribution. RESULTS Parametric survival analysis models were able to accurately estimate median survival times in a population-based data set of 15,703 individuals, even for distributions that were not good fits and the censoring rate was high, due to the central limit theorem. CONCLUSIONS Parametric survival models are able to provide more direct answers, and in our analysis of an obesity-related data set, gave consistent YLL estimates regardless of the distribution used. We recommend increased consideration of parametric survival models in chronic disease and risk factor epidemiology.
Collapse
Affiliation(s)
- Henry T Robertson
- Biostatistics Department, University of Alabama at Birmingham, Birmingham, USA.
| | | | | |
Collapse
|
23
|
Vazquez AI, de los Campos G, Klimentidis YC, Rosa GJM, Gianola D, Yi N, Allison DB. A comprehensive genetic approach for improving prediction of skin cancer risk in humans. Genetics 2012; 192:1493-502. [PMID: 23051645 PMCID: PMC3512154 DOI: 10.1534/genetics.112.141705] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2012] [Accepted: 09/07/2012] [Indexed: 01/09/2023] Open
Abstract
Prediction of genetic risk for disease is needed for preventive and personalized medicine. Genome-wide association studies have found unprecedented numbers of variants associated with complex human traits and diseases. However, these variants explain only a small proportion of genetic risk. Mounting evidence suggests that many traits, relevant to public health, are affected by large numbers of small-effect genes and that prediction of genetic risk to those traits and diseases could be improved by incorporating large numbers of markers into whole-genome prediction (WGP) models. We developed a WGP model incorporating thousands of markers for prediction of skin cancer risk in humans. We also considered other ways of incorporating genetic information into prediction models, such as family history or ancestry (using principal components, PCs, of informative markers). Prediction accuracy was evaluated using the area under the receiver operating characteristic curve (AUC) estimated in a cross-validation. Incorporation of genetic information (i.e., familial relationships, PCs, or WGP) yielded a significant increase in prediction accuracy: from an AUC of 0.53 for a baseline model that accounted for nongenetic covariates to AUCs of 0.58 (pedigree), 0.62 (PCs), and 0.64 (WGP). In summary, prediction of skin cancer risk could be improved by considering genetic information and using a large number of single-nucleotide polymorphisms (SNPs) in a WGP model, which allows for the detection of patterns of genetic risk that are above and beyond those that can be captured using family history. We discuss avenues for improving prediction accuracy and speculate on the possible use of WGP to prospectively identify individuals at high risk.
Collapse
Affiliation(s)
- Ana I Vazquez
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama, Birmingham, AL 35294, USA.
| | | | | | | | | | | | | |
Collapse
|
24
|
Abstract
Unaccounted population stratification can lead to spurious associations in genome-wide association studies (GWAS) and in this context several methods have been proposed to deal with this problem. An alternative line of research uses whole-genome random regression (WGRR) models that fit all markers simultaneously. Important objectives in WGRR studies are to estimate the proportion of variance accounted for by the markers, the effect of individual markers, prediction of genetic values for complex traits, and prediction of genetic risk of diseases. Proposals to account for stratification in this context are unsatisfactory. Here we address this problem and describe a reparameterization of a WGRR model, based on an eigenvalue decomposition, for simultaneous inference of parameters and unobserved population structure. This allows estimation of genomic parameters with and without inclusion of marker-derived eigenvectors that account for stratification. The method is illustrated with grain yield in wheat typed for 1279 genetic markers, and with height, HDL cholesterol and systolic blood pressure from the British 1958 cohort study typed for 1 million SNP genotypes. Both sets of data show signs of population structure but with different consequences on inferences. The method is compared to an advocated approach consisting of including eigenvectors as fixed-effect covariates in a WGRR model. We show that this approach, used in the context of WGRR models, is ill posed and illustrate the advantages of the proposed model. In summary, our method permits a unified approach to the study of population structure and inference of parameters, is computationally efficient, and is easy to implement.
Collapse
Affiliation(s)
- Luc Janss
- Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark
| | - Gustavo de los Campos
- Section on Statistical Genetics, Biostatistics, University of Alabama, Birmingham, Alabama 35294, and
| | - Nuala Sheehan
- Department of Health Sciences and Department of Genetics, University of Leicester, Leicester LE1 7RH, United Kingdom
| | - Daniel Sorensen
- Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark
| |
Collapse
|
25
|
Abstract
Genetic factors are believed to account for 25% of the interindividual differences in Years of Life (YL) among humans. However, the genetic loci that have thus far been found to be associated with YL explain a very small proportion of the expected genetic variation in this trait, perhaps reflecting the complexity of the trait and the limitations of traditional association studies when applied to traits affected by a large number of small-effect genes. Using data from the Framingham Heart Study and statistical methods borrowed largely from the field of animal genetics (whole-genome prediction, WGP), we developed a WGP model for the study of YL and evaluated the extent to which thousands of genetic variants across the genome examined simultaneously can be used to predict interindividual differences in YL. We find that a sizable proportion of differences in YL--which were unexplained by age at entry, sex, smoking and BMI--can be accounted for and predicted using WGP methods. The contribution of genomic information to prediction accuracy was even higher than that of smoking and body mass index (BMI) combined; two predictors that are considered among the most important life-shortening factors. We evaluated the impacts of familial relationships and population structure (as described by the first two marker-derived principal components) and concluded that in our dataset population structure explained partially, but not fully the gains in prediction accuracy obtained with WGP. Further inspection of prediction accuracies by age at death indicated that most of the gains in predictive ability achieved with WGP were due to the increased accuracy of prediction of early mortality, perhaps reflecting the ability of WGP to capture differences in genetic risk to deadly diseases such as cancer, which are most often responsible for early mortality in our sample.
Collapse
Affiliation(s)
- Gustavo de los Campos
- Section on Statistical Genetics, Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Alabama, United States of America.
| | | | | | | |
Collapse
|
26
|
Makowsky R, Pajewski NM, Klimentidis YC, Vazquez AI, Duarte CW, Allison DB, de los Campos G. Beyond missing heritability: prediction of complex traits. PLoS Genet 2011; 7:e1002051. [PMID: 21552331 PMCID: PMC3084207 DOI: 10.1371/journal.pgen.1002051] [Citation(s) in RCA: 210] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Accepted: 03/02/2011] [Indexed: 01/25/2023] Open
Abstract
Despite rapid advances in genomic technology, our ability to account for phenotypic variation using genetic information remains limited for many traits. This has unfortunately resulted in limited application of genetic data towards preventive and personalized medicine, one of the primary impetuses of genome-wide association studies. Recently, a large proportion of the "missing heritability" for human height was statistically explained by modeling thousands of single nucleotide polymorphisms concurrently. However, it is currently unclear how gains in explained genetic variance will translate to the prediction of yet-to-be observed phenotypes. Using data from the Framingham Heart Study, we explore the genomic prediction of human height in training and validation samples while varying the statistical approach used, the number of SNPs included in the model, the validation scheme, and the number of subjects used to train the model. In our training datasets, we are able to explain a large proportion of the variation in height (h(2) up to 0.83, R(2) up to 0.96). However, the proportion of variance accounted for in validation samples is much smaller (ranging from 0.15 to 0.36 depending on the degree of familial information used in the training dataset). While such R(2) values vastly exceed what has been previously reported using a reduced number of pre-selected markers (<0.10), given the heritability of the trait (∼ 0.80), substantial room for improvement remains.
Collapse
Affiliation(s)
- Robert Makowsky
- Department of Biostatistics, University of Alabama at Birmingham, Alabama, United States of America.
| | | | | | | | | | | | | |
Collapse
|
27
|
Rosa GJM, Valente BD, de los Campos G, Wu XL, Gianola D, Silva MA. Inferring causal phenotype networks using structural equation models. Genet Sel Evol 2011; 43:6. [PMID: 21310061 PMCID: PMC3056759 DOI: 10.1186/1297-9686-43-6] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Accepted: 02/10/2011] [Indexed: 01/14/2023] Open
Abstract
Phenotypic traits may exert causal effects between them. For example, on the one hand, high yield in dairy cows may increase the liability to certain diseases and, on the other hand, the incidence of a disease may affect yield negatively. Likewise, the transcriptome may be a function of the reproductive status in mammals and the latter may depend on other physiological variables. Knowledge of phenotype networks describing such interrelationships can be used to predict the behavior of complex systems, e.g. biological pathways underlying complex traits such as diseases, growth and reproduction. Structural Equation Models (SEM) can be used to study recursive and simultaneous relationships among phenotypes in multivariate systems such as genetical genomics, system biology, and multiple trait models in quantitative genetics. Hence, SEM can produce an interpretation of relationships among traits which differs from that obtained with traditional multiple trait models, in which all relationships are represented by symmetric linear associations among random variables, such as covariances and correlations. In this review, we discuss the application of SEM and related techniques for the study of multiple phenotypes. Two basic scenarios are considered, one pertaining to genetical genomics studies, in which QTL or molecular marker information is used to facilitate causal inference, and another related to quantitative genetic analysis in livestock, in which only phenotypic and pedigree information is available. Advantages and limitations of SEM compared to traditional approaches commonly used for the analysis of multiple traits, as well as some indication of future research in this area are presented in a concluding section.
Collapse
Affiliation(s)
- Guilherme J M Rosa
- Department of Animal Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | | | | | | | | | | |
Collapse
|
28
|
de los Campos G, Gianola D, Allison DB. Predicting genetic predisposition in humans: the promise of whole-genome markers. Nat Rev Genet 2010; 11:880-6. [DOI: 10.1038/nrg2898] [Citation(s) in RCA: 211] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
29
|
de Maturana EL, de los Campos G, Wu XL, Gianola D, Weigel KA, Rosa GJM. Modeling relationships between calving traits: a comparison between standard and recursive mixed models. Genet Sel Evol 2010; 42:1. [PMID: 20100345 PMCID: PMC2830933 DOI: 10.1186/1297-9686-42-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 01/25/2010] [Indexed: 11/26/2022] Open
Abstract
Background The use of structural equation models for the analysis of recursive and simultaneous relationships between phenotypes has become more popular recently. The aim of this paper is to illustrate how these models can be applied in animal breeding to achieve parameterizations of different levels of complexity and, more specifically, to model phenotypic recursion between three calving traits: gestation length (GL), calving difficulty (CD) and stillbirth (SB). All recursive models considered here postulate heterogeneous recursive relationships between GL and liabilities to CD and SB, and between liability to CD and liability to SB, depending on categories of GL phenotype. Methods Four models were compared in terms of goodness of fit and predictive ability: 1) standard mixed model (SMM), a model with unstructured (co)variance matrices; 2) recursive mixed model 1 (RMM1), assuming that residual correlations are due to the recursive relationships between phenotypes; 3) RMM2, assuming that correlations between residuals and contemporary groups are due to recursive relationships between phenotypes; and 4) RMM3, postulating that the correlations between genetic effects, contemporary groups and residuals are due to recursive relationships between phenotypes. Results For all the RMM considered, the estimates of the structural coefficients were similar. Results revealed a nonlinear relationship between GL and the liabilities both to CD and to SB, and a linear relationship between the liabilities to CD and SB. Differences in terms of goodness of fit and predictive ability of the models considered were negligible, suggesting that RMM3 is plausible. Conclusions The applications examined in this study suggest the plausibility of a nonlinear recursive effect from GL onto CD and SB. Also, the fact that the most restrictive model RMM3, which assumes that the only cause of correlation is phenotypic recursion, performs as well as the others indicates that the phenotypic recursion may be an important cause of the observed patterns of genetic and environmental correlations.
Collapse
|
30
|
de los Campos G, Gianola D. Factor analysis models for structuring covariance matrices of additive genetic effects: a Bayesian implementation. Genet Sel Evol 2007. [DOI: 10.1051/gse:20070016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|