Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

30
(from Reference Citation Analysis)

Article PDFs (12)

Cited by > 0 (24)

Searched Name

Gustavo de los Campos

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Ricker B, Castellanos Franco EA, de los Campos G, Pelled G, Gilad AA. A conserved phenylalanine motif among Teleost fish provides insight for improving electromagnetic perception. bioRxiv 2024:2024.04.04.588096. [PMID: 38617371 PMCID: PMC11014636 DOI: 10.1101/2024.04.04.588096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]

Valente BD, de los Campos G, Grueneberg A, Chen CY, Ros-Freixedes R, Herring WO. Using residual regressions to quantify and map signal leakage in genomic prediction. Genet Sel Evol 2023;55:57. [PMID: 37550618 PMCID: PMC10405418 DOI: 10.1186/s12711-023-00830-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 07/12/2023] [Indexed: 08/09/2023] Open

Abstract

BACKGROUND

Most genomic prediction applications in animal breeding use genotypes with tens of thousands of single nucleotide polymorphisms (SNPs). However, modern sequencing technologies and imputation algorithms can generate ultra-high-density genotypes (including millions of SNPs) at an affordable cost. Empirical studies have not produced clear evidence that using ultra-high-density genotypes can significantly improve prediction accuracy. However, (whole-genome) prediction accuracy is not very informative about the ability of a model to capture the genetic signals from specific genomic regions. To address this problem, we propose a simple methodology that detects chromosome regions for which a specific model (e.g., single-step genomic best linear unbiased prediction (ssGBLUP)) may fail to fully capture the genetic signal present in such segments-a phenomenon that we refer to as signal leakage. We propose to detect regions with evidence of signal leakage by testing the association of residuals from a pedigree or a genomic model with SNP genotypes. We discuss how this approach can be used to map regions with signals that are poorly captured by a model and to identify strategies to fix those problems (e.g., using a different prior or increasing marker density). Finally, we explored the proposed approach to scan for signal leakage of different models (pedigree-based, ssGBLUP, and various Bayesian models) applied to growth-related phenotypes (average daily gain and backfat thickness) in pigs.

RESULTS

We report widespread evidence of signal leakage for pedigree-based models. Including a percentage of animals with SNP data in ssGBLUP reduced the extent of signal leakage. However, local peaks of missed signals remained in some regions, even when all animals were genotyped. Using variable selection priors solves leakage points that are caused by excessive shrinkage of marker effects. Nevertheless, these models still miss signals in some regions due to low linkage disequilibrium between the SNPs on the array used and causal variants. Thus, we discuss how such problems could be addressed by adding sequence SNPs from those regions to the prediction model.

CONCLUSIONS

Residual single-marker regression analysis is a simple approach that can be used to detect regional genomic signals that are poorly captured by a model and to indicate ways to fix such problems.

Collapse

Ejima K, Liu N, Mestre LM, de los Campos G, Allison DB. Conditioning on parental mating types can reduce necessary assumptions for Mendelian randomization. Front Genet 2023;14:1014014. [PMID: 36950138 PMCID: PMC10025466 DOI: 10.3389/fgene.2023.1014014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open

Hansen PB, Ruud AK, de los Campos G, Malinowska M, Nagy I, Svane SF, Thorup-Kristensen K, Jensen JD, Krusell L, Asp T. Integration of DNA Methylation and Transcriptome Data Improves Complex Trait Prediction in Hordeum vulgare. Plants 2022;11:plants11172190. [PMID: 36079572 PMCID: PMC9459846 DOI: 10.3390/plants11172190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/19/2022] [Accepted: 08/21/2022] [Indexed: 11/30/2022]

Schrauf MF, de los Campos G, Munilla S. Comparing Genomic Prediction Models by Means of Cross Validation. Front Plant Sci 2021;12:734512. [PMID: 34868117 PMCID: PMC8639521 DOI: 10.3389/fpls.2021.734512] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/26/2021] [Indexed: 06/13/2023]

Lopez-Cruz M, Beyene Y, Gowda M, Crossa J, Pérez-Rodríguez P, de los Campos G. Multi-generation genomic prediction of maize yield using parametric and non-parametric sparse selection indices. Heredity (Edinb) 2021;127:423-432. [PMID: 34564692 PMCID: PMC8551287 DOI: 10.1038/s41437-021-00474-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 09/10/2021] [Accepted: 09/11/2021] [Indexed: 02/07/2023] Open

Strong BW, Pudar J, Thrift AG, de los Campos G, Howard VJ, Hussain M, Reeves MJ. Abstract 28: The Representation of Women in Randomized Clinical Trials of Acute Stroke (2010-2020). Stroke 2021. [DOI: 10.1161/str.52.suppl_1.28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Introduction: The inadequate enrollment of women in RCTs represents a threat to trial generalizability and potential inequities in access to novel treatments. We sought to determine whether women were under-enrolled in contemporary acute stroke trials. Methods: We searched MEDLINE for completed RCTs published in one of nine major journals between 2010 and 2020. Eligible studies were phase 2 or 3 trials undertaken to test therapeutic interventions within one month of stroke onset. For each trial we calculated the proportion of trial participants that were women (PPW). We used Global Burden of Disease (GBD) data to estimate the expected proportion of strokes occurring in women in the underlying stroke populations (PSW). We matched individual estimates from the GBD data to each trial based on geographic location, year, and stroke type. To quantify disparities, we calculated the enrollment disparity difference (EDD), defined as EDD = PSW - PPW. A positive EDD indicates that women were under-represented in the trial. We used random effects meta-analysis to pool individual EDDs and conducted subgroup analyses. Results: We identified 115 trials that met eligibility criteria. The random effects summary EDD was 0.053 (95% CI = 0.040, 0.053), indicating that women were under-enrolled in acute stroke trials by 5% relative to their representation in the underlying stroke population. However, there was substantial between-trial variability in the EDD (I 2 =84.4%). In subgroup analyses, the EDD was similar across subgroups except for stroke type (figure); trials that only included subarachnoid hemorrhages enrolled women in excess of their representation in the underlying population (summary EDD = -0.117 [95% CI = -0.150, -0.084]). Conclusions: Overall, women were modestly under-represented in contemporary acute stroke trials compared to their representation among all strokes. Further study is needed to elucidate factors driving sex differences in enrollment between RCTs. Collapse

de los Campos G, Pook T, Gonzalez-Reymundez A, Simianer H, Mias G, Vazquez AI. ANOVA-HD: Analysis of variance when both input and output layers are high-dimensional. PLoS One 2020;15:e0243251. [PMID: 33315963 PMCID: PMC7735570 DOI: 10.1371/journal.pone.0243251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 11/17/2020] [Indexed: 11/21/2022] Open

Noble JD, Balmant KM, Dervinis C, de los Campos G, Resende MFR, Kirst M, Barbazuk WB. The Genetic Regulation of Alternative Splicing in Populus deltoides. Front Plant Sci 2020;11:590. [PMID: 32582229 PMCID: PMC7291814 DOI: 10.3389/fpls.2020.00590] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Accepted: 04/20/2020] [Indexed: 06/11/2023]

Abstract

Alternative splicing (AS) is a mechanism of regulation of the proteome via enabling the production of multiple mRNAs from a single gene. To date, the dynamics of AS and its effects on the protein sequences of individuals in a large and genetically unrelated population of trees have not been investigated. Here we describe the diversity of AS events within a previously genotyped population of 268 individuals of Populus deltoides and their putative downstream functional effects. Using a robust bioinformatics pipeline, the AS events and resulting transcript isoforms were discovered and quantified for each individual in the population. Analysis of the AS revealed that, as expected, most AS isoforms are conserved. However, we also identified a substantial collection of new, unannotated splice junctions and transcript isoforms. Heritability estimates for the expression of transcript isoforms showed that approximately half of the isoforms are heritable. The genetic regulators of these AS isoforms and splice junction usage were then identified using a genome-wide association analysis. The expression of AS isoforms was predominately cis regulated while splice junction usage was generally regulated in trans. Additionally, we identified 696 genes encoding alternatively spliced isoforms that changed putative protein domains relative to the longest protein coding isoform of the gene, and 859 genes exhibiting this same phenomenon relative to the most highly expressed isoform. Finally, we found that 748 genes gained or lost micro-RNA binding sites relative to the longest protein coding isoform of a given gene, while 940 gained or lost micro-RNA binding sites relative to the most highly expressed isoform. These results indicate that a significant fraction of AS events are genetically regulated and that this isoform usage can result in protein domain architecture changes.

Collapse

Cheng HG, Gonzalez-Reymundez A, Li I, Pathak A, Pathak DR, de los Campos G, Vazquez AI. Breast cancer survival and the expression of genes related to alcohol drinking. PLoS One 2020;15:e0228957. [PMID: 32078659 PMCID: PMC7032692 DOI: 10.1371/journal.pone.0228957] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 01/27/2020] [Indexed: 12/12/2022] Open

Toledo-Alvarado H, Vazquez AI, de los Campos G, Tempelman RJ, Gabai G, Cecchinato A, Bittante G. Changes in milk characteristics and fatty acid profile during the estrous cycle in dairy cows. J Dairy Sci 2018;101:9135-9153. [DOI: 10.3168/jds.2018-14480] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 05/31/2018] [Indexed: 11/19/2022]

Montesinos-López A, Montesinos-López OA, de los Campos G, Crossa J, Burgueño J, Luna-Vazquez FJ. Correction to: Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 2018;14:57. [PMID: 30002724 PMCID: PMC6036691 DOI: 10.1186/s13007-018-0321-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Montesinos-López A, Montesinos-López OA, de los Campos G, Crossa J, Burgueño J, Luna-Vazquez FJ. Bayesian functional regression as an alternative statistical analysis of high-throughput phenotyping data of modern agriculture. Plant Methods 2018;14:46. [PMID: 29991959 PMCID: PMC5994840 DOI: 10.1186/s13007-018-0314-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 06/01/2018] [Indexed: 05/04/2023]

Abstract

BACKGROUND

Modern agriculture uses hyperspectral cameras with hundreds of reflectance data at discrete narrow bands measured in several environments. Recently, Montesinos-López et al. (Plant Methods 13(4):1-23, 2017a. 10.1186/s13007-016-0154-2; Plant Methods 13(62):1-29, 2017b. 10.1186/s13007-017-0212-4) proposed using functional regression analysis (as functional data analyses) to help reduce the dimensionality of the bands and thus decrease the computational cost. The purpose of this paper is to discuss the advantages and disadvantages that functional regression analysis offers when analyzing hyperspectral image data. We provide a brief review of functional regression analysis and examples that illustrate the methodology. We highlight critical elements of model specification: (i) type and number of basis functions, (ii) the degree of the polynomial, and (iii) the methods used to estimate regression coefficients. We also show how functional data analyses can be integrated into Bayesian models. Finally, we include an in-depth discussion of the challenges and opportunities presented by functional regression analysis.

RESULTS

We used seven model-methods, one with the conventional model (M1), three methods using the B-splines model (M2, M4, and M6) and three methods using the Fourier basis model (M3, M5, and M7). The data set we used comprises 976 wheat lines under irrigated environments with 250 wavelengths. Under a Bayesian Ridge Regression (BRR), we compared the prediction accuracy of the model-methods proposed under different numbers of basis functions, and compared the implementation time (in seconds) of the seven proposed model-methods for different numbers of basis. Our results as well as previously analyzed data (Montesinos-López et al. 2017a, 2017b) support that around 23 basis functions are enough. Concerning the degree of the polynomial in the context of B-splines, degree 3 approximates most of the curves very well. Two satisfactory types of basis are the Fourier basis for period curves and the B-splines model for non-periodic curves. Under nine different basis, the seven method-models showed similar prediction accuracy. Regarding implementation time, results show that the lower the number of basis, the lower the implementation time required. Methods M2, M3, M6 and M7 were around 3.4 times faster than methods M1, M4 and M5.

CONCLUSIONS

In this study, we promote the use of functional regression modeling for analyzing high-throughput phenotypic data and indicate the advantages and disadvantages of its implementation. In addition, many key elements that are needed to understand and implement this statistical technique appropriately are provided using a real data set. We provide details for implementing Bayesian functional regression using the developed genomic functional regression (GFR) package. In summary, we believe this paper is a good guide for breeders and scientists interested in using functional regression models for implementing prediction models when their data are curves.

Collapse

Toledo-Alvarado H, Vazquez AI, de los Campos G, Tempelman RJ, Bittante G, Cecchinato A. Diagnosing pregnancy status using infrared spectra and milk composition in dairy cows. J Dairy Sci 2018;101:2496-2505. [DOI: 10.3168/jds.2017-13647] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 11/08/2017] [Indexed: 01/01/2023]

Montesinos-López OA, Montesinos-López A, Crossa J, de los Campos G, Alvarado G, Suchismita M, Rutkoski J, González-Pérez L, Burgueño J. Predicting grain yield using canopy hyperspectral reflectance in wheat breeding data. Plant Methods 2017;13:4. [PMID: 28053649 PMCID: PMC5209864 DOI: 10.1186/s13007-016-0154-2] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Accepted: 12/01/2016] [Indexed: 05/21/2023]

Abstract

BACKGROUND

Modern agriculture uses hyperspectral cameras to obtain hundreds of reflectance data measured at discrete narrow bands to cover the whole visible light spectrum and part of the infrared and ultraviolet light spectra, depending on the camera. This information is used to construct vegetation indices (VI) (e.g., green normalized difference vegetation index or GNDVI, simple ratio or SRa, etc.) which are used for the prediction of primary traits (e.g., biomass). However, these indices only use some bands and are cultivar-specific; therefore they lose considerable information and are not robust for all cultivars.

RESULTS

This study proposes models that use all available bands as predictors to increase prediction accuracy; we compared these approaches with eight conventional vegetation indexes (VIs) constructed using only some bands. The data set we used comes from CIMMYT's global wheat program and comprises 1170 genotypes evaluated for grain yield (ton/ha) in five environments (Drought, Irrigated, EarlyHeat, Melgas and Reduced Irrigated); the reflectance data were measured in 250 discrete narrow bands ranging between 392 and 851 nm. The proposed models for the simultaneous analysis of all the bands were ordinal least square (OLS), Bayes B, principal components with Bayes B, functional B-spline, functional Fourier and functional partial least square. The results of these models were compared with the OLS performed using as predictors each of the eight VIs individually and combined.

CONCLUSIONS

We found that using all bands simultaneously increased prediction accuracy more than using VI alone. The Splines and Fourier models had the best prediction accuracy for each of the nine time-points under study. Combining image data collected at different time-points led to a small increase in prediction accuracy relative to models that use data from a single time-point. Also, using bands with heritabilities larger than 0.5 only in Drought as predictor variables showed improvements in prediction accuracy.

Collapse

Reynolds RJ, de los Campos G, Egan SP, Ott JR. Modelling heterogeneity among fitness functions using random regression. Methods Ecol Evol 2016;7:70-79. [PMID: 26949509 PMCID: PMC4776641 DOI: 10.1111/2041-210x.12440] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Lopez-Cruz M, Crossa J, Bonnett D, Dreisigacker S, Poland J, Jannink JL, Singh RP, Autrique E, de los Campos G. Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. G3 (Bethesda) 2015. [PMID: 25660166 DOI: 10.1534/g3.114.01609] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]

Abstract

Genomic selection (GS) models use genome-wide genetic information to predict genetic values of candidates of selection. Originally, these models were developed without considering genotype × environment interaction(G×E). Several authors have proposed extensions of the single-environment GS model that accommodate G×E using either covariance functions or environmental covariates. In this study, we model G×E using a marker × environment interaction (M×E) GS model; the approach is conceptually simple and can be implemented with existing GS software. We discuss how the model can be implemented by using an explicit regression of phenotypes on markers or using co-variance structures (a genomic best linear unbiased prediction-type model). We used the M×E model to analyze three CIMMYT wheat data sets (W1, W2, and W3), where more than 1000 lines were genotyped using genotyping-by-sequencing and evaluated at CIMMYT's research station in Ciudad Obregon, Mexico, under simulated environmental conditions that covered different irrigation levels, sowing dates and planting systems. We compared the M×E model with a stratified (i.e., within-environment) analysis and with a standard (across-environment) GS model that assumes that effects are constant across environments (i.e., ignoring G×E). The prediction accuracy of the M×E model was substantially greater of that of an across-environment analysis that ignores G×E. Depending on the prediction problem, the M×E model had either similar or greater levels of prediction accuracy than the stratified analyses. The M×E model decomposes marker effects and genomic values into components that are stable across environments (main effects) and others that are environment-specific (interactions). Therefore, in principle, the interaction model could shed light over which variants have effects that are stable across environments and which ones are responsible for G×E. The data set and the scripts required to reproduce the analysis are publicly available as Supporting Information.

Collapse

Berger S, Pérez-Rodríguez P, Veturi Y, Simianer H, de los Campos G. Effectiveness of shrinkage and variable selection methods for the prediction of complex human traits using data from distantly related individuals. Ann Hum Genet 2015;79:122-35. [PMID: 25600682 PMCID: PMC4428155 DOI: 10.1111/ahg.12099] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 12/03/2014] [Indexed: 02/02/2023]

Mehta T, Fontaine KR, Keith SW, Bangalore SS, de los Campos G, Bartolucci A, Pajewski NM, Allison DB. Obesity and mortality: are the risks declining? Evidence from multiple prospective studies in the United States. Obes Rev 2014;15:619-29. [PMID: 24913899 PMCID: PMC4121970 DOI: 10.1111/obr.12191] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 03/25/2014] [Accepted: 04/22/2014] [Indexed: 12/18/2022]

Jarquín D, Crossa J, Lacaze X, Du Cheyron P, Daucourt J, Lorgeou J, Piraux F, Guerreiro L, Pérez P, Calus M, Burgueño J, de los Campos G. A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theor Appl Genet 2014;127:595-607. [PMID: 24337101 PMCID: PMC3931944 DOI: 10.1007/s00122-013-2243-1] [Citation(s) in RCA: 252] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 11/20/2013] [Indexed: 05/18/2023]

Abstract

New methods that incorporate the main and interaction effects of high-dimensional markers and of high-dimensional environmental covariates gave increased prediction accuracy of grain yield in wheat across and within environments. In most agricultural crops the effects of genes on traits are modulated by environmental conditions, leading to genetic by environmental interaction (G × E). Modern genotyping technologies allow characterizing genomes in great detail and modern information systems can generate large volumes of environmental data. In principle, G × E can be accounted for using interactions between markers and environmental covariates (ECs). However, when genotypic and environmental information is high dimensional, modeling all possible interactions explicitly becomes infeasible. In this article we show how to model interactions between high-dimensional sets of markers and ECs using covariance functions. The model presented here consists of (random) reaction norm where the genetic and environmental gradients are described as linear functions of markers and of ECs, respectively. We assessed the proposed method using data from Arvalis, consisting of 139 wheat lines genotyped with 2,395 SNPs and evaluated for grain yield over 8 years and various locations within northern France. A total of 68 ECs, defined based on five phases of the phenology of the crop, were used in the analysis. Interaction terms accounted for a sizable proportion (16 %) of the within-environment yield variance, and the prediction accuracy of models including interaction terms was substantially higher (17-34 %) than that of models based on main effects only. Breeding for target environmental conditions has become a central priority of most breeding programs. Methods, like the one presented here, that can capitalize upon the wealth of genomic and environmental information available, will become increasingly important.

Collapse

Klimentidis YC, Vazquez AI, de los Campos G, Allison DB, Dransfield MT, Thannickal VJ. Heritability of pulmonary function estimated from pedigree and whole-genome markers. Front Genet 2013;4:174. [PMID: 24058366 PMCID: PMC3766834 DOI: 10.3389/fgene.2013.00174] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 08/22/2013] [Indexed: 11/13/2022] Open

Robertson HT, de los Campos G, Allison DB. Turning the analysis of obesity-mortality associations upside down: modeling years of life lost through conditional distributions. Obesity (Silver Spring) 2013;21:398-404. [PMID: 23404823 PMCID: PMC3610864 DOI: 10.1002/oby.20019] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 06/24/2012] [Indexed: 11/10/2022]

Vazquez AI, de los Campos G, Klimentidis YC, Rosa GJM, Gianola D, Yi N, Allison DB. A comprehensive genetic approach for improving prediction of skin cancer risk in humans. Genetics 2012;192:1493-502. [PMID: 23051645 PMCID: PMC3512154 DOI: 10.1534/genetics.112.141705] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2012] [Accepted: 09/07/2012] [Indexed: 01/09/2023] Open

Janss L, de los Campos G, Sheehan N, Sorensen D. Inferences from genomic models in stratified populations. Genetics 2012;192:693-704. [PMID: 22813891 PMCID: PMC3454890 DOI: 10.1534/genetics.112.141143] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2012] [Accepted: 06/26/2012] [Indexed: 12/21/2022] Open

de los Campos G, Klimentidis YC, Vazquez AI, Allison DB. Prediction of expected years of life using whole-genome markers. PLoS One 2012;7:e40964. [PMID: 22848416 PMCID: PMC3405107 DOI: 10.1371/journal.pone.0040964] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2011] [Accepted: 06/15/2012] [Indexed: 01/27/2023] Open

Abstract

Genetic factors are believed to account for 25% of the interindividual differences in Years of Life (YL) among humans. However, the genetic loci that have thus far been found to be associated with YL explain a very small proportion of the expected genetic variation in this trait, perhaps reflecting the complexity of the trait and the limitations of traditional association studies when applied to traits affected by a large number of small-effect genes. Using data from the Framingham Heart Study and statistical methods borrowed largely from the field of animal genetics (whole-genome prediction, WGP), we developed a WGP model for the study of YL and evaluated the extent to which thousands of genetic variants across the genome examined simultaneously can be used to predict interindividual differences in YL. We find that a sizable proportion of differences in YL--which were unexplained by age at entry, sex, smoking and BMI--can be accounted for and predicted using WGP methods. The contribution of genomic information to prediction accuracy was even higher than that of smoking and body mass index (BMI) combined; two predictors that are considered among the most important life-shortening factors. We evaluated the impacts of familial relationships and population structure (as described by the first two marker-derived principal components) and concluded that in our dataset population structure explained partially, but not fully the gains in prediction accuracy obtained with WGP. Further inspection of prediction accuracies by age at death indicated that most of the gains in predictive ability achieved with WGP were due to the increased accuracy of prediction of early mortality, perhaps reflecting the ability of WGP to capture differences in genetic risk to deadly diseases such as cancer, which are most often responsible for early mortality in our sample.

Collapse

Makowsky R, Pajewski NM, Klimentidis YC, Vazquez AI, Duarte CW, Allison DB, de los Campos G. Beyond missing heritability: prediction of complex traits. PLoS Genet 2011;7:e1002051. [PMID: 21552331 PMCID: PMC3084207 DOI: 10.1371/journal.pgen.1002051] [Citation(s) in RCA: 210] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Accepted: 03/02/2011] [Indexed: 01/25/2023] Open

Rosa GJM, Valente BD, de los Campos G, Wu XL, Gianola D, Silva MA. Inferring causal phenotype networks using structural equation models. Genet Sel Evol 2011;43:6. [PMID: 21310061 PMCID: PMC3056759 DOI: 10.1186/1297-9686-43-6] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Accepted: 02/10/2011] [Indexed: 01/14/2023] Open

de los Campos G, Gianola D, Allison DB. Predicting genetic predisposition in humans: the promise of whole-genome markers. Nat Rev Genet 2010;11:880-6. [DOI: 10.1038/nrg2898] [Citation(s) in RCA: 211] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

de Maturana EL, de los Campos G, Wu XL, Gianola D, Weigel KA, Rosa GJM. Modeling relationships between calving traits: a comparison between standard and recursive mixed models. Genet Sel Evol 2010;42:1. [PMID: 20100345 PMCID: PMC2830933 DOI: 10.1186/1297-9686-42-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 01/25/2010] [Indexed: 11/26/2022] Open

Abstract

Background

The use of structural equation models for the analysis of recursive and simultaneous relationships between phenotypes has become more popular recently. The aim of this paper is to illustrate how these models can be applied in animal breeding to achieve parameterizations of different levels of complexity and, more specifically, to model phenotypic recursion between three calving traits: gestation length (GL), calving difficulty (CD) and stillbirth (SB). All recursive models considered here postulate heterogeneous recursive relationships between GL and liabilities to CD and SB, and between liability to CD and liability to SB, depending on categories of GL phenotype.

Methods

Four models were compared in terms of goodness of fit and predictive ability: 1) standard mixed model (SMM), a model with unstructured (co)variance matrices; 2) recursive mixed model 1 (RMM1), assuming that residual correlations are due to the recursive relationships between phenotypes; 3) RMM2, assuming that correlations between residuals and contemporary groups are due to recursive relationships between phenotypes; and 4) RMM3, postulating that the correlations between genetic effects, contemporary groups and residuals are due to recursive relationships between phenotypes.

Results

For all the RMM considered, the estimates of the structural coefficients were similar. Results revealed a nonlinear relationship between GL and the liabilities both to CD and to SB, and a linear relationship between the liabilities to CD and SB.

Differences in terms of goodness of fit and predictive ability of the models considered were negligible, suggesting that RMM3 is plausible.

Conclusions

The applications examined in this study suggest the plausibility of a nonlinear recursive effect from GL onto CD and SB. Also, the fact that the most restrictive model RMM3, which assumes that the only cause of correlation is phenotypic recursion, performs as well as the others indicates that the phenotypic recursion may be an important cause of the observed patterns of genetic and environmental correlations.

Collapse

de los Campos G, Gianola D. Factor analysis models for structuring covariance matrices of additive genetic effects: a Bayesian implementation. Genet Sel Evol 2007. [DOI: 10.1051/gse:20070016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open