1
|
MixEHR-SurG: A joint proportional hazard and guided topic model for inferring mortality-associated topics from electronic health records. J Biomed Inform 2024; 153:104638. [PMID: 38631461 DOI: 10.1016/j.jbi.2024.104638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 03/07/2024] [Accepted: 04/03/2024] [Indexed: 04/19/2024]
Abstract
Survival models can help medical practitioners to evaluate the prognostic importance of clinical variables to patient outcomes such as mortality or hospital readmission and subsequently design personalized treatment regimes. Electronic Health Records (EHRs) hold the promise for large-scale survival analysis based on systematically recorded clinical features for each patient. However, existing survival models either do not scale to high dimensional and multi-modal EHR data or are difficult to interpret. In this study, we present a supervised topic model called MixEHR-SurG to simultaneously integrate heterogeneous EHR data and model survival hazard. Our contributions are three-folds: (1) integrating EHR topic inference with Cox proportional hazards likelihood; (2) integrating patient-specific topic hyperparameters using the PheCode concepts such that each topic can be identified with exactly one PheCode-associated phenotype; (3) multi-modal survival topic inference. This leads to a highly interpretable survival topic model that can infer PheCode-specific phenotype topics associated with patient mortality. We evaluated MixEHR-SurG using a simulated dataset and two real-world EHR datasets: the Quebec Congenital Heart Disease (CHD) data consisting of 8211 subjects with 75,187 outpatient claim records of 1767 unique ICD codes; the MIMIC-III consisting of 1458 subjects with multi-modal EHR records. Compared to the baselines, MixEHR-SurG achieved a superior dynamic AUROC for mortality prediction, with a mean AUROC score of 0.89 in the simulation dataset and a mean AUROC of 0.645 on the CHD dataset. Qualitatively, MixEHR-SurG associates severe cardiac conditions with high mortality risk among the CHD patients after the first heart failure hospitalization and critical brain injuries with increased mortality among the MIMIC-III patients after their ICU discharge. Together, the integration of the Cox proportional hazards model and EHR topic inference in MixEHR-SurG not only leads to competitive mortality prediction but also meaningful phenotype topics for in-depth survival analysis. The software is available at GitHub: https://github.com/li-lab-mcgill/MixEHR-SurG.
Collapse
|
2
|
Metabolic Connectome and Its Role in the Prediction, Diagnosis, and Treatment of Complex Diseases. Metabolites 2024; 14:93. [PMID: 38392985 PMCID: PMC10890086 DOI: 10.3390/metabo14020093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 01/17/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
The interconnectivity of advanced biological systems is essential for their proper functioning. In modern connectomics, biological entities such as proteins, genes, RNA, DNA, and metabolites are often represented as nodes, while the physical, biochemical, or functional interactions between them are represented as edges. Among these entities, metabolites are particularly significant as they exhibit a closer relationship to an organism's phenotype compared to genes or proteins. Moreover, the metabolome has the ability to amplify small proteomic and transcriptomic changes, even those from minor genomic changes. Metabolic networks, which consist of complex systems comprising hundreds of metabolites and their interactions, play a critical role in biological research by mediating energy conversion and chemical reactions within cells. This review provides an introduction to common metabolic network models and their construction methods. It also explores the diverse applications of metabolic networks in elucidating disease mechanisms, predicting and diagnosing diseases, and facilitating drug development. Additionally, it discusses potential future directions for research in metabolic networks. Ultimately, this review serves as a valuable reference for researchers interested in metabolic network modeling, analysis, and their applications.
Collapse
|
3
|
High-intensity ultrasound improves color and oxidative stability of beef from grain-fed and pasture-fed Nellore cattle. Meat Sci 2023; 206:109324. [PMID: 37683507 DOI: 10.1016/j.meatsci.2023.109324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/17/2023] [Accepted: 08/30/2023] [Indexed: 09/10/2023]
Abstract
This research aimed to evaluate the influence of high-intensity ultrasound (HIU) levels (control: 0; high: 747.79; ultra-high: 1344.17 Wcm-2) on pH, instrumental color (redness, R630/580, hue angle and chroma) and oxidative stability (lipid and protein oxidation) of Psoas major (PM) muscle from Nellore cattle raised in two feeding systems: grain and pasture. Using a structural equation modeling (SEM) approach, the relations (P > 0.05) between exogenous (HIU levels) and endogenous (pH, color, lipid and protein oxidation) variables were observed. In beef from grain-fed animals the pH was directly and negatively related to lipid oxidation (γ = -0.321), hue angle (γ = -0.847) and chroma (γ = -0.442) and protein oxidation (γ = -0.752). In PM from pasture-fed HIU exhibited a negative relation with lipid (γ = -0.144) and protein (γ = -0.743) oxidation, suggesting a possible positive influence on the oxidative stability of meat and a positive relation with redness (γ = 0.197) and R630/580 (γ = 0.379). The HIU positively influenced the color and oxidative stability of beef from Bos indicus cattle, and a synergistic effect of HIU and feeding system on beef from pasture-fed animals.
Collapse
|
4
|
Multiple-trait and structural equation modelling approaches to infer genetic relationships between tail length and weight traits in Merinoland sheep. J Anim Breed Genet 2023; 140:132-143. [PMID: 36583443 DOI: 10.1111/jbg.12752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 12/08/2022] [Indexed: 12/31/2022]
Abstract
Tail docking is routinely conducted in long-tailed sheep breeds to prevent flystrike infections, but it is not in agreement with legal guidelines and animal welfare issues. Selection on short tails is a sustainable alternative in this regard, but side effects on other breeding goal traits are unclear. In consequence, the present study aimed to estimate genetic parameters for tail length (TL) at birth, birth weight (BW), weaning weight (WW) and postweaning weight (PWW) at the slaughtering date considering single-trait (STM), multiple-trait (MTM) and structural equation models (SEM) with different random effects, and accordingly, different covariance structures. The SEM considered time-lagged recursive relationships among response variables in three different pathways. The first path pertained to the effect of TL on WW and of WW on PWW. The second path reflected the effect of BW on WW and of WW on PWW. The third path was the recursive effect of TL on PWW. The phenotypic data consisted of 2803 records for TL, 13,042 records for BW, 1556 records for WW and 3986 records for PWW from Merinoland lambs. Lambs were born in the period from 1995 to 2021 and kept at the university Gießen research station, Germany, with their naturally long tails. Genetic statistical model evaluation based on Bayesian and Akaike's information criteria suggested models simultaneously considering direct genetic, maternal genetic and maternal permanent environmental effects and respective covariances. For statistical models including the same random effects and covariance structures, SEM were superior over MTM. The direct heritability for TL from the best-fitting STM was 0.60 ± 0.08, indicating the potential for genetic reduction of tail length within a few generations. For growth traits, the direct heritabilities ranged from 0.16 ± 0.03 for BW to 0.31 ± 0.09 for PWW. The maternal heritabilities were 0.03 ± 0.03 for TL, 0.12 ± 0.02 for BW, 0.04 ± 0.03 for WW and 0.07 ± 0.03 for PWW, reflecting small, but the non-significant influence of uterine characteristics on the tail development. The direct genetic correlations between TL and all weight traits were positive and very similar to MTM and SEM but reflected antagonistic genetic relationships from a breeding perspective. Oppositely, the structural equation coefficients reflecting trait associations phenotypically were negative (favourable) for the time-lagged effects of TL on WW and on PWW. As an explanation, lambs with long and woolly tails have an increased risk for contamination with dirt and dust causing infections, which in turn impairs the body weight development. In conclusion, breeding on short tails should consider trait-associated environmental risk factors, for example, disease susceptibility, which can be mimicked via SEM approaches.
Collapse
|
5
|
Genetical analysis of mastitis and reproductive traits in first-parity Holstein cows using standard and structural equation modelling. Animal 2023; 17:100777. [PMID: 37043934 DOI: 10.1016/j.animal.2023.100777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 03/03/2023] [Accepted: 03/07/2023] [Indexed: 03/17/2023] Open
Abstract
The present study aimed to investigate the causal relationships between clinical mastitis and some reproductive traits, including success at first insemination (SFI), the number of inseminations to pregnancy (INS), the interval from calving to first service (CTFS), first and last service interval (IFL), and open days (OD) in first-parity Holstein cows. For this purpose, the records of 58 281 first parity Holstein cows were analysed. These data sets were collected from 17 large dairy herds from 2008 to 2017. Recursive Mixed Models (RMMs) were applied and compared with the estimations under Standard Mixed Models. Then, one trivariate and three bivariate Gaussian-threshold models were used for the analyses. Recursive models were applied, considering that clinical mastitis can influence fertility traits. Mastitis is considered a covariate for the reproductive traits to determine their causal relationship. The results of this study indicated that causal effects of mastitis on SFI (on the observed scale, %), CTFS, IFL, OD, and INS were -5.7%, 3.3 days, 12.27 days, seven days, and 0.26 services, respectively. The estimated structural coefficients of the recursive models in the first parity imply that mastitis significantly lengthened the fertility interval and decreased the conception rate. In addition, genetic, residual, and phenotypic correlations between mastitis and the reproductive traits under both models were statistically significant. Results of genetic correlations between mastitis and fertility traits suggest that more incidence of mastitis during lactation is related to the delays in the heat show and pregnancy rate after insemination. In summary, considering the causal effects under RMMs may be advantageous to comprehend complicated relationships between complex traits better.
Collapse
|
6
|
Inferring causal structures of gut microbiota diversity and feed efficiency traits in poultry using Bayesian learning and genomic structural equation models. J Anim Sci 2023; 101:skad044. [PMID: 36734360 PMCID: PMC10032182 DOI: 10.1093/jas/skad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 02/02/2023] [Indexed: 02/04/2023] Open
Abstract
Feed and phosphorus (P) efficiency are of increasing importance in poultry breeding. It has been shown recently that these efficiency traits are influenced by the gut microbiota composition of the birds. The efficiency traits and the gut microbiota composition are partly under control of the host genome. Thus, the gut microbiota composition can be seen as a mediator trait between the host genome and the efficiency traits. The present study used data from 749 individuals of a Japanese quail F2 cross. The birds were genotyped for 4k single-nucleotide polymorphism (SNP) and trait recorded for P utilization (PU) and P retention (PR), body weight gain (BWG), and feed per gain ratio (F:G). The gut microbiota composition was characterized by targeted amplicon sequencing. The alpha diversity was calculated as the Pielou's evenness index (J'). A stable Bayesian network was established using a Hill-Climbing learning algorithm. Pielou's evenness index was placed as the most upstream trait and BWG as the most downstream trait, with direct and indirect links via PR, PU, and F:G. The direct and indirect effects between J', PU, and PR were quantified with structural equation models (SEM), which revealed a causal link from J' to PU and from PU to PR. Quantitative trait loci (QTL) linkage mapping revealed three genome-wide significant QTL regions for these traits with in total 49 trait-associated SNP within the QTL regions. SEM association mapping separated the total SNP effect for a trait into a direct effect and indirect effects mediated by upstream traits. Although the indirect effects were in general small, they contributed to the total SNP effect in some cases. This enabled us to detect some shared genetic effects. The method applied allows for the detection of shared genetic architecture of quantitative traits and microbiota compositions.
Collapse
|
7
|
Investigating functional relationships among health and fertility traits in dairy cows. Livest Sci 2022. [DOI: 10.1016/j.livsci.2022.105122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
8
|
Genetic correlations and causal effects of fighting ability on fitness traits in cattle reveal antagonistic trade-offs. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.972093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Complex genetic and phenotypic relationships are theorized to link different fitness components but revealing the correlations occurring among disparate traits requires large datasets of pedigreed populations. In particular, the association between traits beneficial to social dominance with health and fitness could be antagonistic, because of trade-offs, or positive, because of greater resource acquisition by dominant individuals. Studies investigating these relationships found some empirical evidence in support of both theories, mainly using multiple trait models (MTM). However, if a trait giving a social advantage is suspected to affect the expression of other traits, MTM could provide some bias, that structural equation models (SEM) could highlight. We used Aosta Chestnut-Black Pied cattle to investigate whether the fighting ability of cows (the capability of winning social dominance interactions) is genetically correlated with health and fitness traits. We ran both MTM and SEM using a Gibbs sampling algorithm to disentangle the possible causal effects of fighting ability from the genetic correlations that this trait shares with other traits: individual milk yield, somatic cells (representing mammary health), fertility, and longevity. We found antagonistic genetic correlations, similar under both approaches, for fighting ability vs. milk, somatic cells, and fertility, Accordingly, we found only a slight causal effects of fighting ability on these traits (–0.012 to 0.059 in standardized value). However, we found genetic correlations opposite in sign between fighting ability and longevity under MTM (0.237) and SEM (–0.183), suggesting a strong causal effect (0.386 standardized) of fighting ability on longevity. In other words, MTM found a positive correlation between longevity and fighting ability, while SEM found a negative correlation. The explanation could be that for economic reasons dominant cows are kept in this population for longer, thus attaining greater longevity: using MTM, the economic importance of competitions probably covers the true genetic correlation among traits. This artificially simulates a natural situation where an antagonistic genetic correlation between longevity and fighting ability appears positive under MTM due to a non-genetic advantage obtained by the best fighters. The use of SEM to properly assess the relationships among traits is suggested in both evolutionary studies and animal breeding.
Collapse
|
9
|
Application of structural equation models for genetic evaluation of lifetime reproductive traits and age at first lambing in Moghani sheep. Small Rumin Res 2022. [DOI: 10.1016/j.smallrumres.2022.106761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
10
|
Composition of the ileum microbiota is a mediator between the host genome and phosphorus utilization and other efficiency traits in Japanese quail (Coturnix japonica). Genet Sel Evol 2022; 54:20. [PMID: 35260076 PMCID: PMC8903610 DOI: 10.1186/s12711-022-00697-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 01/13/2022] [Indexed: 11/24/2022] Open
Abstract
Background Phosphorus is an essential nutrient in all living organisms and, currently, it is the focus of much attention due to its global scarcity, the environmental impact of phosphorus from excreta, and its low digestibility due to its storage in the form of phytates in plants. In poultry, phosphorus utilization is influenced by composition of the ileum microbiota and host genetics. In our study, we analyzed the impact of host genetics on composition of the ileum microbiota and the relationship of the relative abundance of ileal bacterial genera with phosphorus utilization and related quantitative traits in Japanese quail. An F2 cross of 758 quails was genotyped with 4k genome-wide single nucleotide polymorphisms (SNPs) and composition of the ileum microbiota was characterized using target amplicon sequencing. Heritabilities of the relative abundance of bacterial genera were estimated and quantitative trait locus (QTL) linkage mapping for the host was conducted for the heritable genera. Phenotypic and genetic correlations and recursive relationships between bacterial genera and quantitative traits were estimated using structural equation models. A genomic best linear unbiased prediction (GBLUP) and microbial (M)BLUP hologenomic selection approach was applied to assess the feasibility of breeding for improved phosphorus utilization based on the host genome and the heritable part of composition of the ileum microbiota. Results Among the 59 bacterial genera examined, 24 showed a significant heritability (nominal p ≤ 0.05), ranging from 0.04 to 0.17. For these genera, six genome-wide significant QTL were mapped. Significant recursive effects were found, which support the indirect host genetic effects on the host’s quantitative traits via microbiota composition in the ileum of quail. Cross-validated microbial and genomic prediction accuracies confirmed the strong impact of microbial composition and host genetics on the host’s quantitative traits, as the GBLUP accuracies based on the heritable microbiota-mediated components of the traits were similar to the accuracies of conventional GBLUP based on genome-wide SNPs. Conclusions Our results revealed a significant effect of host genetics on composition of the ileal microbiota and confirmed that host genetics and composition of the ileum microbiota have an impact on the host’s quantitative traits. This offers the possibility to breed for improved phosphorus utilization based on the host genome and the heritable part of composition of the ileum microbiota. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00697-8.
Collapse
|
11
|
Investigating potential causal relationships among carcass and meat quality traits using structural equation model in Nellore cattle. Meat Sci 2022; 187:108771. [DOI: 10.1016/j.meatsci.2022.108771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 02/03/2022] [Accepted: 02/10/2022] [Indexed: 11/23/2022]
|
12
|
Searching for causal relationships among latent variables concerning performance, carcass, and meat quality traits in broilers. J Anim Breed Genet 2021; 139:181-192. [PMID: 34750908 DOI: 10.1111/jbg.12653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 10/19/2021] [Accepted: 10/22/2021] [Indexed: 11/30/2022]
Abstract
In causal relationship studies, the latent variables may summarize the phenotypes in theoretical traits according to their phenotypic correlations, improving the understanding of causal relationships between broilers phenotypes. In this study, we aimed to investigate potential causal relationships among latent variables in broilers using a structural equation model in the context of genetic analysis. The data used in this study comprised 14 traits in broilers with 2,017 records each, and 104,154 animals in pedigree. Four latent variables (WEIGHT, LOSSES, COLOUR, and VISCERA) were defined and validated using Bayesian Confirmatory Factor Analysis. Subsequently, a search for causal linkage structures was performed, obtaining a single causal link structure between the latent variables. Then, this information was used to fit the structural equation model (SEM). The results from the SEM indicated positive causal effects of the variables WEIGHT and LOSSES on the variables VISCERA and COLOUR, respectively, with structural coefficient estimates of 1.006 and 0.040, respectively. On the other hand, an antagonist causal effect of the variable WEIGHT on the variable LOSSES was verified, with a structural coefficient estimate of -4.333. These results highlight the causal relationship between performance and meat quality traits, which may be associated with the natural processes involved in the conversion of muscle into meat and the structural changes in muscle tissues due to intense selection for high growth rates in broilers.
Collapse
|
13
|
Structural equation modeling for unraveling the multivariate genomic architecture of milk proteins in dairy cattle. J Dairy Sci 2021; 104:5705-5718. [PMID: 33663837 DOI: 10.3168/jds.2020-18321] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 12/31/2020] [Indexed: 01/28/2023]
Abstract
The aims of this study were to investigate potential functional relationships among milk protein fractions in dairy cattle and to carry out a structural equation model (SEM) GWAS to provide a decomposition of total SNP effects into direct effects and effects mediated by traits that are upstream in a phenotypic network. To achieve these aims, we first fitted a mixed Bayesian multitrait genomic model to infer the genomic correlations among 6 milk nitrogen fractions [4 caseins (CN), namely κ-, β-, αS1-, and αS2-CN, and 2 whey proteins, namely β-lactoglobulin (β-LG) and α-lactalbumin (α-LA)], in a population of 989 Italian Brown Swiss cows. Animals were genotyped with the Illumina BovineSNP50 Bead Chip v.2 (Illumina Inc.). A Bayesian network approach using the max-min hill-climbing (MMHC) algorithm was implemented to model the dependencies or independence among traits. Strong and negative genomic correlations were found between β-CN and αS1-CN (-0.706) and between β-CN and κ-CN (-0.735). The application of the MMHC algorithm revealed that κ-CN and β-CN seemed to directly or indirectly influence all other milk protein fractions. By integrating multitrait model GWAS and SEM-GWAS, we identified a total of 127 significant SNP for κ-CN, 89 SNP for β-CN, 30 SNP for αS1-CN, and 14 SNP for αS2-CN (mostly shared among CN and located on Bos taurus autosome 6) and 15 SNP for β-LG (mostly located on Bos taurus autosome 11), whereas no SNP passed the significance threshold for α-LA. For the significant SNP, we assessed and quantified the contribution of direct and indirect paths to total marker effect. Pathway analyses confirmed that common regulatory mechanisms (e.g., energy metabolism and hormonal and neural signals) are involved in the control of milk protein synthesis and metabolism. The information acquired might be leveraged for setting up optimal management and selection strategies aimed at improving milk quality and technological characteristics in dairy cattle.
Collapse
|
14
|
Accuracy of breeding values for production traits in turkeys (Meleagris gallopavo) using recursive models with or without genomics. Genet Sel Evol 2021; 53:16. [PMID: 33593272 PMCID: PMC7885440 DOI: 10.1186/s12711-021-00611-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 02/01/2021] [Indexed: 11/10/2022] Open
Abstract
Background Knowledge about potential functional relationships among traits of interest offers a unique opportunity to understand causal mechanisms and to optimize breeding goals, management practices, and prediction accuracy. In this study, we inferred the phenotypic causal networks among five traits in a turkey population and assessed the effect of the use of such causal structures on the accuracy of predictions of breeding values. Methods Phenotypic data on feed conversion ratio, residual feed intake, body weight, breast meat yield, and walking score in addition to genotype data from a commercial breeding population were used. Causal links between the traits were detected using the inductive causation algorithm based on the joint distribution of genetic effects obtained from a standard Bayesian multiple trait model. Then, a structural equation model was implemented to infer the magnitude of causal structure coefficients among the phenotypes. Accuracies of predictions of breeding values derived using pedigree- and blending-based multiple trait models were compared to those obtained with the pedigree- and blending-based structural equation models. Results In contrast to the two unconditioned traits (i.e., feed conversion ratio and breast meat yield) in the causal structures, the three conditioned traits (i.e., residual feed intake, body weight, and walking score) showed noticeable changes in estimates of genetic and residual variances between the structural equation model and the multiple trait model. The analysis revealed interesting functional associations and indirect genetic effects. For example, the structural coefficient for the path from body weight to walking score indicated that a 1-unit genetic improvement in body weight is expected to result in a 0.27-unit decline in walking score. Both structural equation models outperformed their counterpart multiple trait models for the conditioned traits. Applying the causal structures led to an increase in accuracy of estimated breeding values of approximately 7, 6, and 20% for residual feed intake, body weight, and walking score, respectively, and different rankings of selection candidates for the conditioned traits. Conclusions Our results suggest that structural equation models can improve genetic selection decisions and increase the prediction accuracy of breeding values of selection candidates. The identified causal relationships between the studied traits should be carefully considered in future turkey breeding programs.
Collapse
|
15
|
Genetic consistency between gait analysis by accelerometry and evaluation scores at breeding shows for the selection of jumping competition horses. PLoS One 2020; 15:e0244064. [PMID: 33326505 PMCID: PMC7743953 DOI: 10.1371/journal.pone.0244064] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 12/02/2020] [Indexed: 01/16/2023] Open
Abstract
The aim was to assess the efficiency of gaits characteristics in improving jumping performance of sport horses and confront accelerometers and judge scores for this purpose. A sample of 1,477 young jumping horses were measured using accelerometers for walk, trot, and canter. Of these, 702 were genotyped with 541,175 SNPs after quality control. Dataset of 26,914 horses scored by judges in breeding shows for gaits and dataset of 142,682 horses that performed in jumping competitions were used. Analysis of accelerometric data defined three principal components from 64% to 89% of variability explained for each gait. Animal mixed models were used to estimate genetic parameters with the inclusion to up 308,105 ancestors for the relationship matrix. Fixed effects for the accelerometric variables included velocity, gender, age, and event. A GWAS was performed on residuals with the fixed effect of each SNP. The GWAS did not reveal other QTLs for gait traits than the one related to the height at withers. The accelerometric principal components were highly heritable for the one linked to stride frequency and dorsoventral displacement at trot (0.53) and canter (0.41) and moderately for the one linked to longitudinal activities (0.33 for trot, 0.19 for canter). Low heritabilities were found for the walk traits. The genetic correlations of the accelerometric principal components with the jumping competition were essentially nil, except for a negative correlation with longitudinal activity at canter (-0.19). The genetic correlation between the judges’ scores and the jumping competition reached 0.45 for canter (0.31 for trot and 0.17 for walk). But these correlations turned negative when the scores were corrected for the known parental breeding value for competition at the time of the judging. In conclusion, gait traits were not helpful to select for jumping performances. Different gaits may be suitable for a good jumping horse.
Collapse
|
16
|
Inferring phenotypic causal structure among farrowing and weaning traits in pigs. Anim Sci J 2020; 91:e13369. [PMID: 32323457 PMCID: PMC7217067 DOI: 10.1111/asj.13369] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 03/01/2020] [Accepted: 03/05/2020] [Indexed: 11/30/2022]
Abstract
Direct selection for litter size or weight at weaning in pigs is often hindered by external interventions such as cross-fostering. The objective of this study was to infer the causal structure among phenotypes of reproductive traits in pigs to enable subsequent direct selection for these traits. Examined traits included: number born alive (NBA), litter size on day 21 (LS21), and litter weight on day 21 (LW21). The study included 6,240 litters from 1,673 Landrace dams and 5,393 litters from 1,484 Large White dams. The inductive causation (IC) algorithm was used to infer the causal structure, which was then fitted to a structural equation model (SEM) to estimate causal coefficients and genetic parameters. Based on the IC algorithm and temporal and biological information, the causal structure among traits was identified as: NBA → LS21 → LW21 and NBA → LW21. Owing to the causal effect of NBA on LS21 and LW21, the genetic, permanent environmental, and residual variances of LS21 and LW21were much lower in the SEM than in the multiple-trait model for both breeds. Given the strong effect of NBA on LS21 and LW21, the SEM and causal information might assist with selective breeding for LS21 and LW21 when cross-fostering occurs.
Collapse
|
17
|
Inferring phenotypic causal networks for tick infestation, Babesia bovis infection, and weight gain in Hereford and Braford cattle using structural equation models. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104032] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
18
|
Structural equation modelling analysis determining causal role among methyltransferases, methylation, and apoptosis during human pregnancy and abortion. Sci Rep 2020; 10:12408. [PMID: 32709893 PMCID: PMC7381664 DOI: 10.1038/s41598-020-68270-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 06/01/2020] [Indexed: 01/06/2023] Open
Abstract
The human implantation failure during first trimester leads to spontaneous abortions. Spontaneous abortions are consecutive and occur twice or thrice (with or without prior live births) due to factors which are either maternal or fetal. However, it also constitutes of unknown etiology; known as unexplained recurrent spontaneous abortions (URSA). In this study, the medical terminated human normal early pregnancies (NEP) of the first trimester were taken as control samples, the normal decidual sample whose molecular and epigenetic changes were compared with that of decidua of human URSA subjects. Apoptosis-related genes reported in consecutive recurrent pregnancy loss became the basis for this study. So, in this study, we evaluated the hypothesis that "p53 methylation level through methyltransferases (G9aMT and DNMT1) implicates the fate of embryo towards sustenance or cessation of pregnancy". Further, the interaction between P53, BAX, BCL-2, CASPASE-6, G9aMT, DNMT-1, and methylated p53 expression level(s) during the first trimester of both URSA and NEP are included in this study. The degree of p53 methylation during the first trimester is found to be significant and positively correlated with that of G9aMT (p < 0.05), BCL-2 (p < 0.001), and DNMT1 (p < 0.001) at both transcript and protein level. A significant and negative correlation (with p-value < 0.001) between the degree of p53 methylation during the first trimester and that of the expression level of TUNEL assay (Apoptosis), P53, BAX, and CASPASE-6 are also observed in the present study. A positive correlation between apoptosis and a higher level of p53 expression (which is possibly due to low degree of p53 methylation) is observed both at the transcript and protein level in URSA which is in line with our findings. The analysis performed using structural equation modelling (SEM) further throws light on the causal relationship between sustenance of pregnancy or URSA during the first trimester of a human pregnancy and degree of methylation of p53 which is closely correlated with the interaction between G9aMT, DNMT1, BCL-2, BAX, P53, CASPASE-6, and apoptosis.
Collapse
|
19
|
Genetic parameters of resistance to pasteurellosis using novel response traits in rabbits. Genet Sel Evol 2020; 52:34. [PMID: 32590928 PMCID: PMC7320576 DOI: 10.1186/s12711-020-00552-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 06/17/2020] [Indexed: 11/13/2022] Open
Abstract
Background Pasteurellosis (Pasteurella infection) is one of the most common bacterial infections in rabbits on commercial farms and in laboratory facilities. Curative treatments using antibiotics are only partly efficient, with frequent relapses. Breeding rabbits for improved genetic resistance to pasteurellosis is a sustainable alternative approach. In this study, we infected 964 crossbred rabbits from six sire lines experimentally with Pasteurella multocida. After post-mortem examination and bacteriological analyses, abscess, bacteria, and resistance scores were derived for each rabbit based on the extent of lesions and bacterial dissemination in the body. This is the first study to use such an experimental design and response traits to measure resistance to pasteurellosis in a rabbit population. We investigated the genetic variation of these traits in order to identify potential selection criteria. We also estimated genetic correlations of resistance to pasteurellosis in the experimental population with traits that are under selection in the breeding populations (number of kits born alive and weaning weight). Results Heritability estimates for the novel response traits, abscess, bacteria, and resistance scores, ranged from 0.08 (± 0.05) to 0.16 (± 0.06). The resistance score showed very strong negative genetic correlation estimates with abscess (− 0.99 ± 0.05) and bacteria scores (− 0.98 ± 0.07). A very high positive genetic correlation of 0.99 ± 0.16 was estimated between abscess and bacteria scores. Estimates of genetic correlations of the resistance score with average daily gain traits for the first and second week after inoculation were 0.98 (± 0.06) and 0.70 (± 0.14), respectively. Estimates of genetic correlations of the disease-related traits with average daily gain pre-inoculation were favorable but with high standard errors. Estimates of genetic and phenotypic correlations of the disease-related traits with commercial selection traits were not significantly different from zero. Conclusions Disease response traits are heritable and are highly correlated with each other, but do not show any significant genetic correlations with commercial selection traits. Thus, the prevalence of pasteurellosis could be decreased by selecting more resistant rabbits on any one of the disease response traits with a limited impact on the selection traits, which would allow implementation of a breeding program to improve resistance to pasteurellosis in rabbits.
Collapse
|
20
|
Inferring phenotypic causal structures among body weight traits via structural equation modeling in Kurdi sheep. ACTA SCIENTIARUM: ANIMAL SCIENCES 2020. [DOI: 10.4025/actascianimsci.v42i1.48823] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Data collected on 2550 Kurdi lambs originated from 1505 dams and 149 sires during 1991 to 2015 in Hossein Abad Kurdi Sheep Breeding Station, located in Shirvan city, North Khorasan province, North-eastern area of Iran, were used for inferring causal relationship among the body weights at birth (BW), at weaning (WW), at six-month age (6MW), at nine-month age (9MW) and yearling age (YW). The inductive causation (IC) algorithm was employed to search for causal structure among these traits. This algorithm was applied to the posterior distribution of the residual (co)variance matrix of a standard multivariate model (SMM). The causal structure detected by the IC algorithm coupling with biological prior knowledge provides a temporal recursive causal network among the studied traits. The studied traits were analyzed under three multivariate models including SMM, fully recursive multivariate model (FRM) and IC-based multivariate model (ICM) via a Bayesian approach by 100,000 iterations, thinning interval of 10 and the first 10,000 iterations as burn-in. The three considered multivariate models (SMM, FRM and ICM) were compared using deviance information criterion (DIC) and predictive ability measures including mean square of error (MSE) and Pearson's correlation coefficient between the observed and predicted values (r(y, )) of records. In general, structural equation based models (FRM and ICM) performed better than SMM in terms of lower DIC and MSE and also higher r(y, ). Among the tested models ICM had the lowest (36678.551) and SMM had the highest (36744.107)DIC values. In each case of the traits studied, the lowest MSE and the highest r(y, ) were obtained under ICM. The causal effects of BW on WW, WW on 6MW, 6MW on 9MW and 9MW on YW were statistically significant values of 1.478, 0.737, 0.776 and 0.929 kg, respectively (99% highest posterior density intervals did not include zero).
Collapse
|
21
|
Structural equation modeling for investigating multi-trait genetic architecture of udder health in dairy cattle. Sci Rep 2020; 10:7751. [PMID: 32385377 PMCID: PMC7210309 DOI: 10.1038/s41598-020-64575-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 04/15/2020] [Indexed: 02/04/2023] Open
Abstract
Mastitis is one of the most prevalent and costly diseases in dairy cattle. It results in changes in milk composition and quality which are indicators of udder inflammation in absence of clinical signs. We applied structural equation modeling (SEM) - GWAS aiming to explore interrelated dependency relationships among phenotypes related to udder health, including milk yield (MY), somatic cell score (SCS), lactose (%, LACT), pH and non-casein N (NCN, % of total milk N), in a cohort of 1,158 Brown Swiss cows. The phenotypic network inferred via the Hill-Climbing algorithm was used to estimate SEM parameters. Integration of multi-trait models-GWAS and SEM-GWAS identified six significant SNPs for SCS, and quantified the contribution of MY and LACT acting as mediator traits to total SNP effects. Functional analyses revealed that overrepresented pathways were often shared among traits and were consistent with biological knowledge (e.g., membrane transport activity for pH and MY or Wnt signaling for SCS and NCN). In summary, SEM-GWAS offered new insights on the relationships among udder health phenotypes and on the path of SNP effects, providing useful information for genetic improvement and management strategies in dairy cattle.
Collapse
|
22
|
|
23
|
Reconstruction of Networks with Direct and Indirect Genetic Effects. Genetics 2020; 214:781-807. [PMID: 32015018 PMCID: PMC7153926 DOI: 10.1534/genetics.119.302949] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 01/02/2020] [Indexed: 12/29/2022] Open
Abstract
Genetic variance of a phenotypic trait can originate from direct genetic effects, or from indirect effects, i.e., through genetic effects on other traits, affecting the trait of interest. This distinction is often of great importance, for example, when trying to improve crop yield and simultaneously control plant height. As suggested by Sewall Wright, assessing contributions of direct and indirect effects requires knowledge of (1) the presence or absence of direct genetic effects on each trait, and (2) the functional relationships between the traits. Because experimental validation of such relationships is often unfeasible, it is increasingly common to reconstruct them using causal inference methods. However, most current methods require all genetic variance to be explained by a small number of quantitative trait loci (QTL) with fixed effects. Only a few authors have considered the "missing heritability" case, where contributions of many undetectable QTL are modeled with random effects. Usually, these are treated as nuisance terms that need to be eliminated by taking residuals from a multi-trait mixed model (MTM). But fitting such an MTM is challenging, and it is impossible to infer the presence of direct genetic effects. Here, we propose an alternative strategy, where genetic effects are formally included in the graph. This has important advantages: (1) genetic effects can be directly incorporated in causal inference, implemented via our PCgen algorithm, which can analyze many more traits; and (2) we can test the existence of direct genetic effects, and improve the orientation of edges between traits. Finally, we show that reconstruction is much more accurate if individual plant or plot data are used, instead of genotypic means. We have implemented the PCgen-algorithm in the R-package pcgen.
Collapse
|
24
|
Application of Bayesian causal inference and structural equation model to animal breeding. Anim Sci J 2020; 91:e13359. [PMID: 32219948 PMCID: PMC7187322 DOI: 10.1111/asj.13359] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 02/27/2020] [Indexed: 01/20/2023]
Abstract
Optimized breeding goals and management practices for the improvement of target traits requires knowledge regarding any potential functional relationships between them. Fitting a structural equation model (SEM) allows for inferences about the magnitude of causal effects between traits to be made. In recent years, an adaptation of SEM was proposed in the context of quantitative genetics and mixed models. Several studies have since applied the SEM in the context of animal breeding. However, fitting the SEM requires choosing a causal structure with prior biological or temporal knowledge. The inductive causation (IC) algorithm can be used to recover an underlying causal structure from observed associations between traits. The results of the papers, which are introduced in this review, showed that using the IC algorithm to infer a causal structure is a helpful tool for detecting a causal structure without proper prior knowledge or with uncertain relationships between traits. The reports also presented that fitting the SEM could infer the effects of interventions, which are not given by correlations. Hence, information from the SEM provides more insights into and suggestions on breeding strategy than that from a multiple-trait model, which is the conventional model used for multitrait analysis.
Collapse
|
25
|
Single-Step Methodology for Genomic Evaluation in Turkeys ( Meleagris gallopavo). Front Genet 2019; 10:1248. [PMID: 31921294 PMCID: PMC6934134 DOI: 10.3389/fgene.2019.01248] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 11/13/2019] [Indexed: 11/13/2022] Open
Abstract
Genomic information can contribute significantly to the increase in accuracy of genetic predictions compared to using pedigree relationships alone. The main objective of this study was to compare the prediction ability of pedigree-based best linear unbiased prediction (PBLUP) and single-step genomic BLUP (ssGBLUP) models. Turkey records of feed conversion ratio, residual feed intake, body weight, breast meat yield, and walking ability were provided by Hybrid Turkeys, Kitchener, Canada. This data was analyzed using pedigree-based and single-step genomic models. The genomic relationship matrix was calculated either using observed allele frequencies, all allele frequencies equal to 0.5 or with a different scaling. To avoid potential problems with inversion, three different weighting factors were applied to combine the genomic and pedigree matrices. Across the studied traits, ssGBLUP had higher heritability estimates and significantly outperformed PBLUP in terms of accuracy. Walking ability was genetically negatively correlated to body weight and breast meat yield; however, it was not correlated to feed conversion ratio (FCR) or residual feed intake (RFI). Body weight showed a moderate positive genetic correlation to feed conversion ratio, residual feed intake and breast meat yield. Feed conversion ratio was strongly correlated to residual feed intake (0.68 ± 0.06). There was almost no genetic correlation between breast meat yield and feed efficiency traits. Larger differences in accuracy between PBLUP and ssGBLUP were observed for traits with lower heritability. Results of the three weighting factors showed only slight differences and an increase in accuracy of prediction compared to PBLUP. Slightly different levels of bias were observed across the models, but were higher among the traits; BMY was the only trait that had a regression coefficient higher than 1 (1.38 to 1.41). We show that incorporating genomic information increases the prediction accuracy for preselection of young candidate turkeys for the five traits investigated. Single-step genomic prediction showed substantially higher accuracy estimates than the pedigree-based model, and only slight differences in bias were observed across the alternate models.
Collapse
|
26
|
Estimation of indirect social genetic effects for skin lesion count in group-housed pigs by quantifying behavioral interactions1. J Anim Sci 2019; 97:3658-3668. [PMID: 31373628 DOI: 10.1093/jas/skz244] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 07/13/2019] [Indexed: 11/13/2022] Open
Abstract
Mixing of pigs into new social groups commonly induces aggressive interactions that result in skin lesions on the body of the animals. The relationship between skin lesions and aggressive behavioral interactions in group-housed pigs can be analyzed within the framework of social genetic effects (SGE). This study incorporates the quantification of aggressive interactions between pairs of animals in the modeling of SGE for skin lesions in different regions of the body in growing pigs. The dataset included 792 pigs housed in 59 pens. Skin lesions in the anterior, central, and caudal regions of the body were counted 24 h after pig mixing. Animals were video-recorded for 9 h postmixing and trained observers recorded the type and duration of aggressive interactions between pairs of animals. The number of seconds that pairs of pigs spent engaged in reciprocal fights and unilateral attack behaviors were used to parametrize the intensity of social interactions (ISI). Three types of models were fitted: direct genetic additive model (DGE), traditional social genetic effect model (TSGE) assuming uniform interactions between dyads, and an intensity-based social genetic effect model (ISGE) that used ISI to parameterize SGE. All models included fixed effects of sex, replicate, lesion scorer, weight at mixing, premixing lesion count, and the total time that the animal spent engaged in aggressive interactions (reciprocal fights and unilateral attack behaviors) as a covariate; a random effect of pen; and a random direct genetic effect. The ISGE models recovered more direct genetic variance than DGE and TSGE, and the estimated heritabilities (h^D2) were highest for all traits (P < 0.01) for the ISGE with ISI parametrized with unilateral attack behavior. The TSGE produced estimates that did not differ significantly from DGE (P > 0.5). Incorporating the ISI into ISGE, even in a small dataset, allowed separate estimation of the genetic parameters for direct and SGE, as well as the genetic correlation between direct and SGE (r^ds), which was positive for all lesion traits. The estimates from ISGE suggest that if behavioral observations are available, selection incorporating SGE may reduce the consequences of aggressive behaviors after mixing pigs.
Collapse
|
27
|
Amazonian trees show increased edge effects due to Atlantic Ocean warming and northward displacement of the Intertropical Convergence Zone since 1980. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 693:133515. [PMID: 31377364 DOI: 10.1016/j.scitotenv.2019.07.321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 07/05/2019] [Accepted: 07/19/2019] [Indexed: 06/10/2023]
Abstract
Recent investigations indicate a warming of Atlantic Ocean surface waters since 1980, probably influenced by anthropic actions, inducing rainfall intensification mainly during the rainy season and slight reductions during the dry season in the Amazon. Under these climate changes, trees in upland forests (terra firme) could benefit from the intensification of the hydrological cycle and could also be affected by the reduction of precipitation during the dry season. Results of dendrochronological analyses, spatial correlations and structural equation models, showed that Scleronema micranthum (Ducke) Ducke (Malvaceae) trees exposed in fragmented areas and to edge effects in Central Amazonian terra firme forest were more sensitive to the increase in the Atlantic Ocean surface temperature and consequent northward displacement of the Intertropical Convergence Zone, mainly during the dry season. Therefore, we proved that in altered and potentially more stressful environments such as edges of fragmented forests, recent anthropogenic climatic changes are exerting pressure on tree growth dynamics, inducing alterations in their performance and, consequently, in essential processes related to ecosystem services. Changes that could affect human well-being, highlighting the need for strategies that reduce edge areas expansion in Amazon forests and anthropic climate changes of the Anthropocene.
Collapse
|
28
|
Structural equation models to disentangle the biological relationship between microbiota and complex traits: Methane production in dairy cattle as a case of study. J Anim Breed Genet 2019; 137:36-48. [PMID: 31617268 DOI: 10.1111/jbg.12444] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 09/17/2019] [Accepted: 09/18/2019] [Indexed: 01/21/2023]
Abstract
The advent of metagenomics in animal breeding poses the challenge of statistically modelling the relationship between the microbiome, the host genetics and relevant complex traits. A set of structural equation models (SEMs) of a recursive type within a Markov chain Monte Carlo (MCMC) framework was proposed here to jointly analyse the host-metagenome-phenotype relationship. A non-recursive bivariate model was set as benchmark to compare the recursive model. The relative abundance of rumen microbes (RA), methane concentration (CH4 ) and the host genetics was used as a case of study. Data were from 337 Holstein cows from 12 herds in the north and north-west of Spain. Microbial composition from each cow was obtained from whole metagenome sequencing of ruminal content samples using a MinION device from Oxford Nanopore Technologies. Methane concentration was measured with Guardian® NG infrared gas monitor from Edinburgh Sensors during cow's visits to the milking automated system. A quarterly average from the methane eructation peaks for each cow was computed and used as phenotype for CH4 . Heritability of CH4 was estimated at 0.12 ± 0.01 in both the recursive and bivariate models. Likewise, heritability estimates for the relative abundance of the taxa overlapped between models and ranged between 0.08 and 0.48. Genetic correlations between the microbial composition and CH4 ranged from -0.76 to 0.65 in the non-recursive bivariate model and from -0.68 to 0.69 in the recursive model. Regardless of the statistical model used, positive genetic correlations with methane were estimated consistently for the seven genera pertaining to the Ciliophora phylum, as well as for those genera belonging to the Euryarchaeota (Methanobrevibacter sp.), Chytridiomycota (Neocallimastix sp.) and Fibrobacteres (Fibrobacter sp.) phyla. These results suggest that rumen's whole metagenome recursively regulates methane emissions in dairy cows and that both CH4 and the microbiota compositions are partially controlled by the host genotype.
Collapse
|
29
|
Structural equation modeling for hypertension and type 2 diabetes based on multiple SNPs and multiple phenotypes. PLoS One 2019; 14:e0217189. [PMID: 31513605 PMCID: PMC6742377 DOI: 10.1371/journal.pone.0217189] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 08/28/2019] [Indexed: 01/04/2023] Open
Abstract
Genome-wide association studies (GWAS) have been successful in identifying genetic variants associated with complex diseases. However, association analyses between genotypes and phenotypes are not straightforward due to the complex relationships between genetic and environmental factors. Moreover, multiple correlated phenotypes further complicate such analyses. To resolve this complexity, we present an analysis using structural equation modeling (SEM). Unlike current methods that focus only on identifying direct associations between diseases and genetic variants such as single-nucleotide polymorphisms (SNPs), our method introduces the effects of intermediate phenotypes, which are related phenotypes distinct from the target, into the systematic genetic study of diseases. Moreover, we consider multiple diseases simultaneously in a single model. The procedure can be summarized in four steps: 1) selection of informative SNPs, 2) extraction of latent variables from the selected SNPs, 3) investigation of the relationships among intermediate phenotypes and diseases, and 4) construction of an SEM. As a result, a quantitative map can be drawn that simultaneously shows the relationship among multiple SNPs, phenotypes, and diseases. In this study, we considered two correlated diseases, hypertension and type 2 diabetes (T2D), which are known to have a substantial overlap in their disease mechanism and have significant public health implications. As intermediate phenotypes for these diseases, we considered three obesity-related phenotypes—subscapular skin fold thickness, body mass index, and waist circumference—as traits representing subcutaneous adiposity, overall adiposity, and abdominal adiposity, respectively. Using GWAS data collected from the Korea Association Resource (KARE) project, we applied the proposed SEM process. Among 327,872 SNPs, 24 informative SNPs were selected in the first step (p<1.0E-05). Ten latent variables were generated in step 2. After an exploratory analysis, we established a path diagram among phenotypes and diseases in step 3. Finally, in step 4, we produced a quantitative map with paths moving from specific SNPs to hypertension through intermediate phenotypes and T2D. The resulting model had high goodness-of-fit measures (χ2 = 536.52, NFI = 0.997, CFI = 0.998, GFI = 0.995, AGFI = 0.993, RMSEA = 0.012).
Collapse
|
30
|
Causal phenotypic networks for egg traits in an F 2 chicken population. Mol Genet Genomics 2019; 294:1455-1462. [PMID: 31240383 DOI: 10.1007/s00438-019-01588-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 06/17/2019] [Indexed: 12/24/2022]
Abstract
Traditional single-trait genetic analyses, such as quantitative trait locus (QTL) and genome-wide association studies (GWAS), have been used to understand genotype-phenotype relationships for egg traits in chickens. Even though these techniques can detect potential genes of major effect, they cannot reveal cryptic causal relationships among QTLs and phenotypes. Thus, to better understand the relationships involving multiple genes and phenotypes of interest, other data analysis techniques must be used. Here, we utilized a QTL-directed dependency graph (QDG) mapping approach for a joint analysis of chicken egg traits, so that functional relationships and potential causal effects between them could be investigated. The QDG mapping identified a total of 17 QTLs affecting 24 egg traits that formed three independent networks of phenotypic trait groups (eggshell color, egg production, and size and weight of egg components), clearly distinguishing direct and indirect effects of QTLs towards correlated traits. For example, the network of size and weight of egg components contained 13 QTLs and 18 traits that are densely connected to each other. This indicates complex relationships between genotype and phenotype involving both direct and indirect effects of QTLs on the studied traits. Most of the QTLs were commonly identified by both the traditional (single-trait) mapping and the QDG approach. The network analysis, however, offers additional insight regarding the source and characterization of pleiotropy affecting egg traits. As such, the QDG analysis provides a substantial step forward, revealing cryptic relationships among QTLs and phenotypes, especially regarding direct and indirect QTL effects as well as potential causal relationships between traits, which can be used, for example, to optimize management practices and breeding strategies for the improvement of the traits.
Collapse
|
31
|
Genomic Bayesian Confirmatory Factor Analysis and Bayesian Network To Characterize a Wide Spectrum of Rice Phenotypes. G3-GENES GENOMES GENETICS 2019; 9:1975-1986. [PMID: 30992319 PMCID: PMC6553530 DOI: 10.1534/g3.119.400154] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
With the advent of high-throughput phenotyping platforms, plant breeders have a means to assess many traits for large breeding populations. However, understanding the genetic interdependencies among high-dimensional traits in a statistically robust manner remains a major challenge. Since multiple phenotypes likely share mutual relationships, elucidating the interdependencies among economically important traits can better inform breeding decisions and accelerate the genetic improvement of plants. The objective of this study was to leverage confirmatory factor analysis and graphical modeling to elucidate the genetic interdependencies among a diverse agronomic traits in rice. We used a Bayesian network to depict conditional dependencies among phenotypes, which can not be obtained by standard multi-trait analysis. We utilized Bayesian confirmatory factor analysis which hypothesized that 48 observed phenotypes resulted from six latent variables including grain morphology, morphology, flowering time, physiology, yield, and morphological salt response. This was followed by studying the genetics of each latent variable, which is also known as factor, using single nucleotide polymorphisms. Bayesian network structures involving the genomic component of six latent variables were established by fitting four algorithms (i.e., Hill Climbing, Tabu, Max-Min Hill Climbing, and General 2-Phase Restricted Maximization algorithms). Physiological components influenced the flowering time and grain morphology, and morphology and grain morphology influenced yield. In summary, we show the Bayesian network coupled with factor analysis can provide an effective approach to understand the interdependence patterns among phenotypes and to predict the potential influence of external interventions or selection related to target traits in the interrelated complex traits systems.
Collapse
|
32
|
The comparison of alternative models for genetic evaluation of growth traits in Lori-Bakhtiari sheep: Implications on predictive ability and ranking of animals. Small Rumin Res 2019. [DOI: 10.1016/j.smallrumres.2019.02.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
33
|
Conceptual framework for investigating causal effects from observational data in livestock. J Anim Sci 2018; 96:4045-4062. [PMID: 30107524 DOI: 10.1093/jas/sky277] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Accepted: 07/03/2018] [Indexed: 01/07/2023] Open
Abstract
Understanding causal mechanisms among variables is critical to efficient management of complex biological systems such as animal agriculture production. The increasing availability of data from commercial livestock operations offers unique opportunities for attaining causal insight, despite the inherently observational nature of these data. Causal claims based on observational data are substantiated by recent theoretical and methodological developments in the rapidly evolving field of causal inference. Thus, the objectives of this review are as follows: 1) to introduce a unifying conceptual framework for investigating causal effects from observational data in livestock, 2) to illustrate its implementation in the context of the animal sciences, and 3) to discuss opportunities and challenges associated with this framework. Foundational to the proposed conceptual framework are graphical objects known as directed acyclic graphs (DAGs). As mathematical constructs and practical tools, DAGs encode putative structural mechanisms underlying causal models together with their probabilistic implications. The process of DAG elicitation and causal identification is central to any causal claims based on observational data. We further discuss necessary causal assumptions and associated limitations to causal inference. Last, we provide practical recommendations to facilitate implementation of causal inference from observational data in the context of the animal sciences.
Collapse
|
34
|
Structural Equation Modeling and Whole-Genome Scans Uncover Chromosome Regions and Enriched Pathways for Carcass and Meat Quality in Beef. Front Genet 2018; 9:532. [PMID: 30555508 PMCID: PMC6282042 DOI: 10.3389/fgene.2018.00532] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Accepted: 10/22/2018] [Indexed: 12/11/2022] Open
Abstract
Structural equation models involving latent variables are useful tools for formulating hypothesized models defined by theoretical variables and causal links between these variables. The objectives of this study were: (1) to identify latent variables underlying carcass and meat quality traits and (2) to perform whole-genome scans for these latent variables in order to identify genomic regions and individual genes with both direct and indirect effects. A total of 726 steers from an Angus-Brahman multibreed population with records for 22 phenotypes were used. A total of 480 animals were genotyped with the GGP Bovine F-250. The single-step genomic best linear unbiased prediction method was used to estimate the amount of genetic variance explained for each latent variable by chromosome regions of 20 adjacent SNP-windows across the genome. Three types of genetic effects were considered: (1) direct effects on a single latent phenotype; (2) direct effects on two latent phenotypes simultaneously; and (3) indirect effects. The final structural model included carcass quality as an independent latent variable and meat quality as a dependent latent variable. Carcass quality was defined by quality grade, fat over the ribeye and marbling, while the meat quality was described by juiciness, tenderness and connective tissue, all of them measured through a taste panel. From 571 associated genomic regions (643 genes), each one explaining at least 0.05% of the additive variance, 159 regions (179 genes) were associated with carcass quality, 106 regions (114 genes) were associated with both carcass and meat quality, 242 regions (266 genes) were associated with meat quality, and 64 regions (84 genes) were associated with carcass quality, having an indirect effect on meat quality. Three biological mechanisms emerged from these findings: postmortem proteolysis of structural proteins and cellular compartmentalization, cellular proliferation and differentiation of adipocytes, and fat deposition.
Collapse
|
35
|
Harnessing longitudinal information to identify genetic variation in tolerance of pigs to Porcine Reproductive and Respiratory Syndrome virus infection. Genet Sel Evol 2018; 50:50. [PMID: 30355341 PMCID: PMC6201485 DOI: 10.1186/s12711-018-0420-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 10/05/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND High resistance (the ability of the host to reduce pathogen load) and tolerance (the ability to maintain high performance at a given pathogen load) are two desirable host traits for producing animals that are resilient to infections. For Porcine Reproductive and Respiratory Syndrome (PRRS), one of the most devastating swine diseases worldwide, studies have identified substantial genetic variation in resistance of pigs, but evidence for genetic variation in tolerance has so far been inconclusive. Resistance and tolerance are usually considered as static traits. In this study, we used longitudinal viremia measurements of PRRS virus infected pigs to define discrete stages of infection based on viremia profile characteristics. These were used to investigate host genetic effects on viral load (VL) and growth at different stages of infection, to quantify genetic variation in tolerance at these stages and throughout the entire 42-day observation period, and to assess whether the single nucleotide polymorphism (SNP) WUR10000125 (WUR) with known large effects on resistance confers significant differences in tolerance. RESULTS Genetic correlations between resistance and growth changed considerably over time. Individuals that expressed high genetic resistance early in infection tended to grow slower during that time-period, but were more likely to experience lower VL and recovery in growth by the later stage. The WUR genotype was most strongly associated with VL at early- to mid-stages of infection, and with growth at mid- to late-stages of infection. Both, single-stage and repeated measurements random regression models identified significant genetic variation in tolerance. The WUR SNP was significantly associated only with the overall tolerance slope fitted through all stages of infection, with the genetically more resistant AB pigs for the WUR SNP being also more tolerant to PRRS. CONCLUSIONS The results suggest that genetic selection for improved tolerance of pigs to PRRS is possible in principle, but may be feasible only with genomic selection, requiring intense recording schemes that involve repeated measurements to reliably estimate genetic effects. In the absence of such records, consideration of the WUR genotype in current selection schemes appears to be a promising strategy to improve simultaneously resistance and tolerance of growing pigs to PRRS.
Collapse
|
36
|
Including Phenotypic Causal Networks in Genome-Wide Association Studies Using Mixed Effects Structural Equation Models. Front Genet 2018; 9:455. [PMID: 30356716 PMCID: PMC6189326 DOI: 10.3389/fgene.2018.00455] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 09/18/2018] [Indexed: 12/21/2022] Open
Abstract
Network based statistical models accounting for putative causal relationships among multiple phenotypes can be used to infer single-nucleotide polymorphism (SNP) effect which transmitting through a given causal path in genome-wide association studies (GWAS). In GWAS with multiple phenotypes, reconstructing underlying causal structures among traits and SNPs using a single statistical framework is essential for understanding the entirety of genotype-phenotype maps. A structural equation model (SEM) can be used for such purposes. We applied SEM to GWAS (SEM-GWAS) in chickens, taking into account putative causal relationships among breast meat (BM), body weight (BW), hen-house production (HHP), and SNPs. We assessed the performance of SEM-GWAS by comparing the model results with those obtained from traditional multi-trait association analyses (MTM-GWAS). Three different putative causal path diagrams were inferred from highest posterior density (HPD) intervals of 0.75, 0.85, and 0.95 using the inductive causation algorithm. A positive path coefficient was estimated for BM → BW, and negative values were obtained for BM → HHP and BW → HHP in all implemented scenarios. Further, the application of SEM-GWAS enabled the decomposition of SNP effects into direct, indirect, and total effects, identifying whether a SNP effect is acting directly or indirectly on a given trait. In contrast, MTM-GWAS only captured overall genetic effects on traits, which is equivalent to combining the direct and indirect SNP effects from SEM-GWAS. Although MTM-GWAS and SEM-GWAS use the similar probabilistic models, we provide evidence that SEM-GWAS captures complex relationships in terms of causal meaning and mediation and delivers a more comprehensive understanding of SNP effects compared to MTM-GWAS. Our results showed that SEM-GWAS provides important insight regarding the mechanism by which identified SNPs control traits by partitioning them into direct, indirect, and total SNP effects.
Collapse
|
37
|
What Is the Influence of Morphological Knowledge in the Early Stages of Reading Acquisition Among Low SES Children? A Graphical Modeling Approach. Front Psychol 2018; 9:547. [PMID: 29725313 PMCID: PMC5917267 DOI: 10.3389/fpsyg.2018.00547] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 03/30/2018] [Indexed: 11/13/2022] Open
Abstract
Children from low-SES families are known to show delays in aspects of language development which underpin reading acquisition such as vocabulary and listening comprehension. Research on the development of morphological skills in this group is scarce, and no studies exist in French. The present study investigated the involvement of morphological knowledge in the very early stages of reading acquisition (decoding), before reading comprehension can be reliably assessed. We assessed listening comprehension, receptive vocabulary, phoneme awareness, morphological awareness as well as decoding, word reading and non-verbal IQ in 703 French first-graders from low-SES families after 3 months of formal schooling (November). Awareness of derivational morphology was assessed using three oral tasks: Relationship Judgment (e.g., do these words belong to the same family or not? heat-heater … ham-hammer); Lexical Sentence Completion [e.g., Someone who runs is a …? (runner)]; and Non-lexical Sentence Completion [e.g., Someone who lums is a…? (lummer)]. The tasks differ on implicit/explicit demands and also tap different kinds of morphological knowledge. The Judgement task measures the phonological and semantic properties of the morphological relationship and the Sentence Completion tasks measure knowledge of morphological production rules. Data were processed using a graphical modeling approach which offers key information about how skills known to be involved in learning to read are organized in memory. This modeling approach was therefore useful in revealing a potential network which expresses the conditional dependence structure between skills, after which recursive structural equation modeling was applied to test specific hypotheses. Six main conclusions can be drawn from these analyses about low SES reading acquisition: (1) listening comprehension is at the heart of the reading acquisition process; (2) word reading depends directly on phonemic awareness and indirectly on listening comprehension; (3) decoding depends on word reading; (4) Morphological awareness and vocabulary have an indirect influence on word reading via both listening comprehension and phoneme awareness; (5) the components of morphological awareness assessed by our tasks have independent relationships with listening comprehension; and (6) neither phonemic nor morphological awareness influence vocabulary directly. The implications of these results with regard to early reading acquisition among low SES groups are discussed.
Collapse
|
38
|
Family history and obesity in youth, their effect on acylcarnitine/aminoacids metabolomics and non-alcoholic fatty liver disease (NAFLD). Structural equation modeling approach. PLoS One 2018; 13:e0193138. [PMID: 29466466 PMCID: PMC5821462 DOI: 10.1371/journal.pone.0193138] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 02/05/2018] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Structural equation modeling (SEM) can help understanding complex functional relationships among obesity, non-alcoholic fatty liver disease (NAFLD), family history of obesity, targeted metabolomics and pro-inflammatory markers. We tested two hypotheses: 1) If obesity precedes an excess of free fatty acids that increase oxidative stress and mitochondrial dysfunction, there would be an increase of serum acylcarnitines, amino acids and cytokines in obese subjects. Acylcarnitines would be related to non-alcoholic fatty disease that will induce insulin resistance. 2) If a positive family history of obesity and type 2 diabetes are the major determinants of the metabolomic profile, there would be higher concentration of amino acids and acylcarnitines in patients with this background that will induce obesity and NAFLD which in turn will induce insulin resistance. METHODS/RESULTS 137 normoglycemic subjects, mean age (SD) of 30.61 (8.6) years divided in three groups: BMI<25 with absence of NAFLD (G1), n = 82; BMI>30 with absence of NAFLD (G2), n = 24; and BMI>30 with NAFLD (G3), n = 31. Family history of obesity (any) was present in 53%. Both models were adjusted in SEM. Family history of obesity predicted obesity but could not predict acylcarnitines and amino acid concentrations (effect size <0.2), but did predict obesity phenotype. CONCLUSION Family history of obesity is the major predictor of obesity, and the metabolic abnormalities on amino acids, acylcarnitines, inflammation, insulin resistance, and NAFLD.
Collapse
|
39
|
Abstract
In this paper we use Bayesian networks to determine and visualise the interactions among various Class III malocclusion maxillofacial features during growth and treatment. We start from a sample of 143 patients characterised through a series of a maximum of 21 different craniofacial features. We estimate a network model from these data and we test its consistency by verifying some commonly accepted hypotheses on the evolution of these disharmonies by means of Bayesian statistics. We show that untreated subjects develop different Class III craniofacial growth patterns as compared to patients submitted to orthodontic treatment with rapid maxillary expansion and facemask therapy. Among treated patients the CoA segment (the maxillary length) and the ANB angle (the antero-posterior relation of the maxilla to the mandible) seem to be the skeletal subspaces that receive the main effect of the treatment.
Collapse
|
40
|
A novel network analysis approach reveals DNA damage, oxidative stress and calcium/cAMP homeostasis-associated biomarkers in frontotemporal dementia. PLoS One 2017; 12:e0185797. [PMID: 29020091 PMCID: PMC5636111 DOI: 10.1371/journal.pone.0185797] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2017] [Accepted: 09/19/2017] [Indexed: 01/04/2023] Open
Abstract
Frontotemporal Dementia (FTD) is the form of neurodegenerative dementia with the highest prevalence after Alzheimer’s disease, equally distributed in men and women. It includes several variants, generally characterized by behavioural instability and language impairments. Although few mendelian genes (MAPT, GRN, and C9orf72) have been associated to the FTD phenotype, in most cases there is only evidence of multiple risk loci with relatively small effect size. To date, there are no comprehensive studies describing FTD at molecular level, highlighting possible genetic interactions and signalling pathways at the origin FTD-associated neurodegeneration. In this study, we designed a broad FTD genetic interaction map of the Italian population, through a novel network-based approach modelled on the concepts of disease-relevance and interaction perturbation, combining Steiner tree search and Structural Equation Model (SEM) analysis. Our results show a strong connection between Calcium/cAMP metabolism, oxidative stress-induced Serine/Threonine kinases activation, and postsynaptic membrane potentiation, suggesting a possible combination of neuronal damage and loss of neuroprotection, leading to cell death. In our model, Calcium/cAMP homeostasis and energetic metabolism impairments are primary causes of loss of neuroprotection and neural cell damage, respectively. Secondly, the altered postsynaptic membrane potentiation, due to the activation of stress-induced Serine/Threonine kinases, leads to neurodegeneration. Our study investigates the molecular underpinnings of these processes, evidencing key genes and gene interactions that may account for a significant fraction of unexplained FTD aetiology. We emphasized the key molecular actors in these processes, proposing them as novel FTD biomarkers that could be crucial for further epidemiological and molecular studies.
Collapse
|
41
|
Implementing structural equation models to observational data from feedlot production systems. Prev Vet Med 2017; 147:163-171. [PMID: 29254715 DOI: 10.1016/j.prevetmed.2017.09.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Revised: 08/09/2017] [Accepted: 09/03/2017] [Indexed: 12/28/2022]
Abstract
The objective of this study was to illustrate the implementation of a mixed-model-based structural equation modeling (SEM) approach to observational data in the context of feedlot production systems. Different from traditional multiple-trait models, SEMs allow assessment of potential causal interrelationships between outcomes and can effectively discriminate between direct and indirect effects. For illustration, we focused on feedlot performance and its relationship to health outcomes related to Bovine Respiratory Disease (BRD), which accounts for approximately 75% of morbidity and 50-80% of deaths in feedlots. Our data consisted of 1430 lots representing 178,983 cattle from 9 feedlot operations located across the US Great Plains. We explored functional links between arrival weight (AW; i = 1), BRD-related treatment costs (Trt$; as a proxy for health; i = 2) and average daily weight gain (ADG; as an indicator of productive performance i = 3), accounting for the fixed effect of sex and correlation patterns due to the clustering of lots within feedlots. We proposed competing plausible causal models based on expert knowledge. The best fitting model selected for inference supported direct effects of AW on ADG as well as indirect effects of AW on ADG mediated by Trt$. Direct effects from outcome i' to outcome i are quantified by the structural coefficient λii', such that every unit increase in kg/head of AW had a direct effect of increasing ADG by approximately (estimate ± standard error) λˆ31=0.002±0.0001 kg/head/day and also a direct effect of reducing Trt$ by an estimated λˆ21=$0.08±0.006 USD per head. In addition, every $1 USD spent on Trt$ directly decreased ADG by an estimated λˆ32=0.004±0.0006 kg/head/day. From these estimates, we show how to compute the indirect, Trt$-mediated, effect of AW on ADG, as well as the overall effect of AW on ADG, including both direct and indirect effects. We further compared estimates of SEM-based effects with those obtained from standard linear regression mixed models and demonstrated the additional advantage of explicitly distinguishing direct and indirect components of an overall regression effect using SEMs. Understanding the direct and indirect mechanisms of interplay between health and performance outcomes may provide valuable insight into production systems.
Collapse
|
42
|
Bayesian Networks Illustrate Genomic and Residual Trait Connections in Maize ( Zea mays L.). G3-GENES GENOMES GENETICS 2017. [PMID: 28637811 PMCID: PMC5555481 DOI: 10.1534/g3.117.044263] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Relationships among traits were investigated on the genomic and residual levels using novel methodology. This included inference on these relationships via Bayesian networks and an assessment of the networks with structural equation models. The methodology employed three steps. First, a Bayesian multiple-trait Gaussian model was fitted to the data to decompose phenotypic values into their genomic and residual components. Second, genomic and residual network structures among traits were learned from estimates of these two components. Network learning was performed using six different algorithmic settings for comparison, of which two were score-based and four were constraint-based approaches. Third, structural equation model analyses ranked the networks in terms of goodness of fit and predictive ability, and compared them with the standard multiple-trait fully recursive network. The methodology was applied to experimental data representing the European heterotic maize pools Dent and Flint (Zea mays L.). Inferences on genomic and residual trait connections were depicted separately as directed acyclic graphs. These graphs provide information beyond mere pairwise genetic or residual associations between traits, illustrating for example conditional independencies and hinting at potential causal links among traits. Network analysis suggested some genetic correlations as potentially spurious. Genomic and residual networks were compared between Dent and Flint.
Collapse
|
43
|
Inferring phenotypic causal structures among meat quality traits and the application of a structural equation model in Japanese Black cattle. J Anim Sci 2017; 94:4133-4142. [PMID: 27898842 DOI: 10.2527/jas.2016-0554] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Meat quality is one of the most important traits determining carcass price in the Japanese beef market. Optimized breeding goals and management practices for the improvement of meat quality traits requires knowledge regarding any potential functional relationships between them. In this context, the objective of this research was to infer phenotypic causal networks involving beef marbling score (BMS), beef color score (BCL), firmness of beef (FIR), texture of beef (TEX), beef fat color score (BFS), and the ratio of MUFA to SFA (MUS) from 11,855 Japanese Black cattle. The inductive causation (IC) algorithm was implemented to search for causal links among these traits and was conditionally applied to their joint distribution on genetic effects. This information was obtained from the posterior distribution of the residual (co)variance matrix of a standard Bayesian multiple trait model (MTM). Apart from BFS, the IC algorithm implemented with 95% highest posterior density (HPD) intervals detected only undirected links among the traits. However, as a result of the application of 80% HPD intervals, more links were recovered and the undirected links were changed into directed ones, except between FIR and TEX. Therefore, 2 competing causal networks resulting from the IC algorithm, with either the arrow FIR → TEX or the arrow FIR ← TEX, were fitted using a structural equation model () to infer causal structure coefficients between the selected traits. Results indicated similar genetic and residual variances as well as genetic correlation estimates from both structural equation models. The genetic variances in BMS, FIR, and TEX from the structural equation models were smaller than those obtained from the MTM. In contrast, the variances in BCL, BFS, and MUS, which were not conditioned on any of the other traits in the causal structures, had no significant differences between the structural equation model and MTM. The structural coefficient for the path from MUS (BCL) to BMS showed that a 1-unit improvement in MUS (BCL) resulted in an increase of 0.85 or 1.45 (an decrease of 0.52 or 0.54) in BMS in the causal structures. The analysis revealed some interesting functional relationships, direct genetic effects, and the magnitude of the causal effects between these traits, for example, indicating that BMS would be affected by interventions on MUS and BCL. In addition, if interventions existed in this scenario, a breeding strategy based only on the MTM would lead to a mistaken selection for BMS.
Collapse
|
44
|
Multiple-trait structured antedependence model to study the relationship between litter size and birth weight in pigs and rabbits. Genet Sel Evol 2017; 49:11. [PMID: 28107818 PMCID: PMC5439150 DOI: 10.1186/s12711-017-0288-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 01/10/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Some genetic studies need to take into account correlations between traits that are repeatedly measured over time. Multiple-trait random regression models are commonly used to analyze repeated traits but suffer from several major drawbacks. In the present study, we developed a multiple-trait extension of the structured antedependence model (SAD) to overcome this issue and validated its usefulness by modeling the association between litter size (LS) and average birth weight (ABW) over parities in pigs and rabbits. METHODS The single-trait SAD model assumes that a random effect at time [Formula: see text] can be explained by the previous values of the random effect (i.e. at previous times). The proposed multiple-trait extension of the SAD model consists in adding a cross-antedependence parameter to the single-trait SAD model. This model can be easily fitted using ASReml and the OWN Fortran program that we have developed. In comparison with the random regression model, we used our multiple-trait SAD model to analyze the LS and ABW of 4345 litters from 1817 Large White sows and 8706 litters from 2286 L-1777 does over a maximum of five successive parities. RESULTS For both species, the multiple-trait SAD fitted the data better than the random regression model. The difference between AIC of the two models (AIC_random regression-AIC_SAD) were equal to 7 and 227 for pigs and rabbits, respectively. A similar pattern of heritability and correlation estimates was obtained for both species. Heritabilities were lower for LS (ranging from 0.09 to 0.29) than for ABW (ranging from 0.23 to 0.39). The general trend was a decrease of the genetic correlation for a given trait between more distant parities. Estimates of genetic correlations between LS and ABW were negative and ranged from -0.03 to -0.52 across parities. No correlation was observed between the permanent environmental effects, except between the permanent environmental effects of LS and ABW of the same parity, for which the estimate of the correlation was strongly negative (ranging from -0.57 to -0.67). CONCLUSIONS We demonstrated that application of our multiple-trait SAD model is feasible for studying several traits with repeated measurements and showed that it provided a better fit to the data than the random regression model.
Collapse
|
45
|
Inferring causal structures and comparing the causal effects among calving difficulty, gestation length and calf size in Japanese Black cattle. Animal 2017; 11:2120-2128. [DOI: 10.1017/s1751731117000957] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
46
|
A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data. BMC Genomics 2016; 17:881. [PMID: 27821073 PMCID: PMC5100198 DOI: 10.1186/s12864-016-3169-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 10/18/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The widely used genetic pleiotropic analyses of multiple phenotypes are often designed for examining the relationship between common variants and a few phenotypes. They are not suited for both high dimensional phenotypes and high dimensional genotype (next-generation sequencing) data. To overcome limitations of the traditional genetic pleiotropic analysis of multiple phenotypes, we develop sparse structural equation models (SEMs) as a general framework for a new paradigm of genetic analysis of multiple phenotypes. To incorporate both common and rare variants into the analysis, we extend the traditional multivariate SEMs to sparse functional SEMs. To deal with high dimensional phenotype and genotype data, we employ functional data analysis and the alternative direction methods of multiplier (ADMM) techniques to reduce data dimension and improve computational efficiency. RESULTS Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods. Simulations also demonstrate that the gene-based pleiotropic analysis has higher power than the single variant-based pleiotropic analysis. The proposed method is applied to exome sequence data from the NHLBI's Exome Sequencing Project (ESP) with 11 phenotypes, which identifies a network with 137 genes connected to 11 phenotypes and 341 edges. Among them, 114 genes showed pleiotropic genetic effects and 45 genes were reported to be associated with phenotypes in the analysis or other cardiovascular disease (CVD) related phenotypes in the literature. CONCLUSIONS Our proposed sparse functional SEMs can incorporate both common and rare variants into the analysis and the ADMM algorithm can efficiently solve the penalized SEMs. Using this model we can jointly infer genetic architecture and casual phenotype network structure, and decompose the genetic effect into direct, indirect and total effect. Using large scale simulations we showed that the proposed methods have higher power to detect true causal genetic pleiotropic structure than other existing methods.
Collapse
|
47
|
Searching for causal networks involving latent variables in complex traits: Application to growth, carcass, and meat quality traits in pigs. J Anim Sci 2016; 93:4617-23. [PMID: 26523553 DOI: 10.2527/jas.2015-9213] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Structural equation models (SEQM) can be used to model causal relationships between multiple variables in multivariate systems. Among the strengths of SEQM is its ability to consider causal links between latent variables. The use of latent variables allows modeling complex phenomena while reducing at the same time the dimensionality of the data. One relevant aspect in the quantitative genetics context is the possibility of correlated genetic effects influencing sets of variables under study. Under this scenario, if one aims at inferring causality among latent variables, genetic covariances act as confounders if ignored. Here we describe a methodology for assessing causal networks involving latent variables underlying complex phenotypic traits. The first step of the method consists of the construction of latent variables defined on the basis of prior knowledge and biological interest. These latent variables are jointly evaluated using confirmatory factor analysis. The estimated factor scores are then used as phenotypes for fitting a multivariate mixed model to obtain the covariance matrix of latent variables conditional on the genetic effects. Finally, causal relationships between the adjusted latent variables are evaluated using different SEQM with alternative causal specifications. We have applied this method to a data set with pigs for which several phenotypes were recorded over time. Five different latent variables were evaluated to explore causal links between growth, carcass, and meat quality traits. The measurement model, which included 5 latent variables capturing the information conveyed by 19 different phenotypic traits, showed an acceptable fit to data (e.g., χ/df = 1.3, root-mean-square error of approximation = 0.028, standardized root-mean-square residual = 0.041). Causal links between latent variables were explored after removing genetic confounders. Interestingly, we found that both growth (-0.160) and carcass traits (-0.500) have a significant negative causal effect on quality traits (-value ≤ 0.001). This result may have important implications for strategies for pig production improvement. More generally, the proposed method allows further learning regarding phenotypic causal structures underlying complex traits in farm species.
Collapse
|
48
|
Assessing Predictive Properties of Genome-Wide Selection in Soybeans. G3 (BETHESDA, MD.) 2016; 6:2611-6. [PMID: 27317786 PMCID: PMC4978914 DOI: 10.1534/g3.116.032268] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 06/16/2016] [Indexed: 11/30/2022]
Abstract
Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set.
Collapse
|
49
|
Quantitative genetic analysis of causal relationships among feather pecking, feather eating, and general locomotor activity in laying hens using structural equation models. Poult Sci 2016; 95:1757-63. [PMID: 27252366 DOI: 10.3382/ps/pew146] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/29/2016] [Indexed: 11/20/2022] Open
Abstract
The objective of this research was to analyze the relationship between feather pecking (FP) and feather eating (FE) as well as general locomotor activity (GLA) using structural equation models, which allow that one trait can be treated as an explanatory variable of another trait. This provides an opportunity to infer putative causal links among the traits. For the analysis, 897 F2-hens set up from 2 lines divergently selected for high and low FP were available. The FP observations were Box-Cox transformed, and FE and GLA observations were log and square root transformed, respectively. The estimated heritabilities of FE, GLA, and FP were 0.36, 0.29, and 0.20, respectively. The genetic correlation between FP and FE (GLA) was 0.17 (0.04). A high genetic correlation of 0.47 was estimated between FE and GLA. The recursive effect from FE to FP was [Formula: see text], and from GLA to FP [Formula: see text] These results imply that an increase of FE leads to an increased FP behavior and that an increase in GLA results in a higher FP value. Furthermore, the study showed that the genetic correlation among the traits is mainly caused by indirect effects.
Collapse
|
50
|
Discovering phenotypic causal structure from nonexperimental data. J Evol Biol 2016; 29:1268-77. [PMID: 27007864 DOI: 10.1111/jeb.12869] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Revised: 03/09/2016] [Accepted: 03/14/2016] [Indexed: 11/27/2022]
Abstract
The evolutionary potential of organisms depends on how their parts are structured into a cohesive whole. A major obstacle for empirical studies of phenotypic organization is that observed associations among characters usually confound different causal pathways such as pleiotropic modules, interphenotypic causal relationships and environmental effects. The present article proposes causal search algorithms as a new tool to distinguish these different modes of phenotypic integration. Without assuming an a priori structure, the algorithms seek a class of causal hypotheses consistent with independence relationships holding in observational data. The technique can be applied to discover causal relationships among a set of measured traits and to distinguish genuine selection from spurious correlations. The former application is illustrated with a biological data set of rat morphological measurements previously analysed by Cheverud et al. (Evolution 1983, 37, 895).
Collapse
|