1
|
Klein N, Entwistle A, Rosenberger A, Kneib T, Bickeböller H. Candidate-gene association analysis for a continuous phenotype with a spike at zero using parent-offspring trios. J Appl Stat 2019; 47:2066-2080. [DOI: 10.1080/02664763.2019.1704226] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Nadja Klein
- Humboldt University of Berlin, Berlin, Germany
| | | | | | - Thomas Kneib
- Georg-August-Universität Göttingen, Göttingen, Germany
| | | |
Collapse
|
2
|
Zhang Z, Wang JC, Howells W, Lin P, Agrawal A, Edenberg HJ, Tischfield JA, Schuckit MA, Bierut LJ, Goate A, Rice JP. Dosage transmission disequilibrium test (dTDT) for linkage and association detection. PLoS One 2013; 8:e63526. [PMID: 23691058 PMCID: PMC3653954 DOI: 10.1371/journal.pone.0063526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2012] [Accepted: 04/06/2013] [Indexed: 11/26/2022] Open
Abstract
Both linkage and association studies have been successfully applied to identify disease susceptibility genes with genetic markers such as microsatellites and Single Nucleotide Polymorphisms (SNPs). As one of the traditional family-based studies, the Transmission/Disequilibrium Test (TDT) measures the over-transmission of an allele in a trio from its heterozygous parents to the affected offspring and can be potentially useful to identify genetic determinants for complex disorders. However, there is reduced information when complete trio information is unavailable. In this study, we developed a novel approach to "infer" the transmission of SNPs by combining both the linkage and association data, which uses microsatellite markers from families informative for linkage together with SNP markers from the offspring who are genotyped for both linkage and a Genome-Wide Association Study (GWAS). We generalized the traditional TDT to process these inferred dosage probabilities, which we name as the dosage-TDT (dTDT). For evaluation purpose, we developed a simulation procedure to assess its operating characteristics. We applied the dTDT to the simulated data and documented the power of the dTDT under a number of different realistic scenarios. Finally, we applied our methods to a family study of alcohol dependence (COGA) and performed individual genotyping on complete families for the top signals. One SNP (rs4903712 on chromosome 14) remained significant after correcting for multiple testing Methods developed in this study can be adapted to other platforms and will have widespread applicability in genomic research when case-control GWAS data are collected in families with existing linkage data.
Collapse
Affiliation(s)
- Zhehao Zhang
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - Jen-Chyong Wang
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - William Howells
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - Peng Lin
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - Arpana Agrawal
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - Howard J. Edenberg
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana, United States of America
| | - Jay A. Tischfield
- LSB 136, Rutgers University, Piscataway, New Jersey, United States of America
| | - Marc A. Schuckit
- Department of Psychiatry, University of California San Diego, La Jolla, California, United States of America
| | - Laura J. Bierut
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - Alison Goate
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| | - John P. Rice
- Washington University School of Medicine, Department of Psychiatry, St. Louis, Missouri, United States of America
| |
Collapse
|
3
|
Wang Y, Huang C, Fang Y, Yang Q, Li R. Flexible semiparametric analysis of longitudinal genetic studies by reduced rank smoothing. J R Stat Soc Ser C Appl Stat 2012; 61:1-24. [PMID: 22581986 PMCID: PMC3348702 DOI: 10.1111/j.1467-9876.2011.01016.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
In family-based longitudinal genetic studies, investigators collect repeated measurements on a trait that changes with time along with genetic markers. Since repeated measurements are nested within subjects and subjects are nested within families, both the subject-level and measurement-level correlations must be taken into account in the statistical analysis to achieve more accurate estimation. In such studies, the primary interests include to test for quantitative trait locus (QTL) effect, and to estimate age-specific QTL effect and residual polygenic heritability function. We propose flexible semiparametric models along with their statistical estimation and hypothesis testing procedures for longitudinal genetic designs. We employ penalized splines to estimate nonparametric functions in the models. We find that misspecifying the baseline function or the genetic effect function in a parametric analysis may lead to substantially inflated or highly conservative type I error rate on testing and large mean squared error on estimation. We apply the proposed approaches to examine age-specific effects of genetic variants reported in a recent genome-wide association study of blood pressure collected in the Framingham Heart Study.
Collapse
Affiliation(s)
| | | | | | | | - Runze Li
- The Pennsylvania State University at University Park, University Park, USA
| |
Collapse
|
4
|
Fardo DW, Liu J, Demeo DL, Silverman EK, Vansteelandt S. Gene-environment interaction testing in family-based association studies with phenotypically ascertained samples: a causal inference approach. Biostatistics 2011; 13:468-81. [PMID: 22084302 DOI: 10.1093/biostatistics/kxr035] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We propose a method for testing gene-environment (G × E) interactions on a complex trait in family-based studies in which a phenotypic ascertainment criterion has been imposed. This novel approach employs G-estimation, a semiparametric estimation technique from the causal inference literature, to avoid modeling of the association between the environmental exposure and the phenotype, to gain robustness against unmeasured confounding due to population substructure, and to acknowledge the ascertainment conditions. The proposed test allows for incomplete parental genotypes. It is compared by simulation studies to an analogous conditional likelihood-based approach and to the QBAT-I test, which also invokes the G-estimation principle but ignores ascertainment. We apply our approach to a study of chronic obstructive pulmonary disorder.
Collapse
Affiliation(s)
- David W Fardo
- Department of Biostatistics, Division of Biomedical Informatics, Center for Clinical and Translational Science, University of Kentucky, Lexington, KY 40536, USA.
| | | | | | | | | |
Collapse
|
5
|
Wang Y, Yang Q, Rabinowitz D. Unbiased and locally efficient estimation of genetic effect on quantitative trait in the presence of population admixture. Biometrics 2010; 67:331-43. [PMID: 20560930 DOI: 10.1111/j.1541-0420.2010.01454.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Population admixture can be a confounding factor in genetic association studies. Family-based methods (Rabinowitz and Larid, 2000, Human Heredity 50, 211-223) have been proposed in both testing and estimation settings to adjust for this confounding, especially in case-only association studies. The family-based methods rely on conditioning on the observed parental genotypes or on the minimal sufficient statistic for the genetic model under the null hypothesis. In some cases, these methods do not capture all the available information due to the conditioning strategy being too stringent. General efficient methods to adjust for population admixture that use all the available information have been proposed (Rabinowitz, 2002, Journal of the American Statistical Association 92, 742-758). However these approaches may not be easy to implement in some situations. A previously developed easy-to-compute approach adjusts for admixture by adding supplemental covariates to linear models (Yang et al., 2000, Human Heredity 50, 227-233). Here is shown that this augmenting linear model with appropriate covariates strategy can be combined with the general efficient methods in Rabinowitz (2002) to provide computationally tractable and locally efficient adjustment. After deriving the optimal covariates, the adjusted analysis can be carried out using standard statistical software packages such as SAS or R. The proposed methods enjoy a local efficiency in a neighborhood of the true model. The simulation studies show that nontrivial efficiency gains can be obtained by using information not accessible to the methods that rely on conditioning on the minimal sufficient statistics. The approaches are illustrated through an analysis of the influence of apolipoprotein E (APOE) genotype on plasma low-density lipoprotein (LDL) concentration in children.
Collapse
Affiliation(s)
- Yuanjia Wang
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, New York 10032, USA.
| | | | | |
Collapse
|
6
|
Welsby IJ, Podgoreanu MV, Phillips-Bute B, Morris R, Mathew JP, Smith PK, Newman MF, Schwinn DA, Stafford-Smith M. Association of the 98T ELAM-1 polymorphism with increased bleeding after cardiac surgery. J Cardiothorac Vasc Anesth 2010; 24:427-33. [PMID: 20056442 DOI: 10.1053/j.jvca.2009.10.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/14/2009] [Indexed: 01/04/2023]
Abstract
OBJECTIVE Hemorrhage continues to be a major problem after cardiac surgery despite the routine use of antifibrinolytic drugs, with striking inter-patient variability poorly explained by already known risk factors. The authors tested the hypothesis that genetic polymorphisms of inflammatory mediators and cellular adhesion molecules are associated with bleeding after cardiac surgery. DESIGN Prospective, observational study. SETTING Single, tertiary referral university heart center. PARTICIPANTS Adult patients undergoing aortocoronary surgery with cardiopulmonary bypass. INTERVENTIONS Patients (n = 759) had 10 mL of blood drawn preoperatively and genomic DNA isolated then genotyped for 17 polymorphisms in 7 candidate genes: tumor necrosis factor, interleukins 1beta and 6, interleukin 1 receptor antagonist, intercellular adhesion molecule-1 (ICAM-1), P-selectin and endothelial leucocyte adhesion molecule-1 (E-selectin). Multivariate analyses were used to relate clinical and genetic factors to bleeding and transfusion. MEASUREMENTS AND MAIN RESULTS The 98G/T polymorphism of the E-selectin gene was independently associated with bleeding after cardiac surgery (p = 0.002), after adjusting for significant clinical predictors (patient size and baseline hemoglobin concentration). There was a gene dose effect according to the number of minor alleles in the genotype; carriers of the minor allele bled 17% (GT) and 54% (TT) more than wild type (GG) genotypes, respectively (p = 0.01). Carriers of the minor allele also had longer activated partial thromboplastin times (p = 0.0023) and increased fresh frozen plasma transfusion (p = 0.03) compared with wild type. CONCLUSIONS The authors found a dose-related association between the 98T E-selectin polymorphism and bleeding after cardiac surgery, independent of and additive to standard clinical risk factors.
Collapse
Affiliation(s)
- Ian J Welsby
- Department of Anesthesiology, Duke University Medical Center, Durham, NC 27710, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Subramanian S. Methods and approaches in using secondary data sources to study race and ethnicity factors. Methods Mol Biol 2009; 471:227-237. [PMID: 19109783 DOI: 10.1007/978-1-59745-416-2_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Race and ethnicity are increasing used in cancer research to assess differences in cancer incidence and response to therapy. In this chapter, we discuss the measurement and methodologic issues that should be addressed to minimize bias and derive valid estimates when performing such assessments. These issues include 1) lack of national standards for race and ethnicity categories; 2) difficulty in comparing race and ethnic categories in longitudinal assessments; 3) broad categorization of race and ethnicity groups that do not provide adequate details for meaningful assessments; 4) inaccuracies in race and ethnicity data collection, and 5) confounding by socioeconomic and other factors. Recommendations for improving race and ethnicity data collection also are discussed.
Collapse
|
8
|
Wheeler E, Cordell HJ. Quantitative trait association in parent offspring trios: Extension of case/pseudocontrol method and comparison of prospective and retrospective approaches. Genet Epidemiol 2008; 31:813-33. [PMID: 17549757 PMCID: PMC2707979 DOI: 10.1002/gepi.20243] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The case/pseudocontrol method provides a convenient framework for family-based association analysis of case-parent trios, incorporating several previously proposed methods such as the transmission/disequilibrium test and log-linear modelling of parent-of-origin effects. The method allows genotype and haplotype analysis at an arbitrary number of linked and unlinked multiallelic loci, as well as modelling of more complex effects such as epistasis, parent-of-origin effects, maternal genotype and mother-child interaction effects, and gene-environment interactions. Here we extend the method for analysis of quantitative as opposed to dichotomous (e.g. disease) traits. The resulting method can be thought of as a retrospective approach, modelling genotype given trait value, in contrast to prospective approaches that model trait given genotype. Through simulations and analytical derivations, we examine the power and properties of our proposed approach, and compare it to several previously proposed single-locus methods for quantitative trait association analysis. We investigate the performance of the different methods when extended to allow analysis of haplotype, maternal genotype and parent-of-origin effects. With randomly ascertained families, with or without population stratification, the prospective approach (modeling trait value given genotype) is found to be generally most effective, although the retrospective approach has some advantages with regard to estimation and interpretability of parameter estimates when applied to selected samples.
Collapse
Affiliation(s)
| | - Heather J Cordell
- Institute of Human Genetics, Newcastle UniversityUK
- * Correspondence to: Heather Cordell, Institute of Human Genetics, Newcastle University, International Centre for Life, Central Parkway, Newcastle upon Tyne NE1 3BZ, UK. E-mail:
| |
Collapse
|
9
|
Vansteelandt S, Demeo DL, Lasky-Su J, Smoller JW, Murphy AJ, McQueen M, Schneiter K, Celedon JC, Weiss ST, Silverman EK, Lange C. Testing and estimating gene-environment interactions in family-based association studies. Biometrics 2007; 64:458-67. [PMID: 17970814 DOI: 10.1111/j.1541-0420.2007.00925.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We propose robust and efficient tests and estimators for gene-environment/gene-drug interactions in family-based association studies in which haplotypes, dichotomous/quantitative phenotypes, and complex exposure/treatment variables are analyzed. Using causal inference methodology, we show that the tests and estimators are robust against unmeasured confounding due to population admixture and stratification, provided that Mendel's law of segregation holds and that the considered exposure/treatment variable is not affected by the candidate gene under study. We illustrate the practical relevance of our approach by an application to a chronic obstructive pulmonary disease study. The data analysis suggests a gene-environment interaction between a single nucleotide polymorphism in the Serpine2 gene and smoking status/pack-years of smoking. Simulation studies show that the proposed methodology is sufficiently powered for realistic sample sizes and that it provides valid tests and effect size estimators in the presence of admixture and stratification.
Collapse
Affiliation(s)
- Stijn Vansteelandt
- Department of Applied Mathematics and Computer Science, Ghent University, Krijgslaan 281, S9, B-9000 Gent, Belgium.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
An initial linkage analysis of the alcoholism phenotype as defined by DSM-III-R criteria and alcoholism defined by DSM-IV criteria showed many, sometimes striking, inconsistencies. These inconsistencies are greatly reduced by making the definition of alcoholism more specific. We defined new phenotypes combining the alcoholism definitions and the latent variables, defining an individual as affected if that individual is alcoholic under one of the definitions (either DSM-III-R or DSM-IV), and indicated having a symptom defined by one of the latent variables. This was done for each of the two alcoholism definitions and five latent variables, selected from a canonical discriminant analyses indicating they formed significant groupings using the electrophysiological variables. We found that linkage analyses utilizing these latent variables were much more robust and consistent than the linkage results based on DSM-III-R or DSM-IV criteria for definition of alcoholism. We also performed linkage analyses on two first principal components derived phenotypes, one derived from the electrophysiological variables, and the other derived from the latent variables. A region on chromosome 2 at 250 cM was found to be linked to both of these derived phenotypes. Further examination of the SNPs in this region identified several haplotypes strongly associated with these derived phenotypes.
Collapse
Affiliation(s)
- Howard W Wiener
- Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Rodney CP Go
- Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Hemant Tiwari
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Varghese George
- Department of Biostatistics, Medical College of Georgia, Augusta, GA, USA
| | - Grier P Page
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL, USA
| |
Collapse
|
11
|
Persico AM, D'Agruma L, Zelante L, Militerni R, Bravaccio C, Schneider C, Melmed R, Trillo S, Montecchi F, Elia M, Palermo M, Rabinowitz D, Pascucci T, Puglisi-Allegra S, Reichelt KL, Muscarella L, Guarnieri V, Melgari JM, Conciatori M, Keller F. Enhanced APOE2 transmission rates in families with autistic probands. Psychiatr Genet 2005; 14:73-82. [PMID: 15167692 DOI: 10.1097/01.ypg.0000128768.37838.17] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
We have previously described linkage/association between reelin gene polymorphisms and autistic disorder. APOE also participates in the Reelin signaling pathway, by competitively antagonizing Reelin binding to APOE receptor 2 and to very-low-density lipoprotein receptors. The APOE2 protein variant displays the lowest receptor binding affinity compared with APOE3 and APOE4. In this study, we assess linkage/association between primary autism and APOE alleles in 223 complete trios, from 119 simplex Italian families and 44 simplex and 29 multiplex Caucasian-American families. Statistically significant disequilibrium favors the transmission of epsilon2 alleles to autistic offspring, over epsilon3 and epsilon4 (allele-wise transmission/disequilibrium test [TDT], chi2 = 6.16, 2 degrees of freedom [d.f.], P<0.05; genotype-wise TDT, chi2 = 10.68, 3 d.f., P<0.05). A novel epsilon3r allele was also discovered in an autistic child and his mother. Autistic patients do not differ significantly from unaffected siblings (allele-wise TDT comparing autistic patients versus unaffected sibs, chi2 = 1.83, 2 d.f., P<0.40, not significant). The major limitation of this study consists of our small sample size of trios including one unaffected sibling, currently not possessing the statistical power necessary to conclusively discriminate a specific association of epsilon2 with autism, from a distorted segregation pattern characterized by enhanced epsilon2 transmission rates both to affected and unaffected offspring. Our findings are thus compatible with either (a) pathogenetic contributions by epsilon2 alleles to autism spectrum vulnerability, requiring additional environmental and/or genetic factors to yield an autistic syndrome, and/or (b) a protective effect of epsilon2 alleles against the enhanced risk of miscarriage and infertility previously described among parents of autistic children.
Collapse
Affiliation(s)
- A M Persico
- Laboratory of Molecular Psychiatry and Neurogenetics, University 'Campus Bio-Medico', Rome, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Van Steen K, Lange C. PBAT: a comprehensive software package for genome-wide association analysis of complex family-based studies. Hum Genomics 2005; 2:67-9. [PMID: 15814068 PMCID: PMC3525120 DOI: 10.1186/1479-7364-2-1-67] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2004] [Accepted: 12/07/2004] [Indexed: 11/10/2022] Open
Abstract
The PBAT software package (v2.5) provides a unique set of tools for complex family-based association analysis at a genome-wide level. PBAT can handle nuclear families with missing parental genotypes, extended pedigrees with missing genotypic information, analysis of single nucleotide polymorphisms (SNPs), haplotype analysis, quantitative traits, multivariate/longitudinal data and time to onset phenotypes. The data analysis can be adjusted for covariates and gene/environment interactions. Haplotype-based features include sliding windows and the reconstruction of the haplotypes of the probands. PBAT's screening tools allow the user successfully to handle the multiple comparisons problem at a genome-wide level, even for 100,000 SNPs and more. Moreover, PBAT is computationally fast. A genome scan of 300,000 SNPs in 2,000 trios takes 4 central processing unit (CPU)-days. PBAT is available for Linux, Sun Solaris and Windows XP.
Collapse
Affiliation(s)
- Kristel Van Steen
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
| | - Christoph Lange
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
- Harvard Medical School, Channing Laboratory, Boston, MA 02115, USA
| |
Collapse
|
13
|
Chapman JM, Cooper JD, Todd JA, Clayton DG. Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered 2004; 56:18-31. [PMID: 14614235 DOI: 10.1159/000073729] [Citation(s) in RCA: 350] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2003] [Accepted: 08/04/2003] [Indexed: 11/19/2022] Open
Abstract
In the 'indirect' method of detecting genetic associations between a trait and a DNA variant, we type several markers in a gene or chromosome region of linkage disequilibrium. If there is association between markers and the trait, we presume the existence of one or more causal polymorphisms in the region. In order to obtain a sufficiently dense set of markers it will almost always be necessary to use single nucleotide polymorphisms (SNPs). Although there is an emerging literature on methods for choosing an optimal set of 'haplotype tag SNPs' (htSNPs) to detect association between a genetic region and a trait, less attention has been given to the problem of how such studies should be analysed when completed, and how the initial data which was used to select the htSNPs should be incorporated into the analysis. This paper discusses this problem for both population- and family-based association studies. The role of the R2 measure of association between a causal locus and various methods of scoring of marker haplotypes is highlighted. In most cases, the simplest method of scoring (locus coding), which does not require phase resolution, is shown generally to be more powerful than scoring methods that include haplotype information. A new 'multi-locus TDT' is also proposed.
Collapse
Affiliation(s)
- Juliet M Chapman
- JDRF/WT Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | | | | |
Collapse
|
14
|
Gauderman WJ. Candidate gene association analysis for a quantitative trait, using parent-offspring trios. Genet Epidemiol 2004; 25:327-38. [PMID: 14639702 DOI: 10.1002/gepi.10262] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
With the increasing availability of genetic data, many studies of quantitative traits focus on hypotheses related to candidate genes, and also gene-environment (G x E) and gene-gene (G x G) interactions. In a population-based sample, estimates and tests of candidate gene effects can be biased by ethnic confounding, also known as population stratification bias. This paper demonstrates that even a modest degree of ethnic confounding can lead to unacceptably high type I error rates for tests of genetic effects. The parent-offspring trio design is reviewed, and several forms of the quantitative transmission disequilibrium test (QTDT) are summarized. A variation of the QTDT (QTDTM) is described that is based on a linear regression model with multiple intercepts, one per parental mating type. This and other models are expanded to allow testing of G x E and G x G interactions. A method for computing required sample sizes using direct computations is described. Sample size requirements for tests of genetic main effects and G x E and G x G interactions are compared across various QTDT approaches to infer their efficiencies relative to one another. The QTDTM is found to meet or exceed the efficiency of other QTDT approaches. For example, the QTDTM is approximately 3% more efficient than the QTDT of Rabinowitz ([1997] Hum. Hered. 47:342-350) for testing a genetic main effect, but can be as much as twice as efficient for testing G x E interaction, and three times more efficient for testing G x G interaction.
Collapse
Affiliation(s)
- W James Gauderman
- Department of Preventive Medicine, University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
15
|
Abstract
There is currently a broad effort to produce genome-wide high-density linkage disequilibrium (LD) maps with single nucleotide polymorphisms. The hope is that the resulting maps can be exploited to find genes that affect the onset and severity of at least some common human diseases. These maps may also be useful for identifying genes that affect drug response or the likelihood of drug toxicities. The goal of this review is to provide a broad overview of some of the key concerns motivating the design of a major international project called the International Haplotype Map Project. The process of map production requires the identification of very large numbers of polymorphic sites, implementation of facile, highly accurate and inexpensive genotyping production pipelines, and provision for public access to the genotype data. Great progress has been made recently in genotyping methods and these advances are allowing very large-scale data collection. A major goal of these efforts is to enable the selection of subsets of markers that capture useful genetic information in short genomic intervals, while optimally reducing the number of markers that must be genotyped. Standard measures of LD provide a starting point but may not fully capture the complexity of the information inherent in the data. Extremely dense genotype data in several broadly representative populations (European, Chinese, Japanese, and Yoruba) should yield important insights into the genetic structure of most genes. Further study is required to determine how broadly applicable the data will be to other population groups. Significant challenges lie ahead in determining the best methods for the selection of markers in disease/phenotype studies, large-scale genotyping, and analysis of the resulting genetic data.
Collapse
Affiliation(s)
- John W Belmont
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | | |
Collapse
|
16
|
Persico AM, Pascucci T, Puglisi-Allegra S, Militerni R, Bravaccio C, Schneider C, Melmed R, Trillo S, Montecchi F, Palermo M, Rabinowitz D, Reichelt KL, Conciatori M, Marino R, Keller F. Serotonin transporter gene promoter variants do not explain the hyperserotoninemia in autistic children. Mol Psychiatry 2003; 7:795-800. [PMID: 12192626 DOI: 10.1038/sj.mp.4001069] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2001] [Accepted: 11/20/2001] [Indexed: 11/09/2022]
Abstract
Autism is a biologically-heterogeneous disease. Distinct subgroups of autistic patients may be marked by intermediate phenotypes, such as elevated serotonin (5-HT) blood levels, potentially associated with different underlying disease mechanisms. This could lead to inconsistent genetic association results, such as those of prior studies on serotonin transporter (5-HTT) gene promoter variants and autistic disorder. Contributions of 5-HTT gene promoter alleles to 5-HT blood levels were thus investigated in 134 autistic patients and 291 first-degree relatives. Mean 5-HT blood levels are 11% higher in autistic patients carrying the L/L genotype, compared to patients with the S/S or S/L genotype; this trend is not observed in first-degree relatives. The probability of inheriting L or S alleles is significantly enhanced in patients with 5-HT blood levels above or below the mean, respectively (P < 0.05), but quantitative TDT analyses yield a non-significant trend (P = 0.10), as this polymorphism explains only 2.5% of the variance in 5-HT blood levels of autistic patients. In conclusion, 5-HTT gene promoter variants seemingly exert a small effect on 5-HT blood levels in autistic children, which largely does not account for hyperserotoninemia. Nonetheless, the inconsistent outcome of prior association studies could partly stem from a selection bias of hyper- or hypo-serotoninemic probands.
Collapse
Affiliation(s)
- A M Persico
- Laboratory of Neuroscience, Interdisciplinary Center for Biomedical Research (CIR), Università Campus Bio-Medico, Via Longoni 83, I-00155 Rome, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
Over the past decade, attention has turned from positional cloning of Mendelian disease genes to the dissection of complex diseases. Both theoretical and empirical studies have shown that traditional linkage studies may be inferior in power compared to studies that directly utilize allele status. Case-control association studies, as an alternative, are subject to bias due to population stratification. As a compromise between linkage studies and case-control studies, family-based association designs have received great attention recently due to their potentially higher power to identify complex disease genes and their robustness in the presence of population substructure. In this review, we first describe the basic family-based association design involving one affected offspring with its two parents, all genotyped for a biallelic genetic marker. Extensions of the original transmission disequilibrium tests to multiallelic markers, families with multiple siblings, families with incomplete parental genotypes, and general pedigree structures are discussed. Further developments of statistical methods to study quantitative traits, to analyse genes on the X chromosome, to incorporate multiple tightly linked markers, to identify imprinting genes, and to detect gene-environment interactions are also reviewed. Finally, we discuss the implications of the completion of the Human Genome Project and the identification of hundreds of thousands of genetic polymorphisms on employing family-based association designs to search for complex disease genes.
Collapse
Affiliation(s)
- H Zhao
- Yale University School of Medicine, New Haven, Connecticut 06520, USA.
| |
Collapse
|
18
|
Isasi CR, Shea S, Deckelbaum RJ, Couch SC, Starc TJ, Otvos JD, Berglund L. Apolipoprotein epsilon2 allele is associated with an anti-atherogenic lipoprotein profile in children: The Columbia University BioMarkers Study. Pediatrics 2000; 106:568-75. [PMID: 10969104 DOI: 10.1542/peds.106.3.568] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
OBJECTIVE We examined associations between allelic variation in the apo epsilon gene, which codes for apolipoprotein E, and plasma lipid levels in children. MATERIALS AND METHODS We analyzed genotype and fasting lipid levels, including lipid particle size by nuclear magnetic resonance spectroscopy, in 515 children from 297 families. RESULTS Children carrying the apo epsilon2 allele (1 or 2 epsilon2 alleles; n = 45) had higher mean high-density lipoprotein (HDL) cholesterol level (49.5 +/- 13.0 vs 42.4 +/- 8.9 mg/dL) and lower mean low-density lipoprotein (LDL) cholesterol level (82.2 +/- 48.6 vs 105.9 +/- 45.0 mg/dL) compared with apo epsilon3/epsilon3 children (n = 322). Mean HDL size was larger and mean level of the atheroprotective large HDL subpopulation was higher among apo epsilon2 carriers compared with epsilon3/epsilon3 children (9.5 +/- 0.4 vs 9.3 +/-.4 nm, and 32.8 +/- 9.9 vs 27.6 +/- 8.2 mg/dL). In multivariate models adjusting for age, sex, ethnicity, family history, body mass index, and fasting triglyceride level, the apo epsilon2 allele was independently predictive of higher levels of HDL cholesterol and the large HDL subpopulation and of lower level of LDL cholesterol. CONCLUSION The apo epsilon2 allele is associated with an anti-atherogenic lipid pattern in children.apolipoprotein epsilon, children, cholesterol.
Collapse
Affiliation(s)
- C R Isasi
- Department of Medicine, Columbia University, New York, New York 10032, USA
| | | | | | | | | | | | | |
Collapse
|