1
|
Herzig AF, Nutile T, Babron MC, Ciullo M, Bellenguez C, Leutenegger AL. Strategies for phasing and imputation in a population isolate. Genet Epidemiol 2018; 42:201-213. [PMID: 29319195 DOI: 10.1002/gepi.22109] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Revised: 11/16/2017] [Accepted: 11/16/2017] [Indexed: 11/05/2022]
Abstract
In the search for genetic associations with complex traits, population isolates offer the advantage of reduced genetic and environmental heterogeneity. In addition, cost-efficient next-generation association approaches have been proposed in these populations where only a subsample of representative individuals is sequenced and then genotypes are imputed into the rest of the population. Gene mapping in such populations thus requires high-quality genetic imputation and preliminary phasing. To identify an effective study design, we compare by simulation a range of phasing and imputation software and strategies. We simulated 1,115,604 variants on chromosome 10 for 477 members of the large complex pedigree of Campora, a village within the established isolate of Cilento in southern Italy. We assessed the phasing performance of identical by descent based software ALPHAPHASE and SLRP, LD-based software SHAPEIT2, SHAPEIT3, and BEAGLE, and new software EAGLE that combines both methodologies. For imputation we compared IMPUTE2, IMPUTE4, MINIMAC3, BEAGLE, and new software PBWT. Genotyping errors and missing genotypes were simulated to observe their effects on the performance of each software. Highly accurate phased data were achieved by all software with SHAPEIT2, SHAPEIT3, and EAGLE2 providing the most accurate results. MINIMAC3, IMPUTE4, and IMPUTE2 all performed strongly as imputation software and our study highlights the considerable gain in imputation accuracy provided by a genome sequenced reference panel specific to the population isolate.
Collapse
Affiliation(s)
- Anthony Francis Herzig
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| | - Teresa Nutile
- Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy
| | - Marie-Claude Babron
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| | - Marina Ciullo
- Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy.,IRCCS Neuromed, Pozzilli, Isernia, Italy
| | - Céline Bellenguez
- Inserm, U1167, RID-AGE-Risk Factors and Molecular Determinants of Aging-Related Diseases, Lille, France.,Institut Pasteur de Lille, Lille, France.,Université de Lille, U1167-Excellence Laboratory LabEx DISTALZ, Lille, France
| | - Anne-Louise Leutenegger
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| |
Collapse
|
2
|
Consangunity and psychosis in Algeria. A family study. Eur Psychiatry 2015. [DOI: 10.1016/j.eurpsy.2015.09.373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Several studies have affirmed the existence of a strong and complex genetic component in the determination of psychotic disorders. However, the genetic architecture of these disorders remains poorly understood. GWAS studies conducted over the past decade have identified some associations to low effect, and the major part of this heritability remains unexplained, thus calling into question the hypothesis of “common disease – common variant” for model involving a large number of rare variants. Family studies of extended pedigrees selected from geographical isolate can be a powerful approach in identifying rare genetic variants of complex diseases such as psychotic disorders. Here, we studied four multigenerational families in which co-exist psychotic and mood disorders and a high rate of consanguinity, identified in the northwest of Algeria. This case-control study aimed to characterize new rare genetic variants responsible for psychosis. These families have received complete clinical and genealogical investigations, genome wide analysis that were performed in the laboratory of medical genetics in the university hospital of Geneva. A genome wide research CNVs using Agilent Human Genome CGH Microarray Kit 44 K, covering 45 subjects including 20 patients and in a control population of 55 individuals. Three CNVs that had never been reported to date have been identified in one of four families and validated by two techniques. It is the dup 4q26, and 16q23.1 del del21q21. These CNVs are transmitted by either parent line, suggesting a cumulative effect on the risk of psychotic disorders. Further analyzes using pan-genomic linkage analysis using GWAS chip (Illumina Human 660 W-Quad v1.0 Breadchip) and complete WES (by GAIIx Illumina/HiSeq 2000) were performed in some related individuals to search other mutations may explain the appearance of the phenotype in this population.
Collapse
|
3
|
Gauvin H, Lefebvre JF, Moreau C, Lavoie EM, Labuda D, Vézina H, Roy-Gagnon MH. GENLIB: an R package for the analysis of genealogical data. BMC Bioinformatics 2015; 16:160. [PMID: 25971991 PMCID: PMC4431039 DOI: 10.1186/s12859-015-0581-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 04/22/2015] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Founder populations have an important role in the study of genetic diseases. Access to detailed genealogical records is often one of their advantages. These genealogical data provide unique information for researchers in evolutionary and population genetics, demography and genetic epidemiology. However, analyzing large genealogical datasets requires specialized methods and software. The GENLIB software was developed to study the large genealogies of the French Canadian population of Quebec, Canada. These genealogies are accessible through the BALSAC database, which contains over 3 million records covering the whole province of Quebec over four centuries. Using this resource, extended pedigrees of up to 17 generations can be constructed from a sample of present-day individuals. RESULTS We have extended and implemented GENLIB as a package in the R environment for statistical computing and graphics, thus allowing optimal flexibility for users. The GENLIB package includes basic functions to manage genealogical data allowing, for example, extraction of a part of a genealogy or selection of specific individuals. There are also many functions providing information to describe the size and complexity of genealogies as well as functions to compute standard measures such as kinship, inbreeding and genetic contribution. GENLIB also includes functions for gene-dropping simulations. The goal of this paper is to present the full functionalities of GENLIB. We used a sample of 140 individuals from the province of Quebec (Canada) to demonstrate GENLIB's functions. Ascending genealogies for these individuals were reconstructed using BALSAC, yielding a large pedigree of 41,523 individuals. Using GENLIB's functions, we provide a detailed description of these genealogical data in terms of completeness, genetic contribution of founders, relatedness, inbreeding and the overall complexity of the genealogical tree. We also present gene-dropping simulations based on the whole genealogy to investigate identical-by-descent sharing of alleles and chromosomal segments of different lengths and estimate probabilities of identical-by-descent sharing. CONCLUSIONS The R package GENLIB provides a user friendly and flexible environment to analyze extensive genealogical data, allowing an efficient and easy integration of different types of data, analytical methods and additional developments and making this tool ideal for genealogical analysis.
Collapse
Affiliation(s)
- Héloïse Gauvin
- Département de médecine sociale et préventive, Université de Montréal, Montréal, Québec, Canada.
- Centre de recherche, Centre hospitalier universitaire Sainte-Justine, Montréal, Québec, Canada.
| | - Jean-François Lefebvre
- Centre de recherche, Centre hospitalier universitaire Sainte-Justine, Montréal, Québec, Canada.
| | - Claudia Moreau
- Centre de recherche, Centre hospitalier universitaire Sainte-Justine, Montréal, Québec, Canada.
| | - Eve-Marie Lavoie
- BALSAC Project, Université du Québec à Chicoutimi, Chicoutimi, Québec, Canada.
| | - Damian Labuda
- Centre de recherche, Centre hospitalier universitaire Sainte-Justine, Montréal, Québec, Canada.
- Département de pédiatrie, Université de Montréal, Montréal, Québec, Canada.
| | - Hélène Vézina
- BALSAC Project, Université du Québec à Chicoutimi, Chicoutimi, Québec, Canada.
| | - Marie-Hélène Roy-Gagnon
- Centre de recherche, Centre hospitalier universitaire Sainte-Justine, Montréal, Québec, Canada.
- School of Epidemiology, Public Health and Preventive Medicine, Faculty of Medicine, University of Ottawa, 600 Peter Morand Cres, Room 101E, Ottawa, ON, K1G 5Z3, Canada.
| |
Collapse
|
4
|
Abney M. Permutation testing in the presence of polygenic variation. Genet Epidemiol 2015; 39:249-58. [PMID: 25758362 DOI: 10.1002/gepi.21893] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Revised: 01/09/2015] [Accepted: 01/26/2015] [Indexed: 01/08/2023]
Abstract
This article discusses problems with and solutions to performing valid permutation tests for quantitative trait loci in the presence of polygenic effects. Although permutation testing is a popular approach for determining statistical significance of a test statistic with an unknown distribution--for instance, the maximum of multiple correlated statistics or some omnibus test statistic for a gene, gene-set, or pathway--naive application of permutations may result in an invalid test. The risk of performing an invalid permutation test is particularly acute in complex trait mapping where polygenicity may combine with a structured population resulting from the presence of families, cryptic relatedness, admixture, or population stratification. I give both analytical derivations and a conceptual understanding of why typical permutation procedures fail and suggest an alternative permutation-based algorithm, MVNpermute, that succeeds. In particular, I examine the case where a linear mixed model is used to analyze a quantitative trait and show that both phenotype and genotype permutations may result in an invalid permutation test. I provide a formula that predicts the amount of inflation of the type 1 error rate depending on the degree of misspecification of the covariance structure of the polygenic effect and the heritability of the trait. I validate this formula by doing simulations, showing that the permutation distribution matches the theoretical expectation, and that my suggested permutation-based test obtains the correct null distribution. Finally, I discuss situations where naive permutations of the phenotype or genotype are valid and the applicability of the results to other test statistics.
Collapse
Affiliation(s)
- Mark Abney
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
5
|
Kazma R, Bailey JN. Population-based and family-based designs to analyze rare variants in complex diseases. Genet Epidemiol 2012; 35 Suppl 1:S41-7. [PMID: 22128057 DOI: 10.1002/gepi.20648] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genotyping of rare variants on a large scale is now possible using next-generation sequencing. Sample selection is a crucial step in designing the genetic study of a complex disease, and knowledge of the efficiency and limitations of population-based and family-based designs can help researchers make the appropriate choice. The nine contributions to Group 5 of Genetic Analysis Workshop 17 evaluate population-based and family-based designs by comparing the results obtained with various methods applied to the mini-exome simulations. These simulations consisted of 200 replicates composed of unrelated individuals and eight extended pedigrees with genotypes and various phenotypes. The methods tested for association with a population-based and/or a family-based design, tested for linkage with a family-based design, or estimated heritability. We summarize the strengths and weaknesses of both designs. Although population-based designs seem more suitable for detecting the effect of multiple rare variants, family-based designs can potentially enrich the sample in rare variants, for which the effect would be concealed at the population level. However, as of today, the main limitation is still the high cost of next-generation sequencing.
Collapse
Affiliation(s)
- Rémi Kazma
- Department of Epidemiology and Biostatistics and Institute for Human Genetics, University of California, San Francisco, CA 94143-3110, USA.
| | | |
Collapse
|
6
|
Kenny EE, Kim M, Gusev A, Lowe JK, Salit J, Smith JG, Kovvali S, Kang HM, Newton-Cheh C, Daly MJ, Stoffel M, Altshuler DM, Friedman JM, Eskin E, Breslow JL, Pe'er I. Increased power of mixed models facilitates association mapping of 10 loci for metabolic traits in an isolated population. Hum Mol Genet 2010; 20:827-39. [PMID: 21118897 DOI: 10.1093/hmg/ddq510] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The potential benefits of using population isolates in genetic mapping, such as reduced genetic, phenotypic and environmental heterogeneity, are offset by the challenges posed by the large amounts of direct and cryptic relatedness in these populations confounding basic assumptions of independence. We have evaluated four representative specialized methods for association testing in the presence of relatedness; (i) within-family (ii) within- and between-family and (iii) mixed-models methods, using simulated traits for 2906 subjects with known genome-wide genotype data from an extremely isolated population, the Island of Kosrae, Federated States of Micronesia. We report that mixed models optimally extract association information from such samples, demonstrating 88% power to rank the true variant as among the top 10 genome-wide with 56% achieving genome-wide significance, a >80% improvement over the other methods, and demonstrate that population isolates have similar power to non-isolate populations for observing variants of known effects. We then used the mixed-model method to reanalyze data for 17 published phenotypes relating to metabolic traits and electrocardiographic measures, along with another 8 previously unreported. We replicate nine genome-wide significant associations with known loci of plasma cholesterol, high-density lipoprotein, low-density lipoprotein, triglycerides, thyroid stimulating hormone, homocysteine, C-reactive protein and uric acid, with only one detected in the previous analysis of the same traits. Further, we leveraged shared identity-by-descent genetic segments in the region of the uric acid locus to fine-map the signal, refining the known locus by a factor of 4. Finally, we report a novel associations for height (rs17629022, P< 2.1 × 10⁻⁸).
Collapse
Affiliation(s)
- Eimear E Kenny
- Department of Computer Science, Columbia University, 505 Computer Science Building, 1214 Amsterdam Ave.: Mailcode 0401, New York, NY 10027-7003, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Colonna V, Nutile T, Ferrucci RR, Fardella G, Aversano M, Barbujani G, Ciullo M. Comparing population structure as inferred from genealogical versus genetic information. Eur J Hum Genet 2009; 17:1635-41. [PMID: 19550436 PMCID: PMC2987018 DOI: 10.1038/ejhg.2009.97] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Revised: 05/06/2009] [Accepted: 05/07/2009] [Indexed: 11/09/2022] Open
Abstract
Algorithms for inferring population structure from genetic data (ie, population assignment methods) have shown to effectively recognize genetic clusters in human populations. However, their performance in identifying groups of genealogically related individuals, especially in scanty-differentiated populations, has not been tested empirically thus far. For this study, we had access to both genealogical and genetic data from two closely related, isolated villages in southern Italy. We found that nearly all living individuals were included in a single pedigree, with multiple inbreeding loops. Despite F(st) between villages being a low 0.008, genetic clustering analysis identified two clusters roughly corresponding to the two villages. Average kinship between individuals (estimated from genealogies) increased at increasing values of group membership (estimated from the genetic data), showing that the observed genetic clusters represent individuals who are more closely related to each other than to random members of the population. Further, average kinship within clusters and F(st) between clusters increases with increasingly stringent membership threshold requirements. We conclude that a limited number of genetic markers is sufficient to detect structuring, and that the results of genetic analyses faithfully mirror the structuring inferred from detailed analyses of population genealogies, even when F(st) values are low, as in the case of the two villages. We then estimate the impact of observed levels of population structure on association studies using simulated data.
Collapse
Affiliation(s)
- Vincenza Colonna
- Dipartimento di Biologia ed Evoluzione, Università di Ferrara, Ferrara, Italy
- Istituto di Genetica e Biofisica ‘A. Buzzati-Traverso', National Research Council (CNR), Napoli, Italy
| | - Teresa Nutile
- Istituto di Genetica e Biofisica ‘A. Buzzati-Traverso', National Research Council (CNR), Napoli, Italy
| | - Ronald R Ferrucci
- Dipartimento di Biologia ed Evoluzione, Università di Ferrara, Ferrara, Italy
| | - Giulio Fardella
- Istituto di Genetica e Biofisica ‘A. Buzzati-Traverso', National Research Council (CNR), Napoli, Italy
| | - Mario Aversano
- Istituto di Genetica e Biofisica ‘A. Buzzati-Traverso', National Research Council (CNR), Napoli, Italy
| | - Guido Barbujani
- Dipartimento di Biologia ed Evoluzione, Università di Ferrara, Ferrara, Italy
| | - Marina Ciullo
- Istituto di Genetica e Biofisica ‘A. Buzzati-Traverso', National Research Council (CNR), Napoli, Italy
| |
Collapse
|
8
|
Abstract
Genome-wide association studies result in inflated false-positive results when unrecognized cryptic relatedness exists. A number of methods have been proposed for testing association between markers and disease with a correction for known pedigree-based relationships. However, in most case-control studies, relationships are generally unknown, yet the design is predicated on the assumption of at least ancestral relatedness among cases. Here, we focus on adjusting cryptic relatedness when the genealogy of the sample is unknown, particularly in the context of samples from isolated populations where cryptic relatedness may be problematic. We estimate cryptic relatedness using maximum-likelihood methods and use a corrected chi(2) test with estimated kinship coefficients for testing in the context of unknown cryptic relatedness. Estimated kinship coefficients characterize precisely the relatedness between truly related people, but are biased for unrelated pairs. The proposed test substantially reduces spurious positive results, producing a uniform null distribution of P-values. Especially with missing pedigree information, estimated kinship coefficients can still be used to correct non-independence among individuals. The corrected test was applied to real data sets from genetic isolates and created a distribution of P-value that was close to uniform. Thus, the proposed test corrects the non-uniform distribution of P-values obtained with the uncorrected test and illustrates the advantage of the approach on real data.
Collapse
Affiliation(s)
- Yoonha Choi
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| | - Ellen M. Wijsman
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98195, USA
| | - Bruce S. Weir
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| |
Collapse
|
9
|
Sieh W, Choi Y, Chapman NH, Craig UK, Steinbart EJ, Rothstein JH, Oyanagi K, Garruto RM, Bird TD, Galasko DR, Schellenberg GD, Wijsman EM. Identification of novel susceptibility loci for Guam neurodegenerative disease: challenges of genome scans in genetic isolates. Hum Mol Genet 2009; 18:3725-38. [PMID: 19567404 PMCID: PMC2742398 DOI: 10.1093/hmg/ddp300] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2009] [Revised: 06/17/2009] [Accepted: 06/25/2009] [Indexed: 12/17/2022] Open
Abstract
Amyotrophic lateral sclerosis/parkinsonism-dementia complex (ALS/PDC) is a fatal neurodegenerative disease found in the Chamorro people of Guam and other Pacific Island populations. The etiology is unknown, although both genetic and environmental factors appear important. To identify loci for ALS/PDC, we conducted both genome-wide linkage and association analyses, using approximately 400 microsatellite markers, in the largest sample assembled to date, comprising a nearly complete sample of all living and previously sampled deceased cases. A single, large, complex pedigree was ascertained from a village on Guam, with smaller families and a case-control sample ascertained from the rest of Guam by population-based neurological screening and archival review. We found significant evidence for two regions with novel ALS/PDC loci on chromosome 12 and supportive evidence for the involvement of the MAPT region on chromosome 17. D12S1617 on 12p gave the strongest evidence of linkage (maximum LOD score, Z(max) = 4.03) in our initial scan, with additional support in the complete case-control sample in the form of evidence of allelic association at this marker and another nearby marker. D12S79 on 12q also provided significant evidence of linkage (Z(max) = 3.14) with support from flanking markers. Our results suggest that ALS/PDC may be influenced by as many as three loci, while illustrating challenges that are intrinsic in genetic analyses of isolated populations, as well as analytical strategies that are useful in this context. Elucidation of the genetic basis of ALS/PDC should improve our understanding of related neurodegenerative disorders including Alzheimer disease, Parkinson disease, frontotemporal dementia and ALS.
Collapse
Affiliation(s)
- Weiva Sieh
- Division of Medical Genetics, Department of Medicine
- Division of Epidemiology, Department of Health Research and Policy, Stanford University, Stanford, CA 94305, USA
| | | | | | - Ulla-Katrina Craig
- Micronesian Health and Aging Study, University of Guam, Mangilao, Guam 96923, USA
| | - Ellen J. Steinbart
- Department of Neurology
- Geriatric Research Education and Clinical Center, Veterans Affairs Puget Sound Health Care System, Seattle, WA 98108, USA
| | | | - Kiyomitsu Oyanagi
- Department of Neuropathology, Tokyo Metropolitan Institute for Neuroscience, Tokyo, Japan
| | - Ralph M. Garruto
- Laboratory of Biomedical Anthropology and Neurosciences, Department of Anthropology, Binghamton University, Binghamton, NY 13902, USA
| | - Thomas D. Bird
- Division of Medical Genetics, Department of Medicine
- Department of Neurology
- Geriatric Research Education and Clinical Center, Veterans Affairs Puget Sound Health Care System, Seattle, WA 98108, USA
| | - Douglas R. Galasko
- Department of Neurosciences, University of California, San Diego, La Jolla, CA 92093, USA and
| | - Gerard D. Schellenberg
- Department of Neurology
- Division of Gerontology and Geriatric Medicine, Department of Medicine
- Department of Pharmacology and
- Geriatric Research Education and Clinical Center, Veterans Affairs Puget Sound Health Care System, Seattle, WA 98108, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Ellen M. Wijsman
- Division of Medical Genetics, Department of Medicine
- Department of Biostatistics
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
10
|
Genome-wide association studies in an isolated founder population from the Pacific Island of Kosrae. PLoS Genet 2009; 5:e1000365. [PMID: 19197348 PMCID: PMC2628735 DOI: 10.1371/journal.pgen.1000365] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2008] [Accepted: 01/05/2009] [Indexed: 12/15/2022] Open
Abstract
It has been argued that the limited genetic diversity and reduced allelic heterogeneity observed in isolated founder populations facilitates discovery of loci contributing to both Mendelian and complex disease. A strong founder effect, severe isolation, and substantial inbreeding have dramatically reduced genetic diversity in natives from the island of Kosrae, Federated States of Micronesia, who exhibit a high prevalence of obesity and other metabolic disorders. We hypothesized that genetic drift and possibly natural selection on Kosrae might have increased the frequency of previously rare genetic variants with relatively large effects, making these alleles readily detectable in genome-wide association analysis. However, mapping in large, inbred cohorts introduces analytic challenges, as extensive relatedness between subjects violates the assumptions of independence upon which traditional association test statistics are based. We performed genome-wide association analysis for 15 quantitative traits in 2,906 members of the Kosrae population, using novel approaches to manage the extreme relatedness in the sample. As positive controls, we observe association to known loci for plasma cholesterol, triglycerides, and C-reactive protein and to a compelling candidate loci for thyroid stimulating hormone and fasting plasma glucose. We show that our study is well powered to detect common alleles explaining >/=5% phenotypic variance. However, no such large effects were observed with genome-wide significance, arguing that even in such a severely inbred population, common alleles typically have modest effects. Finally, we show that a majority of common variants discovered in Caucasians have indistinguishable effect sizes on Kosrae, despite the major differences in population genetics and environment.
Collapse
|
11
|
Kristiansson K, Naukkarinen J, Peltonen L. Isolated populations and complex disease gene identification. Genome Biol 2008; 9:109. [PMID: 18771588 PMCID: PMC2575505 DOI: 10.1186/gb-2008-9-8-109] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Isolated populations can be useful for the identification of genes underlying common complex diseases. The utility of genetically isolated populations (population isolates) in the mapping and identification of genes is not only limited to the study of rare diseases; isolated populations also provide a useful resource for studies aimed at improved understanding of the biology underlying common diseases and their component traits. Well characterized human populations provide excellent study samples for many different genetic investigations, ranging from genome-wide association studies to the characterization of interactions between genes and the environment.
Collapse
Affiliation(s)
- Kati Kristiansson
- National Public Health Institute and FIMM, Institute for Molecular Medicine Finland, Helsinki 00300, Finland
| | | | | |
Collapse
|
12
|
Identity-by-descent estimation and mapping of qualitative traits in large, complex pedigrees. Genetics 2008; 179:1577-90. [PMID: 18622032 DOI: 10.1534/genetics.108.089912] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Computing identity-by-descent sharing between individuals connected through a large, complex pedigree is a computationally demanding task that often cannot be done using exact methods. What I present here is a rapid computational method for estimating, in large complex pedigrees, the probability that pairs of alleles are IBD given the single-point genotype data at that marker for all individuals. The method can be used on pedigrees of essentially arbitrary size and complexity without the need to divide the individuals into separate subpedigrees. I apply the method to do qualitative trait linkage mapping using the nonparametric sharing statistic S(pairs). The validity of the method is demonstrated via simulation studies on a 13-generation 3028-person pedigree with 700 genotyped individuals. An analysis of an asthma data set of individuals in this pedigree finds four loci with P-values <10(-3) that were not detected in prior analyses. The mapping method is fast and can complete analyses of approximately 150 affected individuals within this pedigree for thousands of markers in a matter of hours.
Collapse
|
13
|
Xing J, Witherspoon DJ, Watkins WS, Zhang Y, Tolpinrud W, Jorde LB. HapMap tagSNP transferability in multiple populations: general guidelines. Genomics 2008; 92:41-51. [PMID: 18482828 PMCID: PMC2471876 DOI: 10.1016/j.ygeno.2008.03.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2008] [Revised: 03/26/2008] [Accepted: 03/28/2008] [Indexed: 11/30/2022]
Abstract
Linkage disequilibrium (LD) has received much attention recently because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian sample and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r(2) >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations.
Collapse
Affiliation(s)
- Jinchuan Xing
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| | - David J. Witherspoon
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| | - W. Scott Watkins
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| | - Yuhua Zhang
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| | - Whitney Tolpinrud
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| | - Lynn B. Jorde
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT,84112
| |
Collapse
|
14
|
Nolan DK, Chen P, Das S, Ober C, Waggoner D. Fine mapping of a locus for nonsyndromic mental retardation on chromosome 19p13. Am J Med Genet A 2008; 146A:1414-22. [PMID: 18446860 DOI: 10.1002/ajmg.a.32307] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Mental retardation (MR) occurs in approximately 3% of the population and therefore significantly impacts public health. Despite this relatively high prevalence, the specific causes of MR remain unknown in most cases, although both genetic and environmental factors are known to contribute. We describe a consanguineous family with autosomal recessive (AR) nonsyndromic MR (NSMR). Because the consanguinity of this family is complex, we explore alternative approaches for generating accurate estimates of the evidence for linkage in this family, and demonstrate evidence for linkage to chromosome 19p13 (lod score ranging from 1.2 to 3.5, depending on assumptions of allele frequencies). Fine mapping of the linked region defined a critical region of 3.6 Mb, which overlaps with a previously reported gene (CC2D1A) for MR. However, no mutations in the coding region of this gene are present in the family we describe. These results suggest that another gene causing autosomal recessive nonsyndromic MR (ARNSMR) is located within this genomic region.
Collapse
Affiliation(s)
- D K Nolan
- Committee on Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | |
Collapse
|
15
|
Aulchenko YS, de Koning DJ, Haley C. Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis. Genetics 2007; 177:577-85. [PMID: 17660554 PMCID: PMC2013682 DOI: 10.1534/genetics.107.075614] [Citation(s) in RCA: 325] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2007] [Accepted: 07/02/2007] [Indexed: 11/18/2022] Open
Abstract
For pedigree-based quantitative trait loci (QTL) association analysis, a range of methods utilizing within-family variation such as transmission-disequilibrium test (TDT)-based methods have been developed. In scenarios where stratification is not a concern, methods exploiting between-family variation in addition to within-family variation, such as the measured genotype (MG) approach, have greater power. Application of MG methods can be computationally demanding (especially for large pedigrees), making genomewide scans practically infeasible. Here we suggest a novel approach for genomewide pedigree-based quantitative trait loci (QTL) association analysis: genomewide rapid association using mixed model and regression (GRAMMAR). The method first obtains residuals adjusted for family effects and subsequently analyzes the association between these residuals and genetic polymorphisms using rapid least-squares methods. At the final step, the selected polymorphisms may be followed up with the full measured genotype (MG) analysis. In a simulation study, we compared type 1 error, power, and operational characteristics of the proposed method with those of MG and TDT-based approaches. For moderately heritable (30%) traits in human pedigrees the power of the GRAMMAR and the MG approaches is similar and is much higher than that of TDT-based approaches. When using tabulated thresholds, the proposed method is less powerful than MG for very high heritabilities and pedigrees including large sibships like those observed in livestock pedigrees. However, there is little or no difference in empirical power of MG and the proposed method. In any scenario, GRAMMAR is much faster than MG and enables rapid analysis of hundreds of thousands of markers.
Collapse
Affiliation(s)
- Yurii S Aulchenko
- Department of Epidemiology and Biostatistics, Erasmus MC, 3000 CA Rotterdam, The Netherlands.
| | | | | |
Collapse
|
16
|
Angius A, Hyland FCL, Persico I, Pirastu N, Woodage T, Pirastu M, De la Vega FM. Patterns of linkage disequilibrium between SNPs in a Sardinian population isolate and the selection of markers for association studies. Hum Hered 2007; 65:9-22. [PMID: 17652959 DOI: 10.1159/000106058] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2006] [Accepted: 04/30/2007] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE In isolated populations, 'background' linkage disequilibrium (LD) has been shown to extend over large genetic distances. This and their reduced environmental and genetic heterogeneity has stimulated interest in their potential for association mapping. We compared LD unit map distances with pair-wise measurements of LD in a dense single nucleotide polymorphism (SNP) set. METHODS We genotyped 771 SNPs in an 8 Mb segment of chromosome 22 on 101 individuals from the isolated village of Talana, Sardinia, and compared with outbred European populations. RESULTS Heterozygosity was remarkably similar in both populations. In contrast, the extent of LD observed was quite different. The decay of LD with distance is slower in the isolate. The differences in LD map lengths suggest that useful LD extends up to three times farther in the Sardinian population; smaller differences are seen with pairwise LD metrics. While LD map length slightly decreases with average relatedness, cryptic relatedness does not explain the decrease in LD map length. Haplotypes, block boundaries, and patterns of LD are similar in both populations, suggesting a shared distribution of recombination hotspots. CONCLUSIONS About 15% fewer haplotype tagging SNPs need to be genotyped in the isolate, and possibly 70% fewer if selecting SNPs evenly spaced on the metric LD map.
Collapse
|
17
|
Barrett JH, Sheehan NA, Cox A, Worthington J, Cannings C, Teare MD. Family based studies and genetic epidemiology: theory and practice. Hum Hered 2007; 64:146-8. [PMID: 17476114 DOI: 10.1159/000101993] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2007] [Accepted: 02/19/2007] [Indexed: 11/19/2022] Open
Abstract
Family based studies have underpinned many successes in uncovering the causes of monogenic and oligogenic diseases. Now research is focussing on the identification and characterisation of genes underlying common diseases and it is widely accepted that these studies will require large population based samples. Population based family study designs have the potential to facilitate the analysis of the effects of both genes and environment. These types of studies integrate the population based approaches of classic epidemiology and the methods enabling the analysis of correlations between relatives sharing both genes and environment. The extent to which such studies are feasible will depend upon population- and disease-specific factors. To review this topic, a symposium was held to present and discuss the costs, requirements and advantages of population based family study designs. This article summarises the features of the meeting held at The University of Sheffield, August 2006.
Collapse
Affiliation(s)
- J H Barrett
- Genetic Epidemiology Division, Leeds Institute of Molecular Medicine, University of Leeds, Leeds, UK
| | | | | | | | | | | |
Collapse
|
18
|
Colonna V, Nutile T, Astore M, Guardiola O, Antoniol G, Ciullo M, Persico MG. Campora: a young genetic isolate in South Italy. Hum Hered 2007; 64:123-35. [PMID: 17476112 PMCID: PMC2787182 DOI: 10.1159/000101964] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2006] [Accepted: 02/14/2007] [Indexed: 12/21/2022] Open
Abstract
Genetic isolates have been successfully used in the study of complex traits, mainly because due to their features, they allow a reduction in the complexity of the genetic models underlying the trait. The aim of the present study is to describe the population of Campora, a village in the South of Italy, highlighting its properties of a genetic isolate. Both historical evidence and multi-locus genetic data (genomic and mitochondrial DNA polymorphisms) have been taken into account in the analyses. The extension of linkage disequilibrium (LD) regions has been evaluated on autosomes and on a region of the X chromosome. We defined a study sample population on the basis of the genealogy and exogamy data. We found in this population a few different mitochondrial and Y chromosome haplotypes and we ascertained that, similarly to other isolated populations, in Campora LD extends over wider region compared to large and genetically heterogeneous populations. These findings indicate a conspicuous genetic homogeneity in the genome. Finally, we found evidence for a recent population bottleneck that we propose to interpret as a demographic crisis determined by the plague of the 17th century. Overall our findings demonstrate that Campora displays the genetic characteristics of a young isolate.
Collapse
Affiliation(s)
- Vincenza Colonna
- Institute of Genetics and Biophysics A. Buzzati-Traverso, CNR Naples, Naples, Italy.
| | | | | | | | | | | | | |
Collapse
|
19
|
Venken T, Del-Favero J. Chasing genes for mood disorders and schizophrenia in genetically isolated populations. Hum Mutat 2007; 28:1156-70. [PMID: 17659644 DOI: 10.1002/humu.20582] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Major affective disorders and schizophrenia are among the most common brain diseases worldwide and their predisposition is influenced by a complex interaction of genetic and environmental factors. So far, traditional linkage mapping studies for these complex disorders have not achieved the same success as the positional cloning of genes for Mendelian diseases. The struggle to identify susceptibility genes for complex disorders has stimulated the development of alternative approaches, including studies in genetically isolated populations. Since isolated populations are likely to have both a reduced number of genetic vulnerability factors and environmental background and are therefore considered to be more homogeneous compared to outbred populations, the use of isolated populations in genetic studies is expected to improve the chance of finding susceptibility loci and genes. Here we review the role of isolated populations, based on linkage and association studies, in the identification of susceptibility genes for bipolar disorder and schizophrenia.
Collapse
Affiliation(s)
- Tine Venken
- Applied Molecular Genomics Group, Department of Molecular Genetics, VIB, Antwerpen, Belgium
| | | |
Collapse
|
20
|
Ciullo M, Bellenguez C, Colonna V, Nutile T, Calabria A, Pacente R, Iovino G, Trimarco B, Bourgain C, Persico MG. New susceptibility locus for hypertension on chromosome 8q by efficient pedigree-breaking in an Italian isolate. Hum Mol Genet 2006; 15:1735-43. [PMID: 16611673 DOI: 10.1093/hmg/ddl097] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Essential hypertension (EH) affects a large proportion of the adult population in Western countries and is a major risk factor for cardiovascular diseases. EH is a multifactorial disease with a complex genetic component. To tackle the complexity of this genetic component, we have initiated a study of Campora, an isolated village in South Italy. A random sample of 389 adults was genotyped for a very dense microsatellite genome scan and phenotyped for EH. Of this sample, 173 affected individuals were all related through a 2,180-member pedigree and could be integrated within a linkage analysis. The complexity of the pedigree prevented its direct use for a non-parametric linkage (NPL) analysis. Therefore, the method proposed by Falchi et al. [2004, Am. J. Hum. Genet., 75, 1015-1031] was used for automatic pedigree-breaking. We identified a new locus for EH on chromosome 8q22-23 and detected linkage with two known loci for EH: 1q42-43 and 4p16. Simulations showed that the linkage with 8q22-23 is highly genome-wide significant, even when accounting for the breaking of the pedigree. An extension to qualitative traits of another pedigree-breaking approach [Pankratz et al., 2001, Genet. Epidemiol., 21 (Suppl. 1), S258-S263] also detected a significant linkage on 8q22-23 using a remarkably different set of sub-pedigrees and helped to refine the location of the linkage signal. This work both identifies a new locus strongly linked to hypertension and shows that the power of linkage analysis can be improved by the appropriate use of efficient pedigree-breaking strategies.
Collapse
Affiliation(s)
- Marina Ciullo
- Institute of Genetics and Biophysics, A. Buzzati-Traverso, CNR Naples, Italy.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
McCauley JL, Hahs DW, Jiang L, Scott WK, Welsh-Bohmer KA, Jackson CE, Vance JM, Pericak-Vance MA, Haines JL. Combinatorial Mismatch Scan (CMS) for loci associated with dementia in the Amish. BMC MEDICAL GENETICS 2006; 7:19. [PMID: 16515697 PMCID: PMC1448207 DOI: 10.1186/1471-2350-7-19] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/24/2005] [Accepted: 03/03/2006] [Indexed: 11/28/2022]
Abstract
BACKGROUND Population heterogeneity may be a significant confounding factor hampering detection and verification of late onset Alzheimer's disease (LOAD) susceptibility genes. The Amish communities located in Indiana and Ohio are relatively isolated populations that may have increased power to detect disease susceptibility genes. METHODS We recently performed a genome scan of dementia in this population that detected several potential loci. However, analyses of these data are complicated by the highly consanguineous nature of these Amish pedigrees. Therefore we applied the Combinatorial Mismatch Scanning (CMS) method that compares identity by state (IBS) (under the presumption of identity by descent (IBD)) sharing in distantly related individuals from such populations where standard linkage and association analyses are difficult to implement. CMS compares allele sharing between individuals in affected and unaffected groups from founder populations. Comparisons between cases and controls were done using two Fisher's exact tests, one testing for excess in IBS allele frequency and the other testing for excess in IBS genotype frequency for 407 microsatellite markers. RESULTS In all, 13 dementia cases and 14 normal controls were identified who were not related at least through the grandparental generation. The examination of allele frequencies identified 24 markers (6%) nominally (p < or = 0.05) associated with dementia; the most interesting (empiric p < or = 0.005) markers were D3S1262, D5S211, and D19S1165. The examination of genotype frequencies identified 21 markers (5%) nominally (p < or = 0.05) associated with dementia; the most significant markers were both located on chromosome 5 (D5S1480 and D5S211). Notably, one of these markers (D5S211) demonstrated differences (empiric p < or = 0.005) under both tests. CONCLUSION Our results provide the initial groundwork for identifying genes involved in late-onset Alzheimer's disease within the Amish community. Genes identified within this isolated population will likely play a role in a subset of late-onset AD cases across more general populations. Regions highlighted by markers demonstrating suggestive allelic and/or genotypic differences will be the focus of more detailed examination to characterize their involvement in dementia.
Collapse
Affiliation(s)
- Jacob L McCauley
- Center for Human Genetics Research and Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Daniel W Hahs
- Center for Human Genetics Research and Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Lan Jiang
- Center for Human Genetics Research and Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - William K Scott
- Center for Human Genetics and Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Kathleen A Welsh-Bohmer
- Joseph & Kathleen Bryan ADRC/Division of Neurology, Duke University Medical Center, Durham, NC, USA
| | | | - Jeffery M Vance
- Center for Human Genetics and Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Margaret A Pericak-Vance
- Center for Human Genetics and Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Jonathan L Haines
- Center for Human Genetics Research and Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
22
|
Als TD, Jorgensen TH, Børglum AD, Petersen PA, Mors O, Wang AG. Highly discrepant proportions of female and male Scandinavian and British Isles ancestry within the isolated population of the Faroe Islands. Eur J Hum Genet 2006; 14:497-504. [PMID: 16434998 DOI: 10.1038/sj.ejhg.5201578] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The Faroe Islands in the North Atlantic Ocean are inhabited by a small population, whose origin is thought to date back to the Viking Age. Historical, archaeological and linguistic evidence indicates that the present population of the Faroe Islands may have a mixture of Scandinavian and British Isles ancestry. In the present study we used 122 new and 19 previously published hypervariable region I sequences of the mitochondrial control region to analyse the genetic diversity of the Faroese population and compare it with other populations in the North Atlantic region. The analyses suggested that the Faroese mtDNA pool has been affected by genetic drift, and is among the most homogenous and isolated in the North Atlantic region. This will have implications for attempts to locate genes for complex disorders. To obtain estimates of Scandinavian vs British Isles ancestry proportions, we applied a frequency-based admixture approach taking private haplotypes into account by the use of phylogenetic information. While previous studies have suggested an excess of Scandinavian ancestry among the male settlers of the Faroe Islands, the current study indicates an excess of British Isles ancestry among the female settlers of the Faroe Islands. Compared to other admixed populations of the North Atlantic region, the population of the Faroe Islands appears to have the highest level of asymmetry in Scandinavian vs British Isles ancestry proportions among female and male settlers of the archipelago.
Collapse
Affiliation(s)
- Thomas D Als
- Centre for Basic Psychiatric Research, Aarhus University Hospital, Skovagervej 2, Risskov DK-8240, Denmark.
| | | | | | | | | | | |
Collapse
|
23
|
Lauritsen MB, Als TD, Dahl HA, Flint TJ, Wang AG, Vang M, Kruse TA, Ewald H, Mors O. A genome-wide search for alleles and haplotypes associated with autism and related pervasive developmental disorders on the Faroe Islands. Mol Psychiatry 2006; 11:37-46. [PMID: 16205737 DOI: 10.1038/sj.mp.4001754] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The involvement of genetic factors in the etiology of autism has been clearly established. We undertook a genome-wide search for regions containing susceptibility genes for autism in 12 subjects with childhood autism and related pervasive developmental disorders (PDDs) and 44 controls from the relatively isolated population of the Faroe Islands. In total, 601 microsatellite markers distributed throughout the human genome with an average distance of 5.80 cM were genotyped, including 502 markers in the initial scan. The Faroese population structure and genetic relatedness of cases and controls were also evaluated. Based on a combined approach, including an assumption-free test as implemented in CLUMP, Fisher's exact test for specific alleles and haplotypes, and IBD(0) probability calculations, we found association between autism and microsatellite markers in regions on 2q, 3p, 6q, 15q, 16p, and 18q. The most significant finding was on 3p25.3 (P(T1)=0.00003 and P(T4)=0.00007), which was also supported by other genetic studies. Furthermore, no evidence of population substructure was found, and a higher degree of relatedness among cases could not be detected, decreasing the risk of inflated P-values. Our data suggest that markers in these regions are in linkage disequilibrium with genes involved in the etiology of autism, and we hypothesize susceptibility genes for autism and related PDDs to be localized within these regions.
Collapse
Affiliation(s)
- M B Lauritsen
- Centre for Basic Psychiatric Research, Psychiatric Hospital in Aarhus, Aarhus University Hospital, Shovagervej 2, DK-8240 Risskov, Denmark.
| | | | | | | | | | | | | | | | | |
Collapse
|