1
|
Elhawary NA, AlJahdali IA, Abumansour IS, Elhawary EN, Gaboon N, Dandini M, Madkhali A, Alosaimi W, Alzahrani A, Aljohani F, Melibary EM, Kensara OA. Genetic etiology and clinical challenges of phenylketonuria. Hum Genomics 2022; 16:22. [PMID: 35854334 PMCID: PMC9295449 DOI: 10.1186/s40246-022-00398-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 07/08/2022] [Indexed: 02/08/2023] Open
Abstract
This review discusses the epidemiology, pathophysiology, genetic etiology, and management of phenylketonuria (PKU). PKU, an autosomal recessive disease, is an inborn error of phenylalanine (Phe) metabolism caused by pathogenic variants in the phenylalanine hydroxylase (PAH) gene. The prevalence of PKU varies widely among ethnicities and geographic regions, affecting approximately 1 in 24,000 individuals worldwide. Deficiency in the PAH enzyme or, in rare cases, the cofactor tetrahydrobiopterin results in high blood Phe concentrations, causing brain dysfunction. Untreated PKU, also known as PAH deficiency, results in severe and irreversible intellectual disability, epilepsy, behavioral disorders, and clinical features such as acquired microcephaly, seizures, psychological signs, and generalized hypopigmentation of skin (including hair and eyes). Severe phenotypes are classic PKU, and less severe forms of PAH deficiency are moderate PKU, mild PKU, mild hyperphenylalaninaemia (HPA), or benign HPA. Early diagnosis and intervention must start shortly after birth to prevent major cognitive and neurological effects. Dietary treatment, including natural protein restriction and Phe-free supplements, must be used to maintain blood Phe concentrations of 120-360 μmol/L throughout the life span. Additional treatments include the casein glycomacropeptide (GMP), which contains very limited aromatic amino acids and may improve immunological function, and large neutral amino acid (LNAA) supplementation to prevent plasma Phe transport into the brain. The synthetic BH4 analog, sapropterin hydrochloride (i.e., Kuvan®, BioMarin), is another potential treatment that activates residual PAH, thus decreasing Phe concentrations in the blood of PKU patients. Moreover, daily subcutaneous injection of pegylated Phe ammonia-lyase (i.e., pegvaliase; PALYNZIQ®, BioMarin) has promised gene therapy in recent clinical trials, and mRNA approaches are also being studied.
Collapse
Affiliation(s)
- Nasser A. Elhawary
- grid.412832.e0000 0000 9137 6644Department of Medical Genetics, College of Medicine, Umm Al-Qura University, P.O. Box 57543, Mecca, 21955 Saudi Arabia
| | - Imad A. AlJahdali
- grid.412832.e0000 0000 9137 6644Department of Community Medicine, College of Medicine, Umm Al-Qura University, P.O. Box 57543, Mecca, 21955 Saudi Arabia
| | - Iman S. Abumansour
- grid.412832.e0000 0000 9137 6644Department of Medical Genetics, College of Medicine, Umm Al-Qura University, P.O. Box 57543, Mecca, 21955 Saudi Arabia
| | - Ezzeldin N. Elhawary
- grid.123047.30000000103590315Faculty of Medicine, MS Genomic Medicine Program, University of Southampton, Southampton General Hospital, Southampton, UK
| | - Nagwa Gaboon
- grid.7269.a0000 0004 0621 1570Department of Clinical Genetics, Faculty of Medicine, Ain Shams University, Cairo, Egypt
| | - Mohammed Dandini
- Department of Laboratory and Blood Bank, Maternity and Children Hospital, Mecca, Saudi Arabia
| | - Abdulelah Madkhali
- grid.415254.30000 0004 1790 7311Department of Pathology and Laboratory Medicine, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Wafaa Alosaimi
- Department of Hematology, Maternity and Children Hospital, Mecca, Saudi Arabia
| | - Abdulmajeed Alzahrani
- Department of Laboratory and Blood Bank at Maternity and Children Hospital, Mecca, Saudi Arabia
| | - Fawzia Aljohani
- Department of Pediatric Clinics, Maternity and Children Hospital, King Salman Medical City, Madinah, Saudi Arabia
| | - Ehab M. Melibary
- grid.412832.e0000 0000 9137 6644Department of Medical Genetics, College of Medicine, Umm Al-Qura University, P.O. Box 57543, Mecca, 21955 Saudi Arabia
| | - Osama A. Kensara
- grid.412832.e0000 0000 9137 6644Department of Clinical Nutrition, Faculty of Applied Medical Sciences, Umm Al-Qura University, Jeddah, Saudi Arabia
- Department of Biochemistry, Batterjee Medical College, Jeddah, Saudi Arabia
| |
Collapse
|
2
|
Alibakhshi R, Moradi K, Ghadiri K. The status of PAH gene-VNTR alleles and mini-haplotypes associations with PAH gene mutations in Iranian Kurdish PKU patients. Med J Islam Repub Iran 2019; 33:88. [PMID: 31696082 PMCID: PMC6825400 DOI: 10.34171/mjiri.33.88] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Indexed: 11/17/2022] Open
Abstract
Background: The analysis of haplotypes/mini-haplotypes in the PAH gene has been used as an informative tool in several genetic anthropology studies. Considering the notion that Iranian population is one of the most heterogeneous i the world, this study was conducted to evaluate the association of VNTR-STR mini-haplotypes with the PAH gene mutations in PKU patients in Kermanshah province. Methods: A total of 24 unrelated Kurdish PKU patients with the known PAH gene causing mutations and 72 healthy controls were selected. The DNA fragments containing VNTR and STR systems were amplified by polymerase chain reaction (PCR). For VNTR system, PCR products were separated using electrophoresis on 2.5% agarose gel. For STR system, the samples were analyzed using DNA sequencing analysis version 5.2 software. Results: Overall, 5 PAH-VNTR-alleles, including VNTR3, 7, 8, 9, 12, and 3 PAH-STR-alleles, including STR238, 242, and 250, were detected in this study. VNTR3 and 8 alleles had the most frequency among healthy controls. Also, 6 different mini-haplotype alleles were found to be associated with PKU chromosomes. The 2 most prevalent mutations in Kermanshah province, IVS2+5G>C and IVS9+5G>A, were strongly linked to mini-haplotypes 9/242 and 8/238, respectively. Conclusion: The distributions and frequencies of VNTR alleles in Kurdish population have the most similarity to alleles previously described in European Caucasian families. Moreover, since the most common mutations in Kermanshah PKU chromosomes are rare and this was the first study on mini-haplotypes VNTR/STR among Iranian Kurdish PKU patients, given that this study was the first of its kind, it was not possible to compare its results with that of other studies on Iranian and non-Iranian populations.
Collapse
Affiliation(s)
- Reza Alibakhshi
- Department of Biochemistry, School of Medicine, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Keivan Moradi
- Medical Biology Research Center, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Keyghobad Ghadiri
- Infectious Disease Research Center, Kermanshah University of Medical Sciences, Kermanshah, Iran
| |
Collapse
|
3
|
Talebi S, Entezam M, Mohajer N, Kazemi-Sefat GE, Razipour M, Ahmadloo S, Setoodeh A, Keramatipour M. An Efficient Trio-Based Mini-Haplotyping Method for Genetic Diagnosis of Phenylketonuria. CELL JOURNAL 2016; 18:229-36. [PMID: 27540528 PMCID: PMC4988422 DOI: 10.22074/cellj.2016.4318] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Accepted: 09/15/2015] [Indexed: 11/18/2022]
Abstract
Objective The phenylalanine hydroxylase (PAH) locus has high linkage disequilibrium. Haplotypes related to this locus may thus be considered sufficiently informative for genetic diagnosis and carrier screening using multi-allelic markers. In this study, we present an efficient
method for haplotype analysis of PAH locus using multiplexing dyes. In addition, we explain
how to resolve the dye shift challenge in multiplex short tandem repeat (STR) genotyping.
Materials and Methods One hundred family trios were included in this descriptive
study. The forward primer of a tetra-nucleotide STR and the reverse primer of a variable
number tandem repeat (VNTR) were labeled with three different non-overlapping dyes
5-carboxyfluorescein (FAM), 6-carboxy-N,N,N’,N’-tetramethylrhodamine (HEX) and 6-carboxy-N,N,N’,N’-tetramethylrhodamine (TAMRA). The polymerase chain reaction (PCR)
products from each family trio were multiplexed for capillary electrophoresis and results
were analyzed using Peak Scanner software.
Results Multiplexing trio products decreased the cost significantly. The TAMRA labeled products had a significant predictable shift (migrated at a slower electrophoretic rate) relative to the
HEX and FAM labeled products. Through our methodology we achieve, the less inter-dye shift
than intra-dye shift variance. Correcting the dye shift in the labeled products, according to the
reference allele size, significantly decreased the inter-dye variability (P<0.001).
Conclusion Multiplexing trio products helps to detect and resolve the dye shift accurately
in each family, which otherwise would result in diagnostic error. The dye system of FAM,
HEX and TAMRA is more feasible and cheaper than other dye systems.
Collapse
Affiliation(s)
- Saeed Talebi
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Mona Entezam
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Neda Mohajer
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Golnaz-Ensieh Kazemi-Sefat
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran; Department of Molecular Medicine, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Masoumeh Razipour
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Somayeh Ahmadloo
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Aria Setoodeh
- Department of Endocrinology, Children's Hospital Medical Center, Tehran University of Medical Sciences, Tehran, Iran
| | - Mohammad Keramatipour
- Department of Medical Genetics, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
4
|
Bagheri M, Rad IA, Jazani NH, Zarrin R, Ghazavi A. Frequency of the VNTR-Polymorphisms at the PAH Gene in the Iranian Azeri Turkish Patients with Phenylketonuria. MAEDICA 2015; 10:310-314. [PMID: 28465730 PMCID: PMC5394437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
INTRODUCTION This study was carried out to determine the frequency of the VNTR-polymorphisms at the PAH gene in the Iranian Azeri Turkish patients with phenylketonuria (PKU) and normal controls. MATERIAL AND METHODS The VNTR-polymorphisms were determined by PCR in 43 PKU patients as well as 43 controls. OUTCOMES The frequencies of VNTR-alleles were 13(15.1%), 3(3.49%), 64(74.4%), 5(5.81%), and 1(1.16%) in the patients and 43(50%), 0(0%), 42(48.8%), 0(0%), and 1(1.16%) in the controls regarding 3, 7, 8, 9, and 11 repeat copies, respectively. The VNTR alleles with 12 and 13 repeats were not found in our samples. The frequencies of VNTR-genotypes were 25(58.1%), 1(2.33%), 1(2.33%), 10(23.3%), 2(4.65%), 2(4.65%), 1(2.33%), 1(2.33%), and 0(0%) in the patients and 13(30.2%), 13(30.2%), 0(0%), 16(37.2%), 0(0%), 0(0%), 0(0%), 0(0%) and 1(2.33%) in the controls regarding VNTR8/VNTR8, VNTR3/VNTR3, VNTR3/VNTR9, VNTR8/VNTR3, VNTR8/VNTR9, VNTR7/VNTR9, VNTR7/ VNTR8, VNTR8/VNTR11, and VNTR3/VNTR11 genotypes, respectively. The comparisons of VNTRpolymorphisms imply that there are statistically significant differences between the patients and controls regarding VNTR3, VNTR8, and VNTR9 alleles as well as VNTR8/VNTR8 and VNTR3/VNTR3 genotypes (all P-Value <0.05). The frequency of "risk-associated genotype of VNTR8/VNTR8" was significantly higher in the cases. CONCLUSION It is concluded that this position is heterozygous and there were statistically significant differences between patients and controls concerning the VNTR8/VNTR8 genotype. We found higher frequencies of disease-associated genotype in our samples than controls. This report is the first in its own type in the west Azerbaijani population. Further studies require assessing how this genotype predicts adverse outcomes in tested population.
Collapse
Affiliation(s)
- Morteza Bagheri
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Isa Abdi Rad
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Nima Hosseini Jazani
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Rasoul Zarrin
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Ahad Ghazavi
- Neurophysiology Research Center, Urmia University of Medical Sciences, Urmia, Iran
| |
Collapse
|
5
|
Bagheri M, Abdi Rad I, Hosseini Jazani N, Zarrin R, Ghazavi A. Molecular Genetic Analysis of the Variable Number of Tandem-Repeat Alleles at the Phenylalanine Hydroxylase Gene in Iranian Azeri Turkish Population. IRANIAN BIOMEDICAL JOURNAL 2015; 19:183-7. [PMID: 26025954 PMCID: PMC4571015 DOI: 10.7508/ibj.2015.03.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Background: The variable numbers of tandem-repeat (VNTR) alleles at the phenylalanine hydroxylase (PAH) gene have been used in carrier detection and prenatal diagnosis in phenylketonuria families. This study was carried out to analyze VNTR alleles at the PAH gene in Iranian Azeri Turkish population. Methods: In this study, 200 alleles from general population were studied by PCR. Results: The frequencies of VNTR alleles were 45%, 46%, 2%, 3%, 1%, and 3% in studied group regarding 3, 8, 9, 11, 12, and 13 repeat copies, respectively. Statistically significant differences were not found between expected and observed frequencies of VNTR genotypes (P > 0.05). Conclusions: VNTR alleles with three and eight repeats were frequent, and the VNTR alleles with 13 repeats showed 3% frequency in the tested group. This study is the first report on tested population genetic structure using VNTR alleles at the PAH gene.
Collapse
Affiliation(s)
- Morteza Bagheri
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran.,Cellular and Molecular Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Isa Abdi Rad
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran.,Cellular and Molecular Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Nima Hosseini Jazani
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Rasoul Zarrin
- Food and Beverages Safety Research Center, Urmia University of Medical Sciences, Urmia, Iran
| | - Ahad Ghazavi
- Neurophysiology Research Center, Urmia University of Medical Sciences, Urmia, Iran
| |
Collapse
|
6
|
Hozyasz KK, Mostowska A, Wójcicki P, Lasota A, Wołkowicz A, Dunin-Wilczyńska I, Jagodziński PP. Association of common variants in PAH and LAT1 with non-syndromic cleft lip with or without cleft palate (NSCL/P) in the Polish population. Arch Oral Biol 2014; 59:363-9. [PMID: 24606907 DOI: 10.1016/j.archoralbio.2014.01.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Revised: 12/23/2013] [Accepted: 01/06/2014] [Indexed: 11/30/2022]
Abstract
BACKGROUND Non-syndromic cleft lip with or without cleft palate (NSCL/P) is a common structural malformation with a complex and multifactorial aetiology. Associations of abnormalities in phenylalanine metabolism and orofacial clefts have been suggested. METHODS Eight single nucleotide polymorphisms (SNPs) of genes encoding phenylalanine hydroxylase (PAH) and large neutral l-amino acid transporter type 1 (LAT1), as well as the PAH mutation that is most common in the Polish population (rs5030858; R408W), were investigated in 263 patients with NSCL/P and 270 matched controls using high resolution melting curve analysis (HRM). RESULTS We found that two polymorphic variants of PAH appear to be risk factors for NSCL/P. The odds ratio (OR) for individuals with the rs7485331 A allele (AC or AA) compared to CC homozygotes was 0.616 (95% confidence interval [CI]=0.437-0.868; p=0.005) and this association remains statistically significant after multiple testing correction. The PAH rs12425434, previously associated with schizophrenia, was borderline associated with orofacial clefts. Moreover, haplotype analysis of polymorphisms in the PAH gene revealed a 4-marker combination that was significantly associated with NSCL/P. The global p-value for a haplotype comprised of SNPs rs74385331, rs12425434, rs1722392, and the mutation rs5030858 was 0.032, but this association did not survive multiple testing correction. CONCLUSION This study suggests the involvement of the PAH gene in the aetiology of NSCL/P in the tested population. Further replication will be required in separate cohorts to confirm the consistency of the observed association.
Collapse
Affiliation(s)
- Kamil K Hozyasz
- Department of Paediatrics, Institute of Mother and Child, Warsaw, Poland.
| | - Adrianna Mostowska
- Department of Biochemistry and Molecular Biology, Poznan University of Medical Sciences, Poznan, Poland
| | - Piotr Wójcicki
- University Clinic of Medical Academy, Wroclaw, Poland; Department of Plastic Surgery, Specialist Medical Center, Polanica Zdroj, Poland
| | - Agnieszka Lasota
- Department of Jaw Orthopaedics, Medical University of Lublin, Lublin, Poland
| | - Anna Wołkowicz
- Department of Paediatrics, Institute of Mother and Child, Warsaw, Poland
| | | | - Paweł P Jagodziński
- Department of Biochemistry and Molecular Biology, Poznan University of Medical Sciences, Poznan, Poland
| |
Collapse
|
7
|
The mutation spectrum of the phenylalanine hydroxylase (PAH) gene and associated haplotypes reveal ethnic heterogeneity in the Taiwanese population. J Hum Genet 2014; 59:145-52. [PMID: 24401910 DOI: 10.1038/jhg.2013.136] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 11/29/2013] [Accepted: 12/05/2013] [Indexed: 02/01/2023]
Abstract
Phenylalanine hydroxylase (PAH) deficiency is responsible for most cases of phenylketonuria (PKU). In this study of the PAH mutation spectrum in the Taiwanese population, 139 alleles were identified including 34 different mutations. The V190G, Q267R and F392I mutations are first reported in this study. The most common mutations, R241C, R408Q and Ex6-96A>G, account for 23.2%, 12.0% and 9.2%, of the mutant alleles, respectively. Haplotype analysis shows that R241C and Ex6-96A>G are exclusively associated with haplotype 4.3 to suggest founder effects. On the other hand, R408Q is found on two distinct haplotypes suggesting recurrent mutations. The spectrum of PAH mutations in Taiwan shows various links to those of other Asian regions, yet remarkable differences exist. Notably, R408Q, E286K and -4173_-407del, accounting for 21% of all mutant alleles in Taiwan, are very rare or are undetected among PKU cohorts of other Asian regions to suggest local founder effects. Moreover, the low homozygosity value of 0.092 hints at a high degree of ethnic heterogeneity within the Taiwanese population. Our study of PAH mutation spectrum and the associated haplotypes is useful for subsequent study on the origin and migration pattern via Taiwan, an island at the historical crossroad of migration of ancient populations.
Collapse
|
8
|
Khitrinskaya IY, Khar’kov VN, Stepanov VA. Genetic diversity of the chromosome X in aboriginal Siberian populations: The structure of linkage disequilibrium and haplotype phylogeography of the ZFX locus. Mol Biol 2010. [DOI: 10.1134/s0026893310050055] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
9
|
Zoossmann-Diskin A. The origin of Eastern European Jews revealed by autosomal, sex chromosomal and mtDNA polymorphisms. Biol Direct 2010; 5:57. [PMID: 20925954 PMCID: PMC2964539 DOI: 10.1186/1745-6150-5-57] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2010] [Accepted: 10/06/2010] [Indexed: 12/24/2022] Open
Abstract
Background This study aims to establish the likely origin of EEJ (Eastern European Jews) by genetic distance analysis of autosomal markers and haplogroups on the X and Y chromosomes and mtDNA. Results According to the autosomal polymorphisms the investigated Jewish populations do not share a common origin, and EEJ are closer to Italians in particular and to Europeans in general than to the other Jewish populations. The similarity of EEJ to Italians and Europeans is also supported by the X chromosomal haplogroups. In contrast according to the Y-chromosomal haplogroups EEJ are closest to the non-Jewish populations of the Eastern Mediterranean. MtDNA shows a mixed pattern, but overall EEJ are more distant from most populations and hold a marginal rather than a central position. The autosomal genetic distance matrix has a very high correlation (0.789) with geography, whereas the X-chromosomal, Y-chromosomal and mtDNA matrices have a lower correlation (0.540, 0.395 and 0.641 respectively). Conclusions The close genetic resemblance to Italians accords with the historical presumption that Ashkenazi Jews started their migrations across Europe in Italy and with historical evidence that conversion to Judaism was common in ancient Rome. The reasons for the discrepancy between the biparental markers and the uniparental markers are discussed. Reviewers This article was reviewed by Damian Labuda (nominated by Jerzy Jurka), Kateryna Makova and Qasim Ayub (nominated by Dan Graur).
Collapse
Affiliation(s)
- Avshalom Zoossmann-Diskin
- Department of Haematology and Genetic Pathology, School of Medicine, Flinders University, Adelaide, Australia.
| |
Collapse
|
10
|
Mukherjee N, Kidd KK, Pakstis AJ, Speed WC, Li H, Tarnok Z, Barta C, Kajuna SLB, Kidd JR. The complex global pattern of genetic variation and linkage disequilibrium at catechol-O-methyltransferase. Mol Psychiatry 2010; 15:216-25. [PMID: 18574484 PMCID: PMC2811226 DOI: 10.1038/mp.2008.64] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genetic variation at the catechol-O-methyltransferase (COMT) gene has been significantly associated with risk for various neuropsychiatric conditions such as schizophrenia, panic disorder, bipolar disorders, anorexia nervosa and others. It has also been associated with nicotine dependence, sensitivity to pain and cognitive dysfunctions especially in schizophrenia. The non-synonymous single nucleotide polymorphism (SNP) in exon 4--Val108/158Met--is the most studied SNP at COMT and is the basis for most associations. It is not, however, the only variation in the gene; several haplotypes exist across the gene. Some studies indicate that the haplotypic combinations of alleles at the Val108/158Met SNP with those in the promoter region and in the 3'-untranslated region are responsible for the associations with disorders and not the non-synonymous SNP by itself. We have now studied DNA samples from 45 populations for 63 SNPs in a region of 172 kb across the region of 22q11.2 encompassing the COMT gene. We focused on 28 SNPs spanning the COMT-coding region and immediately flanking DNA, and found that the haplotypes are from diverse evolutionary lineages that could harbor as yet undetected variants with functional consequences. Future association studies should be based on SNPs that define the common haplotypes in the population(s) being studied.
Collapse
Affiliation(s)
- N Mukherjee
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511
| | - KK Kidd
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511
| | - AJ Pakstis
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511
| | - WC Speed
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511
| | - H Li
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511
| | - Z Tarnok
- Vadaskert Child and Adolescent Psychiatric clinic, Budapest, Hungary
| | - C Barta
- Institute of Medical Chemistry, Molecular Biology and Pathobiochemistry, Semmelweis University, Budapest,Hungary
| | - SLB Kajuna
- Hubert Kairuki Memorial University, Dar es Salaam, Tanzania
| | - JR Kidd
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA - 06511,Correspondence to: Judith Kidd. Ph.D Department of Genetics, Yale University School of Medicine, P.O. Box 208005, 333 Cedar Street, New Haven, CT-06520-8005. E-mail:
| |
Collapse
|
11
|
Liu N, Bucala R, Zhao H. Modeling Informatively Missing Genotypes in Haplotype Analysis. COMMUN STAT-THEOR M 2009; 38:3445-3460. [PMID: 20052310 DOI: 10.1080/03610920802696588] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
It is common to have missing genotypes in practical genetic studies. The majority of the existing statistical methods, including those on haplotype analysis, assume that genotypes are missing at random-that is, at a given marker, different genotypes and different alleles are missing with the same probability. In our previous work, we have demonstrated that the violation of this assumption may lead to serious bias in haplotype frequency estimates and haplotype association analysis. We have proposed a general missing data model to simultaneously characterize missing data patterns across a set of two or more biallelic markers. We have proved that haplotype frequencies and missing data probabilities are identifiable if and only if there is linkage disequilibrium between these markers under the general missing data model. In this study, we extend our work to multi-allelic markers and observe a similar finding. Simulation studies on the analysis of haplotypes consisting of two markers illustrate that our proposed model can reduce the bias for haplotype frequency estimates due to incorrect assumptions on the missing data mechanism. Finally, we illustrate the utilities of our method through its application to a real data set from a study of scleroderma.
Collapse
Affiliation(s)
- Nianjun Liu
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, AL
| | | | | |
Collapse
|
12
|
Global patterns of variation in allele and haplotype frequencies and linkage disequilibrium across the CYP2E1 gene. THE PHARMACOGENOMICS JOURNAL 2008; 8:349-56. [PMID: 18663376 DOI: 10.1038/tpj.2008.9] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Cytochrome P450 2E1, gene symbol CYP2E1, is one of a family of enzymes with a central role in activating and detoxifying xenobiotics and endogenous compounds. Genetic variation at this gene has been reported in different human populations, and some association studies have reported increased risk for cancers and other diseases. To the best of our knowledge, multi-single-nucleotide polymorphism haplotypes and linkage disequilibrium (LD) have not been systematically studied for CYP2E1 in multiple populations. Haplotypes can greatly increase the power both to identify patterns of genetic variation relevant for gene expression as well as to detect disease-related susceptibility mutations. We present frequency and LD data and analyses for 11 polymorphisms and their haplotypes that we have studied on over 2600 individuals from 50 human population samples representing the major geographical regions of the world. The diverse patterns of haplotype variation found in the different populations we have studied show that ethnicity may be an important variable helping to explain inconsistencies that have been reported by association studies. More studies clearly are needed of the variants we have studied, especially those in the 5' region, such as the variable number of tandem repeats, as well as studies of additional polymorphisms known for this gene to establish evidence relating any systematic differences in gene expression that exist to the haplotypes at this gene.
Collapse
|
13
|
Abstract
"Inborn errors of metabolism," first recognized 100 years ago by Garrod, were seen as transforming evidence for chemical and biological individuality. Phenylketonuria (PKU), a Mendelian autosomal recessive phenotype, was identified in 1934 by Asbjörn Fölling. It is a disease with impaired postnatal cognitive development resulting from a neurotoxic effect of hyperphenylalaninemia (HPA). Its metabolic phenotype is accountable to multifactorial origins both in nurture, where the normal nutritional experience introduces L-phenylalanine, and in nature, where mutations (>500 alleles) occur in the phenylalanine hydroxylase gene (PAH) on chromosome 12q23.2 encoding the L-phenylalanine hydroxylase enzyme (EC 1.14.16.1). The PAH enzyme converts phenylalanine to tyrosine in the presence of molecular oxygen and catalytic amounts of tetrahydrobiopterin (BH4), its nonprotein cofactor. PKU is among the first of the human genetic diseases to enter, through newborn screening, the domain of public health, and to show a treatment effect. This effect caused a paradigm shift in attitudes about genetic disease. The PKU story contains many messages, including: a framework on which to appreciate the complexity of PKU in which phenotype reflects both locus-specific and genomic components; what the human PAH gene tells us about human population genetics and evolution of modern humans; and how our interest in PKU is served by a locus-specific mutation database (http://www.pahdb.mcgill.ca; last accessed 20 March 2007). The individual Mendelian PKU phenotype has no "simple" or single explanation; every patient has her/his own complex PKU phenotype and will be treated accordingly. Knowledge about PKU reveals genomic components of both disease and health.
Collapse
Affiliation(s)
- Charles R Scriver
- Department of Human Genetics, Faculty of Medicine, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
14
|
Wollstein A, Herrmann A, Wittig M, Nothnagel M, Franke A, Nürnberg P, Schreiber S, Krawczak M, Hampe J. Efficacy assessment of SNP sets for genome-wide disease association studies. Nucleic Acids Res 2007; 35:e113. [PMID: 17726055 PMCID: PMC2034459 DOI: 10.1093/nar/gkm621] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The power of a genome-wide disease association study depends critically upon the properties of the marker set used, particularly the number and physical spacing of markers, and the level of inter-marker association due to linkage disequilibrium. Extending our previously devised theoretical framework for the entropy-based selection of genetic markers, we have developed a local measure of the efficacy of a marker set, relative to including a maximally polymorphic single nucleotide polymorphism (SNP) at the map position of interest. Using this quantitative criterion, we evaluated five currently available SNP sets, namely Affymetrix 100K and 500K, and Illumina 100K, 300K and 550K in the CEU, YRI and JPT + CHB HapMap populations. At 50% relative efficacy, the commercial marker sets cover between 19 and 68% of the human genome, depending upon the population under study. An optimal technology-independent 500K marker set constructed from HapMap for Caucasians, in contrast, would achieve 73% coverage at the same relative efficacy.
Collapse
Affiliation(s)
- Andreas Wollstein
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Alexander Herrmann
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Michael Wittig
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Michael Nothnagel
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Andre Franke
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Peter Nürnberg
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Stefan Schreiber
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Michael Krawczak
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
| | - Jochen Hampe
- Cologne Center for Genomics, Cologne, Institute of Clinical Molecular Biology, Christian-Albrechts University, Ist Department of Medicine and Institute of Medical Informatics and Statistics, Christian-Albrechts University, University Hospital Schleswig-Holstein Campus Kiel, Kiel, Germany
- *To whom correspondence should be addressed. +49 431 597 1246+49 431 597 1842
| |
Collapse
|
15
|
Hardelid P, Cortina-Borja M, Munro A, Jones H, Cleary M, Champion MP, Foo Y, Scriver CR, Dezateux C. The Birth Prevalence of PKU in Populations of European, South Asian and Sub-Saharan African Ancestry Living in South East England. Ann Hum Genet 2007; 72:65-71. [DOI: 10.1111/j.1469-1809.2007.00389.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
16
|
Hu Z, Wang H, Shao M, Jin G, Sun W, Wang Y, Liu H, Wang Y, Ma H, Qian J, Jin L, Wei Q, Lu D, Huang W, Shen H. Genetic variants inMGMTand risk of lung cancer in Southeastern Chinese: a haplotype-based analysis. Hum Mutat 2007; 28:431-40. [PMID: 17285603 DOI: 10.1002/humu.20462] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
O6-alkylguanine-DNA alkyltransferase (MGMT) is a universal DNA repair protein involved in the DNA direct reversal repair pathway that copes with alkylating carcinogens. Reduced MGMT expression as well as enzyme activity may result in an increased susceptibility to cancers. To elucidate the role of sequence variation in MGMT in the etiology of lung cancer, we conducted a comprehensive association study focusing on linkage disequilibrium (LD) structure of common variations across the MGMT sequence and its modification effect on smoking-related lung cancer risk. We rebuilt the LD block of MGMT by genotyping 39 SNPs and selected a subset of 10 haplotype-tagging SNPs (htSNP) and three pre- and interblock SNPs to capture variation across MGMT. By using a haplotype-based multifactor dimensionality reduction (MDR) analysis, we found that there were significant more-than-multiplicative interactions between diplotypes in block 5 and cumulative smoking and additive interaction between genotypes of preblock SNP rs1625649:C>A and smoking status in relation on lung cancer risk. Diplotypes in block 3 and block 5, genotypes of rs1625649:C>A, and trichotomized cumulative smoking are the four factors included in the MDR-defined best model on lung cancer risk. When these variables were combined and dichotomized, we found that subjects carrying the combined risk stratum had a significantly increased risk for lung cancer of 4.10-fold (odds ratio [OR]=4.10, 95% confidence interval [CI]=3.12-5.37, P=2.09 x 10(-24)). These findings suggest that genetic variants in MGMT may modulate the risk of smoking-related lung cancer. This haplotype-based interaction analysis might provide a "proof-of-principle" approach for studying candidate genes in cancer susceptibility.
Collapse
Affiliation(s)
- Zhibin Hu
- Department of Epidemiology and Biostatistics, Cancer Research Center of Nanjing Medical University, Nanjing, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Nothnagel M, Wollstein A, Krawczak M. Comparative Assessment of the Association Information Captured by SNP Tagging. Hum Hered 2007; 64:27-34. [PMID: 17483594 DOI: 10.1159/000101420] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Exploiting the association between single nucleotide polymorphisms (SNP) can potentially reduce the costs of association mapping of common disease genes. Different methods have been proposed for defining subsets of SNPs as proxies (or tagSNPs) for other SNPs, some of which rely upon a model of haplotype blocks. Other approaches only consider the pair-wise correlation between markers as a basis for selecting tagSNPs. Yet another, recently proposed model-based method takes marker heterozygosity and genetic distance into account in order to maximize the expected utility of a marker set to map frequent, but unobserved genetic variants. We compared these tagging approaches with regard to their ability to correlate tagSNPs and bi-allelic, potentially disease-causing genetic variants. We used the CEU sample of chromosome 19 from the HapMap project for an initial comparison, and demonstrated a comparable performance of both approaches but a difference in terms of tagSNPs selected and variants captured. In any case, we conclude that a considerable loss of information appears to be inherent to any type of SNP tagging, even when dense marker sets are available for SNP selection.
Collapse
Affiliation(s)
- Michael Nothnagel
- Institute of Medical Informatics and Statistics, University Hospital Schleswig-Holstein, Christian-Albrechts-University, Kiel, Germany.
| | | | | |
Collapse
|
18
|
Jungerius BJ, Gu J, Crooijmans RPMA, van der Poel JJ, Groenen MAM, van Oost BA, Te Pas MFW. ESTIMATION OF THE EXTENT OF LINKAGE DISEQUILIBRIUM IN SEVEN REGIONS OF THE PORCINE GENOME. Anim Biotechnol 2007; 16:41-54. [PMID: 28881173 DOI: 10.1081/abio-200053402] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Linkage disequilibrium (LD) refers to the correlation among neighboring alleles, reflecting non-random patterns of association between alleles at (nearby) loci. A better understanding of LD in the porcine genome is of direct relevance for identification of genes and mutations with a certain effect on the traits of interest. Here, 215 SNPs in seven genomic regions were genotyped in individuals of three breeds. Pairwise linkage disequilibrium was calculated for all marker pairs. To estimate the extent of LD, all pairwise LD values were plotted against the distance between the markers. Based on SNP markers in four genomic regions analyzed in three panels from populations of Large White, Dutch Landrace, and Meishan origin, useful LD is estimated to extend for approximately 40 to 60 kb in the porcine genome.
Collapse
Affiliation(s)
- Bart J Jungerius
- a Department of Animal Breeding and Genetics , Wageningen University , Wageningen , Netherlands
| | - Jingjing Gu
- a Department of Animal Breeding and Genetics , Wageningen University , Wageningen , Netherlands
| | | | - Jan J van der Poel
- a Department of Animal Breeding and Genetics , Wageningen University , Wageningen , Netherlands
| | - Martien A M Groenen
- a Department of Animal Breeding and Genetics , Wageningen University , Wageningen , Netherlands
| | - Bernard A van Oost
- b Department of Laboratory Animal Sciences , Utrecht University , Utrecht , Netherlands
| | - Marinus F W Te Pas
- c Animal Sciences Group , Wageningen University and Research Centre , Lelystad , Netherlands
| |
Collapse
|
19
|
Gu S, Pakstis AJ, Li H, Speed WC, Kidd JR, Kidd KK. Significant variation in haplotype block structure but conservation in tagSNP patterns among global populations. Eur J Hum Genet 2007; 15:302-12. [PMID: 17202997 DOI: 10.1038/sj.ejhg.5201751] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The initial belief that haplotype block boundaries and haplotypes were largely shared across populations was a foundation for constructing a haplotype map of the human genome using common SNP markers. The HapMap data document the generality of a block-like pattern of linkage disequilibrium (LD) with regions of low and high haplotype diversity but differences among the populations. Studies of many additional populations demonstrate that LD patterns can be highly variable among populations both across and within geographic regions. Because of this variation, emphasis has shifted to the generalizability of tagSNPs, those SNPs that capture the bulk of variation in a region. We have examined the LD and tagSNP patterns based upon over 2000 individual samples in 38 populations and 134 SNPs in 10 genetically independent loci for a total of 517 kb with an average density of 1 SNP/5 kb. Four different 'block' definitions and the pairwise LD tagSNP selection algorithm have been applied. Our results not only confirm large variation in block partition among populations from different regions (agreeing with previous studies including the HapMap) but also show that significant variation can occur among populations within geographic regions. None of the block-defining algorithms produces a consistent pattern within or across all geographic groups. In contrast, tagSNP transferability is much greater than the similarity of LD patterns and, although not perfect, some generalizations of transferability are possible. The analyses show an asymmetric pattern of tagSNP transferability coinciding with the subsetting of variation attributed to the spread of modern humans around the world.
Collapse
Affiliation(s)
- Sheng Gu
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06520-8005, USA
| | | | | | | | | | | |
Collapse
|
20
|
Abstract
Phenylketonuria (PKU), an inborn error of metabolism, is caused by mutations in the phenylalanine hydroxylase (PAH) gene. In the Serbian population, 19 different PAH mutations have been identified. We used PAH mutations as molecular markers for population genetics study. The low homozygosity value of the PAH gene (0.10) indicates that PKU in Serbia is heterogeneous, reflecting numerous migrations throughout Southeast Europe. The strategy for molecular diagnostics of PKU was designed accordingly. To elucidate the origin of the most common (L48S) PKU mutation in Serbia, we performed haplotype analysis by PCR-RFLP. Our results suggest that the L48S mutation was imported into Serbia from populations with different genetic backgrounds.
Collapse
|
21
|
Liu N, Beerman I, Lifton R, Zhao H. Haplotype analysis in the presence of informatively missing genotype data. Genet Epidemiol 2006; 30:290-300. [PMID: 16528706 DOI: 10.1002/gepi.20144] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
It is common to have missing genotypes in practical genetic studies, but the exact underlying missing data mechanism is generally unknown to the investigators. Although some statistical methods can handle missing data, they usually assume that genotypes are missing at random, that is, at a given marker, different genotypes and different alleles are missing with the same probability. These include those methods on haplotype frequency estimation and haplotype association analysis. However, it is likely that this simple assumption does not hold in practice, yet few studies to date have examined the magnitude of the effects when this simplifying assumption is violated. In this study, we demonstrate that the violation of this assumption may lead to serious bias in haplotype frequency estimates, and haplotype association analysis based on this assumption can induce both false-positive and false-negative evidence of association. To address this limitation in the current methods, we propose a general missing data model to characterize missing data patterns across a set of two or more markers simultaneously. We prove that haplotype frequencies and missing data probabilities are identifiable if and only if there is linkage disequilibrium between these markers under our general missing data model. Simulation studies on the analysis of haplotypes consisting of two single nucleotide polymorphisms illustrate that our proposed model can reduce the bias both for haplotype frequency estimates and association analysis due to incorrect assumption on the missing data mechanism. Finally, we illustrate the utilities of our method through its application to a real data set.
Collapse
Affiliation(s)
- Nianjun Liu
- Department of Biostatistics, University of Alabama at Birmingham, Birmingham, USA
| | | | | | | |
Collapse
|
22
|
Goldstein O, Zangerl B, Pearce-Kelling S, Sidjanin DJ, Kijas JW, Felix J, Acland GM, Aguirre GD. Linkage disequilibrium mapping in domestic dog breeds narrows the progressive rod-cone degeneration interval and identifies ancestral disease-transmitting chromosome. Genomics 2006; 88:541-50. [PMID: 16859891 PMCID: PMC4006154 DOI: 10.1016/j.ygeno.2006.05.013] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2006] [Revised: 05/30/2006] [Accepted: 05/31/2006] [Indexed: 11/16/2022]
Abstract
Canine progressive rod-cone degeneration (prcd) is a retinal disease previously mapped to a broad, gene-rich centromeric region of canine chromosome 9. As allelic disorders are present in multiple breeds, we used linkage disequilibrium (LD) to narrow the approximately 6.4-Mb interval candidate region. Multiple dog breeds, each representing genetically isolated populations, were typed for SNPs and other polymorphisms identified from BACs. The candidate region was initially localized to a 1.5-Mb zero recombination interval between growth factor receptor-bound protein 2 (GRB2) and SEC14-like 1 (SEC14L). A fine-scale haplotype of the region was developed, which reduced the LD interval to 106 kb and identified a conserved haplotype of 98 polymorphisms present in all prcd-affected chromosomes from 14 different dog breeds. The findings strongly suggest that a common ancestor transmitted the prcd disease allele to many of the modern dog breeds and demonstrate the power of the LD approach in the canine model.
Collapse
Affiliation(s)
- Orly Goldstein
- James A. Baker Institute, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Barbara Zangerl
- Section of Medical Genetics, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Sue Pearce-Kelling
- James A. Baker Institute, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Duska J. Sidjanin
- Department of Ophthalmology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - James W. Kijas
- CSIRO Livestock Industries, Brisbane, Queensland, Australia
| | - Jeanette Felix
- OptiGen, LLC, Cornell Business & Technology Park; Ithaca, NY, USA
| | - Gregory M Acland
- James A. Baker Institute, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Gustavo D. Aguirre
- Department of Ophthalmology, Medical College of Wisconsin, Milwaukee, WI, USA
- corresponding author: Gustavo D. Aguirre, Section of Medical Genetics, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA; phone: 215-898-4667; fax: 215-573-2162;
| |
Collapse
|
23
|
Fredman D, Sawyer SL, Strömqvist L, Mottagui-Tabar S, Kidd KK, Wahlestedt C, Chanock SJ, Brookes AJ. Nonsynonymous SNPs: validation characteristics, derived allele frequency patterns, and suggestive evidence for natural selection. Hum Mutat 2006; 27:173-86. [PMID: 16429399 DOI: 10.1002/humu.20289] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We experimentally investigated more than 1,200 entries in dbSNP that would change amino-acids (nsSNPs), using various subsets of DNA samples drawn from 18 global populations (approximately 1,000 subjects in total). First, we mined the data for any SNP features that correlated with a high validation rate. Useful predictors of valid SNPs included multiple submissions to dbSNP, having a dbSNP validation statement, and being present in a low number of ESTs. Together, these features improved validation rates by almost 10-fold. Higher-abundance SNPs (e.g., T/C variants) also validated more frequently. Second, we considered derived alleles and noted a considerably (approximately 10%) increased average derived allele frequency (DAF) in Europeans vs. Africans, plus a further increase in some other populations. This was not primarily due to an SNP ascertainment bias, nor to the effects of natural selection. Instead, it can be explained as a drift-based, progressive increase in DAF that occurs over many generations and becomes exaggerated during population bottlenecks. This observation could be used as the basis for novel DAF-based tests for comparing demographic histories. Finally, we considered individual marker patterns and identified 37 SNPs with allele frequency variance or FST values consistent with the effects of population-specific natural selection. Four particularly striking clusters of these markers were apparent, and three of these coincide with genes/regions from among only several dozen such domains previously suggested by others to carry signatures of selection.
Collapse
Affiliation(s)
- David Fredman
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
24
|
Meyer D, Single RM, Mack SJ, Erlich HA, Thomson G. Signatures of demographic history and natural selection in the human major histocompatibility complex Loci. Genetics 2006; 173:2121-42. [PMID: 16702436 PMCID: PMC1569707 DOI: 10.1534/genetics.105.052837] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Many lines of evidence show that several HLA loci have experienced balancing selection. However, distinguishing among demographic and selective explanations for patterns of variation observed with HLA genes remains a challenge. In this study we address this issue using data from a diverse set of human populations at six classical HLA loci and, employing a comparative genomics approach, contrast results for HLA loci to those for non-HLA markers. Using a variety of analytic methods, we confirm and extend evidence for selection acting on several HLA loci. We find that allele frequency distributions for four of the six HLA loci deviate from neutral expectations and show that this is unlikely to be explained solely by demographic factors. Other features of HLA variation are explained in part by demographic history, including decreased heterozygosity and increased LD for populations at greater distances from Africa and a similar apportionment of genetic variation for HLA loci compared to putatively neutral non-HLA loci. On the basis of contrasts among different HLA loci and between HLA and non-HLA loci, we conclude that HLA loci bear detectable signatures of both natural selection and demographic history.
Collapse
Affiliation(s)
- Diogo Meyer
- Departmento de Genética e Evolução, Universidade de São Paulo, Brazil, and Children's Hospital Oakland Research Institute, Oakland, CA 94609, USA.
| | | | | | | | | |
Collapse
|
25
|
Howie BN, Carlson CS, Rieder MJ, Nickerson DA. Efficient selection of tagging single-nucleotide polymorphisms in multiple populations. Hum Genet 2006; 120:58-68. [PMID: 16680432 DOI: 10.1007/s00439-006-0182-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2006] [Accepted: 03/30/2006] [Indexed: 10/24/2022]
Abstract
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Collapse
Affiliation(s)
- Bryan N Howie
- Department of Genome Sciences, University of Washington, Box 357730, Seattle, WA 98195, USA
| | | | | | | |
Collapse
|
26
|
Han Y, Oota H, Osier MV, Pakstis AJ, Speed WC, Odunsi A, Okonofua F, Kajuna SLB, Karoma NJ, Kungulilo S, Grigorenko E, Zhukova OV, Bonne-Tamir B, Lu RB, Parnas J, Schulz LO, Kidd JR, Kidd KK. Considerable haplotype diversity within the 23kb encompassing the ADH7 gene. Alcohol Clin Exp Res 2006; 29:2091-100. [PMID: 16385178 DOI: 10.1097/01.alc.0000191769.92667.04] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND Of the seven known human alcohol dehydrogenase (ADH) genes, the non-liver expressed ADH7 gene codes for the enzyme with the highest maximal activity for ethanol. Previous study from our laboratory has suggested that ADH7 has an epistatic role for protection against alcoholism based on a single ADH7 SNP. METHODS We have now studied seven SNPs, additional populations for the SNP previously examined, and six more new SNPs, across 23 kb of ADH7 in 38 population samples originating from different geographical regions of the world. RESULTS The overall linkage disequilibrium is moderate to strong across this region even though considerable 7-SNP haplotype diversity is observed. This uncommonly high haplotype diversity is explained by high LD within each "half," the three upstream SNPs and the four downstream SNPs, but near randomization between the "halves." This division significantly simplified the haplotype pattern: only four major haplotypes account for almost all chromosomes in all populations in each "half." CONCLUSIONS The low linkage disequilibrium between these two "halves" suggests multiple recombination(s) have occurred in this region, specifically, within intron 7. The absence of strong LD between the functional variation in ADH1B that is strongly associated with alcoholism and any of the variation in ADH7 supports the genetic independence of ADH7 in association studies. Thus, the previously observed epistatic effect of ADH7 cannot be explained by its linkage disequilibrium with a causative factor in ADH1B.
Collapse
Affiliation(s)
- Yi Han
- Department of Genetics, Yale University, School of Medicine, New Haven, CT 06520-8005, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Sabbagh A, Darlu P. SNP selection at the NAT2 locus for an accurate prediction of the acetylation phenotype. Genet Med 2006; 8:76-85. [PMID: 16481889 DOI: 10.1097/01.gim.0000200951.54346.d6] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
PURPOSE Genetic polymorphisms in the N-acetyltransferase 2 gene determine the individual acetylator status, which influences both the toxicity and efficacy profile of acetylated drugs. Determination of an individual's acetylation phenotype prior to initiation of therapy, through DNA-based tests, should permit to improve therapy response and reduce adverse events. However, due to extensive linkage disequilibrium between markers within NAT2, the genotyping of closely spaced markers yields highly redundant data: testing them all is expensive and often unnecessary. The objective of this study is to establish the optimal strategy to define, in the genetic context of a given ethnic group, the most informative set of single-nucleotide polymorphisms that best enables accurate prediction of acetylation phenotype. METHODS Three classification methods have been investigated (classification trees, artificial neural networks and multifactor dimensionality reduction method) in order to find the optimal set of single-nucleotide polymorphisms enabling the most efficient classification of individuals in rapid and slow acetylators. RESULTS Our results show that, in almost all population samples, only one or two single-nucleotide polymorphisms would be enough to obtain a good predictive capacity with no or only a modest reduction in power relative to direct assays of all common markers. In contrast, in Black African populations, where lower levels of linkage disequilibrium are observed at NAT2, a larger number of single-nucleotide polymorphisms are required to predict acetylation phenotype. CONCLUSION The results of this study will be helpful for the design of time- and cost-effective pharmacogenetic tests (adapted to specific populations) that could be used as routine tools in clinical practice.
Collapse
Affiliation(s)
- Audrey Sabbagh
- Unité de Recherche en Génétique Epidémiologique et Structure des Populations Humaines, INSERM U535, Villejuif, France
| | | |
Collapse
|
28
|
Gardner M, González-Neira A, Lao O, Calafell F, Bertranpetit J, Comas D. Extreme population differences across Neuregulin 1 gene, with implications for association studies. Mol Psychiatry 2006; 11:66-75. [PMID: 16189508 DOI: 10.1038/sj.mp.4001749] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Neuregulin 1 (NRG1) is one of the most exciting candidate genes for schizophrenia in recent years since its first association with the disease in an Icelandic population. Since then, many association studies have analysed allele and haplotype frequencies in distinct populations yielding varying results: some have replicated the association, although with different alleles or haplotypes being associated, whereas others have failed to replicate the association. These contradictory results might be attributed to population differences in allele and haplotype frequencies. In order to approach this issue, we have typed 13 SNPs across this large 1.4 Mb gene, including two of the SNPs originally found associated with schizophrenia in the Icelandic population, the objective being to discover if the underlying cause of the association discrepancies to date may be due to population-specific genetic variation. The analyses have been performed in a total of 1088 individuals from 39 populations, covering most of the genetic diversity in the human species. Most of the SNPs analysed displayed differing frequencies according to geographical region. These allele differences are especially relevant in two SNPs located in a large intron of the gene, as shown by the extreme F(ST) values, which reveal genetic stratification correlated to broad continental areas. This finding may be indicative of the influence of some local selective forces on this gene. Furthermore, haplotype analysis reveals a clear clustering according to geographical areas. In summary, our findings suggest that NRG1 presents extreme population differences in allele and haplotype frequencies. We have given recommendations for taking this into account in future association studies since this diversity could give rise to erroneous results.
Collapse
Affiliation(s)
- M Gardner
- Unitat de Biologia Evolutiva, Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Doctor Aiguader 80, 08003 Barcelona, Catalonia, Spain
| | | | | | | | | | | |
Collapse
|
29
|
Abstract
Currently, more than 10 million DNA sequence variations have been uncovered in the human genome. The most detailed variation discovery efforts have focused on candidate genes involved in cardiovascular disease or in susceptibilities associated with exposure to environmental agents. Here we provide an overview of natural genetic variation from the literature and in 510 human candidate genes resequenced for variation discovery. The average human gene contains 126 biallelic polymorphisms, 46 of which are common (> or =5% minor allele frequency) and 5 of which are found in coding regions. Using this complete picture of genetic diversity, we explore conservation, signatures of selection, and historical recombination to mine information useful for candidate gene association studies. In general, we find that the patterns of human gene variation suggest that no one approach will be appropriate for genetic association studies across all genes. Therefore, many different approaches may be required to identify the elusive genotypes associated with common human phenotypes.
Collapse
Affiliation(s)
- Dana C Crawford
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | | | |
Collapse
|
30
|
Nothnagel M, Rohde K. The effect of single-nucleotide polymorphism marker selection on patterns of haplotype blocks and haplotype frequency estimates. Am J Hum Genet 2005; 77:988-98. [PMID: 16380910 PMCID: PMC1285181 DOI: 10.1086/498175] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2004] [Accepted: 09/16/2004] [Indexed: 11/03/2022] Open
Abstract
The definition of haplotype blocks of single-nucleotide polymorphisms (SNPs) has been proposed so that the haplotypes can be used as markers in association studies and to efficiently describe human genetic variation. The International Haplotype Map (HapMap) project to construct a comprehensive catalog of haplotypic variation in humans is underway. However, a number of factors have already been shown to influence the definition of blocks, including the population studied and the sample SNP density. Here, we examine the effect that marker selection has on the definition of blocks and the pattern of haplotypes by using comparable but complementary SNP sets and a number of block definition methods in various genomic regions and populations that were provided by the Encyclopedia of DNA Elements (ENCODE) project. We find that the chosen SNP set has a profound effect on the block-covered sequence and block borders, even at high marker densities. Our results question the very concept of discrete haplotype blocks and the possibility of generalizing block findings from the HapMap project. We comparatively apply the block-free tagging-SNP approach and discuss both the haplotype approach and the tagging-SNP approach as means to efficiently catalog genetic variation.
Collapse
Affiliation(s)
- Michael Nothnagel
- Department of Bioinformatics, Max Delbrück Center for Molecular Medicine, Berlin, Germany.
| | | |
Collapse
|
31
|
Sawyer SL, Mukherjee N, Pakstis AJ, Feuk L, Kidd JR, Brookes AJ, Kidd KK. Linkage disequilibrium patterns vary substantially among populations. Eur J Hum Genet 2005; 13:677-86. [PMID: 15657612 DOI: 10.1038/sj.ejhg.5201368] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
A major initiative to create a global human haplotype map has recently been launched as a tool to improve the efficiency of disease gene mapping. The 'HapMap' project will study common variants in depth in four (and to a lesser degree in up to 12) populations to catalogue haplotypes that are expected to be common to all populations. A hope of the 'HapMap' project is that much of the genome occurs in regions of limited diversity such that only a few of the SNPs in each region will capture the diversity and be relevant around the world. In order to explore the implications of studying only a limited number of populations, we have analyzed linkage disequilibrium (LD) patterns of three 175-320 kb genomic regions in 16 diverse populations with an emphasis on African and European populations. Analyses of these three genomic regions provide empiric demonstration of marked differences in frequencies of the same few haplotypes, resulting in differences in the amount of LD and very different sets of haplotype frequencies. These results highlight the distinction between the statistical concept of LD and the biological reality of haplotypes and their frequencies. The significant quantitative and qualitative variation in LD among populations, even for populations within a geographic region, emphasizes the importance of studying diverse populations in the HapMap project to assure broad applicability of the results.
Collapse
Affiliation(s)
- Sarah L Sawyer
- Center for Genomics and Bioinformatics, Karolinska Institute, Berzelius väg 35, Stockholm 17177, Sweden
| | | | | | | | | | | | | |
Collapse
|
32
|
Ramírez-Soriano A, Lao O, Soldevila M, Calafell F, Bertranpetit J, Comas D. Haplotype tagging efficiency in worldwide populations in CTLA4 gene. Genes Immun 2005; 6:646-57. [PMID: 16034471 DOI: 10.1038/sj.gene.6364251] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
The cytotoxic T lymphocyte antigen 4 (CTLA4) acts as a potent negative regulator of T-cell response, and has been suggested as a pivotal candidate gene for autoimmune disorders such as Graves' disease, type 1 diabetes and autoimmune hypothyroidism, among others. Several single-nucleotide polymorphisms (SNPs) have been proposed as the susceptibility variants, or to be in strong linkage disequilibrium (LD) with the variant. Nevertheless, contradictory results have been found, which may be due to lack of knowledge of the genetic structure of CTLA4 and its geographic variation. We have typed 17 SNPs throughout the CTLA4 gene region in order to analyze the haplotype diversity and LD structure in a worldwide population set (1262 individuals from 44 populations) to understand the variation pattern of the region. Allele and haplotype frequency differentiation between populations is consistent with genomewide averages and points to a lack of strong population-specific selection pressures. LD is high and its pattern is not significantly different within or between continents. However, haplotype composition is significantly different between geographical groups. A continent-specific set of haplotype tagging SNPs has been designed to be used for future association studies. These are portable among populations, although their efficiency might vary depending on the population haplotype spectrum.
Collapse
Affiliation(s)
- A Ramírez-Soriano
- Unitat de Biologia Evolutiva, Departament de Ciències Experimentals i de la Salut, Facultat de Ciències de la Salut i de la Vida, Universitat Pompeu Fabra, Barcelona, Catalonia 08003, Spain
| | | | | | | | | | | |
Collapse
|
33
|
Kim JJ, Verdu P, Pakstis AJ, Speed WC, Kidd JR, Kidd KK. Use of autosomal loci for clustering individuals and populations of East Asian origin. Hum Genet 2005; 117:511-9. [PMID: 16028061 DOI: 10.1007/s00439-005-1334-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2004] [Accepted: 04/14/2005] [Indexed: 10/25/2022]
Abstract
We studied the genetic relationships among East Asian populations based on allele frequency differences to clarify the relative similarities of East Asian populations with a specific focus on the relationships among the Koreans, the Japanese, and the Chinese populations known to be genetically similar. The goal is to find markers appropriate for differentiating among the specific populations. In this study, no prior data existed for Koreans and the markers were selected to differentiate Chinese and Japanese. We typed, using AB TaqMan assays, single-nucleotide polymorphisms (SNPs) at 43 highly selected mostly independent diallelic sites, on 386 individuals from eight East Asian populations (Han Chinese from San Francisco, Han Chinese from Taiwan, Hakka, Koreans, Japanese, Ami, Atayal, and Cambodians) and one Siberian population (Yakut). We inferred group membership of individuals using a model-based clustering method implemented by the STRUCTURE program and population clustering by using computer programs DISTANCE, NEIGHBOR, LSSEARCH, and DRAWTREE, respectively, calculating genetic distances among populations, calculating neighbor-joining and least-squares trees, and drawing the calculated trees. On average 52% of individuals in the three Chinese groups were assigned into one cluster, and, respectively, 78 and 69% of Koreans and Japanese into a different cluster. Koreans differentiated from the Chinese groups and clustered with the Japanese in the principal component analysis (PCA) and in the best least-squares tree. The majority of Koreans were difficult to distinguish from the Japanese. This study shows that a relatively few highly selected markers can, within limits, differentiate between closely related populations.
Collapse
Affiliation(s)
- Jong-Jin Kim
- National Institute of Scientific Investigation, DNA Analysis Division, Seoul, Korea
| | | | | | | | | | | |
Collapse
|
34
|
Adeyemo AA, Chen G, Chen Y, Rotimi C. Genetic structure in four West African population groups. BMC Genet 2005; 6:38. [PMID: 15978124 PMCID: PMC1180433 DOI: 10.1186/1471-2156-6-38] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2004] [Accepted: 06/24/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Africa contains the most genetically divergent group of continental populations and several studies have reported that African populations show a high degree of population stratification. In this regard, it is important to investigate the potential for population genetic structure or stratification in genetic epidemiology studies involving multiple African populations. The presences of genetic sub-structure, if not properly accounted for, have been reported to lead to spurious association between a putative risk allele and a disease. Within the context of the Africa America Diabetes Mellitus (AADM) Study (a genetic epidemiologic study of type 2 diabetes mellitus in West Africa), we have investigated population structure or stratification in four ethnic groups in two countries (Akan and Gaa-Adangbe from Ghana, Yoruba and Igbo from Nigeria) using data from 372 autosomal microsatellite loci typed in 493 unrelated persons (986 chromosomes). RESULTS There was no significant population genetic structure in the overall sample. The smallest probability is associated with an inferred cluster of 1 and little of the posterior probability is associated with a higher number of inferred clusters. The distribution of members of the sample to inferred clusters is consistent with this finding; roughly the same proportion of individuals from each group is assigned to each cluster with little variation between the ethnic groups. Analysis of molecular variance (AMOVA) showed that the between-population component of genetic variance is less than 0.1% in contrast to 99.91% for the within population component. Pair-wise genetic distances between the four ethnic groups were also very similar. Nonetheless, the small between-population genetic variance was sufficient to distinguish the two Ghanaian groups from the two Nigerian groups. CONCLUSION There was little evidence for significant population substructure in the four major West African ethnic groups represented in the AADM study sample. Ethnicity apparently did not introduce differential allele frequencies that may affect analysis and interpretation of linkage and association studies. These findings, although not entirely surprising given the geographical proximity of these groups, provide important insights into the genetic relationships between the ethnic groups studied and confirm previous results that showed close genetic relationship between most studied West African groups.
Collapse
Affiliation(s)
- Adebowale A Adeyemo
- College of Medicine, University of Ibadan, Ibadan. Nigeria
- National Human Genome Center, Howard University, Washington DC, USA
| | - Guanjie Chen
- National Human Genome Center, Howard University, Washington DC, USA
| | - Yuanxiu Chen
- National Human Genome Center, Howard University, Washington DC, USA
| | - Charles Rotimi
- National Human Genome Center, Howard University, Washington DC, USA
| |
Collapse
|
35
|
Evans DM, Cardon LR. A comparison of linkage disequilibrium patterns and estimated population recombination rates across multiple populations. Am J Hum Genet 2005; 76:681-7. [PMID: 15719321 PMCID: PMC1199305 DOI: 10.1086/429274] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2004] [Accepted: 01/28/2005] [Indexed: 11/04/2022] Open
Abstract
Large-scale studies of linkage disequilibrium (LD) have shown considerable variation in the extent and distribution of pairwise LD within and between populations. Taken at face value, these results suggest that genomewide LD maps for one population may not be generalizable to other populations. However, at least part of this diversity is due to some undesirable features of pairwise LD measures, which are well documented for the D' and r2 measures. In this report, we compare patterns of LD derived from pairwise measures with statistical estimates of population recombination rates ( rho ) along a 10-Mb stretch of chromosome 20 in four population samples, comprising East Asians, African Americans, and U.K. and U.S. individuals of western European descent. The results reveal the expected variability of D' within and between populations but show better concordance in estimates of r2 for the same markers across the population samples. Estimates of rho correlate well across populations, but there is still evidence of population-specific spikes and troughs in rho values. We conclude that it is unlikely that a single haplotype map will provide a definitive guide for association studies of many populations; rather, multiple maps will need to be constructed to provide the best-possible guides for gene mapping.
Collapse
Affiliation(s)
- David M Evans
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom.
| | | |
Collapse
|
36
|
Sivakumaran TA, Shen P, Wall DP, Do BH, Kucheria K, Oefner PJ. Conservation of theRB1gene in human and primates. Hum Mutat 2005; 25:396-409. [PMID: 15776430 DOI: 10.1002/humu.20154] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Mutations in the RB1 gene are associated with retinoblastoma, which has served as an important model for understanding hereditary predisposition to cancer. Despite the great scrutiny that RB1 has enjoyed as the prototypical tumor suppressor gene, it has never been the object of a comprehensive survey of sequence variation in diverse human populations and primates. Therefore, we analyzed the coding (2,787 bp) and adjacent intronic and untranslated (7,313 bp) sequences of RB1 in 137 individuals from a wide range of ethnicities, including 19 Asian Indian hereditary retinoblastoma cases, and five primate species. Aside from nine apparently disease-associated mutations, 52 variants were identified. They included six singleton, coding variants that comprised five amino acid replacements and one silent site. Nucleotide diversity of the coding region (pi=0.0763+/-1.35 x 10(-4)) was 52 times lower than that of the noncoding regions (pi=3.93+/-5.26 x 10(-4)), indicative of significant sequence conservation. The occurrence of purifying selection was corroborated by phylogeny-based maximum likelihood analysis of the RB1 sequences of human and five primates, which yielded an estimated ratio of replacement to silent substitutions (omega) of 0.095 across all lineages. RB1 displayed extensive linkage disequilibrium over 174 kb, and only four unique recombination events, two in Africa and one each in Europe and Southwest Asia, were observed. Using a parsimony approach, 15 haplotypes could be inferred. Ten were found in Africa, though only 12.4% of the 274 chromosomes screened were of African origin. In non-Africans, a single haplotype accounted for from 63 to 84% of all chromosomes, most likely the consequence of natural selection and a significant bottleneck in effective population size during the colonization of the non-African continents.
Collapse
Affiliation(s)
- Theru A Sivakumaran
- Division of Genetics, Department of Anatomy, All India Institute of Medical Sciences, New Delhi, India
| | | | | | | | | | | |
Collapse
|
37
|
Abstract
In this article, the authors argue that the overwhelming portion of the literature on intelligence, race, and genetics is based on folk taxonomies rather than scientific analysis. They suggest that because theorists of intelligence disagree as to what it is, any consideration of its relationships to other constructs must be tentative at best. They further argue that race is a social construction with no scientific definition. Thus, studies of the relationship between race and other constructs may serve social ends but cannot serve scientific ends. No gene has yet been conclusively linked to intelligence, so attempts to provide a compelling genetic link of race to intelligence are not feasible at this time. The authors also show that heritability, a behavior-genetic concept, is inadequate in regard to providing such a link.
Collapse
|
38
|
Khrunin AV, Tarskaia LA, Spitsyn VA, Lylova OI, Bebyakova NA, Mikulich AI, Limborska SA. p53 polymorphisms in Russia and Belarus: correlation of the 2-1-1 haplotype frequency with longitude. Mol Genet Genomics 2005; 272:666-72. [PMID: 15654624 DOI: 10.1007/s00438-004-1091-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2004] [Accepted: 11/16/2004] [Indexed: 12/01/2022]
Abstract
Four different polymorphisms in the human p53 gene (a 16-bp duplication in intron 3, and three RFLPs: for Bsh1236I at codon 72, for MspI in intron 6 and for BamHI in the 3' flanking region) and extended haplotypes were studied in nine geographically diverse populations from Russia and Belarus. The Yakuts differed from all other populations, as they had a significantly higher frequency of the BamHI A1 allele. Most populations did not differ significantly from each other in the frequency of the Bsh1236I polymorphism. The 16-bp duplication A1 allele and MspI A2 allele frequencies were significantly higher in the Yakut and Khant populations. Linkage disequilibrium values (D') between BamHI and other polymorphic sites were not significant in many cases; for this reason we have used the 16 bp-Bsh1236I-MspI haplotype frequencies only. Of eight possible haplotypes, five were observed in the populations investigated. Haplotype 1-2-2 was the most frequent in all populations. The next most common haplotype, 1-1-2, was present at very similar frequencies among the Byelorussians and Russians from Smolensk, but was more frequent in other populations. The frequency of haplotype 2-1-1 showed a nearly continuous decrease from West to East (from 17.857% among the Byelorussians to 0.685% in the Yakuts from the Verkhoyansk) and correlated with longitude (Spearman's r = -0.8667, P = 0.0025), which may be due to natural selection and adaptation. The relationships among populations were evaluated by means of Nei's D(A) distances for the 16 bp-Bsh1236I-MspI haplotype frequencies. Based on the multidimensional scaling analysis a correlation between p53 haplotype frequencies and ethnicity is supposed.
Collapse
Affiliation(s)
- A V Khrunin
- Institute of Molecular Genetics of Russian Academy of Sciences, Kurchatov Square 2, Moscow, 123182, Russia.
| | | | | | | | | | | | | |
Collapse
|
39
|
Bhangale TR, Rieder MJ, Livingston RJ, Nickerson DA. Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes. Hum Mol Genet 2004; 14:59-69. [PMID: 15525656 DOI: 10.1093/hmg/ddi006] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Despite being the second most frequent type of polymorphism in the genome, diallelic insertion-deletion polymorphisms (indels) have received far less attention in the study of sequence variation. In this report, we describe an approach that can detect indels in the heterozygous state and can comprehensively identify indels in the target sequence. Using this approach, we identified 2393 indels in a set of 330 candidate genes, i.e. an average of seven indels per gene with about two indels per gene being common (minor allele frequency >or=0.1). We compared the population genetic characteristics of indels with substitutions in this data. Our data supported the findings that deletions occur more frequently in the human genome. 5'-UTR and coding regions of the genes showed a significantly lower diversity for indels compared with other regions, suggesting differences in effects of selection on indels and substitutions. Sequence diversity and pairwise linkage disequilibrium (LD) findings of the different populations were similar to earlier results and included a greater skew towards low-frequency variants and a faster rate of LD decay in the African-descent population compared with the non-African populations. Within populations, the allele frequency spectra and LD-decay profiles for indels were similar to substitutions. Overall, the findings suggest that, although the mechanisms giving rise to indels may be different from those causing substitutions, the evolutionary histories of indels and substitutions are similar, and that indels can play a valuable role in association studies and marker selection strategies.
Collapse
Affiliation(s)
- Tushar R Bhangale
- Department of Bioengineering, University of Washington, Seattle, WA 98195-7730, USA
| | | | | | | |
Collapse
|
40
|
Implications of biogeography of human populations for 'race' and medicine. Nat Genet 2004; 36:S21-7. [DOI: 10.1038/ng1438] [Citation(s) in RCA: 332] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2004] [Accepted: 09/09/2004] [Indexed: 11/08/2022]
|
41
|
Bertranpetit J, Calafell F, Comas D, González-Neira A, Navarro A. Structure of linkage disequilibrium in humans: genome factors and population stratification. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2004; 68:79-88. [PMID: 15338606 DOI: 10.1101/sqb.2003.68.79] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- J Bertranpetit
- Unitat de Biologia Evolutiva, Universitat Pompeu Fabra, 08003 Barcelona, Catalonia, Spain
| | | | | | | | | |
Collapse
|
42
|
Paschou P, Feng Y, Pakstis AJ, Speed WC, DeMille MM, Kidd JR, Jaghori B, Kurlan R, Pauls DL, Sandor P, Barr CL, Kidd KK. Indications of linkage and association of Gilles de la Tourette syndrome in two independent family samples: 17q25 is a putative susceptibility region. Am J Hum Genet 2004; 75:545-60. [PMID: 15303240 PMCID: PMC1182043 DOI: 10.1086/424389] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2004] [Accepted: 07/09/2004] [Indexed: 02/04/2023] Open
Abstract
Gilles de la Tourette syndrome (GTS) is characterized by multiple motor and phonic tics and high comorbidity rates with other neurobehavioral disorders. It is hypothesized that frontal-subcortical pathways and a complex genetic background are involved in the etiopathogenesis of the disorder. The genetic basis of GTS remains elusive. However, several genomic regions have been implicated. Among them, 17q25 appears to be of special interest, as suggested by various independent investigators. In the present study, we explored the possibility that 17q25 contributes to the genetic component of GTS. The initial scan of chromosome 17 performed on two large pedigrees provided a nonparametric LOD score of 2.41 near D17S928. Fine mapping with 17 additional microsatellite markers increased the peak to 2.61 (P=.002). The original families, as well as two additional pedigrees, were genotyped for 25 single-nucleotide polymorphisms (SNPs), with a focus on three genes in the indicated region that could play a role in the development of GTS, on the basis of their function and expression profile. Multiple three-marker haplotypes spanning all three genes studied provided highly significant association results (P<.001). An independent sample of 96 small families with one or two children affected with GTS was also studied. Of the 25 SNPs, 3 were associated with GTS at a statistically significant level. The transmission/disequilibrium test for a three-site haplotype moving window again provided multiple positive results. The background linkage disequilibrium (LD) of the region was studied in eight populations of European origin. A complicated pattern was revealed, with the pairwise tests producing unexpectedly high LD values at the telomeric TBCD gene. In conclusion, our findings warrant the further investigation of 17q25 as a candidate susceptibility region for GTS.
Collapse
Affiliation(s)
- P Paschou
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06520-8005, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Abstract
Over the past century researchers have identified normal genetic variation and studied that variation in diverse human populations to determine the amounts and distributions of that variation. That information is being used to develop an understanding of the demographic histories of the different populations and the species as a whole, among other studies. With the advent of DNA-based markers in the last quarter century, these studies have accelerated. One of the challenges for the next century is to understand that variation. One component of that understanding will be population genetics. We present here examples of many of the ways these new data can be analyzed from a population perspective using results from our laboratory on multiple individual DNA-based polymorphisms, many clustered in haplotypes, studied in multiple populations representing all major geographic regions of the world. These data support an "out of Africa" hypothesis for human dispersal around the world and begin to refine the understanding of population structures and genetic relationships. We are also developing baseline information against which we can compare findings at different loci to aid in the identification of loci subject, now and in the past, to selection (directional or balancing). We do not yet have a comprehensive understanding of the extensive variation in the human genome, but some of that understanding is coming from population genetics.
Collapse
Affiliation(s)
- K K Kidd
- Department of Genetics, Yale University School of Medicine, 333 Cedar St., New Haven, CT 06520-8005, USA.
| | | | | | | |
Collapse
|
44
|
Oota H, Pakstis AJ, Bonne-Tamir B, Goldman D, Grigorenko E, Kajuna SLB, Karoma NJ, Kungulilo S, Lu RB, Odunsi K, Okonofua F, Zhukova OV, Kidd JR, Kidd KK. The evolution and population genetics of the ALDH2 locus: random genetic drift, selection, and low levels of recombination. Ann Hum Genet 2004; 68:93-109. [PMID: 15008789 DOI: 10.1046/j.1529-8817.2003.00060.x] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The catalytic deficiency of human aldehyde dehydrogenase 2 (ALDH2) is caused by a nucleotide substitution (G1510A; Glu487Lys) in exon 12 of the ALDH2 locus. This SNP, and four non-coding SNPs, including one in the promoter, span 40 kb of ALDH2; these and one downstream STRP have been tested in 37 worldwide populations. Only four major SNP-defined haplotypes account for almost all chromosomes in all populations. A fifth haplotype harbours the functional variant and is only found in East Asians. Though the SNPs showed virtually no historic recombination, LD values are quite variable because of varying haplotype frequencies, demonstrating that LD is a statistical abstraction and not a fundamental aspect of the genome, and is not a function solely of recombination. Among populations, different sets of tagging SNPs, sometimes not overlapping, can be required to identify the common haplotypes. Thus, solely because haplotype frequencies vary, there is no common minimum set of tagging SNPs globally applicable. The Fst values of the promoter region SNP and the functional SNP were about two S.D. above the mean for a reference distribution of 117 autosomal biallelic markers. These high Fst values may indicate selection has operated at these or very tightly linked sites.
Collapse
Affiliation(s)
- Hiroki Oota
- Department of Genetics, Yale University School of Medicine, 333 Cedar Street, PO Box 208005, New Haven, CT 06520-8005, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Abstract
The introduction of molecular markers in genetic analysis has revolutionized medicine. These molecular markers are genetic variations associated with a predisposition to common diseases and individual variations in drug responses. Identification and genotyping a vast number of genetic polymorphisms in large populations are increasingly important for disease gene identification, pharmacogenetics and population-based studies. Among variations being analyzed, single nucleotide polymorphisms seem to be most useful in large-scale genetic analysis. This review discusses approaches for genetic analysis, use of different markers, and emerging technologies for large-scale genetic analysis where millions of genotyping need to be performed.
Collapse
Affiliation(s)
- Elahe Elahi
- Faculty of Science, Tehran University, Tehran, Iran
| | | | | |
Collapse
|
46
|
Abstract
Genomics resources that use samples from identified populations raise scientific, social and ethical issues that are, in many ways, inextricably linked. Scientific decisions about which populations to sample to produce the HapMap, an international genetic variation resource, have raised questions about the relationships between the social identities used to recruit participants and the biological findings of studies that will use the HapMap. The sometimes problematic implications of those complex relationships have led to questions about how to conduct genetic variation research that uses identified populations in an ethical way, including how to involve members of a population in evaluating the risks and benefits posed for everyone who shares that identity. The ways in which these issues are linked is increasingly drawing the scientific and ethical spheres of genomics research closer together.
Collapse
|
47
|
Osier MV, Lu RB, Pakstis AJ, Kidd JR, Huang SY, Kidd KK. Possible epistatic role of ADH7 in the protection against alcoholism. Am J Med Genet B Neuropsychiatr Genet 2004; 126B:19-22. [PMID: 15048643 DOI: 10.1002/ajmg.b.20136] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In recent studies of the role of the alcohol dehydrogenase genes (ADH) in alcoholism the ADH1B Arg47His polymorphism appears to affect risk via a protective effect associated with the ADH1B*47His. Here we present evidence for an additional effect from outside the Class I ADH genes, presumably from functional variation at the ADH7 gene. The protective effect is restricted to one of two haplotypes identical at ADH1B but differing at an intronic SNP at ADH7 suggesting epistasis or strong linkage disequilibrium (LD).
Collapse
Affiliation(s)
- M V Osier
- Department of Genetics, SHM I-353, Yale University School of Medicine, 3334 Cedar Street, New Haven, CT 06520, USA
| | | | | | | | | | | |
Collapse
|
48
|
Hendrie HC, Hall KS, Ogunniyi A, Gao S. Alzheimer's disease, genes, and environment: the value of international studies. CANADIAN JOURNAL OF PSYCHIATRY. REVUE CANADIENNE DE PSYCHIATRIE 2004; 49:92-9. [PMID: 15065742 DOI: 10.1177/070674370404900203] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
OBJECTIVE To describe the construction of a disease model incorporating both genetic an environmental factors in the etiology of Alzheimer's disease (AD), using data generated from the Indianapolis-Ibadan dementia project (I-IDP). METHOD The I-IDP is a longitudinal comparative study of the prevalence and incidence o dementia in 2 communities: elderly African Americans living in Indianapolis, Indiana, an Yoruba living in Ibadan, Nigeria. RESULTS African Americans are more than twice as likely as Yoruba to develop AD. Possible explanations for this finding include genetic factors: the possession of the apolipoprotein E epsilon4 allele does not increase risk for AD among Yoruba but confers a sligh increase in AD risk for African Americans. As well, environmental factors may play a role: African Americans have a higher risk of vascular risk factors than do Yoruba. CONCLUSIONS International comparative studies, particularly those involving population from developing and developed countries, offer a unique opportunity for applying new in formation regarding population genetics to traditional AD risk factor research.
Collapse
Affiliation(s)
- Hugh C Hendrie
- Department of Psychiatry, Indiana University School of Medicine, Indiana University Center for Aging Research, Indianapolis 46202-2872, USA.
| | | | | | | |
Collapse
|
49
|
Abstract
DNA and RNA quantifications are widely used in biological and biomedical research. In the last ten years, many technologies have been developed to enable automated and high-throughput analyses. In this review, we first give a brief overview of how DNA and RNA quantifications are carried out. Then, five technologies (microarrays, SAGE, differential display, real time PCR and real competitive PCR) are introduced, with an emphasis on how these technologies can be applied and what their limitations are. The technologies are also evaluated in terms of a few key aspects of nucleic acids quantification such as accuracy, sensitivity, specificity, cost and throughput.
Collapse
Affiliation(s)
- Chunming Ding
- Bioinformatics Program and Center for Advanced Biotechnology, Boston University, Boston, MA 02215, USA.
| | | |
Collapse
|
50
|
Tishkoff SA, Verrelli BC. Patterns of human genetic diversity: implications for human evolutionary history and disease. Annu Rev Genomics Hum Genet 2003; 4:293-340. [PMID: 14527305 DOI: 10.1146/annurev.genom.4.070802.110226] [Citation(s) in RCA: 234] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Since the completion of the human genome sequencing project, the discovery and characterization of human genetic variation is a principal focus for future research. Comparative studies across ethnically diverse human populations and across human and nonhuman primate species is important for reconstructing human evolutionary history and for understanding the genetic basis of human disease. In this review, we summarize data on patterns of human genetic diversity and the evolutionary forces (mutation, genetic drift, migration, and selection) that have shaped these patterns of variation across both human populations and the genome. African population samples typically have higher levels of genetic diversity, a complex population substructure, and low levels of linkage disequilibrium (LD) relative to non-African populations. We discuss these differences and their implications for mapping disease genes and for understanding how population and genomic diversity have been important in the evolution, differentiation, and adaptation of humans.
Collapse
Affiliation(s)
- Sarah A Tishkoff
- Department of Biology, University of Maryland, College Park, Maryland 20742, USA.
| | | |
Collapse
|