1
|
Yilmaz F, Gurusamy U, Mosley TJ, Hallast P, Kim K, Mostovoy Y, Purcell RH, Shaikh TH, Zwick ME, Kwok PY, Lee C, Mulle JG. High level of complexity and global diversity of the 3q29 locus revealed by optical mapping and long-read sequencing. Genome Med 2023; 15:35. [PMID: 37165454 PMCID: PMC10170684 DOI: 10.1186/s13073-023-01184-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 04/20/2023] [Indexed: 05/12/2023] Open
Abstract
BACKGROUND High sequence identity between segmental duplications (SDs) can facilitate copy number variants (CNVs) via non-allelic homologous recombination (NAHR). These CNVs are one of the fundamental causes of genomic disorders such as the 3q29 deletion syndrome (del3q29S). There are 21 protein-coding genes lost or gained as a result of such recurrent 1.6-Mbp deletions or duplications, respectively, in the 3q29 locus. While NAHR plays a role in CNV occurrence, the factors that increase the risk of NAHR at this particular locus are not well understood. METHODS We employed an optical genome mapping technique to characterize the 3q29 locus in 161 unaffected individuals, 16 probands with del3q29S and their parents, and 2 probands with the 3q29 duplication syndrome (dup3q29S). Long-read sequencing-based haplotype resolved de novo assemblies from 44 unaffected individuals, and 1 trio was used for orthogonal validation of haplotypes and deletion breakpoints. RESULTS In total, we discovered 34 haplotypes, of which 19 were novel haplotypes. Among these 19 novel haplotypes, 18 were detected in unaffected individuals, while 1 novel haplotype was detected on the parent-of-origin chromosome of a proband with the del3q29S. Phased assemblies from 44 unaffected individuals enabled the orthogonal validation of 20 haplotypes. In 89% (16/18) of the probands, breakpoints were confined to paralogous copies of a 20-kbp segment within the 3q29 SDs. In one del3q29S proband, the breakpoint was confined to a 374-bp region using long-read sequencing. Furthermore, we categorized del3q29S cases into three classes and dup3q29S cases into two classes based on breakpoints. Finally, we found no evidence of inversions in parent-of-origin chromosomes. CONCLUSIONS We have generated the most comprehensive haplotype map for the 3q29 locus using unaffected individuals, probands with del3q29S or dup3q29S, and available parents, and also determined the deletion breakpoint to be within a 374-bp region in one proband with del3q29S. These results should provide a better understanding of the underlying genetic architecture that contributes to the etiology of del3q29S and dup3q29S.
Collapse
Affiliation(s)
- Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Umamaheswaran Gurusamy
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
| | - Trenell J Mosley
- Graduate Program in Genetics and Molecular Biology, Laney Graduate School, Emory University, 201 Dowman Drive, Atlanta, GA, 30322, USA
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Kwondo Kim
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA
| | - Yulia Mostovoy
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
| | - Ryan H Purcell
- Laboratory of Translational Cell Biology, Department of Cell Biology, Emory University School of Medicine, 100 Woodruff Circle, Atlanta, GA, 30322, USA
| | - Tamim H Shaikh
- Department of Pediatrics, Section of Genetics and Metabolism, University of Colorado School of Medicine, 13123 E 16Th Ave, Aurora, CO, 80045, USA
| | - Michael E Zwick
- Department of Genetics, Rutgers University-New Brunswick, Rutgers University, Piscataway, New Brunswick, NJ, 08901, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute and Institute for Human Genetics, UCSF School of Medicine, 513 Parnassus Ave, San Francisco, CA, 94143, USA
- Department of Dermatology, UCSF School of Medicine, 1701 Divisadero Street, San Francisco, CA, 94115, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, 06032, USA.
| | - Jennifer G Mulle
- Department of Psychiatry, Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences, Rutgers University, 671 Hoes Lane, New Brunswick, NJ, 08901, USA.
| |
Collapse
|
2
|
Chander V, Mahmoud M, Hu J, Dardas Z, Grochowski CM, Dawood M, Khayat MM, Li H, Li S, Jhangiani S, Korchina V, Shen H, Weissenberger G, Meng Q, Gingras MC, Muzny DM, Doddapaneni H, Posey JE, Lupski JR, Sabo A, Murdock DR, Sedlazeck FJ, Gibbs RA. Long read sequencing and expression studies of AHDC1 deletions in Xia-Gibbs syndrome reveal a novel genetic regulatory mechanism. Hum Mutat 2022; 43:2033-2053. [PMID: 36054313 PMCID: PMC10167679 DOI: 10.1002/humu.24461] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 08/17/2022] [Accepted: 08/30/2022] [Indexed: 01/25/2023]
Abstract
Xia-Gibbs syndrome (XGS; MIM# 615829) is a rare mendelian disorder characterized by Development Delay (DD), intellectual disability (ID), and hypotonia. Individuals with XGS typically harbor de novo protein-truncating mutations in the AT-Hook DNA binding motif containing 1 (AHDC1) gene, although some missense mutations can also cause XGS. Large de novo heterozygous deletions that encompass the AHDC1 gene have also been ascribed as diagnostic for the disorder, without substantial evidence to support their pathogenicity. We analyzed 19 individuals with large contiguous deletions involving AHDC1, along with other genes. One individual bore the smallest known contiguous AHDC1 deletion (∼350 Kb), encompassing eight other genes within chr1p36.11 (Feline Gardner-Rasheed, IFI6, FAM76A, STX12, PPP1R8, THEMIS2, RPA2, SMPDL3B) and terminating within the first intron of AHDC1. The breakpoint junctions and phase of the deletion were identified using both short and long read sequencing (Oxford Nanopore). Quantification of RNA expression patterns in whole blood revealed that AHDC1 exhibited a mono-allelic expression pattern with no deficiency in overall AHDC1 expression levels, in contrast to the other deleted genes, which exhibited a 50% reduction in mRNA expression. These results suggest that AHDC1 expression in this individual is compensated by a novel regulatory mechanism and advances understanding of mutational and regulatory mechanisms in neurodevelopmental disorders.
Collapse
Affiliation(s)
- Varuna Chander
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Jianhong Hu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Zain Dardas
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | - Moez Dawood
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Michael M. Khayat
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - He Li
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Shoudong Li
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Shalini Jhangiani
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Viktoriya Korchina
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Hua Shen
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | | | - Qingchang Meng
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Marie-Claude Gingras
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Donna M. Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Harsha Doddapaneni
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Jennifer E. Posey
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - James R. Lupski
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Texas Children’s Hospital, Houston, Texas, USA
- Department of Pediatrics, Baylor College of Medicine, Houston, Texas, USA
| | - Aniko Sabo
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - David R. Murdock
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Fritz J. Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
- Department of Computer Science, Rice University, Houston, Texas, USA
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| |
Collapse
|
3
|
Mary L, Leclerc D, Gilot D, Belaud-Rotureau MA, Jaillard S. The TALE never ends: A comprehensive overview of the role of PBX1, a TALE transcription factor, in human developmental defects. Hum Mutat 2022; 43:1125-1148. [PMID: 35451537 DOI: 10.1002/humu.24388] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 03/25/2022] [Accepted: 04/20/2022] [Indexed: 11/07/2022]
Abstract
PBX1 is a highly conserved atypical homeodomain transcription factor (TF) belonging to the TALE (three amino acid loop extension) family. Dimerized with other TALE proteins, it can interact with numerous partners and reach dozens of regulating sequences, suggesting its role as a pioneer factor. PBX1 is expressed throughout the embryonic stages (as early as the blastula stage) in vertebrates. In human, PBX1 germline variations are linked to syndromic renal anomalies (CAKUTHED). In this review, we summarized available data on PBX1 functions, PBX1-deficient animal models, and PBX1 germline variations in humans. Two types of genetic alterations were identified in PBX1 gene. PBX1 missense variations generate a severe phenotype including lung hypoplasia, cardiac malformations, and sexual development defects (DSDs). Conversely, truncating variants generate milder phenotypes (mainly cryptorchidism and deafness). We suggest that defects in PBX1 interactions with various partners, including proteins from the HOX (HOXA7, HOXA10, etc.), WNT (WNT9B, WNT3), and Polycomb (BMI1, EED) families are responsible for abnormal proliferation and differentiation of the embryonic mesenchyme. These alterations could explain most of the defects observed in humans. However, some phenotype variability (especially DSDs) remains poorly understood. Further studies are needed to explore the TALE family in greater depth.
Collapse
Affiliation(s)
- Laura Mary
- Service de Cytogénétique et Biologie Cellulaire, CHU Rennes, Rennes, France
- INSERM, EHESP, IRSET (Institut de recherche en santé, environnement et travail)- UMR_S 1085, Université Rennes 1, Rennes, France
| | - Delphine Leclerc
- Inserm U1242, Centre de lutte contre le cancer Eugène Marquis, Université de Rennes, Rennes, France
| | - David Gilot
- Service de Cytogénétique et Biologie Cellulaire, CHU Rennes, Rennes, France
- Inserm U1242, Centre de lutte contre le cancer Eugène Marquis, Université de Rennes, Rennes, France
| | - Marc-Antoine Belaud-Rotureau
- Service de Cytogénétique et Biologie Cellulaire, CHU Rennes, Rennes, France
- INSERM, EHESP, IRSET (Institut de recherche en santé, environnement et travail)- UMR_S 1085, Université Rennes 1, Rennes, France
| | - Sylvie Jaillard
- Service de Cytogénétique et Biologie Cellulaire, CHU Rennes, Rennes, France
- INSERM, EHESP, IRSET (Institut de recherche en santé, environnement et travail)- UMR_S 1085, Université Rennes 1, Rennes, France
| |
Collapse
|
4
|
Duan X, Pan M, Fan S. Comprehensive evaluation of structural variant genotyping methods based on long-read sequencing data. BMC Genomics 2022; 23:324. [PMID: 35461238 PMCID: PMC9034514 DOI: 10.1186/s12864-022-08548-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 04/11/2022] [Indexed: 12/28/2022] Open
Abstract
Background Structural variants (SVs) play a crucial role in gene regulation, trait association, and disease in humans. SV genotyping has been extensively applied in genomics research and clinical diagnosis. Although a growing number of SV genotyping methods for long reads have been developed, a comprehensive performance assessment of these methods has yet to be done. Results Based on one simulated and three real SV datasets, we performed an in-depth evaluation of five SV genotyping methods, including cuteSV, LRcaller, Sniffles, SVJedi, and VaPoR. The results show that for insertions and deletions, cuteSV and LRcaller have similar F1 scores (cuteSV, insertions: 0.69–0.90, deletions: 0.77–0.90 and LRcaller, insertions: 0.67–0.87, deletions: 0.74–0.91) and are superior to other methods. For duplications, inversions, and translocations, LRcaller yields the most accurate genotyping results (0.84, 0.68, and 0.47, respectively). When genotyping SVs located in tandem repeat region or with imprecise breakpoints, cuteSV (insertions and deletions) and LRcaller (duplications, inversions, and translocations) are better than other methods. In addition, we observed a decrease in F1 scores when the SV size increased. Finally, our analyses suggest that the F1 scores of these methods reach the point of diminishing returns at 20× depth of coverage. Conclusions We present an in-depth benchmark study of long-read SV genotyping methods. Our results highlight the advantages and disadvantages of each genotyping method, which provide practical guidance for optimal application selection and prospective directions for tool improvement. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08548-y.
Collapse
Affiliation(s)
- Xiaoke Duan
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, 200438, China.,MOE Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Mingpei Pan
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, 200438, China.,MOE Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Shaohua Fan
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, 200438, China.
| |
Collapse
|
5
|
Mostovoy Y, Yilmaz F, Chow SK, Chu C, Lin C, Geiger EA, Meeks NJL, Chatfield KC, Coughlin CR, Surti U, Kwok PY, Shaikh TH. Genomic regions associated with microdeletion/microduplication syndromes exhibit extreme diversity of structural variation. Genetics 2021; 217:6066166. [PMID: 33724415 DOI: 10.1093/genetics/iyaa038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2020] [Accepted: 12/18/2020] [Indexed: 11/12/2022] Open
Abstract
Segmental duplications (SDs) are a class of long, repetitive DNA elements whose paralogs share a high level of sequence similarity with each other. SDs mediate chromosomal rearrangements that lead to structural variation in the general population as well as genomic disorders associated with multiple congenital anomalies, including the 7q11.23 (Williams-Beuren Syndrome, WBS), 15q13.3, and 16p12.2 microdeletion syndromes. Population-level characterization of SDs has generally been lacking because most techniques used for analyzing these complex regions are both labor and cost intensive. In this study, we have used a high-throughput technique to genotype complex structural variation with a single molecule, long-range optical mapping approach. We characterized SDs and identified novel structural variants (SVs) at 7q11.23, 15q13.3, and 16p12.2 using optical mapping data from 154 phenotypically normal individuals from 26 populations comprising five super-populations. We detected several novel SVs for each locus, some of which had significantly different prevalence between populations. Additionally, we localized the microdeletion breakpoints to specific paralogous duplicons located within complex SDs in two patients with WBS, one patient with 15q13.3, and one patient with 16p12.2 microdeletion syndromes. The population-level data presented here highlights the extreme diversity of large and complex SVs within SD-containing regions. The approach we outline will greatly facilitate the investigation of the role of inter-SD structural variation as a driver of chromosomal rearrangements and genomic disorders.
Collapse
Affiliation(s)
- Yulia Mostovoy
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Feyza Yilmaz
- Department of Integrative Biology, University of Colorado Denver, Denver, CO 80204, USA.,Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Stephen K Chow
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Catherine Chu
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Chin Lin
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Elizabeth A Geiger
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Naomi J L Meeks
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Kathryn C Chatfield
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA.,Department of Pediatrics, Section of Cardiology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Curtis R Coughlin
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute, UCSF School of Medicine, San Francisco, CA 94143, USA.,Department of Dermatology, UCSF School of Medicine, San Francisco, CA 94143, USA.,Institute for Human Genetics, UCSF School of Medicine, San Francisco, CA 94143, USA
| | - Tamim H Shaikh
- Department of Pediatrics, Section of Clinical Genetics and Metabolism, University of Colorado School of Medicine, Aurora, CO 80045, USA
| |
Collapse
|
6
|
Eaaswarkhanth M, Pathak AK, Ongaro L, Montinaro F, Hebbar P, Alsmadi O, Metspalu M, Al-Mulla F, Thanaraj TA. Unraveling a fine-scale high genetic heterogeneity and recent continental connections of an Arabian Peninsula population. Eur J Hum Genet 2021; 30:307-319. [PMID: 33753911 PMCID: PMC8904638 DOI: 10.1038/s41431-021-00861-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 02/09/2021] [Accepted: 03/03/2021] [Indexed: 11/19/2022] Open
Abstract
Recent studies have showed the diverse genetic architecture of the highly consanguineous populations inhabiting the Arabian Peninsula. Consanguinity coupled with heterogeneity is complex and makes it difficult to understand the bases of population-specific genetic diseases in the region. Therefore, comprehensive genetic characterization of the populations at the finest scale is warranted. Here, we revisit the genetic structure of the Kuwait population by analyzing genome-wide single nucleotide polymorphisms data from 583 Kuwaiti individuals sorted into three subgroups. We envisage a diverse demographic genetic history among the three subgroups based on drift and allelic sharing with modern and ancient individuals. Furthermore, our comprehensive haplotype-based analyses disclose a high genetic heterogeneity among the Kuwaiti populations. We infer the major sources of ancestry within the newly defined groups; one with an obvious predominance of sub-Saharan/Western Africa mostly comprising Kuwait-B individuals, and other with West Eurasia including Kuwait-P and Kuwait-S individuals. Overall, our results recapitulate the historical population movements and reaffirm the genetic imprints of the legacy of continental trading in the region. Such deciphering of fine-scale population structure and their regional genetic heterogeneity would provide clues to the uncharted areas of disease-gene discovery and related associations in populations inhabiting the Arabian Peninsula.
Collapse
Affiliation(s)
| | - Ajai K Pathak
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu, Estonia.,Department of Evolutionary Biology, Institute of Molecular and Cell Biology, Tartu, Estonia
| | - Linda Ongaro
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu, Estonia.,Department of Evolutionary Biology, Institute of Molecular and Cell Biology, Tartu, Estonia
| | - Francesco Montinaro
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu, Estonia.,Department of Biology-Genetics, University of Bari, Bari, Italy
| | - Prashantha Hebbar
- Department of Genetics and Bioinformatics, Dasman Diabetes Institute, Dasman, Kuwait
| | - Osama Alsmadi
- Department of Genetics and Bioinformatics, Dasman Diabetes Institute, Dasman, Kuwait.,Department of Cell Therapy and Applied Genomics, King Hussein Cancer Center, Amman, Jordan
| | - Mait Metspalu
- Estonian Biocentre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Fahd Al-Mulla
- Department of Genetics and Bioinformatics, Dasman Diabetes Institute, Dasman, Kuwait.
| | | |
Collapse
|
7
|
Dashti M, Alsaleh H, Eaaswarkhanth M, John SE, Nizam R, Melhem M, Hebbar P, Sharma P, Al-Mulla F, Thanaraj TA. Delineation of Mitochondrial DNA Variants From Exome Sequencing Data and Association of Haplogroups With Obesity in Kuwait. Front Genet 2021; 12:626260. [PMID: 33659027 PMCID: PMC7920096 DOI: 10.3389/fgene.2021.626260] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 01/13/2021] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND/OBJECTIVES Whole-exome sequencing is a valuable tool to determine genetic variations that are associated with rare and common health conditions. A limited number of studies demonstrated that mitochondrial DNA can be captured using whole-exome sequencing. Previous studies have suggested that mitochondrial DNA variants and haplogroup lineages are associated with obesity. Therefore, we investigated the role of mitochondrial variants and haplogroups contributing to the risk of obesity in Arabs in Kuwait using exome sequencing data. SUBJECTS/METHODS Indirect mitochondrial genomes were extracted from exome sequencing data from 288 unrelated native Arab individuals from Kuwait. The cohort was divided into obese [body mass index (BMI) ≥ 30 kg/m2] and non-obese (BMI < 30 kg/m2) groups. Mitochondrial variants were identified, and haplogroups were classified and compared with other sequencing technologies. Statistical analysis was performed to determine associations and identify mitochondrial variants and haplogroups affecting obesity. RESULTS Haplogroup R showed a protective effect on obesity [odds ratio (OR) = 0.311; P = 0.006], whereas haplogroup L individuals were at high risk of obesity (OR = 2.285; P = 0.046). Significant differences in mitochondrial variants between the obese and non-obese groups were mainly haplogroup-defining mutations and were involved in processes in energy generation. The majority of mitochondrial variants and haplogroups extracted from exome were in agreement with technical replica from Sanger and whole-genome sequencing. CONCLUSIONS This is the first to utilize whole-exome data to extract entire mitochondrial haplogroups to study its association with obesity in an Arab population.
Collapse
Affiliation(s)
- Mohammed Dashti
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Hussain Alsaleh
- Kuwait Identification DNA Laboratory, General Department of Criminal Evidence, Ministry of Interior, Kuwait City, Kuwait
| | | | - Sumi Elsa John
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Rasheeba Nizam
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Motasem Melhem
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Prashantha Hebbar
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Prem Sharma
- Department Special Services Facilities, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Fahd Al-Mulla
- Genetics and Bioinformatics Department, Dasman Diabetes Institute, Kuwait City, Kuwait
| | | |
Collapse
|
8
|
Jiang T, Liu Y, Jiang Y, Li J, Gao Y, Cui Z, Liu Y, Liu B, Wang Y. Long-read-based human genomic structural variation detection with cuteSV. Genome Biol 2020; 21:189. [PMID: 32746918 PMCID: PMC7477834 DOI: 10.1186/s13059-020-02107-y] [Citation(s) in RCA: 132] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 07/14/2020] [Indexed: 01/01/2023] Open
Abstract
Long-read sequencing is promising for the comprehensive discovery of structural variations (SVs). However, it is still non-trivial to achieve high yields and performance simultaneously due to the complex SV signatures implied by noisy long reads. We propose cuteSV, a sensitive, fast, and scalable long-read-based SV detection approach. cuteSV uses tailored methods to collect the signatures of various types of SVs and employs a clustering-and-refinement method to implement sensitive SV detection. Benchmarks on simulated and real long-read sequencing datasets demonstrate that cuteSV has higher yields and scaling performance than state-of-the-art tools. cuteSV is available at https://github.com/tjiangHIT/cuteSV.
Collapse
Affiliation(s)
- Tao Jiang
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China
| | - Yongzhuang Liu
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China
| | - Yue Jiang
- Nebula Genomics, Harbin, 150030, Heilongjiang, China
| | - Junyi Li
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, Guangdong, China
| | - Yan Gao
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China
| | - Zhe Cui
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China
| | - Yadong Liu
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China
| | - Bo Liu
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China.
| | - Yadong Wang
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China.
| |
Collapse
|
9
|
Daw Elbait G, Henschel A, Tay GK, Al Safar HS. Whole Genome Sequencing of Four Representatives From the Admixed Population of the United Arab Emirates. Front Genet 2020; 11:681. [PMID: 32754195 PMCID: PMC7367215 DOI: 10.3389/fgene.2020.00681] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Accepted: 06/03/2020] [Indexed: 01/21/2023] Open
Abstract
Whole genome sequences (WGS) of four nationals of the United Arab Emirates (UAE) at an average coverage of 33X have been completed and described. The selection of suitable subpopulation representatives was informed by a preceding comprehensive population structure analysis. Representatives were chosen based on their central location within the subpopulation on a principal component analysis (PCA) and the degree to which they were admixed. Novel genomic variations among the different subgroups of the UAE population are reported here. Specifically, the WGS analysis identified 4,161,067-4,798,806 variants in the four individual samples, where approximately 80% were single nucleotide polymorphisms (SNPs) and 20% were insertions or deletions (indels). An average of 2.75% was found to be novel variants according to dbSNP (build 151). This is the first report of structural variants (SV) from WGS data from UAE nationals. There were 15,677-20,339 called SVs, of which around 13.5% were novel. The four samples shared 1,399,178 variants, each with distinct variants as follows: 1,085,524 (for the individual denoted as UAE S011), 1,228,559 (UAE S012), 791,072 (UAE S013), and 906,818 (UAE S014). These results show a previously unappreciated population diversity in the region. The synergy of WGS and genotype array data was demonstrated through variant annotation of the former using 2.3 million allele frequencies for the local population derived from the latter technology platform. This novel approach of combining breadth and depth of array and WGS technologies has guided the choice of population genetic representatives and provides complementary, regionalized allele frequency annotation to new genomes comprising millions of loci.
Collapse
Affiliation(s)
- Gihan Daw Elbait
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Andreas Henschel
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan K Tay
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Division of Psychiatry, Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia.,School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Habiba S Al Safar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Genetics and Molecular Biology, Collage of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| |
Collapse
|
10
|
Tay GK, Henschel A, Daw Elbait G, Al Safar HS. Genetic Diversity and Low Stratification of the Population of the United Arab Emirates. Front Genet 2020; 11:608. [PMID: 32595703 PMCID: PMC7304494 DOI: 10.3389/fgene.2020.00608] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Accepted: 05/19/2020] [Indexed: 01/09/2023] Open
Abstract
With high consanguinity rates on the Arabian Peninsula, it would not have been unexpected if the population of the United Arab Emirates (UAE) was shown to be relatively homogenous. However, this study of 1000 UAE nationals provided a contrasting perspective, one of a relatively heterogeneous population. Located at the apex of Europe, Asia, and Africa, the observed diversity could be explained by a plethora of migration patterns since the first Out-of-Africa movement. A strategy to explore the extent of genetic variation of the population of the UAE is presented. The first step involved a comprehensive population stratification study that was instructive for subsequent whole genome sequencing (WGS) of suitable representatives (which is described elsewhere). When these UAE data were compared to previous smaller studies from the region, the findings were consistent with a population that is a diverse and admixed group of people. However, rather than sharp and distinctive clusters, cluster analysis reveals low levels of stratification throughout the population. UAE emirates exhibit high within-Emirate-distance/among-Emirate distance ratios. Supervised admixture analysis showed a continuous gradient of ancestral populations, suggesting that admixture on the south eastern tip of the Arabian Peninsula occurred gradually. When visualized using a unique technique that combined admixture ratios and principal component analysis (PCA), unappreciated diversity was revealed while mitigating projection bias of conventional PCA. We observe low population stratification in the UAE in terms of homozygosity versus separation cluster coefficients. This holds for the UAE in a global context as well as for isolated cluster analysis of the Emirati birthplaces. However, the subtle clustering observed in the Emirates reflects geographic proximity and historic migration events. The analytical strategy used here highlights the complementary nature of data from genotype array and WGS for anthropological studies. Specifically, genotype array data were instructive to select representative subjects for WGS. Furthermore, from the 2.3 million allele frequencies obtained from genotype arrays, we identified 46,481 loci with allele frequencies that were significantly different with respect to other world populations. This comparison of allele frequencies facilitates variant prioritization in common diseases. In addition, these loci bear great potential as biomarkers in anthropological and forensic studies.
Collapse
Affiliation(s)
- Guan K Tay
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Faculty of Health and Medical Sciences, UWA Medical School, The University of Western Australia, Crawley, WA, Australia.,School of Medical and Health Sciences, Edith Cowan University, Joondalup, WA, Australia
| | - Andreas Henschel
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Gihan Daw Elbait
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Habiba S Al Safar
- Center for Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Genetics and Molecular Biology, College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| |
Collapse
|
11
|
Eaaswarkhanth M, dos Santos ALC, Gokcumen O, Al-Mulla F, Thanaraj TA. Genome-Wide Selection Scan in an Arabian Peninsula Population Identifies a TNKS Haplotype Linked to Metabolic Traits and Hypertension. Genome Biol Evol 2020; 12:77-87. [PMID: 32068798 PMCID: PMC7093833 DOI: 10.1093/gbe/evaa033] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2020] [Indexed: 12/12/2022] Open
Abstract
Despite the extreme and varying environmental conditions prevalent in the Arabian Peninsula, it has experienced several waves of human migrations following the out-of-Africa diaspora. Eventually, the inhabitants of the peninsula region adapted to the hot and dry environment. The adaptation and natural selection that shaped the extant human populations of the Arabian Peninsula region have been scarcely studied. In an attempt to explore natural selection in the region, we analyzed 662,750 variants in 583 Kuwaiti individuals. We searched for regions in the genome that display signatures of positive selection in the Kuwaiti population using an integrative approach in a conservative manner. We highlight a haplotype overlapping TNKS that showed strong signals of positive selection based on the results of the multiple selection tests conducted (integrated Haplotype Score, Cross Population Extended Haplotype Homozygosity, Population Branch Statistics, and log-likelihood ratio scores). Notably, the TNKS haplotype under selection potentially conferred a fitness advantage to the Kuwaiti ancestors for surviving in the harsh environment while posing a major health risk to present-day Kuwaitis.
Collapse
Affiliation(s)
| | - Andre Luiz Campelo dos Santos
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo
- Department of Archeology, Federal University of Pernambuco, Recife, Brazil
| | - Omer Gokcumen
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo
| | - Fahd Al-Mulla
- Department of Genetics and Bioinformatics, Dasman Diabetes Institute, Kuwait City, Kuwait
| | | |
Collapse
|
12
|
AlSafar HS, Al-Ali M, Elbait GD, Al-Maini MH, Ruta D, Peramo B, Henschel A, Tay GK. Introducing the first whole genomes of nationals from the United Arab Emirates. Sci Rep 2019; 9:14725. [PMID: 31604968 PMCID: PMC6789106 DOI: 10.1038/s41598-019-50876-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Accepted: 09/20/2019] [Indexed: 12/30/2022] Open
Abstract
Whole Genome Sequencing (WGS) provides an in depth description of genome variation. In the era of large-scale population genome projects, the assembly of ethnic-specific genomes combined with mapping human reference genomes of underrepresented populations has improved the understanding of human diversity and disease associations. In this study, for the first time, whole genome sequences of two nationals of the United Arab Emirates (UAE) at >27X coverage are reported. The two Emirati individuals were predominantly of Central/South Asian ancestry. An in-house customized pipeline using BWA, Picard followed by the GATK tools to map the raw data from whole genome sequences of both individuals was used. A total of 3,994,521 variants (3,350,574 Single Nucleotide Polymorphisms (SNPs) and 643,947 indels) were identified for the first individual, the UAE S001 sample. A similar number of variants, 4,031,580 (3,373,501 SNPs and 658,079 indels), were identified for UAE S002. Variants that are associated with diabetes, hypertension, increased cholesterol levels, and obesity were also identified in these individuals. These Whole Genome Sequences has provided a starting point for constructing a UAE reference panel which will lead to improvements in the delivery of precision medicine, quality of life for affected individuals and a reduction in healthcare costs. The information compiled will likely lead to the identification of target genes that could potentially lead to the development of novel therapeutic modalities.
Collapse
Affiliation(s)
- Habiba S AlSafar
- Center of Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Mariam Al-Ali
- Center of Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Gihan Daw Elbait
- Center of Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | | | - Dymitr Ruta
- Etisalat-British Telecom Innovation Center, Abu Dhabi, United Arab Emirates
| | | | - Andreas Henschel
- Center of Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates.,Department of Computer Science, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
| | - Guan K Tay
- Center of Biotechnology, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates. .,Department of Biomedical Engineering, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates. .,College of Medicine and Health Sciences, Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates. .,School of Psychiatry and Clinical Neurosciences, University of Western Australia, Nedlands, Australia. .,School of Medical and Health Sciences, Edith Cowan University, Joondalup, Australia.
| |
Collapse
|
13
|
Amin AM, Mostafa H, Arif NH, Abdul Kader MASK, Kah Hay Y. Metabolomics profiling and pathway analysis of human plasma and urine reveal further insights into the multifactorial nature of coronary artery disease. Clin Chim Acta 2019; 493:112-122. [DOI: 10.1016/j.cca.2019.02.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 02/25/2019] [Accepted: 02/27/2019] [Indexed: 12/21/2022]
|
14
|
Hebbar P, Abubaker JA, Abu-Farha M, Tuomilehto J, Al-Mulla F, Thanaraj TA. A Perception on Genome-Wide Genetic Analysis of Metabolic Traits in Arab Populations. Front Endocrinol (Lausanne) 2019; 10:8. [PMID: 30761081 PMCID: PMC6362414 DOI: 10.3389/fendo.2019.00008] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 01/09/2019] [Indexed: 12/16/2022] Open
Abstract
Despite dedicated nation-wide efforts to raise awareness against the harmful effects of fast-food consumption and sedentary lifestyle, the Arab population continues to struggle with an increased risk for metabolic disorders. Unlike the European population, the Arab population lacks well-established genetic risk determinants for metabolic disorders, and the transferability of established risk loci to this population has not been satisfactorily demonstrated. The most recent findings have identified over 240 genetic risk loci (with ~400 independent association signals) for type 2 diabetes, but thus far only 25 risk loci (ADAMTS9, ALX4, BCL11A, CDKAL1, CDKN2A/B, COL8A1, DUSP9, FTO, GCK, GNPDA2, HMG20A, HNF1A, HNF1B, HNF4A, IGF2BP2, JAZF1, KCNJ11, KCNQ1, MC4R, PPARγ, SLC30A8, TCF7L2, TFAP2B, TP53INP1, and WFS1) have been replicated in Arab populations. To our knowledge, large-scale population- or family-based association studies are non-existent in this region. Recently, we conducted genome-wide association studies on Arab individuals from Kuwait to delineate the genetic determinants for quantitative traits associated with anthropometry, lipid profile, insulin resistance, and blood pressure levels. Although these studies led to the identification of novel recessive variants, they failed to reproduce the established loci. However, they provided insights into the genetic architecture of the population, the applicability of genetic models based on recessive mode of inheritance, the presence of genetic signatures of inbreeding due to the practice of consanguinity, and the pleiotropic effects of rare disorders on complex metabolic disorders. This perspective presents analysis strategies and study designs for identifying genetic risk variants associated with diabetes and related traits in Arab populations.
Collapse
Affiliation(s)
- Prashantha Hebbar
- Genetics and Bioinformatics Unit, Dasman Diabetes Institute, Kuwait City, Kuwait
- Doctoral Program in Population Health, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Jehad Ahmed Abubaker
- Genetics and Bioinformatics Unit, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Mohamed Abu-Farha
- Genetics and Bioinformatics Unit, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Jaakko Tuomilehto
- Genetics and Bioinformatics Unit, Dasman Diabetes Institute, Kuwait City, Kuwait
| | - Fahd Al-Mulla
- Genetics and Bioinformatics Unit, Dasman Diabetes Institute, Kuwait City, Kuwait
- *Correspondence: Fahd Al-Mulla
| | | |
Collapse
|
15
|
Abstract
BACKGROUND Numerous different types of variations can occur in DNA and have diverse effects and consequences. The Variation Ontology (VariO) was developed for systematic descriptions of variations and their effects at DNA, RNA and protein levels. RESULTS VariO use and terms for DNA variations are described in depth. VariO provides systematic names for variation types and detailed descriptions for changes in DNA function, structure and properties. The principles of VariO are presented along with examples from published articles or databases, most often in relation to human diseases. VariO terms describe local DNA changes, chromosome number and structure variants, chromatin alterations, as well as genomic changes, whether of genetic or non-genetic origin. CONCLUSIONS DNA variation systematics facilitates unambiguous descriptions of variations and their effects and further reuse and integration of data from different sources by both human and computers.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, BMC B13, SE-22184, Lund, Sweden.
| |
Collapse
|
16
|
Assessment of coding region variants in Kuwaiti population: implications for medical genetics and population genomics. Sci Rep 2018; 8:16583. [PMID: 30409984 PMCID: PMC6224454 DOI: 10.1038/s41598-018-34815-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 10/16/2018] [Indexed: 02/07/2023] Open
Abstract
Consanguineous populations of the Arabian Peninsula have been underrepresented in global efforts that catalogue human exome variability. We sequenced 291 whole exomes of unrelated, healthy native Arab individuals from Kuwait to a median coverage of 45X and characterised 170,508 single-nucleotide variants (SNVs), of which 21.7% were ‘personal’. Up to 12% of the SNVs were novel and 36% were population-specific. Half of the SNVs were rare and 54% were missense variants. The study complemented the Greater Middle East Variome by way of reporting many additional Arabian exome variants. The study corroborated Kuwaiti population genetic substructures previously derived using genome-wide genotype data and illustrated the genetic relatedness among Kuwaiti population subgroups, Middle Eastern, European and Ashkenazi Jewish populations. The study mapped 112 rare and frequent functional variants relating to pharmacogenomics and disorders (recessive and common) to the phenotypic characteristics of Arab population. Comparative allele frequency data and carrier distributions of known Arab mutations for 23 disorders seen among Arabs, of putative OMIM-listed causal mutations for 12 disorders observed among Arabs but not yet characterized for genetic basis in Arabs, and of 17 additional putative mutations for disorders characterized for genetic basis in Arab populations are presented for testing in future Arab studies.
Collapse
|
17
|
Dron JS, Wang J, Berberich AJ, Iacocca MA, Cao H, Yang P, Knoll J, Tremblay K, Brisson D, Netzer C, Gouni-Berthold I, Gaudet D, Hegele RA. Large-scale deletions of the ABCA1 gene in patients with hypoalphalipoproteinemia. J Lipid Res 2018; 59:1529-1535. [PMID: 29866657 PMCID: PMC6071767 DOI: 10.1194/jlr.p086280] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 05/21/2018] [Indexed: 01/07/2023] Open
Abstract
Copy-number variations (CNVs) have been studied in the context of familial hypercholesterolemia but have not yet been evaluated in patients with extreme levels of HDL cholesterol. We evaluated targeted, next-generation sequencing data from patients with very low levels of HDL cholesterol (i.e., hypoalphalipoproteinemia) with the VarSeq-CNV® caller algorithm to screen for CNVs that disrupted the ABCA1, LCAT, or APOA1 genes. In four individuals, we found three unique deletions in ABCA1: a heterozygous deletion of exon 4, a heterozygous deletion that spanned exons 8 to 31, and a heterozygous deletion of the entire ABCA1 gene. Breakpoints were identified with Sanger sequencing, and the full-gene deletion was confirmed by using exome sequencing and the Affymetrix CytoScan HD array. Previously, large-scale deletions in candidate HDL genes had not been associated with hypoalphalipoproteinemia; our findings indicate that CNVs in ABCA1 may be a previously unappreciated genetic determinant of low levels of HDL cholesterol. By coupling bioinformatic analyses with next-generation sequencing data, we can successfully assess the spectrum of genetic determinants of many dyslipidemias, including hypoalphalipoproteinemia.
Collapse
Affiliation(s)
- Jacqueline S Dron
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada.,Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Jian Wang
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Amanda J Berberich
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada.,Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London ON, Canada.,Department of Medicine, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Michael A Iacocca
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada.,Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Henian Cao
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Ping Yang
- Department of Pathology and Laboratory Medicine, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Joan Knoll
- Department of Pathology and Laboratory Medicine, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| | - Karine Tremblay
- Lipidology Unit, Community Genomic Medicine Centre and ECOGENE-21, Department of Medicine, Université de Montréal, Saguenay QC, Canada
| | - Diane Brisson
- Lipidology Unit, Community Genomic Medicine Centre and ECOGENE-21, Department of Medicine, Université de Montréal, Saguenay QC, Canada
| | | | - Ioanna Gouni-Berthold
- Polyclinic for Endocrinology, Diabetes and Preventive Medicine, University of Cologne, Germany
| | - Daniel Gaudet
- Lipidology Unit, Community Genomic Medicine Centre and ECOGENE-21, Department of Medicine, Université de Montréal, Saguenay QC, Canada
| | - Robert A Hegele
- Robarts Research Institute, Schulich School of Medicine and Dentistry, Western University, London ON, Canada .,Department of Biochemistry, Schulich School of Medicine and Dentistry, Western University, London ON, Canada.,Department of Medicine, Schulich School of Medicine and Dentistry, Western University, London ON, Canada
| |
Collapse
|
18
|
Blue EE, Bis JC, Dorschner MO, Tsuang D, Barral SM, Beecham G, Below JE, Bush WS, Butkiewicz M, Cruchaga C, DeStefano A, Farrer LA, Goate A, Haines J, Jaworski J, Jun G, Kunkle B, Kuzma A, Lee JJ, Lunetta K, Ma Y, Martin E, Naj A, Nato AQ, Navas P, Nguyen H, Reitz C, Reyes D, Salerno W, Schellenberg GD, Seshadri S, Sohi H, Thornton TA, Valladares O, van Duijn C, Vardarajan BN, Wang LS, Boerwinkle E, Dupuis J, Pericak-Vance MA, Mayeux R, Wijsman EM. Genetic Variation in Genes Underlying Diverse Dementias May Explain a Small Proportion of Cases in the Alzheimer's Disease Sequencing Project. Dement Geriatr Cogn Disord 2018; 45:1-17. [PMID: 29486463 PMCID: PMC5971141 DOI: 10.1159/000485503] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 11/20/2017] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND/AIMS The Alzheimer's Disease Sequencing Project (ADSP) aims to identify novel genes influencing Alzheimer's disease (AD). Variants within genes known to cause dementias other than AD have previously been associated with AD risk. We describe evidence of co-segregation and associations between variants in dementia genes and clinically diagnosed AD within the ADSP. METHODS We summarize the properties of known pathogenic variants within dementia genes, describe the co-segregation of variants annotated as "pathogenic" in ClinVar and new candidates observed in ADSP families, and test for associations between rare variants in dementia genes in the ADSP case-control study. The participants were clinically evaluated for AD, and they represent European, Caribbean Hispanic, and isolate Dutch populations. RESULTS/CONCLUSIONS Pathogenic variants in dementia genes were predominantly rare and conserved coding changes. Pathogenic variants within ARSA, CSF1R, and GRN were observed, and candidate variants in GRN and CHMP2B were nominated in ADSP families. An independent case-control study provided evidence of an association between variants in TREM2, APOE, ARSA, CSF1R, PSEN1, and MAPT and risk of AD. Variants in genes which cause dementing disorders may influence the clinical diagnosis of AD in a small proportion of cases within the ADSP.
Collapse
Affiliation(s)
| | | | | | - Debby Tsuang
- University of Washington
- Veterans Administration Puget Sound Health Care
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Eric Boerwinkle
- Baylor College of Medicine
- University of Texas Health Sciences Center at Houston
| | | | | | | | | |
Collapse
|
19
|
Henderson LM, Claw KG, Woodahl EL, Robinson RF, Boyer BB, Burke W, Thummel KE. P450 Pharmacogenetics in Indigenous North American Populations. J Pers Med 2018; 8:jpm8010009. [PMID: 29389890 PMCID: PMC5872083 DOI: 10.3390/jpm8010009] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 01/19/2018] [Accepted: 01/22/2018] [Indexed: 12/14/2022] Open
Abstract
Indigenous North American populations, including American Indian and Alaska Native peoples in the United States, the First Nations, Métis and Inuit peoples in Canada and Amerindians in Mexico, are historically under-represented in biomedical research, including genomic research on drug disposition and response. Without adequate representation in pharmacogenetic studies establishing genotype-phenotype relationships, Indigenous populations may not benefit fully from new innovations in precision medicine testing to tailor and improve the safety and efficacy of drug treatment, resulting in health care disparities. The purpose of this review is to summarize and evaluate what is currently known about cytochrome P450 genetic variation in Indigenous populations in North America and to highlight the importance of including these groups in future pharmacogenetic studies for implementation of personalized drug therapy.
Collapse
Affiliation(s)
- Lindsay M Henderson
- Departments of Pharmaceutics, University of Washington, Seattle, WA 98195, USA.
| | - Katrina G Claw
- Departments of Pharmaceutics, University of Washington, Seattle, WA 98195, USA.
| | - Erica L Woodahl
- Department of Biomedical and Pharmaceutical Sciences, University of Montana, Missoula, MT 59812, USA.
| | - Renee F Robinson
- Southcentral Foundation, Anchorage, AK 99508, USA.
- United States Public Health Service, Department of Human Services, Washington, DC 20201, USA.
| | - Bert B Boyer
- Center for Alaska Native Health Research, University of Alaska Fairbanks, Fairbanks, AK 99775, USA.
| | - Wylie Burke
- Bioethics & Humanities, University of Washington, Seattle, WA 98195, USA.
| | - Kenneth E Thummel
- Departments of Pharmaceutics, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
20
|
Jithesh PV, Scaria V. From genomes to genomic medicine: enabling personalized and precision medicine in the Middle East. Per Med 2017; 14:377-382. [DOI: 10.2217/pme-2017-0048] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Affiliation(s)
- Puthen Veettil Jithesh
- Division of Biomedical Informatics Research, Sidra Medical & Research Center, OPC Building, PO Box 26999, Doha, Qatar
| | - Vinod Scaria
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR Institute of Genomics & Integrative Biology, Mathura Road, Delhi 110 025, India
| |
Collapse
|
21
|
Koshy R, Ranawat A, Scaria V. al mena: a comprehensive resource of human genetic variants integrating genomes and exomes from Arab, Middle Eastern and North African populations. J Hum Genet 2017. [PMID: 28638141 DOI: 10.1038/jhg.2017.67] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Middle East and North Africa (MENA) encompass very unique populations, with a rich history and encompasses characteristic ethnic, linguistic and genetic diversity. The genetic diversity of MENA region has been largely unknown. The recent availability of whole-exome and whole-genome sequences from the region has made it possible to collect population-specific allele frequencies. The integration of data sets from this region would provide insights into the landscape of genetic variants in this region. We integrated genetic variants from multiple data sets systematically, available from this region to create a compendium of over 26 million genetic variations. The variants were systematically annotated and their allele frequencies in the data sets were computed and available as a web interface which enables quick query. As a proof of principle for application of the compendium for genetic epidemiology, we analyzed the allele frequencies for variants in transglutaminase 1 (TGM1) gene, associated with autosomal recessive lamellar ichthyosis. Our analysis revealed that the carrier frequency of selected variants differed widely with significant interethnic differences. To the best of our knowledge, al mena is the first and most comprehensive repertoire of genetic variations from the Arab, Middle Eastern and North African region. We hope al mena would accelerate Precision Medicine in the region.
Collapse
Affiliation(s)
- Remya Koshy
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Anop Ranawat
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| | - Vinod Scaria
- GN Ramachandran Knowledge Center for Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, Delhi, India.,The Academy of Scientific and Innovative Research (AcSIR), CSIR-Institute of Genomics and Integrative Biology, Delhi, India
| |
Collapse
|
22
|
John SE, Antony D, Eaaswarkhanth M, Hebbar P, Alkayal F, Tuomilehto J, Alsmadi O, Thanaraj TA. Genetic variants associated with warfarin dosage in Kuwaiti population. Pharmacogenomics 2017; 18:757-764. [PMID: 28592190 DOI: 10.2217/pgs-2017-0020] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Assessing the distinct prevalence or absence of genetic variants associated with differential response to the anticoagulant medication of warfarin in different population groups is actively pursued by pharmacogenomics community. Populations from Arabian Peninsula are underrepresented in such studies. By way of examining exome- and genome-wide genotype data from 1395 Arab individuals in Kuwait, we report distinct occurrence of warfarin response-related variants rs12460590_A/CYP2A7, rs2108622_T/CYP4F2, rs2884737_C/VKORC1 and distinct absence of rs11150606_C/PRSS53 in Kuwaiti population. The presented results in conjunction with similar literature reports on Qatari population enhance the worldwide understanding on population-specific distributions of genetic variants associated with warfarin drug dosage.
Collapse
Affiliation(s)
- Sumi Elsa John
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Dinu Antony
- Functional Genomics Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Muthukrishnan Eaaswarkhanth
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Prashantha Hebbar
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Fadi Alkayal
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Jaakko Tuomilehto
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Osama Alsmadi
- Functional Genomics Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| | - Thangavel Alphonse Thanaraj
- Population Genomics & Genetic Epidemiology Unit, Dasman Diabetes Institute, P.O. Box 1180, Dasman, 15462, Kuwait
| |
Collapse
|
23
|
Hebbar P, Alkayal F, Nizam R, Melhem M, Elkum N, John SE, Abufarha M, Alsmadi O, Thanaraj TA. The TCN2 variant of rs9606756 [Ile23Val] acts as risk loci for obesity-related traits and mediates by interacting with Apo-A1. Obesity (Silver Spring) 2017; 25:1098-1108. [PMID: 28417558 DOI: 10.1002/oby.21826] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Revised: 02/02/2017] [Accepted: 02/22/2017] [Indexed: 02/04/2023]
Abstract
OBJECTIVE Despite alarming obesity levels in the Arabian Peninsula, its population lacks convincingly identified genetic determinants of obesity. A genome-wide association study was performed for obesity-related anthropometric traits in Arabs and to decipher mechanisms by which the variants mediate traits. METHODS The Illumina HumanOmniExpress BeadChip was used to genotype 1,353 Arab individuals (largely with Class I obesity) from Kuwait. Genome-wide association tests for obesity-related anthropometric traits were performed. Top associations were tested for replication in an independent cohort (1,176 unrelated Arabs). Resultant variants were investigated for interactions with obesity-related plasma biomarkers. Pathway analysis was performed on genes harboring markers in linkage disequilibrium (LD) with identified variants. RESULTS The rs9606756[c.67A>G,p.Ile23Val] variant from TCN2 was associated with waist circumference (WC) at nearly genome-wide significance (P = 8.92E-08). WC was inversely related with Apo-A1 or high-density lipoprotein levels; individuals with the AG genotype exhibited stronger relationship than those with the reference AA genotype. Interaction involving the AG genotype (effect allele = G) significantly contributed to an increase in anthropometric traits (particularly WC). Genes harboring single-nucleotide polymorphisms in LD with rs9606756 mapped onto an interaction network (with TP53 as central element) of established obesity/diabetes-related protein components. CONCLUSIONS The TCN2 variant acts as a risk factor for WC in the Arab population. The variant mediates obesity-related anthropometric traits via interactions with Apo-A1/high-density lipoprotein or TP53.
Collapse
Affiliation(s)
| | | | | | | | - Naser Elkum
- Sidra Medical and Research Center, Research Department Doha, Qatar
| | | | | | | | | |
Collapse
|
24
|
Kohailan M, Alanazi M, Rouabhia M, Al Amri A, Parine NR, Semlali A. Two SNPs in the promoter region of Toll-like receptor 4 gene are not associated with smoking in Saudi Arabia. Onco Targets Ther 2017; 10:745-752. [PMID: 28223830 PMCID: PMC5308598 DOI: 10.2147/ott.s111971] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Defects in the innate immune system, particularly in Toll-like receptors (TLRs), have been reported in several cigarette smoke-promoted diseases. The aim of this study was to examine the impact of tobacco smoke on allelic frequencies of TLR4 single-nucleotide polymorphisms (SNPs) and to compare the genotypic distribution of these SNPs in a Saudi Arabian population with that in previously studied populations. DNA was extracted from 303 saliva samples collected from smokers and nonsmokers. Two transitional SNPs in the promoter region of TLR4 were selected, rs2770150 (T/C) and rs10759931 (G/A). Genotype frequencies were determined using quantitative polymerase chain reaction. Our results showed a slight effect of smoking on the distribution of rs2770150 and rs10759931. However, the differences were not significant. Thus, we conclude that the SNPs selected for this study were independent of smoking and may not be related to smoking-induced diseases.
Collapse
Affiliation(s)
- Muhammad Kohailan
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Mohammad Alanazi
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Mahmoud Rouabhia
- Groupe de Recherche en Écologie Buccale, Département de Stomatologie, Faculté de Médecine Dentaire, Université Laval, Québec, QC, Canada
| | - Abdullah Al Amri
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Narasimha Reddy Parine
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Abdelhabib Semlali
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| |
Collapse
|
25
|
Abstract
The 22 Arab nations have a unique genetic structure, which reflects both conserved and diverse gene pools due to the prevalent endogamous and consanguineous marriage culture and the long history of admixture among different ethnic subcultures descended from the Asian, European, and African continents. Human genome sequencing has enabled large-scale genomic studies of different populations and has become a powerful tool for studying disease predictions and diagnosis. Despite the importance of the Arab genome for better understanding the dynamics of the human genome, discovering rare genetic variations, and studying early human migration out of Africa, it is poorly represented in human genome databases, such as HapMap and the 1000 Genomes Project. In this review, I demonstrate the significance of sequencing the Arab genome and setting an Arab genome reference(s) for better understanding the molecular pathogenesis of genetic diseases, discovering novel/rare variants, and identifying a meaningful genotype-phenotype correlation for complex diseases.
Collapse
Affiliation(s)
- Hatem Zayed
- College of Health and Sciences, Biomedical Sciences Department, Qatar University, PO Box 2713, Doha, Qatar.
| |
Collapse
|
26
|
Al-Hebshi NN, Li S, Nasher AT, El-Setouhy M, Alsanosi R, Blancato J, Loffredo C. Exome sequencing of oral squamous cell carcinoma in users of Arabian snuff reveals novel candidates for driver genes. Int J Cancer 2016; 139:363-72. [PMID: 26934577 DOI: 10.1002/ijc.30068] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 02/16/2016] [Indexed: 02/06/2023]
Abstract
The study sought to identify genetic aberrations driving oral squamous cell carcinoma (OSCC) development among users of shammah, an Arabian preparation of smokeless tobacco. Twenty archival OSCC samples, 15 of which with a history of shammah exposure, were whole-exome sequenced at an average depth of 127×. Somatic mutations were identified using a novel, matched controls-independent filtration algorithm. CODEX and Exomedepth coupled with a novel, Database of Genomic Variant-based filter were employed to call somatic gene-copy number variations. Significantly mutated genes were identified with Oncodrive FM and the Youn and Simon's method. Candidate driver genes were nominated based on Gene Set Enrichment Analysis. The observed mutational spectrum was similar to that reported by the TCGA project. In addition to confirming known genes of OSCC (TP53, CDKNA2, CASP8, PIK3CA, HRAS, FAT1, TP63, CCND1 and FADD) the analysis identified several candidate novel driver events including mutations of NOTCH3, CSMD3, CRB1, CLTCL1, OSMR and TRPM2, amplification of the proto-oncogenes FOSL1, RELA, TRAF6, MDM2, FRS2 and BAG1, and deletion of the recently described tumor suppressor SMARCC1. Analysis also revealed significantly altered pathways not previously implicated in OSCC including Oncostatin-M signalling pathway, AP-1 and C-MYB transcription networks and endocytosis. There was a trend for higher number of mutations, amplifications and driver events in samples with history of shammah exposure particularly those that tested EBV positive, suggesting an interaction between tobacco exposure and EBV. The work provides further evidence for the genetic heterogeneity of oral cancer and suggests shammah-associated OSCC is characterized by extensive amplification of oncogenes.
Collapse
Affiliation(s)
- Nezar Noor Al-Hebshi
- Department of Preventive Dentistry, College of Dentistry, Jazan University, Jazan, Kingdom of Saudi Arabia
| | - Shiyong Li
- Department of Oncology and Pharmacogenomics, Beijing Genome Institute (BGI), Shenzhen, Republic of China
| | - Akram Thabet Nasher
- Department of Oral and Maxillofacial Surgery, Faculty of Dentistry, Sana'a University, Yemen
| | - Maged El-Setouhy
- Substance Abuse Research Center (SARC), Jazan University, Jazan, Kingdom of Saudi Arabia
| | - Rashad Alsanosi
- Substance Abuse Research Center (SARC), Jazan University, Jazan, Kingdom of Saudi Arabia
| | - Jan Blancato
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, USA
| | - Christopher Loffredo
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, USA
| |
Collapse
|
27
|
Rodriguez-Flores JL, Fakhro K, Agosto-Perez F, Ramstetter MD, Arbiza L, Vincent TL, Robay A, Malek JA, Suhre K, Chouchane L, Badii R, Al-Nabet Al-Marri A, Abi Khalil C, Zirie M, Jayyousi A, Salit J, Keinan A, Clark AG, Crystal RG, Mezey JG. Indigenous Arabs are descendants of the earliest split from ancient Eurasian populations. Genome Res 2016; 26:151-62. [PMID: 26728717 PMCID: PMC4728368 DOI: 10.1101/gr.191478.115] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Accepted: 12/15/2015] [Indexed: 12/26/2022]
Abstract
An open question in the history of human migration is the identity of the earliest Eurasian populations that have left contemporary descendants. The Arabian Peninsula was the initial site of the out-of-Africa migrations that occurred between 125,000 and 60,000 yr ago, leading to the hypothesis that the first Eurasian populations were established on the Peninsula and that contemporary indigenous Arabs are direct descendants of these ancient peoples. To assess this hypothesis, we sequenced the entire genomes of 104 unrelated natives of the Arabian Peninsula at high coverage, including 56 of indigenous Arab ancestry. The indigenous Arab genomes defined a cluster distinct from other ancestral groups, and these genomes showed clear hallmarks of an ancient out-of-Africa bottleneck. Similar to other Middle Eastern populations, the indigenous Arabs had higher levels of Neanderthal admixture compared to Africans but had lower levels than Europeans and Asians. These levels of Neanderthal admixture are consistent with an early divergence of Arab ancestors after the out-of-Africa bottleneck but before the major Neanderthal admixture events in Europe and other regions of Eurasia. When compared to worldwide populations sampled in the 1000 Genomes Project, although the indigenous Arabs had a signal of admixture with Europeans, they clustered in a basal, outgroup position to all 1000 Genomes non-Africans when considering pairwise similarity across the entire genome. These results place indigenous Arabs as the most distant relatives of all other contemporary non-Africans and identify these people as direct descendants of the first Eurasian populations established by the out-of-Africa migrations.
Collapse
Affiliation(s)
- Juan L Rodriguez-Flores
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA
| | - Khalid Fakhro
- Sidra Medical and Research Center, Doha, Qatar; Department of Genetic Medicine, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Francisco Agosto-Perez
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA; Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Monica D Ramstetter
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Leonardo Arbiza
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Thomas L Vincent
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA
| | - Amal Robay
- Department of Genetic Medicine, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Joel A Malek
- Department of Genetic Medicine, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Karsten Suhre
- Bioinformatics Core, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Lotfi Chouchane
- Department of Genetic Medicine, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Ramin Badii
- Laboratory Medicine and Pathology, Hamad Medical Corporation, Doha, Qatar
| | | | - Charbel Abi Khalil
- Department of Genetic Medicine, Weill Cornell Medical College-Qatar, Doha, Qatar
| | - Mahmoud Zirie
- Department of Medicine, Hamad Medical Corporation, Doha, Qatar
| | - Amin Jayyousi
- Department of Medicine, Hamad Medical Corporation, Doha, Qatar
| | - Jacqueline Salit
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA
| | - Alon Keinan
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Andrew G Clark
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Ronald G Crystal
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA
| | - Jason G Mezey
- Department of Genetic Medicine, Weill Cornell Medical College, New York, New York 10065, USA; Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| |
Collapse
|
28
|
Fohner AE, Robinson R, Yracheta J, Dillard DA, Schilling B, Khan B, Hopkins S, Boyer B, Black J, Wiener H, Tiwari HK, Gordon A, Nickerson D, Tsai JM, Farin FM, Thornton TA, Rettie AE, Thummel KE. Variation in genes controlling warfarin disposition and response in American Indian and Alaska Native people: CYP2C9, VKORC1, CYP4F2, CYP4F11, GGCX. Pharmacogenet Genomics 2015; 25:343-353. [PMID: 25946405 PMCID: PMC4461509 DOI: 10.1097/fpc.0000000000000143] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES Pharmacogenetic testing is projected to improve health outcomes and reduce the cost of care by increasing therapeutic efficacy and minimizing drug toxicity. American Indian and Alaska Native (AI/AN) people historically have been excluded from pharmacogenetic research and its potential benefits, a deficiency we sought to address. The vitamin K antagonist warfarin is prescribed for prevention of thromboembolic events, although its narrow therapeutic index and wide interindividual variability necessitate close monitoring of drug response. Therefore, we were interested in variation in CYP2C9, VKORC1, CYP4F2, CYP4F11, and GGCX, which encode enzymes important for the activity of warfarin and synthesis of vitamin K-dependent blood clotting factors. METHODS We resequenced these genes in 188 AI/AN people in partnership with Southcentral Foundation in Anchorage, Alaska and 94 Yup'ik people living in the Yukon-Kuskokwim Delta of southwest Alaska to identify known or novel function-disrupting variation. We conducted genotyping for specific single nucleotide polymorphisms in larger cohorts of each study population (380 and 350, respectively). RESULTS We identified high frequencies of the lower-warfarin dose VKORC1 haplotype (-1639G>A and 1173C>T) and the higher-warfarin dose CYP4F2*3 variant. We also identified two relatively common, novel, and potentially function-disrupting variants in CYP2C9 (M1L and N218I), which, along with CYP2C9*3, CYP2C9*2, and CYP2C9*29, predict that a significant proportion of AI/AN people will have decreased CYP2C9 activity. CONCLUSION Overall, we predict a lower average warfarin dose requirement in AI/AN populations in Alaska than that seen in non-AI/AN populations of the USA, a finding consistent with clinical experience in Alaska.
Collapse
Affiliation(s)
- Alison E Fohner
- Department of Pharmaceutics, University of Washington, Seattle, WA
| | | | - Joseph Yracheta
- Department of Pharmaceutics, University of Washington, Seattle, WA
| | | | | | | | - Scarlett Hopkins
- Center for Alaska Native Health Research, University of Alaska Fairbanks, Fairbanks, AK
| | - Bert Boyer
- Center for Alaska Native Health Research, University of Alaska Fairbanks, Fairbanks, AK
| | - Jynene Black
- Center for Alaska Native Health Research, University of Alaska Fairbanks, Fairbanks, AK
| | - Howard Wiener
- School of Public Health, University of Alabama at Birmingham, Birmingham, AL
| | - Hemant K Tiwari
- School of Public Health, University of Alabama at Birmingham, Birmingham, AL
| | - Adam Gordon
- Department of Genome Sciences, University of Washington, Seattle WA
| | | | - Jesse M Tsai
- Department of Environmental and Occupational Health Sciences, University of Washington
| | - Federico M Farin
- Department of Environmental and Occupational Health Sciences, University of Washington
| | | | - Allan E Rettie
- Department of Medicinal Chemistry, University of Washington
| | | |
Collapse
|
29
|
John SE, Thareja G, Hebbar P, Behbehani K, Thanaraj TA, Alsmadi O. Kuwaiti population subgroup of nomadic Bedouin ancestry-Whole genome sequence and analysis. GENOMICS DATA 2015; 3:116-27. [PMID: 26484159 PMCID: PMC4535864 DOI: 10.1016/j.gdata.2014.11.016] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Revised: 11/27/2014] [Accepted: 11/28/2014] [Indexed: 12/21/2022]
Abstract
Kuwaiti native population comprises three distinct genetic subgroups of Persian, "city-dwelling" Saudi Arabian tribe, and nomadic "tent-dwelling" Bedouin ancestry. Bedouin subgroup is characterized by presence of 17% African ancestry; it owes it origin to nomadic tribes of the deserts of Arabian Peninsula and North Africa. By sequencing whole genome of a Kuwaiti male from this subgroup at 41X coverage, we report 3,752,878 SNPs, 411,839 indels, and 8451 structural variations. Neighbor-joining tree, based on shared variant positions carrying disease-risk alleles between the Bedouin and other continental genomes, places Bedouin genome at the nexus of African, Asian, and European genomes in concordance with geographical location of Kuwait and Peninsula. In congruence with participant's medical history for morbid obesity and bronchial asthma, risk alleles are seen at deleterious SNPs associated with obesity and asthma. Many of the observed deleterious 'novel' variants lie in genes associated with autosomal recessive disorders characteristic of the region.
Collapse
Affiliation(s)
| | | | | | | | | | - Osama Alsmadi
- Corresponding author. Tel.: + 965 2224 2999x4343(work); fax: + 965 2249 2406.
| |
Collapse
|