1
|
Punniyamoorthy D, Souframanien J. Gamma-rays induced genome wide stable mutations in cowpea deciphered through whole genome sequencing. Int J Radiat Biol 2024:1-13. [PMID: 38683196 DOI: 10.1080/09553002.2024.2345087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 03/30/2024] [Indexed: 05/01/2024]
Abstract
PURPOSE Gamma rays are the most widely exploited physical mutagen in plant mutation breeding. They are known to be involved in the development of more than 60% of global cowpea (Vigna unguiculata (L.) Walp.) mutant varieties. Nevertheless, the nature and type of genome-wide mutations induced by gamma rays have not been studied in cowpea and therefore, the present investigation was undertaken. MATERIALS AND METHODS Genomic DNAs from three stable gamma rays-induced mutants (large seed size, small seed size and disease resistant mutant) of cowpea cultivar 'CPD103' in M6 generation along with its progenitor were used for Illumina-based whole-genome resequencing. RESULTS Gamma rays induced a relatively higher frequency (88.9%) of single base substitutions (SBSs) with an average transition to transversion ratio (Ti/Tv) of 3.51 in M6 generation. A > G transitions, including its complementary T > C transitions, predominated the transition mutations, while all four types of transversion mutations were detected with frequencies over 6.5%. Indels (small insertions and deletions) constituted about 11% of the total induced variations, wherein small insertions (6.3%) were relatively more prominent than small deletions (4.8%). Among the indels, single-base indels and, in particular, those involving A/T bases showed a preponderance, albeit indels of up to three bases were detected in low proportions. Distributed across all 11 chromosomes, only a fraction of SBSs (19.45%) and indels (20.2%) potentially altered the encoded amino acids/peptides. The inherent mutation rate induced by gamma rays in cowpea was observed to be in the order of 1.4 × 10-7 per base pair in M6 generation. CONCLUSION Gamma-rays with a greater tendency to induce SBSs and, to a lesser extent, indels could be efficiently and effectively exploited in cowpea mutation breeding.
Collapse
Affiliation(s)
| | - Jegadeesan Souframanien
- Nuclear Agriculture and Biotechnology Division, Bhabha Atomic Research Centre, Mumbai, India
| |
Collapse
|
2
|
Han S, Zhang S, Yi R, Bi D, Ding H, Yang J, Ye Y, Xu W, Wu L, Zhuo R, Kan X. Phylogenomics and plastomics offer new evolutionary perspectives on Kalanchoideae (Crassulaceae). Ann Bot 2024; 133:585-604. [PMID: 38359907 PMCID: PMC11037489 DOI: 10.1093/aob/mcae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 02/08/2024] [Indexed: 02/17/2024]
Abstract
BACKGROUND AND AIMS Kalanchoideae is one of three subfamilies within Crassulaceae and contains four genera. Despite previous efforts, the phylogeny of Kalanchoideae remains inadequately resolved with persistent issues including low support, unstructured topologies and polytomies. This study aimed to address two central objectives: (1) resolving the pending phylogenetic questions within Kalanchoideae by using organelle-scale 'barcodes' (plastomes) and nuclear data; and (2) investigating interspecific diversity patterns among Kalanchoideae plastomes. METHODS To explore the plastome evolution in Kalanchoideae, we newly sequenced 38 plastomes representing all four constituent genera (Adromischus, Cotyledon, Kalanchoe and Tylecodon). We performed comparative analyses of plastomic features, including GC and gene contents, gene distributions at the IR (inverted repeat) boundaries, nucleotide divergence, plastomic tRNA (pttRNA) structures and codon aversions. Additionally, phylogenetic inferences were inferred using both the plastomic dataset (79 genes) and nuclear dataset (1054 genes). KEY RESULTS Significant heterogeneities were observed in plastome lengths among Kalanchoideae, strongly correlated with LSC (large single copy) lengths. Informative diversities existed in the gene content at SSC/IRa (small single copy/inverted repeat a), with unique patterns individually identified in Adromischus leucophyllus and one major Kalanchoe clade. The ycf1 gene was assessed as a shared hypervariable region among all four genera, containing nine lineage-specific indels. Three pttRNAs exhibited unique structures specific to Kalanchoideae and the genera Adromischus and Kalanchoe. Moreover, 24 coding sequences revealed a total of 41 lineage-specific unused codons across all four constituent genera. The phyloplastomic inferences clearly depicted internal branching patterns in Kalanchoideae. Most notably, by both plastid- and nuclear-based phylogenies, our research offers the first evidence that Kalanchoe section Eukalanchoe is not monophyletic. CONCLUSIONS This study conducted comprehensive analyses on 38 newly reported Kalanchoideae plastomes. Importantly, our results not only reconstructed well-resolved phylogenies within Kalanchoideae, but also identified highly informative unique markers at the subfamily, genus and species levels. These findings significantly enhance our understanding of the evolutionary history of Kalanchoideae.
Collapse
Affiliation(s)
- Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Ran Yi
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Hengwu Ding
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jianke Yang
- School of Basic Medical Sciences, Wannan Medical College, Wuhu 241000, China
| | - Yuanxin Ye
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Wenzhong Xu
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Longhua Wu
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, China
| | - Renying Zhuo
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding of Zhejiang Province, Research Institute of Subtropical Forestry, Chinese Academy of Forestry, Hangzhou 311400, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| |
Collapse
|
3
|
Ely B, Hils M, Clarke A, Albert M, Holness N, Lenski J, Mohammadi T. New Genera and Species of Caulobacter and Brevundimonas Bacteriophages Provide Insights into Phage Genome Evolution. Viruses 2024; 16:641. [PMID: 38675982 PMCID: PMC11053796 DOI: 10.3390/v16040641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/28/2024] Open
Abstract
Previous studies have identified diverse bacteriophages that infect Caulobacter vibrioides strain CB15 ranging from small RNA phages to four genera of jumbo phages. In this study, we focus on 20 bacteriophages whose genomes range from 40 to 60 kb in length. Genome comparisons indicated that these diverse phages represent six Caulobacter phage genera and one additional genus that includes both Caulobacter and Brevundimonas phages. Within species, comparisons revealed that both single base changes and inserted or deleted genetic material cause the genomes of closely related phages to diverge. Among genera, the basic gene order and the orientation of key genes were retained with most of the observed variation occurring at ends of the genomes. We hypothesize that the nucleotide sequences of the ends of these phage genomes are less important than the need to maintain the size of the genome and the stability of the corresponding mRNAs.
Collapse
Affiliation(s)
- Bert Ely
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA (A.C.); (M.A.); (T.M.)
| | | | | | | | | | | | | |
Collapse
|
4
|
Alexandrino AO, Oliveira AR, Jean G, Fertin G, Dias U, Dias Z. Reversal and Transposition Distance on Unbalanced Genomes Using Intergenic Information. J Comput Biol 2023; 30:861-876. [PMID: 37222724 DOI: 10.1089/cmb.2023.0087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023] Open
Abstract
The most common way to calculate the rearrangement distance between two genomes is to use the size of a minimum length sequence of rearrangements that transforms one of the two given genomes into the other, where the genomes are represented as permutations using only their gene order, based on the assumption that genomes have the same gene content. With the advance of research in genome rearrangements, new works extended the classical models by either considering genomes with different gene content (unbalanced genomes) or including more genomic characteristics to the mathematical representation of the genomes, such as the distribution of intergenic regions sizes. In this study, we study the Reversal, Transposition, and Indel (Insertion and Deletion) Distance using intergenic information, which allows comparing unbalanced genomes, because indels are included in the rearrangement model (i.e., the set of possible rearrangements allowed when we compute the distance). For the particular case of transpositions and indels on unbalanced genomes, we present a 4-approximation algorithm, improving a previous 4.5 approximation. This algorithm is extended so as to deal with gene orientation and to maintain the 4-approximation factor for the Reversal, Transposition, and Indel Distance on unbalanced genomes. Furthermore, we evaluate the proposed algorithms using experiments on simulated data.
Collapse
Affiliation(s)
| | | | - Géraldine Jean
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, Nantes, France
| | - Guillaume Fertin
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, Nantes, France
| | - Ulisses Dias
- School of Technology, University of Campinas, Limeira, Brazil
| | - Zanoni Dias
- Institute of Computing, University of Campinas, Campinas, Brazil
| |
Collapse
|
5
|
Wang Y, Obbard DJ. Experimental estimates of germline mutation rate in eukaryotes: a phylogenetic meta-analysis. Evol Lett 2023; 7:216-226. [PMID: 37475753 PMCID: PMC10355183 DOI: 10.1093/evlett/qrad027] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 05/08/2023] [Accepted: 06/08/2023] [Indexed: 07/22/2023] Open
Abstract
Mutation is the ultimate source of all genetic variation, and over the last 10 years the ready availability of whole-genome sequencing has permitted direct estimation of mutation rate for many non-model species across the tree of life. In this meta-analysis, we make a comprehensive search of the literature for mutation rate estimates in eukaryotes, identifying 140 mutation accumulation (MA) and parent-offspring (PO) sequencing studies covering 134 species. Based on these data, we revisit differences in the single-nucleotide mutation (SNM) rate between different phylogenetic lineages and update the known relationships between mutation rate and generation time, genome size, and nucleotide diversity-while accounting for phylogenetic nonindependence. We do not find a significant difference between MA and PO in estimated mutation rates, but we confirm that mammal and plant lineages have higher mutation rates than arthropods and that unicellular eukaryotes have the lowest mutation rates. We find that mutation rates are higher in species with longer generation times and larger genome sizes, even when accounting for phylogenetic relationships. Moreover, although nucleotide diversity is positively correlated with mutation rate, the gradient of the relationship is significantly less than one (on a logarithmic scale), consistent with higher mutation rates in populations with smaller effective size. For the 29 species for which data are available, we find that indel mutation rates are positively correlated with nucleotide mutation rates and that short deletions are generally more common than short insertions. Nevertheless, despite recent progress, no estimates of either SNM or indel mutation rates are available for the majority of deeply branching eukaryotic lineages-or even for most animal phyla. Even among charismatic megafauna, experimental mutation rate estimates remain unknown for amphibia and scarce for reptiles and fish.
Collapse
Affiliation(s)
- Yiguan Wang
- Corresponding author: Institute of Ecology and Evolution, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, United Kingdom.
| | - Darren J Obbard
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
6
|
Nakamura S, Inada E, Saitoh I, Sato M. Recent Genome-Editing Approaches toward Post-Implanted Fetuses in Mice. BioTech (Basel) 2023; 12:biotech12020037. [PMID: 37218754 DOI: 10.3390/biotech12020037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/25/2023] [Accepted: 05/08/2023] [Indexed: 05/24/2023] Open
Abstract
Genome editing, as exemplified by the CRISPR/Cas9 system, has recently been employed to effectively generate genetically modified animals and cells for the purpose of gene function analysis and disease model creation. There are at least four ways to induce genome editing in individuals: the first is to perform genome editing at the early preimplantation stage, such as fertilized eggs (zygotes), for the creation of whole genetically modified animals; the second is at post-implanted stages, as exemplified by the mid-gestational stages (E9 to E15), for targeting specific cell populations through in utero injection of viral vectors carrying genome-editing components or that of nonviral vectors carrying genome-editing components and subsequent in utero electroporation; the third is at the mid-gestational stages, as exemplified by tail-vein injection of genome-editing components into the pregnant females through which the genome-editing components can be transmitted to fetal cells via a placenta-blood barrier; and the last is at the newborn or adult stage, as exemplified by facial or tail-vein injection of genome-editing components. Here, we focus on the second and third approaches and will review the latest techniques for various methods concerning gene editing in developing fetuses.
Collapse
Affiliation(s)
- Shingo Nakamura
- Division of Biomedical Engineering, National Defense Medical College Research Institute, Saitama 359-8513, Japan
| | - Emi Inada
- Department of Pediatric Dentistry, Graduate School of Medical and Dental Sciences, Kagoshima University, Kagoshima 890-8544, Japan
| | - Issei Saitoh
- Department of Pediatric Dentistry, Asahi University School of Dentistry, Mizuho-shi 501-0296, Japan
| | - Masahiro Sato
- Department of Genome Medicine, National Center for Child Health and Development, Tokyo 157-8535, Japan
| |
Collapse
|
7
|
Kim WJ, Kang BH, Moon CY, Kang S, Shin S, Chowdhury S, Choi MS, Park SK, Moon JK, Ha BK. Quantitative Trait Loci (QTL) Analysis of Seed Protein and Oil Content in Wild Soybean ( Glycine soja). Int J Mol Sci 2023; 24:ijms24044077. [PMID: 36835486 PMCID: PMC9959443 DOI: 10.3390/ijms24044077] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 02/10/2023] [Accepted: 02/13/2023] [Indexed: 02/22/2023] Open
Abstract
Soybean seeds consist of approximately 40% protein and 20% oil, making them one of the world's most important cultivated legumes. However, the levels of these compounds are negatively correlated with each other and regulated by quantitative trait loci (QTL) that are controlled by several genes. In this study, a total of 190 F2 and 90 BC1F2 plants derived from a cross of Daepung (Glycine max) with GWS-1887 (G. soja, a source of high protein), were used for the QTL analysis of protein and oil content. In the F2:3 populations, the average protein and oil content was 45.52% and 11.59%, respectively. A QTL associated with protein levels was detected at Gm20_29512680 on chr. 20 with a likelihood of odds (LOD) of 9.57 and an R2 of 17.2%. A QTL associated with oil levels was also detected at Gm15_3621773 on chr. 15 (LOD: 5.80; R2: 12.2%). In the BC1F2:3 populations, the average protein and oil content was 44.25% and 12.14%, respectively. A QTL associated with both protein and oil content was detected at Gm20_27578013 on chr. 20 (LOD: 3.77 and 3.06; R2 15.8% and 10.7%, respectively). The crossover to the protein content of BC1F3:4 population was identified by SNP marker Gm20_32603292. Based on these results, two genes, Glyma.20g088000 (S-adenosyl-l-methionine-dependent methyltransferases) and Glyma.20g088400 (oxidoreductase, 2-oxoglutarate-Fe(II) oxygenase family protein), in which the amino acid sequence had changed and a stop codon was generated due to an InDel in the exon region, were identified.
Collapse
Affiliation(s)
- Woon Ji Kim
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Byeong Hee Kang
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
- BK21 FOUR Center for IT-Bio Convergence System Agriculture, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Chang Yeok Moon
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
- BK21 FOUR Center for IT-Bio Convergence System Agriculture, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Sehee Kang
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
- BK21 FOUR Center for IT-Bio Convergence System Agriculture, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Seoyoung Shin
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Sreeparna Chowdhury
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Man-Soo Choi
- National Institute of Crop Science, Rural Development Administration (RDA), Wanju 55365, Republic of Korea
| | - Soo-Kwon Park
- National Institute of Crop Science, Rural Development Administration (RDA), Wanju 55365, Republic of Korea
| | - Jung-Kyung Moon
- National Institute of Crop Science, Rural Development Administration (RDA), Wanju 55365, Republic of Korea
| | - Bo-Keun Ha
- Department of Applied Plant Science, Chonnam National University, Gwangju 61186, Republic of Korea
- BK21 FOUR Center for IT-Bio Convergence System Agriculture, Chonnam National University, Gwangju 61186, Republic of Korea
- Correspondence: ; Tel.: +82-62-530-2055
| |
Collapse
|
8
|
Ciccozzi M, Pascarella S. Two sides of the same coin: the N-terminal and the receptor binding domains of SARS-CoV-2 Spike. Future Virol 2023:10.2217/fvl-2022-0181. [PMID: 36896145 PMCID: PMC9987531 DOI: 10.2217/fvl-2022-0181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 01/31/2023] [Indexed: 03/08/2023]
Abstract
The SARS-CoV-2 Spike receptor binding domain and N-terminal domain interact with each other in an intricate mechanism. Mutations modulate the interplay between the Spike and host molecules. This editorial comments on the intricacies of SARS-CoV-2 Spike interactions.
Collapse
Affiliation(s)
- Massimo Ciccozzi
- Medical Statistic & Molecular Epidemiology Unit, University of Biomedical Campus, Rome, Italy
| | - Stefano Pascarella
- Department of Biochemical Sciences 'A Rossi Fanelli', Sapienza Università di Roma, Rome, 00185, Italy
| |
Collapse
|
9
|
Takahashi R, Takahashi G, Kameyama Y, Sato M, Ohtsuka M, Wada K. Gender-Difference in Hair Length as Revealed by Crispr-Based Production of Long-Haired Mice with Dysfunctional FGF5 Mutations. Int J Mol Sci 2022; 23:ijms231911855. [PMID: 36233155 PMCID: PMC9569730 DOI: 10.3390/ijms231911855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/27/2022] [Accepted: 10/03/2022] [Indexed: 11/16/2022] Open
Abstract
Fibroblast growth factor 5 (FGF5) is an important molecule required for the transition from anagen to catagen phase of the mammalian hair cycle. We previously reported that Syrian hamsters harboring a 1-bp deletion in the Fgf5 gene exhibit excessive hair growth in males. Herein, we generated Fgf5 mutant mice using genome editing via oviductal nucleic acid delivery (GONAD)/improved GONAD (i-GONAD), an in vivo genome editing system used to target early embryos present in the oviductal lumen, to study gender differences in hair length in mutant mice. The two lines (Fgf5go-malc), one with a 2-bp deletion (c.552_553del) and the other with a 1-bp insertion (c.552_553insA) in exon 3 of Fgf5, were successfully established. Each mutation was predicted to disrupt a part of the FGF domain through frameshift mutation (p.Glu184ValfsX128 or p.Glu184ArgfsX128). Fgf5go-malc1 mice had heterogeneously distributed longer hairs than wild-type mice (C57BL/6J). Notably, this change was more evident in males than in females (p < 0.0001). Immunohistochemical analysis revealed the presence of FGF5 protein in the dermal papilla and outer root sheath of the hair follicles from C57BL/6J and Fgf5go-malc1 mice. Histological analysis revealed that the prolonged anagen phase might be the cause of accelerated hair growth in Fgf5go-malc1 mice.
Collapse
Affiliation(s)
- Ryo Takahashi
- Graduate School of Bioindustry, Tokyo University of Agriculture, Abashiri 099-2493, Japan
| | - Gou Takahashi
- Regenerative Medicine Project, Tokyo Metropolitan Institute of Medical Science, Tokyo 156-8506, Japan
| | - Yuichi Kameyama
- Graduate School of Bioindustry, Tokyo University of Agriculture, Abashiri 099-2493, Japan
| | - Masahiro Sato
- Department of Genome Medicine, National Center for Child Health and Development, Tokyo 157-8535, Japan
| | - Masato Ohtsuka
- Department of Molecular Life Science, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara 259-1193, Japan
- Center for Matrix Biology and Medicine, Graduate School of Medicine, Tokai University, Isehara 259-1193, Japan
- The Institute of Medical Sciences, Tokai University, Isehara 259-1193, Japan
| | - Kenta Wada
- Graduate School of Bioindustry, Tokyo University of Agriculture, Abashiri 099-2493, Japan
- Correspondence: ; Tel.: +81-152-48-3827
| |
Collapse
|
10
|
Gress A, Srikakulam SK, Keller S, Ramensky V, Kalinina OV. d-StructMAn: Containerized structural annotation on the scale from genetic variants to whole proteomes. Gigascience 2022; 11:6706670. [PMID: 36130085 PMCID: PMC9487898 DOI: 10.1093/gigascience/giac086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 07/06/2022] [Accepted: 08/18/2022] [Indexed: 11/30/2022] Open
Abstract
Background Structural annotation of genetic variants in the context of intermolecular interactions and protein stability can shed light onto mechanisms of disease-related phenotypes. Three-dimensional structures of related proteins in complexes with other proteins, nucleic acids, or ligands enrich such functional interpretation, since intermolecular interactions are well conserved in evolution. Results We present d-StructMAn, a novel computational method that enables structural annotation of local genetic variants, such as single-nucleotide variants and in-frame indels, and implements it in a highly efficient and user-friendly tool provided as a Docker container. Using d-StructMAn, we annotated several very large sets of human genetic variants, including all variants from ClinVar and all amino acid positions in the human proteome. We were able to provide annotation for more than 46% of positions in the human proteome representing over 60% proteins. Conclusions d-StructMAn is the first of its kind and a highly efficient tool for structural annotation of protein-coding genetic variation in the context of observed and potential intermolecular interactions. d-StructMAn is readily applicable to proteome-scale datasets and can be an instrumental building machine-learning tool for predicting genotype-to-phenotype relationships.
Collapse
Affiliation(s)
- Alexander Gress
- Correspondence address. Alexander Gress, Campus Saarland University 66123 Saarbrücken Building E2.1 Room 101; E-mail:
| | - Sanjay K Srikakulam
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS)/Helmholtz Centre for Infection Research (HZI), Saarbrücken 8: 66123, Germany
- Graduate School of Computer Science, Saarland University, Saarbrücken 5: 101990, Germany
- Interdisciplinary Graduate School of Natural Product Research, Saarland University, Saarbrücken 6: 119991, Germany
| | - Sebastian Keller
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS)/Helmholtz Centre for Infection Research (HZI), Saarbrücken 8: 66123, Germany
- Graduate School of Computer Science, Saarland University, Saarbrücken 5: 101990, Germany
- Research Group Computational Biology, Max Planck Institute for Informatics, Saarbrücken 7: 66421, Germany
| | - Vasily Ramensky
- National Medical Research Center for Therapy and Preventive Medicine of the Ministry of Healthcare of Russian Federation, Moscow, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS)/Helmholtz Centre for Infection Research (HZI), Saarbrücken 8: 66123, Germany
- Medical Faculty, Saarland University, Homburg, Germany
- Center for Bioinformatics, Saarland Informatics Campus, Saarbrücken, Germany
| |
Collapse
|
11
|
Bruno S, Landi V, Senczuk G, Brooks SA, Almathen F, Faye B, Gaouar SSB, Piro M, Kim KS, David X, Eggen A, Burger P, Ciani E. Refining the Camelus dromedarius Myostatin Gene Polymorphism through Worldwide Whole-Genome Sequencing. Animals (Basel) 2022; 12:2068. [PMID: 36009658 PMCID: PMC9404819 DOI: 10.3390/ani12162068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/07/2022] [Accepted: 08/11/2022] [Indexed: 11/16/2022] Open
Abstract
Myostatin (MSTN) is a highly conserved negative regulator of skeletal muscle in mammals. Inactivating mutations results in a hyper-muscularity phenotype known as "double muscling" in several livestock and model species. In Camelus dromedarius, the gene structure organization and the sequence polymorphisms have been previously investigated, using Sanger and Next-Generation Sequencing technologies on a limited number of animals. Here, we carried out a follow-up study with the aim to further expand our knowledge about the sequence polymorphisms at the myostatin locus, through the whole-genome sequencing data of 183 samples representative of the geographical distribution range for this species. We focused our polymorphism analysis on the ±5 kb upstream and downstream region of the MSTN gene. A total of 99 variants (77 Single Nucleotide Polymorphisms and 22 indels) were observed. These were mainly located in intergenic and intronic regions, with only six synonymous Single Nucleotide Polymorphisms in exons. A sequence comparative analysis among the three species within the Camelus genus confirmed the expected higher genetic distance of C. dromedarius from the wild and domestic two-humped camels compared to the genetic distance between C. bactrianus and C. ferus. In silico functional prediction highlighted: (i) 213 differential putative transcription factor-binding sites, out of which 41 relative to transcription factors, with known literature evidence supporting their involvement in muscle metabolism and/or muscle development; and (ii) a number of variants potentially disrupting the canonical MSTN splicing elements, out of which two are discussed here for their potential ability to generate a prematurely truncated (inactive) form of the protein. The distribution of the considered variants in the studied cohort is discussed in light of the peculiar evolutionary history of this species and the hypothesis that extremely high muscularity, associated with a homozygous condition for mutated (inactivating) alleles at the myostatin locus, may represent, in arid desert conditions, a clear metabolic disadvantage, emphasizing the thermoregulatory and water availability challenges typical of these habitats.
Collapse
Affiliation(s)
- Silvia Bruno
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari “Aldo Moro”, 70126 Bari, Italy
| | - Vincenzo Landi
- Department of Veterinary Medicine, University of Bari “Aldo Moro”, Valenzano, 70010 Bari, Italy
| | - Gabriele Senczuk
- Department of Agricultural, Environmental and Food Sciences, University of Molise, 86100 Campobasso, Italy
| | - Samantha Ann Brooks
- Department of Animal Sciences, University of Florida, Gainesville, FL 32610, USA
| | - Faisal Almathen
- Department of Public Health, College of Veterinary Medicine, King Faisal University, Al-Ahsa 31982, Saudi Arabia
- Camel Research Center, King Faisal University, Al-Ahsa 31982, Saudi Arabia
| | | | | | - Mohammed Piro
- Department of Medicine, Surgery and Reproduction, Institut Agronomique et Vétérinaire Hassan II, Rabat BP 6202, Morocco
| | - Kwan Suk Kim
- Department of Animal Sciences, Chungbuk National University, Chungbuk 28644, Korea
| | | | | | - Pamela Burger
- Research Institute of Wildlife Ecology, Vetmeduni, 1160 Vienna, Austria
| | - Elena Ciani
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari “Aldo Moro”, 70126 Bari, Italy
| |
Collapse
|
12
|
Greco S, Gerdol M. Independent acquisition of short insertions at the RIR1 site in the spike N-terminal domain of the SARS-CoV-2 BA.2 lineage. Transbound Emerg Dis 2022; 69:e3408-e3415. [PMID: 35908169 PMCID: PMC9353284 DOI: 10.1111/tbed.14672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 07/17/2022] [Accepted: 07/27/2022] [Indexed: 11/28/2022]
Abstract
Although the major SARS‐CoV‐2 omicron lineages share over 30 non‐synonymous substitutions in the spike glycoprotein, they show several unique mutations that were acquired after their ancestral split. One of the most intriguing mutations associated with BA.1 is the presence of the inserted tripeptide Glu‐Pro‐Glu within the N‐terminal domain, at a site that had previously independently acquired short insertions in several other SARS‐CoV‐2 lineages. Although the functional implications of the small nucleotide sequences found at this insertion hotspot, named RIR1, are still unclear, we have previously hypothesized that they may play a compensatory role in counterbalancing minor fitness deficits associated with other co‐occurring spike non‐synonymous mutations. Here we show that similar insertion events have independently occurred at RIR1 at least 20 times in early 2022 within the BA.2 lineage, being occasionally associated with significant community transmission. One of these omicron sublineages, characterized by a Ser‐Gly‐Arg insertion in position 212, has been responsible of over 4,000 documented covid‐19 cases worldwide between January and July 2022, for the most part concentrated in Denmark, where it reached a national prevalence close to 4% (10% in the Nordjylland region) in mid‐May. Although the concurrent spread of the BA.2.12.1, BA.4 and BA.5 lineages led to the rapid decline of this BA.2 sublineage, the independent acquisition of several other RIR1 insertions on a BA.2 genomic background suggests that these events may provide a slight fitness advantage. Therefore, we they should be carefully monitored in the upcoming months in other emerging omicron‐related lineages, including BA.5. This article is protected by copyright. All rights reserved
Collapse
Affiliation(s)
| | - Marco Gerdol
- Department of Life Sciences, University of Trieste
| |
Collapse
|
13
|
Abstract
Antibodies are important immune molecules that are elicited by B cells to protect our bodies during viral infections or vaccinations. In humans, the antibody repertoire is diversified by programmed DNA lesion processes to ensure specific and high affinity binding to various antigens. Broadly neutralizing antibodies (bnAbs) are antibodies that have strong neutralizing activities against different variants of a virus. bnAbs such as anti-HIV bnAbs often have special characteristics including insertions and deletions, long complementarity determining region 3 (CDR3), and high frequencies of mutations, often at improbable sites of the variable regions. These unique features are rare mutational outcomes that are acquired during antibody diversification processes. In this review, we will discuss possible mechanisms that generate these rare antibody mutational outcomes. The understanding of the mechanisms that generate these rare mutational outcomes during antibody diversification will have implications in vaccine design strategies to elicit bnAbs.
Collapse
|
14
|
Carrington B, Bishop K, Sood R. A Comprehensive Review of Indel Detection Methods for Identification of Zebrafish Knockout Mutants Generated by Genome-Editing Nucleases. Genes (Basel) 2022; 13:857. [PMID: 35627242 PMCID: PMC9141975 DOI: 10.3390/genes13050857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 05/06/2022] [Accepted: 05/10/2022] [Indexed: 11/16/2022] Open
Abstract
The use of zebrafish in functional genomics and disease modeling has become popular due to the ease of targeted mutagenesis with genome editing nucleases, i.e., zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and clustered regularly interspaced short palindromic repeats/Cas9 (CRISPR/Cas9). These nucleases, specifically CRISPR/Cas9, are routinely used to generate gene knockout mutants by causing a double stranded break at the desired site in the target gene and selecting for frameshift insertions or deletions (indels) caused by the errors during the repair process. Thus, a variety of methods have been developed to identify fish with indels during the process of mutant generation and phenotypic analysis. These methods range from PCR and gel-based low-throughput methods to high-throughput methods requiring specific reagents and/or equipment. Here, we provide a comprehensive review of currently used indel detection methods in zebrafish. By discussing the molecular basis for each method as well as their pros and cons, we hope that this review will serve as a comprehensive resource for zebrafish researchers, allowing them to choose the most appropriate method depending upon their budget, access to required equipment and the throughput needs of the projects.
Collapse
Affiliation(s)
| | | | - Raman Sood
- Zebrafish Core, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA; (B.C.); (K.B.)
| |
Collapse
|
15
|
Carballar-Lejarazú R, Tushar T, Pham TB, James AA. Cas9-mediated maternal-effect and derived resistance alleles in a gene-drive strain of the African malaria vector mosquito, Anopheles gambiae. Genetics 2022; 221:6564662. [PMID: 35389492 PMCID: PMC9157122 DOI: 10.1093/genetics/iyac055] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 03/30/2022] [Indexed: 11/24/2022] Open
Abstract
CRISPR/Cas9 technologies are important tools for the development of gene-drive systems to modify mosquito vector populations to control the transmission of pathogens that cause diseases such as malaria. However, one of the challenges for current Cas9-based drive systems is their ability to produce drive-resistant alleles resulting from insertions and deletions (indels) caused principally by nonhomologous end-joining following chromosome cleavage. Rapid increases in the frequency of such alleles may impair gene-drive dynamics. We explored the generation of indels in the germline and somatic cells in female gene-drive lineages using a series of selective crosses between a gene-drive line, AgNosCd-1, and wild-type mosquitoes. We find that potential drive-resistant mutant alleles are generated largely during embryonic development, most likely caused by deposition of the Cas9 endonuclease and guide RNAs in oocytes and resulting embryos by homozygous and hemizygous gene-drive mothers.
Collapse
Affiliation(s)
- Rebeca Carballar-Lejarazú
- Department of Microbiology & Molecular Genetics, University of California, Irvine, Irvine, CA 92697-4025, USA
| | - Taylor Tushar
- Department of Microbiology & Molecular Genetics, University of California, Irvine, Irvine, CA 92697-4025, USA
| | - Thai Binh Pham
- Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA 92697-3900, USA
| | - Anthony A James
- Department of Microbiology & Molecular Genetics, University of California, Irvine, Irvine, CA 92697-4025, USA.,Department of Molecular Biology & Biochemistry, University of California, Irvine, Irvine, CA 92697-3900, USA
| |
Collapse
|
16
|
Gala M, Pristaš P, Žoldák G. Allosteric Inter-Domain Contacts in Bacterial Hsp70 Are Located in Regions That Avoid Insertion and Deletion Events. Int J Mol Sci 2022; 23:2788. [PMID: 35269930 DOI: 10.3390/ijms23052788] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 02/24/2022] [Accepted: 02/25/2022] [Indexed: 02/04/2023] Open
Abstract
Heat shock proteins 70 (Hsp70) are chaperones consisting of a nucleotide-binding domain (NBD) and a substrate-binding domain (SBD), the latter of which binds protein clients. After ATP binds to the NBD, the SBD α/β subdomains’ shared interface opens, and the open SBD docks to the NBD. Such allosteric effects are stabilized by the newly formed NBD-SBD interdomain contacts. In this paper, we examined how such an opening and formation of subdomain interfaces is affected during the evolution of Hsp70. In particular, insertion and deletion events (indels) can be highly disruptive for the mechanical events since such changes introduce a collective shift in the pairing interactions at communicating interfaces. Based on a multiple sequence alignment analysis of data collected from Swiss-Prot/UniProt database, we find several indel-free regions (IFR) in Hsp70. The two largest IFRs are located in interdomain regions that participate in allosteric structural changes. We speculate that the reason why the indels have a lower likelihood of occurrence in these regions is that indel events in these regions cause dysfunction in the protein due to perturbations of the mechanical balance. Thus, the development of functional allosteric machines requires including in the rational design a concept of the balance between structural elements.
Collapse
|
17
|
Mustapha UF, Assan D, Huang YQ, Li GL, Jiang DN. High Polymorphism in the Dmrt2a Gene Is Incompletely Sex-Linked in Spotted Scat, Scatophagus argus. Animals (Basel) 2022; 12:ani12050613. [PMID: 35268179 PMCID: PMC8909180 DOI: 10.3390/ani12050613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 02/14/2022] [Accepted: 02/25/2022] [Indexed: 12/10/2022] Open
Abstract
Unlike mammals and birds, many fishes have young sex chromosomes, providing excellent models to study sex chromosome differentiation at early stages. Previous studies showed that spotted scat possesses an XX-XY sex determination system. The X has a complete Dmrt3 copy (termed normal) and a truncated copy of Dmrt1 (called Dmrt1b), while the Y has the opposite (normal Dmrt1, which is male-specific, and a truncated Dmrt3 called Dmrt3△-Y). Dmrt1 is the candidate sex determination gene, while the differentiation of other sex-linked genes remains unknown. The spotted scat has proven to be a good model to study the evolution of sex chromosomes in vertebrates. Herein, we sequenced a neighbor gene of this family, Dmrt2, positioned farther from Dmrt1 and closer to Dmrt3 in the spotted scat, and analyzed its sequence variation and expression profiles. The physical locations of the three genes span across an estimated size of >40 kb. The open reading frames of Dmrt2a and its paralog Dmrt2b are 1578 bp and 1311 bp, encoding peptides of 525 and 436 amino acid residues, respectively. Dmrt2a is positioned close to Dmrt3 but farther from Dmrt1 on the same chromosome, while Dmrt2b is not. Sequence analysis revealed several mutations; insertions, and deletions (indels) on Dmrt2a non-coding regions and single-nucleotide polymorphisms (SNPs) on the Dmrt2a transcript. These indels and SNPs are sex-linked and showed high male heterogeneity but do not affect gene translation. The markers designed to span the mutation sites tested on four different populations showed varied concordance with the genetic sexes. Dmrt2a is transcribed solely in the gonads and gills, while Dmrt2b exists in the gonads, hypothalamus, gills, heart, and spleen. The Dmrt2a and Dmrt2b transcripts are profoundly expressed in the male gonads. Analyses of the transcriptome data from five other fish species (Hainan medaka (Oryzias curvinotus), silver sillago (Sillago sihama), Nile tilapia (Oreochromis niloticus), Hong Kong catfish (Clarias fuscus), and spot-fin porcupine fish (Diodon hystrix)) revealed testes-biased expression of Dmrt1 in all, similar to spotted scat. Additionally, the expression of Dmrt2a is higher in the testes than the ovaries in spotted scat and Hainan medaka. The Dmrt2a transcript was not altered in the coding regions as found in Dmrt1 and Dmrt3 in spotted scat. This could be due to the functional importance of Dmrt2a in development. Another possibility is that because Dmrt2a is positioned farther from Dmrt1 and the chromosome is still young, meaning it is only a matter of time before it differentiates. This study undeniably will aid in understanding the functional divergence of the sex-linked genes in fish.
Collapse
|
18
|
Raymond PW, Velie BD, Wade CM. Forensic DNA phenotyping: Canis familiaris breed classification and skeletal phenotype prediction using functionally significant skeletal SNPs and indels. Anim Genet 2021; 53:247-263. [PMID: 34963196 DOI: 10.1111/age.13165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 11/30/2021] [Accepted: 12/12/2021] [Indexed: 11/29/2022]
Abstract
This review highlights a novel application of breed identification and prediction of skeletal traits in forensic investigations using canine DNA evidence. Currently, genotyping methods used for canine breed classification involve the application of highly polymorphic short tandem repeats in addition to larger commercially available SNP arrays. Both applications face technical challenges. An additional approach to breed identification could be through genotyping SNPs and indels that characterise the array of skeletal differences displayed across domestic dog populations. Research has shown that a small number of genetic variants of large effect drive differences in skeletal phenotypes among domestic dog breeds. This feature makes functionally significant canine skeletal variants a cost-effective target for forensic investigators to classify individuals according to their breed. Further analysis of these skeletal variants would enable the prediction of external appearance. To date, functionally significant genes with genetic variants associated with differences in size, bulk, skull shape, ear shape, limb length, digit type, and tail morphology have been uncovered. Recommendations of a cost-effective genotyping method that can be readily designed and applied by forensic investigators have been given. Further advances to improve the field of canine skeletal forensic DNA phenotyping include the refinement of phenotyping methods, further biological validation of the skeletal genetic variants and establishing a publicly available database for storage of allele frequencies of the skeletal genetic variants in the wider domestic dog population.
Collapse
Affiliation(s)
- Patrick W Raymond
- School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| | - Brandon D Velie
- School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| | - Claire M Wade
- School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| |
Collapse
|
19
|
Rao RSP, Ahsan N, Xu C, Su L, Verburgt J, Fornelli L, Kihara D, Xu D. Evolutionary Dynamics of Indels in SARS-CoV-2 Spike Glycoprotein. Evol Bioinform Online 2021; 17:11769343211064616. [PMID: 34898980 PMCID: PMC8655444 DOI: 10.1177/11769343211064616] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 11/12/2021] [Indexed: 01/28/2023] Open
Abstract
SARS-CoV-2, responsible for the current COVID-19 pandemic that claimed over 5.0 million lives, belongs to a class of enveloped viruses that undergo quick evolutionary adjustments under selection pressure. Numerous variants have emerged in SARS-CoV-2, posing a serious challenge to the global vaccination effort and COVID-19 management. The evolutionary dynamics of this virus are only beginning to be explored. In this work, we have analysed 1.79 million spike glycoprotein sequences of SARS-CoV-2 and found that the virus is fine-tuning the spike with numerous amino acid insertions and deletions (indels). Indels seem to have a selective advantage as the proportions of sequences with indels steadily increased over time, currently at over 89%, with similar trends across countries/variants. There were as many as 420 unique indel positions and 447 unique combinations of indels. Despite their high frequency, indels resulted in only minimal alteration of N-glycosylation sites, including both gain and loss. As indels and point mutations are positively correlated and sequences with indels have significantly more point mutations, they have implications in the evolutionary dynamics of the SARS-CoV-2 spike glycoprotein.
Collapse
Affiliation(s)
- R Shyama Prasad Rao
- Biostatistics and Bioinformatics Division, Yenepoya Research Center, Yenepoya University, Mangaluru, Karnataka, India
| | - Nagib Ahsan
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, USA
- Mass Spectrometry, Proteomics and Metabolomics Core Facility, Stephenson Life Sciences Research Center, University of Oklahoma, Norman, OK, USA
| | - Chunhui Xu
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Lingtao Su
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Luca Fornelli
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, OK, USA
- Department of Biology, University of Oklahoma, Norman, OK, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, Informatics Institute, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| |
Collapse
|
20
|
Abstract
Recently, we proposed an efficient ILP formulation [Rubert DP, Martinez FV, Braga MDV, Natural family-free genomic distance, Algorithms Mol Biol 16:4, 2021] for exactly computing the rearrangement distance of two genomes in a family-free setting. In such a setting, neither prior classification of genes into families, nor further restrictions on the genomes are imposed. Given two genomes, the mentioned ILP computes an optimal matching of the genes taking into account simultaneously local mutations, given by gene similarities, and large-scale genome rearrangements. Here, we explore the potential of using this ILP for inferring groups of orthologs across several species. More precisely, given a set of genomes, our method first computes all pairwise optimal gene matchings, which are then integrated into gene families in the second step. Our approach is implemented into a pipeline incorporating the pre-computation of gene similarities. It can be downloaded from gitlab.ub.uni-bielefeld.de/gi/FFGC. We obtained promising results with experiments on both simulated and real data.
Collapse
Affiliation(s)
- Diego P Rubert
- Faculdade de Computação, Universidade Federal de Mato Grosso do Sul, Campo Grande, Brazil
| | - Daniel Doerr
- Faculty of Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Marília D V Braga
- Faculty of Technology and CeBiTec, Bielefeld University, Bielefeld, Germany
| |
Collapse
|
21
|
Alexandrino AO, Oliveira AR, Dias U, Dias Z. Incorporating intergenic regions into reversal and transposition distances with indels. J Bioinform Comput Biol 2021; 19:2140011. [PMID: 34775923 DOI: 10.1142/s0219720021400114] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Problems in the genome rearrangement field are often formulated in terms of pairwise genome comparison: given two genomes [Formula: see text] and [Formula: see text], find the minimum number of genome rearrangements that may have occurred during the evolutionary process. This broad definition lacks at least two important considerations: the first being which features are extracted from genomes to create a useful mathematical model, and the second being which types of genome rearrangement events should be represented. Regarding the first consideration, seminal works in the genome rearrangement field solely used gene order to represent genomes as permutations of integer numbers, neglecting many important aspects like gene duplication, intergenic regions, and complex interactions between genes. Regarding the second consideration, some rearrangement events are widely studied such as reversals and transpositions. In this paper, we shed light on the first consideration and created a model that takes into account gene order and the number of nucleotides in intergenic regions. In addition, we consider events of reversals, transpositions, and indels (insertions and deletions) of genomic material. We present a 4-approximation algorithm for reversals and indels, a [Formula: see text]-approximation algorithm for transpositions and indels, and a 6-approximation for reversals, transpositions, and indels.
Collapse
Affiliation(s)
| | - Andre Rodrigues Oliveira
- Institute of Computing, University of Campinas, 1251 Albert Einstein Ave., 13083-852 Campinas, São Paulo, Brazil
| | - Ulisses Dias
- School of Technology, University of Campinas, 1888 Paschoal Marmo St., 13484-332 Limeira, São Paulo, Brazil
| | - Zanoni Dias
- Institute of Computing, University of Campinas, 1251 Albert Einstein Ave., 13083-852 Campinas, São Paulo, Brazil
| |
Collapse
|
22
|
Alexandrino AO, Oliveira AR, Dias U, Dias Z. Labeled Cycle Graph for Transposition and Indel Distance. J Comput Biol 2021; 29:243-256. [PMID: 34724796 DOI: 10.1089/cmb.2021.0279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the comparative genomics field, one way to infer the evolutionary distance between two organisms of related species is by finding the minimum number of large-scale mutations, called genome rearrangements, that transform one genome into the other. This number is referred to as the rearrangement distance. Since problems in this area emerged in the mid-1990s, several genome rearrangements have been proposed. Rearrangements that do not alter the genome content are called conservative, and in this group we have the following: the reversal, which inverts a segment of the genome; the transposition, which exchanges two consecutive segments; and the double cut and join, which cuts two different pairs of adjacent blocks and joins them differently. Seminal works compared genomes sharing the same set of conserved blocks, but nowadays, researchers started looking at genomes with unequal gene content, by allowing the use of nonconservative rearrangements such as insertion and deletion (jointly called indel). The transposition distance and the transposition and indel distance are both NP-hard. We investigate the transposition and indel distance and present a structure called labeled cycle graph, representing an instance of rearrangement distance problems for genomes with unequal gene content. This structure is used to devise a lower bound and a 2-approximation algorithm for the transposition and indel distance.
Collapse
Affiliation(s)
| | | | - Ulisses Dias
- School of Technology, University of Campinas, Limeira, Brazil
| | - Zanoni Dias
- Institute of Computing, University of Campinas, Campinas, Brazil
| |
Collapse
|
23
|
Foster PL, Niccum BA, Lee H. DNA Replication-Transcription Conflicts Do Not Significantly Contribute to Spontaneous Mutations Due to Replication Errors in Escherichia coli. mBio 2021; 12:e0250321. [PMID: 34634932 DOI: 10.1128/mBio.02503-21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Encounters between DNA replication and transcription can cause genomic disruption, particularly when the two meet head-on. Whether these conflicts produce point mutations is debated. This paper presents detailed analyses of a large collection of mutations generated during mutation accumulation experiments with mismatch repair (MMR)-defective Escherichia coli. With MMR absent, mutations are primarily due to DNA replication errors. Overall, there were no differences in the frequencies of base pair substitutions or small indels (i.e., insertion and deletions of ≤4 bp) in the coding sequences or promoters of genes oriented codirectionally versus head-on to replication. Among a subset of highly expressed genes, there was a 2- to 3-fold bias for indels in genes oriented head-on to replication, but this difference was almost entirely due to the asymmetrical genomic locations of tRNA genes containing mononucleotide runs, which are hot spots for indels. No additional orientation bias in mutation frequencies occurred when MMR− strains were also defective for transcription-coupled repair (TCR). However, in contrast to other reports, loss of TCR slightly increased the overall mutation rate, meaning that TCR is antimutagenic. There was no orientation bias in mutation frequencies among the stress response genes that are regulated by RpoS or induced by DNA damage. Thus, biases in the locations of mutational targets can account for most, if not all, apparent biases in mutation frequencies between genes oriented head-on versus codirectional to replication. In addition, the data revealed a strong correlation of the frequency of base pair substitutions with gene length but no correlation with gene expression levels.
Collapse
|
24
|
Loewenthal G, Rapoport D, Avram O, Moshe A, Wygoda E, Itzkovitch A, Israeli O, Azouri D, Cartwright RA, Mayrose I, Pupko T. A probabilistic model for indel evolution: differentiating insertions from deletions. Mol Biol Evol 2021; 38:5769-5781. [PMID: 34469521 PMCID: PMC8662616 DOI: 10.1093/molbev/msab266] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Insertions and deletions (indels) are common molecular evolutionary events. However, probabilistic models for indel evolution are under-developed due to their computational complexity. Here, we introduce several improvements to indel modeling: 1) While previous models for indel evolution assumed that the rates and length distributions of insertions and deletions are equal, here we propose a richer model that explicitly distinguishes between the two; 2) we introduce numerous summary statistics that allow approximate Bayesian computation-based parameter estimation; 3) we develop a method to correct for biases introduced by alignment programs, when inferring indel parameters from empirical data sets; and 4) using a model-selection scheme, we test whether the richer model better fits biological data compared with the simpler model. Our analyses suggest that both our inference scheme and the model-selection procedure achieve high accuracy on simulated data. We further demonstrate that our proposed richer model better fits a large number of empirical data sets and that, for the majority of these data sets, the deletion rate is higher than the insertion rate.
Collapse
Affiliation(s)
- Gil Loewenthal
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dana Rapoport
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Oren Avram
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Asher Moshe
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Elya Wygoda
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Alon Itzkovitch
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Omer Israeli
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dana Azouri
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.,School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Reed A Cartwright
- The Biodesign Institute, Arizona State University, Tempe, Arizona, USA.,School of Life Sciences, Arizona State University, Tempe, Arizona, USA
| | - Itay Mayrose
- School of Plant Sciences and Food Security, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
25
|
Abstract
Aim: To demonstrate that MSI-WES is an accurate testing method for microsatellite instability (MSI). Materials & methods: Microsatellite-based indels were counted in the variant call-formatted whole exome sequencing (WES) data of 441 gastric cancer cases using Unix-based algorithms, and the counts expressed as a fraction of the genome sequenced to obtain next-generation sequencing-based MSI indices. Results: The next-generation sequencing-based MSI indices showed a near-perfect concordance with PCR-based MSI status, and moderate to good correlations with the molecular targets of MSI index, MLH1 expression and MLH1 methylation status, at a level comparable to the strengths of correlation between PCR-based MSI status and molecular targets of MSI index/MLH1 expression and methylation. Conclusion: MSI-WES is a valid, adequate and sensitive approach for testing MSI in cancer.
Collapse
Affiliation(s)
- Henry O Ebili
- Division of Cancer & Stem Cell, University of Nottingham, Nottingham, NG7 2UH, UK.,Department of Morbid Anatomy & Histopathology, Olabisi Onabanjo University, Ago-Iwoye, Nigeria
| | - Adedeji Oj Agboola
- Department of Morbid Anatomy & Histopathology, Olabisi Onabanjo University, Ago-Iwoye, Nigeria
| | - Emad Rakha
- Division of Cancer & Stem Cell, University of Nottingham, Nottingham, NG7 2UH, UK
| |
Collapse
|
26
|
Kulski JK, Suzuki S, Shiina T. Haplotype Shuffling and Dimorphic Transposable Elements in the Human Extended Major Histocompatibility Complex Class II Region. Front Genet 2021; 12:665899. [PMID: 34122517 PMCID: PMC8193847 DOI: 10.3389/fgene.2021.665899] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/12/2021] [Indexed: 12/26/2022] Open
Abstract
The major histocompatibility complex (MHC) on chromosome 6p21 is one of the most single-nucleotide polymorphism (SNP)-dense regions of the human genome and a prime model for the study and understanding of conserved sequence polymorphisms and structural diversity of ancestral haplotypes/conserved extended haplotypes. This study aimed to follow up on a previous analysis of the MHC class I region by using the same set of 95 MHC haplotype sequences downloaded from a publicly available BioProject database at the National Center for Biotechnology Information to identify and characterize the polymorphic human leukocyte antigen (HLA)-class II genes, the MTCO3P1 pseudogene alleles, the indels of transposable elements as haplotypic lineage markers, and SNP-density crossover (XO) loci at haplotype junctions in DNA sequence alignments of different haplotypes across the extended class II region (∼1 Mb) from the telomeric PRRT1 gene in class III to the COL11A2 gene at the centromeric end of class II. We identified 42 haplotypic indels (20 Alu, 7 SVA, 13 LTR or MERs, and 2 indels composed of a mosaic of different transposable elements) linked to particular HLA-class II alleles. Comparative sequence analyses of 136 haplotype pairs revealed 98 unique XO sites between SNP-poor and SNP-rich genomic segments with considerable haplotype shuffling located in the proximity of putative recombination hotspots. The majority of XO sites occurred across various regions including in the vicinity of MTCO3P1 between HLA-DQB1 and HLA-DQB3, between HLA-DQB2 and HLA-DOB, between DOB and TAP2, and between HLA-DOA and HLA-DPA1, where most XOs were within a HERVK22 sequence. We also determined the genomic positions of the PRDM9-recombination suppression sequence motif ATCCATG/CATGGAT and the PRDM9 recombination activation partial binding motif CCTCCCCT/AGGGGAG in the class II region of the human reference genome (NC_ 000006) relative to published meiotic recombination positions. Both the recombination and anti-recombination PRDM9 binding motifs were widely distributed throughout the class II genomic regions with 50% or more found within repeat elements; the anti-recombination motifs were found mostly in L1 fragmented repeats. This study shows substantial haplotype shuffling between different polymorphic blocks and confirms the presence of numerous putative ancestral recombination sites across the class II region between various HLA class II genes.
Collapse
Affiliation(s)
- Jerzy K Kulski
- Faculty of Health and Medical Sciences, The University of Western Australia, Crawley, WA, Australia.,Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Department of Molecular Life Sciences, Division of Basic Medical Science and Molecular Medicine, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
27
|
Mathioudakis MM, Maliogka VI, Candresse T, Nickel O, Fajardo TVM, Budzyńska D, Hasiów-Jaroszewska B, Katis NI. Molecular Characterization of the Coat Protein Gene of Greek Apple Stem Pitting Virus Isolates: Evolution through Deletions, Insertions, and Recombination Events. Plants (Basel) 2021; 10:917. [PMID: 34063623 DOI: 10.3390/plants10050917] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Revised: 04/26/2021] [Accepted: 04/27/2021] [Indexed: 01/12/2023]
Abstract
A RT–PCR assay developed to amplify the full coat protein (CP) gene of apple stem pitting virus (ASPV) was evaluated using 180 Greek apple and pear samples and showed a broad detection range. This method was used to investigate the presence of ASPV in quince in Greece and showed a high incidence of 52%. The sequences of 14 isolates from various hosts with a distinct RFLP profile were determined. ASPV population genetics and the factors driving ASPV evolution were analyzed using the Greek ASPV sequences, novel sequences from Brazilian apple trees and Chinese botanical Pyrus species, and homologous sequences retrieved from GenBank. Fourteen variant types of Greek, Brazilian and botanical isolates, which differ in CP gene length and presence of indels, were identified. In addition, these analyses showed high intra- and inter-group variation among isolates from different countries and hosts, indicating the significant variability present in ASPV. Recombination events were detected in four isolates originating from Greek pear and quince and two from Brazilian apples. In a phylogenetic analysis, there was a tendency for isolates to cluster together based on CP gene length, the isolation host, and the detection method applied. Although there was no strict clustering based on geographical origin, most isolates from a given country tended to regroup in specific clusters. Interestingly, it was found that the phylogeny was correlated to the type, position, and pattern of indels, which represent hallmarks of specific lineages and indicate their possible role in virus diversification, rather than the CP size itself. Evidence of recombination between isolates from botanical and cultivated species and the clustering of isolates from botanical species and isolates from cultivated species suggest the existence of a possible undetermined transmission mechanism allowing the exchange of ASPV isolates between the cultivated and wild/ornamental hosts.
Collapse
|
28
|
Nasir A, Mughal F, Caetano-Anollés G. The tree of life describes a tripartite cellular world. Bioessays 2021; 43:e2000343. [PMID: 33837594 DOI: 10.1002/bies.202000343] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Revised: 03/11/2021] [Accepted: 03/15/2021] [Indexed: 12/28/2022]
Abstract
The canonical view of a 3-domain (3D) tree of life was recently challenged by the discovery of Asgardarchaeota encoding eukaryote signature proteins (ESPs), which were treated as missing links of a 2-domain (2D) tree. Here we revisit the debate. We discuss methodological limitations of building trees with alignment-dependent approaches, which often fail to satisfactorily address the problem of ''gaps.'' In addition, most phylogenies are reconstructed unrooted, neglecting the power of direct rooting methods. Alignment-free methodologies lift most difficulties but require employing realistic evolutionary models. We argue that the discoveries of Asgards and ESPs, by themselves, do not rule out the 3D tree, which is strongly supported by comparative and evolutionary genomic analyses and vast genomic and biochemical superkingdom distinctions. Given uncertainties of retrodiction and interpretation difficulties, we conclude that the 3D view has not been falsified but instead has been strengthened by genomic analyses. In turn, the objections to the 2D model have not been lifted. The debate remains open. Also see the video abstract here: https://youtu.be/-6TBN0bubI8.
Collapse
Affiliation(s)
- Arshan Nasir
- Theoretical Biology and Biophysics (T-6), Los Alamos National Laboratory, Los Alamos, New Mexico, USA
| | - Fizza Mughal
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
29
|
Kulski JK, Suzuki S, Shiina T. SNP-Density Crossover Maps of Polymorphic Transposable Elements and HLA Genes Within MHC Class I Haplotype Blocks and Junction. Front Genet 2021; 11:594318. [PMID: 33537058 PMCID: PMC7848197 DOI: 10.3389/fgene.2020.594318] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The genomic region (~4 Mb) of the human major histocompatibility complex (MHC) on chromosome 6p21 is a prime model for the study and understanding of conserved polymorphic sequences (CPSs) and structural diversity of ancestral haplotypes (AHs)/conserved extended haplotypes (CEHs). The aim of this study was to use a set of 95 MHC genomic sequences downloaded from a publicly available BioProject database at NCBI to identify and characterise polymorphic human leukocyte antigen (HLA) class I genes and pseudogenes, MICA and MICB, and retroelement indels as haplotypic lineage markers, and single-nucleotide polymorphism (SNP) crossover loci in DNA sequence alignments of different haplotypes across the Olfactory Receptor (OR) gene region (~1.2 Mb) and the MHC class I region (~1.8 Mb) from the GPX5 to the MICB gene. Our comparative sequence analyses confirmed the identity of 12 haplotypic retroelement markers and revealed that they partitioned the HLA-A/B/C haplotypes into distinct evolutionary lineages. Crossovers between SNP-poor and SNP-rich regions defined the sequence range of haplotype blocks, and many of these crossover junctions occurred within particular transposable elements, lncRNA, OR12D2, MUC21, MUC22, PSORS1A3, HLA-C, HLA-B, and MICA. In a comparison of more than 250 paired sequence alignments, at least 38 SNP-density crossover sites were mapped across various regions from GPX5 to MICB. In a homology comparison of 16 different haplotypes, seven CEH/AH (7.1, 8.1, 18.2, 51.x, 57.1, 62.x, and 62.1) had no detectable SNP-density crossover junctions and were SNP poor across the entire ~2.8 Mb of sequence alignments. Of the analyses between different recombinant haplotypes, more than half of them had SNP crossovers within 10 kb of LTR16B/ERV3-16A3_I, MLT1, Charlie, and/or THE1 sequences and were in close vicinity to structurally polymorphic Alu and SVA insertion sites. These studies demonstrate that (1) SNP-density crossovers are associated with putative ancestral recombination sites that are widely spread across the MHC class I genomic region from at least the telomeric OR12D2 gene to the centromeric MICB gene and (2) the genomic sequences of MHC homozygous cell lines are useful for analysing haplotype blocks, ancestral haplotypic landscapes and markers, CPSs, and SNP-density crossover junctions.
Collapse
Affiliation(s)
- Jerzy K. Kulski
- Faculty of Health and Medical Sciences, Medical School, The University of Western Australia, Crawley, WA, Australia
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Shingo Suzuki
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Takashi Shiina
- Division of Basic Medical Science and Molecular Medicine, Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| |
Collapse
|
30
|
Oleksyk TK, Wolfsberger WW, Weber AM, Shchubelka K, Oleksyk OT, Levchuk O, Patrus A, Lazar N, Castro-Marquez SO, Hasynets Y, Boldyzhar P, Neymet M, Urbanovych A, Stakhovska V, Malyar K, Chervyakova S, Podoroha O, Kovalchuk N, Rodriguez-Flores JL, Zhou W, Medley S, Battistuzzi F, Liu R, Hou Y, Chen S, Yang H, Yeager M, Dean M, Mills RE, Smolanka V. Genome diversity in Ukraine. Gigascience 2021; 10:6079618. [PMID: 33438729 PMCID: PMC7804371 DOI: 10.1093/gigascience/giaa159] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 08/21/2020] [Accepted: 12/15/2020] [Indexed: 01/21/2023] Open
Abstract
Background The main goal of this collaborative effort is to provide genome-wide data for the previously underrepresented population in Eastern Europe, and to provide cross-validation of the data from genome sequences and genotypes of the same individuals acquired by different technologies. We collected 97 genome-grade DNA samples from consented individuals representing major regions of Ukraine that were consented for public data release. BGISEQ-500 sequence data and genotypes by an Illumina GWAS chip were cross-validated on multiple samples and additionally referenced to 1 sample that has been resequenced by Illumina NovaSeq6000 S4 at high coverage. Results The genome data have been searched for genomic variation represented in this population, and a number of variants have been reported: large structural variants, indels, copy number variations, single-nucletide polymorphisms, and microsatellites. To our knowledge, this study provides the largest to-date survey of genetic variation in Ukraine, creating a public reference resource aiming to provide data for medical research in a large understudied population. Conclusions Our results indicate that the genetic diversity of the Ukrainian population is uniquely shaped by evolutionary and demographic forces and cannot be ignored in future genetic and biomedical studies. These data will contribute a wealth of new information bringing forth a wealth of novel, endemic and medically related alleles.
Collapse
Affiliation(s)
- Taras K Oleksyk
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Walter W Wolfsberger
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine.,Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Alexandra M Weber
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Khrystyna Shchubelka
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA.,Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Olga T Oleksyk
- A. Novak Transcarpathian Regional Clinical Hospital, Uzhhorod 88000, Ukraine
| | | | | | | | - Stephanie O Castro-Marquez
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA.,Departamento de Biología, Universidad de Puerto Rico, Mayagüez, PR 00682, USA
| | - Yaroslava Hasynets
- Department of Biological Sciences, Uzhhorod National University, 32 Voloshyna Str., Uzhhorod 88000, Ukraine
| | - Patricia Boldyzhar
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| | - Mikhailo Neymet
- Velyka Kopanya Family Hospital, Transcarpatia 90330, Ukraine
| | | | | | - Kateryna Malyar
- I.I.Mechnikov Dnipro Regional Clinical Hospital, Dnipro 49000, Ukraine
| | | | | | - Natalia Kovalchuk
- Rivne Regional Specialized Hospital of Radiation Protection, Rivne 33028, Ukraine
| | | | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Sarah Medley
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Fabia Battistuzzi
- Department of Biological Sciences,Oakland University, Dodge Hall, 118 Library Dr., Rochester, MI 48309, USA
| | - Ryan Liu
- BGI Shenzhen, Shenzhen, 518083, China
| | - Yong Hou
- BGI Shenzhen, Shenzhen, 518083, China
| | - Siru Chen
- BGI Shenzhen, Shenzhen, 518083, China
| | | | - Meredith Yeager
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Michael Dean
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Volodymyr Smolanka
- Department of Medicine, Uzhhorod National University, Uzhhorod 88000, Ukraine
| |
Collapse
|
31
|
Abstract
We introduce a systematic method of approximating finite-time transition probabilities for continuous-time insertion-deletion models on sequences. The method uses automata theory to describe the action of an infinitesimal evolutionary generator on a probability distribution over alignments, where both the generator and the alignment distribution can be represented by pair hidden Markov models (HMMs). In general, combining HMMs in this way induces a multiplication of their state spaces; to control this, we introduce a coarse-graining operation to keep the state space at a constant size. This leads naturally to ordinary differential equations for the evolution of the transition probabilities of the approximating pair HMM. The TKF91 model emerges as an exact solution to these equations for the special case of single-residue indels. For the more general case of multiple-residue indels, the equations can be solved by numerical integration. Using simulated data, we show that the resulting distribution over alignments, when compared to previous approximations, is a better fit over a broader range of parameters. We also propose a related approach to develop differential equations for sufficient statistics to estimate the underlying instantaneous indel rates by expectation maximization. Our code and data are available at https://github.com/ihh/trajectory-likelihood.
Collapse
Affiliation(s)
- Ian Holmes
- Department of Bioengineering, University of California, Berkeley, California 94720
| |
Collapse
|
32
|
Bi Y, Chen Y, Xin D, Liu T, He L, Kang Y, Pan C, Shen W, Lan X, Liu M. Effect of indel variants within the sorting nexin 29 (SNX29) gene on growth traits of goats. Anim Biotechnol 2020; 33:914-919. [PMID: 33208046 DOI: 10.1080/10495398.2020.1846547] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The sorting nexin 29 gene (SNX29) is a well-known regulator of myocyte differentiation and proliferation. In this work, two indels (17-bp and 21-bp) were identified in the goat SNX29 gene, and their effects on the growth traits of 1,759 Shaanbei white cashmere (SBWC) goats were analyzed. Both indels had three genotypes [homozygote wild type (II), heterozygote (ID), and homozygote mutation (DD)] and displayed medium genetic diversity (0.25 < polymorphism information content (PIC) < 0.50) in the population. The 17-bp indel was significantly associated with chest width (p = 0.009), body weight (p = 0.021), and chest depth (p = 0.032), with the II genotype dominant. The 21-bp indel was significantly associated with chest width (p = 0.001), chest depth (p = 4.8E-5), heart girth (p = 0.007), and hip width (p = 0.002). Because the two indels were in the upstream (17-bp) and intron (21-bp) regions of the SNX29 gene, transcription factor binding sites were predicted. The IRF5 and MYC could bind with the 17-bp indel and 21-bp indel sequences, respectively. This study indicates that SNX29 is a promising candidate gene that can be used to improve meat production in goat breeding.
Collapse
Affiliation(s)
- Yi Bi
- Animal Nutritional Genome and Germplasm Innovation Research Center, College of Animal Science and Technology, Hunan Agricultural University, Changsha, China.,College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Yuhan Chen
- Animal Nutritional Genome and Germplasm Innovation Research Center, College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Dongyun Xin
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Tingting Liu
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Libang He
- Animal Nutritional Genome and Germplasm Innovation Research Center, College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Yuxin Kang
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Chuanying Pan
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Weijun Shen
- Animal Nutritional Genome and Germplasm Innovation Research Center, College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Xianyong Lan
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Mei Liu
- Animal Nutritional Genome and Germplasm Innovation Research Center, College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| |
Collapse
|
33
|
Abstract
The rearrangement distance is a well-known problem in the field of comparative genomics. Given two genomes, the rearrangement distance is the minimum number of rearrangements in a set of allowed rearrangements (rearrangement model), which transforms one genome into the other. In rearrangement distance problems, a genome is modeled as a string, where each element represents a conserved region within the two genomes. When the orientation of the genes is known, it is represented by (plus or minus) signs assigned to the elements of the string. Two of the most studied rearrangements are reversals, which invert a segment of the genome, and transpositions, which exchange the relative positions of two adjacent segments of the genome. The first works in genome rearrangements considered that the genomes being compared had the same genetic material and that rearrangement events were restricted to reversals, transpositions, or both. El-Mabrouk extended the reversal model on signed strings to include the operations of insertion and deletion of segments in the genome, which allowed the comparison of genomes with different genetic material. Other studies also addressed this problem and, recently, this problem was proved to be solvable in polynomial time by Willing et al. For unsigned strings, we still observe a lack of results. That said, in this study we prove that computing the rearrangement distance for the following models is NP-Hard: reversals and indels on unsigned strings; transpositions and indels on unsigned strings; and reversals, transpositions, and indels on signed and unsigned strings. Along with the NP-hardness proofs, we present a 2-approximation algorithm for reversals on unsigned strings and 3-approximation algorithms for the other models.
Collapse
Affiliation(s)
| | | | - Ulisses Dias
- School of Technology, University of Campinas, Limeira, Brazil
| | - Zanoni Dias
- Institute of Computing, University of Campinas, Campinas, Brazil
| |
Collapse
|
34
|
Mehmood F, Abdullah, Ubaid Z, Bao Y, Poczai P, Mirza B. Comparative Plastomics of Ashwagandha ( Withania, Solanaceae) and Identification of Mutational Hotspots for Barcoding Medicinal Plants. Plants (Basel) 2020; 9:E752. [PMID: 32549379 PMCID: PMC7355740 DOI: 10.3390/plants9060752] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Revised: 06/10/2020] [Accepted: 06/12/2020] [Indexed: 01/04/2023]
Abstract
Within the family Solanaceae, Withania is a small genus belonging to the Solanoideae subfamily. Here, we report the de novo assembled chloroplast genome sequences of W. coagulans, W. adpressa, and W. riebeckii. The length of these genomes ranged from 154,162 to 154,364 base pairs (bp). These genomes contained a pair of inverted repeats (IRa and IRb) ranging from 25,029 to 25,071 bp that were separated by a large single-copy (LSC) region of 85,635-85,765 bp and a small single-copy (SSC) region of 18,457-18,469 bp. We analyzed the structural organization, gene content and order, guanine-cytosine content, codon usage, RNA-editing sites, microsatellites, oligonucleotide and tandem repeats, and substitutions of Withania plastomes, which revealed high similarities among the species. Comparative analysis among the Withania species also highlighted 10 divergent hotspots that could potentially be used for molecular marker development, phylogenetic analysis, and species identification. Furthermore, our analyses showed that even three mutational hotspots (rps4-trnT, trnM-atpE, and rps15) were sufficient to discriminate the Withania species included in current study.
Collapse
Affiliation(s)
- Furrukh Mehmood
- Department of Biochemistry, Quaid-i-Azam University, Islamabad 45320, Pakistan; (F.M.); (A.); (Z.U.)
- Botany Unit, Finnish Museum of Natural History, University of Helsinki, P.O. Box 7, FI-00014 Helsinki, Finland
| | - Abdullah
- Department of Biochemistry, Quaid-i-Azam University, Islamabad 45320, Pakistan; (F.M.); (A.); (Z.U.)
| | - Zartasha Ubaid
- Department of Biochemistry, Quaid-i-Azam University, Islamabad 45320, Pakistan; (F.M.); (A.); (Z.U.)
| | - Yiming Bao
- National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, and China National Center for Bioinformation, Beijing 100101, China;
- School of Future Technology, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peter Poczai
- Botany Unit, Finnish Museum of Natural History, University of Helsinki, P.O. Box 7, FI-00014 Helsinki, Finland
| | - Bushra Mirza
- Department of Biochemistry, Quaid-i-Azam University, Islamabad 45320, Pakistan; (F.M.); (A.); (Z.U.)
- Vice Chancellor of Lahore College for Women University, Lahore 54000, Pakistan
| |
Collapse
|
35
|
Yang X, Wang N, Cao X, Bie P, Xing Z, Yin S, Jiang H, Wu Q. First isolation and characterization of Brucella suis from yak. Genome 2020; 63:397-405. [PMID: 32384250 DOI: 10.1139/gen-2019-0101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Brucella spp., facultative intracellular pathogens that can persistently colonize animal host cells and cause zoonosis, affect public health and safety. A Brucella strain was isolated from yak in Qinghai Province. To detect whether this isolate could cause an outbreak of brucellosis and to reveal its genetic characteristics, several typing and whole-genome sequencing methods were applied to identify its species and genetic characteristics. Phylogenetic analysis based on MLVA and whole-genome sequencing revealed the genetic characteristics of the isolated strain. The results showed that the isolated strain is a B. suis biovar 1 smooth strain, and this isolate was named B. suis QH05. The results of comparative genomics and MLVA showed that B. suis QH05 is not a vaccine strain. Comparison with other B. suis strains isolated from humans and animals indicated that B. suis QH05 may be linked to specific animal and human sources. In conclusion, B. suis QH05 does not belong to the Brucella epidemic species in China, and as the first isolation of B. suis from yak, this strain expands the host range of B. suis.
Collapse
Affiliation(s)
- Xiaowen Yang
- State Key Laboratory for Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 102206, China.,Key Laboratory of Animal Epidemiology and Zoonosis of the Ministry of Agriculture, College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
| | - Ning Wang
- Key Laboratory of Animal Epidemiology and Zoonosis of the Ministry of Agriculture, College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
| | - Xiaofang Cao
- Key Laboratory of Animal Epidemiology and Zoonosis of the Ministry of Agriculture, College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
| | - Pengfei Bie
- Key Laboratory of Animal Epidemiology and Zoonosis of the Ministry of Agriculture, College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
| | - Zhifeng Xing
- Heilongjiang Provincial Center for Disease Control and Prevention, Haerbin 150030, China
| | - Shihui Yin
- Heilongjiang Provincial Center for Disease Control and Prevention, Haerbin 150030, China
| | - Hai Jiang
- State Key Laboratory for Infectious Disease Prevention and Control, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing 102206, China
| | - Qingmin Wu
- Key Laboratory of Animal Epidemiology and Zoonosis of the Ministry of Agriculture, College of Veterinary Medicine, China Agricultural University, Beijing 100193, China
| |
Collapse
|
36
|
Mahmoud M, Gracz-Bernaciak J, Żywicki M, Karłowski W, Twardowski T, Tyczewska A. Identification of Structural Variants in Two Novel Genomes of Maize Inbred Lines Possibly Related to Glyphosate Tolerance. Plants (Basel) 2020; 9:E523. [PMID: 32325671 DOI: 10.3390/plants9040523] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 03/29/2020] [Accepted: 04/14/2020] [Indexed: 12/30/2022]
Abstract
To study genetic variations between genomes of plants that are naturally tolerant and sensitive to glyphosate, we used two Zea mays L. lines traditionally bred in Poland. To overcome the complexity of the maize genome, two sequencing technologies were employed: Illumina and Single Molecule Real-Time (SMRT) PacBio. Eleven thousand structural variants, 4 million SNPs and approximately 800 thousand indels differentiating the two genomes were identified. Detailed analyses allowed to identify 20 variations within the EPSPS gene, but all of them were predicted to have moderate or unknown effects on gene expression. Other genes of the shikimate pathway encoding bifunctional 3-dehydroquinate dehydratase/shikimate dehydrogenase and chorismate synthase were altered by variants predicted to have a high impact on gene expression. Additionally, high-impact variants located within the genes involved in the active transport of glyphosate through the cell membrane encoding phosphate transporters as well as multidrug and toxic compound extrusion have been identified.
Collapse
|
37
|
Sato M, Miyagasako R, Takabayashi S, Ohtsuka M, Hatada I, Horii T. Sequential i-GONAD: An Improved In Vivo Technique for CRISPR/Cas9-Based Genetic Manipulations in Mice. Cells 2020; 9:E546. [PMID: 32110989 DOI: 10.3390/cells9030546] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Revised: 02/22/2020] [Accepted: 02/25/2020] [Indexed: 12/25/2022] Open
Abstract
Improved genome-editing via oviductal nucleic acid delivery (i-GONAD) is a technique capable of inducing genomic changes in preimplantation embryos (zygotes) present within the oviduct of a pregnant female. i-GONAD involves intraoviductal injection of a solution containing genome-editing components via a glass micropipette under a dissecting microscope, followed by in vivo electroporation using tweezer-type electrodes. i-GONAD does not involve ex vivo handling of embryos (isolation of zygotes, microinjection or electroporation of zygotes, and egg transfer of the treated embryos to the oviducts of a recipient female), which is required for in vitro genome-editing of zygotes. i-GONAD enables the generation of indels, knock-in (KI) of ~ 1 kb sequence of interest, and large deletion at a target locus. i-GONAD is usually performed on Day 0.7 of pregnancy, which corresponds to the late zygote stage. During the initial development of this technique, we performed i-GONAD on Days 1.4–1.5 (corresponding to the 2-cell stage). Theoretically, this means that at least two GONAD steps (on Day 0.7 and Day 1.4–1.5) must be performed. If this is practically demonstrated, it provides additional options for various clustered regularly interspaced palindrome repeats (CRISPR)/Caspase 9 (Cas9)-based genetic manipulations. For example, it is usually difficult to induce two independent indels at the target sites, which are located very close to each other, by simultaneous transfection of two guide RNAs and Cas9 protein. However, the sequential induction of indels at a target site may be possible when repeated i-GONAD is performed on different days. Furthermore, simultaneous introduction of two mutated lox sites (to which Cre recombinase bind) for making a floxed allele is reported to be difficult, as it often causes deletion of a sequence between the two gRNA target sites. However, differential KI of lox sites may be possible when repeated i-GONAD is performed on different days. In this study, we performed proof-of-principle experiments to demonstrate the feasibility of the proposed approach called “sequential i-GONAD (si-GONAD).”
Collapse
|
38
|
Guan Q, Almutairi TS, Alhalouli T, Pain A, Alasmari F. Metagenomics of Imported Multidrug-Resistant Mycobacterium leprae, Saudi Arabia, 2017. Emerg Infect Dis 2020; 26:615-617. [PMID: 32091380 PMCID: PMC7045828 DOI: 10.3201/eid2603.190661] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Using shotgun metagenomics, we identified an imported case of multidrug-resistant Mycobacterium leprae in a Filipino resident of Saudi Arabia in 2017. We determined the phylogenomic lineage (3K1) and identified mutations in rpoB and rrs corresponding to the multidrug-resistance phenotype clinically observed. Metagenomics sequencing can be used to identify multidrug-resistant M. leprae.
Collapse
|
39
|
Li DM, Zhu GF, Xu YC, Ye YJ, Liu JM. Complete Chloroplast Genomes of Three Medicinal Alpinia Species: Genome Organization, Comparative Analyses and Phylogenetic Relationships in Family Zingiberaceae. Plants (Basel) 2020; 9:E286. [PMID: 32102387 PMCID: PMC7076362 DOI: 10.3390/plants9020286] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 02/18/2020] [Accepted: 02/20/2020] [Indexed: 12/17/2022]
Abstract
Alpinia katsumadai (A. katsumadai), Alpinia oxyphylla (A. oxyphylla) and Alpinia pumila (A. pumila), which belong to the family Zingiberaceae, exhibit multiple medicinal properties. The chloroplast genome of a non-model plant provides valuable information for species identification and phylogenetic analysis. Here, we sequenced three complete chloroplast genomes of A. katsumadai, A. oxyphylla sampled from Guangdong and A. pumila, and analyzed the published chloroplast genomes of Alpinia zerumbet (A. zerumbet) and A. oxyphylla sampled from Hainan to retrieve useful chloroplast molecular resources for Alpinia. The five Alpinia chloroplast genomes possessed typical quadripartite structures comprising of a large single copy (LSC, 87,248-87,667 bp), a small single copy (SSC, 15,306-18,295 bp) and a pair of inverted repeats (IR, 26,917-29,707 bp). They had similar gene contents, gene orders and GC contents, but were slightly different in the numbers of small sequence repeats (SSRs) and long repeats. Interestingly, fifteen highly divergent regions (rpl36, ycf1, rps15, rpl22, infA, psbT-psbN, accD-psaI, petD-rpoA, psaC-ndhE, ccsA-ndhD, ndhF-rpl32, rps11-rpl36, infA-rps8, psbC-psbZ, and rpl32-ccsA), which could be suitable for species identification and phylogenetic studies, were detected in the Alpinia chloroplast genomes. Comparative analyses among the five chloroplast genomes indicated that 1891 mutational events, including 304 single nucleotide polymorphisms (SNPs) and 118 insertion/deletions (indels) between A. pumila and A. katsumadai, 367 SNPs and 122 indels between A. pumila and A. oxyphylla sampled from Guangdong, 331 SNPs and 115 indels between A. pumila and A. zerumbet, 371 SNPs and 120 indels between A. pumila and A. oxyphylla sampled from Hainan, and 20 SNPs and 23 indels between the two accessions of A. oxyphylla, were accurately located. Additionally, phylogenetic relationships based on SNP matrix among 28 whole chloroplast genomes showed that Alpinia was a sister branch to Amomum in the family Zingiberaceae, and that the five Alpinia accessions were divided into three groups, one including A. pumila, another including A. zerumbet and A. katsumadai, and the other including two accessions of A. oxyphylla. In conclusion, the complete chloroplast genomes of the three medicinal Alpinia species in this study provided valuable genomic resources for further phylogeny and species identification in the family Zingiberaceae.
Collapse
Affiliation(s)
- Dong-Mei Li
- Guangdong Key Lab of Ornamental Plant Germplasm Innovation and Utilization, Environmental Horticulture Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China; (Y.-C.X.); (Y.-J.Y.)
| | - Gen-Fa Zhu
- Guangdong Key Lab of Ornamental Plant Germplasm Innovation and Utilization, Environmental Horticulture Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China; (Y.-C.X.); (Y.-J.Y.)
| | | | | | | |
Collapse
|
40
|
Palmer DJ, Turner DL, Ng P. A Single "All-in-One" Helper-Dependent Adenovirus to Deliver Donor DNA and CRISPR/Cas9 for Efficient Homology-Directed Repair. Mol Ther Methods Clin Dev 2020; 17:441-447. [PMID: 32154329 PMCID: PMC7058846 DOI: 10.1016/j.omtm.2020.01.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Accepted: 01/28/2020] [Indexed: 11/19/2022]
Abstract
In this study, we developed a single helper-dependent adenovirus (HDAd) to deliver all of the components (donor DNA, CRISPR-associated protein 9 [Cas9], and guide RNA [gRNA]) needed to achieve high-efficiency gene targeting and homology-directed repair in transduced cells. We show that these "all-in-one" HDAds are up to 117-fold more efficient at gene targeting than donor HDAds that do not express CRISPR/Cas9 in human induced pluripotent stem cells (iPSCs). The vast majority (>90%) of targeted recombinants had only one allele targeted, and this was accompanied by high-frequency indel formation in the non-targeted allele at the site of Cas9 cleavage. These indels varied in size and nature, and included large deletions of ∼8 kb. The remaining minority of recombinants had both alleles targeted (so-called bi-allelic targeting). These all-in-one HDAds represent an important platform for accomplishing and expanding the utility of homology-directed repair, especially for difficult-to-transfect cells and for in vivo applications.
Collapse
Affiliation(s)
- Donna J. Palmer
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Dustin L. Turner
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Philip Ng
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
- Corresponding author: Philip Ng, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
| |
Collapse
|
41
|
Seiden AH, Richter F, Patel N, Rodriguez OL, Deikus G, Shah H, Smith M, Roberts A, King EC, Sebra RP, Sharp AJ, Gelb BD. Elucidation of de novo small insertion/deletion biology with parent-of-origin phasing. Hum Mutat 2020; 41:800-806. [PMID: 31898844 DOI: 10.1002/humu.23971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 11/24/2019] [Accepted: 12/24/2019] [Indexed: 12/30/2022]
Abstract
The mechanisms underlying de novo insertion/deletion (indel) genesis, such as polymerase slippage, have been hypothesized but not well characterized in the human genome. We implemented two methodological improvements, which were leveraged to dissect indel mutagenesis. We assigned de novo variants to parent-of-origin (i.e., phasing) with low-coverage long-read whole-genome sequencing, achieving better phasing compared to short-read sequencing (medians of 84% and 23%, respectively). We then wrote an application programming interface to classify indels into three subtypes according to sequence context. Across three cohorts with different phasing methods (Ntrios = 540, all cohorts), we observed that one de novo indel subtype, change in copy count (CCC), was significantly correlated with father's (p = 7.1 × 10-4 ) but not mother's (p = .45) age at conception. We replicated this effect in three cohorts without de novo phasing (ppaternal = 1.9 × 10-9 , pmaternal = .61; Ntrios = 3,391, all cohorts). Although this is consistent with polymerase slippage during spermatogenesis, the percentage of variance explained by paternal age was low, and we did not observe an association with replication timing. These results suggest that spermatogenesis-specific events have a minor role in CCC indel mutagenesis, one not observed for other indel subtypes nor for maternal age in general. These results have implications for indel modeling in evolution and disease.
Collapse
Affiliation(s)
- Allison H Seiden
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Felix Richter
- Graduate School of Biomedical Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Nihir Patel
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Oscar L Rodriguez
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York.,Graduate School of Biomedical Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Gintaras Deikus
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Icahn Institute for Data Science and Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Hardik Shah
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Icahn Institute for Data Science and Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Melissa Smith
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Icahn Institute for Data Science and Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Amy Roberts
- Division of Genetics, Department of Pediatrics and Department of Cardiology, Boston Children's Hospital, Boston, Massachusetts
| | - Eileen C King
- Division of Biostatistics and Epidemiology, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Robert P Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Icahn Institute for Data Science and Genomics Technology, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Andrew J Sharp
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Bruce D Gelb
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York.,Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, New York
| |
Collapse
|
42
|
Alioto T, Alexiou KG, Bardil A, Barteri F, Castanera R, Cruz F, Dhingra A, Duval H, Fernández i Martí Á, Frias L, Galán B, García JL, Howad W, Gómez‐Garrido J, Gut M, Julca I, Morata J, Puigdomènech P, Ribeca P, Rubio Cabetas MJ, Vlasova A, Wirthensohn M, Garcia‐Mas J, Gabaldón T, Casacuberta JM, Arús P. Transposons played a major role in the diversification between the closely related almond and peach genomes: results from the almond genome sequence. Plant J 2020; 101:455-472. [PMID: 31529539 PMCID: PMC7004133 DOI: 10.1111/tpj.14538] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 08/29/2019] [Accepted: 09/02/2019] [Indexed: 05/19/2023]
Abstract
We sequenced the genome of the highly heterozygous almond Prunus dulcis cv. Texas combining short- and long-read sequencing. We obtained a genome assembly totaling 227.6 Mb of the estimated almond genome size of 238 Mb, of which 91% is anchored to eight pseudomolecules corresponding to its haploid chromosome complement, and annotated 27 969 protein-coding genes and 6747 non-coding transcripts. By phylogenomic comparison with the genomes of 16 additional close and distant species we estimated that almond and peach (Prunus persica) diverged around 5.88 million years ago. These two genomes are highly syntenic and show a high degree of sequence conservation (20 nucleotide substitutions per kb). However, they also exhibit a high number of presence/absence variants, many attributable to the movement of transposable elements (TEs). Transposable elements have generated an important number of presence/absence variants between almond and peach, and we show that the recent history of TE movement seems markedly different between them. Transposable elements may also be at the origin of important phenotypic differences between both species, and in particular for the sweet kernel phenotype, a key agronomic and domestication character for almond. Here we show that in sweet almond cultivars, highly methylated TE insertions surround a gene involved in the biosynthesis of amygdalin, whose reduced expression has been correlated with the sweet almond phenotype. Altogether, our results suggest a key role of TEs in the recent history and diversification of almond and its close relative peach.
Collapse
Affiliation(s)
- Tyler Alioto
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Konstantinos G. Alexiou
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Amélie Bardil
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Fabio Barteri
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Fernando Cruz
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Amit Dhingra
- Department of HorticultureWashington State University99164-6414PullmanWAUSA
| | - Henri Duval
- INRA, UR1052Unité de Génétique et Amélioration des Fruits et Légumes (GAFL)Domaine St. Maurice CS 6009484143Montfavet CedexFrance
| | - Ángel Fernández i Martí
- Department of Environmental Science Policy and ManagementUniversity of CaliforniaBerkeley94720CAUSA
- Innovative Genomics Institute (IGI)94720BerkeleyCAUSA
| | - Leonor Frias
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Beatriz Galán
- Department of Environmental BiologyCenter for Biological Research (CIB‐CSIC)Spanish National Research Council (CSIC)Ramiro de Maeztu 928040MadridSpain
| | - José L. García
- Department of Environmental BiologyCenter for Biological Research (CIB‐CSIC)Spanish National Research Council (CSIC)Ramiro de Maeztu 928040MadridSpain
| | - Werner Howad
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Jèssica Gómez‐Garrido
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Marta Gut
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
| | - Irene Julca
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
| | - Jordi Morata
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Pere Puigdomènech
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Paolo Ribeca
- CNAG‐CRG, Centre for Genomic Regulation (CRG)Barcelona Institute of Science and Technology (BIST)Baldiri i Reixac 408028BarcelonaSpain
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- The Pirbright InstituteWokingSurreyGU24 0NFUK
| | - María J. Rubio Cabetas
- Centro de Investigación y Tecnología Agroalimentaria de Aragón (CITA)Unidad de HortofruticulturaGobierno de Aragón, Avda. Montañana 93050059ZaragozaSpain
- Instituto Agroalimentario de Aragón – IA2 (CITA‐Universidad de Zaragoza)Calle Miguel Servet 17750013ZaragozaSpain
| | - Anna Vlasova
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
| | - Michelle Wirthensohn
- University of AdelaideWaite Research InstituteSchool of Agriculture, Food and WinePMB 1Glen OsmondSA5064Australia
| | - Jordi Garcia‐Mas
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Toni Gabaldón
- Universitat Pompeu Fabra (UPF)08005BarcelonaSpain
- Bioinformatics and Genomics ProgrammeCentre for Genomic Regulation (CRG)Dr Aiguader, 8808003BarcelonaSpain
- Institució Catalana de Recerca i Estudis Avançats (ICREA)Pg Lluís Companys 2308010BarcelonaSpain
| | - Josep M. Casacuberta
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| | - Pere Arús
- IRTA, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
- Centre for Research in Agricultural Genomics (CRAG)CSIC‐IRTA‐UAB‐UB, Campus UABEdifici CRAGCerdanyola del Vallès (Bellaterra)08193BarcelonaSpain
| |
Collapse
|
43
|
Catanach A, Crowhurst R, Deng C, David C, Bernatchez L, Wellenreuther M. The genomic pool of standing structural variation outnumbers single nucleotide polymorphism by threefold in the marine teleost Chrysophrys auratus. Mol Ecol 2019; 28:1210-1223. [PMID: 30770610 DOI: 10.1111/mec.15051] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Revised: 01/31/2019] [Accepted: 02/01/2019] [Indexed: 12/22/2022]
Abstract
Recent studies have highlighted an important role of structural variation (SV) in ecological and evolutionary processes, but few have studied nonmodel species in the wild. As part of our long-term research programme on the nonmodel teleost fish Australasian snapper (Chrysophrys auratus), we aim to build one of the first catalogues of genomic variants (SNPs and indels, and deletions, duplications and inversions) in fishes and evaluate overlap of genomic variants with regions under putative selection (Tajima's D and π), and coding sequences (genes). For this, we analysed six males and six females from three locations in New Zealand and generated a high-resolution genomic variation catalogue. We characterized 20,385 SVs and found they intersected with almost a third of all annotated genes. Together with small indels, SVs account for three times more variation in the genome in terms of bases affected compared to SNPs. We found that a sizeable portion of detected SVs was in the upper and lower genomic regions of Tajima's D and π, indicating that some of these have an effect on the phenotype. Together, these results shed light on the often neglected genomic variation that is produced by SVs and highlights the need to go beyond the mere measure of SNPs when investigating evolutionary processes, such as species diversification and adaptation.
Collapse
Affiliation(s)
- Andrew Catanach
- The New Zealand Institute for Plant & Food Research Ltd, Lincoln, New Zealand
| | - Ross Crowhurst
- The New Zealand Institute for Plant & Food Research Ltd, Auckland, New Zealand
| | - Cecilia Deng
- The New Zealand Institute for Plant & Food Research Ltd, Auckland, New Zealand
| | - Charles David
- The New Zealand Institute for Plant & Food Research Ltd, Lincoln, New Zealand
| | - Louis Bernatchez
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, Quebec, Canada
| | - Maren Wellenreuther
- The New Zealand Institute for Plant & Food Research Ltd, Nelson, New Zealand.,School of Biological Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
44
|
LaCava MEF, Aikens EO, Megna LC, Randolph G, Hubbard C, Buerkle CA. Accuracy of de novo assembly of DNA sequences from double-digest libraries varies substantially among software. Mol Ecol Resour 2019; 20:360-370. [PMID: 31665547 DOI: 10.1111/1755-0998.13108] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 10/21/2019] [Accepted: 10/23/2019] [Indexed: 11/29/2022]
Abstract
Advances in DNA sequencing have made it feasible to gather genomic data for non-model organisms and large sets of individuals, often using methods for sequencing subsets of the genome. Several of these methods sequence DNA associated with endonuclease restriction sites (various RAD and GBS methods). For use in taxa without a reference genome, these methods rely on de novo assembly of fragments in the sequencing library. Many of the software options available for this application were originally developed for other assembly types and we do not know their accuracy for reduced representation libraries. To address this important knowledge gap, we simulated data from the Arabidopsis thaliana and Homo sapiens genomes and compared de novo assemblies by six software programs that are commonly used or promising for this purpose (ABySS, CD-HIT, Stacks, Stacks2, Velvet and VSEARCH). We simulated different mutation rates and types of mutations, and then applied the six assemblers to the simulated data sets, varying assembly parameters. We found substantial variation in software performance across simulations and parameter settings. ABySS failed to recover any true genome fragments, and Velvet and VSEARCH performed poorly for most simulations. Stacks and Stacks2 produced accurate assemblies of simulations containing SNPs, but the addition of insertion and deletion mutations decreased their performance. CD-HIT was the only assembler that consistently recovered a high proportion of true genome fragments. Here, we demonstrate the substantial difference in the accuracy of assemblies from different software programs and the importance of comparing assemblies that result from different parameter settings.
Collapse
Affiliation(s)
- Melanie E F LaCava
- Program in Ecology, University of Wyoming, Laramie, WY, USA.,Wildlife Genomics and Disease Ecology Laboratory, Department of Veterinary Sciences, University of Wyoming, Laramie, WY, USA
| | - Ellen O Aikens
- Program in Ecology, University of Wyoming, Laramie, WY, USA.,Wyoming Cooperative Fish and Wildlife Research Unit, Department of Zoology and Physiology, University of Wyoming, Laramie, WY, USA
| | - Libby C Megna
- Program in Ecology, University of Wyoming, Laramie, WY, USA.,Department of Zoology and Physiology, University of Wyoming, Laramie, WY, USA
| | - Gregg Randolph
- Genome Technologies Lab, University of Wyoming, Laramie, WY, USA
| | - Charley Hubbard
- Program in Ecology, University of Wyoming, Laramie, WY, USA.,Department of Botany, University of Wyoming, Laramie, WY, USA
| | - C Alex Buerkle
- Program in Ecology, University of Wyoming, Laramie, WY, USA.,Department of Botany, University of Wyoming, Laramie, WY, USA
| |
Collapse
|
45
|
Román-Rodríguez FJ, Ugalde L, Álvarez L, Díez B, Ramírez MJ, Risueño C, Cortón M, Bogliolo M, Bernal S, March F, Ayuso C, Hanenberg H, Sevilla J, Rodríguez-Perales S, Torres-Ruiz R, Surrallés J, Bueren JA, Río P. NHEJ-Mediated Repair of CRISPR-Cas9-Induced DNA Breaks Efficiently Corrects Mutations in HSPCs from Patients with Fanconi Anemia. Cell Stem Cell 2019; 25:607-621.e7. [PMID: 31543367 DOI: 10.1016/j.stem.2019.08.016] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 06/24/2019] [Accepted: 08/26/2019] [Indexed: 12/26/2022]
Abstract
Non-homologous end-joining (NHEJ) is the preferred mechanism used by hematopoietic stem cells (HSCs) to repair double-stranded DNA breaks and is particularly increased in cells deficient in the Fanconi anemia (FA) pathway. Here, we show feasible correction of compromised functional phenotypes in hematopoietic cells from multiple FA complementation groups, including FA-A, FA-C, FA-D1, and FA-D2. NHEJ-mediated repair of targeted CRISPR-Cas9-induced DNA breaks generated compensatory insertions and deletions that restore the coding frame of the mutated gene. NHEJ-mediated editing efficacy was initially verified in FA lymphoblastic cell lines and then in primary FA patient-derived CD34+ cells, which showed marked proliferative advantage and phenotypic correction both in vitro and after transplantation. Importantly, and in contrast to homologous directed repair, NHEJ efficiently targeted primitive human HSCs, indicating that NHEJ editing approaches may constitute a sound alternative for editing self-renewing human HSCs and consequently for treatment of FA and other monogenic diseases affecting the hematopoietic system.
Collapse
Affiliation(s)
- Francisco José Román-Rodríguez
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - Laura Ugalde
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - Lara Álvarez
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - Begoña Díez
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - María José Ramírez
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Genome Instability and DNA Repair Syndromes Group, Department of Genetics and Microbiology, Universitat Autònoma de Barcelona (UAB), Barcelona 08193, Spain; Servicio de Genética e Instituto de Investigaciones Biomédicas del Hospital de Sant Pau, Barcelona 08025, Spain
| | - Cristina Risueño
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - Marta Cortón
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Department of Genetics, Hospital Universitario Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD, UAM), Madrid 28040, Spain
| | - Massimo Bogliolo
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Genome Instability and DNA Repair Syndromes Group, Department of Genetics and Microbiology, Universitat Autònoma de Barcelona (UAB), Barcelona 08193, Spain; Servicio de Genética e Instituto de Investigaciones Biomédicas del Hospital de Sant Pau, Barcelona 08025, Spain
| | - Sara Bernal
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Servicio de Genética e Instituto de Investigaciones Biomédicas del Hospital de Sant Pau, Barcelona 08025, Spain
| | - Francesca March
- Servicio de Genética e Instituto de Investigaciones Biomédicas del Hospital de Sant Pau, Barcelona 08025, Spain
| | - Carmen Ayuso
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Department of Genetics, Hospital Universitario Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD, UAM), Madrid 28040, Spain
| | - Helmut Hanenberg
- Department of Otorhinolaryngology and Head/Neck Surgery, Heinrich Heine University, Düsseldorf 40225, Germany; Department of Pediatrics III, University Children's Hospital Essen, University of Duisburg-Essen, Essen 45122, Germany
| | | | - Sandra Rodríguez-Perales
- Molecular Cytogenetics Group, Human Cancer Genetics Program, Centro Nacional de Investigaciones Oncológicas (CNIO), Madrid 28029, Spain
| | - Raúl Torres-Ruiz
- Molecular Cytogenetics Group, Human Cancer Genetics Program, Centro Nacional de Investigaciones Oncológicas (CNIO), Madrid 28029, Spain; Josep Carreras Leukemia Research Institute and Department of Biomedicine, School of Medicine, University of Barcelona, Barcelona 08036, Spain
| | - Jordi Surrallés
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Genome Instability and DNA Repair Syndromes Group, Department of Genetics and Microbiology, Universitat Autònoma de Barcelona (UAB), Barcelona 08193, Spain; Servicio de Genética e Instituto de Investigaciones Biomédicas del Hospital de Sant Pau, Barcelona 08025, Spain
| | - Juan Antonio Bueren
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain
| | - Paula Río
- Division of Hematopoietic Innovative Therapies, Centro de Investigaciones Energéticas Medioambientales y Tecnológicas (CIEMAT), Madrid 28040, Spain; Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Madrid 28040, Spain; Advanced Therapies Unit, Instituto de Investigación Sanitaria Fundación Jiménez Díaz (IIS-FJD/UAM), Madrid 28040, Spain.
| |
Collapse
|
46
|
Nakamura S, Ishihara M, Ando N, Watanabe S, Sakurai T, Sato M. Transplacental delivery of genome editing components causes mutations in embryonic cardiomyocytes of mid-gestational murine fetuses. IUBMB Life 2019; 71:835-844. [PMID: 30635953 DOI: 10.1002/iub.2004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 12/11/2018] [Accepted: 12/17/2018] [Indexed: 12/19/2022]
Abstract
Genome editing, as exemplified by CRISPR/Cas9, is now recognized as a powerful tool for the engineering of endogenous target genes. It employs only two components, namely, Cas9 in the form of DNA, mRNA, or protein; and guide RNA (gRNA), which is specific to a target gene. When these components are transferred to cells, they create insertion/deletion mutations (indels) within a target gene. Therefore, when fetuses within the uteri of pregnant murine females are exposed to these reagents, fetal cells incorporating them should show mutations in the target gene. To examine a possible genome editing of fetal cells in vivo, we intravenously administered a solution containing plasmid DNA-FuGENE complex to pregnant wild-type female mice [which had been successfully mated with enhanced green fluorescent protein (EGFP)-expressing male transgenic mice] on day 12.5 of gestation. The plasmid DNA induces the expression of gRNA, which was targeted at the EGFP cDNA, and that of the Cas9 gene. All fetuses in the pregnant females should express EGFP systemically, since they are heterozygous (Tg/+) for the transgene. Thus, the delivery of CRISPR system targeted at EGFP in the fetuses will cause a reduced expression of EGFP as a result of the genome editing of EGFP genomic sequence. Of the 24 fetuses isolated from three pregnant females 2 days after gene delivery, 3 were found to have reduced fluorescence in their hearts. Genotyping of the dissected hearts revealed the presence of the transgene construct (Cas9 gene) in all the samples. Furthermore, all the three samples exhibited mutations at the target loci, although normal cells were also present. Thus, transplacental delivery of gene editing components may be a useful tool for developing animal models with heart disorder for heart-related disease research, and gene therapy in congenital heart defects such as hypertrophic cardiomyopathy (HCM). © 2019 IUBMB Life, 9999(9999):1-10, 2019.
Collapse
Affiliation(s)
- Shingo Nakamura
- Division of Biomedical Engineering, National Defense Medical College Research Institute, Tokorozawa, Saitama, 359-8513, Japan
| | - Masayuki Ishihara
- Division of Biomedical Engineering, National Defense Medical College Research Institute, Tokorozawa, Saitama, 359-8513, Japan
| | - Naoko Ando
- Division of Biomedical Engineering, National Defense Medical College Research Institute, Tokorozawa, Saitama, 359-8513, Japan
| | - Satoshi Watanabe
- Animal Genome Unit, Institute of Livestock and Grassland Science, National Agriculture and Food Research Organization (NARO), Tsukuba, Ibaraki, 305-0901, Japan
| | - Takayuki Sakurai
- Department of Cardiovascular Research, School of Medicine, Shinshu University, Matsumoto, Nagano, 390-8621, Japan
| | - Masahiro Sato
- Section of Gene Expression Regulation, Frontier Science Research Center, Kagoshima University, Kagoshima, Kagoshima, 890-8544, Japan
| |
Collapse
|
47
|
Avdeyev P, Jiang S, Alekseyev MA. Linearization of Median Genomes Under the Double-Cut-and-Join-Indel Model. Evol Bioinform Online 2019; 15:1176934318820534. [PMID: 31217687 PMCID: PMC6557028 DOI: 10.1177/1176934318820534] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Accepted: 11/27/2018] [Indexed: 11/17/2022] Open
Abstract
Reconstruction of the median genome consisting of linear chromosomes from three given genomes is known to be intractable. There exist efficient methods for solving a relaxed version of this problem, where the median genome is allowed to have circular chromosomes. We propose a method for construction of an approximate solution to the original problem from a solution to the relaxed problem and prove a bound on its approximation error. Our method also provides insights into the combinatorial structure of genome transformations with respect to appearance of circular chromosomes.
Collapse
Affiliation(s)
- Pavel Avdeyev
- Computational Biology Institute, The George Washington University, Washington, DC, USA
| | - Shuai Jiang
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, USA
| | - Max A Alekseyev
- Computational Biology Institute, The George Washington University, Washington, DC, USA
| |
Collapse
|
48
|
Brew-Appiah RAT, Peracchi LM, Sanguinet KA. Never the Two Shall Mix: Robust Indel Markers to Ensure the Fidelity of Two Pivotal and Closely-Related Accessions of Brachypodium distachyon. Plants (Basel) 2019; 8:plants8060153. [PMID: 31174296 PMCID: PMC6630600 DOI: 10.3390/plants8060153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 05/30/2019] [Accepted: 06/05/2019] [Indexed: 11/25/2022]
Abstract
Brachypodium distachyon is an established model for monocotyledonous plants. Numerous markers intended for gene discovery and population genetics have been designed. However to date, very few indel markers with larger and easily scored length polymorphism differences, that distinguish between the two morphologically similar and highly utilized B. distachyon accessions, Bd21, the reference genome accession, and Bd21-3, the transformation-optimal accession, are publically available. In this study, 22 indel markers were designed and utilized to produce length polymorphism differences of 150 bp or more, for easy discrimination between Bd21 and Bd21-3. When tested on four other B. distachyon accessions, one case of multiallelism was observed. It was also shown that the markers could be used to determine homozygosity and heterozygosity at specific loci in a Bd21 x Bd3-1 F2 population. The work done in this study allows researchers to maintain the fidelity of Bd21 and Bd21-3 stocks for both transgenic and nontransgenic studies. It also provides markers that can be utilized in conjunction with others already available for further research on population genetics, gene discovery and gene characterization, all of which are necessary for the relevance of B. distachyon as a model species.
Collapse
Affiliation(s)
- Rhoda A T Brew-Appiah
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99164-6420, USA.
| | - Luigi M Peracchi
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99164-6420, USA.
| | - Karen A Sanguinet
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99164-6420, USA.
| |
Collapse
|
49
|
Wang H, Li F, Liu N, Liu X, Yang X, Guo Y, Bei J, Zeng Y, Shao J. Prognostic implications of a molecular classifier derived from whole-exome sequencing in nasopharyngeal carcinoma. Cancer Med 2019; 8:2705-2716. [PMID: 30950204 PMCID: PMC6558473 DOI: 10.1002/cam4.2146] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Revised: 03/04/2019] [Accepted: 03/07/2019] [Indexed: 12/30/2022] Open
Abstract
The aim of this study was to use whole-exome sequencing to derive a molecular classifier for nasopharyngeal carcinoma (NPC) and evaluate its clinical performance. We performed whole-exome sequencing on 82 primary NPC tumors from Sun Yat-sen University Cancer Center (Guangzhou cohort) to obtain somatic single-nucleotide variants, indels, and copy number variants. A novel molecular classifier was then developed and validated in another NPC cohort (Hong Kong cohort, n = 99). Survival analysis was estimated by the Kaplan-Meier method and compared using the log-rank test. Cox proportional hazards model was adopted for univariate and multivariate analyses. We identified three prominent NPC genetic subtypes: RAS/PI3K/AKT (based on RAS, AKT1, and PIK3CA mutations), cell-cycle (based on CDKN2A/CDKN2B deletions, and CDKN1B and CCND1 amplifications), and unclassified (based on dominant mutations in epigenetic regulators, such as KMT2C/2D, or the Notch signaling pathway, such as NOTCH1/2). These subtypes differed in survival analysis, with good, intermediate, and poor progression-free survival in the unclassified, cell-cycle, and RAS/PI3K/AKT subgroups, respectively, among the Guangzhou, Hong Kong, and combined cohorts (n = 82, P = 0.0342; n = 99, P = 0.0372; and n = 181, P = 0.0023; log-rank test). We have uncovered genetic subtypes of NPC with distinct mutations and/or copy number changes, reflecting discrete paths of NPC tumorigenesis and providing a roadmap for developing new prognostic biomarkers and targeted therapies.
Collapse
Affiliation(s)
- Hai‐Yun Wang
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and TherapySun Yat‐sen University Cancer CenterGuangzhouP. R. China
- Department of Molecular DiagnosticsSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Fugen Li
- Research and Development Institute of Precision Medicine3D Medicine Inc.ShanghaiP. R. China
| | - Na Liu
- BGI Genomics, BGI‐ShenzhenShenzhenP. R. China
| | - Xiao‐Yun Liu
- Department of Molecular DiagnosticsSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Xin‐Hua Yang
- Department of Molecular DiagnosticsSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Yun‐Miao Guo
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and TherapySun Yat‐sen University Cancer CenterGuangzhouP. R. China
- Department of Experiment ResearchSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Jin‐Xin Bei
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and TherapySun Yat‐sen University Cancer CenterGuangzhouP. R. China
- Department of Experiment ResearchSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Yi‐Xin Zeng
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and TherapySun Yat‐sen University Cancer CenterGuangzhouP. R. China
- Department of Experiment ResearchSun Yat‐sen University Cancer CenterGuangzhouP. R. China
| | - Jian‐Yong Shao
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and TherapySun Yat‐sen University Cancer CenterGuangzhouP. R. China
- Department of Molecular DiagnosticsSun Yat‐sen University Cancer CenterGuangzhouP. R. China
- School of Laboratory MedicineWannan Medical CollegeWuhu, Anhui ProvinceP. R. China
| |
Collapse
|
50
|
Palmer DJ, Turner DL, Ng P. Production of CRISPR/Cas9-Mediated Self-Cleaving Helper-Dependent Adenoviruses. Mol Ther Methods Clin Dev 2019; 13:432-439. [PMID: 31080846 PMCID: PMC6506437 DOI: 10.1016/j.omtm.2019.04.003] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 04/08/2019] [Indexed: 12/21/2022]
Abstract
Prolonged expression of CRISPR/Cas9 raises concerns about off-target cleavage, cytotoxicity, and immune responses. To address these issues, we have developed a system to produce helper-dependent adenoviruses that express CRISPR/Cas9 to direct cleavage of the vectors’ own genome after transduction of target cells. To prevent self-cleavage during vector production, it was necessary to downregulate Cas9 mRNA as well as inhibit Cas9 protein activity. Cas9 mRNA downregulation was achieved by inserting the target sequences for the helper-virus-encoded miRNA, mivaRNAI, and producer-cell-encoded miRNAs, hsa-miR183-5p, and hsa-miR218-5p, into the 3′ UTR of the HDAd-encoded Cas9 expression cassette. Cas9 protein activity was inhibited by expressing anti-CRISPR proteins AcrIIA2 and AcrAII4 from both the producer cells and the helper virus. After purification, these helper-dependent adenoviruses will perform CRISPR/Cas9-mediated self-cleavage in the transduced target cells, thereby limiting the duration of Cas9 expression and thus represent an important platform for improving the safety of gene editing by CRISPR/Cas9.
Collapse
Affiliation(s)
- Donna J Palmer
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Dustin L Turner
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Philip Ng
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| |
Collapse
|