1
|
Ramakrishnan Chandra J, Kalidass M, Demidov D, Dabravolski SA, Lermontova I. The role of centromeric repeats and transcripts in kinetochore assembly and function. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:982-996. [PMID: 37665331 DOI: 10.1111/tpj.16445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 08/09/2023] [Accepted: 08/18/2023] [Indexed: 09/05/2023]
Abstract
Centromeres are the chromosomal domains, where the kinetochore protein complex is formed, mediating proper segregation of chromosomes during cell division. Although the function of centromeres has remained conserved during evolution, centromeric DNA is highly variable, even in closely related species. In addition, the composition of the kinetochore complexes varies among organisms. Therefore, it is assumed that the centromeric position is determined epigenetically, and the centromeric histone H3 (CENH3) serves as an epigenetic marker. The loading of CENH3 onto centromeres depends on centromere-licensing factors, chaperones, and transcription of centromeric repeats. Several proteins that regulate CENH3 loading and kinetochore assembly interact with the centromeric transcripts and DNA in a sequence-independent manner. However, the functional aspects of these interactions are not fully understood. This review discusses the variability of centromeric sequences in different organisms and the regulation of their transcription through the RNA Pol II and RNAi machinery. The data suggest that the interaction of proteins involved in CENH3 loading and kinetochore assembly with centromeric DNA and transcripts plays a role in centromere, and possibly neocentromere, formation in a sequence-independent manner.
Collapse
Affiliation(s)
| | - Manikandan Kalidass
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| | - Dmitri Demidov
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| | - Siarhei A Dabravolski
- Department of Biotechnology Engineering, Braude Academic College of Engineering, Snunit 51, Karmiel, 2161002, Israel
| | - Inna Lermontova
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstrasse 3, D-06466, Seeland, Germany
| |
Collapse
|
2
|
English AC, Dolzhenko E, Ziaei Jam H, McKenzie SK, Olson ND, De Coster W, Park J, Gu B, Wagner J, Eberle MA, Gymrek M, Chaisson MJP, Zook JM, Sedlazeck FJ. Analysis and benchmarking of small and large genomic variants across tandem repeats. Nat Biotechnol 2024:10.1038/s41587-024-02225-z. [PMID: 38671154 DOI: 10.1038/s41587-024-02225-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 03/28/2024] [Indexed: 04/28/2024]
Abstract
Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits and are linked to over 60 disease phenotypes. However, they are often excluded from at-scale studies because of challenges with variant calling and representation, as well as a lack of a genome-wide standard. Here, to promote the development of TR methods, we created a catalog of TR regions and explored TR properties across 86 haplotype-resolved long-read human assemblies. We curated variants from the Genome in a Bottle (GIAB) HG002 individual to create a TR dataset to benchmark existing and future TR analysis methods. We also present an improved variant comparison method that handles variants greater than 4 bp in length and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ~24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 'truth-set' TR benchmark. We demonstrate the utility of this pipeline across short-read and long-read technologies.
Collapse
Affiliation(s)
- Adam C English
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
| | | | - Helyaneh Ziaei Jam
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | | | - Nathan D Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Wouter De Coster
- Applied and Translational Neurogenomics Group, VIB Center for Molecular Neurology, VIB, Antwerp, Belgium
- Applied and Translational Neurogenomics Group, Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| | - Jonghun Park
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| | - Bida Gu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | - Melissa Gymrek
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
3
|
Hu L, Meng A, Tu Z, Jia S, Liu Q, Chen F. The analysis of complete genome sequence and comparative genomics of Vibrio parahaemolyticus LF1113 in Hainan. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2024; 119:105574. [PMID: 38373468 DOI: 10.1016/j.meegid.2024.105574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 01/28/2024] [Accepted: 02/16/2024] [Indexed: 02/21/2024]
Abstract
Vibrio parahaemolyticus is a Gram-negative, halophilic and polymorphic coccobacillus. It is world-widely distributed and has resulted in great economic losses since its first appearance. In this study, a pathogenic strain was isolated from diseased pearl gentian grouper and identified as V. parahaemolyticus based on the sequencing results of 16S rDNA gene. In order to gain a comprehensive understanding of this isolation, the whole genome sequencing was conducted. Phylogenetic analysis of the complete genomes of 16 Vibrio species showed that LF1113, ATCC17802, ATCC33787, 2210633, FORC 004, and 160807 were the most closely related. Animal experiments demonstrated that the isolated LF1113 strain was pathogenic in a fish model. This study is the first study to describe the complete genome sequence of a V. parahaemolyticus isolate, which infected pearl gentian grouper from an outbreak in a fish factory farm in Hainan. The results will expand our understanding of genetic characteristics, pathogenesis, diagnostics and disease prevention of V. parahaemolyticus, and lay the foundation for further study.
Collapse
Affiliation(s)
- Linlin Hu
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China; Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources of Ministry of Education, Hainan Tropical Ocean University, Sanya 572022, China, Haikou 570228, China.
| | - Aiyun Meng
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China; Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources of Ministry of Education, Hainan Tropical Ocean University, Sanya 572022, China, Haikou 570228, China
| | - Zhigang Tu
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China; Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources of Ministry of Education, Hainan Tropical Ocean University, Sanya 572022, China, Haikou 570228, China
| | - Shuwen Jia
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China
| | - Qingming Liu
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China
| | - Fuxiao Chen
- Hainan Provincial Key Laboratory of Tropical Maricultural Technologies, Hainan Academy of Ocean and Fisheries Sciences, Haikou 571126, China; Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources of Ministry of Education, Hainan Tropical Ocean University, Sanya 572022, China, Haikou 570228, China
| |
Collapse
|
4
|
Lauber E, González-Fuente M, Escouboué M, Vicédo C, Luneau JS, Pouzet C, Jauneau A, Gris C, Zhang ZM, Pichereaux C, Carrère S, Deslandes L, Noël LD. Bacterial host adaptation through sequence and structural variations of a single type III effector gene. iScience 2024; 27:109224. [PMID: 38439954 PMCID: PMC10909901 DOI: 10.1016/j.isci.2024.109224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 12/02/2023] [Accepted: 02/08/2024] [Indexed: 03/06/2024] Open
Abstract
Molecular mechanisms underlying quantitative variations of pathogenicity remain elusive. Here, we identified the Xanthomonas campestris XopJ6 effector that triggers disease resistance in cauliflower and Arabidopsis thaliana. XopJ6 is a close homolog of the Ralstoniapseudosolanacearum PopP2 YopJ family acetyltransferase. XopJ6 is recognized by the RRS1-R/RPS4 NLR pair that integrates a WRKY decoy domain mimicking effector targets. We identified a XopJ6 natural variant carrying a single residue substitution in XopJ6 WRKY-binding site that disrupts interaction with WRKY proteins. This mutation allows XopJ6 to evade immune perception while retaining some XopJ6 virulence functions. Interestingly, xopJ6 resides in a Tn3-family transposon likely contributing to xopJ6 copy number variation (CNV). Using synthetic biology, we demonstrate that xopJ6 CNV tunes pathogen virulence on Arabidopsis through gene dosage-mediated modulation of xopJ6 expression. Together, our findings highlight how sequence and structural genetic variations restricted at a particular effector gene contribute to bacterial host adaptation.
Collapse
Affiliation(s)
- Emmanuelle Lauber
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Manuel González-Fuente
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Maxime Escouboué
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Céline Vicédo
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Julien S. Luneau
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Cécile Pouzet
- TRI-FRAIB Imaging Platform Facilities, FRAIB, Université de Toulouse, CNRS, UPS, 31320 Castanet-Tolosan, France
| | - Alain Jauneau
- TRI-FRAIB Imaging Platform Facilities, FRAIB, Université de Toulouse, CNRS, UPS, 31320 Castanet-Tolosan, France
| | - Carine Gris
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Zhi-Min Zhang
- International Cooperative Laboratory of Traditional Chinese Medicine Modernization and Innovative Drug Development of Chinese Ministry of Education (MOE), College of Pharmacy, Jinan University, Guangzhou 510632, China
| | - Carole Pichereaux
- Fédération de Recherche Agrobiosciences, Interactions et Biodiversité (FRAIB), Université de Toulouse, CNRS, Université de Toulouse III - Paul Sabatier (UT3), Auzeville-Tolosane, France
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université de Toulouse III - Paul Sabatier (UT3), Toulouse, France
- Infrastructure nationale de protéomique, ProFI, FR 2048, Toulouse, France
| | - Sébastien Carrère
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Laurent Deslandes
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| | - Laurent D. Noël
- Laboratoire des Interactions Plantes-Microbes-Environnement (LIPME), Université de Toulouse, INRAE, CNRS, F-31326 Castanet-Tolosan, France
| |
Collapse
|
5
|
English A, Dolzhenko E, Jam HZ, Mckenzie S, Olson ND, De Coster W, Park J, Gu B, Wagner J, Eberle MA, Gymrek M, Chaisson MJP, Zook JM, Sedlazeck FJ. Benchmarking of small and large variants across tandem repeats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.29.564632. [PMID: 37961319 PMCID: PMC10634962 DOI: 10.1101/2023.10.29.564632] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits, and are linked to over 60 disease phenotypes. However, their complexity often excludes them from at-scale studies due to challenges with variant calling, representation, and lack of a genome-wide standard. To promote TR methods development, we create a comprehensive catalog of TR regions and explore its properties across 86 samples. We then curate variants from the GIAB HG002 individual to create a tandem repeat benchmark. We also present a variant comparison method that handles small and large alleles and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ∼24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 TR benchmark. We work with the GIAB community to demonstrate the utility of this benchmark across short and long read technologies.
Collapse
|
6
|
Liu J, Ni Y, Liu C. Polymeric structure of the Cannabis sativa L. mitochondrial genome identified with an assembly graph model. Gene 2023; 853:147081. [PMID: 36470482 DOI: 10.1016/j.gene.2022.147081] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 11/14/2022] [Accepted: 11/28/2022] [Indexed: 12/03/2022]
Abstract
Cannabis sativa L. belongs to the family Cannabaceae in Rosales. It has been widely used as medicines, building materials, and textiles. Elucidating its genome is critical for molecular breeding and synthetic biology study. Many studies have shown that the mitochondrial genomes (mitogenomes) and even chloroplast genomes (plastomes) had complex polymeric structures. Using the Nanopore sequencing platform, we sequenced, assembled, and analyzed its mitogenome and plastome. The resulting unitig graph suggested that the mitogenome had a complex polymeric structure. However, a gap-free, circular sequence was further assembled from the unitig graph. In contrast, a circular sequence representing the plastome was obtained. The mitogenome major conformation was 415,837 bp long, and the plastome was 153,927 bp long. To test if the repeat sequences promote recombination, which corresponds to the branch points in the structure, we tested the sequences around repeats by long-read mapping. Among 208 pairs of predicted repeats, the mapping results supported the presence of cross-over around 25 pairs of repeats. Subsequent PCR amplification confirmed the presence of cross-over around 15 of the 25 repeats. By comparing the mitogenome and plastome sequences, we identified 19 mitochondria plastid DNAs, including seven complete genes (trnW-CCA, trnP-UGG, psbJ, trnN-GUU, trnD-GUC, trnH-GUG, trnM-CAU) and nine gene fragments. Furthermore, the selective pressure analysis results showed that five genes (atp1, ccmB, ccmC, cox1, nad7) had 19 positively selected sites. Lastly, we predicted 28 RNA editing sites. A total of 8 RNA editing sites located in the coding regions were successfully validated by PCR amplification and Sanger sequencing, of which four were synonymous, and four were nonsynonymous. In particular, the RNA editing events appeared to be tissue-specific in C. sativa mitogenome. In summary, we have confirmed the major confirmation of C. sativa mitogenome and characterized its structural features in detail. These results provide critical information for future variety breeding and resource development for C. sativa.
Collapse
Affiliation(s)
- Jingting Liu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, PR China
| | - Yang Ni
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, PR China
| | - Chang Liu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, PR China.
| |
Collapse
|
7
|
Liu J, Maxwell M, Cuddihy T, Crawford T, Bassetti M, Hyde C, Peigneur S, Tytgat J, Undheim EAB, Mobli M. ScrepYard: An online resource for disulfide-stabilized tandem repeat peptides. Protein Sci 2023; 32:e4566. [PMID: 36644825 PMCID: PMC9885460 DOI: 10.1002/pro.4566] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 01/05/2023] [Accepted: 01/12/2023] [Indexed: 01/17/2023]
Abstract
Receptor avidity through multivalency is a highly sought-after property of ligands. While readily available in nature in the form of bivalent antibodies, this property remains challenging to engineer in synthetic molecules. The discovery of several bivalent venom peptides containing two homologous and independently folded domains (in a tandem repeat arrangement) has provided a unique opportunity to better understand the underpinning design of multivalency in multimeric biomolecules, as well as how naturally occurring multivalent ligands can be identified. In previous work, we classified these molecules as a larger class termed secreted cysteine-rich repeat-proteins (SCREPs). Here, we present an online resource; ScrepYard, designed to assist researchers in identification of SCREP sequences of interest and to aid in characterizing this emerging class of biomolecules. Analysis of sequences within the ScrepYard reveals that two-domain tandem repeats constitute the most abundant SCREP domain architecture, while the interdomain "linker" regions connecting the functional domains are found to be abundant in amino acids with short or polar sidechains and contain an unusually high abundance of proline residues. Finally, we demonstrate the utility of ScrepYard as a virtual screening tool for discovery of putatively multivalent peptides, by using it as a resource to identify a previously uncharacterized serine protease inhibitor and confirm its predicted activity using an enzyme assay.
Collapse
Affiliation(s)
- Junyu Liu
- Centre for Advanced ImagingThe University of QueenslandSt. LuciaQueenslandAustralia
| | - Michael Maxwell
- Centre for Advanced ImagingThe University of QueenslandSt. LuciaQueenslandAustralia
| | - Thom Cuddihy
- Queensland Cyber Infrastructure Foundation Ltd.The University of QueenslandSt. LuciaQueenslandAustralia,Centre for Clinical ResearchThe University of QueenslandSt. LuciaQueenslandAustralia
| | - Theo Crawford
- Centre for Advanced ImagingThe University of QueenslandSt. LuciaQueenslandAustralia
| | - Madeline Bassetti
- Queensland Cyber Infrastructure Foundation Ltd.The University of QueenslandSt. LuciaQueenslandAustralia
| | - Cameron Hyde
- Queensland Cyber Infrastructure Foundation Ltd.The University of QueenslandSt. LuciaQueenslandAustralia,University of the Sunshine CoastMaroochydoreQueenslandAustralia
| | - Steve Peigneur
- Toxicology and PharmacologyUniversity of Leuven (KU Leuven)LeuvenBelgium
| | - Jan Tytgat
- Toxicology and PharmacologyUniversity of Leuven (KU Leuven)LeuvenBelgium
| | - Eivind A. B. Undheim
- Centre for Advanced ImagingThe University of QueenslandSt. LuciaQueenslandAustralia,Centre for Ecological and Evolutionary Synthesis, Department of BiosciencesUniversity of OsloOsloNorway
| | - Mehdi Mobli
- Centre for Advanced ImagingThe University of QueenslandSt. LuciaQueenslandAustralia
| |
Collapse
|
8
|
Remnants of SIRE1 retrotransposons in human genome? J Genet 2022. [DOI: 10.1007/s12041-022-01398-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
9
|
Boukaba A, Wu Q, Liu J, Chen C, Liang J, Li J, Strunnikov A. Mapping separase-mediated cleavage in situ. NAR Genom Bioinform 2022; 4:lqac085. [PMID: 36415827 PMCID: PMC9673495 DOI: 10.1093/nargab/lqac085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 10/13/2022] [Accepted: 10/25/2022] [Indexed: 11/21/2022] Open
Abstract
Separase is a protease that performs critical functions in the maintenance of genetic homeostasis. Among them, the cleavage of the meiotic cohesin during meiosis is a key step in producing gametes in eukaryotes. However, the exact chromosomal localization of this proteolytic cleavage was not addressed due to the lack of experimental tools. To this end, we developed a method based on monoclonal antibodies capable of recognizing the predicted neo-epitopes produced by separase-mediated proteolysis in the RAD21 and REC8 cohesin subunits. To validate the epigenomic strategy of mapping cohesin proteolysis, anti-RAD21 neo-epitopes antibodies were used in ChIP-On-ChEPseq analysis of human cells undergoing mitotic anaphase. Second, a similar analysis applied for mapping of REC8 cleavage in germline cells in Macaque showed a correlation with a subset of alpha-satellites and other repeats, directly demonstrating that the site-specific mei-cohesin proteolysis hotspots are coincident but not identical with centromeres. The sequences for the corresponding immunoglobulin genes show a convergence of antibodies with close specificity. This approach could be potentially used to investigate cohesin ring opening events in other chromosomal locations, if applied to single cells.
Collapse
Affiliation(s)
- Abdelhalim Boukaba
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Qiongfang Wu
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Jian Liu
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Cheng Chen
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Jierong Liang
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Jingjing Li
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| | - Alexander V Strunnikov
- Molecular Epigenetics Laboratory, Guangzhou Institutes of Biomedicine and Health , Guangzhou , Guangdong , 510530 , China
| |
Collapse
|
10
|
Trinca V, Uliana JVC, Ribeiro GKS, Torres TT, Monesi N. Characterization of the mitochondrial genomes of Bradysia hygida, Phytosciara flavipes and Trichosia splendens (Diptera: Sciaridae) and novel insights on the control region of sciarid mitogenomes. INSECT MOLECULAR BIOLOGY 2022; 31:482-496. [PMID: 35332955 DOI: 10.1111/imb.12774] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 02/12/2022] [Accepted: 03/21/2022] [Indexed: 06/14/2023]
Abstract
Sciarids, also called "fungus gnats" are small, almost entirely dark-coloured insects. Sciarid larvae feed on different substrates and can infest agricultural crops and mushroom nurseries, causing economic losses. Of the 2174 Diptera mitogenome sequences currently available in GenBank, only eight are from the Sciaridae family, none of which are complete circular molecules. Here we describe the mitogenome sequences of three sciarid species: Phytosciara flavipes, Trichosia splendens and Bradysia hygida and provide novel insights on the control region of sciarid mitogenomes. The assembled mitogenomes range from 16,062 bp in P. flavipes to 17,095 bp in B. hygida. All 13 protein coding genes, 22 tRNAs and 2 rRNAs characteristic of insect mitogenomes were identified, but the sequence of the control region could not be determined. Experimental results suggest that the B. hygida control region is about 21 kb long resulting in a 37 kb long mitogenome which constitutes the largest insect mitochondrial genome described so far. Phylogenetic analysis using all Bibionomorpha mitogenome sequences available in GenBank strongly supports the Sciaridae monophyly and led to the identification of species and subfamily specific gene rearrangements. Our study extends the knowledge of this large and diverse insect family that includes agricultural pest species.
Collapse
Affiliation(s)
- Vitor Trinca
- Programa de Biologia Celular e Molecular, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - João Vitor Cardoso Uliana
- Programa de Biologia Celular e Molecular, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Geyza Katrinny Sousa Ribeiro
- Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Tatiana Teixeira Torres
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Nadia Monesi
- Programa de Biologia Celular e Molecular, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
- Departamento de Análises Clínicas, Toxicológicas e Bromatológicas, Faculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
| |
Collapse
|
11
|
Hilário S, Gonçalves MFM, Fidalgo C, Tacão M, Alves A. Genome Analyses of Two Blueberry Pathogens: Diaporthe amygdali CAA958 and Diaporthe eres CBS 160.32. J Fungi (Basel) 2022; 8:jof8080804. [PMID: 36012791 PMCID: PMC9409727 DOI: 10.3390/jof8080804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 07/27/2022] [Accepted: 07/28/2022] [Indexed: 02/04/2023] Open
Abstract
The genus Diaporthe includes pathogenic species distributed worldwide and affecting a wide variety of hosts. Diaporthe amygdali and Diaporthe eres have been found to cause cankers, dieback, or twig blights on economically important crops such as soybean, almond, grapevine, and blueberry. Despite their importance as plant pathogens, the strategies of species of Diaporthe to infect host plants are poorly explored. To provide a genomic basis of pathogenicity, the genomes of D. amygdali CAA958 and D. eres CBS 160.32 were sequenced and analyzed. Cellular transporters involved in the transport of toxins, ions, sugars, effectors, and genes implicated in pathogenicity were detected in both genomes. Hydrolases and oxidoreductases were the most prevalent carbohydrate-active enzymes (CAZymes). However, analyses of the secreted proteins revealed that the secretome of D. eres CBS 160.32 is represented by 5.4% of CAZymes, whereas the secreted CAZymes repertoire of D. amygdali CAA958 represents 29.1% of all secretomes. Biosynthetic gene clusters (BGCs) encoding compounds related to phytotoxins and mycotoxins were detected in D. eres and D. amygdali genomes. The core gene clusters of the phytotoxin Fusicoccin A in D. amygdali are reported here through a genome-scale assembly. Comparative analyses of the genomes from 11 Diaporthe species revealed an average of 874 CAZymes, 101 secondary metabolite BGCs, 1640 secreted proteins per species, and genome sizes ranging from 51.5 to 63.6 Mbp. This study offers insights into the overall features and characteristics of Diaporthe genomes. Our findings enrich the knowledge about D. eres and D. amygdali, which will facilitate further research into the pathogenicity mechanisms of these species.
Collapse
|
12
|
Sui Y, Epstein A, Dominska M, Zheng DQ, Petes T, Klein H. Ribodysgenesis: sudden genome instability in the yeast Saccharomyces cerevisiae arising from RNase H2 cleavage at genomic-embedded ribonucleotides. Nucleic Acids Res 2022; 50:6890-6902. [PMID: 35748861 PMCID: PMC9262587 DOI: 10.1093/nar/gkac536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 05/27/2022] [Accepted: 06/07/2022] [Indexed: 12/24/2022] Open
Abstract
Ribonucleotides can be incorporated into DNA during replication by the replicative DNA polymerases. These aberrant DNA subunits are efficiently recognized and removed by Ribonucleotide Excision Repair, which is initiated by the heterotrimeric enzyme RNase H2. While RNase H2 is essential in higher eukaryotes, the yeast Saccharomyces cerevisiae can survive without RNase H2 enzyme, although the genome undergoes mutation, recombination and other genome instability events at an increased rate. Although RNase H2 can be considered as a protector of the genome from the deleterious events that can ensue from recognition and removal of embedded ribonucleotides, under conditions of high ribonucleotide incorporation and retention in the genome in a RNase H2-negative strain, sudden introduction of active RNase H2 causes massive DNA breaks and genome instability in a condition which we term 'ribodysgenesis'. The DNA breaks and genome instability arise solely from RNase H2 cleavage directed to the ribonucleotide-containing genome. Survivors of ribodysgenesis have massive loss of heterozygosity events stemming from recombinogenic lesions on the ribonucleotide-containing DNA, with increases of over 1000X from wild-type. DNA breaks are produced over one to two divisions and subsequently cells adapt to RNase H2 and ribonucleotides in the genome and grow with normal levels of genome instability.
Collapse
Affiliation(s)
- Yang Sui
- State Key Laboratory of Motor Vehicle Biofuel Technology, Ocean College, Zhejiang University, Zhoushan 316021, China,Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC 27710, USA
| | - Anastasiya Epstein
- Department of Biochemistry and Molecular Pharmacology, New York University Grossman School of Medicine, New York, NY 10016, USA
| | - Margaret Dominska
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC 27710, USA
| | - Dao-Qiong Zheng
- State Key Laboratory of Motor Vehicle Biofuel Technology, Ocean College, Zhejiang University, Zhoushan 316021, China,Hainan Institute of Zhejiang University, Sanya 572000, China,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou 311200, China
| | - Thomas D Petes
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine, Durham, NC 27710, USA
| | - Hannah L Klein
- To whom correspondence should be addressed. Tel: +1 212 263 5778;
| |
Collapse
|
13
|
Yang L, Zhang L, Yin P, Ding H, Xiao Y, Zeng J, Wang W, Zhou H, Wang Q, Zhang Y, Chen Z, Yang M, Feng Y. Insights into the inhibition of type I-F CRISPR-Cas system by a multifunctional anti-CRISPR protein AcrIF24. Nat Commun 2022; 13:1931. [PMID: 35411005 PMCID: PMC9001735 DOI: 10.1038/s41467-022-29581-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 03/16/2022] [Indexed: 11/18/2022] Open
Abstract
CRISPR-Cas systems are prokaryotic adaptive immune systems and phages use anti-CRISPR proteins (Acrs) to counteract these systems. Here, we report the structures of AcrIF24 and its complex with the crRNA-guided surveillance (Csy) complex. The HTH motif of AcrIF24 can bind the Acr promoter region and repress its transcription, suggesting its role as an Aca gene in self-regulation. AcrIF24 forms a homodimer and further induces dimerization of the Csy complex. Apart from blocking the hybridization of target DNA to the crRNA, AcrIF24 also induces the binding of non-sequence-specific dsDNA to the Csy complex, similar to AcrIF9, although this binding seems to play a minor role in AcrIF24 inhibitory capacity. Further structural and biochemical studies of the Csy-AcrIF24-dsDNA complexes and of AcrIF24 mutants reveal that the HTH motif of AcrIF24 and the PAM recognition loop of the Csy complex are structural elements essential for this non-specific dsDNA binding. Moreover, AcrIF24 and AcrIF9 display distinct characteristics in inducing non-specific DNA binding. Together, our findings highlight a multifunctional Acr and suggest potential wide distribution of Acr-induced non-specific DNA binding. Phages use anti-CRISPR proteins (Acrs) to counteract the bacterial CRISPR-Cas systems. Here, the authors characterize AcrIF24, which functions as an Aca (Acr-associated) to repress and regulate its own transcription, dimerizes the Csy complex, blocks the hybridization of target DNA, and tethers non-sequence-specific DNA to the Csy complex.
Collapse
|
14
|
Bacterial N4-methylcytosine as an epigenetic mark in eukaryotic DNA. Nat Commun 2022; 13:1072. [PMID: 35228526 PMCID: PMC8885841 DOI: 10.1038/s41467-022-28471-w] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 01/21/2022] [Indexed: 01/04/2023] Open
Abstract
DNA modifications are used to regulate gene expression and defend against invading genetic elements. In eukaryotes, modifications predominantly involve C5-methylcytosine (5mC) and occasionally N6-methyladenine (6mA), while bacteria frequently use N4-methylcytosine (4mC) in addition to 5mC and 6mA. Here we report that 4mC can serve as an epigenetic mark in eukaryotes. Bdelloid rotifers, tiny freshwater invertebrates with transposon-poor genomes rich in foreign genes, lack canonical eukaryotic C5-methyltransferases for 5mC addition, but encode an amino-methyltransferase, N4CMT, captured from bacteria >60 Mya. N4CMT deposits 4mC at active transposons and certain tandem repeats, and fusion to a chromodomain shapes its “histone-read-DNA-write” architecture recognizing silent chromatin marks. Furthermore, amplification of SETDB1 H3K9me3 histone methyltransferases yields variants preferentially binding 4mC-DNA, suggesting “DNA-read-histone-write” partnership to maintain chromatin-based silencing. Our results show how non-native DNA methyl groups can reshape epigenetic systems to silence transposons and demonstrate the potential of horizontal gene transfer to drive regulatory innovation in eukaryotes. Eukaryotic DNA can be methylated as 5-methylcytosine and N6-methyladenine, but whether other forms of DNA methylation occur has been controversial. Here the authors show that a bacterial DNA methyltransferase was acquired >60 Mya in bdelloid rotifers that catalyzes N4-methylcytosine addition and is involved in suppression of transposon proliferation.
Collapse
|
15
|
The Genome of Rhyzopertha dominica (Fab.) (Coleoptera: Bostrichidae): Adaptation for Success. Genes (Basel) 2022; 13:genes13030446. [PMID: 35328000 PMCID: PMC8956072 DOI: 10.3390/genes13030446] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/16/2022] [Accepted: 02/21/2022] [Indexed: 12/15/2022] Open
Abstract
The lesser grain borer, Rhyzopertha dominica (F.) (Coleoptera: Bostrichidae), is a major global pest of cereal grains. Infestations are difficult to control as larvae feed inside grain kernels, and many populations are resistant to both contact insecticides and fumigants. We sequenced the genome of R. dominica to identify genes responsible for important biological functions and develop more targeted and efficacious management strategies. The genome was assembled from long read sequencing and long-range scaffolding technologies. The genome assembly is 479.1 Mb, close to the predicted genome size of 480.4 Mb by flow cytometry. This assembly is among the most contiguous beetle assemblies published to date, with 139 scaffolds, an N50 of 53.6 Mb, and L50 of 4, indicating chromosome-scale scaffolds. Predicted genes from biologically relevant groups were manually annotated using transcriptome data from adults and different larval tissues to guide annotation. The expansion of carbohydrase and serine peptidase genes suggest that they combine to enable efficient digestion of cereal proteins. A reduction in the copy number of several detoxification gene families relative to other coleopterans may reflect the low selective pressure on these genes in an insect that spends most of its life feeding internally. Chemoreceptor genes contain elevated numbers of pseudogenes for odorant receptors that also may be related to the recent ontogenetic shift of R. dominica to a diet consisting primarily of stored grains. Analysis of repetitive sequences will further define the evolution of bostrichid beetles compared to other species. The data overall contribute significantly to coleopteran genetic research.
Collapse
|
16
|
Tutaj H, Pirog A, Tomala K, Korona R. Genome-scale patterns in the loss of heterozygosity incidence in Saccharomyces cerevisiae. Genetics 2022; 221:6536968. [PMID: 35212738 PMCID: PMC9071580 DOI: 10.1093/genetics/iyac032] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Accepted: 02/17/2022] [Indexed: 02/07/2023] Open
Abstract
Former studies have established that loss of heterozygosity can be a key driver of sequence evolution in unicellular eukaryotes and tissues of metazoans. However, little is known about whether the distribution of loss of heterozygosity events is largely random or forms discernible patterns across genomes. To initiate our experiments, we introduced selectable markers to both arms of all chromosomes of the budding yeast. Subsequent extensive assays, repeated over several genetic backgrounds and environments, provided a wealth of information on the genetic and environmental determinants of loss of heterozygosity. Three findings stand out. First, the number of loss of heterozygosity events per unit time was more than 25 times higher for growing than starving cells. Second, loss of heterozygosity was most frequent when regions of homology around a recombination site were identical, about a half-% sequence divergence was sufficient to reduce its incidence. Finally, the density of loss of heterozygosity events was highly dependent on the genome's physical architecture. It was several-fold higher on short chromosomal arms than on long ones. Comparably large differences were seen within a single arm where regions close to a centromere were visibly less affected than regions close, though usually not strictly adjacent, to a telomere. We suggest that the observed uneven distribution of loss of heterozygosity events could have been caused not only by an uneven density of initial DNA damages. Location-depended differences in the mode of DNA repair, or its effect on fitness, were likely to operate as well.
Collapse
Affiliation(s)
- Hanna Tutaj
- Institute of Environmental Sciences, Jagiellonian University, 30-387 Cracow, Poland
| | - Adrian Pirog
- Institute of Environmental Sciences, Jagiellonian University, 30-387 Cracow, Poland
| | - Katarzyna Tomala
- Institute of Environmental Sciences, Jagiellonian University, 30-387 Cracow, Poland
| | - Ryszard Korona
- Institute of Environmental Sciences, Jagiellonian University, 30-387 Cracow, Poland,Corresponding author: Institute of Environmental Sciences, Jagiellonian University, Gronostajowa Street 7, 30-387 Krakow, Poland.
| |
Collapse
|
17
|
Geng J, Liu H, Chen S, Long J, Jin Y, Yang H, Duan G. Comparative genomic analysis of Escherichia coli strains obtained from continuous imipenem stress evolution. FEMS Microbiol Lett 2022; 369:6526866. [PMID: 35147175 DOI: 10.1093/femsle/fnac015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 01/07/2022] [Accepted: 02/09/2022] [Indexed: 11/13/2022] Open
Abstract
The carbapenem-resistant Escherichia coli (E. coli) has aroused increasing attention worldwide, especially in terms of imipenem (IMP) resistance. The molecular mechanism of IMP resistance remains unclear. This study aimed to explore the resistance mechanisms of IMP in E. coli. Susceptible Sx181-0-1 strain was induced into resistance strains by adaptive laboratory evolution. The drug resistance spectrum was measured using the disk diffusion and microbroth dilution methods. Whole-genome sequencing and resequencing were used to analyze the non-synonymous single-nucleotide polymorphisms (nsSNPs) between the primary susceptible strain and resistant strains. The expression levels of these genes with nsSNPs were identified by real-time quantitative PCR (RT-qPCR). Resistance phenotype appeared in the induced 15th generation (induction time = 183 h). Sx181-32 and Sx181-256, which had minimum inhibitory concentrations of IMP of 8 and 64 µg mL-1, were isolated during continuous subculture exposed to increasing concentrations of IMP, respectively. Nineteen nsSNPs were observed both in Sx181-32 and Sx181-256, including rpsU, sdaC, zwf, ttuC, araJ, dacC, mrdA, secF, dacD, lpxD, mrcB, ftsI, envZ, and two unknown function genes (orf01892 and orf01933). Among these 15 genes, five genes (dacC, mrdA, lpxD, mrcB, and ftsI) were mainly involved in cell wall synthesis. The mrdA (V338A, L378P, and M574I) and mrcB (P784L, A736V, and T708A) had three amino acid substitutions, respectively. The expression levels of rpsU, ttuC and orf01933 were elevated in both Sx181-32 and Sx181-256 compared to Sx181-0-1. The expression levels of these genes were elevated in Sx181-256, except for araJ. Bacteria developed resistance to antimicrobials by regulating various biological processes, among which the most involved is the cell wall synthesis (dacC, mrdA, lpxD, mrcB, and ftsI). The combination mutations of mrdA, envZ, and ftsI genes may increase the resistance to IMP. Our study could improve the understanding of the molecular mechanism underlying the IMP resistance of E. coli.
Collapse
Affiliation(s)
- Juan Geng
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Huiying Liu
- People's Hospital of Henan University of Chinese Medicine, Zhengzhou, China.,People's Hospital of Zhengzhou, Zhengzhou, China
| | - Shuaiyin Chen
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Jinzhao Long
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Yuefei Jin
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Haiyan Yang
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| | - Guangcai Duan
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, China
| |
Collapse
|
18
|
Tang R, Li Y, Han F, Li Z, Lin X, Sun H, Zhang X, Jiang Q, Nie H, Li Y. A CTCF-Binding Element and Histone Deacetylation Cooperatively Maintain Chromatin Loops, Linking to Long-Range Gene Regulation in Cancer Genomes. Front Oncol 2022; 11:821495. [PMID: 35127534 PMCID: PMC8813737 DOI: 10.3389/fonc.2021.821495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 12/16/2021] [Indexed: 11/26/2022] Open
Abstract
Background Genes spanning long chromosomal domains are coordinately regulated in human genome, which contribute to global gene dysregulation and carcinogenesis in cancer. It has been noticed that epigenetic modification and chromatin architecture may participate in the regulation process. However, the regulation patterns and functional elements of long-range gene regulation are unclear. Methods Based on the clinical transcriptome data from different tumor sets, a novel expressional correlation analysis pipeline was performed to classify the co-regulated regions and subsets of intercorrelated regions. The GLAM2 program was used to predict conserved DNA elements that enriched in regions. Two conserved elements were selected to delete in Ishikawa and HeLa cells by CRISPR-Cas9. SAHA treatment and HDAC knockdown were used to change the histone acetylation status. Using qPCR, MTT, and scratch healing assay, we evaluate the effect on gene expression and cancer cell phenotype. By DNA pull-down and ChIP, the element-binding proteins were testified. 3C and 3D-FISH were performed to depict the alteration in chromatin architecture. Results In multiple cancer genomes, we classified subsets of coordinately regulated regions (sub-CRRs) that possibly shared the same regulatory mechanisms and exhibited similar expression patterns. A new conserved DNA element (CRE30) was enriched in sub-CRRs and associated with cancer patient survival. CRE30 could restrict gene regulation in sub-CRRs and affect cancer cell phenotypes. DNA pull-down showed that multiple proteins including CTCF were recruited on the CRE30 locus, and ChIP assay confirmed the CTCF-binding signals. Subsequent results uncovered that as an essential element, CRE30 maintained chromatin loops and mediated a compact chromatin architecture. Moreover, we found that blocking global histone deacetylation induced chromatin loop disruption and CTCF dropping in the region containing CRE30, linked to promoted gene regulation. Additionally, similar effects were observed with CRE30 deletion in another locus of chromosome 8. Conclusions Our research clarified a new functional element that recruits CTCF and collaborates with histone deacetylation to maintain high-order chromatin organizations, linking to long-range gene regulation in cancer genomes. The findings highlight a close relationship among conserved DNA element, epigenetic modification, and chromatin architecture in long-range gene regulation process.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Huan Nie
- *Correspondence: Yu Li, ; Huan Nie,
| | - Yu Li
- *Correspondence: Yu Li, ; Huan Nie,
| |
Collapse
|
19
|
Genomic and Metabolomic Analyses of the Marine Fungus Emericellopsis cladophorae: Insights into Saltwater Adaptability Mechanisms and Its Biosynthetic Potential. J Fungi (Basel) 2021; 8:jof8010031. [PMID: 35049971 PMCID: PMC8780691 DOI: 10.3390/jof8010031] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 12/18/2021] [Accepted: 12/27/2021] [Indexed: 01/01/2023] Open
Abstract
The genus Emericellopsis is found in terrestrial, but mainly in marine, environments with a worldwide distribution. Although Emericellopsis has been recognized as an important source of bioactive compounds, the range of metabolites expressed by the species of this genus, as well as the genes involved in their production are still poorly known. Untargeted metabolomics, using UPLC- QToF–MS/MS, and genome sequencing (Illumina HiSeq) was performed to unlock E. cladophorae MUM 19.33 chemical diversity. The genome of E. cladophorae is 26.9 Mb and encodes 8572 genes. A large set of genes encoding carbohydrate-active enzymes (CAZymes), secreted proteins, transporters, and secondary metabolite biosynthetic gene clusters were identified. Our analysis also revealed genomic signatures that may reflect a certain fungal adaptability to the marine environment, such as genes encoding for (1) the high-osmolarity glycerol pathway; (2) osmolytes’ biosynthetic processes; (3) ion transport systems, and (4) CAZymes classes allowing the utilization of marine polysaccharides. The fungal crude extract library constructed revealed a promising source of antifungal (e.g., 9,12,13-Trihydroxyoctadec-10-enoic acid, hymeglusin), antibacterial (e.g., NovobiocinA), anticancer (e.g., daunomycinone, isoreserpin, flavopiridol), and anti-inflammatory (e.g., 2’-O-Galloylhyperin) metabolites. We also detected unknown compounds with no structural match in the databases used. The metabolites’ profiles of E. cladophorae MUM 19.33 fermentations were salt dependent. The results of this study contribute to unravel aspects of the biology and ecology of this marine fungus. The genome and metabolome data are relevant for future biotechnological exploitation of the species.
Collapse
|
20
|
Gonçalves MFM, Hilário S, Tacão M, Van de Peer Y, Alves A, Esteves AC. Genome and Metabolome MS-Based Mining of a Marine Strain of Aspergillus affinis. J Fungi (Basel) 2021; 7:1091. [PMID: 34947073 PMCID: PMC8709101 DOI: 10.3390/jof7121091] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 12/17/2021] [Accepted: 12/17/2021] [Indexed: 01/09/2023] Open
Abstract
Aspergillus section Circumdati encompasses several species that express both beneficial (e.g., biochemical transformation of steroids and alkaloids, enzymes and metabolites) and harmful compounds (e.g., production of ochratoxin A (OTA)). Given their relevance, it is important to analyze the genetic and metabolic diversity of the species of this section. We sequenced the genome of Aspergillus affinis CMG 70, isolated from sea water, and compared it with the genomes of species from section Circumdati, including A. affinis's strain type. The A. affinis genome was characterized considering secondary metabolites biosynthetic gene clusters (BGCs), carbohydrate-active enzymes (CAZymes), and transporters. To uncover the biosynthetic potential of A. affinis CMG 70, an untargeted metabolomics (LC-MS/MS) approach was used. Cultivating the fungus in the presence and absence of sea salt showed that A. affinis CMG 70 metabolite profiles are salt dependent. Analyses of the methanolic crude extract revealed the presence of both unknown and well-known Aspergillus compounds, such as ochratoxin A, anti-viral (e.g., 3,5-Di-tert-butyl-4-hydroxybenzoic acid and epigallocatechin), anti-bacterial (e.g., 3-Hydroxybenzyl alcohol, l-pyroglutamic acid, lecanoric acid), antifungal (e.g., lpyroglutamic acid, 9,12,13-Trihydroxyoctadec-10-enoic acid, hydroxyferulic acid), and chemotherapeutic (e.g., daunomycinone, mitoxantrone) related metabolites. Comparative analysis of 17 genomes from 16 Aspergillus species revealed abundant CAZymes (568 per species), secondary metabolite BGCs (73 per species), and transporters (1359 per species). Some BGCs are highly conserved in this section (e.g., pyranonigrin E and UNII-YC2Q1O94PT (ACR toxin I)), while others are incomplete or completely lost among species (e.g., bikaverin and chaetoglobosins were found exclusively in series Sclerotiorum, while asperlactone seemed completely lost). The results of this study, including genome analysis and metabolome characterization, emphasize the molecular diversity of A. affinis CMG 70, as well as of other species in the section Circumdati.
Collapse
Affiliation(s)
- Micael F. M. Gonçalves
- CESAM, Department of Biology, University of Aveiro, 3810-193 Aveiro, Portugal; (M.F.M.G.); (S.H.); (M.T.); (A.C.E.)
| | - Sandra Hilário
- CESAM, Department of Biology, University of Aveiro, 3810-193 Aveiro, Portugal; (M.F.M.G.); (S.H.); (M.T.); (A.C.E.)
| | - Marta Tacão
- CESAM, Department of Biology, University of Aveiro, 3810-193 Aveiro, Portugal; (M.F.M.G.); (S.H.); (M.T.); (A.C.E.)
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium;
- Center for Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
| | - Artur Alves
- CESAM, Department of Biology, University of Aveiro, 3810-193 Aveiro, Portugal; (M.F.M.G.); (S.H.); (M.T.); (A.C.E.)
| | - Ana C. Esteves
- CESAM, Department of Biology, University of Aveiro, 3810-193 Aveiro, Portugal; (M.F.M.G.); (S.H.); (M.T.); (A.C.E.)
| |
Collapse
|
21
|
MicroSEC filters sequence errors for formalin-fixed and paraffin-embedded samples. Commun Biol 2021; 4:1396. [PMID: 34912045 PMCID: PMC8674242 DOI: 10.1038/s42003-021-02930-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Accepted: 11/30/2021] [Indexed: 12/14/2022] Open
Abstract
The clinical sequencing of tumors is usually performed on formalin-fixed, paraffin-embedded samples and results in many sequencing errors. We identified that most of these errors are detected in chimeric reads caused by single-strand DNA molecules with microhomology. During the end-repair step of library preparation, mutations are introduced by the mis-annealing of two single-strand DNA molecules comprising homologous sequences. The mutated bases are distributed unevenly near the ends in the individual reads. Our filtering pipeline, MicroSEC, focuses on the uneven distribution of mutations in each read and removes the sequencing errors in formalin-fixed, paraffin-embedded samples without over-eliminating the mutations detected also in fresh frozen samples. Amplicon-based sequencing using 97 mutations confirmed that the sensitivity and specificity of MicroSEC were 97% (95% confidence interval: 82–100%) and 96% (95% confidence interval: 88–99%), respectively. Our pipeline will increase the reliability of the clinical sequencing and advance the cancer research using formalin-fixed, paraffin-embedded samples. Masachika Ikegami and Shinji Kohsaka et al. develop MicroSEC, a computational pipeline to filter sequencing artifacts from archival formalin-fixed and paraffin-embedded samples. Given that archival FFPE tissue is of great interest for genomic analysis, but difficult to reliably analyze, this tool may improve the ability of researchers to probe sequencing data from these samples.
Collapse
|
22
|
Bora CAF, Varghese A, Deepa CK, Nandini A, Malangmei L, Kumar KGA, Raina OK, John L, Prasanna P, Asaf M, Kumar GS, Hembram PK, Vergis J, Juliet S, Ravindran R. Sequence and phylogenetic analysis of the thrombospondin-related adhesive protein gene of Babesia gibsoni isolates in dogs in South India. Parasitol Int 2021; 86:102477. [PMID: 34619383 DOI: 10.1016/j.parint.2021.102477] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 09/26/2021] [Accepted: 09/29/2021] [Indexed: 10/20/2022]
Abstract
Babesia gibsoni, the causative agent of canine piroplasmosis, is a tick-borne intraerythrocytic protozoan parasite predominantly reported in Asian countries. The present study aimed at genotypic characterization of B. gibsoni isolates prevalent in dogs in Kerala, a southern state of India. Blood samples were collected from 272 dogs in Kerala and B. gibsoni infection was detected by microscopy and polymerase chain reaction (PCR). Molecular confirmation of B. gibsoni parasites was carried out by 18S rRNA nested-PCR, followed by sequencing. Nested-PCR detected a higher percentage of dogs (40.44%) positive for B. gibsoni infection than microscopy where 15.81% dogs were detected positive for infection. Genetic characterization of B. gibsoni isolates (n = 11) prevalent in dogs in the state of Kerala was carried out by PCR amplification and sequencing of the 855 bp thrombospondin-related adhesive protein (TRAP) gene fragment. Phylogenetic analysis of the B. gibsoni TRAP (BgTRAP) gene revealed that B. gibsoni isolates from Kerala formed a distinct cluster with the isolates from north India and Bangladesh, away from other East Asian isolates. Nucleotide analysis of the tandem repeats of BgTRAP gene showed considerable genetic variation among Indian isolates that was shared by B. gibsoni isolates of Bangladesh but not by the isolates of East Asian countries. The results of the present study further confirmed that B. gibsoni parasites in a distinct genetic clade are endemic in dogs in India and Bangladesh. However, elaborate studies are required for better understanding of the genetic diversity of B. gibsoni.
Collapse
Affiliation(s)
- Christophe Angeline Felicia Bora
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Anju Varghese
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India.
| | - Chundayil Kalarickal Deepa
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Ashwathappa Nandini
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Lanchalung Malangmei
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Karapparambu Gopalan Ajith Kumar
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Opinder Krishen Raina
- Division of Parasitology, ICAR-Indian Veterinary Research Institute, Izatnagar, Uttar Pradesh, Bareilly, India
| | - Lijo John
- Department of Veterinary Biochemistry, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Preena Prasanna
- Department of Animal Husbandry, District Veterinary Centre, Kannur, Kerala, India
| | - Muhasin Asaf
- Department of Animal Breeding and Genetics, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Gatchanda Shravan Kumar
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Prabodh Kumar Hembram
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Jess Vergis
- Department of Veterinary Public health, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Sanis Juliet
- Department of Pharmacology and Toxicology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| | - Reghu Ravindran
- Department of Veterinary Parasitology, College of Veterinary and Animal Sciences, Pookode, Kerala Veterinary and Animal Sciences University, Wayanad, Kerala, India
| |
Collapse
|
23
|
Eslami Rasekh M, Hernández Y, Drinan SD, Fuxman Bass J, Benson G. Genome-wide characterization of human minisatellite VNTRs: population-specific alleles and gene expression differences. Nucleic Acids Res 2021; 49:4308-4324. [PMID: 33849068 PMCID: PMC8096271 DOI: 10.1093/nar/gkab224] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 03/06/2021] [Accepted: 03/18/2021] [Indexed: 11/12/2022] Open
Abstract
Variable Number Tandem Repeats (VNTRs) are tandem repeat (TR) loci that vary in copy number across a population. Using our program, VNTRseek, we analyzed human whole genome sequencing datasets from 2770 individuals in order to detect minisatellite VNTRs, i.e., those with pattern sizes ≥7 bp. We detected 35 638 VNTR loci and classified 5676 as commonly polymorphic (i.e. with non-reference alleles occurring in >5% of the population). Commonly polymorphic VNTR loci were found to be enriched in genomic regions with regulatory function, i.e. transcription start sites and enhancers. Investigation of the commonly polymorphic VNTRs in the context of population ancestry revealed that 1096 loci contained population-specific alleles and that those could be used to classify individuals into super-populations with near-perfect accuracy. Search for quantitative trait loci (eQTLs), among the VNTRs proximal to genes, indicated that in 187 genes expression differences correlated with VNTR genotype. We validated our predictions in several ways, including experimentally, through the identification of predicted alleles in long reads, and by comparisons showing consistency between sequencing platforms. This study is the most comprehensive analysis of minisatellite VNTRs in the human population to date.
Collapse
Affiliation(s)
| | - Yözen Hernández
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
| | | | - Juan I Fuxman Bass
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
- Department of Biology, Boston University, Boston, MA 02215, USA
| | - Gary Benson
- Graduate Program in Bioinformatics, Boston University, Boston, MA 02215, USA
- Department of Biology, Boston University, Boston, MA 02215, USA
- Department of Computer Science, Boston University, Boston, MA 02215, USA
| |
Collapse
|
24
|
Field MJ, Kumar R, Hackett A, Kayumi S, Shoubridge CA, Ewans LJ, Ivancevic AM, Dudding-Byth T, Carroll R, Kroes T, Gardner AE, Sullivan P, Ha TT, Schwartz CE, Cowley MJ, Dinger ME, Palmer EE, Christie L, Shaw M, Roscioli T, Gecz J, Corbett MA. Different types of disease-causing noncoding variants revealed by genomic and gene expression analyses in families with X-linked intellectual disability. Hum Mutat 2021; 42:835-847. [PMID: 33847015 DOI: 10.1002/humu.24207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 03/19/2021] [Accepted: 04/08/2021] [Indexed: 11/06/2022]
Abstract
The pioneering discovery research of X-linked intellectual disability (XLID) genes has benefitted thousands of individuals worldwide; however, approximately 30% of XLID families still remain unresolved. We postulated that noncoding variants that affect gene regulation or splicing may account for the lack of a genetic diagnosis in some cases. Detecting pathogenic, gene-regulatory variants with the same sensitivity and specificity as structural and coding variants is a major challenge for Mendelian disorders. Here, we describe three pedigrees with suggestive XLID where distinctive phenotypes associated with known genes guided the identification of three different noncoding variants. We used comprehensive structural, single-nucleotide, and repeat expansion analyses of genome sequencing. RNA-Seq from patient-derived cell lines, reverse-transcription polymerase chain reactions, Western blots, and reporter gene assays were used to confirm the functional effect of three fundamentally different classes of pathogenic noncoding variants: a retrotransposon insertion, a novel intronic splice donor, and a canonical splice variant of an untranslated exon. In one family, we excluded a rare coding variant in ARX, a known XLID gene, in favor of a regulatory noncoding variant in OFD1 that correlated with the clinical phenotype. Our results underscore the value of genomic research on unresolved XLID families to aid novel, pathogenic noncoding variant discovery.
Collapse
Affiliation(s)
- Michael J Field
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia
| | - Raman Kumar
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Anna Hackett
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia.,School of Biomedical Sciences and Pharmacy, University of Newcastle, Newcastle, New South Wales, Australia
| | - Sayaka Kayumi
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Cheryl A Shoubridge
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Lisa J Ewans
- St Vincent's Clinical School, University of New South Wales, Darlinghurst, Australia.,Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
| | - Atma M Ivancevic
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, Colorado, USA
| | - Tracy Dudding-Byth
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia.,School of Biomedical Sciences and Pharmacy, University of Newcastle, Newcastle, New South Wales, Australia
| | - Renée Carroll
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Thessa Kroes
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Alison E Gardner
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Patricia Sullivan
- Children's Cancer Institute, University of New South Wales, Kensington, New South Wales, Australia
| | - Thuong T Ha
- Molecular Pathology Department, Centre for Cancer Biology, SA Pathology, Adelaide, South Australia, Australia
| | | | - Mark J Cowley
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia.,Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia.,Children's Cancer Institute, University of New South Wales, Kensington, New South Wales, Australia
| | - Marcel E Dinger
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Kensington, New South Wales, Australia
| | - Elizabeth E Palmer
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia.,School of Women's and Children's Health, University of New South Wales, Kensington, Sydney, New South Wales, Australia
| | - Louise Christie
- NSW Genetics of Learning Disability Service, Newcastle, New South Wales, Australia
| | - Marie Shaw
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| | - Tony Roscioli
- NeuRA, University of New South Wales, Sydney, New South Wales, Australia.,Centre for Clinical Genetics, Sydney Children's Hospital, Randwick, Sydney, New South Wales, Australia
| | - Jozef Gecz
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia.,South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
| | - Mark A Corbett
- Adelaide Medical School and Robinson Research Institute, University of Adelaide, Adelaide, South Australia, Australia
| |
Collapse
|
25
|
Peñaloza C, Gutierrez AP, Eöry L, Wang S, Guo X, Archibald AL, Bean TP, Houston RD. A chromosome-level genome assembly for the Pacific oyster Crassostrea gigas. Gigascience 2021; 10:6187865. [PMID: 33764468 PMCID: PMC7992393 DOI: 10.1093/gigascience/giab020] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 02/10/2021] [Accepted: 03/03/2021] [Indexed: 02/06/2023] Open
Abstract
Background The Pacific oyster (Crassostrea gigas) is a bivalve mollusc with vital roles in coastal ecosystems and aquaculture globally. While extensive genomic tools are available for C. gigas, highly contiguous reference genomes are required to support both fundamental and applied research. Herein we report the creation and annotation of a chromosome-level assembly for C. gigas. Findings High-coverage long- and short-read sequence data generated on Pacific Biosciences and Illumina platforms were used to generate an initial assembly, which was then scaffolded into 10 pseudo-chromosomes using both Hi-C sequencing and a high-density linkage map. The assembly has a scaffold N50 of 58.4 Mb and a contig N50 of 1.8 Mb, representing a step advance on the previously published C. gigas assembly. Annotation based on Pacific Biosciences Iso-Seq and Illumina RNA-Seq resulted in identification of ∼30,000 putative protein-coding genes. Annotation of putative repeat elements highlighted an enrichment of Helitron rolling-circle transposable elements, suggesting their potential role in shaping the evolution of the C. gigas genome. Conclusions This new chromosome-level assembly will be an enabling resource for genetics and genomics studies to support fundamental insight into bivalve biology, as well as for selective breeding of C. gigas in aquaculture.
Collapse
Affiliation(s)
- Carolina Peñaloza
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Alejandro P Gutierrez
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Lél Eöry
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Shan Wang
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, 6959 Miller Avenue, Port Norris, NJ 08349, USA
| | - Ximing Guo
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, 6959 Miller Avenue, Port Norris, NJ 08349, USA
| | - Alan L Archibald
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Tim P Bean
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| | - Ross D Houston
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK
| |
Collapse
|
26
|
Ma Q, Srivastav SP, Gamez S, Dayama G, Feitosa-Suntheimer F, Patterson EI, Johnson RM, Matson EM, Gold AS, Brackney DE, Connor JH, Colpitts TM, Hughes GL, Rasgon JL, Nolan T, Akbari OS, Lau NC. A mosquito small RNA genomics resource reveals dynamic evolution and host responses to viruses and transposons. Genome Res 2021; 31:512-528. [PMID: 33419731 PMCID: PMC7919454 DOI: 10.1101/gr.265157.120] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Accepted: 01/06/2021] [Indexed: 12/14/2022]
Abstract
Although mosquitoes are major transmission vectors for pathogenic arboviruses, viral infection has little impact on mosquito health. This immunity is caused in part by mosquito RNA interference (RNAi) pathways that generate antiviral small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs). RNAi also maintains genome integrity by potently repressing mosquito transposon activity in the germline and soma. However, viral and transposon small RNA regulatory pathways have not been systematically examined together in mosquitoes. Therefore, we developed an integrated mosquito small RNA genomics (MSRG) resource that analyzes the transposon and virus small RNA profiles in mosquito cell cultures and somatic and gonadal tissues across four medically important mosquito species. Our resource captures both somatic and gonadal small RNA expression profiles within mosquito cell cultures, and we report the evolutionary dynamics of a novel Mosquito-Conserved piRNA Cluster Locus (MCpiRCL) made up of satellite DNA repeats. In the larger culicine mosquito genomes we detected highly regular periodicity in piRNA biogenesis patterns coinciding with the expansion of Piwi pathway genes. Finally, our resource enables detection of cross talk between piRNA and siRNA populations in mosquito cells during a response to virus infection. The MSRG resource will aid efforts to dissect and combat the capacity of mosquitoes to tolerate and spread arboviruses.
Collapse
Affiliation(s)
- Qicheng Ma
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Satyam P Srivastav
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Stephanie Gamez
- Division of Biological Sciences, Section of Cell and Developmental Biology, University of California San Diego, La Jolla, California 92093, USA
| | - Gargi Dayama
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Fabiana Feitosa-Suntheimer
- Department of Microbiology and the National Emerging Infectious Disease Laboratory, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Edward I Patterson
- Departments of Vector Biology and Tropical Disease Biology, Centre for Neglected Tropical Diseases, Liverpool School of Tropical Medicine, Liverpool L3 5QA, United Kingdom
| | - Rebecca M Johnson
- Department of Entomology, Center for Infectious Disease Dynamics, and the Huck Institutes for the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Erik M Matson
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Alexander S Gold
- Department of Microbiology and the National Emerging Infectious Disease Laboratory, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Douglas E Brackney
- Department of Environmental Sciences, The Connecticut Agricultural Experiment Station, New Haven, Connecticut 06511, USA
| | - John H Connor
- Department of Microbiology and the National Emerging Infectious Disease Laboratory, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Tonya M Colpitts
- Department of Microbiology and the National Emerging Infectious Disease Laboratory, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Grant L Hughes
- Departments of Vector Biology and Tropical Disease Biology, Centre for Neglected Tropical Diseases, Liverpool School of Tropical Medicine, Liverpool L3 5QA, United Kingdom
| | - Jason L Rasgon
- Department of Entomology, Center for Infectious Disease Dynamics, and the Huck Institutes for the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Tony Nolan
- Departments of Vector Biology and Tropical Disease Biology, Centre for Neglected Tropical Diseases, Liverpool School of Tropical Medicine, Liverpool L3 5QA, United Kingdom
| | - Omar S Akbari
- Division of Biological Sciences, Section of Cell and Developmental Biology, University of California San Diego, La Jolla, California 92093, USA
| | - Nelson C Lau
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
- Boston University Genome Science Institute and the National Emerging Infectious Disease Laboratory, Boston, Massachusetts 02118, USA
| |
Collapse
|
27
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
28
|
Genetic Diversity of the Noncoding Control Region of the Novel Human Polyomaviruses. Viruses 2020; 12:v12121406. [PMID: 33297530 PMCID: PMC7762344 DOI: 10.3390/v12121406] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 12/02/2020] [Indexed: 02/07/2023] Open
Abstract
The genomes of polyomaviruses are characterized by their tripartite organization with an early region, a late region and a noncoding control region (NCCR). The early region encodes proteins involved in replication and transcription of the viral genome, while expression of the late region generates the capsid proteins. Transcription regulatory sequences for expression of the early and late genes, as well as the origin of replication are encompassed in the NCCR. Cell tropism of polyomaviruses not only depends on the appropriate receptors on the host cell, but cell-specific expression of the viral genes is also governed by the NCCR. Thus far, 15 polyomaviruses have been isolated from humans, though it remains to be established whether all of them are genuine human polyomaviruses (HPyVs). The sequences of the NCCR of these HPyVs show high genetic variability and have been best studied in the human polyomaviruses BK and JC. Rearranged NCCRs in BKPyV and JCPyV, the first HPyVs to be discovered approximately 30 years ago, have been associated with the pathogenic properties of these viruses in nephropathy and progressive multifocal leukoencephalopathy, respectively. Since 2007, thirteen novel PyVs have been isolated from humans: KIPyV, WUPyV, MCPyV, HPyV6, HPyV7, TSPyV, HPyV9, HPyV10, STLPyV, HPyV12, NJPyV, LIPyV and QPyV. This review describes all NCCR variants of the new HPyVs that have been reported in the literature and discusses the possible consequences of NCCR diversity in terms of promoter strength, putative transcription factor binding sites and possible association with diseases.
Collapse
|
29
|
Yu Z, Wang E, Geng Y, Wang K, Chen D, Huang X, Ouyang P, Zuo Z, Huang C, Fang J, Yin L, Guo H, Zhong Z. Complete genome analysis of Vibrio mimicus strain SCCF01, a highly virulent isolate from the freshwater catfish. Virulence 2020; 11:23-31. [PMID: 31826705 PMCID: PMC6961728 DOI: 10.1080/21505594.2019.1702797] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 10/14/2019] [Accepted: 10/14/2019] [Indexed: 12/02/2022] Open
Abstract
Vibrio mimicus is a foodborne pathogen, which is widely distributed in the aquatic environment. Moreover, it is often involved in aquatic animal diseases. In recent years, V. mimicus is an emerging pathogen in some species of Siluriformes. The strain SCCF01 was isolated from yellow catfish (Pelteobagrus fulvidraco). In this study, we aimed to perform genomic analysis of V. mimicus strain SCCF01 to identify genetic features and evolutionary relationships. Information on gene function and classification was obtained by functional annotation, and circular graph of strain SCCF01 genome, which was created by Circos v0.64. Information on virulence genes (adhesion, flagellum system, exotoxin, and secretory system, etc.) was obtained by virulence genes annotation. Genome element prediction showed that most of the mobile elements were distributed in chromosome I. Therefore, chromosome I of SCCF01 genome has more plasticity than chromosome II and might be larger in size. Genomic linear relationship between the strain of V. mimicus and strain SCCF01 was analyzed by linear pairwise comparison but was unable to determine the relationship. Gene family analysis predicted that the evolutionary direction of strain SCCF01 was: clinical strain → environmental strain → SCCF01 strain. Phylogenetic analysis showed that the strain SCCF01 was more closely related to environmental strains. According to gene family analysis and phylogenetic analysis, we speculated that strain SCCF01 has probably diverged from environmental strains.
Collapse
Affiliation(s)
- Zehui Yu
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
- Laboratory Animal Center, Southwest Medical University, LuZhou, Sichuan, P. R. China
| | - Erlong Wang
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi, P.R. China
| | - Yi Geng
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Kaiyu Wang
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Defang Chen
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Xiaoli Huang
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Ping Ouyang
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Zhicai Zuo
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Chao Huang
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Jing Fang
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Lizi Yin
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Hongrui Guo
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| | - Zhijun Zhong
- College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, P. R. China
| |
Collapse
|
30
|
Satellite DNA-like repeats are dispersed throughout the genome of the Pacific oyster Crassostrea gigas carried by Helentron non-autonomous mobile elements. Sci Rep 2020; 10:15107. [PMID: 32934255 PMCID: PMC7492417 DOI: 10.1038/s41598-020-71886-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 08/11/2020] [Indexed: 01/31/2023] Open
Abstract
Satellite DNAs (satDNAs) are long arrays of tandem repeats typically located in heterochromatin and span the centromeres of eukaryotic chromosomes. Despite the wealth of knowledge about satDNAs, little is known about a fraction of short, satDNA-like arrays dispersed throughout the genome. Our survey of the Pacific oyster Crassostrea gigas sequenced genome revealed genome assembly replete with satDNA-like tandem repeats. We focused on the most abundant arrays, grouped according to sequence similarity into 13 clusters, and explored their flanking sequences. Structural analysis showed that arrays of all 13 clusters represent central repeats of 11 non-autonomous elements named Cg_HINE, which are classified into the Helentron superfamily of DNA transposons. Each of the described elements is formed by a unique combination of flanking sequences and satDNA-like central repeats, coming from one, exceptionally two clusters in a consecutive order. While some of the detected Cg_HINE elements are related according to sequence similarities in flanking and repetitive modules, others evidently arose in independent events. In addition, some of the Cg_HINE's central repeats are related to the classical C. gigas satDNA, interconnecting mobile elements and satDNAs. Genome-wide distribution of Cg_HINE implies non-autonomous Helentrons as a dynamic system prone to efficiently propagate tandem repeats in the C. gigas genome.
Collapse
|
31
|
Kim M, Cha IT, Lee KE, Lee EY, Park SJ. Genomics Reveals the Metabolic Potential and Functions in the Redistribution of Dissolved Organic Matter in Marine Environments of the Genus Thalassotalea. Microorganisms 2020; 8:microorganisms8091412. [PMID: 32937826 PMCID: PMC7564069 DOI: 10.3390/microorganisms8091412] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 09/11/2020] [Indexed: 11/16/2022] Open
Abstract
Members of the bacterial genus Thalassotalea have been isolated recently from various marine environments, including marine invertebrates. A metagenomic study of the Deepwater Horizon oil plume has identified genes involved in aromatic hydrocarbon degradation in the Thalassotalea genome, shedding light on its potential role in the degradation of crude oils. However, the genomic traits of the genus are not well-characterized, despite the ability of the species to degrade complex natural compounds, such as agar, gelatin, chitin, or starch. Here, we obtained a complete genome of a new member of the genus, designated PS06, isolated from marine sediments containing dead marine benthic macroalgae. Unexpectedly, strain PS06 was unable to grow using most carbohydrates as sole carbon sources, which is consistent with the finding of few ABC transporters in the PS06 genome. A comparative analysis of 12 Thalassotalea genomes provided insights into their metabolic potential (e.g., microaerobic respiration and carbohydrate utilization) and evolutionary stability [including a low abundance of clustered regularly interspaced short palindromic repeats (CRISPR) loci and prophages]. The diversity and frequency of genes encoding extracellular enzymes for carbohydrate metabolism in the 12 genomes suggest that members of Thalassotalea contribute to nutrient cycling by the redistribution of dissolved organic matter in marine environments. Our study improves our understanding of the ecological and genomic properties of the genus Thalassotalea.
Collapse
Affiliation(s)
- Minji Kim
- Department of Biology, Jeju National University, 102 Jejudaehak-ro, Jeju 63243, Korea;
| | - In-Tae Cha
- Microorganism Resources Division, National Institute of Biological Resources, Incheon 22689, Korea; (I.-T.C.); (K.-E.L.)
| | - Ki-Eun Lee
- Microorganism Resources Division, National Institute of Biological Resources, Incheon 22689, Korea; (I.-T.C.); (K.-E.L.)
| | - Eun-Young Lee
- Exhibition & Education Division, National Institute of Biological Resources, Incheon 22689, Korea;
| | - Soo-Je Park
- Department of Biology, Jeju National University, 102 Jejudaehak-ro, Jeju 63243, Korea;
- Correspondence: ; Tel.: +82-64-753-3524; Fax: +82-64-756-3541
| |
Collapse
|
32
|
Molecular detection and genetic characteristics of Babesia gibsoni in dogs in Shaanxi Province, China. Parasit Vectors 2020; 13:366. [PMID: 32698848 PMCID: PMC7376908 DOI: 10.1186/s13071-020-04232-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Accepted: 07/15/2020] [Indexed: 11/10/2022] Open
Abstract
Background Several members of genus Babesia are important pathogens causing babesiosis in dogs. In China, at least five Babesia species have been described in dogs or ticks. This study sought to determine the prevalence and molecular characteristics of various Babesia spp. in dogs in cities in Shaanxi Province in China, including Xi’an and Hanzhong. Methods A total of 371 blood samples were collected from pet dogs presenting to veterinary clinics in the cities of Xi’an and Hanzhong in Shaanxi, China. Babesia spp. DNA was detected via amplification of partial 18S rRNA genes by semi-nested PCR. Almost full-length 18S rRNA, ITS, partial TRAP and complete cytb genes were recovered for analysis of the genetic characteristics and relationships with known isolates. Results A single species, Babesia gibsoni, was identified in dogs in Xi’an and Hanzhong. Consistently, B. gibsoni was also detected in 14 ticks collected from positive dogs. Sequence similarities and phylogenetic analysis suggested that the isolates identified herein showed a closer genetic relationship with isolates from East Asian countries rather than India, Bangladesh, or the USA. Sequence analysis based on tandem repeat analysis of the TRAP gene further revealed that specific haplotypes were circulating in both Xi’an and Hanzhong, with no specific regionality. In addition, 10.9% of all isolates with atovaquone (ATV)-resistance were identified because of M121I mutation in the deduced cytb protein. Conclusions This study revealed a high prevalence rate of Babesia infection. Babesia gibsoni was the only Babesia species identified in cases of canine babesiosis in the cities of Xi’an and Hanzhong cities in Shaanxi, China. In addition, the TRAP gene presented high genetic diversity across isolates. Such information is useful for elucidating the epidemiological characteristics of canine babesiosis, as well as the overall genetic diversity of Babesia spp. circulating in dog populations in Shaanxi Province.![]()
Collapse
|
33
|
Avvaru AK, Sharma D, Verma A, Mishra RK, Sowpati DT. MSDB: a comprehensive, annotated database of microsatellites. Nucleic Acids Res 2020; 48:D155-D159. [PMID: 31599331 PMCID: PMC6943038 DOI: 10.1093/nar/gkz886] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 09/28/2019] [Accepted: 10/01/2019] [Indexed: 11/18/2022] Open
Abstract
Microsatellites are short tandem repeats of 1–6 nucleotide motifs, studied for their utility as genome markers and in forensics. Recent evidence points to the role of microsatellites in important regulatory functions, and their length polymorphisms at coding regions are linked to various neurodegenerative disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and their evolution remains poorly understood. Though other databases of microsatellites exist, they fall short on several fronts. MSDB (MicroSatellite DataBase) is a collection of >4 billion microsatellites from 37 680 genomes presented in a user-friendly web portal for easy, interactive analysis and visualization. This is by far the most comprehensive, annotated, updated database to access and analyze microsatellite data of multiple species. The features of MSDB enable users to explore the data as tables that can be filtered and exported, and also as interactive charts to view and compare the data of multiple species simultaneously. Its modularity and architecture permit seamless updates with new data, making it a powerful tool and useful resource to researchers working on this important class of DNA elements, particularly in context of their evolution and emerging roles in genome organization and gene regulation.
Collapse
Affiliation(s)
- Akshay Kumar Avvaru
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad - 201002, India
| | - Deepak Sharma
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Archana Verma
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Rakesh K Mishra
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| | - Divya Tej Sowpati
- CSIR-Centre for Cellular and Molecular Biology, Hyderabad - 500007, India
| |
Collapse
|
34
|
Phylogenetic implications of mitogenome rearrangements in East Asian potamiscine freshwater crabs (Brachyura: Potamidae). Mol Phylogenet Evol 2020; 143:106669. [DOI: 10.1016/j.ympev.2019.106669] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 10/31/2019] [Accepted: 10/31/2019] [Indexed: 11/18/2022]
|
35
|
Repetti SI, Jackson CJ, Judd LM, Wick RR, Holt KE, Verbruggen H. The inflated mitochondrial genomes of siphonous green algae reflect processes driving expansion of noncoding DNA and proliferation of introns. PeerJ 2020; 8:e8273. [PMID: 31915577 PMCID: PMC6944098 DOI: 10.7717/peerj.8273] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 11/22/2019] [Indexed: 02/06/2023] Open
Abstract
Within the siphonous green algal order Bryopsidales, the size and gene arrangement of chloroplast genomes has been examined extensively, while mitochondrial genomes have been mostly overlooked. The recently published mitochondrial genome of Caulerpa lentillifera is large with expanded noncoding DNA, but it remains unclear if this is characteristic of the entire order. Our study aims to evaluate the evolutionary forces shaping organelle genome dynamics in the Bryopsidales based on the C. lentillifera and Ostreobium quekettii mitochondrial genomes. In this study, the mitochondrial genome of O. quekettii was characterised using a combination of long and short read sequencing, and bioinformatic tools for annotation and sequence analyses. We compared the mitochondrial and chloroplast genomes of O. quekettii and C. lentillifera to examine hypotheses related to genome evolution. The O. quekettii mitochondrial genome is the largest green algal mitochondrial genome sequenced (241,739 bp), considerably larger than its chloroplast genome. As with the mtDNA of C. lentillifera, most of this excess size is from the expansion of intergenic DNA and proliferation of introns. Inflated mitochondrial genomes in the Bryopsidales suggest effective population size, recombination and/or mutation rate, influenced by nuclear-encoded proteins, differ between the genomes of mitochondria and chloroplasts, reducing the strength of selection to influence evolution of their mitochondrial genomes.
Collapse
Affiliation(s)
- Sonja I Repetti
- School of BioSciences, University of Melbourne, Melbourne, VIC, Australia
| | | | - Louise M Judd
- Department of Infectious Diseases, Monash University, Melbourne, VIC, Australia
| | - Ryan R Wick
- Department of Infectious Diseases, Monash University, Melbourne, VIC, Australia
| | - Kathryn E Holt
- Department of Infectious Diseases, Monash University, Melbourne, VIC, Australia
| | - Heroen Verbruggen
- School of BioSciences, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
36
|
Tørresen OK, Star B, Mier P, Andrade-Navarro MA, Bateman A, Jarnot P, Gruca A, Grynberg M, Kajava AV, Promponas VJ, Anisimova M, Jakobsen KS, Linke D. Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases. Nucleic Acids Res 2019; 47:10994-11006. [PMID: 31584084 PMCID: PMC6868369 DOI: 10.1093/nar/gkz841] [Citation(s) in RCA: 155] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 09/03/2019] [Accepted: 10/01/2019] [Indexed: 12/13/2022] Open
Abstract
The widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with 'ready-to-use' deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotation-deposition workflow, and that may proliferate in public database repositories affecting all downstream analyses. As a case study, we provide examples of the Atlantic cod genome, whose sequencing and assembly were hindered by a particularly high prevalence of tandem repeats. We complement this case study with examples from other species, where mis-annotations and sequencing errors have propagated into protein databases. With this review, we aim to raise the awareness level within the community of database users, and alert scientists working in the underlying workflow of database creation that the data they omit or improperly assemble may well contain important biological information valuable to others.
Collapse
Affiliation(s)
- Ole K Tørresen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, NO-0316 Oslo, Norway
| | - Bastiaan Star
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, NO-0316 Oslo, Norway
| | - Pablo Mier
- Faculty of Biology, Johannes Gutenberg University Mainz, Hans-Dieter-Husch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Faculty of Biology, Johannes Gutenberg University Mainz, Hans-Dieter-Husch-Weg 15, 55128 Mainz, Germany
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton. CB10 1SD, UK
| | - Patryk Jarnot
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Aleksandra Gruca
- Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100 Gliwice, Poland
| | - Marcin Grynberg
- Institute of Biochemistry and Biophysics PAS, Pawińskiego 5A, 02-106 Warsaw, Poland
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237 CNRS, Universite Montpellier 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
- Institut de Biologie Computationnelle, 34095 Montpellier, France
| | - Vasilis J Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, PO Box 20537, CY 1678 Nicosia, Cyprus
| | - Maria Anisimova
- Institute of Applied Simulations, School of Life Sciences and Facility Management, Zurich University of Applied Sciences (ZHAW), Wädenswil, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Kjetill S Jakobsen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, NO-0316 Oslo, Norway
| | - Dirk Linke
- Section for Genetics and Evolutionary Biology, Department of Biosciences, University of Oslo, NO-0316 Oslo, Norway
| |
Collapse
|
37
|
K-mer-Based Motif Analysis in Insect Species across Anopheles, Drosophila, and Glossina Genera and Its Application to Species Classification. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019; 2019:4259479. [PMID: 31827584 PMCID: PMC6881769 DOI: 10.1155/2019/4259479] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 09/18/2019] [Accepted: 09/28/2019] [Indexed: 11/17/2022]
Abstract
Short k-mer sequences from DNA are both conserved and diverged across species owing to their functional significance in speciation, which enables their use in many species classification algorithms. In the present study, we developed a methodology to analyze the DNA k-mers of whole genome, 5' UTR, intron, and 3' UTR regions from 58 insect species belonging to three genera of Diptera that include Anopheles, Drosophila, and Glossina. We developed an improved algorithm to predict and score k-mers based on a scheme that normalizes k-mer scores in different genomic subregions. This algorithm takes advantage of the information content of the whole genome as opposed to other algorithms or studies that analyze only a small group of genes. Our algorithm uses k-mers of lengths 7-9 bp for the whole genome, 5' and 3' UTR regions as well as the intronic regions. Taxonomical relationships based on the whole-genome k-mer signatures showed that species of the three genera clustered together quite visibly. We also improved the scoring and filtering of these k-mers for accurate species identification. The whole-genome k-mer content correlation algorithm showed that species within a single genus correlated tightly with each other as compared to other genera. The genomes of two Aedes and one Culex species were also analyzed to demonstrate how newly sequenced species can be classified using the algorithm. Furthermore, working with several dozen species has enabled us to assign a whole-genome k-mer signature for each of the 58 Dipteran species by making all-to-all pairwise comparison of the k-mer content. These signatures were used to compare the similarity between species and to identify clusters of species displaying similar signatures.
Collapse
|
38
|
Beh CW, Zhang Y, Zheng YL, Sun B, Wang TH. Fluorescence spectroscopic detection and measurement of single telomere molecules. Nucleic Acids Res 2019; 46:e117. [PMID: 30010842 PMCID: PMC6212783 DOI: 10.1093/nar/gky627] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Accepted: 06/28/2018] [Indexed: 01/26/2023] Open
Abstract
Telomeres are the end-caps of chromosomes that serve to protect the integrity of the genome. Below certain critical lengths, the telomeres can no longer fulfill their protective function, and chromosomal instability ensues. Telomeres shorten during normal cell division due to the end replication problem and are implicated in the development of various aging-associated diseases, including cancer. Telomere length has the potential to serve as a useful biomarker in the field of aging and cancer. However, existing methods of telomere measurement are either too laborious, unable to provide absolute measurement of individual telomere lengths, or limited to certain chromosomes or cell types. Here, we describe an easy single-molecule, fluorescence spectroscopic method for measuring the length of telomeres that permits the profiling of absolute telomere lengths in any DNA sample. We have demonstrated the accurate detection of telomeres as short as 100 bp using cloned telomere standards, and have profiled telomere lengths in human cancer cell lines and primary cells. Since this method allows direct comparison between samples, it could greatly improve the clinical utility of telomere biomarkers.
Collapse
Affiliation(s)
- Cyrus W Beh
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Ye Zhang
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Yun-Ling Zheng
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC 20057, USA
| | - Bing Sun
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC 20057, USA
| | - Tza-Huei Wang
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Department of Mechanical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Institute for NanoBioTechnology, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
39
|
RB1 Deletion in Retinoblastoma Protein Pathway-Disrupted Cells Results in DNA Damage and Cancer Progression. Mol Cell Biol 2019; 39:MCB.00105-19. [PMID: 31138663 DOI: 10.1128/mcb.00105-19] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 05/20/2019] [Indexed: 12/12/2022] Open
Abstract
Proliferative control in cancer cells is frequently disrupted by mutations in the retinoblastoma protein (RB) pathway. Intriguingly, RB1 mutations can arise late in tumorigenesis in cancer cells whose RB pathway is already compromised by another mutation. In this study, we present evidence for increased DNA damage and instability in cancer cells with RB pathway defects when RB1 mutations are induced. We generated isogenic RB1 mutant genotypes with CRISPR/Cas9 in a number of cell lines. Cells with even one mutant copy of RB1 have increased basal levels of DNA damage and increased mitotic errors. Elevated levels of reactive oxygen species as well as impaired homologous recombination repair underlie this DNA damage. When xenografted into immunocompromised mice, RB1 mutant cells exhibit an elevated propensity to seed new tumors in recipient lungs. This study offers evidence that late-arising RB1 mutations can facilitate genome instability and cancer progression that are beyond the preexisting proliferative control deficit.
Collapse
|
40
|
Farnoud F, Schwartz M, Bruck J. Estimation of duplication history under a stochastic model for tandem repeats. BMC Bioinformatics 2019; 20:64. [PMID: 30727948 PMCID: PMC6364452 DOI: 10.1186/s12859-019-2603-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 01/03/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Tandem repeat sequences are common in the genomes of many organisms and are known to cause important phenomena such as gene silencing and rapid morphological changes. Due to the presence of multiple copies of the same pattern in tandem repeats and their high variability, they contain a wealth of information about the mutations that have led to their formation. The ability to extract this information can enhance our understanding of evolutionary mechanisms. RESULTS We present a stochastic model for the formation of tandem repeats via tandem duplication and substitution mutations. Based on the analysis of this model, we develop a method for estimating the relative mutation rates of duplications and substitutions, as well as the total number of mutations, in the history of a tandem repeat sequence. We validate our estimation method via Monte Carlo simulation and show that it outperforms the state-of-the-art algorithm for discovering the duplication history. We also apply our method to tandem repeat sequences in the human genome, where it demonstrates the different behaviors of micro- and mini-satellites and can be used to compare mutation rates across chromosomes. It is observed that chromosomes that exhibit the highest mutation activity in tandem repeat regions are the same as those thought to have the highest overall mutation rates. However, unlike previous works that rely on comparing human and chimpanzee genomes to measure mutation rates, the proposed method allows us to find chromosomes with the highest mutation activity based on a single genome, in essence by comparing (approximate) copies of the pattern in tandem repeats. CONCLUSION The prevalence of tandem repeats in most organisms and the efficiency of the proposed method enable studying various aspects of the formation of tandem repeats and the surrounding sequences in a wide range of settings. AVAILABILITY The implementation of the estimation method is available at http://ips.lab.virginia.edu/smtr .
Collapse
Affiliation(s)
- Farzad Farnoud
- Department of Electrical and Computer Engineering, Department of Computer Science, University of Virginia, Charlottesville, USA
| | - Moshe Schwartz
- Department of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Jehoshua Bruck
- Department of Electrical Engineering, California Institute of Technology, Pasadena, USA
| |
Collapse
|
41
|
Christmas MJ, Wallberg A, Bunikis I, Olsson A, Wallerman O, Webster MT. Chromosomal inversions associated with environmental adaptation in honeybees. Mol Ecol 2018; 28:1358-1374. [DOI: 10.1111/mec.14944] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 11/07/2018] [Accepted: 11/07/2018] [Indexed: 01/03/2023]
Affiliation(s)
- Matthew J. Christmas
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala University Uppsala Sweden
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala University Uppsala Sweden
| | - Ignas Bunikis
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala University Uppsala Sweden
| | - Anna Olsson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala University Uppsala Sweden
| | - Ola Wallerman
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala University Uppsala Sweden
| | - Matthew T. Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory Uppsala University Uppsala Sweden
| |
Collapse
|
42
|
Zhao X, Su L, Schaack S, Sadd BM, Sun C. Tandem Repeats Contribute to Coding Sequence Variation in Bumblebees (Hymenoptera: Apidae). Genome Biol Evol 2018; 10:3176-3187. [PMID: 30398620 PMCID: PMC6286909 DOI: 10.1093/gbe/evy244] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/03/2018] [Indexed: 01/02/2023] Open
Abstract
Tandem repeats (TRs) are highly dynamic regions of the genome. Mutations at these loci represent a significant source of genetic variation and can facilitate rapid adaptation. Bumblebees are important pollinating insects occupying a wide range of habitats. However, to date, molecular mechanisms underlying the potential adaptation of bumblebees to diverse habitats are largely unknown. In the present study, we investigate how TRs contribute to genetic variation in bumblebees, thus potentially facilitating adaptation. We identified 26,595 TRs from the assembled 18 chromosome sequences of the buff-tailed bumblebee (Bombus terrestris), 66.7% of which reside in genic regions. We also compared TRs found in B. terrestris with those present in the assembled genome sequence of a congener, B. impatiens. We found that a total of 1,137 TRs were variable in length between the two sequenced bumblebee species, and further analysis reveals that 101 of them are located within coding regions. These 101 TRs are responsible for coding sequence variation and correspond to protein sequence length variation between the two bumblebee species. The variability of identified TRs in coding regions between bumblebees was confirmed by PCR amplification of a subset of loci. Functional classification of bumblebee genes where coding sequences include variable-length TRs suggests that a majority of genes (87%) that could be assigned to a protein class are related to transcriptional regulation. Our results show that TRs contribute to coding sequence variation in bumblebees, and thus may facilitate the adaptation of bumblebees through diversifying proteins involved in controlling gene expression.
Collapse
Affiliation(s)
- Xiaomeng Zhao
- Key Laboratory of Pollinating Insect Biology of the Ministry of Agriculture, Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Long Su
- Key Laboratory of Pollinating Insect Biology of the Ministry of Agriculture, Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Sarah Schaack
- Department of Biology, Reed College, Portland, Oregon, USA
| | - Ben M Sadd
- School of Biological Sciences, Illinois State University, Normal, Illinois, USA
| | - Cheng Sun
- Key Laboratory of Pollinating Insect Biology of the Ministry of Agriculture, Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
43
|
Lee J, Kim H, Lee JE, Shin SJ, Oh S, Kwon G, Kim H, Choi YY, White MA, Paik S, Cheong JH, Kim HS. Selective Cytotoxicity of the NAMPT Inhibitor FK866 Toward Gastric Cancer Cells With Markers of the Epithelial-Mesenchymal Transition, Due to Loss of NAPRT. Gastroenterology 2018; 155:799-814.e13. [PMID: 29775598 DOI: 10.1053/j.gastro.2018.05.024] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/23/2017] [Revised: 05/03/2018] [Accepted: 05/09/2018] [Indexed: 01/06/2023]
Abstract
BACKGROUND & AIMS Markers of the epithelial-to-mesenchymal transition (EMT) in gastric tumor tissues are associated with poor patient outcomes. We performed a screen to identify pharmacologic compounds that kill gastric cancer cells with EMT-associated gene expression patterns and investigate their mechanisms. METHODS We identified 29 gastric cancer cell lines with a gene expression signature previously associated with an EMT subtype, based on data from RNA sequence analyses, and confirmed the mesenchymal phenotypes of 7 lines (Hs746T, SNU1750, MKN1, SK4, SNU484, SNU668, and YCC11), based on invasive activity and protein markers. We screened 1,345 compounds for their ability to kill cells with the EMT signature compared with cell lines without this pattern. We tested the effects of identified compounds in BALB/c nude mice bearing GA077 tumors; mice were given intraperitoneal injections of the compound or vehicle (control) twice daily for 24 days and tumor growth was monitored. Proteins associated with the toxicity of the compounds were overexpressed in MKN1 and SNU484 cells or knocked down in MKN45 and SNU719 using small interfering RNAs. We performed immunohistochemical analyses of 942 gastric cancer tissues and investigated associations between EMT markers and protein expression patterns. RESULTS The nicotinamide phosphoribosyltransferase inhibitor FK866 killed 6 of 7 gastric cancer cell lines with EMT-associated gene expression signatures but not gastric cancer cells without this signature. The 6 EMT-subtype gastric cell lines expressed significantly low levels of nicotinic acid phosphoribosyltransferase (NAPRT), which makes the cells hypersensitive to nicotinamide phosphoribosyltransferase inhibition. Gastric cell lines that expressed higher levels of NAPRT, regardless of EMT markers, were sensitized to FK866 after knockdown of NAPRT, whereas overexpression of NAPRT in deficient EMT cell lines protected them from FK866-mediated toxicity. Administration of FK866 to nude mice with tumors grown from GA077 cells (human gastric cancer tumors of the EMT subtype) led to tumor regression in 2 weeks; FK866 did not affect tumors grown from MKN45 cells without the EMT expression signature. Loss of NAPRT might promote the EMT, because it stabilizes β-catenin. We correlated the EMT gene expression signature with lower levels of NAPRT in 942 gastric tumors from patients; we also found lower levels of NAPRT mRNA in colorectal, pancreatic, and lung adenocarcinoma tissues with the EMT gene expression signature. CONCLUSIONS FK866 selectively kills gastric cancer cells with an EMT gene expression signature by inhibiting nicotinamide phosphoribosyltransferase in cells with NAPRT deficiency. Loss of NAPRT expression, frequently through promoter hypermethylation, is observed in many gastric tumors of the EMT subtype. FK866 might be used to treat patients with tumors of this subtype.
Collapse
Affiliation(s)
- Jooyoung Lee
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea
| | - Hyosil Kim
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea
| | - Jae Eun Lee
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Department of Surgery, Yonsei University College of Medicine, Seoul, Korea
| | - Su-Jin Shin
- Department of Pathology, Hanyang University College of Medicine, Seoul, Korea
| | - Sejin Oh
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Brain Korea 21 Plus Project for Medical Science, Yonsei University College of Medicine, Seoul, Korea
| | - Gino Kwon
- Graduate Program for Nanomedical Science, Yonsei University, Seoul, Korea
| | - Hakhyun Kim
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Brain Korea 21 Plus Project for Medical Science, Yonsei University College of Medicine, Seoul, Korea
| | - Yoon Young Choi
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Department of Surgery, Yonsei University College of Medicine, Seoul, Korea
| | - Michael A White
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Soonmyung Paik
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea
| | - Jae-Ho Cheong
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Department of Surgery, Yonsei University College of Medicine, Seoul, Korea; Brain Korea 21 Plus Project for Medical Science, Yonsei University College of Medicine, Seoul, Korea.
| | - Hyun Seok Kim
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Korea; Brain Korea 21 Plus Project for Medical Science, Yonsei University College of Medicine, Seoul, Korea.
| |
Collapse
|
44
|
Ganesamoorthy D, Cao MD, Duarte T, Chen W, Coin L. GtTR: Bayesian estimation of absolute tandem repeat copy number using sequence capture and high throughput sequencing. BMC Bioinformatics 2018; 19:267. [PMID: 30012093 PMCID: PMC6048696 DOI: 10.1186/s12859-018-2282-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 07/09/2018] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND Tandem repeats comprise significant proportion of the human genome including coding and regulatory regions. They are highly prone to repeat number variation and nucleotide mutation due to their repetitive and unstable nature, making them a major source of genomic variation between individuals. Despite recent advances in high throughput sequencing, analysis of tandem repeats in the context of complex diseases is still hindered by technical limitations. We report a novel targeted sequencing approach, which allows simultaneous analysis of hundreds of repeats. We developed a Bayesian algorithm, namely - GtTR - which combines information from a reference long-read dataset with a short read counting approach to genotype tandem repeats at population scale. PCR sizing analysis was used for validation. RESULTS We used a PacBio long-read sequenced sample to generate a reference tandem repeat genotype dataset with on average 13% absolute deviation from PCR sizing results. Using this reference dataset GtTR generated estimates of VNTR copy number with accuracy within 95% high posterior density (HPD) intervals of 68 and 83% for capture sequence data and 200X WGS data respectively, improving to 87 and 94% with use of a PCR reference. We show that the genotype resolution increases as a function of depth, such that the median 95% HPD interval lies within 25, 14, 12 and 8% of the its midpoint copy number value for 30X, 200X WGS, 395X and 800X capture sequence data respectively. We validated nine targets by PCR sizing analysis and genotype estimates from sequencing results correlated well with PCR results. CONCLUSIONS The novel genotyping approach described here presents a new cost-effective method to explore previously unrecognized class of repeat variation in GWAS studies of complex diseases at the population level. Further improvements in accuracy can be obtained by improving accuracy of the reference dataset.
Collapse
Affiliation(s)
- Devika Ganesamoorthy
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Minh Duc Cao
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Tania Duarte
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Wenhan Chen
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| | - Lachlan Coin
- Institute for Molecular Biosciences, University of Queensland, Brisbane, Australia
| |
Collapse
|
45
|
Lee-Kirsch M. Molekulargenetische Diagnostik. Monatsschr Kinderheilkd 2018. [DOI: 10.1007/s00112-018-0491-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
46
|
Podgornaya OI, Ostromyshenskii DI, Enukashvily NI. Who Needs This Junk, or Genomic Dark Matter. BIOCHEMISTRY (MOSCOW) 2018; 83:450-466. [PMID: 29626931 DOI: 10.1134/s0006297918040156] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Centromeres (CEN), pericentromeric regions (periCEN), and subtelomeric regions (subTel) comprise the areas of constitutive heterochromatin (HChr). Tandem repeats (TRs or satellite DNA) are the main components of HChr forming no less than 10% of the mouse and human genome. HChr is assembled within distinct structures in the interphase nuclei of many species - chromocenters. In this review, the main classes of HChr repeat sequences are considered in the order of their number increase in the sequencing reads of the mouse chromocenters (ChrmC). TRs comprise ~70% of ChrmC occupying the first place. Non-LTR (-long terminal repeat) retroposons (mainly LINE, long interspersed nuclear element) are the next (~11%), and endogenous retroviruses (ERV; LTR-containing) are in the third position (~9%). HChr is not enriched with ERV in comparison with the whole genome, but there are differences in distribution of certain elements: while MaLR-like elements (ERV3) are dominant in the whole genome, intracisternal A-particles and corresponding LTR (ERV2) are prevalent in HChr. Most of LINE in ChrmC is represented by the 2-kb fragment at the end of the 2nd open reading frame and its flanking regions. Almost all tandem repeats classified as CEN or periCEN are contained in ChrmC. Our previous classification revealed 60 new mouse TR families with 29 of them being absent in ChrmC, which indicates their location on chromosome arms. TR transcription is necessary for maintenance of heterochromatic status of the HChr genome part. A burst of TR transcription is especially important in embryogenesis and other cases of radical changes in the cell program, including carcinogenesis. The recently discovered mechanism of epigenetic regulation with noncoding sequences transcripts, long noncoding RNA, and its role in embryogenesis and pluripotency maintenance is discussed.
Collapse
Affiliation(s)
- O I Podgornaya
- Institute of Cytology, Russian Academy of Sciences, St. Petersburg, 194064, Russia.
| | | | | |
Collapse
|
47
|
Stanyte R, Nuebler J, Blaukopf C, Hoefler R, Stocsits R, Peters JM, Gerlich DW. Dynamics of sister chromatid resolution during cell cycle progression. J Cell Biol 2018; 217:1985-2004. [PMID: 29695489 PMCID: PMC5987726 DOI: 10.1083/jcb.201801157] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Revised: 03/16/2018] [Accepted: 04/11/2018] [Indexed: 01/04/2023] Open
Abstract
Faithful genome transmission in dividing cells requires that the two copies of each chromosome's DNA package into separate but physically linked sister chromatids. The linkage between sister chromatids is mediated by cohesin, yet where sister chromatids are linked and how they resolve during cell cycle progression has remained unclear. In this study, we investigated sister chromatid organization in live human cells using dCas9-mEGFP labeling of endogenous genomic loci. We detected substantial sister locus separation during G2 phase irrespective of the proximity to cohesin enrichment sites. Almost all sister loci separated within a few hours after their respective replication and then rapidly equilibrated their average distances within dynamic chromatin polymers. Our findings explain why the topology of sister chromatid resolution in G2 largely reflects the DNA replication program. Furthermore, these data suggest that cohesin enrichment sites are not persistent cohesive sites in human cells. Rather, cohesion might occur at variable genomic positions within the cell population.
Collapse
Affiliation(s)
- Rugile Stanyte
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Vienna BioCenter, Vienna, Austria
| | - Johannes Nuebler
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA
| | - Claudia Blaukopf
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Vienna BioCenter, Vienna, Austria
| | - Rudolf Hoefler
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Vienna BioCenter, Vienna, Austria
| | - Roman Stocsits
- Research Institute of Molecular Pathology, Vienna BioCenter, Vienna, Austria
| | - Jan-Michael Peters
- Research Institute of Molecular Pathology, Vienna BioCenter, Vienna, Austria
| | - Daniel W Gerlich
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Vienna BioCenter, Vienna, Austria
| |
Collapse
|
48
|
De Roeck A, Duchateau L, Van Dongen J, Cacace R, Bjerke M, Van den Bossche T, Cras P, Vandenberghe R, De Deyn PP, Engelborghs S, Van Broeckhoven C, Sleegers K. An intronic VNTR affects splicing of ABCA7 and increases risk of Alzheimer's disease. Acta Neuropathol 2018; 135:827-837. [PMID: 29589097 PMCID: PMC5954066 DOI: 10.1007/s00401-018-1841-z] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Revised: 03/22/2018] [Accepted: 03/23/2018] [Indexed: 01/09/2023]
Abstract
Mutations leading to premature termination codons in ATP-Binding Cassette Subfamily A Member 7 (ABCA7) are high penetrant risk factors of Alzheimer’s disease (AD). The influence of other genetic variants in ABCA7 and downstream functional mechanisms, however, is poorly understood. To address this knowledge gap, we investigated tandem repetitive regions in ABCA7 in a Belgian cohort of 1529 AD patients and control individuals and identified an intronic variable number tandem repeat (VNTR). We observed strong association between VNTR length and a genome-wide associated signal for AD in the ABCA7 locus. Expanded VNTR alleles were highly enriched in AD patients [odds ratio = 4.5 (1.3–24.2)], and VNTR length inversely correlated with amyloid β1–42 in cerebrospinal fluid and ABCA7 expression. In addition, we identified three novel ABCA7 alternative splicing events. One isoform in particular—which is formed through exon 19 skipping—lacks the first nucleotide binding domain of ABCA7 and is abundant in brain tissue. We observed a tight correlation between exon 19 skipping and VNTR length. Our findings underline the importance of studying repetitive DNA in complex disorders and expand the contribution of genetic and transcript variation in ABCA7 to AD.
Collapse
|
49
|
Feng X, Zhou X, Zhou S, Wang J, Hu W. Analysis of microRNA profile of Anopheles sinensis by deep sequencing and bioinformatic approaches. Parasit Vectors 2018. [PMID: 29530087 PMCID: PMC5848538 DOI: 10.1186/s13071-018-2734-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND microRNAs (miRNAs) are small non-coding RNAs widely identified in many mosquitoes. They are reported to play important roles in development, differentiation and innate immunity. However, miRNAs in Anopheles sinensis, one of the Chinese malaria mosquitoes, remain largely unknown. METHODS We investigated the global miRNA expression profile of An. sinensis using Illumina Hiseq 2000 sequencing. Meanwhile, we applied a bioinformatic approach to identify potential miRNAs in An. sinensis. The identified miRNA profiles were compared and analyzed by two approaches. The selected miRNAs from the sequencing result and the bioinformatic approach were confirmed with qRT-PCR. Moreover, target prediction, GO annotation and pathway analysis were carried out to understand the role of miRNAs in An. sinensis. RESULTS We identified 49 conserved miRNAs and 12 novel miRNAs by next-generation high-throughput sequencing technology. In contrast, 43 miRNAs were predicted by the bioinformatic approach, of which two were assigned as novel. Comparative analysis of miRNA profiles by two approaches showed that 21 miRNAs were shared between them. Twelve novel miRNAs did not match any known miRNAs of any organism, indicating that they are possibly species-specific. Forty miRNAs were found in many mosquito species, indicating that these miRNAs are evolutionally conserved and may have critical roles in the process of life. Both the selected known and novel miRNAs (asi-miR-281, asi-miR-184, asi-miR-14, asi-miR-nov5, asi-miR-nov4, asi-miR-9383, and asi-miR-2a) could be detected by quantitative real-time PCR (qRT-PCR) in the sequenced sample, and the expression patterns of these miRNAs measured by qRT-PCR were in concordance with the original miRNA sequencing data. The predicted targets for the known and the novel miRNAs covered many important biological roles and pathways indicating the diversity of miRNA functions. We also found 21 conserved miRNAs and eight counterparts of target immune pathway genes in An. sinensis based on the analysis of An. gambiae. CONCLUSIONS Our results provide the first lead to the elucidation of the miRNA profile in An. sinensis. Unveiling the roles of mosquito miRNAs will undoubtedly lead to a better understanding of mosquito biology and mosquito-pathogen interactions. This work lays the foundation for the further functional study of An. sinensis miRNAs and will facilitate their application in vector control.
Collapse
Affiliation(s)
- Xinyu Feng
- National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology, National Health and Family Planning Commission, WHO Collaborating Center for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, People's Republic of China.,Joint Research Laboratory of Genetics and Ecology on Parasites-hosts Interaction, National Institute of Parasitic Diseases - Fudan University, Shanghai, 200025, China
| | - Xiaojian Zhou
- Institute of Software Engineering, Zhejiang University, Hangzhou, 310011, China
| | - Shuisen Zhou
- National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology, National Health and Family Planning Commission, WHO Collaborating Center for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, People's Republic of China.
| | - Jingwen Wang
- State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China.
| | - Wei Hu
- National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention, Key Laboratory of Parasite and Vector Biology, National Health and Family Planning Commission, WHO Collaborating Center for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, People's Republic of China. .,State Key Laboratory of Genetic Engineering, Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200438, People's Republic of China.
| |
Collapse
|
50
|
Farrer RA, Fisher MC. Describing Genomic and Epigenomic Traits Underpinning Emerging Fungal Pathogens. ADVANCES IN GENETICS 2017; 100:73-140. [PMID: 29153405 DOI: 10.1016/bs.adgen.2017.09.009] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
An unprecedented number of pathogenic fungi are emerging and causing disease in animals and plants, putting the resilience of wild and managed ecosystems in jeopardy. While the past decades have seen an increase in the number of pathogenic fungi, they have also seen the birth of new big data technologies and analytical approaches to tackle these emerging pathogens. We review how the linked fields of genomics and epigenomics are transforming our ability to address the challenge of emerging fungal pathogens. We explore the methodologies and bioinformatic toolkits that currently exist to rapidly analyze the genomes of unknown fungi, then discuss how these data can be used to address key questions that shed light on their epidemiology. We show how genomic approaches are leading a revolution into our understanding of emerging fungal diseases and speculate on future approaches that will transform our ability to tackle this increasingly important class of emerging pathogens.
Collapse
|