1
|
Ni BB, Liu H, Wang ZS, Zhang GY, Sang ZY, Liu JJ, He CY, Zhang JG. A chromosome-scale genome of Rhus chinensis Mill. provides new insights into plant-insect interaction and gallotannins biosynthesis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:766-786. [PMID: 38271098 DOI: 10.1111/tpj.16631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 12/26/2023] [Accepted: 01/02/2024] [Indexed: 01/27/2024]
Abstract
Rhus chinensis Mill., an economically valuable Anacardiaceae species, is parasitized by the galling aphid Schlechtendalia chinensis, resulting in the formation of the Chinese gallnut (CG). Here, we report a chromosomal-level genome assembly of R. chinensis, with a total size of 389.40 Mb and scaffold N50 of 23.02 Mb. Comparative genomic and transcriptome analysis revealed that the enhanced structure of CG and nutritional metabolism contribute to improving the adaptability of R. chinensis to S. chinensis by supporting CG and galling aphid growth. CG was observed to be abundant in hydrolysable tannins (HT), particularly gallotannin and its isomers. Tandem repeat clusters of dehydroquinate dehydratase/shikimate dehydrogenase (DQD/SDH) and serine carboxypeptidase-like (SCPL) and their homologs involved in HT production were determined as specific to HT-rich species. The functional differentiation of DQD/SDH tandem duplicate genes and the significant contraction in the phenylalanine ammonia-lyase (PAL) gene family contributed to the accumulation of gallic acid and HT while minimizing the production of shikimic acid, flavonoids, and condensed tannins in CG. Furthermore, we identified one UDP glucosyltransferase (UGT84A), three carboxylesterase (CXE), and six SCPL genes from conserved tandem repeat clusters that are involved in gallotannin biosynthesis and hydrolysis in CG. We then constructed a regulatory network of these genes based on co-expression and transcription factor motif analysis. Our findings provide a genomic resource for the exploration of the underlying mechanisms of plant-galling insect interaction and highlight the importance of the functional divergence of tandem duplicate genes in the accumulation of secondary metabolites.
Collapse
Affiliation(s)
- Bing-Bing Ni
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
- Collaborative Innovation Center of Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Hong Liu
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Zhao-Shan Wang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Guo-Yun Zhang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Zi-Yang Sang
- Forest Enterprise of Wufeng County in Hubei Province, Wufeng, 443400, Hubei, China
| | - Juan-Juan Liu
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Cai-Yun He
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
- Collaborative Innovation Center of Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| | - Jian-Guo Zhang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation, National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
- Collaborative Innovation Center of Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, China
| |
Collapse
|
2
|
Chen Z, Ain NU, Zhao Q, Zhang X. From tradition to innovation: conventional and deep learning frameworks in genome annotation. Brief Bioinform 2024; 25:bbae138. [PMID: 38581418 PMCID: PMC10998533 DOI: 10.1093/bib/bbae138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 03/08/2024] [Accepted: 03/10/2024] [Indexed: 04/08/2024] Open
Abstract
Following the milestone success of the Human Genome Project, the 'Encyclopedia of DNA Elements (ENCODE)' initiative was launched in 2003 to unearth information about the numerous functional elements within the genome. This endeavor coincided with the emergence of numerous novel technologies, accompanied by the provision of vast amounts of whole-genome sequences, high-throughput data such as ChIP-Seq and RNA-Seq. Extracting biologically meaningful information from this massive dataset has become a critical aspect of many recent studies, particularly in annotating and predicting the functions of unknown genes. The core idea behind genome annotation is to identify genes and various functional elements within the genome sequence and infer their biological functions. Traditional wet-lab experimental methods still rely on extensive efforts for functional verification. However, early bioinformatics algorithms and software primarily employed shallow learning techniques; thus, the ability to characterize data and features learning was limited. With the widespread adoption of RNA-Seq technology, scientists from the biological community began to harness the potential of machine learning and deep learning approaches for gene structure prediction and functional annotation. In this context, we reviewed both conventional methods and contemporary deep learning frameworks, and highlighted novel perspectives on the challenges arising during annotation underscoring the dynamic nature of this evolving scientific landscape.
Collapse
Affiliation(s)
- Zhaojia Chen
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangzhou 518120, China
- College of Biomedical Engineering, Taiyuan University of Technology, Jinzhong 030600, China
| | - Noor ul Ain
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangzhou 518120, China
| | - Qian Zhao
- State Key Laboratory for Ecological Pest Control of Fujian/Taiwan Crops and College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Xingtan Zhang
- National Key Laboratory for Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangzhou 518120, China
| |
Collapse
|
3
|
Li X, Dang Z, Tang W, Zhang H, Shao J, Jiang R, Zhang X, Huang F. Detection of Parasites in the Field: The Ever-Innovating CRISPR/Cas12a. BIOSENSORS 2024; 14:145. [PMID: 38534252 DOI: 10.3390/bios14030145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 03/11/2024] [Accepted: 03/12/2024] [Indexed: 03/28/2024]
Abstract
The rapid and accurate identification of parasites is crucial for prompt therapeutic intervention in parasitosis and effective epidemiological surveillance. For accurate and effective clinical diagnosis, it is imperative to develop a nucleic-acid-based diagnostic tool that combines the sensitivity and specificity of nucleic acid amplification tests (NAATs) with the speed, cost-effectiveness, and convenience of isothermal amplification methods. A new nucleic acid detection method, utilizing the clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas) nuclease, holds promise in point-of-care testing (POCT). CRISPR/Cas12a is presently employed for the detection of Plasmodium falciparum, Toxoplasma gondii, Schistosoma haematobium, and other parasites in blood, urine, or feces. Compared to traditional assays, the CRISPR assay has demonstrated notable advantages, including comparable sensitivity and specificity, simple observation of reaction results, easy and stable transportation conditions, and low equipment dependence. However, a common issue arises as both amplification and cis-cleavage compete in one-pot assays, leading to an extended reaction time. The use of suboptimal crRNA, light-activated crRNA, and spatial separation can potentially weaken or entirely eliminate the competition between amplification and cis-cleavage. This could lead to enhanced sensitivity and reduced reaction times in one-pot assays. Nevertheless, higher costs and complex pre-test genome extraction have hindered the popularization of CRISPR/Cas12a in POCT.
Collapse
Affiliation(s)
- Xin Li
- School of Life Science and Engineering, Foshan University, Foshan 528225, China
| | - Zhisheng Dang
- National Institute of Parasitic Diseases, Chinese Center for Diseases Control and Prevention (Chinese Center for Tropical Diseases Research), Key Laboratory of Parasite and Vector Biology, National Health Commission of the People's Republic of China (NHC), World Health Organization (WHO) Collaborating Center for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai 200025, China
| | - Wenqiang Tang
- State Key Laboratory of Hulless Barley and Yak Germplasm Resources and Genetic Improvement, Lhasa 850002, China
- Tibet Academy of Agriculture and Animal Husbandry Sciences, Lhasa 850002, China
| | - Haoji Zhang
- School of Life Science and Engineering, Foshan University, Foshan 528225, China
| | - Jianwei Shao
- School of Life Science and Engineering, Foshan University, Foshan 528225, China
| | - Rui Jiang
- College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Xu Zhang
- School of Life Science and Engineering, Foshan University, Foshan 528225, China
| | - Fuqiang Huang
- School of Life Science and Engineering, Foshan University, Foshan 528225, China
| |
Collapse
|
4
|
Mecca M, Picerno S, Cortellino S. The Killer's Web: Interconnection between Inflammation, Epigenetics and Nutrition in Cancer. Int J Mol Sci 2024; 25:2750. [PMID: 38473997 DOI: 10.3390/ijms25052750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/21/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024] Open
Abstract
Inflammation is a key contributor to both the initiation and progression of tumors, and it can be triggered by genetic instability within tumors, as well as by lifestyle and dietary factors. The inflammatory response plays a critical role in the genetic and epigenetic reprogramming of tumor cells, as well as in the cells that comprise the tumor microenvironment. Cells in the microenvironment acquire a phenotype that promotes immune evasion, progression, and metastasis. We will review the mechanisms and pathways involved in the interaction between tumors, inflammation, and nutrition, the limitations of current therapies, and discuss potential future therapeutic approaches.
Collapse
Affiliation(s)
- Marisabel Mecca
- Laboratory of Preclinical and Translational Research, Centro di Riferimento Oncologico della Basilicata (IRCCS-CROB), 85028 Rionero in Vulture, PZ, Italy
| | - Simona Picerno
- Laboratory of Preclinical and Translational Research, Centro di Riferimento Oncologico della Basilicata (IRCCS-CROB), 85028 Rionero in Vulture, PZ, Italy
| | - Salvatore Cortellino
- Laboratory of Preclinical and Translational Research, Responsible Research Hospital, 86100 Campobasso, CB, Italy
- Scuola Superiore Meridionale (SSM), Clinical and Translational Oncology, 80138 Naples, NA, Italy
- S.H.R.O. Italia Foundation ETS, 10060 Candiolo, TO, Italy
| |
Collapse
|
5
|
Gao J, Sun W, Li J, Ban H, Zhang T, Liao J, Kim N, Lee SH, Dong Q, Madramootoo R, Chen Y, Li F. Rex1BD and the 14-3-3 protein control heterochromatin organization at tandem repeats by linking RNAi and HDAC. Proc Natl Acad Sci U S A 2023; 120:e2309359120. [PMID: 38048463 PMCID: PMC10723143 DOI: 10.1073/pnas.2309359120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 10/30/2023] [Indexed: 12/06/2023] Open
Abstract
Tandem DNA repeats are often organized into heterochromatin that is crucial for genome organization and stability. Recent studies revealed that individual repeats within tandem DNA repeats can behave very differently. How DNA repeats are assembled into distinct heterochromatin structures remains poorly understood. Here, we developed a genome-wide genetic screen using a reporter gene at different units in a repeat array. This screen led to identification of a conserved protein Rex1BD required for heterochromatin silencing. Our structural analysis revealed that Rex1BD forms a four-helix bundle structure with a distinct charged electrostatic surface. Mechanistically, Rex1BD facilitates the recruitment of Clr6 histone deacetylase (HDAC) by interacting with histones. Interestingly, Rex1BD also interacts with the 14-3-3 protein Rad25, which is responsible for recruiting the RITS (RNA-induced transcriptional silencing) complex to DNA repeats. Our results suggest that coordinated action of Rex1BD and Rad25 mediates formation of distinct heterochromatin structure at DNA repeats via linking RNAi and HDAC pathways.
Collapse
Affiliation(s)
- Jinxin Gao
- Department of Biology, New York University, New York, NY10003
| | - Wenqi Sun
- Key Laboratory of Epigenetic Regulation and Intervention, State Key Laboratory of Molecular Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai200031, China
| | - Jie Li
- National Facility for Protein Science Shanghai, Zhangjiang Lab, Shanghai Advanced Research Institute, Chinese Academy of Science, Shanghai201210, China
| | - Hyoju Ban
- Department of Biology, New York University, New York, NY10003
| | - Tuokai Zhang
- Department of Biology, New York University, New York, NY10003
| | - Junwei Liao
- Department of Biology, New York University, New York, NY10003
| | - Namho Kim
- Department of Biology, New York University, New York, NY10003
| | - Soon Hoo Lee
- Department of Biology, New York University, New York, NY10003
| | - Qianhua Dong
- Department of Biology, New York University, New York, NY10003
| | | | - Yong Chen
- Key Laboratory of Epigenetic Regulation and Intervention, State Key Laboratory of Molecular Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou310024, China
| | - Fei Li
- Department of Biology, New York University, New York, NY10003
| |
Collapse
|
6
|
Bi Y, Hu J, Zeng L, Chen G, Cai H, Cao H, Ma Q, Wu X. Characteristics of HPV integration in cervical adenocarcinoma and squamous carcinoma. J Cancer Res Clin Oncol 2023; 149:17973-17986. [PMID: 37966613 PMCID: PMC10725361 DOI: 10.1007/s00432-023-05494-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Accepted: 10/25/2023] [Indexed: 11/16/2023]
Abstract
PURPOSE HPV integration usually occurs in HPV-related cancer, and is the main cause of cancer. But the carcinogenic mechanism of HPV integration is unclear. The study aims to provide a theoretical basis for understanding the pathogenesis of cervical adenocarcinoma (AC) and cervical squamous carcinoma (SCC). METHODS We used HPV capture sequencing to obtain HPV integration sites in AC and SCC, and analyzed cytobands, distribution of genetic and genomic elements, identified integration hotspot genes, clinicopathological parameters, breakpoints of HPV16 and performed pathway analysis. Then we conducted immunohistochemical (IHC) assay to preliminarily verify the expression of most frequently integrated genes in AC, STARD3 and ERBB2. RESULTS The results revealed that the most frequently observed integrated cytoband was 17q12 in AC and 21p11.2 in SCC, respectively. The breakpoints in both AC and SCC were more tended to occur within gene regions, compared to intergenetic regions. Compared to SCC samples, AC samples had a higher prevalence of genomic elements. In AC, HPV integration has no significantly difference with clinicopathological parameters, but in SCC integration correlated with differentiation (P < 0.05). Breakpoints of HPV in SCC located in LCR more frequently compared to AC, which destroyed the activation of promoter p97. Hotspot genes of HPV integration were STARD3 and ERBB2 in AC, and RNA45S rDNA and MIR3648-1 in SCC, respectively. Meanwhile, we preliminarily proved that the expression of STARD3 and ERBB2, the most frequently integrated genes, would increase after integration. CONCLUSION These results suggested that HPV may utilize the powerful hosts' promoters to express viral oncogenes and overexpression of viral oncogenes plays a significant role in the carcinogenesis of SCC. In AC, HPV integration may affect hosts' oncogenes, and the dysregulation of oncogenes may primarily contribute to progression of AC.
Collapse
Affiliation(s)
- Yuxin Bi
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei Clinical Medical Research Center for Gynecologic Malignancy, Wuhan, China
| | - Junbo Hu
- Department of Pathology, Maternal and Child Health Hospital of Hubei Province, Wuhan, China
| | - Ling Zeng
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei Provincial Center for Medical Genetics, Wuhan, China
| | - Gang Chen
- Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Hongning Cai
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei Clinical Medical Research Center for Gynecologic Malignancy, Wuhan, China
| | - Huang Cao
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei Clinical Medical Research Center for Gynecologic Malignancy, Wuhan, China
- Hubei Provincial Center for Medical Genetics, Wuhan, China
| | - Quanfu Ma
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- Hubei Clinical Medical Research Center for Gynecologic Malignancy, Wuhan, China
| | - Xufeng Wu
- Maternal and Child Health Hospital of Hubei Province, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
- Hubei Clinical Medical Research Center for Gynecologic Malignancy, Wuhan, China.
| |
Collapse
|
7
|
Monzon AM, Arrías PN, Elofsson A, Mier P, Andrade-Navarro MA, Bevilacqua M, Clementel D, Bateman A, Hirsh L, Fornasari MS, Parisi G, Piovesan D, Kajava AV, Tosatto SCE. A STRP-ed definition of Structured Tandem Repeats in Proteins. J Struct Biol 2023; 215:108023. [PMID: 37652396 DOI: 10.1016/j.jsb.2023.108023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/31/2023] [Accepted: 08/28/2023] [Indexed: 09/02/2023]
Abstract
Tandem Repeat Proteins (TRPs) are a class of proteins with repetitive amino acid sequences that have been studied extensively for over two decades. Different features at the level of sequence, structure, function and evolution have been attributed to them by various authors. And yet many of its salient features appear only when looking at specific subclasses of protein tandem repeats. Here, we attempt to rationalize the existing knowledge on Tandem Repeat Proteins (TRPs) by pointing out several dichotomies. The emerging picture is more nuanced than generally assumed and allows us to draw some boundaries of what is not a "proper" TRP. We conclude with an operational definition of a specific subset, which we have denominated STRPs (Structural Tandem Repeat Proteins), which separates a subclass of tandem repeats with distinctive features from several other less well-defined types of repeats. We believe that this definition will help researchers in the field to better characterize the biological meaning of this large yet largely understudied group of proteins.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Dept. of Information Engineering, University of Padova, via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Paula Nazarena Arrías
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Arne Elofsson
- Dept. of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Tomtebodavägen 23, 171 21 Solna, Sweden
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Martina Bevilacqua
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Damiano Clementel
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Layla Hirsh
- Dept. of Engineering, Faculty of Science and Engineering, Pontifical Catholic University of Peru, Av. Universitaria 1801 San Miguel, Lima 32, Lima, Peru
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Damiano Piovesan
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier, 1919 Route de Mende, Cedex 5, 34293 Montpellier, France
| | - Silvio C E Tosatto
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy.
| |
Collapse
|
8
|
Zhang GJ, Jia KL, Wang J, Gao WJ, Li SF. Genome-wide analysis of transposable elements and satellite DNA in Humulus scandens, a dioecious plant with XX/XY 1Y 2 chromosomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1230250. [PMID: 37908838 PMCID: PMC10614002 DOI: 10.3389/fpls.2023.1230250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 10/04/2023] [Indexed: 11/02/2023]
Abstract
Transposable elements (TEs) and satellite DNAs, two major categories of repetitive sequences, are expected to accumulate in non-recombining genome regions, including sex-linked regions, and contribute to sex chromosome evolution. The dioecious plant, Humulus scandens, can be used for studying the evolution of the XX/XY1Y2 sex chromosomes. In this study, we thoroughly examined the repetitive components of male and female H. scandens using next-generation sequencing data followed by bioinformatics analysis and florescence in situ hybridization (FISH). The H. scandens genome has a high overall repetitive sequence composition, 68.30% in the female and 66.78% in the male genome, with abundant long terminal repeat (LTR) retrotransposons (RTs), including more Ty3/Gypsy than Ty1/Copia elements, particularly two Ty3/Gypsy lineages, Tekay and Retand. Most LTR-RT lineages were found dispersed across the chromosomes, though CRM and Athila elements were predominately found within the centromeres and the pericentromeric regions. The Athila elements also showed clearly higher FISH signal intensities in the Y1 and Y2 chromosomes than in the X or autosomes. Three novel satellite DNAs were specifically distributed in the centromeric and/or telomeric regions, with markedly different distributions on the X, Y1, and Y2 chromosomes. Combined with FISH using satellite DNAs to stain chromosomes during meiotic diakinesis, we determined the synapsis pattern and distinguish pseudoautosomal regions (PARs). The results indicate that the XY1Y2 sex chromosomes of H. scandens might have originated from a centric fission event. This study improves our understanding of the repetitive sequence organization of H. scandens genome and provides a basis for further analysis of their chromosome evolution process.
Collapse
Affiliation(s)
- Guo-Jun Zhang
- School of Basic Medical Sciences, Xinxiang Medical University, Xinxiang, China
- College of Life Sciences, Henan Normal University, Xinxiang, China
| | - Ke-Li Jia
- College of Life Sciences, Henan Normal University, Xinxiang, China
- SanQuan Medical College, Xinxiang Medical University, Xinxiang, China
| | - Jin Wang
- College of Life Sciences, Henan Normal University, Xinxiang, China
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang, China
| | - Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang, China
| |
Collapse
|
9
|
Lakhotia SC. C-value paradox: Genesis in misconception that natural selection follows anthropocentric parameters of 'economy' and 'optimum'. BBA ADVANCES 2023; 4:100107. [PMID: 37868661 PMCID: PMC10587719 DOI: 10.1016/j.bbadva.2023.100107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 10/11/2023] [Accepted: 10/12/2023] [Indexed: 10/24/2023] Open
Abstract
C-value paradox refers to the lack of correlation between biological complexity and the intuitively expected protein-coding genomic information or DNA content. Here I discuss five questions about this paradox: i) Do biologically complex organisms carry more protein-coding genes? ii) Does variable accumulation of selfish/ junk/ parasitic DNA underlie the c-value paradox? iii) Can nucleoskeletal or nucleotypic function of DNA explain the enigma of orders of magnitude high levels of DNA in some 'lower' taxa or in taxonomically related species? iv) Can the newly understood noncoding but functional DNA explain the c-value paradox? and, v) Does natural selection uniformly apply the anthropocentric parameters for 'optimum' and 'economy'? Answers to Q.1-5 are largely negative. Biology presents numerous 'anomalous' examples where the same end function/ phenotype is attained in different organisms through astoundingly diverse ways that appear 'illogical' in our perceptions. Such evolutionary oddities exist because natural selection, unlike a designer, exploits random and stochastic events to modulate the existing system. Consequently, persistence of the new-found 'solution/s' often appear bizarre, uneconomic, and therefore, paradoxical to human logic. The unexpectedly high c-values in diverse organisms are irreversible evolutionary accidents that persisted, and the additional DNA often got repurposed over the evolutionary time scale. Therefore, the c-value paradox is a redundant issue. Future integrative biological studies should address evolutionary mechanisms and processes underlying sporadic DNA expansions/ contractions, and how the newly acquired DNA content has been repurposed in diverse groups.
Collapse
Affiliation(s)
- Subhash C. Lakhotia
- Cytogenetics Laboratory, Department of Zoology, Institute of Science, Banaras Hindu University, Varanasi 221005, India
| |
Collapse
|
10
|
Rutz C, Bonassin L, Kress A, Francesconi C, Boštjančić LL, Merlat D, Theissinger K, Lecompte O. Abundance and Diversification of Repetitive Elements in Decapoda Genomes. Genes (Basel) 2023; 14:1627. [PMID: 37628678 PMCID: PMC10454600 DOI: 10.3390/genes14081627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 08/05/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
Repetitive elements are a major component of DNA sequences due to their ability to propagate through the genome. Characterization of Metazoan repetitive profiles is improving; however, current pipelines fail to identify a significant proportion of divergent repeats in non-model organisms. The Decapoda order, for which repeat content analyses are largely lacking, is characterized by extremely variable genome sizes that suggest an important presence of repetitive elements. Here, we developed a new standardized pipeline to annotate repetitive elements in non-model organisms, which we applied to twenty Decapoda and six other Crustacea genomes. Using this new tool, we identified 10% more repetitive elements than standard pipelines. Repetitive elements were more abundant in Decapoda species than in other Crustacea, with a very large number of highly repeated satellite DNA families. Moreover, we demonstrated a high correlation between assembly size and transposable elements and different repeat dynamics between Dendrobranchiata and Reptantia. The patterns of repetitive elements largely reflect the phylogenetic relationships of Decapoda and the distinct evolutionary trajectories within Crustacea. In summary, our results highlight the impact of repetitive elements on genome evolution in Decapoda and the value of our novel annotation pipeline, which will provide a baseline for future comparative analyses.
Collapse
Affiliation(s)
- Christelle Rutz
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Lena Bonassin
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Arnaud Kress
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Caterina Francesconi
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Ljudevit Luka Boštjančić
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
- Department of Molecular Ecology, Institute for Environmental Sciences, Rhineland-Palatinate Technical University Kaiserslautern Landau, Fortstr. 7, 76829 Landau, Germany
| | - Dorine Merlat
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| | - Kathrin Theissinger
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberg Biodiversity and Climate Research Centre, Georg-Voigt-Str. 14-16, 60325 Frankfurt am Main, Germany; (C.F.); (K.T.)
| | - Odile Lecompte
- Department of Computer Science, ICube, UMR 7357, University of Strasbourg, CNRS, Rue Eugène Boeckel 1, 67000 Strasbourg, France; (C.R.); (L.B.); (A.K.); (L.L.B.); (D.M.)
| |
Collapse
|
11
|
McCallum GE, Rossiter AE, Quraishi MN, Iqbal TH, Kuehne SA, van Schaik W. Noise reduction strategies in metagenomic chromosome confirmation capture to link antibiotic resistance genes to microbial hosts. Microb Genom 2023; 9:mgen001030. [PMID: 37272920 PMCID: PMC10327510 DOI: 10.1099/mgen.0.001030] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 04/11/2023] [Indexed: 06/06/2023] Open
Abstract
The gut microbiota is a reservoir for antimicrobial resistance genes (ARGs). With current sequencing methods, it is difficult to assign ARGs to their microbial hosts, particularly if these ARGs are located on plasmids. Metagenomic chromosome conformation capture approaches (meta3C and Hi-C) have recently been developed to link bacterial genes to phylogenetic markers, thus potentially allowing the assignment of ARGs to their hosts on a microbiome-wide scale. Here, we generated a meta3C dataset of a human stool sample and used previously published meta3C and Hi-C datasets to investigate bacterial hosts of ARGs in the human gut microbiome. Sequence reads mapping to repetitive elements were found to cause problematic noise in, and may importantly skew interpretation of, meta3C and Hi-C data. We provide a strategy to improve the signal-to-noise ratio by discarding reads that map to insertion sequence elements and to the end of contigs. We also show the importance of using spike-in controls to quantify whether the cross-linking step in meta3C and Hi-C protocols has been successful. After filtering to remove artefactual links, 87 ARGs were assigned to their bacterial hosts across all datasets, including 27 ARGs in the meta3C dataset we generated. We show that commensal gut bacteria are an important reservoir for ARGs, with genes coding for aminoglycoside and tetracycline resistance being widespread in anaerobic commensals of the human gut.
Collapse
Affiliation(s)
- Gregory E. McCallum
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Amanda E. Rossiter
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | | | - Tariq H. Iqbal
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Sarah A. Kuehne
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- School of Dentistry, Institute of Clinical Sciences, University of Birmingham, Birmingham, UK
| | - Willem van Schaik
- Institute of Microbiology and Infection, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| |
Collapse
|
12
|
Lu F, Ruan S, Li Y, Wang Y, Xie P, Zhao X, Chao J, Ma H. Assessment of DNA mutagenicity induced by He-Ne laser using Salmonella typhimurium strains. Appl Microbiol Biotechnol 2023:10.1007/s00253-023-12566-5. [PMID: 37231160 DOI: 10.1007/s00253-023-12566-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 04/23/2023] [Accepted: 04/25/2023] [Indexed: 05/27/2023]
Abstract
Helium-neon (He-Ne) laser mutagenesis is widely used in microbiology and plant breeding. In this study, two frameshift mutant representative strains of Salmonella typhimurium TA97a and TA98 and two base pair substitution types TA100 and TA102 were employed as model microorganisms to assess DNA mutagenicity induced by He-Ne laser (3 J·cm-2·s-1, 632.8 nm) for 10, 20, and 30 min. The results revealed that the optimal laser application was 6 h in the mid-logarithmic growth stage. Low-power He-Ne laser for short treatment inhibited cell growth, and continued treatment stimulated the metabolism. The effects of the laser on TA98 and TA100 were the most prominent. Sequencing results from 1500 TA98 revertants showed that there were 88 insertion and deletion (InDel) types in the hisD3052 gene, of which the InDels unique to laser were 21 more than that of the control. Sequencing results from 760 TA100 revertants indicated that laser treatment created Pro (CCC) in the product of the hisG46 gene more likely to be replaced by His (CAC) or Ser (TCC) than by Leu (CTC). Two unique non-classical base substitutions, CCC → TAC and CCC → CAA, also appeared in the laser group. These findings will provide a theoretical basis for further exploration of laser mutagenesis breeding. KEY POINTS: • Salmonella typhimurium served as model organism for laser mutagenesis study. • Laser promoted the occurrence of InDels in the hisD3052 gene of TA98. • Laser promoted the occurrence of base substitution in the hisG46 gene of TA100.
Collapse
Affiliation(s)
- Feng Lu
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Siyu Ruan
- College of Tea and Food Science Technology, Jiangsu Polytechnic College of Agriculture and Forestry, 19 Wenchangdong Road, Jurong City, 212400, Jiangsu, China
| | - Yunliang Li
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
- Institute of Food Physical Processing, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Yining Wang
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Pengfei Xie
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Xiaoxue Zhao
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Jiapin Chao
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China
| | - Haile Ma
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China.
- Institute of Food Physical Processing, Jiangsu University, 301 Xuefu Road, Zhenjiang City, 212013, Jiangsu, China.
| |
Collapse
|
13
|
Ahmad W, Asaf S, Al-Rawahi A, Al-Harrasi A, Khan AL. Comparative plastome genomics, taxonomic delimitation and evolutionary divergences of Tetraena hamiensis var. qatarensis and Tetraena simplex (Zygophyllaceae). Sci Rep 2023; 13:7436. [PMID: 37156827 PMCID: PMC10167353 DOI: 10.1038/s41598-023-34477-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 05/02/2023] [Indexed: 05/10/2023] Open
Abstract
The Zygophyllum and Tetraena genera are intriguingly important ecologically and medicinally. Based on morphological characteristics, T. hamiensis var. qatarensis, and T. simplex were transferred from Zygophyllum to Tetraena with the least genomic datasets available. Hence, we sequenced the T. hamiensis and T. simplex and performed in-depth comparative genomics, phylogenetic analysis, and estimated time divergences. The complete plastomes ranged between 106,720 and 106,446 bp-typically smaller than angiosperms plastomes. The plastome circular genomes are divided into large single-copy regions (~ 80,964 bp), small single-copy regions (~ 17,416 bp), and two inverted repeats regions (~ 4170 bp) in both Tetraena species. An unusual shrinkage of IR regions 16-24 kb was identified. This resulted in the loss of 16 genes, including 11 ndh genes which encode the NADH dehydrogenase subunits, and a significant size reduction of Tetraena plastomes compared to other angiosperms. The inter-species variations and similarities were identified using genome-wide comparisons. Phylogenetic trees generated by analyzing the whole plastomes, protein-coding genes, matK, rbcL, and cssA genes exhibited identical topologies, indicating that both species are sisters to the genus Tetraena and may not belong to Zygophyllum. Similarly, based on the entire plastome and proteins coding genes datasets, the time divergence of Zygophyllum and Tetraena was 36.6 Ma and 34.4 Ma, respectively. Tetraena stem ages were 31.7 and 18.2 Ma based on full plastome and protein-coding genes. The current study presents the plastome as a distinguishing and identification feature among the closely related Tetraena and Zygophyllum species. It can be potentially used as a universal super-barcode for identifying plants.
Collapse
Affiliation(s)
- Waqar Ahmad
- Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa, 616, Oman
| | - Sajjad Asaf
- Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa, 616, Oman
| | - Ahmed Al-Rawahi
- Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa, 616, Oman
| | - Ahmed Al-Harrasi
- Natural and Medical Sciences Research Centre, University of Nizwa, Nizwa, 616, Oman.
| | - Abdul Latif Khan
- Department of Engineering Technology, University of Houston, Sugar Land, TX, 77479, USA.
- Department of Biology and Biochemistry, University of Houston, Houston, USA.
| |
Collapse
|
14
|
Ilan Y. Constrained disorder principle-based variability is fundamental for biological processes: Beyond biological relativity and physiological regulatory networks. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2023; 180-181:37-48. [PMID: 37068713 DOI: 10.1016/j.pbiomolbio.2023.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 03/26/2023] [Accepted: 04/14/2023] [Indexed: 04/19/2023]
Abstract
The constrained disorder principle (CDP) defines systems based on their degree of disorder bounded by dynamic boundaries. The principle explains stochasticity in living and non-living systems. Denis Noble described the importance of stochasticity in biology, emphasizing stochastic processes at molecular, cellular, and higher levels in organisms as having a role beyond simple noise. The CDP and Noble's theories (NT) claim that biological systems use stochasticity. This paper presents the CDP and NT, discussing common notions and differences between the two theories. The paper presents the CDP-based concept of taking the disorder beyond its role in nature to correct malfunctions of systems and improve the efficiency of biological systems. The use of CDP-based algorithms embedded in second-generation artificial intelligence platforms is described. In summary, noise is inherent to complex systems and has a functional role. The CDP provides the option of using noise to improve functionality.
Collapse
Affiliation(s)
- Yaron Ilan
- Faculty of Medicine, Hebrew University, Department of Medicine, Hadassah Medical Center, Jerusalem, Israel.
| |
Collapse
|
15
|
Manee MM, Alqahtani FH, Al-Shomrani BM, El-Shafie HAF, Dias GB. Omics in the Red Palm Weevil Rhynchophorus ferrugineus (Olivier) (Coleoptera: Curculionidae): A Bridge to the Pest. INSECTS 2023; 14:255. [PMID: 36975940 PMCID: PMC10054242 DOI: 10.3390/insects14030255] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/23/2023] [Accepted: 03/02/2023] [Indexed: 06/18/2023]
Abstract
The red palm weevil (RPW), Rhynchophorus ferrugineus (Coleoptera: Curculionidae), is the most devastating pest of palm trees worldwide. Mitigation of the economic and biodiversity impact it causes is an international priority that could be greatly aided by a better understanding of its biology and genetics. Despite its relevance, the biology of the RPW remains poorly understood, and research on management strategies often focuses on outdated empirical methods that produce sub-optimal results. With the development of omics approaches in genetic research, new avenues for pest control are becoming increasingly feasible. For example, genetic engineering approaches become available once a species's target genes are well characterized in terms of their sequence, but also population variability, epistatic interactions, and more. In the last few years alone, there have been major advances in omics studies of the RPW. Multiple draft genomes are currently available, along with short and long-read transcriptomes, and metagenomes, which have facilitated the identification of genes of interest to the RPW scientific community. This review describes omics approaches previously applied to RPW research, highlights findings that could be impactful for pest management, and emphasizes future opportunities and challenges in this area of research.
Collapse
Affiliation(s)
- Manee M. Manee
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
- Institute of Advanced Agricultural and Food Technologies, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
| | - Fahad H. Alqahtani
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
- Institute of Advanced Agricultural and Food Technologies, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
| | - Badr M. Al-Shomrani
- National Center for Bioinformatics, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
- Institute of Advanced Agricultural and Food Technologies, King Abdulaziz City for Science and Technology, Riyadh 11442, Saudi Arabia
| | | | | |
Collapse
|
16
|
Villarreal L, Witzany G. Self-empowerment of life through RNA networks, cells and viruses. F1000Res 2023; 12:138. [PMID: 36785664 PMCID: PMC9918806 DOI: 10.12688/f1000research.130300.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/20/2023] [Indexed: 01/05/2024] Open
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
17
|
Villarreal L, Witzany G. Self-empowerment of life through RNA networks, cells and viruses. F1000Res 2023; 12:138. [PMID: 36785664 PMCID: PMC9918806 DOI: 10.12688/f1000research.130300.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/23/2023] [Indexed: 03/08/2023] Open
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
18
|
Chen W, Chen H, Liao J, Tang M, Qin H, Zhao Z, Liu X, Wu Y, Jiang L, Zhang L, Fang B, Feng X, Zhang B, Reid K, Merilä J. Chromosome-level genome assembly of a high-altitude-adapted frog (Rana kukunoris) from the Tibetan plateau provides insight into amphibian genome evolution and adaptation. Front Zool 2023; 20:1. [PMID: 36604706 PMCID: PMC9817415 DOI: 10.1186/s12983-022-00482-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 12/22/2022] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND The high-altitude-adapted frog Rana kukunoris, occurring on the Tibetan plateau, is an excellent model to study life history evolution and adaptation to harsh high-altitude environments. However, genomic resources for this species are still underdeveloped constraining attempts to investigate the underpinnings of adaptation. RESULTS The R. kukunoris genome was assembled to a size of 4.83 Gb and the contig N50 was 1.80 Mb. The 6555 contigs were clustered and ordered into 12 pseudo-chromosomes covering ~ 93.07% of the assembled genome. In total, 32,304 genes were functionally annotated. Synteny analysis between the genomes of R. kukunoris and a low latitude species Rana temporaria showed a high degree of chromosome level synteny with one fusion event between chr11 and chr13 forming pseudo-chromosome 11 in R. kukunoris. Characterization of features of the R. kukunoris genome identified that 61.5% consisted of transposable elements and expansions of gene families related to cell nucleus structure and taste sense were identified. Ninety-five single-copy orthologous genes were identified as being under positive selection and had functions associated with the positive regulation of proteins in the catabolic process and negative regulation of developmental growth. These gene family expansions and positively selected genes indicate regions for further interrogation to understand adaptation to high altitude. CONCLUSIONS Here, we reported a high-quality chromosome-level genome assembly of a high-altitude amphibian species using a combination of Illumina, PacBio and Hi-C sequencing technologies. This genome assembly provides a valuable resource for subsequent research on R. kukunoris genomics and amphibian genome evolution in general.
Collapse
Affiliation(s)
- Wei Chen
- grid.252245.60000 0001 0085 4987School of Resources and Environmental Engineering, Anhui University, Hefei, 230601 China ,Anhui Shengjin Lake Wetland Ecology National Long-Term Scientific Research Base, Dongzhi, 247230 China ,grid.252245.60000 0001 0085 4987Anhui Province Key Laboratory of Wetland Ecosystem Protection and Restoration, Anhui University, Hefei, 230601 China
| | - Hongzhou Chen
- grid.252245.60000 0001 0085 4987School of Resources and Environmental Engineering, Anhui University, Hefei, 230601 China
| | - Jiahong Liao
- grid.464385.80000 0004 1804 2321School of Life Science and Technology, Mianyang Normal University, Mianyang, 621000 Sichuan China
| | - Min Tang
- grid.464385.80000 0004 1804 2321School of Life Science and Technology, Mianyang Normal University, Mianyang, 621000 Sichuan China
| | - Haifen Qin
- grid.464385.80000 0004 1804 2321School of Life Science and Technology, Mianyang Normal University, Mianyang, 621000 Sichuan China
| | - Zhenkun Zhao
- grid.464385.80000 0004 1804 2321School of Life Science and Technology, Mianyang Normal University, Mianyang, 621000 Sichuan China
| | - Xueyan Liu
- grid.252245.60000 0001 0085 4987School of Resources and Environmental Engineering, Anhui University, Hefei, 230601 China
| | - Yanfang Wu
- grid.252245.60000 0001 0085 4987School of Resources and Environmental Engineering, Anhui University, Hefei, 230601 China
| | - Lichun Jiang
- grid.464385.80000 0004 1804 2321School of Life Science and Technology, Mianyang Normal University, Mianyang, 621000 Sichuan China
| | - Lixia Zhang
- grid.462338.80000 0004 0605 6769Department of Ecology, College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Bohao Fang
- grid.38142.3c000000041936754XDepartment of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, 26 Oxford Street, Cambridge, MA USA
| | - Xueyun Feng
- grid.7737.40000 0004 0410 2071Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, 00014 Helsinki, Finland
| | - Baowei Zhang
- grid.252245.60000 0001 0085 4987School of Life Sciences, Anhui University, Hefei, 230601 China
| | - Kerry Reid
- grid.194645.b0000000121742757Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Juha Merilä
- grid.7737.40000 0004 0410 2071Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, 00014 Helsinki, Finland ,grid.194645.b0000000121742757Area of Ecology and Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
19
|
Li Y, Ruan S, Lu F, Xie P, Liu X, Ma H. Studies on ultrasound-mediated insertion-deletion polymorphisms of DNA and underlying mechanisms based on Ames tester strains. ULTRASONICS SONOCHEMISTRY 2023; 92:106270. [PMID: 36543046 PMCID: PMC9794972 DOI: 10.1016/j.ultsonch.2022.106270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 12/05/2022] [Accepted: 12/14/2022] [Indexed: 06/17/2023]
Abstract
Low-lethality ultrasound technology has received more and more attention in regulating microorganisms of fermentation industry. Herein, two representative Ames tester strains TA97a and TA98 as model organisms were used to explore the effects of ultrasound on insertion-deletion (InDel) polymorphisms of microbial DNA and its underlying mechanisms. Results revealed that a promotion was observed in the reversion mutation of TA98 upon sonication. Sequencing results from 1752 TA98 revertants showed that there was a total of 127 InDels, of which the InDels unique to ultrasound were 36 more than that of the control. Compared with the control, ultrasound-mediated InDels of DNA displayed additional -29 bp deletion and +7 ∼ +43 bp insertions of direct repeat sequences. Combined with the analysis of transcriptomics and prediction of secondary structure of single-stranded DNA from InDels core region (No. 832 ∼ 915 bp) in hisD3052 gene of TA98 strain, ultrasound-mediated "thermal breathing" mechanism was proposed based on the formation of DNA hairpin structure with micro-homologous sequence. This finding implied that low-intensity ultrasound is expected to be developed a new low-lethal mutagenic technology for continuous mutagenesis.
Collapse
Affiliation(s)
- Yunliang Li
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, PR China; Institute of Food Physical Processing, Jiangsu University, 301 Xuefu Road, Zhenjiang, Jiangsu 212013, China
| | - Siyu Ruan
- College of Tea and Food Science Technology, Jiangsu Polytechnic College of Agriculture and Forestry, 19 Wenchangdong Road, Jurong, Jiangsu 212400, PR China.
| | - Feng Lu
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, PR China
| | - Pengfei Xie
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, PR China
| | - Xiaoshuang Liu
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, PR China
| | - Haile Ma
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, PR China; Institute of Food Physical Processing, Jiangsu University, 301 Xuefu Road, Zhenjiang, Jiangsu 212013, China.
| |
Collapse
|
20
|
Kukkar D, Sharma PK, Kim KH. Recent advances in metagenomic analysis of different ecological niches for enhanced biodegradation of recalcitrant lignocellulosic biomass. ENVIRONMENTAL RESEARCH 2022; 215:114369. [PMID: 36165858 DOI: 10.1016/j.envres.2022.114369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/06/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
Lignocellulose wastes stemming from agricultural residues can offer an excellent opportunity as alternative energy solutions in addition to fossil fuels. Besides, the unrestrained burning of agricultural residues can lead to the destruction of the soil microflora and associated soil sterilization. However, the difficulties associated with the biodegradation of lignocellulose biomasses remain as a formidable challenge for their sustainable management. In this respect, metagenomics can be used as an effective option to resolve such dilemma because of its potential as the next generation sequencing technology and bioinformatics tools to harness novel microbial consortia from diverse environments (e.g., soil, alpine forests, and hypersaline/acidic/hot sulfur springs). In light of the challenges associated with the bulk-scale biodegradation of lignocellulose-rich agricultural residues, this review is organized to help delineate the fundamental aspects of metagenomics towards the assessment of the microbial consortia and novel molecules (such as biocatalysts) which are otherwise unidentifiable by conventional laboratory culturing techniques. The discussion is extended further to highlight the recent advancements (e.g., from 2011 to 2022) in metagenomic approaches for the isolation and purification of lignocellulolytic microbes from different ecosystems along with the technical challenges and prospects associated with their wide implementation and scale-up. This review should thus be one of the first comprehensive reports on the metagenomics-based analysis of different environmental samples for the isolation and purification of lignocellulose degrading enzymes.
Collapse
Affiliation(s)
- Deepak Kukkar
- Department of Biotechnology, Chandigarh University, Gharuan, Mohali - 140413, Punjab, India; University Centre for Research and Development, Chandigarh University, Gharuan, Mohali - 140413, Punjab, India.
| | | | - Ki-Hyun Kim
- Department of Civil and Environmental Engineering, Hanyang University, Seongdong-gu, Wangsimni-ro, Seoul - 04763, South Korea.
| |
Collapse
|
21
|
Nevo E, Li K. Sympatric Speciation in Mole Rats and Wild Barley and Their Genome Repeatome Evolution: A Commentary. ADVANCED GENETICS (HOBOKEN, N.J.) 2022; 3:2200009. [PMID: 36911292 PMCID: PMC9993473 DOI: 10.1002/ggn2.202200009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 07/16/2022] [Indexed: 11/05/2022]
Abstract
The theories of sympatric speciation (SS) and coding and noncoding (cd and ncd =repeatome) genome function are still contentious. Studies on SS in our two new models, "Evolution Canyon" and "Evolution Plateau", in Israel, divergent microclimatically and geologically-edaphically, respectively, indicated that in ecologically divergent microsites SS is a common speciation model across life from bacteria to mammals. Genomically, the intergenic ncd repeatome was and is still regarded by many biologists as "selfish," "junk," and non-functional. In contrast, it is considered by the encyclopedia of DNA elements discovery as biochemically functional and regulatory, and the transposable elements were considered earlier by Barbara McClintock as "controlling elements" of genes. Remarkably, it is found that repeated elements can statistically identify significantly, the five species of subterranean mole rats of Spalax ehrenbergi superspecies adapted to increasingly arid climatic trend southward in Israel. Moreover, it is first discovered in the SS studies in two distant taxa, subterranean mole rats and wild barley, and later also in spiny mice in Israel and subterranean zokors in China, that the noncoding repeatome is genomically mirroring the image of the protein-coding genome in divergent ecologies. It is shown that this mirroring image is statistically significant both within and between the ecologically divergent taxa supporting the hypothesis that much of the repeatome might be regulatory and selected as the protein-coding genome by the same ecological stresses.
Collapse
Affiliation(s)
- Eviatar Nevo
- Institute of EvolutionUniversity of HaifaHaifa3498838Israel
| | - Kexin Li
- State Key Laboratory of Grassland Agro‐ecosystemCollege of EcologyLanzhou UniversityLanzhou730000China
| |
Collapse
|
22
|
Balachandra S, Sarkar S, Amodeo AA. The Nuclear-to-Cytoplasmic Ratio: Coupling DNA Content to Cell Size, Cell Cycle, and Biosynthetic Capacity. Annu Rev Genet 2022; 56:165-185. [PMID: 35977407 PMCID: PMC10165727 DOI: 10.1146/annurev-genet-080320-030537] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Though cell size varies between different cells and across species, the nuclear-to-cytoplasmic (N/C) ratio is largely maintained across species and within cell types. A cell maintains a relatively constant N/C ratio by coupling DNA content, nuclear size, and cell size. We explore how cells couple cell division and growth to DNA content. In some cases, cells use DNA as a molecular yardstick to control the availability of cell cycle regulators. In other cases, DNA sets a limit for biosynthetic capacity. Developmentally programmed variations in the N/C ratio for a given cell type suggest that a specific N/C ratio is required to respond to given physiological demands. Recent observations connecting decreased N/C ratios with cellular senescence indicate that maintaining the proper N/C ratio is essential for proper cellular functioning. Together, these findings suggest a causative, not simply correlative, role for the N/C ratio in regulating cell growth and cell cycle progression.
Collapse
Affiliation(s)
- Shruthi Balachandra
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA; ,
| | - Sharanya Sarkar
- Department of Microbiology and Immunology, Dartmouth College, Hanover, New Hampshire, USA;
| | - Amanda A Amodeo
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA; ,
| |
Collapse
|
23
|
Wang Q, Xiong F, Wu G, Liu W, Chen J, Wang B, Chen Y. Gene body methylation in cancer: molecular mechanisms and clinical applications. Clin Epigenetics 2022; 14:154. [PMID: 36443876 PMCID: PMC9706891 DOI: 10.1186/s13148-022-01382-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 11/18/2022] [Indexed: 11/29/2022] Open
Abstract
DNA methylation is an important epigenetic mechanism that regulates gene expression. To date, most DNA methylation studies have focussed on CpG islands in the gene promoter region, and the mechanism of methylation and the regulation of gene expression after methylation have been clearly elucidated. However, genome-wide methylation studies have shown that DNA methylation is widespread not only in promoters but also in gene bodies. Gene body methylation is widely involved in the expression regulation of many genes and is closely related to the occurrence and progression of malignant tumours. This review focusses on the formation of gene body methylation patterns, its regulation of transcription, and its relationship with tumours, providing clues to explore the mechanism of gene body methylation in regulating gene transcription and its significance and application in the field of oncology.
Collapse
Affiliation(s)
- Qi Wang
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Fei Xiong
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Guanhua Wu
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Wenzheng Liu
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Junsheng Chen
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Bing Wang
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| | - Yongjun Chen
- grid.33199.310000 0004 0368 7223Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Road, Wuhan, 430074 Hubei Province China
| |
Collapse
|
24
|
Roy R, Marakkar S, Vayalil MP, Shahanaz A, Anil AP, Kunnathpeedikayil S, Rawal I, Shetty K, Shameer Z, Sathees S, Prasannakumar AP, Mathew OK, Subramanian L, Shameer K, Yadav KK. Drug-food Interactions in the Era of Molecular Big Data, Machine Intelligence, and Personalized Health. RECENT ADVANCES IN FOOD, NUTRITION & AGRICULTURE 2022; 13:27-50. [PMID: 36173075 DOI: 10.2174/2212798412666220620104809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 03/04/2022] [Accepted: 03/30/2022] [Indexed: 12/29/2022]
Abstract
The drug-food interaction brings forth changes in the clinical effects of drugs. While favourable interactions bring positive clinical outcomes, unfavourable interactions may lead to toxicity. This article reviews the impact of food intake on drug-food interactions, the clinical effects of drugs, and the effect of drug-food in correlation with diet and precision medicine. Emerging areas in drug-food interactions are the food-genome interface (nutrigenomics) and nutrigenetics. Understanding the molecular basis of food ingredients, including genomic sequencing and pharmacological implications of food molecules, helps to reduce the impact of drug-food interactions. Various strategies are being leveraged to alleviate drug-food interactions; measures including patient engagement, digital health, approaches involving machine intelligence, and big data are a few of them. Furthermore, delineating the molecular communications across dietmicrobiome- drug-food-drug interactions in a pharmacomicrobiome framework may also play a vital role in personalized nutrition. Determining nutrient-gene interactions aids in making nutrition deeply personalized and helps mitigate unwanted drug-food interactions, chronic diseases, and adverse events from their onset. Translational bioinformatics approaches could play an essential role in the next generation of drug-food interaction research. In this landscape review, we discuss important tools, databases, and approaches along with key challenges and opportunities in drug-food interaction and its immediate impact on precision medicine.
Collapse
Affiliation(s)
- Romy Roy
- Molecular Robotics, Cochin, Kerala, India
| | | | | | - Alisha Shahanaz
- Molecular Robotics, Cochin, Kerala, India.,Sanaria Inc, Rockville, MD, USA
| | - Athira Panicker Anil
- Molecular Robotics, Cochin, Kerala, India.,Mar Athanasious College for Advanced Studies, Tiruvalla, India
| | - Shameer Kunnathpeedikayil
- Molecular Robotics, Cochin, Kerala, India.,Thiruvalla, Kerala; People Care Health LLP Thrissur, Kerala, India
| | | | | | | | - Saraswathi Sathees
- Molecular Robotics, Cochin, Kerala, India.,University of Washington Seattle, Washington WA, USA
| | | | | | - Lakshminarayanan Subramanian
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
| | - Khader Shameer
- Northwell Health, New York, NY, USA and Faculty of Medicine, Imperial College London, London, UK
| | - Kamlesh K Yadav
- School of Engineering Medicine, Center for Genomic and Precision Medicine, Texas A&M University, Houston, TX 77030, USA.,Department of Translational Medical Sciences, Center for Genomic and Precision Medicine, Texas A&M University, Houston, TX 77030, USA
| |
Collapse
|
25
|
Kroupin PY, Badaeva ED, Sokolova VM, Chikida NN, Belousova MK, Surzhikov SA, Nikitina EA, Kocheshkova AA, Ulyanov DS, Ermolaev AS, Khuat TML, Razumova OV, Yurkina AI, Karlov GI, Divashuk MG. Aegilops crassa Boiss. repeatome characterized using low-coverage NGS as a source of new FISH markers: Application in phylogenetic studies of the Triticeae. FRONTIERS IN PLANT SCIENCE 2022; 13:980764. [PMID: 36325551 PMCID: PMC9621091 DOI: 10.3389/fpls.2022.980764] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 08/29/2022] [Indexed: 06/13/2023]
Abstract
Aegilops crassa Boiss. is polyploid grass species that grows in the eastern part of the Fertile Crescent, Afghanistan, and Middle Asia. It consists of tetraploid (4x) and hexaploid (6x) cytotypes (2n = 4x = 28, D1D (Abdolmalaki et al., 2019) XcrXcr and 2n = 6x = 42, D1D (Abdolmalaki et al., 2019) XcrXcrD2D (Adams and Wendel, 2005), respectively) that are similar morphologically. Although many Aegilops species were used in wheat breeding, the genetic potential of Ae. crassa has not yet been exploited due to its uncertain origin and significant genome modifications. Tetraploid Ae. crassa is thought to be the oldest polyploid Aegilops species, the subgenomes of which still retain some features of its ancient diploid progenitors. The D1 and D2 subgenomes of Ae. crassa were contributed by Aegilops tauschii (2n = 2x = 14, DD), while the Xcr subgenome donor is still unknown. Owing to its ancient origin, Ae. crassa can serve as model for studying genome evolution. Despite this, Ae. crassa is poorly studied genetically and no genome sequences were available for this species. We performed low-coverage genome sequencing of 4x and 6x cytotypes of Ae. crassa, and four Ae. tauschii accessions belonging to different subspecies; diploid wheatgrass Thinopyrum bessarabicum (Jb genome), which is phylogenetically close to D (sub)genome species, was taken as an outgroup. Subsequent data analysis using the pipeline RepeatExplorer2 allowed us to characterize the repeatomes of these species and identify several satellite sequences. Some of these sequences are novel, while others are found to be homologous to already known satellite sequences of Triticeae species. The copy number of satellite repeats in genomes of different species and their subgenome (D1 or Xcr) affinity in Ae. crassa were assessed by means of comparative bioinformatic analysis combined with quantitative PCR (qPCR). Fluorescence in situ hybridization (FISH) was performed to map newly identified satellite repeats on chromosomes of common wheat, Triticum aestivum, 4x and 6x Ae. crassa, Ae. tauschii, and Th. bessarabicum. The new FISH markers can be used in phylogenetic analyses of the Triticeae for chromosome identification and the assessment of their subgenome affinities and for evaluation of genome/chromosome constitution of wide hybrids or polyploid species.
Collapse
Affiliation(s)
- Pavel Yu. Kroupin
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Ekaterina D. Badaeva
- N.I.Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Victoria M. Sokolova
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Nadezhda N. Chikida
- All-Russian Institute of Plant Genetic Resources (VIR), Department of Wheat Genetic Resources, St. Petersburg, Russia
| | - Maria Kh. Belousova
- All-Russian Institute of Plant Genetic Resources (VIR), Department of Wheat Genetic Resources, St. Petersburg, Russia
| | - Sergei A. Surzhikov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Ekaterina A. Nikitina
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Alina A. Kocheshkova
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Daniil S. Ulyanov
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Aleksey S. Ermolaev
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Thi Mai Luong Khuat
- Agricultural Genetics Institute, Department of Molecular Biology, Hanoi, Vietnam
| | - Olga V. Razumova
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Anna I. Yurkina
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Gennady I. Karlov
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| | - Mikhail G. Divashuk
- All-Russia Research Institute of Agricultural Biotechnology, Kurchatov Genomics Centre – ARRIAB, Moscow, Russia
| |
Collapse
|
26
|
Yan S, Liu X, Li C, Jiang Z, Li D, Zhu L. Genomic virulence genes profile analysis of Salmonella enterica isolates from animal and human in China from 2004 to 2019. Microb Pathog 2022; 173:105808. [PMID: 36183957 DOI: 10.1016/j.micpath.2022.105808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 09/07/2022] [Accepted: 09/26/2022] [Indexed: 11/16/2022]
Abstract
Salmonella is a momentously zoonotic and food-borne pathogen that seriously threats human and animal health around the world. Salmonella pathogenicity is closely related to its virulence genes profile. However, conventional virulence gene analysis methods cannot truly reveal whole virulence genes carried by Salmonella. In this study, whole genome sequencing in combination with Virulence Factor Database were applied to investigate whole virulence gene profiles of 243 Salmonella isolates from animals and humans in China from 2004 to 2019. The results showed that a total of 670 virulence genes were identified in Salmonella, among them, 319 virulence genes were found in all the Salmonella tested isolates, and 9 virulence genes were unique to Salmonella. The 670 virulence genes were classified into 14 categories according to their functions, and the genes related to adherence, effector delivery system, immune modulation, motility and nutritional/metabolic factors accounted for 84.63%. Relationships between virulence genes and serovars, sequence types indicated that strains belonged to the same serovar or sequence type had similar virulence genes profiles, however, isolates from different sources, years and locations of isolation had variable virulence gene profiles. In addition, copy number of virulence genes and homologous virulence genes shared with other pathogens were also analyzed in this study. In summary, we investigated pan-genomic virulence gene profiles and molecular epidemiology of Salmonella isolates from humans and animals in China from 2004 to 2019. These findings are beneficial for pathogenic monitoring, investigation of virulence evolution as well as prevention and control of Salmonella.
Collapse
Affiliation(s)
- Shigan Yan
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China
| | - Xu Liu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China
| | - Chengyu Li
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China
| | - Zhaoxu Jiang
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China
| | - Donghui Li
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China
| | - Liping Zhu
- School of Bioengineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 25053, China.
| |
Collapse
|
27
|
Dong X, Mkala EM, Mutinda ES, Yang JX, Wanga VO, Oulo MA, Onjolo VO, Hu GW, Wang QF. Taxonomy, comparative genomics of Mullein (Verbascum, Scrophulariaceae), with implications for the evolution of Verbascum and Lamiales. BMC Genomics 2022; 23:566. [PMID: 35941527 PMCID: PMC9358837 DOI: 10.1186/s12864-022-08799-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 07/28/2022] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND The genus Verbascum L. (Scrophulariaceae) is distributed in Africa, Europe, and parts of Asia, with the Mediterranean having the most species variety. Several researchers have already worked on the phylogenetic and taxonomic analysis of Verbascum by using ITS data and chloroplast genome fragments and have produced different conclusions. The taxonomy and phylogenetic relationships of this genus are unclear. RESULTS The complete plastomes (cp) lengths for V. chaixii, V. songaricum, V. phoeniceum, V. blattaria, V. sinaiticum, V. thapsus, and V. brevipedicellatum ranged from 153,014 to 153,481 bp. The cp coded 114 unique genes comprising of 80 protein-coding genes, four ribosomal RNA (rRNA), and 30 tRNA genes. We detected variations in the repeat structures, gene expansion on the inverted repeat, and single copy (IR/SC) boundary regions. The substitution rate analysis indicated that some genes were under purifying selection pressure. Phylogenetic analysis supported the sister relationship of (Lentibulariaceae + Acanthaceae + Bignoniaceae + Verbenaceae + Pedaliaceae) and (Lamiaceae + Phyrymaceae + Orobanchaceae + Paulowniaceae + Mazaceae) in Lamiales. Within Scrophulariaceae, Verbascum was sister to Scrophularia, while Buddleja formed a monophyletic clade from (Scrophularia + Verbascum) with high bootstrap support values. The relationship of the nine species within Verbascum was highly supported. CONCLUSION Based on the phylogenetic results, we proposed to reinstate the species status of V. brevipedicellatum (Engl.) Hub.-Mor. Additionally, three genera (Mazus, Lancea, and Dodartia) placed in the Phyrymaceae family formed a separate clade within Lamiaceae. The classification of the three genera was supported by previous studies. Thus, the current study also suggests the circumscription of these genera as documented previously to be reinstated. The divergence time of Lamiales was approximated to be 86.28 million years ago (Ma) (95% highest posterior density (HPD), 85.12-89.91 Ma). The complete plastomes sequence data of the Verbascum species will be important for understanding the Verbascum phylogenetic relationships and evolution in order Lamiales.
Collapse
Affiliation(s)
- Xiang Dong
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Elijah Mbandi Mkala
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China.,East African Herbarium, National Museums of Kenya, P.O Box 451660-0100, Nairobi, Kenya
| | - Elizabeth Syowai Mutinda
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Jia-Xin Yang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Vincent Okelo Wanga
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Millicent Akinyi Oulo
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Victor Omondi Onjolo
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China.,University of Chinese Academy of Sciences, Beijing, CN-100049, China
| | - Guang-Wan Hu
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China. .,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China. .,University of Chinese Academy of Sciences, Beijing, CN-100049, China.
| | - Qing-Feng Wang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan, CN-430074, China
| |
Collapse
|
28
|
Podgornaya OI. Nuclear organization by satellite DNA, SAF-A/hnRNPU and matrix attachment regions. Semin Cell Dev Biol 2022; 128:61-68. [PMID: 35484025 DOI: 10.1016/j.semcdb.2022.04.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 04/18/2022] [Accepted: 04/18/2022] [Indexed: 12/15/2022]
Abstract
The need of large-scale chromatin organization in the nucleus has become more and more appreciated. The higher order nuclear organization ultimately regulate a plethora of biological processes including transcription, DNA replication, and DNA repair. In this context, it is of critical importance to understand the mechanisms that allow higher order nuclear organization. Scaffold Attachment Factor A (SAF-A/hnRNPU), which was originally identified as the component of nuclear matrix, has emerged as an important regulator of higher order nuclear organization. It is shown that SAF-A/hnRNPU binds to tandem repeats (TRs) and scaffold/matrix attachment regions (S/MAR) in a sequence-non-specific, but structure-specific manner (e.g. DNA curvature). Recent studies showed that SAF-A interacts with chromatin-associated RNAs (caRNAs) to regulate interphase chromatin structures in a transcription-dependent manner. It is proposed that SAF-A/hnRNPU and caRNAs form a dynamic, transcriptionally responsive chromatin mesh that organizes chromatin in a large scale. The common structural features of S/MAR and pericentromeric (periCEN) TR promotes SAF-A-mediated association with each other. Collectively a model is presented wherein SAF-A/hnRNPU and periCEN TR are the key players in large-scale nuclear organization that supports general transcription.
Collapse
Affiliation(s)
- O I Podgornaya
- Institute of Cytology RAS, St. Petersburg State University, Russia.
| |
Collapse
|
29
|
Whole-genome survey and phylogenetic analysis of Gadus macrocephalus. Biosci Rep 2022; 42:231542. [PMID: 35788826 PMCID: PMC9289796 DOI: 10.1042/bsr20221037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/24/2022] [Accepted: 07/05/2022] [Indexed: 11/17/2022] Open
Abstract
Gadus macrocephalus (Pacific cod) is an economically important species on the northern coast of the Pacific. Although numerous studies on G. macrocephalus exist, there are few reports on its genomic data. Here, we used whole-genome sequencing data to elucidate the genomic characteristics and phylogenetic relationship of G. macrocephalus. From the 19-mer frequency distribution, the genome size was estimated to be 658.22 Mb. The heterozygosity, repetitive sequence content and GC content were approximately 0.62%, 27.50% and 44.73%, respectively. The draft genome sequences were initially assembled, yielding a total of 500,760 scaffolds (N50 = 3565 bp). A total of 789,860 microsatellite motifs were identified from the genomic data, and dinucleotide repeat was the most dominant simple sequence repeat motif. As a byproduct of whole-genome sequencing, the mitochondrial genome was assembled to investigate the evolutionary relationships between G. macrocephalus and its relatives. On the basis of 13 protein-coding gene sequences of the mitochondrial genome of Gadidae species, the maximum likelihood phylogenetic tree showed that complicated relationships and divergence times among Gadidae species. Demographic history analysis revealed changes in the G. macrocephalus population during the Pleistocene by using the pairwise sequentially Markovian coalescent model. These findings supplement the genomic data of G. macrocephalus, and make a valuable contribution to the whole-genome studies on G. macrocephalus.
Collapse
|
30
|
Yandım C, Karakülah G. Repeat expression is linked to patient survival and exhibits single nucleotide variation in pancreatic cancer revealing LTR70:r.879A>G. Gene X 2022; 822:146344. [PMID: 35183687 DOI: 10.1016/j.gene.2022.146344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 02/03/2022] [Accepted: 02/14/2022] [Indexed: 11/04/2022] Open
Abstract
Despite an overwhelming number of cancer literature reporting the links between patient survival and the expression levels of genes or mutations/single nucleotide variations (SNVs) on them, there is only limited information on repeat elements, which make at least half the human genome. Here, we analysed RNA-seq data obtained from primary pancreatic cancer tissues of 51 patients and revealed that two transposons, HERVI-int and X6A_LINE, showed an upregulation trend in the patients who lived shorter, along with 56 other potential repeats which were linked to survival. We also detected expressed single nucleotide variations (SNVs) on repeats, among which LTR70:r.879A>G stands out with the effect of its presence on this particular repeat's expression levels and a significant link to overall patient survival. Interestingly, the expression of LTR70:r.879A>G correlated with different cancer genes in comparison to its reference version highlighting the involvement of BRAF and Fumerate Hydratase with this expressed SNV. This is one of the first studies revealing possible links between repeat expression and survival in cancer and it warrants further research in this avenue.
Collapse
Affiliation(s)
- Cihangir Yandım
- İzmir University of Economics, Faculty of Engineering, Department of Genetics and Bioengineering, 35330 Balçova, İzmir, Turkey; İzmir Biomedicine and Genome Center (IBG), Dokuz Eylül University Health Campus, 35340 İnciraltı, İzmir, Turkey
| | - Gökhan Karakülah
- İzmir Biomedicine and Genome Center (IBG), Dokuz Eylül University Health Campus, 35340 İnciraltı, İzmir, Turkey; İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, 35340 İnciraltı, İzmir, Turkey.
| |
Collapse
|
31
|
Multiple heterochromatin diversification events in the genome of fungus-farming ants: insights from repetitive sequences. Chromosoma 2022; 131:59-75. [PMID: 35325297 DOI: 10.1007/s00412-022-00770-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 01/18/2022] [Accepted: 02/21/2022] [Indexed: 11/03/2022]
Abstract
A substantial portion of the eukaryotic genome includes repetitive DNA, which is important for its stability, regulation, and architecture. Fungus-farming ant genomes show remarkable structural rearrangement rates that were necessary for the establishment of their agriculture-based lifestyle, highlighting the relevance of this peculiar group in understanding the repetitive portion of ant genome. Chromosomal banding studies are in accordance with genomic data because they show that repetitive heterochromatic sequences of basal and derivative Attina species are GC-rich, an uncommon trait in Formicidae. To understand the evolutionary dynamics of heterochromatin in Attina, we compared GC-rich heterochromatin patterns between the Paleoattina and Neoattina clades of this subtribe. To this end, we hybridized the Mrel-C0t probe (highly and moderately repetitive DNA) obtained from Mycetomoellerius relictus, Neoattina with GC-rich heterochromatin, in karyotypes of Paleoattina and Neoattina species. Additionally, we mapped the repetitive sequences (GA)15 and (TTAGG)6 in species of the two clades to investigate their organization and evolutionary patterns in the genome of Attina. The Mrel-C0t probe marked the heterochromatin in M. relictus, in other Mycetomoellerius spp., and in species of Mycetarotes, Cyphomyrmex, and Sericomyrmex (Neoattina). In Mycetomoellerius urichii, only pericentromeric heterochromatin was marked with Mrel-C0t. No marking was observed in Paleoattina species or in Atta and Acromyrmex (Neoattina). These results indicated that different evolutionary events led to heterochromatin differentiation in Attina. The most likely hypothesis is that GC-rich heterochromatin arose in the common ancestor of the two clades and accumulated various changes throughout evolution. The sequences (GA)15 and (TTAGG)6 located in euchromatin and telomeres, respectively, showed more homogeneous results among the species.
Collapse
|
32
|
Liao X, Hu K, Salhi A, Zou Y, Wang J, Gao X. msRepDB: a comprehensive repetitive sequence database of over 80 000 species. Nucleic Acids Res 2021; 50:D236-D245. [PMID: 34850956 PMCID: PMC8728181 DOI: 10.1093/nar/gkab1089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 10/18/2021] [Accepted: 11/30/2021] [Indexed: 11/13/2022] Open
Abstract
Repeats are prevalent in the genomes of all bacteria, plants and animals, and they cover nearly half of the Human genome, which play indispensable roles in the evolution, inheritance, variation and genomic instability, and serve as substrates for chromosomal rearrangements that include disease-causing deletions, inversions, and translocations. Comprehensive identification, classification and annotation of repeats in genomes can provide accurate and targeted solutions towards understanding and diagnosis of complex diseases, optimization of plant properties and development of new drugs. RepBase and Dfam are two most frequently used repeat databases, but they are not sufficiently complete. Due to the lack of a comprehensive repeat database of multiple species, the current research in this field is far from being satisfactory. LongRepMarker is a new framework developed recently by our group for comprehensive identification of genomic repeats. We here propose msRepDB based on LongRepMarker, which is currently the most comprehensive multi-species repeat database, covering >80 000 species. Comprehensive evaluations show that msRepDB contains more species, and more complete repeats and families than RepBase and Dfam databases. (https://msrepdb.cbrc.kaust.edu.sa/pages/msRepDB/index.html).
Collapse
Affiliation(s)
- Xingyu Liao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia.,Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Kang Hu
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Adil Salhi
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - You Zou
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Jianxin Wang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| |
Collapse
|
33
|
Karakülah G, Yandim C. Identification of differentially expressed genomic repeats in primary hepatocellular carcinoma and their potential links to biological processes and survival. Turk J Biol 2021; 45:599-612. [PMID: 34803457 PMCID: PMC8574195 DOI: 10.3906/biy-2104-13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 06/19/2021] [Indexed: 11/05/2022] Open
Abstract
Hepatocellular carcinoma (HCC) is one of the deadliest cancers. Research on HCC so far primarily focused on genes and provided limited information on genomic repeats, which constitute more than half of the human genome and contribute to genomic stability. In line with this, repeat dysregulation was significantly shown to be pathological in various cancers and other diseases. In this study, we aimed to determine the full repeat expression profile of HCC for the first time. We utilised two independent RNA-seq datasets obtained from primary HCC tumours with matched normal tissues of 20 and 17 HCC patients, respectively. We quantified repeat expressions and analysed their differential expression. We also identified repeats that are cooperatively expressed with genes by constructing a gene coexpression network. Our results indicated that HCC tumours in both datasets harbour 24 differentially expressed repeats and even more elements were coexpressed with genes involved in various metabolic pathways. We discovered that two L1 elements (L1M3b, L1M3de) were downregulated and a handful of HERV subfamily repeats (HERV-Fc1-int, HERV3-int, HERVE_a-int, HERVK11D-int, HERVK14C-int, HERVL18-int) were upregulated with the exception of HERV1_LTRc, which was downregulated. Various LTR elements (LTR32, LTR9, LTR4, LTR52-int, LTR70) and MER elements (MER11C, MER11D, MER57C1, MER9a1, MER74C) were implicated along with few other subtypes including Charlie12, MLT2A2, Tigger15a, Tigger 17b. The only satellite repeat differentially expressed in both datasets was GSATII, whose expression was upregulated in 33 (>90%) out of 37 patients. Notably, GSATII expression correlated with HCC survival genes. Elements discovered here promise future studies to be considered for biomarker and HCC therapy research. The coexpression pattern of the GSATII satellite with HCC survival genes and the fact that it has been upregulated in the vast majority of patients make this repeat particularly stand out for HCC.
Collapse
Affiliation(s)
- Gökhan Karakülah
- İzmir Biomedicine and Genome Center (İBG), İzmir Turkey.,İzmir International Biomedicine and Genome Institute (İBG-İzmir), Dokuz Eylül University, İzmir Turkey
| | - Cihangir Yandim
- İzmir Biomedicine and Genome Center (İBG), İzmir Turkey.,Department of Genetics and Bioengineering, Faculty of Engineering, İzmir University of Economics, İzmir Turkey
| |
Collapse
|
34
|
Tibatan MA, Sarısaman M. Unitary structure of palindromes in DNA. Biosystems 2021; 211:104565. [PMID: 34740704 DOI: 10.1016/j.biosystems.2021.104565] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 10/20/2021] [Accepted: 10/20/2021] [Indexed: 12/13/2022]
Abstract
We investigate the quantum behavior encountered in palindromes within DNA structure. In particular, we reveal the unitary structure of usual palindromic sequences found in genomic DNAs of all living organisms, using the Schwinger's approach. We clearly demonstrate the role played by palindromic configurations with special emphasis on physical symmetries, in particular subsymmetries of unitary structure. We unveil the prominence of unitary structure in palindromic sequences in the sense that vitally significant information endowed within DNA could be transformed unchangeably in the process of transcription. We introduce a new symmetry relation, namely purine-purine or pyrimidine-pyrimidine symmetries (p-symmetry) in addition to the already known symmetry relation of purine-pyrimidine symmetries (pp-symmetry) given by Chargaff's rule. Therefore, important vital functions of a living organisms are protected by means of these symmetric features. It is understood that higher order palindromic sequences could be generated in terms of the basis of the highest prime numbers that make up the palindrome sequence number. We propose that violation of this unitary structure of palindromic sequences by means of our proposed symmetries leads to a mutation in DNA, which could offer a new perspective in the scientific studies on the origin and cause of mutation.
Collapse
Affiliation(s)
- Mehmet Ali Tibatan
- Department of Biotechnology, Istanbul University, 34134, Vezneciler, Istanbul, Turkey.
| | - Mustafa Sarısaman
- Department of Physics, Istanbul University, 34134, Vezneciler, Istanbul, Turkey.
| |
Collapse
|
35
|
The large bat Helitron DNA transposase forms a compact monomeric assembly that buries and protects its covalently bound 5'-transposon end. Mol Cell 2021; 81:4271-4286.e4. [PMID: 34403695 DOI: 10.1016/j.molcel.2021.07.028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 07/23/2021] [Accepted: 07/23/2021] [Indexed: 12/22/2022]
Abstract
Helitrons are widespread eukaryotic DNA transposons that have significantly contributed to genome variability and evolution, in part because of their distinctive, replicative rolling-circle mechanism, which often mobilizes adjacent genes. Although most eukaryotic transposases form oligomers and use RNase H-like domains to break and rejoin double-stranded DNA (dsDNA), Helitron transposases contain a single-stranded DNA (ssDNA)-specific HUH endonuclease domain. Here, we report the cryo-electron microscopy structure of a Helitron transposase bound to the 5'-transposon end, providing insight into its multidomain architecture and function. The monomeric transposase forms a tightly packed assembly that buries the covalently attached cleaved end, protecting it until the second end becomes available. The structure reveals unexpected architectural similarity to TraI, a bacterial relaxase that also catalyzes ssDNA movement. The HUH active site suggests how two juxtaposed tyrosines, a feature of many replication initiators that use HUH nucleases, couple the conformational shift of an α-helix to control strand cleavage and ligation reactions.
Collapse
|
36
|
Liao X, Li M, Hu K, Wu FX, Gao X, Wang J. A sensitive repeat identification framework based on short and long reads. Nucleic Acids Res 2021; 49:e100. [PMID: 34214175 PMCID: PMC8464074 DOI: 10.1093/nar/gkab563] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Revised: 06/08/2021] [Accepted: 06/18/2021] [Indexed: 12/11/2022] Open
Abstract
Numerous studies have shown that repetitive regions in genomes play indispensable roles in the evolution, inheritance and variation of living organisms. However, most existing methods cannot achieve satisfactory performance on identifying repeats in terms of both accuracy and size, since NGS reads are too short to identify long repeats whereas SMS (Single Molecule Sequencing) long reads are with high error rates. In this study, we present a novel identification framework, LongRepMarker, based on the global de novo assembly and k-mer based multiple sequence alignment for precisely marking long repeats in genomes. The major characteristics of LongRepMarker are as follows: (i) by introducing barcode linked reads and SMS long reads to assist the assembly of all short paired-end reads, it can identify the repeats to a greater extent; (ii) by finding the overlap sequences between assemblies or chomosomes, it locates the repeats faster and more accurately; (iii) by using the multi-alignment unique k-mers rather than the high frequency k-mers to identify repeats in overlap sequences, it can obtain the repeats more comprehensively and stably; (iv) by applying the parallel alignment model based on the multi-alignment unique k-mers, the efficiency of data processing can be greatly optimized and (v) by taking the corresponding identification strategies, structural variations that occur between repeats can be identified. Comprehensive experimental results show that LongRepMarker can achieve more satisfactory results than the existing de novo detection methods (https://github.com/BioinformaticsCSU/LongRepMarker).
Collapse
Affiliation(s)
- Xingyu Liao
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Kang Hu
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Fang-Xiang Wu
- Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N5A9, Canada
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Jianxin Wang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| |
Collapse
|
37
|
Kostanjšek R, Diderichsen B, Recknagel H, Gunde-Cimerman N, Gostinčar C, Fan G, Kordiš D, Trontelj P, Jiang H, Bolund L, Luo Y. Toward the massive genome of Proteus anguinus-illuminating longevity, regeneration, convergent evolution, and metabolic disorders. Ann N Y Acad Sci 2021; 1507:5-11. [PMID: 34480358 DOI: 10.1111/nyas.14686] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 08/13/2021] [Accepted: 08/17/2021] [Indexed: 12/27/2022]
Abstract
Deciphering the genetic code of organisms with unusual phenotypes can help answer fundamental biological questions and provide insight into mechanisms relevant to human biomedical research. The cave salamander Proteus anguinus (Urodela: Proteidae), also known as the olm, is an example of a species with unique morphological and physiological adaptations to its subterranean environment, including regenerative abilities, resistance to prolonged starvation, and a life span of more than 100 years. However, the structure and sequence of the olm genome is still largely unknown owing to its enormous size, estimated at nearly 50 gigabases. An international Proteus Genome Research Consortium has been formed to decipher the olm genome. This perspective provides the scientific and biomedical rationale for exploring the olm genome and outlines potential outcomes, challenges, and methodological approaches required to analyze and annotate the genome of this unique amphibian.
Collapse
Affiliation(s)
- Rok Kostanjšek
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Børge Diderichsen
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Hans Recknagel
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Nina Gunde-Cimerman
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Cene Gostinčar
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia.,Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
| | - Guangyi Fan
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China
| | - Dušan Kordiš
- Department of Molecular and Biomedical Sciences, Jožef Stefan Institute, Ljubljana, Slovenia
| | - Peter Trontelj
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | | | - Lars Bolund
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China.,Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Yonglun Luo
- Lars Bolund Institute of Regenerative Medicine, Qingdao-Europe Advanced Institute for Life Sciences, BGI-Qingdao, BGI-Shenzhen, Qingdao, China.,Department of Biomedicine, Aarhus University, Aarhus, Denmark.,Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| |
Collapse
|
38
|
Joseph SJ, Park S, Kelley A, Roy S, Cope JR, Ali IKM. Comparative Genomic and Transcriptomic Analysis of Naegleria fowleri Clinical and Environmental Isolates. mSphere 2021; 6:e0063721. [PMID: 34378985 PMCID: PMC8386437 DOI: 10.1128/msphere.00637-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 07/20/2021] [Indexed: 11/21/2022] Open
Abstract
Out of over 40 species of Naegleria, which are free-living thermophilic amebae found in freshwater and soil worldwide, only Naegleria fowleri infects humans, causing primary amebic meningoencephalitis (PAM), a typically fatal brain disease. To understand the population structure of Naegleria species and the genetic relationships between N. fowleri isolates and to detect pathogenic factors, we characterized 52 novel clinical and environmental N. fowleri genomes and a single Naegleria lovaniensis strain, along with transcriptomic data for a subset of 37 N. fowleri isolates. Whole-genome analysis of 56 isolates from three Naegleria species (N. fowleri, N. lovaniensis, and Naegleria gruberi) identified several genes unique to N. fowleri that have previously been linked to the pathogenicity of N. fowleri, while other unique genes could be associated with novel pathogenicity factors in this highly fatal pathogen. Population structure analysis estimated the presence of 10 populations within the three Naegleria species, of which 7 populations were within N. fowleri. The whole-nuclear-genome (WNG) phylogenetic analysis showed an overall geographical clustering of N. fowleri isolates, with few exceptions, and provided higher resolution in identifying potential clusters of isolates beyond that of the traditional locus typing. There were only 34 genes that showed significant differences in gene expression between the clinical and environmental isolates. Genomic data generated in this study can be used for developing rapid molecular assays and to conduct future population-based global genomic analysis and will also be a valuable addition to genomic reference databases, where shotgun metagenomics data from routine water samples could be searched for the presence of N. fowleri strains. IMPORTANCE N. fowleri, the only known Naegleria species to infect humans, causes fatal brain disease. PAM cases from 1965 to 2016 showed <20 cases per year globally. Out of approximately 150 cases in North America since 1962, only four PAM survivors are known, yielding a >97% case fatality rate, which is critically high. Although the pathogenesis of N. fowleri has been studied for the last 50 years, pathogenetic factors that lead to human infection and breaching the blood-brain barrier remain unknown. In addition, little is known regarding the genomic diversity both within N. fowleri isolates and among Naegleria species. In this study, we generated novel genome sequences and performed comparative genomic and transcriptomic analysis of a set of 52 N. fowleri draft genome sequences from clinical and environmental isolates derived from all over the world in the last 53 years, which will help shape future genome-wide studies and develop sensitive assays for routine surveillance.
Collapse
Affiliation(s)
- Sandeep J. Joseph
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Subin Park
- Eagle Medical Services, Atlanta, Georgia, USA
| | | | - Shantanu Roy
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Jennifer R. Cope
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| | - Ibne Karim M. Ali
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
| |
Collapse
|
39
|
Valeri MP, Dias GB, do Espírito Santo AA, Moreira CN, Yonenaga-Yassuda Y, Sommer IB, Kuhn GCS, Svartman M. First Description of a Satellite DNA in Manatees' Centromeric Regions. Front Genet 2021; 12:694866. [PMID: 34504514 PMCID: PMC8421680 DOI: 10.3389/fgene.2021.694866] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/30/2021] [Indexed: 11/18/2022] Open
Abstract
Trichechus manatus and Trichechus inunguis are the two Sirenia species that occur in the Americas. Despite their increasing extinction risk, many aspects of their biology remain understudied, including the repetitive DNA fraction of their genomes. Here we used the sequenced genome of T. manatus and TAREAN to identify satellite DNAs (satDNAs) in this species. We report the first description of TMAsat, a satDNA comprising ~0.87% of the genome, with ~684bp monomers and centromeric localization. In T. inunguis, TMAsat showed similar monomer length, chromosome localization and conserved CENP-B box-like motifs as in T. manatus. We also detected this satDNA in the Dugong dugon and in the now extinct Hydrodamalis gigas genomes. The neighbor-joining tree shows that TMAsat sequences from T. manatus, T. inunguis, D. dugon, and H. gigas lack species-specific clusters, which disagrees with the predictions of concerted evolution. We detected a divergent TMAsat-like homologous sequence in elephants and hyraxes, but not in other mammals, suggesting this sequence was already present in the common ancestor of Paenungulata, and later became a satDNA in the Sirenians. This is the first description of a centromeric satDNA in manatees and will facilitate the inclusion of Sirenia in future studies of centromeres and satDNA biology.
Collapse
Affiliation(s)
- Mirela Pelizaro Valeri
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Alice Alves do Espírito Santo
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Camila Nascimento Moreira
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Yatiyo Yonenaga-Yassuda
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Iara Braga Sommer
- Centro Nacional de Pesquisa e Conservação da Biodiversidade Marinha do Nordeste, Instituto Chico Mendes de Conservação da Biodiversidade, Brasília, Brazil
| | - Gustavo C. S. Kuhn
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Marta Svartman
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
40
|
Hausmann F, Kurtz S. DeepGRP: engineering a software tool for predicting genomic repetitive elements using Recurrent Neural Networks with attention. Algorithms Mol Biol 2021; 16:20. [PMID: 34425870 PMCID: PMC8381506 DOI: 10.1186/s13015-021-00199-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 08/03/2021] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Repetitive elements contribute a large part of eukaryotic genomes. For example, about 40 to 50% of human, mouse and rat genomes are repetitive. So identifying and classifying repeats is an important step in genome annotation. This annotation step is traditionally performed using alignment based methods, either in a de novo approach or by aligning the genome sequence to a species specific set of repetitive sequences. Recently, Li (Bioinformatics 35:4408-4410, 2019) developed a novel software tool dna-brnn to annotate repetitive sequences using a recurrent neural network trained on sample annotations of repetitive elements. RESULTS We have developed the methods of dna-brnn further and engineered a new software tool DeepGRP. This combines the basic concepts of Li (Bioinformatics 35:4408-4410, 2019) with current techniques developed for neural machine translation, the attention mechanism, for the task of nucleotide-level annotation of repetitive elements. An evaluation on the human genome shows a 20% improvement of the Matthews correlation coefficient for the predictions delivered by DeepGRP, when compared to dna-brnn. DeepGRP predicts two additional classes of repeats (compared to dna-brnn) and is able to transfer repeat annotations, using RepeatMasker-based training data to a different species (mouse). Additionally, we could show that DeepGRP predicts repeats annotated in the Dfam database, but not annotated by RepeatMasker. DeepGRP is highly scalable due to its implementation in the TensorFlow framework. For example, the GPU-accelerated version of DeepGRP is approx. 1.8 times faster than dna-brnn, approx. 8.6 times faster than RepeatMasker and over 100 times faster than HMMER searching for models of the Dfam database. CONCLUSIONS By incorporating methods from neural machine translation, DeepGRP achieves a consistent improvement of the quality of the predictions compared to dna-brnn. Improved running times are obtained by employing TensorFlow as implementation framework and the use of GPUs. By incorporating two additional classes of repeats, DeepGRP provides more complete annotations, which were evaluated against three state-of-the-art tools for repeat annotation.
Collapse
Affiliation(s)
- Fabian Hausmann
- Institute of Medical Systems Biology, University Medical Center Hamburg-Eppendorf, Falkenried 94, 20251 Hamburg, Germany
| | - Stefan Kurtz
- ZBH - Center for Bioinformatics, MIN-Fakultät, Universität Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
41
|
Complete Chloroplast Genome Sequence of Fortunella venosa (Champ. ex Benth.) C.C.Huang (Rutaceae): Comparative Analysis, Phylogenetic Relationships, and Robust Support for Its Status as an Independent Species. FORESTS 2021. [DOI: 10.3390/f12080996] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Fortunella venosa (Rutaceae) is an endangered species endemic to China and its taxonomic status has been controversial. The genus Fortunella contains a variety of important economic plants with high value in food, medicine, and ornamental. However, the placement of Genus Fortunella into Genus Citrus has led to controversy on its taxonomy and Systematics. In this present research, the Chloroplast genome of F. venosa was sequenced using the second-generation sequencing, and its structure and phylogenetic relationship analyzed. The results showed that the Chloroplast genome size of F. venosa was 160,265 bp, with a typical angiosperm four-part ring structure containing a large single copy region (LSC) (87,597 bp), a small single copy region (SSC) (18,732 bp), and a pair of inverted repeat regions (IRa\IRb) (26,968 bp each). There are 134 predicted genes in Chloroplast genome, including 89 protein-coding genes, 8 rRNAs, and 37 tRNAs. The GC-content of the whole Chloroplast genome was 43%, with the IR regions having a higher GC content than the LSC and the SSC regions. There were no rearrangements present in the Chloroplast genome; however, the IR regions showed obvious contraction and expansion. A total of 108 simple sequence repeats (SSRs) were present in the entire chloroplast genome and the nucleotide polymorphism was high in LSC and SSC. In addition, there is a preference for codon usage with the non-coding regions being more conserved than the coding regions. Phylogenetic analysis showed that species of Fortunella are nested in the genus of Citrus and the independent species status of F. venosa is supported robustly, which is significantly different from F. japonica. These findings will help in the development of DNA barcodes that can be useful in the study of the systematics and evolution of the genus Fortunella and the family Rutaceae.
Collapse
|
42
|
Pappalardo XG, Barra V. Losing DNA methylation at repetitive elements and breaking bad. Epigenetics Chromatin 2021; 14:25. [PMID: 34082816 PMCID: PMC8173753 DOI: 10.1186/s13072-021-00400-z] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Accepted: 05/21/2021] [Indexed: 02/08/2023] Open
Abstract
Background DNA methylation is an epigenetic chromatin mark that allows heterochromatin formation and gene silencing. It has a fundamental role in preserving genome stability (including chromosome stability) by controlling both gene expression and chromatin structure. Therefore, the onset of an incorrect pattern of DNA methylation is potentially dangerous for the cells. This is particularly important with respect to repetitive elements, which constitute the third of the human genome. Main body Repetitive sequences are involved in several cell processes, however, due to their intrinsic nature, they can be a source of genome instability. Thus, most repetitive elements are usually methylated to maintain a heterochromatic, repressed state. Notably, there is increasing evidence showing that repetitive elements (satellites, long interspersed nuclear elements (LINEs), Alus) are frequently hypomethylated in various of human pathologies, from cancer to psychiatric disorders. Repetitive sequences’ hypomethylation correlates with chromatin relaxation and unscheduled transcription. If these alterations are directly involved in human diseases aetiology and how, is still under investigation. Conclusions Hypomethylation of different families of repetitive sequences is recurrent in many different human diseases, suggesting that the methylation status of these elements can be involved in preservation of human health. This provides a promising point of view towards the research of therapeutic strategies focused on specifically tuning DNA methylation of DNA repeats.
Collapse
Affiliation(s)
- Xena Giada Pappalardo
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), University of Catania, 95125, Catania, Italy.,National Council of Research, Institute for Biomedical Research and Innovation (IRIB), Unit of Catania, 95125, Catania, Italy
| | - Viviana Barra
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (STEBICEF), University of Palermo, 90128, Palermo, Italy.
| |
Collapse
|
43
|
What prevents mainstream evolutionists teaching the whole truth about how genomes evolve? PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2021; 165:140-152. [PMID: 33933502 DOI: 10.1016/j.pbiomolbio.2021.04.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 03/31/2021] [Accepted: 04/26/2021] [Indexed: 01/24/2023]
Abstract
The common belief that the neo-Darwinian Modern Synthesis (MS) was buttressed by the discoveries of molecular biology is incorrect. On the contrary those discoveries have undermined the MS. This article discusses the many processes revealed by molecular studies and genome sequencing that contribute to evolution but nonetheless lie beyond the strict confines of the MS formulated in the 1940s. The core assumptions of the MS that molecular studies have discredited include the idea that DNA is intrinsically a faithful self-replicator, the one-way transfer of heritable information from nucleic acids to other cell molecules, the myth of "selfish DNA", and the existence of an impenetrable Weismann Barrier separating somatic and germ line cells. Processes fundamental to modern evolutionary theory include symbiogenesis, biosphere interactions between distant taxa (including viruses), horizontal DNA transfers, natural genetic engineering, organismal stress responses that activate intrinsic genome change operators, and macroevolution by genome restructuring (distinct from the gradual accumulation of local microevolutionary changes in the MS). These 21st Century concepts treat the evolving genome as a highly formatted and integrated Read-Write (RW) database rather than a Read-Only Memory (ROM) collection of independent gene units that change by random copying errors. Most of the discoverers of these macroevolutionary processes have been ignored in mainstream textbooks and popularizations of evolutionary biology, as we document in some detail. Ironically, we show that the active view of evolution that emerges from genomics and molecular biology is much closer to the 19th century ideas of both Darwin and Lamarck. The capacity of cells to activate evolutionary genome change under stress can account for some of the most negative clinical results in oncology, especially the sudden appearance of treatment-resistant and more aggressive tumors following therapies intended to eradicate all cancer cells. Knowing that extreme stress can be a trigger for punctuated macroevolutionary change suggests that less lethal therapies may result in longer survival times.
Collapse
|
44
|
Seferbekova Z, Zabelkin A, Yakovleva Y, Afasizhev R, Dranenko NO, Alexeev N, Gelfand MS, Bochkareva OO. High Rates of Genome Rearrangements and Pathogenicity of Shigella spp. Front Microbiol 2021; 12:628622. [PMID: 33912145 PMCID: PMC8072062 DOI: 10.3389/fmicb.2021.628622] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 03/22/2021] [Indexed: 02/01/2023] Open
Abstract
Shigella are pathogens originating within the Escherichia lineage but frequently classified as a separate genus. Shigella genomes contain numerous insertion sequences (ISs) that lead to pseudogenisation of affected genes and an increase of non-homologous recombination. Here, we study 414 genomes of E. coli and Shigella strains to assess the contribution of genomic rearrangements to Shigella evolution. We found that Shigella experienced exceptionally high rates of intragenomic rearrangements and had a decreased rate of homologous recombination compared to pathogenic and non-pathogenic E. coli. The high rearrangement rate resulted in independent disruption of syntenic regions and parallel rearrangements in different Shigella lineages. Specifically, we identified two types of chromosomally encoded E3 ubiquitin-protein ligases acquired independently by all Shigella strains that also showed a high level of sequence conservation in the promoter and further in the 5′-intergenic region. In the only available enteroinvasive E. coli (EIEC) strain, which is a pathogenic E. coli with a phenotype intermediate between Shigella and non-pathogenic E. coli, we found a rate of genome rearrangements comparable to those in other E. coli and no functional copies of the two Shigella-specific E3 ubiquitin ligases. These data indicate that the accumulation of ISs influenced many aspects of genome evolution and played an important role in the evolution of intracellular pathogens. Our research demonstrates the power of comparative genomics-based on synteny block composition and an important role of non-coding regions in the evolution of genomic islands.
Collapse
Affiliation(s)
- Zaira Seferbekova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Alexey Zabelkin
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia.,JetBrains Research, Saint Petersburg, Russia.,Bioinformatics Institute, Saint Petersburg, Russia
| | - Yulia Yakovleva
- Bioinformatics Institute, Saint Petersburg, Russia.,Department of Cytology and Histology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Robert Afasizhev
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Natalia O Dranenko
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Nikita Alexeev
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Olga O Bochkareva
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Institute of Science and Technology (IST Austria), Klosterneuburg, Austria
| |
Collapse
|
45
|
Telonis AG, Rigoutsos I. The transcriptional trajectories of pluripotency and differentiation comprise genes with antithetical architecture and repetitive-element content. BMC Biol 2021; 19:60. [PMID: 33765992 PMCID: PMC7995781 DOI: 10.1186/s12915-020-00928-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 11/18/2020] [Indexed: 12/12/2022] Open
Abstract
Background Extensive molecular differences exist between proliferative and differentiated cells. Here, we conduct a meta-analysis of publicly available transcriptomic datasets from preimplantation and differentiation stages examining the architectural properties and content of genes whose abundance changes significantly across developmental time points. Results Analysis of preimplantation embryos from human and mouse showed that short genes whose introns are enriched in Alu (human) and B (mouse) elements, respectively, have higher abundance in the blastocyst compared to the zygote. These highly expressed genes encode ribosomal proteins or metabolic enzymes. On the other hand, long genes whose introns are depleted in repetitive elements have lower abundance in the blastocyst and include genes from signaling pathways. Additionally, the sequences of the genes that are differentially expressed between the blastocyst and the zygote contain distinct collections of pyknon motifs that differ between up- and down-regulated genes. Further examination of the genes that participate in the stem cell-specific protein interaction network shows that their introns are short and enriched in Alu (human) and B (mouse) elements. As organogenesis progresses, in both human and mouse, we find that the primarily short and repeat-rich expressed genes make way for primarily longer, repeat-poor genes. With that in mind, we used a machine learning-based approach to identify gene signatures able to classify human adult tissues: we find that the most discriminatory genes comprising these signatures have long introns that are repeat-poor and include transcription factors and signaling-cascade genes. The introns of widely expressed genes across human tissues, on the other hand, are short and repeat-rich, and coincide with those with the highest expression at the blastocyst stage. Conclusions Protein-coding genes that are characteristic of each trajectory, i.e., proliferation/pluripotency or differentiation, exhibit antithetical biases in their intronic and exonic lengths and in their repetitive-element content. While the respective human and mouse gene signatures are functionally and evolutionarily conserved, their introns and exons are enriched or depleted in organism-specific repetitive elements. We posit that these organism-specific repetitive sequences found in exons and introns are used to effect the corresponding genes’ regulation. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-020-00928-8.
Collapse
Affiliation(s)
- Aristeidis G Telonis
- Computational Medicine Center, Sidney Kimmel College of Medicine, Thomas Jefferson University, 1020 Locust Street, Suite M81, Philadelphia, PA, 19107, USA. .,Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA.
| | - Isidore Rigoutsos
- Computational Medicine Center, Sidney Kimmel College of Medicine, Thomas Jefferson University, 1020 Locust Street, Suite M81, Philadelphia, PA, 19107, USA.
| |
Collapse
|
46
|
Su X, Jiao R, Liu Z, Xia Y, Cao Y. Functional and characteristic analysis of an appressorium-specific promoter PMagas1 in Metarhizium acridum. J Invertebr Pathol 2021; 182:107565. [PMID: 33676966 DOI: 10.1016/j.jip.2021.107565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/07/2021] [Accepted: 02/15/2021] [Indexed: 11/17/2022]
Abstract
Entomopathogenic fungi have been used as important biological control agents throughout the world. To improve the biocontrol efficacy of entomopathogenic fungi, many genes have been used to improve fungal virulence or tolerance to adverse conditions via modulating their expression with strong promoters. The Magas1 gene is specifically expressed during appressorium formation and contributes to the virulence in Metarhizium acridum. In this study, we analyzed the functional region of the promoter of Magas1 gene (PMagas1) in M. acridum using 5'-deletion technique with enhanced green fluoresces protein (EGFP) as a reporter. Results showed the full length of the PMagas1 was at least 897 bp. Two regions (-897 to -611 bp and -392 to -328 bp) were essential for the activity of PMagas1. An engineered M. acridum strain was constructed with PMagas1 driving the expression of a subtilisin-like proteinase gene Pr1A (PMagas1-PR1A). Bioassay showed that the virulence was significantly increased in PMagas1-PR1A strain compared to wild type strain. Pmagas1 promoter is suitable for the overexpression of some genes during the infection of entomopathogenic fungi, which avoids the waste of nutritional resources and the influence on other fungal characteristics during the saprophytic process of constitutive promoter.
Collapse
Affiliation(s)
- Xueling Su
- School of Life Sciences, Chongqing University, Chongqing 401331, People's Republic of China; Chongqing Engineering Research Center for Fungal Insecticides, Chongqing 401331, People's Republic of China; Key Laboratory of Gene Function and Regulation Technologies under Chongqing Municipal Education Commission, Chongqing, People's Republic of China
| | - Run Jiao
- School of Life Sciences, Chongqing University, Chongqing 401331, People's Republic of China; Chongqing Engineering Research Center for Fungal Insecticides, Chongqing 401331, People's Republic of China; Key Laboratory of Gene Function and Regulation Technologies under Chongqing Municipal Education Commission, Chongqing, People's Republic of China
| | - Zhe Liu
- School of Life Sciences, Chongqing University, Chongqing 401331, People's Republic of China; Chongqing Engineering Research Center for Fungal Insecticides, Chongqing 401331, People's Republic of China; Key Laboratory of Gene Function and Regulation Technologies under Chongqing Municipal Education Commission, Chongqing, People's Republic of China
| | - Yuxian Xia
- School of Life Sciences, Chongqing University, Chongqing 401331, People's Republic of China; Chongqing Engineering Research Center for Fungal Insecticides, Chongqing 401331, People's Republic of China; Key Laboratory of Gene Function and Regulation Technologies under Chongqing Municipal Education Commission, Chongqing, People's Republic of China
| | - Yueqing Cao
- School of Life Sciences, Chongqing University, Chongqing 401331, People's Republic of China; Chongqing Engineering Research Center for Fungal Insecticides, Chongqing 401331, People's Republic of China; Key Laboratory of Gene Function and Regulation Technologies under Chongqing Municipal Education Commission, Chongqing, People's Republic of China.
| |
Collapse
|
47
|
Yang W, Guo Y, Ni W, Tian T, Jin L, Liu J, Li Z, Ren A, Wang L. Hypermethylation of WNT3A gene and non-syndromic cleft lip and/or palate in association with in utero exposure to lead: A mediation analysis. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2021; 208:111415. [PMID: 33091767 DOI: 10.1016/j.ecoenv.2020.111415] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 09/21/2020] [Accepted: 09/23/2020] [Indexed: 06/11/2023]
Abstract
OBJECTIVES We aim to investigate association between WNT3A methylation and risk of non-syndromic cleft lip and/or palate (NSCL/P), and examine mediating effect of WNT3A methylation on the association of NSCL/P and lead (Pb) exposure in fetuses. METHODS DNA methylation of WNT3A in umbilical cord blood was determined among 59 NSCL/P cases and 118 non-malformed controls. Mediation analysis was performed to evaluate the potential mediating effect of WNT3A methylation on association between concentrations of Pb in umbilical cord and risk for NSCL/P. Additionally, an animal experiment in which cleft palates were induced by lead acetate was conducted. RESULTS The overall average methylation level of WNT3A was significant higher in NSCL/P cases as compared to controls. The risk for NSCL/P was increased by 1.90-fold with hypermethylation of WNT3A. Significant correlation was observed between concentrations of Pb in umbilical cord and methylation level of WNT3A. The hypermethylation of WNT3A had a mediating effect by 9.32% of total effect of Pb on NSCL/P risk. Gender-specific association between WNT3A methylation and NSCL/P was observed in male fetuses, and the percentage of the mediating effect increased to 14.28%. Animal experiment of mice showed that maternal oral exposure to lead acetate may result in cleft palate in offspring. CONCLUSION Hypermethylation of WNT3A was associated with the risk for NSCL/P and may be partly explain the association between exposure to Pb and risk for NSCL/P. The teratogenic and fetotoxic effects of Pb were found in mice.
Collapse
Affiliation(s)
- Wenlei Yang
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Yingnan Guo
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Wenli Ni
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Tian Tian
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Lei Jin
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Jufen Liu
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Zhiwen Li
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Aiguo Ren
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China
| | - Linlin Wang
- Institute of Reproductive and Child Health, NHC Key Laboratory of Reproductive Health, Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China.
| |
Collapse
|
48
|
Itsko M, Abu YB. Novel type of coagulation equation and its application to DNA repeat expansion process. J Theor Biol 2020; 511:110555. [PMID: 33346021 DOI: 10.1016/j.jtbi.2020.110555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 11/04/2020] [Accepted: 12/02/2020] [Indexed: 11/18/2022]
Abstract
DNA molecules containing repetitive motifs are prone to expand in their lengths. Once there appear a head to tail tandem of two identical DNA sequences in the system, they can propagate indefinitely by the mechanism involving cycles of staggered annealing of complementary DNA strands of variable lengths and polymerase mediated filling-in of the generated overhangs. Microgene Polymerization Reaction (MPR) is an experimental model for expansion of short repetitive DNA to longer lengths. The testable kinetic model of (MPR) was formulated and solved numerically by Itsko et al. in Kinetics of Repeat Propagation in the Microgene Polymerization Reaction (2009). Here, the simple cases of MPR were solved analytically using modified Smoluchowski coagulation equation. It was found that the repeats propagate according to Gumbel probability density function when the distribution of lengths of obtained polymers follows inverted Gumbel probability density function.
Collapse
Affiliation(s)
- Mark Itsko
- WDS Inc., Contractor to Centers for Disease Control and Prevention, 1600 Clifton Road, Atlanta, GA 30033, USA.
| | - Yuval Ben Abu
- Department of Physics and Project Unit, Sapir Academic College, Sderot, Hof Ashkelon 79165, Israel; Clarendon Laboratory, Department of Physics, University of Oxford, UK.
| |
Collapse
|
49
|
Penke L, Denissen JJA, Miller GF. The evolutionary genetics of personality. EUROPEAN JOURNAL OF PERSONALITY 2020. [DOI: 10.1002/per.629] [Citation(s) in RCA: 391] [Impact Index Per Article: 97.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Genetic influences on personality differences are ubiquitous, but their nature is not well understood. A theoretical framework might help, and can be provided by evolutionary genetics. We assess three evolutionary genetic mechanisms that could explain genetic variance in personality differences: selective neutrality, mutation‐selection balance, and balancing selection. Based on evolutionary genetic theory and empirical results from behaviour genetics and personality psychology, we conclude that selective neutrality is largely irrelevant, that mutation‐selection balance seems best at explaining genetic variance in intelligence, and that balancing selection by environmental heterogeneity seems best at explaining genetic variance in personality traits. We propose a general model of heritable personality differences that conceptualises intelligence as fitness components and personality traits as individual reaction norms of genotypes across environments, with different fitness consequences in different environmental niches. We also discuss the place of mental health in the model. This evolutionary genetic framework highlights the role of gene‐environment interactions in the study of personality, yields new insight into the person‐situation‐debate and the structure of personality, and has practical implications for both quantitative and molecular genetic studies of personality. Copyright © 2007 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Lars Penke
- Humboldt University, Berlin, Germany
- International Max Planck Research School LIFE, Berlin, Germany
| | | | | |
Collapse
|
50
|
Pereira F. Evolutionary dynamics of the SARS-CoV-2 ORF8 accessory gene. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2020; 85:104525. [PMID: 32890763 PMCID: PMC7467077 DOI: 10.1016/j.meegid.2020.104525] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Revised: 08/28/2020] [Accepted: 08/29/2020] [Indexed: 01/08/2023]
Abstract
The new SARS-CoV-2 poses a significant threat to human health but many aspects of its basic biology remain unknown. Its genome encodes accessory genes that differ significantly within coronaviruses and contribute to the virus pathogenicity. Among accessory genes, open reading frame 8 (ORF8) stands out by being highly variable and showing structural changes suspected to be related with the virus ability to spread. However, the function of ORF8 remains to be elucidated, making it less studied than other SARS-CoV-2 genes. Here I show that ORF8 is poorly conserved among related coronaviruses. The ORF8 phylogeny built using 11,113 SARS-CoV-2 sequences revealed traces of a typical expanding population with a small number of highly frequent lineages. Interestingly, I detected several nonsense mutations and three main deletions in the ORF8 gene that either remove or significantly change the ORF8 protein. These findings suggest that SARS-CoV-2 can persist without a functional ORF8 protein. Deletion breakpoints were found located in predicted hairpins suggesting a possible involvement of these elements in the rearrangement process. Although the function of ORF8 remains to be elucidated, its structural plasticity and high diversity suggest an important role in SARS-CoV-2 pathogenicity.
Collapse
Affiliation(s)
- Filipe Pereira
- Departamento de Ciências da Vida, Universidade de Coimbra. Calçada Martim de Freitas, 3000-456 Coimbra, Portugal; IDENTIFICA, Science and Technology Park of the University of Porto - UPTEC, Rua Alfredo Allen, N.°455/461, 4200-135 Porto, Portugal..
| |
Collapse
|