Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gough J, Karplus K, Hughey R, Chothia C. Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol 2001;313:903-19. [PMID: 11697912 DOI: 10.1006/jmbi.2001.5080] [Citation(s) in RCA: 854] [Impact Index Per Article: 37.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

For:	Gough J, Karplus K, Hughey R, Chothia C. Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol 2001;313:903-19. [PMID: 11697912 DOI: 10.1006/jmbi.2001.5080] [Citation(s) in RCA: 854] [Impact Index Per Article: 37.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Number

Cited by Other Article(s)

Jebastin T, Syed Abuthakir M, Santhoshi I, Gnanaraj M, Gatasheh MK, Ahamed A, Sharmila V. Unveiling the mysteries: Functional insights into hypothetical proteins from Bacteroides fragilis 638R. Heliyon 2024;10:e31713. [PMID: 38832264 PMCID: PMC11145332 DOI: 10.1016/j.heliyon.2024.e31713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 05/21/2024] [Accepted: 05/21/2024] [Indexed: 06/05/2024] Open

Zhang X, Liu M, Li Z, Zhuo L, Fu X, Zou Q. Fusion of multi-source relationships and topology to infer lncRNA-protein interactions. MOLECULAR THERAPY. NUCLEIC ACIDS 2024;35:102187. [PMID: 38706631 PMCID: PMC11066462 DOI: 10.1016/j.omtn.2024.102187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 04/03/2024] [Indexed: 05/07/2024]

Elisée E, Ducrot L, Méheust R, Bastard K, Fossey-Jouenne A, Grogan G, Pelletier E, Petit JL, Stam M, de Berardinis V, Zaparucha A, Vallenet D, Vergne-Vaxelaire C. A refined picture of the native amine dehydrogenase family revealed by extensive biodiversity screening. Nat Commun 2024;15:4933. [PMID: 38858403 PMCID: PMC11164908 DOI: 10.1038/s41467-024-49009-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 05/20/2024] [Indexed: 06/12/2024] Open

Affiliation(s)

Eddy Elisée Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Laurine Ducrot Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Raphaël Méheust Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Karine Bastard School of Pharmacy, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, 2006, Australia
Aurélie Fossey-Jouenne Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Gideon Grogan York Structural Biology Laboratory, Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
Eric Pelletier Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Jean-Louis Petit Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Mark Stam Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Véronique de Berardinis Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
Anne Zaparucha Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
David Vallenet Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France.
Carine Vergne-Vaxelaire Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France.

Collapse

Joglekar A, Hu W, Zhang B, Narykov O, Diekhans M, Marrocco J, Balacco J, Ndhlovu LC, Milner TA, Fedrigo O, Jarvis ED, Sheynkman G, Korkin D, Ross ME, Tilgner HU. Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain. Nat Neurosci 2024;27:1051-1063. [PMID: 38594596 PMCID: PMC11156538 DOI: 10.1038/s41593-024-01616-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 03/07/2024] [Indexed: 04/11/2024]

Affiliation(s)

Anoushka Joglekar Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA New York Genome Center, New York, NY, USA
Wen Hu Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
Bei Zhang Spatial Genomics, Inc., Pasadena, CA, USA
Oleksandr Narykov Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA Data Science Program, Worcester Polytechnic Institute, Worcester, MA, USA
Mark Diekhans UC Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
Jordan Marrocco Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Department of Biology, Touro University, New York, NY, USA Laboratory of Neuroendocrinology, The Rockefeller University, New York, NY, USA
Jennifer Balacco Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
Lishomwa C Ndhlovu Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Department of Medicine, Division of Infectious Diseases, Weill Cornell Medicine, New York, NY, USA
Teresa A Milner Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
Olivier Fedrigo Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
Erich D Jarvis Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA Howard Hughes Medical Institute, Chevy Chase, MD, USA
Gloria Sheynkman Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA, USA
Dmitry Korkin Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA Data Science Program, Worcester Polytechnic Institute, Worcester, MA, USA
M Elizabeth Ross Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
Hagen U Tilgner Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA. Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA.

Collapse

Xu X, Yin K, Wu R. Systematic Investigation of the Trafficking of Glycoproteins on the Cell Surface. Mol Cell Proteomics 2024;23:100761. [PMID: 38593903 PMCID: PMC11087972 DOI: 10.1016/j.mcpro.2024.100761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 03/30/2024] [Accepted: 04/03/2024] [Indexed: 04/11/2024] Open

Abstract

Glycoproteins located on the cell surface play a pivotal role in nearly every extracellular activity. N-glycosylation is one of the most common and important protein modifications in eukaryotic cells, and it often regulates protein folding and trafficking. Glycosylation of cell-surface proteins undergoes meticulous regulation by various enzymes in the endoplasmic reticulum (ER) and the Golgi, ensuring their proper folding and trafficking to the cell surface. However, the impacts of protein N-glycosylation, N-glycan maturity, and protein folding status on the trafficking of cell-surface glycoproteins remain to be explored. In this work, we comprehensively and site-specifically studied the trafficking of cell-surface glycoproteins in human cells. Integrating metabolic labeling, bioorthogonal chemistry, and multiplexed proteomics, we investigated 706 N-glycosylation sites on 396 cell-surface glycoproteins in monocytes, either by inhibiting protein N-glycosylation, disturbing N-glycan maturation, or perturbing protein folding in the ER. The current results reveal their distinct impacts on the trafficking of surface glycoproteins. The inhibition of protein N-glycosylation dramatically suppresses the trafficking of many cell-surface glycoproteins. The N-glycan immaturity has more substantial effects on proteins with high N-glycosylation site densities, while the perturbation of protein folding in the ER exerts a more pronounced impact on surface glycoproteins with larger sizes. Furthermore, for N-glycosylated proteins, their trafficking to the cell surface is related to the secondary structures and adjacent amino acid residues of glycosylation sites. Systematic analysis of surface glycoprotein trafficking advances our understanding of the mechanisms underlying protein secretion and surface presentation.

Collapse

Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. On Protein Loops, Prior Molecular States and Common Ancestors of Life. J Mol Evol 2024:10.1007/s00239-024-10167-y. [PMID: 38652291 DOI: 10.1007/s00239-024-10167-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 03/22/2024] [Indexed: 04/25/2024]

Peters DL, Gaudreault F, Chen W. Functional domains of Acinetobacter bacteriophage tail fibers. Front Microbiol 2024;15:1230997. [PMID: 38690360 PMCID: PMC11058221 DOI: 10.3389/fmicb.2024.1230997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 03/08/2024] [Indexed: 05/02/2024] Open

Álvarez-Campos P, García-Castro H, Emili E, Pérez-Posada A, Del Olmo I, Peron S, Salamanca-Díaz DA, Mason V, Metzger B, Bely AE, Kenny NJ, Özpolat BD, Solana J. Annelid adult cell type diversity and their pluripotent cellular origins. Nat Commun 2024;15:3194. [PMID: 38609365 PMCID: PMC11014941 DOI: 10.1038/s41467-024-47401-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 03/27/2024] [Indexed: 04/14/2024] Open

Affiliation(s)

Patricia Álvarez-Campos Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK. Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM) & Departamento de Biología (Zoología), Facultad de Ciencias, Universidad Autónoma de Madrid, Madrid, Spain.
Helena García-Castro Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK Living Systems Institute, University of Exeter, Exeter, UK
Elena Emili Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
Alberto Pérez-Posada Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK Living Systems Institute, University of Exeter, Exeter, UK
Irene Del Olmo Centro de Investigación en Biodiversidad y Cambio Global (CIBC-UAM) & Departamento de Biología (Zoología), Facultad de Ciencias, Universidad Autónoma de Madrid, Madrid, Spain
Sophie Peron Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK Living Systems Institute, University of Exeter, Exeter, UK
David A Salamanca-Díaz Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK Living Systems Institute, University of Exeter, Exeter, UK
Vincent Mason Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK
Bria Metzger Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, 05432, USA Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, Saint Louis, MO, 63130, USA
Alexandra E Bely Department of Biology, University of Maryland, College Park, MD, 20742, USA
Nathan J Kenny Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, Aotearoa, New Zealand
B Duygu Özpolat Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA, 05432, USA. Department of Biology, Washington University in St. Louis. 1 Brookings Dr. Saint Louis, Saint Louis, MO, 63130, USA.
Jordi Solana Department of Biological and Medical Sciences, Oxford Brookes University, Oxford, UK. Living Systems Institute, University of Exeter, Exeter, UK.

Collapse

Frey B, Aiesi M, Rast BM, Rüthi J, Julmi J, Stierli B, Qi W, Brunner I. Searching for new plastic-degrading enzymes from the plastisphere of alpine soils using a metagenomic mining approach. PLoS One 2024;19:e0300503. [PMID: 38578779 PMCID: PMC10997104 DOI: 10.1371/journal.pone.0300503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 02/28/2024] [Indexed: 04/07/2024] Open

Abstract

Plastic materials, including microplastics, accumulate in all types of ecosystems, even in remote and cold environments such as the European Alps. This pollution poses a risk for the environment and humans and needs to be addressed. Using shotgun DNA metagenomics of soils collected in the eastern Swiss Alps at about 3,000 m a.s.l., we identified genes and their proteins that potentially can degrade plastics. We screened the metagenomes of the plastisphere and the bulk soil with a differential abundance analysis, conducted similarity-based screening with specific databases dedicated to putative plastic-degrading genes, and selected those genes with a high probability of signal peptides for extracellular export and a high confidence for functional domains. This procedure resulted in a final list of nine candidate genes. The lengths of the predicted proteins were between 425 and 845 amino acids, and the predicted genera producing these proteins belonged mainly to Caballeronia and Bradyrhizobium. We applied functional validation, using heterologous expression followed by enzymatic assays of the supernatant. Five of the nine proteins tested showed significantly increased activities when we used an esterase assay, and one of these five proteins from candidate genes, a hydrolase-type esterase, clearly had the highest activity, by more than double. We performed the fluorescence assays for plastic degradation of the plastic types BI-OPL and ecovio® only with proteins from the five candidate genes that were positively active in the esterase assay, but like the negative controls, these did not show any significantly increased activity. In contrast, the activity of the positive control, which contained a PLA-degrading gene insert known from the literature, was more than 20 times higher than that of the negative controls. These findings suggest that in silico screening followed by functional validation is suitable for finding new plastic-degrading enzymes. Although we only found one new esterase enzyme, our approach has the potential to be applied to any type of soil and to plastics in various ecosystems to search rapidly and efficiently for new plastic-degrading enzymes.

Collapse

Shen C, Mao D, Tang J, Liao Z, Chen S. Prediction of LncRNA-Protein Interactions Based on Kernel Combinations and Graph Convolutional Networks. IEEE J Biomed Health Inform 2024;28:1937-1948. [PMID: 37327093 DOI: 10.1109/jbhi.2023.3286917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Rondón JJ, Pisarenco VA, Ramón Pardos-Blas J, Sánchez-Gracia A, Zardoya R, Rozas J. Comparative genomic analysis of chemosensory-related gene families in gastropods. Mol Phylogenet Evol 2024;192:107986. [PMID: 38142794 DOI: 10.1016/j.ympev.2023.107986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 11/24/2023] [Accepted: 12/07/2023] [Indexed: 12/26/2023]

Botkin JR, Farmer AD, Young ND, Curtin SJ. Genome assembly of Medicago truncatula accession SA27063 provides insight into spring black stem and leaf spot disease resistance. BMC Genomics 2024;25:204. [PMID: 38395768 PMCID: PMC10885650 DOI: 10.1186/s12864-024-10112-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 02/10/2024] [Indexed: 02/25/2024] Open

Abstract

Medicago truncatula, model legume and alfalfa relative, has served as an essential resource for advancing our understanding of legume physiology, functional genetics, and crop improvement traits. Necrotrophic fungus, Ascochyta medicaginicola, the causal agent of spring black stem (SBS) and leaf spot is a devasting foliar disease of alfalfa affecting stand survival, yield, and forage quality. Host resistance to SBS disease is poorly understood, and control methods rely on cultural practices. Resistance has been observed in M. truncatula accession SA27063 (HM078) with two recessively inherited quantitative-trait loci (QTL), rnpm1 and rnpm2, previously reported. To shed light on host resistance, we carried out a de novo genome assembly of HM078. The genome, referred to as MtHM078 v1.0, is comprised of 23 contigs totaling 481.19 Mbp. Notably, this assembly contains a substantial amount of novel centromere-related repeat sequences due to deep long-read sequencing. Genome annotation resulted in 98.4% of BUSCO fabales proteins being complete. The assembly enabled sequence-level analysis of rnpm1 and rnpm2 for gene content, synteny, and structural variation between SBS-resistant accession SA27063 (HM078) and SBS-susceptible accession A17 (HM101). Fourteen candidate genes were identified, and some have been implicated in resistance to necrotrophic fungi. Especially interesting candidates include loss-of-function events in HM078 because they fit the inverse gene-for-gene model, where resistance is recessively inherited. In rnpm1, these include a loss-of-function in a disease resistance gene due to a premature stop codon, and a 10.85 kbp retrotransposon-like insertion disrupting a ubiquitin conjugating E2. In rnpm2, we identified a frameshift mutation causing a loss-of-function in a glycosidase, as well as a missense and frameshift mutation altering an F-box family protein. This study generated a high-quality genome of HM078 and has identified promising candidates, that once validated, could be further studied in alfalfa to enhance disease resistance.

Collapse

Schaeffer RD, Zhang J, Medvedev KE, Kinch LN, Cong Q, Grishin NV. ECOD domain classification of 48 whole proteomes from AlphaFold Structure Database using DPAM2. PLoS Comput Biol 2024;20:e1011586. [PMID: 38416793 PMCID: PMC10927120 DOI: 10.1371/journal.pcbi.1011586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 03/11/2024] [Accepted: 02/20/2024] [Indexed: 03/01/2024] Open

Viner C, Ishak CA, Johnson J, Walker NJ, Shi H, Sjöberg-Herrera MK, Shen SY, Lardo SM, Adams DJ, Ferguson-Smith AC, De Carvalho DD, Hainer SJ, Bailey TL, Hoffman MM. Modeling methyl-sensitive transcription factor motifs with an expanded epigenetic alphabet. Genome Biol 2024;25:11. [PMID: 38191487 PMCID: PMC10773111 DOI: 10.1186/s13059-023-03070-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 09/21/2023] [Indexed: 01/10/2024] Open

Dobson L, Gerdán C, Tusnády S, Szekeres L, Kuffa K, Langó T, Zeke A, Tusnády GE. UniTmp: unified resources for transmembrane proteins. Nucleic Acids Res 2024;52:D572-D578. [PMID: 37870462 PMCID: PMC10767979 DOI: 10.1093/nar/gkad897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/03/2023] [Accepted: 10/04/2023] [Indexed: 10/24/2023] Open

Ali A, Unar A, Muhammad Z, Dil S, Zhang B, Sadaf H, Khan M, Ali M, Khan R, Shah KMB, Ma A, Jiang X, Zhang Y, Zhang H, Shi Q. A novel NPHP4 homozygous missense variant identified in infertile brothers with multiple morphological abnormalities of the sperm flagella. J Assist Reprod Genet 2024;41:109-120. [PMID: 37831349 PMCID: PMC10789708 DOI: 10.1007/s10815-023-02966-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 10/03/2023] [Indexed: 10/14/2023] Open

Affiliation(s)

Asim Ali Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China. Department of Biotechnology, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, 22060, Pakistan.
Ahsanullah Unar Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Zubair Muhammad Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Sobia Dil Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Beibei Zhang Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Humaira Sadaf Department of Obstetrics and Gynecology, Ayub Medical Hospital Complex, Abbottabad, Pakistan
Manan Khan Department of Biotechnology and Genetic Engineering, Hazara University, Mansehra, Pakistan
Muhammad Ali Department of Biotechnology, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, 22060, Pakistan
Ranjha Khan Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Kakakhel Mian Basit Shah Department of Biotechnology, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, 22060, Pakistan
Ao Ma Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Xiaohua Jiang Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Yuanwei Zhang Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Huan Zhang Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China
Qinghua Shi Division of Reproduction and Genetics, The First Affiliated Hospital of University of Science and Technology of China, School of Basic Medical Sciences, Division of Life Sciences and Medicine, Biomedical Sciences and Health Laboratory of Anhui Province, University of Science and Technology of China, Hefei, 230027, China.

Collapse

Romei M, Carpentier M, Chomilier J, Lecointre G. Origins and Functional Significance of Eukaryotic Protein Folds. J Mol Evol 2023;91:854-864. [PMID: 38060007 DOI: 10.1007/s00239-023-10136-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 10/03/2023] [Indexed: 12/08/2023]

Si D, Sun J, Guo L, Yang F, Tian X, He S, Li J. Hypothetical Proteins of Mycoplasma synoviae Reannotation and Expression Changes Identified via RNA-Sequencing. Microorganisms 2023;11:2716. [PMID: 38004728 PMCID: PMC10673309 DOI: 10.3390/microorganisms11112716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 10/25/2023] [Accepted: 11/01/2023] [Indexed: 11/26/2023] Open

Aziz MF, Mughal F, Caetano-Anollés G. Tracing the birth of structural domains from loops during protein evolution. Sci Rep 2023;13:14688. [PMID: 37673948 PMCID: PMC10482863 DOI: 10.1038/s41598-023-41556-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 08/28/2023] [Indexed: 09/08/2023] Open

Mayo-Pérez S, Gama-Martínez Y, Dávila S, Rivera N, Hernández-Lucas I. LysR-type transcriptional regulators: state of the art. Crit Rev Microbiol 2023:1-33. [PMID: 37635411 DOI: 10.1080/1040841x.2023.2247477] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 08/03/2023] [Accepted: 08/08/2023] [Indexed: 08/29/2023]

Riley R, Bowers RM, Camargo AP, Campbell A, Egan R, Eloe-Fadrosh EA, Foster B, Hofmeyr S, Huntemann M, Kellom M, Kimbrel JA, Oliker L, Yelick K, Pett-Ridge J, Salamov A, Varghese NJ, Clum A. Terabase-Scale Coassembly of a Tropical Soil Microbiome. Microbiol Spectr 2023;11:e0020023. [PMID: 37310219 PMCID: PMC10434106 DOI: 10.1128/spectrum.00200-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 05/24/2023] [Indexed: 06/14/2023] Open

Abstract

Petabases of environmental metagenomic data are publicly available, presenting an opportunity to characterize complex environments and discover novel lineages of life. Metagenome coassembly, in which many metagenomic samples from an environment are simultaneously analyzed to infer the underlying genomes' sequences, is an essential tool for achieving this goal. We applied MetaHipMer2, a distributed metagenome assembler that runs on supercomputing clusters, to coassemble 3.4 terabases (Tbp) of metagenome data from a tropical soil in the Luquillo Experimental Forest (LEF), Puerto Rico. The resulting coassembly yielded 39 high-quality (>90% complete, <5% contaminated, with predicted 23S, 16S, and 5S rRNA genes and ≥18 tRNAs) metagenome-assembled genomes (MAGs), including two from the candidate phylum Eremiobacterota. Another 268 medium-quality (≥50% complete, <10% contaminated) MAGs were extracted, including the candidate phyla Dependentiae, Dormibacterota, and Methylomirabilota. In total, 307 medium- or higher-quality MAGs were assigned to 23 phyla, compared to 294 MAGs assigned to nine phyla in the same samples individually assembled. The low-quality (<50% complete, <10% contaminated) MAGs from the coassembly revealed a 49% complete rare biosphere microbe from the candidate phylum FCPU426 among other low-abundance microbes, an 81% complete fungal genome from the phylum Ascomycota, and 30 partial eukaryotic MAGs with ≥10% completeness, possibly representing protist lineages. A total of 22,254 viruses, many of them low abundance, were identified. Estimation of metagenome coverage and diversity indicates that we may have characterized ≥87.5% of the sequence diversity in this humid tropical soil and indicates the value of future terabase-scale sequencing and coassembly of complex environments. IMPORTANCE Petabases of reads are being produced by environmental metagenome sequencing. An essential step in analyzing these data is metagenome assembly, the computational reconstruction of genome sequences from microbial communities. "Coassembly" of metagenomic sequence data, in which multiple samples are assembled together, enables more complete detection of microbial genomes in an environment than "multiassembly," in which samples are assembled individually. To demonstrate the potential for coassembling terabases of metagenome data to drive biological discovery, we applied MetaHipMer2, a distributed metagenome assembler that runs on supercomputing clusters, to coassemble 3.4 Tbp of reads from a humid tropical soil environment. The resulting coassembly, its functional annotation, and analysis are presented here. The coassembly yielded more, and phylogenetically more diverse, microbial, eukaryotic, and viral genomes than the multiassembly of the same data. Our resource may facilitate the discovery of novel microbial biology in tropical soils and demonstrates the value of terabase-scale metagenome sequencing.

Collapse

Affiliation(s)

Robert Riley Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Robert M. Bowers Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Antonio Pedro Camargo Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Ashley Campbell Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
Rob Egan Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Emiley A. Eloe-Fadrosh Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Brian Foster Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Steven Hofmeyr Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Marcel Huntemann Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Matthew Kellom Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Jeffrey A. Kimbrel Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA
Leonid Oliker Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
Katherine Yelick Applied Math and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California, USA
Jennifer Pett-Ridge Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California, USA Life & Environmental Sciences Department, University of California Merced, Merced, California, USA
Asaf Salamov Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Neha J. Varghese Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
Alicia Clum Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA

Collapse

Bao C, Lu C, Lin J, Gough J, Fang H. The dcGO Domain-Centric Ontology Database in 2023: New Website and Extended Annotations for Protein Structural Domains. J Mol Biol 2023;435:168093. [PMID: 37061086 PMCID: PMC7614987 DOI: 10.1016/j.jmb.2023.168093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 03/24/2023] [Accepted: 04/06/2023] [Indexed: 04/17/2023]

Masum MHU, Rajia S, Bristi UP, Akter MS, Amin MR, Shishir TA, Ferdous J, Ahmed F, Rahaman MM, Saha O. In Silico Functional Characterization of a Hypothetical Protein From Pasteurella Multocida Reveals a Novel S-Adenosylmethionine-Dependent Methyltransferase Activity. Bioinform Biol Insights 2023;17:11779322231184024. [PMID: 37424709 PMCID: PMC10328030 DOI: 10.1177/11779322231184024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 06/06/2023] [Indexed: 07/11/2023] Open

Abstract

Genomes may now be sequenced in a matter of weeks, leading to an influx of "hypothetical" proteins (HP) whose activities remain a mystery in GenBank. The information included inside these genes has quickly grown in prominence. Thus, we selected to look closely at the structure and function of an HP (AFF25514.1; 246 residues) from Pasteurella multocida (PM) subsp. multocida str. HN06. Possible insights into bacterial adaptation to new environments and metabolic changes might be gained by studying the functions of this protein. The PM HN06 2293 gene encodes an alkaline cytoplasmic protein with a molecular weight of 28352.60 Da, an isoelectric point (pI) of 9.18, and an overall average hydropathicity of around -0.565. One of its functional domains, tRNA (adenine (37)-N6)-methyltransferase TrmO, is a S-adenosylmethionine (SAM)-dependent methyltransferase (MTase), suggesting that it belongs to the Class VIII SAM-dependent MTase family. The tertiary structures represented by HHpred and I-TASSER models were found to be flawless. We predicted the model's active site using the Computed Atlas of Surface Topography of Proteins (CASTp) and FTSite servers, and then displayed it in 3 dimensional (3D) using PyMOL and BIOVIA Discovery Studio. Based on molecular docking (MD) results, we know that HP interacts with SAM and S-adenosylhomocysteine (SAH), 2 crucial metabolites in the tRNA methylation process, with binding affinities of 7.4 and 7.5 kcal/mol, respectively. Molecular dynamic simulations (MDS) of the docked complex, which included only modest structural adjustments, corroborated the strong binding affinity of SAM and SAH to the HP. Evidence for HP's possible role as an SAM-dependent MTase was therefore given by the findings of Multiple sequence alignment (MSA), MD, and molecular dynamic modeling. These in silico data suggest that the investigated HP might be used as a useful adjunct in the investigation of Pasteurella infections and the development of drugs to treat zoonotic pasteurellosis.

Collapse

Wang Y, Zhao D, Zhang W, Wang S, Wu Y, Wang S, Yang Y, Guo B. Four PQQ-Dependent Alcohol Dehydrogenases Responsible for the Oxidative Detoxification of Deoxynivalenol in a Novel Bacterium Ketogulonicigenium vulgare D3_3 Originated from the Feces of Tenebrio molitor Larvae. Toxins (Basel) 2023;15:367. [PMID: 37368668 PMCID: PMC10301637 DOI: 10.3390/toxins15060367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 05/25/2023] [Accepted: 05/26/2023] [Indexed: 06/29/2023] Open

Álvarez-Campos P, García-Castro H, Emili E, Pérez-Posada A, Salamanca-Díaz DA, Mason V, Metzger B, Bely AE, Kenny N, Özpolat BD, Solana J. Annelid adult cell type diversity and their pluripotent cellular origins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.537979. [PMID: 37163014 PMCID: PMC10168269 DOI: 10.1101/2023.04.25.537979] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Abstract

Annelids are a broadly distributed, highly diverse, economically and environmentally important group of animals. Most species can regenerate missing body parts, and many are able to reproduce asexually. Therefore, many annelids can generate all adult cell types in adult stages. However, the putative adult stem cell populations involved in these processes, as well as the diversity of adult cell types generated by them, are still unknown. Here, we recover 75,218 single cell transcriptomes of Pristina leidyi, a highly regenerative and asexually-reproducing freshwater annelid. We characterise all major annelid adult cell types, and validate many of our observations by HCR in situ hybridisation. Our results uncover complex patterns of regionally expressed genes in the annelid gut, as well as neuronal, muscle and epidermal specific genes. We also characterise annelid-specific cell types such as the chaetal sacs and globin+ cells, and novel cell types of enigmatic affinity, including a vigilin+ cell type, a lumbrokinase+ cell type, and a diverse set of metabolic cells. Moreover, we characterise transcription factors and gene networks that are expressed specifically in these populations. Finally, we uncover a broadly abundant cluster of putative stem cells with a pluripotent signature. This population expresses well-known stem cell markers such as vasa, piwi and nanos homologues, but also shows heterogeneous expression of differentiated cell markers and their transcription factors. In these piwi+ cells, we also find conserved expression of pluripotency regulators, including multiple chromatin remodelling and epigenetic factors. Finally, lineage reconstruction analyses reveal the existence of differentiation trajectories from piwi+ cells to diverse adult types. Our data reveal the cell type diversity of adult annelids for the first time and serve as a resource for studying annelid cell types and their evolution. On the other hand, our characterisation of a piwi+ cell population with a pluripotent stem cell signature will serve as a platform for the study of annelid stem cells and their role in regeneration.

Collapse

Rahman A, Sarker MT, Islam MA, Hossain MU, Hasan M, Susmi TF. Targeting Essential Hypothetical Proteins of Pseudomonas aeruginosa PAO1 for Mining of Novel Therapeutics: An In Silico Approach. BIOMED RESEARCH INTERNATIONAL 2023;2023:1787485. [PMID: 37090194 PMCID: PMC10119676 DOI: 10.1155/2023/1787485] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 01/24/2023] [Accepted: 02/06/2023] [Indexed: 04/25/2023]

Joglekar A, Hu W, Zhang B, Narykov O, Diekhans M, Balacco J, Ndhlovu LC, Milner TA, Fedrigo O, Jarvis ED, Sheynkman G, Korkin D, Ross ME, Tilgner HU. Single-cell long-read mRNA isoform regulation is pervasive across mammalian brain regions, cell types, and development. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.02.535281. [PMID: 37066387 PMCID: PMC10103983 DOI: 10.1101/2023.04.02.535281] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]

Abstract

RNA isoforms influence cell identity and function. Until recently, technological limitations prevented a genome-wide appraisal of isoform influence on cell identity in various parts of the brain. Using enhanced long-read single-cell isoform sequencing, we comprehensively analyze RNA isoforms in multiple mouse brain regions, cell subtypes, and developmental timepoints from postnatal day 14 (P14) to adult (P56). For 75% of genes, full-length isoform expression varies along one or more axes of phenotypic origin, underscoring the pervasiveness of isoform regulation across multiple scales. As expected, splicing varies strongly between cell types. However, certain gene classes including neurotransmitter release and reuptake as well as synapse turnover, harbor significant variability in the same cell type across anatomical regions, suggesting differences in network activity may influence cell-type identity. Glial brain-region specificity in isoform expression includes strong poly(A)-site regulation, whereas neurons have stronger TSS regulation. Furthermore, developmental patterns of cell-type specific splicing are especially pronounced in the murine adolescent transition from P21 to P28. The same cell type traced across development shows more isoform variability than across adult anatomical regions, indicating a coordinated modulation of functional programs dictating neural development. As most cell-type specific exons in P56 mouse hippocampus behave similarly in newly generated data from human hippocampi, these principles may be extrapolated to human brain. However, human brains have evolved additional cell-type specificity in splicing, suggesting gain-of-function isoforms. Taken together, we present a detailed single-cell atlas of full-length brain isoform regulation across development and anatomical regions, providing a previously unappreciated degree of isoform variability across multiple scales of the brain.

Collapse

Affiliation(s)

Anoushka Joglekar Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
Wen Hu Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
Bei Zhang Spatial Genomics, Inc. Pasadena, CA
Oleksandr Narykov Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA Data Science Program, Worcester Polytechnic Institute, Worcester, MA, USA
Mark Diekhans UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
Jennifer Balacco Vertebrate Genome Lab, the Rockefeller University, New York, NY
Lishomwa C Ndhlovu Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Department of Medicine, Division of Infectious Diseases, Weill Cornell Medicine, New York, NY, USA
Teresa A Milner Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
Olivier Fedrigo Vertebrate Genome Lab, the Rockefeller University, New York, NY
Erich D Jarvis Vertebrate Genome Lab, the Rockefeller University, New York, NY Laboratory of Neurogenetics of Language, the Rockefeller University, New York, NY Howard Hughes Medical Institute, Chevy Chase, MD
Gloria Sheynkman Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, Virginia, USA Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, Virginia, USA
Dmitry Korkin Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA Computer Science Department, Worcester Polytechnic Institute, Worcester, MA, USA Data Science Program, Worcester Polytechnic Institute, Worcester, MA, USA
M Elizabeth Ross Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
Hagen U Tilgner Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA

Collapse

Makafe GG, Cole L, Roberts A, Muncil S, Patwardhan A, Bernacki D, Chojnacki M, Weinrick B, Sheinerman F. A novel chemogenomic discovery platform identifies bioactive hits with rapid bactericidal activity against Mycobacteroides Abscessus. Tuberculosis (Edinb) 2023;139:102317. [PMID: 36736037 DOI: 10.1016/j.tube.2023.102317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 01/16/2023] [Accepted: 01/21/2023] [Indexed: 01/26/2023]

Genomic Survey of Flavin Monooxygenases in Wild and Cultivated Rice Provides Insight into Evolution and Functional Diversities. Int J Mol Sci 2023;24:ijms24044190. [PMID: 36835601 PMCID: PMC9960948 DOI: 10.3390/ijms24044190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/08/2023] [Accepted: 01/12/2023] [Indexed: 02/22/2023] Open

Liu J, Maxwell M, Cuddihy T, Crawford T, Bassetti M, Hyde C, Peigneur S, Tytgat J, Undheim EAB, Mobli M. ScrepYard: An online resource for disulfide-stabilized tandem repeat peptides. Protein Sci 2023;32:e4566. [PMID: 36644825 PMCID: PMC9885460 DOI: 10.1002/pro.4566] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 01/05/2023] [Accepted: 01/12/2023] [Indexed: 01/17/2023]

Addressing the pervasive scarcity of structural annotation in eukaryotic algae. Sci Rep 2023;13:1687. [PMID: 36717613 PMCID: PMC9886943 DOI: 10.1038/s41598-023-27881-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 01/09/2023] [Indexed: 02/01/2023] Open

Billaud M, Petit MA, Lossouarn J. The Clostridium-infecting filamentous phage CAK1 genome analysis allows to define a new potential clade of Tubulavirales. FEMS Microbiol Lett 2023;370:fnad099. [PMID: 37791400 DOI: 10.1093/femsle/fnad099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/21/2023] [Accepted: 10/02/2023] [Indexed: 10/05/2023] Open

Hagadorn MA, Hunter FK, DeLory T, Johnson MM, Pitts-Singer TL, Kapheim KM. Maternal body condition and season influence RNA deposition in the oocytes of alfalfa leafcutting bees (Megachile rotundata). Front Genet 2023;13:1064332. [PMID: 36685934 PMCID: PMC9845908 DOI: 10.3389/fgene.2022.1064332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 11/28/2022] [Indexed: 01/06/2023] Open

Nambiar A, Liu S, Heflin M, Forsyth JM, Maslov S, Hopkins M, Ritz A. Transformer Neural Networks for Protein Family and Interaction Prediction Tasks. J Comput Biol 2023;30:95-111. [PMID: 35950958 DOI: 10.1089/cmb.2022.0132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open

Genome-wide subcellular protein map for the flagellate parasite Trypanosoma brucei. Nat Microbiol 2023;8:533-547. [PMID: 36804636 PMCID: PMC9981465 DOI: 10.1038/s41564-022-01295-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 11/21/2022] [Indexed: 02/22/2023]

Patel VK, Das A, Kumari R, Kajla S. In silico Analysis of Diverse Endo-β-1,4-glucanases Reveals Their Molecular Evolution. J EVOL BIOCHEM PHYS+ 2023. [DOI: 10.1134/s0022093023010088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2023]

Ben Boubaker R, Tiss A, Henrion D, Chabbert M. Homology Modeling in the Twilight Zone: Improved Accuracy by Sequence Space Analysis. Methods Mol Biol 2023;2627:1-23. [PMID: 36959439 DOI: 10.1007/978-1-0716-2974-1_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]

Kaur H, Singh V, Kalia M, Mohan B, Taneja N. Identification and functional annotation of hypothetical proteins of uropathogenic Escherichia coli strain CFT073 towards designing antimicrobial drug targets. J Biomol Struct Dyn 2022;40:14084-14095. [PMID: 34751095 DOI: 10.1080/07391102.2021.2000499] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Abstract

Urinary tract infections are a serious health concern worldwide, especially in developing countries. Escherichia coli strain CFT073 is a highly virulent pathogenic bacterial strain. CFT073 proteome contains 4897 proteins, out of which 992 have been classified as hypothetical proteins. Identification and characterization of hypothetical proteins can aid in the selection of targets for drug design. In this study, we studied the hypothetical proteins from the UPEC strain CFT073 using various computational tools. By NCBI-CDD, 376 protein sequences showed conserved domains. Based on the functional motifs in their primary sequences, we classified these 376 hypothetical proteins into 7 functional categories. Further KEGG database was used to find the roles of these hypothetical proteins in several pathways. Protein interaction network analysis of hypothetical proteins identified 53 proteins as highly interacting metabolic proteins. Virulence factor analysis of the proteins identified 8 proteins as virulent. We conducted a non-homology search for the identified proteins of UPEC in the available human proteome. We observed that 35 proteins are non-homologous to humans and hence could be selected for drug designing targets. Qualitative characterization of the selected 35 non-homologous hypothetical proteins including essentiality analysis and evaluation of druggability by similarity search against drug bank database was performed. Out of these 35 proteins, three-dimensional structures of six proteins (NP_752562.1, NP_756345.1, NP_754893.1, NP_756600.2, NP_755264.1 and NP_752994.1) could be successfully modelled. These new annotations can help to better understand disease mechanisms at the molecular level, as well as provide new targets for drug development against the UPEC strain CFT073.Communicated by Ramaswamy H. Sarma.

Collapse

Miller J, Zimin AV, Gordus A. Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus. Gigascience 2022;12:giad002. [PMID: 36762707 PMCID: PMC9912274 DOI: 10.1093/gigascience/giad002] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 11/18/2022] [Accepted: 01/03/2023] [Indexed: 02/11/2023] Open

Genomic basis of the giga-chromosomes and giga-genome of tree peony Paeonia ostii. Nat Commun 2022;13:7328. [PMID: 36443323 PMCID: PMC9705720 DOI: 10.1038/s41467-022-35063-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Accepted: 11/17/2022] [Indexed: 11/29/2022] Open

Stam M, Lelièvre P, Hoebeke M, Corre E, Barbeyron T, Michel G. SulfAtlas, the sulfatase database: state of the art and new developments. Nucleic Acids Res 2022;51:D647-D653. [PMID: 36318251 PMCID: PMC9825549 DOI: 10.1093/nar/gkac977] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/14/2022] [Accepted: 10/17/2022] [Indexed: 11/06/2022] Open

Rahman MA, Heme UH, Parvez MAK. In silico functional annotation of hypothetical proteins from the Bacillus paralicheniformis strain Bac84 reveals proteins with biotechnological potentials and adaptational functions to extreme environments. PLoS One 2022;17:e0276085. [PMID: 36228026 PMCID: PMC9560612 DOI: 10.1371/journal.pone.0276085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 09/28/2022] [Indexed: 11/26/2022] Open

Abstract

Members of the Bacillus genus are industrial cell factories due to their capacity to secrete significant quantities of biomolecules with industrial applications. The Bacillus paralicheniformis strain Bac84 was isolated from the Red Sea and it shares a close evolutionary relationship with Bacillus licheniformis. However, a significant number of proteins in its genome are annotated as functionally uncharacterized hypothetical proteins. Investigating these proteins' functions may help us better understand how bacteria survive extreme environmental conditions and to find novel targets for biotechnological applications. Therefore, the purpose of our research was to functionally annotate the hypothetical proteins from the genome of B. paralicheniformis strain Bac84. We employed a structured in-silico approach incorporating numerous bioinformatics tools and databases for functional annotation, physicochemical characterization, subcellular localization, protein-protein interactions, and three-dimensional structure determination. Sequences of 414 hypothetical proteins were evaluated and we were able to successfully attribute a function to 37 hypothetical proteins. Moreover, we performed receiver operating characteristic analysis to assess the performance of various tools used in this present study. We identified 12 proteins having significant adaptational roles to unfavorable environments such as sporulation, formation of biofilm, motility, regulation of transcription, etc. Additionally, 8 proteins were predicted with biotechnological potentials such as coenzyme A biosynthesis, phenylalanine biosynthesis, rare-sugars biosynthesis, antibiotic biosynthesis, bioremediation, and others. Evaluation of the performance of the tools showed an accuracy of 98% which represented the rationality of the tools used. This work shows that this annotation strategy will make the functional characterization of unknown proteins easier and can find the target for further investigation. The knowledge of these hypothetical proteins' potential functions aids B. paralicheniformis strain Bac84 in effectively creating a new biotechnological target. In addition, the results may also facilitate a better understanding of the survival mechanisms in harsh environmental conditions.

Collapse

Banik A, Ahmed SR, Sajib EH, Deb A, Sinha S, Azim KF. Identification of potential inhibitory analogs of metastasis tumor antigens (MTAs) using bioactive compounds: revealing therapeutic option to prevent malignancy. Mol Divers 2022;26:2473-2502. [PMID: 34743299 DOI: 10.1007/s11030-021-10345-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 10/24/2021] [Indexed: 12/31/2022]

Organizing the bacterial annotation space with amino acid sequence embeddings. BMC Bioinformatics 2022;23:385. [PMID: 36151519 PMCID: PMC9502642 DOI: 10.1186/s12859-022-04930-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 08/11/2022] [Indexed: 11/10/2022] Open

Gagalova KK, Warren RL, Coombe L, Wong J, Nip KM, Yuen MMS, Whitehill JGA, Celedon JM, Ritland C, Taylor GA, Cheng D, Plettner P, Hammond SA, Mohamadi H, Zhao Y, Moore RA, Mungall AJ, Boyle B, Laroche J, Cottrell J, Mackay JJ, Lamothe M, Gérardi S, Isabel N, Pavy N, Jones SJM, Bohlmann J, Bousquet J, Birol I. Spruce giga-genomes: structurally similar yet distinctive with differentially expanding gene families and rapidly evolving genes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022;111:1469-1485. [PMID: 35789009 DOI: 10.1111/tpj.15889] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 06/22/2022] [Accepted: 06/27/2022] [Indexed: 06/15/2023]

Abstract

Spruces (Picea spp.) are coniferous trees widespread in boreal and mountainous forests of the northern hemisphere, with large economic significance and enormous contributions to global carbon sequestration. Spruces harbor very large genomes with high repetitiveness, hampering their comparative analysis. Here, we present and compare the genomes of four different North American spruces: the genome assemblies for Engelmann spruce (Picea engelmannii) and Sitka spruce (Picea sitchensis) together with improved and more contiguous genome assemblies for white spruce (Picea glauca) and for a naturally occurring introgress of these three species known as interior spruce (P. engelmannii × glauca × sitchensis). The genomes were structurally similar, and a large part of scaffolds could be anchored to a genetic map. The composition of the interior spruce genome indicated asymmetric contributions from the three ancestral genomes. Phylogenetic analysis of the nuclear and organelle genomes revealed a topology indicative of ancient reticulation. Different patterns of expansion of gene families among genomes were observed and related with presumed diversifying ecological adaptations. We identified rapidly evolving genes that harbored high rates of non-synonymous polymorphisms relative to synonymous ones, indicative of positive selection and its hitchhiking effects. These gene sets were mostly distinct between the genomes of ecologically contrasted species, and signatures of convergent balancing selection were detected. Stress and stimulus response was identified as the most frequent function assigned to expanding gene families and rapidly evolving genes. These two aspects of genomic evolution were complementary in their contribution to divergent evolution of presumed adaptive nature. These more contiguous spruce giga-genome sequences should strengthen our understanding of conifer genome structure and evolution, as their comparison offers clues into the genetic basis of adaptation and ecology of conifers at the genomic level. They will also provide tools to better monitor natural genetic diversity and improve the management of conifer forests. The genomes of four closely related North American spruces indicate that their high similarity at the morphological level is paralleled by the high conservation of their physical genome structure. Yet, the evidence of divergent evolution is apparent in their rapidly evolving genomes, supported by differential expansion of key gene families and large sets of genes under positive selection, largely in relation to stimulus and environmental stress response.

Collapse

Affiliation(s)

Kristina K Gagalova Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
René L Warren Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Lauren Coombe Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Johnathan Wong Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Ka Ming Nip Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Macaire Man Saint Yuen Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
Justin G A Whitehill Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
Jose M Celedon Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
Carol Ritland Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
Greg A Taylor Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Dean Cheng Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Patrick Plettner Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
S Austin Hammond Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada Next-Generation Sequencing Facility, University of Saskatchewan, Saskatoon, SK, S7N 5E5, Canada
Hamid Mohamadi Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Yongjun Zhao Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Richard A Moore Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Andrew J Mungall Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Brian Boyle Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
Jérôme Laroche Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
Joan Cottrell Forest Research, U.K. Forestry Commission, Northern Research Station, Roslin, EH25 9SY, Midlothian, UK
John J Mackay Department of Plant Sciences, University of Oxford, Oxford, OX1 3RB, UK
Manuel Lamothe Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, Québec, QC, G1V 4C7, Canada
Sébastien Gérardi Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
Nathalie Isabel Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, Québec, QC, G1V 4C7, Canada Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
Nathalie Pavy Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
Steven J M Jones Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
Joerg Bohlmann Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
Jean Bousquet Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
Inanc Birol Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada

Collapse

Garg P, Vanamamalai VK, Jali I, Sharma S. In silico prediction of the animal susceptibility and virtual screening of natural compounds against SARS-CoV-2: Molecular dynamics simulation based analysis. Front Genet 2022;13:906955. [PMID: 36110222 PMCID: PMC9468858 DOI: 10.3389/fgene.2022.906955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 08/03/2022] [Indexed: 11/17/2022] Open

Abstract

COVID-19 is an infectious disease caused by the SARS-CoV-2 virus. It has six open reading frames (orf1ab, orf3a, orf6, orf7a, orf8, and orf10), a spike protein, a membrane protein, an envelope small membrane protein, and a nucleocapsid protein, out of which, orf1ab is the largest ORF coding different important non-structural proteins. In this study, an effort was made to evaluate the susceptibility of different animals against SARS-CoV-2 by analyzing the interactions of Spike and ACE2 proteins of the animals and propose a list of potential natural compounds binding to orf1ab of SARS-CoV-2. Here, we analyzed structural interactions between spike proteins of SARS-CoV-2 and the ACE2 receptor of 16 different hosts. A simulation for 50 ns was performed on these complexes. Based on post-simulation analysis, Chelonia mydas was found to have a more stable complex, while Bubalus bubalis, Aquila chrysaetos chrysaetos, Crocodylus porosus, and Loxodonta africana were found to have the least stable complexes with more fluctuations than all other organisms. Apart from that, we performed domain assignment of orf1ab of SARS-CoV-2 and identified 14 distinct domains. Out of these, Domain 3 (DNA/RNA polymerases) was selected as a target, as it showed no similarities with host proteomes and was validated in silico. Then, the top 10 molecules were selected from the virtual screening of ∼1.8 lakh molecules from the ZINC database, based on binding energy, and validated for ADME and toxicological properties. Three molecules were selected and analyzed further. The structural analysis showed that these molecules were residing within the pocket of the receptor. Finally, a simulation for 200 ns was performed on complexes with three selected molecules. Based on post-simulation analysis (RMSD, RMSF, Rg, SASA, and energies), the molecule ZINC000103666966 was found as the most suitable inhibitory compound against Domain 3. As this is an in silico prediction, further experimental studies could unravel the potential of the proposed molecule against SARS-CoV-2.

Collapse

Climate-Endangered Arctic Epishelf Lake Harbors Viral Assemblages with Distinct Genetic Repertoires. Appl Environ Microbiol 2022;88:e0022822. [PMID: 36005820 PMCID: PMC9469726 DOI: 10.1128/aem.00228-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Abstract

Milne Fiord, located on the coastal margin of the Last Ice Area (LIA) in the High Arctic (82°N, Canada), harbors an epishelf lake, a rare type of ice-dependent ecosystem in which a layer of freshwater overlies marine water connected to the open ocean. This microbe-dominated ecosystem faces catastrophic change due to the deterioration of its ice environment related to warming temperatures. We produced the first assessment of viral abundance, diversity, and distribution in this vulnerable ecosystem and explored the niches available for viral taxa and the functional genes underlying their distribution. We found that the viral community in the freshwater layer was distinct from, and more diverse than, the community in the underlying seawater and contained a different set of putative auxiliary metabolic genes, including the sulfur starvation-linked gene tauD and the gene coding for patatin-like phospholipase. The halocline community resembled the freshwater more than the marine community, but harbored viral taxa unique to this layer. We observed distinct viral assemblages immediately below the halocline, at a depth that was associated with a peak of prasinophyte algae and the viral family Phycodnaviridae. We also assembled 15 complete circular genomes, including a putative Pelagibacter phage with a marine distribution. It appears that despite its isolated and precarious situation, the varied niches in this epishelf lake support a diverse viral community, highlighting the importance of characterizing underexplored microbiota in the Last Ice Area before these ecosystems undergo irreversible change.

IMPORTANCE Viruses are key to understanding polar aquatic ecosystems, which are dominated by microorganisms. However, studies of viral communities are challenging to interpret because the vast majority of viruses are known only from sequence fragments, and their taxonomy, hosts, and genetic repertoires are unknown. Our study establishes a basis for comparison that will advance understanding of viral ecology in diverse global environments, particularly in the High Arctic. Rising temperatures in this region mean that researchers have limited time remaining to understand the biodiversity and biogeochemical cycles of ice-dependent environments and the consequences of these rapid, irreversible changes. The case of the Milne Fiord epishelf lake has special urgency because of the rarity of this type of “floating lake” ecosystem and its location in the Last Ice Area, a region of thick sea ice with global importance for conservation efforts.

Collapse

Dwivedi A, Moirangthem A, Pandey H, Sharma P, Srivastava P, Yadav P, Saxena D, Phadke S, Dabadghao P, Gupta N, Kabra M, Goyal R, Biswas R, Mangaraj S, Bhar D, Chowdhury S, Agarwal A, Mandal K. Von Hippel–Lindau (VHL) disease and VHL-associated tumors in Indian subjects: VHL gene testing in a resource constraint setting. EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS 2022. [DOI: 10.1186/s43042-022-00338-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract Abstract Background Von Hippel–Lindau (VHL) syndrome is a familial cancer syndrome caused by mutations in VHL gene. It is characterized by the formation of benign and malignant tumors like retinal angioma, cerebellar hemangioblastoma, spinal hemangioblastoma, renal cell carcinoma, pheochromocytoma, pancreatic and renal cysts, and endolymphatic sac tumors. Germline mutations in VHL gene have also been reported in isolated VHL-associated tumors. VHL gene is a small gene with 3 coding exons and can be easily tested even in a resource constraint setting. Objective To describe clinical presentation and estimate the diagnostic yield of in VHL and VHL-associated tumors. Methods This is a descriptive study in a hospital setting. Here, we describe the clinical and molecular data of 69 patients with suspected VHL or having VHL-associated tumors. Sanger sequencing of coding sequences and conserved splice sites of VHL gene were done in all patients. Multiplex ligation-dependent probe amplification (MLPA) of VHL gene to detect large deletions/duplications was performed for 18 patients with no pathogenic sequence variations. Results Among tumor types at presentation, pheochromocytoma was seen in 49% (34/69), hemangioblastoma was seen in 30% (21/69), and renal cell carcinoma was seen in 7% (5/69). Rest had other tumors like paraganglioma, endolymphatic sac papillary tumors, cerebellar astrocytoma and pancreatic cyst. Seven patients (10%) had more than one tumor at the time of diagnosis. Pathogenic variations in VHL gene were identified in 31probands by Sanger sequencing; 18 were missense, 2 nonsense and 2 small indels. A heterozygous deletion of exon 3 was detected by MLPA in one patient among 18 patients for whom MLPA was done. Overall, the molecular yield was 46% cases (32/69). Family history was present in 7 mutation positive cases (22%). Overall, 11 families (16%) opted for pre-symptomatic mutation testing in the family. Conclusions Mutation testing is indicated in VHL and VHL-associated tumors. The testing facility is easy and can be adopted easily in developing countries like India. The yield is good, and with fairly high incidence of familial cases, molecular testing can help in pre-symptomatic testing and surveillance. Collapse

Ferrer-Bonsoms JA, Gimeno M, Olaverri D, Sacristan P, Lobato C, Castilla C, Carazo F, Rubio A. EventPointer 3.0: flexible and accurate splicing analysis that includes studying the differential usage of protein-domains. NAR Genom Bioinform 2022;4:lqac067. [PMID: 36128425 PMCID: PMC9477077 DOI: 10.1093/nargab/lqac067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 07/29/2022] [Accepted: 09/07/2022] [Indexed: 12/05/2022] Open

Bashiri R, Curtis TP, Ofiţeru ID. The limitations of the current protein classification tools in identifying lipolytic features in putative bacterial lipase sequences. J Biotechnol 2022;351:30-37. [PMID: 35523393 DOI: 10.1016/j.jbiotec.2022.04.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 04/26/2022] [Accepted: 04/26/2022] [Indexed: 11/19/2022]

Abstract

Metagenomics sequencing has generated millions of new protein sequences, most of them with unknown functions. A relatively quick first step for function assignment is to use the existing public protein databases and their scanning tools. However, to date these tools are not able to identify all sequence features like conserved motifs or patterns. In this study we evaluated the capability of several protein public databases (e.g., InterPro, PROSITE, ESTHER, pfam, AlphaFold etc) and their scanning tools for identifying lipolytic features in 78 putative cold-adapted bacterial lipase sequences. Novel lipases that can tolerate extreme conditions have great biotechnological importance. We obtained the putative cold-adapted lipolytic sequences from the metagenomic study of anaerobic psychrophilic microbial community treating domestic wastewater at 4 and 15 ℃. Both newer and conventional protein classifiers failed to find lipolytic features for most of the putative lipases. InterProScan predicted lipase family membership for only 18 of the putative lipase sequences. For more than half of them (41 out of 78) InterProScan could not predict any protein family membership, let alone find lipolytic features in them. However, when the Lipase Engineering Database and AlphaFold were used, half of those sequences were classified. Conventional databases like PROSITE could find lipolytic patterns for 9 of the putative lipolytic sequences of which only one was identified by InterProScan as a lipase. Moreover, different scanning tools made different and inconsistent predictions for a certain putative lipase sequence. Even InterProScan, which integrates predictions from 13 protein member databases, did not have a consensus prediction for a certain lipase sequence. Our study shows that there is lack of information in public protein databases about bacterial lipase sequences and this limits their lipolytic feature prediction and biotechnological application. The integration of AlphaFold within the InterPro can improve the lipase identification and classification significantly.

Collapse