1
|
Zhao P, Yang H, Sun Y, Zhang J, Gao K, Wu J, Zhu C, Yin C, Chen X, Liu Q, Xia Q, Li Q, Xiao H, Sun HX, Zhang X, Yi L, Zhou C, Kliebenstein DJ, Fang R, Wang X, Ye J. Targeted MYC2 stabilization confers citrus Huanglongbing resistance. Science 2025; 388:191-198. [PMID: 40208996 DOI: 10.1126/science.adq7203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 12/19/2024] [Accepted: 02/11/2025] [Indexed: 04/12/2025]
Abstract
Huanglongbing (HLB) is a devastating citrus disease. In this work, we report an HLB resistance regulatory circuit in Citrus composed of an E3 ubiquitin ligase, PUB21, and its substrate, the MYC2 transcription factor, which regulates jasmonate-mediated defense responses. A helitron insertion in the PUB21 promoter introduced multiple MYC2-binding cis-elements to create a regulatory circuit linking the PUB21 activity with MYC2 degradation. Ectopic expression of a natural dominant-negative PUB21 paralog discovered in distant Citrus relatives stabilized MYC2 and conferred resistance to HLB. Antiproteolysis peptides (APPs), identified by artificial intelligence, stabilized MYC2 by binding and inhibiting PUB21 activity. A 14-amino acid peptide, APP3-14, molecularly controlled HLB in greenhouse and field trials. This approach represents a strategy to combat uncultivable pathogens through targeted disease resistance protein stabilization.
Collapse
Affiliation(s)
- Pingzhi Zhao
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Huan Yang
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Yanwei Sun
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Jingyin Zhang
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Kaixing Gao
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Jinbao Wu
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Chengrong Zhu
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Cece Yin
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Xiaoyue Chen
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Qi Liu
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Qiudong Xia
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Qiong Li
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Han Xiao
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
- BGI Research, Beijing, China
| | | | - Xiaoxiao Zhang
- Guangxi Key Laboratory of Agro-environment and Agro-product Safety, College of Agriculture, Guangxi University, Nanning, China
| | - Long Yi
- College of Life Science, Gannan Normal University, Ganzhou, China
| | - Changyong Zhou
- National Citrus Engineering Research Center, Citrus Research Institute, Southwest University, Chongqing, China
| | | | - Rongxiang Fang
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Xuefeng Wang
- National Citrus Engineering Research Center, Citrus Research Institute, Southwest University, Chongqing, China
| | - Jian Ye
- State Key Laboratory of Microbial Diversity and Innovative Utilization, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
2
|
Froschauer K, Svensson SL, Gelhausen R, Fiore E, Kible P, Klaude A, Kucklick M, Fuchs S, Eggenhofer F, Yang C, Falush D, Engelmann S, Backofen R, Sharma CM. Complementary Ribo-seq approaches map the translatome and provide a small protein census in the foodborne pathogen Campylobacter jejuni. Nat Commun 2025; 16:3078. [PMID: 40159498 PMCID: PMC11955535 DOI: 10.1038/s41467-025-58329-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 03/18/2025] [Indexed: 04/02/2025] Open
Abstract
In contrast to transcriptome maps, bacterial small protein (≤50-100 aa) coding landscapes, including overlapping genes, are poorly characterized. However, an emerging number of small proteins have crucial roles in bacterial physiology and virulence. Here, we present a Ribo-seq-based high-resolution translatome map for the major foodborne pathogen Campylobacter jejuni. Besides conventional Ribo-seq, we employed translation initiation site (TIS) profiling to map start codons and also developed a translation termination site (TTS) profiling approach, which revealed stop codons not apparent from the reference genome in virulence loci. Our integrated approach combined with independent validation expanded the small proteome by two-fold, including CioY, a new 34 aa component of the CioAB oxidase. Overall, our study generates a high-resolution annotation of the C. jejuni coding landscape, provided in an interactive browser, and showcases a strategy for applying integrated Ribo-seq to other species to enrich our understanding of small proteomes.
Collapse
Affiliation(s)
- Kathrin Froschauer
- University of Würzburg, Institute of Molecular Infection Biology, Department of Molecular Infection Biology II, Würzburg, Germany
| | - Sarah L Svensson
- University of Würzburg, Institute of Molecular Infection Biology, Department of Molecular Infection Biology II, Würzburg, Germany
- The Center for Microbes, Development and Health, CAS Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Rick Gelhausen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Elisabetta Fiore
- University of Würzburg, Institute of Molecular Infection Biology, Department of Molecular Infection Biology II, Würzburg, Germany
| | - Philipp Kible
- University of Würzburg, Institute of Molecular Infection Biology, Department of Molecular Infection Biology II, Würzburg, Germany
| | - Alicia Klaude
- Technische Universität Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Martin Kucklick
- Technische Universität Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Stephan Fuchs
- Robert Koch Institute, Methodenentwicklung und Forschungsinfrastruktur (MF), Berlin, Germany
| | - Florian Eggenhofer
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Chao Yang
- The Center for Microbes, Development and Health, CAS Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Daniel Falush
- The Center for Microbes, Development and Health, CAS Key Laboratory of Molecular Virology and Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Susanne Engelmann
- Technische Universität Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Freiburg, Germany
- Signalling Research Centre CIBSS, University of Freiburg, Freiburg, Germany
| | - Cynthia M Sharma
- University of Würzburg, Institute of Molecular Infection Biology, Department of Molecular Infection Biology II, Würzburg, Germany.
| |
Collapse
|
3
|
Fesenko I, Sahakyan H, Dhyani R, Shabalina SA, Storz G, Koonin EV. The hidden bacterial microproteome. Mol Cell 2025; 85:1024-1041.e6. [PMID: 39978337 PMCID: PMC11890958 DOI: 10.1016/j.molcel.2025.01.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 11/05/2024] [Accepted: 01/22/2025] [Indexed: 02/22/2025]
Abstract
Microproteins encoded by small open reading frames comprise the "dark matter" of proteomes. Although microproteins have been detected in diverse organisms from all three domains of life, many more remain to be identified, and only a few have been functionally characterized. In this comprehensive study of intergenic small open reading frames (ismORFs, 15-70 codons) in 5,668 bacterial genomes of the family Enterobacteriaceae, we identify 67,297 clusters of ismORFs subject to purifying selection. Expression of tagged Escherichia coli microproteins is detected for 11 of the 16 tested, validating the predictions. Although the ismORFs mainly code for hydrophobic, potentially transmembrane, unstructured, or minimally structured microproteins, some globular folds, oligomeric structures, and possible interactions with proteins encoded by neighboring genes are predicted. Complete information on the predicted microprotein families, including evidence of transcription and translation, and structure predictions are available as an easily searchable resource for investigation of microprotein functions.
Collapse
Affiliation(s)
- Igor Fesenko
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Harutyun Sahakyan
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Rajat Dhyani
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA
| | - Svetlana A Shabalina
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Gisela Storz
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD 20892, USA.
| | - Eugene V Koonin
- Computational Biology Branch, Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
4
|
Galeota-Sprung B, Bhatt AS, de la Fuente-Nunez C. Microproteins: emerging roles as antibiotics. Trends Genet 2025; 41:104-106. [PMID: 39809670 DOI: 10.1016/j.tig.2024.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Revised: 12/06/2024] [Accepted: 12/06/2024] [Indexed: 01/16/2025]
Abstract
Recent advances in computational prediction and experimental techniques have detected previously unknown microproteins, particularly in the human microbiome. These small proteins, produced by diverse microbial species, are emerging as promising candidates for new antibiotics.
Collapse
Affiliation(s)
- Benjamin Galeota-Sprung
- Machine Biology Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Bioengineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA; Department of Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA; Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Ami S Bhatt
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA, USA; Department of Genetics, Stanford University, Stanford, CA, USA
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Bioengineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA; Department of Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA; Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
5
|
Huang J, Yang P, Pan W, Wu F, Qiu J, Ma Z. The role of polypeptides encoded by ncRNAs in cancer. Gene 2024; 928:148817. [PMID: 39098512 DOI: 10.1016/j.gene.2024.148817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 07/22/2024] [Accepted: 07/31/2024] [Indexed: 08/06/2024]
Abstract
It was previously thought that ncRNA could not encode polypeptides, but recent reports have challenged this notion. As research into ncRNA progresses, it is increasingly clear that it serves roles beyond traditional mechanisms, playing significant regulatory roles in various diseases, notably cancer, which is responsible for 70% of human deaths. Numerous studies have highlighted the diverse regulatory mechanisms of ncRNA that are pivotal in cancer initiation and progression. The role of ncRNA-encoded polypeptides in cancer regulation has gained prominence. This article explores the newly identified regulatory functions of these polypeptides in three types of ncRNA-lncRNA, pri-miRNA, and circRNA. These polypeptides can interact with proteins, influence signaling pathways, enhance miRNA stability, and regulate cancer progression, malignancy, resistance, and other clinical challenges. Furthermore, we discuss the evolutionary significance of these polypeptides in the transition from RNA to protein, examining their emergence and conservation throughout evolution.
Collapse
Affiliation(s)
- Jiayuan Huang
- Lab for Noncoding RNA & Cancer, School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Ping Yang
- Department of Gynecology, The Third Affiliated Hospital of Kunming Medical University, Yunnan Cancer Hospital, Kunming 650118,China
| | - Wei Pan
- Lab for Noncoding RNA & Cancer, School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Fan Wu
- Lab for Noncoding RNA & Cancer, School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Jianhua Qiu
- Department of Anesthesiology, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 201800, China.
| | - Zhongliang Ma
- Lab for Noncoding RNA & Cancer, School of Life Sciences, Shanghai University, Shanghai 200444, China.
| |
Collapse
|
6
|
Torres MDT, Brooks EF, Cesaro A, Sberro H, Gill MO, Nicolaou C, Bhatt AS, de la Fuente-Nunez C. Mining human microbiomes reveals an untapped source of peptide antibiotics. Cell 2024; 187:5453-5467.e15. [PMID: 39163860 DOI: 10.1016/j.cell.2024.07.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 05/09/2024] [Accepted: 07/17/2024] [Indexed: 08/22/2024]
Abstract
Drug-resistant bacteria are outpacing traditional antibiotic discovery efforts. Here, we computationally screened 444,054 previously reported putative small protein families from 1,773 human metagenomes for antimicrobial properties, identifying 323 candidates encoded in small open reading frames (smORFs). To test our computational predictions, 78 peptides were synthesized and screened for antimicrobial activity in vitro, with 70.5% displaying antimicrobial activity. As these compounds were different compared with previously reported antimicrobial peptides, we termed them smORF-encoded peptides (SEPs). SEPs killed bacteria by targeting their membrane, synergizing with each other, and modulating gut commensals, indicating a potential role in reconfiguring microbiome communities in addition to counteracting pathogens. The lead candidates were anti-infective in both murine skin abscess and deep thigh infection models. Notably, prevotellin-2 from Prevotella copri presented activity comparable to the commonly used antibiotic polymyxin B. Our report supports the existence of hundreds of antimicrobials in the human microbiome amenable to clinical translation.
Collapse
Affiliation(s)
- Marcelo D T Torres
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Erin F Brooks
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA 94305, USA
| | - Angela Cesaro
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Hila Sberro
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA 94305, USA
| | - Matthew O Gill
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Cosmos Nicolaou
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA 94305, USA
| | - Ami S Bhatt
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA.
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
7
|
Abstract
How did specific useful protein sequences arise from simpler molecules at the origin of life? This seemingly needle-in-a-haystack problem has remarkably close resemblance to the old Protein Folding Problem, for which the solution is now known from statistical physics. Based on the logic that Origins must have come only after there was an operative evolution mechanism-which selects on phenotype, not genotype-we give a perspective that proteins and their folding processes are likely to have been the primary driver of the early stages of the origin of life.
Collapse
Affiliation(s)
- Charles D. Kocher
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY11794
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY11794
- Department of Chemistry, Stony Brook University, Stony Brook, NY11794
| |
Collapse
|
8
|
Zlitni S, Bowden S, Sberro H, Torres MDT, Vaughan JM, Pinto AFM, Pinto Y, Fernandez D, Röst H, Saghatelian A, de la Fuente-Nunez C, Bhatt AS. Dual quorum-sensing control of purine biosynthesis drives pathogenic fitness of Enterococcus faecalis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.13.607696. [PMID: 39185165 PMCID: PMC11343167 DOI: 10.1101/2024.08.13.607696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Enterococcus faecalis is a resident of the human gut, though upon translocation to the blood or body tissues, it can be pathogenic. Here we discover and characterize two peptide-based quorum-sensing systems that transcriptionally modulate de novo purine biosynthesis in E. faecalis. Using a comparative genomic analysis, we find that most enterococcal species do not encode this system; E. moraviensis, E. haemoperoxidus and E. caccae, three species that are closely related to E. faecalis, encode one of the two systems, and only E. faecalis encodes both systems. We show that these systems are important for the intracellular survival of E. faecalis within macrophages and for the fitness of E. faecalis in a murine wound infection model. Taken together, we combine comparative genomics, microbiological, bacterial genetics, transcriptomics, targeted proteomics and animal model experiments to describe a paired quorum sensing mechanism that directly influences central metabolism and impacts the pathogenicity of E. faecalis.
Collapse
Affiliation(s)
- Soumaya Zlitni
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine (Hematology, Blood and Marrow Transplantation), Stanford University, Stanford, CA, USA
| | - Sierra Bowden
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Hila Sberro
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine (Hematology, Blood and Marrow Transplantation), Stanford University, Stanford, CA, USA
| | - Marcelo D. T. Torres
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
- Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Penn Institute for Computational Science, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
| | - Joan M Vaughan
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, San Diego, CA, USA
| | - Antonio F M Pinto
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, San Diego, CA, USA
| | - Yishay Pinto
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine (Hematology, Blood and Marrow Transplantation), Stanford University, Stanford, CA, USA
| | - Daniel Fernandez
- Program in Chemistry, Engineering, and Medicine for Human Health (ChEM-H), Stanford University, Stanford, CA 94305, USA
- Sarafan ChEM-H Macromolecular Structure Knowledge Center, Stanford University, Stanford, CA 94305, USA
| | - Hannes Röst
- Department of Molecular Genetics, Donnelly Centre for Cellular and Biomolecular Research, The University of Toronto, Toronto, ON, Canada
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, San Diego, CA, USA
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
- Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Penn Institute for Computational Science, University of Pennsylvania; Philadelphia, Pennsylvania 19104, USA
| | - Ami S. Bhatt
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine (Hematology, Blood and Marrow Transplantation), Stanford University, Stanford, CA, USA
- Lead contact
| |
Collapse
|
9
|
Weston M, Hu H, Li X. PSPI: A deep learning approach for prokaryotic small protein identification. Front Genet 2024; 15:1439423. [PMID: 39050248 PMCID: PMC11266045 DOI: 10.3389/fgene.2024.1439423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 06/18/2024] [Indexed: 07/27/2024] Open
Abstract
Small Proteins (SPs) are pivotal in various cellular functions such as immunity, defense, and communication. Despite their significance, identifying them is still in its infancy. Existing computational tools are tailored to specific eukaryotic species, leaving only a few options for SP identification in prokaryotes. In addition, these existing tools still have suboptimal performance in SP identification. To fill this gap, we introduce PSPI, a deep learning-based approach designed specifically for predicting prokaryotic SPs. We showed that PSPI had a high accuracy in predicting generalized sets of prokaryotic SPs and sets specific to the human metagenome. Compared with three existing tools, PSPI was faster and showed greater precision, sensitivity, and specificity not only for prokaryotic SPs but also for eukaryotic ones. We also observed that the incorporation of (n, k)-mers greatly enhances the performance of PSPI, suggesting that many SPs may contain short linear motifs. The PSPI tool, which is freely available at https://www.cs.ucf.edu/∼xiaoman/tools/PSPI/, will be useful for studying SPs as a tool for identifying prokaryotic SPs and it can be trained to identify other types of SPs as well.
Collapse
Affiliation(s)
- Matthew Weston
- Department of Computer Science, University of Central Florida, Orlando, FL, United States
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, Orlando, FL, United States
| | - Xiaoman Li
- Burnett School of Biomedical Science, College of Medicine, University of Central Florida, Orlando, FL, United States
| |
Collapse
|
10
|
Coelho LP, Santos-Júnior CD, de la Fuente-Nunez C. Challenges in computational discovery of bioactive peptides in 'omics data. Proteomics 2024; 24:e2300105. [PMID: 38458994 PMCID: PMC11537280 DOI: 10.1002/pmic.202300105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 02/06/2024] [Accepted: 02/06/2024] [Indexed: 03/10/2024]
Abstract
Peptides have a plethora of activities in biological systems that can potentially be exploited biotechnologically. Several peptides are used clinically, as well as in industry and agriculture. The increase in available 'omics data has recently provided a large opportunity for mining novel enzymes, biosynthetic gene clusters, and molecules. While these data primarily consist of DNA sequences, other types of data provide important complementary information. Due to their size, the approaches proven successful at discovering novel proteins of canonical size cannot be naïvely applied to the discovery of peptides. Peptides can be encoded directly in the genome as short open reading frames (smORFs), or they can be derived from larger proteins by proteolysis. Both of these peptide classes pose challenges as simple methods for their prediction result in large numbers of false positives. Similarly, functional annotation of larger proteins, traditionally based on sequence similarity to infer orthology and then transferring functions between characterized proteins and uncharacterized ones, cannot be applied for short sequences. The use of these techniques is much more limited and alternative approaches based on machine learning are used instead. Here, we review the limitations of traditional methods as well as the alternative methods that have recently been developed for discovering novel bioactive peptides with a focus on prokaryotic genomes and metagenomes.
Collapse
Affiliation(s)
- Luis Pedro Coelho
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology, Woolloongabba, Queensland, Australia
- Institute of Science and Technology for Brain-Inspired Intelligence – ISTBI, Fudan University, Shanghai, China
| | - Célio Dias Santos-Júnior
- Institute of Science and Technology for Brain-Inspired Intelligence – ISTBI, Fudan University, Shanghai, China
- Laboratory of Microbial Processes & Biodiversity – LMPB, Hydrobiology Department, Federal University of São Carlos – UFSCar, São Paulo, Brazil
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Chemistry, School of Arts and Sciences, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
11
|
Beals J, Hu H, Li X. A survey of experimental and computational identification of small proteins. Brief Bioinform 2024; 25:bbae345. [PMID: 39007598 PMCID: PMC11247407 DOI: 10.1093/bib/bbae345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 05/27/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024] Open
Abstract
Small proteins (SPs) are typically characterized as eukaryotic proteins shorter than 100 amino acids and prokaryotic proteins shorter than 50 amino acids. Historically, they were disregarded because of the arbitrary size thresholds to define proteins. However, recent research has revealed the existence of many SPs and their crucial roles. Despite this, the identification of SPs and the elucidation of their functions are still in their infancy. To pave the way for future SP studies, we briefly introduce the limitations and advancements in experimental techniques for SP identification. We then provide an overview of available computational tools for SP identification, their constraints, and their evaluation. Additionally, we highlight existing resources for SP research. This survey aims to initiate further exploration into SPs and encourage the development of more sophisticated computational tools for SP identification in prokaryotes and microbiomes.
Collapse
Affiliation(s)
- Joshua Beals
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| |
Collapse
|
12
|
Miravet-Verde S, Mazzolini R, Segura-Morales C, Broto A, Lluch-Senar M, Serrano L. ProTInSeq: transposon insertion tracking by ultra-deep DNA sequencing to identify translated large and small ORFs. Nat Commun 2024; 15:2091. [PMID: 38453908 PMCID: PMC10920889 DOI: 10.1038/s41467-024-46112-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 02/14/2024] [Indexed: 03/09/2024] Open
Abstract
Identifying open reading frames (ORFs) being translated is not a trivial task. ProTInSeq is a technique designed to characterize proteomes by sequencing transposon insertions engineered to express a selection marker when they occur in-frame within a protein-coding gene. In the bacterium Mycoplasma pneumoniae, ProTInSeq identifies 83% of its annotated proteins, along with 5 proteins and 153 small ORF-encoded proteins (SEPs; ≤100 aa) that were not previously annotated. Moreover, ProTInSeq can be utilized for detecting translational noise, as well as for relative quantification and transmembrane topology estimation of fitness and non-essential proteins. By integrating various identification approaches, the number of initially annotated SEPs in this bacterium increases from 27 to 329, with a quarter of them predicted to possess antimicrobial potential. Herein, we describe a methodology complementary to Ribo-Seq and mass spectroscopy that can identify SEPs while providing other insights in a proteome with a flexible and cost-effective DNA ultra-deep sequencing approach.
Collapse
Affiliation(s)
- Samuel Miravet-Verde
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain.
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zurich, Zurich, Switzerland.
| | | | - Carolina Segura-Morales
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain
| | - Alicia Broto
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain
| | - Maria Lluch-Senar
- Pulmobiotics, Dr Aiguader 88, 08003, Barcelona, Spain.
- Institute of Biotechnology and Biomedicine "Vicent Villar Palasi" (IBB), Universitat Autònoma de Barcelona, Barcelona, Spain.
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, 08003, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), Barcelona, Spain.
- ICREA, Pg. Lluis Companys 23, 08010, Barcelona, Spain.
| |
Collapse
|
13
|
Fesenko I, Sahakyan H, Shabalina SA, Koonin EV. The Cryptic Bacterial Microproteome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.17.580829. [PMID: 38903115 PMCID: PMC11188072 DOI: 10.1101/2024.02.17.580829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
Microproteins encoded by small open reading frames (smORFs) comprise the "dark matter" of proteomes. Although functional microproteins were identified in diverse organisms from all three domains of life, bacterial smORFs remain poorly characterized. In this comprehensive study of intergenic smORFs (ismORFs, 15-70 codons) in 5,668 bacterial genomes of the family Enterobacteriaceae, we identified 67,297 clusters of ismORFs subject to purifying selection. The ismORFs mainly code for hydrophobic, potentially transmembrane, unstructured, or minimally structured microproteins. Using AlphaFold Multimer, we predicted interactions of some of the predicted microproteins encoded by transcribed ismORFs with proteins encoded by neighboring genes, revealing the potential of microproteins to regulate the activity of various proteins, particularly, under stress. We compiled a catalog of predicted microprotein families with different levels of evidence from synteny analysis, structure prediction, and transcription and translation data. This study offers a resource for investigation of biological functions of microproteins.
Collapse
Affiliation(s)
- Igor Fesenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Harutyun Sahakyan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Svetlana A. Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
14
|
Mohsen JJ, Martel AA, Slavoff SA. Microproteins-Discovery, structure, and function. Proteomics 2023; 23:e2100211. [PMID: 37603371 PMCID: PMC10841188 DOI: 10.1002/pmic.202100211] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/22/2023]
Abstract
Advances in proteogenomic technologies have revealed hundreds to thousands of translated small open reading frames (sORFs) that encode microproteins in genomes across evolutionary space. While many microproteins have now been shown to play critical roles in biology and human disease, a majority of recently identified microproteins have little or no experimental evidence regarding their functionality. Computational tools have some limitations for analysis of short, poorly conserved microprotein sequences, so additional approaches are needed to determine the role of each member of this recently discovered polypeptide class. A currently underexplored avenue in the study of microproteins is structure prediction and determination, which delivers a depth of functional information. In this review, we provide a brief overview of microprotein discovery methods, then examine examples of microprotein structures (and, conversely, intrinsic disorder) that have been experimentally determined using crystallography, cryo-electron microscopy, and NMR, which provide insight into their molecular functions and mechanisms. Additionally, we discuss examples of predicted microprotein structures that have provided insight or context regarding their function. Analysis of microprotein structure at the angstrom level, and confirmation of predicted structures, therefore, has potential to identify translated microproteins that are of biological importance and to provide molecular mechanism for their in vivo roles.
Collapse
Affiliation(s)
- Jessica J. Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Alina A. Martel
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
15
|
Zhao B, Zhao J, Wang M, Guo Y, Mehmood A, Wang W, Xiong Y, Luo S, Wei DQ, Zhao XQ, Wang Y. Exploring microproteins from various model organisms using the mip-mining database. BMC Genomics 2023; 24:661. [PMID: 37919660 PMCID: PMC10623795 DOI: 10.1186/s12864-023-09735-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 10/12/2023] [Indexed: 11/04/2023] Open
Abstract
Microproteins, prevalent across all kingdoms of life, play a crucial role in cell physiology and human health. Although global gene transcription is widely explored and abundantly available, our understanding of microprotein functions using transcriptome data is still limited. To mitigate this problem, we present a database, Mip-mining ( https://weilab.sjtu.edu.cn/mipmining/ ), underpinned by high-quality RNA-sequencing data exclusively aimed at analyzing microprotein functions. The Mip-mining hosts 336 sets of high-quality transcriptome data from 8626 samples and nine representative living organisms, including microorganisms, plants, animals, and humans, in our Mip-mining database. Our database specifically provides a focus on a range of diseases and environmental stress conditions, taking into account chemical, physical, biological, and diseases-related stresses. Comparatively, our platform enables customized analysis by inputting desired data sets with self-determined cutoff values. The practicality of Mip-mining is demonstrated by identifying essential microproteins in different species and revealing the importance of ATP15 in the acetic acid stress tolerance of budding yeast. We believe that Mip-mining will facilitate a greater understanding and application of microproteins in biotechnology. Moreover, it will be beneficial for designing therapeutic strategies under various biological conditions.
Collapse
Affiliation(s)
- Bowen Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jing Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Muyao Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yangfan Guo
- Central Laboratory of Yan'an Hospital Affiliated to Kunming Medical University, Kunming, 650051, China
| | - Aamir Mehmood
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Weibin Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China
| | - Shenggan Luo
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Zhongjing Research and Industrialization Institute of Chinese Medicine, Zhongguancun Scientific Park, Meixi, Nayang, Henan, 473006, China.
- Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, 518055, Guangdong, China.
| | - Xin-Qing Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Yanjing Wang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
- Engineering Research Center of Cell & Therapeutic Antibody, School of Pharmacy, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
16
|
Torres MDT, Brooks E, Cesaro A, Sberro H, Nicolaou C, Bhatt AS, de la Fuente-Nunez C. Human gut metagenomic mining reveals an untapped source of peptide antibiotics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.31.555711. [PMID: 37693399 PMCID: PMC10491270 DOI: 10.1101/2023.08.31.555711] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Drug-resistant bacteria are outpacing traditional antibiotic discovery efforts. Here, we computationally mined 444,054 families of putative small proteins from 1,773 human gut metagenomes, identifying 323 peptide antibiotics encoded in small open reading frames (smORFs). To test our computational predictions, 78 peptides were synthesized and screened for antimicrobial activity in vitro, with 59% displaying activity against either pathogens or commensals. Since these peptides were unique compared to previously reported antimicrobial peptides, we termed them smORF-encoded peptides (SEPs). SEPs killed bacteria by targeting their membrane, synergized with each other, and modulated gut commensals, indicating that they may play a role in reconfiguring microbiome communities in addition to counteracting pathogens. The lead candidates were anti-infective in both murine skin abscess and deep thigh infection models. Notably, prevotellin-2 from Prevotella copri presented activity comparable to the commonly used antibiotic polymyxin B. We report the discovery of hundreds of peptide sequences in the human gut.
Collapse
Affiliation(s)
- Marcelo D. T. Torres
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
| | - Erin Brooks
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA, United States of America
| | - Angela Cesaro
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
| | - Hila Sberro
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA, United States of America
| | - Cosmos Nicolaou
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA, United States of America
| | - Ami S. Bhatt
- Department of Medicine (Hematology; Blood and Marrow Transplantation), Stanford University, Stanford, CA, United States of America
- Department of Genetics, Stanford University, Stanford, CA, United States of America
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States of America
| |
Collapse
|
17
|
Zhao S, Meng J, Wekesa JS, Luan Y. Identification of small open reading frames in plant lncRNA using class-imbalance learning. Comput Biol Med 2023; 157:106773. [PMID: 36924731 DOI: 10.1016/j.compbiomed.2023.106773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 02/21/2023] [Accepted: 03/09/2023] [Indexed: 03/12/2023]
Abstract
Recently, small open reading frames (sORFs) in long noncoding RNA (lncRNA) have been demonstrated to encode small peptides that can help study the mechanisms of growth and development in organisms. Since machine learning-based computational methods are less costly compared with biological experiments, they can be used to identify sORFs and provide a basis for biological experiments. However, few computational methods and data resources have been exploited for identifying sORFs in plant lncRNA. Besides, machine learning models produce underperforming classifiers when faced with a class-imbalance problem. In this study, an alternative method called SMOTE based on weighted cosine distance (WCDSMOTE) which enables interaction with feature selection is put forward to synthesize minority class samples and weighted edited nearest neighbor (WENN) is applied to clean up majority class samples, thus, hybrid sampling WCDSMOTE-ENN is proposed to deal with imbalanced datasets with the multi-angle feature. A heterogeneous classifier ensemble is introduced to complete the classification task. Therefore, a novel computational method that is based on class-imbalance learning to identify the sORFs with coding potential in plant lncRNA (sORFplnc) is presented. Experimental results manifest that sORFplnc outperforms existing computational methods in identifying sORFs with coding potential. We anticipate that the proposed work can be a reference for relevant research and contribute to agriculture and biomedicine.
Collapse
Affiliation(s)
- Siyuan Zhao
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China.
| | - Jael Sanyanda Wekesa
- Department of Information Technology, Jomo Kenyatta University of Agriculture and Technology, Nairobi, 62000-00200, Kenya
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning, 116024, China
| |
Collapse
|
18
|
Characterization and Spatial Mapping of the Human Gut Metasecretome. mSystems 2022; 7:e0071722. [PMID: 36468852 PMCID: PMC9765747 DOI: 10.1128/msystems.00717-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/10/2022] Open
Abstract
Bacterially secreted proteins play an important role in microbial physiology and ecology in many environments, including the mammalian gut. While gut microbes have been extensively studied over the past decades, little is known about the proteins that they secrete into the gastrointestinal tract. In this study, we developed and applied a computational pipeline to a comprehensive catalog of human-associated metagenome-assembled genomes in order to predict and analyze the bacterial metasecretome of the human gut, i.e., the collection of proteins secreted out of the cytoplasm by human gut bacteria. We identified the presence of large and diverse families of secreted carbohydrate-active enzymes and assessed their phylogenetic distributions across different taxonomic groups, which revealed an enrichment in Bacteroidetes and Verrucomicrobia. By mapping secreted proteins to available metagenomic data from endoscopic sampling of the human gastrointestinal tract, we specifically pinpointed regions in the upper and lower intestinal tract along the lumen and mucosa where specific glycosidases are secreted by resident microbes. The metasecretome analyzed in this study constitutes the most comprehensive list of secreted proteins produced by human gut bacteria reported to date and serves as a useful resource for the microbiome research community. IMPORTANCE Bacterially secreted proteins are necessary for the proper functioning of bacterial cells and communities. Secreted proteins provide bacterial cells with the ability to harvest resources from the exterior, import these resources into the cell, and signal to other bacteria. In the human gut microbiome, these actions impact host health and allow the maintenance of a healthy gut bacterial community. We utilized computational tools to identify the major components of human gut bacterially secreted proteins and determined their spatial distribution in the gastrointestinal tract. Our analysis of human gut bacterial secreted proteins will allow a better understanding of the impact of gut bacteria on human health and represents a step toward identifying new protein functions with interesting applications in biomedicine and industry.
Collapse
|
19
|
Liu Y, Zeng S, Wu M. Novel insights into noncanonical open reading frames in cancer. Biochim Biophys Acta Rev Cancer 2022; 1877:188755. [PMID: 35777601 DOI: 10.1016/j.bbcan.2022.188755] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 06/11/2022] [Accepted: 06/23/2022] [Indexed: 12/12/2022]
Abstract
With technological advances, previously neglected noncanonical open reading frames (nORFs) are drawing ever-increasing attention. However, the translation potential of numerous putative nORFs remains elusive, and the functions of noncanonical peptides have not been systemically summarized. Moreover, the relationship between noncanonical peptides and their counterpart protein or RNA products remains elusive and the clinical implementation of noncanonical peptides has not been explored. In this review, we highlight how recent technological advances such as ribosome profiling, bioinformatics approaches and CRISPR/Cas9 facilitate the research of noncanonical peptides. We delineate the features of each nORF category and the evolutionary process underneath the nORFs. Most importantly, we summarize the diversified functions of noncanonical peptides in cancer based on their subcellular location, which reflect their extensive participation in key pathways and essential cellular activities in cancer cells. Meanwhile, the equilibrium between noncanonical peptides and their corresponding transcripts or counterpart products may be dysregulated under pathological states, which is essential for their roles in cancer. Lastly, we explore their underestimated potential in clinical application as diagnostic biomarkers and treatment targets against cancer.
Collapse
Affiliation(s)
- Yihan Liu
- Hunan Cancer Hospital and the Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha 410013, Hunan, China; The Key Laboratory of Carcinogenesis of the Chinese Ministry of Health, The Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute, Central South University, Changsha, Hunan 410008, China; Department of Oncology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; Key Laboratory for Molecular Radiation Oncology of Hunan Province, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China
| | - Shan Zeng
- Department of Oncology, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; Key Laboratory for Molecular Radiation Oncology of Hunan Province, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China; National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410008, China.
| | - Minghua Wu
- Hunan Cancer Hospital and the Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha 410013, Hunan, China; The Key Laboratory of Carcinogenesis of the Chinese Ministry of Health, The Key Laboratory of Carcinogenesis and Cancer Invasion of the Chinese Ministry of Education, Cancer Research Institute, Central South University, Changsha, Hunan 410008, China.
| |
Collapse
|
20
|
Thousands of small, novel genes predicted in global phage genomes. Cell Rep 2022; 39:110984. [PMID: 35732113 DOI: 10.1016/j.celrep.2022.110984] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 02/14/2022] [Accepted: 05/27/2022] [Indexed: 11/22/2022] Open
Abstract
Small genes (<150 nucleotides) have been systematically overlooked in phage genomes. We employ a large-scale comparative genomics approach to predict >40,000 small-gene families in ∼2.3 million phage genome contigs. We find that small genes in phage genomes are approximately 3-fold more prevalent than in host prokaryotic genomes. Our approach enriches for small genes that are translated in microbiomes, suggesting the small genes identified are coding. More than 9,000 families encode potentially secreted or transmembrane proteins, more than 5,000 families encode predicted anti-CRISPR proteins, and more than 500 families encode predicted antimicrobial proteins. By combining homology and genomic-neighborhood analyses, we reveal substantial novelty and diversity within phage biology, including small phage genes found in multiple host phyla, small genes encoding proteins that play essential roles in host infection, and small genes that share genomic neighborhoods and whose encoded proteins may share related functions.
Collapse
|
21
|
Cancer-related micropeptides encoded by ncRNAs: Promising drug targets and prognostic biomarkers. Cancer Lett 2022; 547:215723. [DOI: 10.1016/j.canlet.2022.215723] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/14/2022] [Accepted: 05/01/2022] [Indexed: 02/07/2023]
|
22
|
Gelhausen R, Müller T, Svensson SL, Alkhnbashi OS, Sharma CM, Eggenhofer F, Backofen R. RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria. Brief Bioinform 2022; 23:bbab549. [PMID: 35037022 PMCID: PMC8921622 DOI: 10.1093/bib/bbab549] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 11/22/2021] [Accepted: 11/29/2021] [Indexed: 11/19/2022] Open
Abstract
Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility.
Collapse
Affiliation(s)
- Rick Gelhausen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110, Freiburg, Germany
| | - Teresa Müller
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110, Freiburg, Germany
| | - Sarah L Svensson
- Department of Molecular Infection Biology II, Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Str. 2 / D15, 97080, Würzburg, Germany
| | - Omer S Alkhnbashi
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Saudi Arabia
- SDAIA-KFUPM Joint Research Center for Artificial Intelligence (JRC-AI), King Fahd University of Petroleum and Minerals, Saudi Arabia
| | - Cynthia M Sharma
- Department of Molecular Infection Biology II, Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Str. 2 / D15, 97080, Würzburg, Germany
| | - Florian Eggenhofer
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110, Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110, Freiburg, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Schänzlestr. 18, 79104, State, Germany
| |
Collapse
|
23
|
Phan QV, Bogdanow B, Wyler E, Landthaler M, Liu F, Hagemeier C, Wiebusch L. Engineering, decoding and systems-level characterization of chimpanzee cytomegalovirus. PLoS Pathog 2022; 18:e1010193. [PMID: 34982803 PMCID: PMC8759705 DOI: 10.1371/journal.ppat.1010193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 01/14/2022] [Accepted: 12/09/2021] [Indexed: 11/19/2022] Open
Abstract
The chimpanzee cytomegalovirus (CCMV) is the closest relative of human CMV (HCMV). Because of the high conservation between these two species and the ability of human cells to fully support CCMV replication, CCMV holds great potential as a model system for HCMV. To make the CCMV genome available for precise and rapid gene manipulation techniques, we captured the genomic DNA of CCMV strain Heberling as a bacterial artificial chromosome (BAC). Selected BAC clones were reconstituted to infectious viruses, growing to similar high titers as parental CCMV. DNA sequencing confirmed the integrity of our clones and led to the identification of two polymorphic loci and a deletion-prone region within the CCMV genome. To re-evaluate the CCMV coding potential, we analyzed the viral transcriptome and proteome and identified several novel ORFs, splice variants, and regulatory RNAs. We further characterized the dynamics of CCMV gene expression and found that viral proteins cluster into five distinct temporal classes. In addition, our datasets revealed that the host response to CCMV infection and the de-regulation of cellular pathways are in line with known hallmarks of HCMV infection. In a first functional experiment, we investigated a proposed frameshift mutation in UL128 that was suspected to restrict CCMV's cell tropism. In fact, repair of this frameshift re-established productive CCMV infection in endothelial and epithelial cells, expanding the options of CCMV as an infection model. Thus, BAC-cloned CCMV can serve as a powerful tool for systematic approaches in comparative functional genomics, exploiting the close phylogenetic relationship between CCMV and HCMV.
Collapse
Affiliation(s)
- Quang Vinh Phan
- Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Berlin, Germany
| | - Boris Bogdanow
- Department of Structural Biology, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin, Germany
| | - Emanuel Wyler
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
| | - Markus Landthaler
- Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
| | - Fan Liu
- Department of Structural Biology, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin, Germany
| | - Christian Hagemeier
- Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Berlin, Germany
| | - Lüder Wiebusch
- Department of Pediatric Oncology/Hematology, Charité—Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
24
|
Schwengers O, Jelonek L, Dieckmann MA, Beyvers S, Blom J, Goesmann A. Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microb Genom 2021; 7:000685. [PMID: 34739369 PMCID: PMC8743544 DOI: 10.1099/mgen.0.000685] [Citation(s) in RCA: 316] [Impact Index Per Article: 79.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 09/08/2021] [Indexed: 12/21/2022] Open
Abstract
Command-line annotation software tools have continuously gained popularity compared to centralized online services due to the worldwide increase of sequenced bacterial genomes. However, results of existing command-line software pipelines heavily depend on taxon-specific databases or sufficiently well annotated reference genomes. Here, we introduce Bakta, a new command-line software tool for the robust, taxon-independent, thorough and, nonetheless, fast annotation of bacterial genomes. Bakta conducts a comprehensive annotation workflow including the detection of small proteins taking into account replicon metadata. The annotation of coding sequences is accelerated via an alignment-free sequence identification approach that in addition facilitates the precise assignment of public database cross-references. Annotation results are exported in GFF3 and International Nucleotide Sequence Database Collaboration (INSDC)-compliant flat files, as well as comprehensive JSON files, facilitating automated downstream analysis. We compared Bakta to other rapid contemporary command-line annotation software tools in both targeted and taxonomically broad benchmarks including isolates and metagenomic-assembled genomes. We demonstrated that Bakta outperforms other tools in terms of functional annotations, the assignment of functional categories and database cross-references, whilst providing comparable wall-clock runtimes. Bakta is implemented in Python 3 and runs on MacOS and Linux systems. It is freely available under a GPLv3 license at https://github.com/oschwengers/bakta. An accompanying web version is available at https://bakta.computational.bio.
Collapse
Affiliation(s)
- Oliver Schwengers
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Lukas Jelonek
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Marius Alfred Dieckmann
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Sebastian Beyvers
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| | - Alexander Goesmann
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen 35392, Germany
| |
Collapse
|
25
|
Fremin BJ, Nicolaou C, Bhatt AS. Simultaneous ribosome profiling of hundreds of microbes from the human microbiome. Nat Protoc 2021; 16:4676-4691. [PMID: 34381207 PMCID: PMC8750612 DOI: 10.1038/s41596-021-00592-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 06/15/2021] [Indexed: 02/08/2023]
Abstract
Ribosome profiling enables sequencing of ribosome-bound fragments of RNA, revealing which transcripts are being translated as well as the position of ribosomes along mRNAs. Although ribosome profiling has been applied to cultured bacterial isolates, its application to uncultured, mixed communities has been challenging. We present MetaRibo-Seq, a protocol that enables the application of ribosome profiling directly to the human fecal microbiome. MetaRibo-Seq is a benchmarked method that includes several modifications to existing ribosome profiling protocols, specifically addressing challenges involving fecal sample storage, purity and input requirements. We also provide a computational workflow to quality control and trim reads, de novo assemble a reference metagenome with metagenomic reads, align MetaRibo-Seq reads to the reference, and assess MetaRibo-Seq library quality ( https://github.com/bhattlab/bhattlab_workflows/tree/master/metariboseq ). This MetaRibo-Seq protocol enables researchers in standard molecular biology laboratories to study translation in the fecal microbiome in ~5 d.
Collapse
Affiliation(s)
- Brayon J Fremin
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Cosmos Nicolaou
- Divisions of Hematology and Blood & Marrow Transplantation, Department of Medicine, Stanford University, Stanford, CA, USA
| | - Ami S Bhatt
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Divisions of Hematology and Blood & Marrow Transplantation, Department of Medicine, Stanford University, Stanford, CA, USA.
| |
Collapse
|
26
|
Fremin BJ, Bhatt AS. Comparative genomics identifies thousands of candidate structured RNAs in human microbiomes. Genome Biol 2021; 22:100. [PMID: 33845850 PMCID: PMC8040213 DOI: 10.1186/s13059-021-02319-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 03/19/2021] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Structured RNAs play varied bioregulatory roles within microbes. To date, hundreds of candidate structured RNAs have been predicted using informatic approaches that search for motif structures in genomic sequence data. The human microbiome contains thousands of species and strains of microbes. Yet, much of the metagenomic data from the human microbiome remains unmined for structured RNA motifs primarily due to computational limitations. RESULTS We sought to apply a large-scale, comparative genomics approach to these organisms to identify candidate structured RNAs. With a carefully constructed, though computationally intensive automated analysis, we identify 3161 conserved candidate structured RNAs in intergenic regions, as well as 2022 additional candidate structured RNAs that may overlap coding regions. We validate the RNA expression of 177 of these candidate structures by analyzing small fragment RNA-seq data from four human fecal samples. CONCLUSIONS This approach identifies a wide variety of candidate structured RNAs, including tmRNAs, antitoxins, and likely ribosome protein leaders, from a wide variety of taxa. Overall, our pipeline enables conservative predictions of thousands of novel candidate structured RNAs from human microbiomes.
Collapse
Affiliation(s)
- Brayon J Fremin
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA
| | - Ami S Bhatt
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
- Department of Medicine (Hematology), Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|