1
|
Bigiş EZ, Yıldız E, Tagka A, Pavlopoulou A, Chrousos GP, Geronikolou S. Novel Minimal Absent Words Detected in Influenza A Virus. Viruses 2025; 17:659. [PMID: 40431670 PMCID: PMC12116108 DOI: 10.3390/v17050659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2025] [Revised: 04/26/2025] [Accepted: 04/28/2025] [Indexed: 05/29/2025] Open
Abstract
Influenza is a communicable disease caused by RNA viruses. Strains A (affecting animals, humans), B (affecting humans), C (affecting rarely humans and pigs), and D (affecting cattle) comprise a variety of substrains each. Influenza A strain, affecting both humans and animals, is considered the most infectious, causing pandemics. There is an emerging need for the accurate classification of the different influenza A virus (IAV) subtypes, elucidating their mode of infection, as well as their fast and accurate diagnosis. Notably, in recent years, oligomeric sequences (words) that are present in the pathogen genomes and entirely absent from the host human genome were suggested to provide robust biomarkers for virus classification and rapid detection. To this end, we performed updated phylogenetic analyses of the IAV hemagglutinin genes, focusing on the sub H1N1 and H5N1. More importantly, we applied in silico methods to identify minimum length "words" that exist consistently in the IAV genomes and are entirely absent from the human genome; these sequences identified in our current analysis may represent minimal signatures that can be utilized to distinguish IAV from other influenza viruses, as well as to perform rapid diagnostic tests.
Collapse
Affiliation(s)
- Elif Zülal Bigiş
- Izmir Biomedicine and Genome Center, 35340 Balçova, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, 35340 Balçova, Izmir, Türkiye
| | - Elif Yıldız
- Izmir Biomedicine and Genome Center, 35340 Balçova, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, 35340 Balçova, Izmir, Türkiye
| | - Anna Tagka
- First Department of Dermatology-Venereology, Medical School, National and Kapodistrian University of Athens, Andreas Syggros Hospital of Venereal & Dermatological Diseases, 10527 Athens, Greece
| | - Athanasia Pavlopoulou
- Izmir Biomedicine and Genome Center, 35340 Balçova, Izmir, Türkiye
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, 35340 Balçova, Izmir, Türkiye
| | - George P. Chrousos
- University Research Institute of Maternal & Child Health & Precision Medicine, Medical School, National and Kapodistrian University of Athens, Levadeias 8, 10527 Athens, Greece
| | - Styliani Geronikolou
- University Research Institute of Maternal & Child Health & Precision Medicine, Medical School, National and Kapodistrian University of Athens, Levadeias 8, 10527 Athens, Greece
| |
Collapse
|
2
|
Shave S, Isaksson R, Pham NT, Elliott RJR, Dawson JC, Soudant J, Carragher NO, Auer M. Cellular Activity of CQWW Nullomer-Derived Peptides. ACS OMEGA 2025; 10:6794-6800. [PMID: 40028100 PMCID: PMC11865978 DOI: 10.1021/acsomega.4c08860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 01/07/2025] [Accepted: 01/23/2025] [Indexed: 03/05/2025]
Abstract
Analysis of observed protein sequences across all species within the UniProtKB/Swiss-Prot data set reveals CQWW as the shortest absent stretch of amino acids. While DNA can be found encoding the CQWW sequence, it has never been observed to be translated or included in manually curated sets of proteins, existing only in predicted, tentative sequences and in a single mature antibody sequence. We have synthesized this "nullomer" peptide, along with 13 derivatives, reversed, truncated, stereoisomers, and alanine-scanning peptides, conjugated to polyarginine stretches to increase cellular uptake. We observed their impact against a healthy neuronal line and six patient-derived glioblastoma cell lines spanning three clinical subtypes. Results reveal IC50 values averaging 4.9 μM for inhibition of cell survival across tested oncogenic cell lines. High-content phenotypic analysis of cellular features and reverse-phase protein arrays failed to discern a clear mode of action for the nullomer peptide but suggests mitochondrial impairment through the inhibition of GSK3 and isoforms, supported by observations of reduced mitochondrial stain intensities. With a recent increase in interest in nullomer peptides, we see the results in this study as a starting point for further investigation into this potentially therapeutic peptide class.
Collapse
Affiliation(s)
- Steven Shave
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
- School
of Biological Sciences, University of Edinburgh, The King’s Buildings, Edinburgh EH9 3BF, U.K.
| | - Rebecka Isaksson
- School
of Biological Sciences, University of Edinburgh, The King’s Buildings, Edinburgh EH9 3BF, U.K.
- Department
of Chemistry, University College London, 20 Gordon Street, London WC1H 0AJ, U.K.
| | - Nhan T. Pham
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
- School
of Biological Sciences, University of Edinburgh, The King’s Buildings, Edinburgh EH9 3BF, U.K.
- College
of Medicine and Veterinary Medicine, University
of Edinburgh, Institute for Regeneration and Repair, 4-5 Little France Drive, Edinburgh EH16 4UU, U.K.
| | - Richard J. R. Elliott
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
| | - John C. Dawson
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
| | - Julius Soudant
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
- Departamento
de Farmacologia, Facultad de Medicina, Universidad
Autónoma de Madrid, Calle Arzobispo Morcillo 4, Madrid 28029, Spain
| | - Neil O. Carragher
- Edinburgh
Cancer Research, Cancer Research UK Scotland Centre, Institute of
Genetics and Cancer, University of Edinburgh, Crewe Road South, Edinburgh EH4 2XR, U.K.
| | - Manfred Auer
- School
of Biological Sciences, University of Edinburgh, The King’s Buildings, Edinburgh EH9 3BF, U.K.
| |
Collapse
|
3
|
Bochalis E, Patsakis M, Chantzi N, Mouratidis I, Chartoumpekis D, Georgakopoulos-Soares I. Unraveling diversity by isolating peptide sequences specific to distinct taxonomic groups. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.05.636664. [PMID: 39975352 PMCID: PMC11839104 DOI: 10.1101/2025.02.05.636664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
The identification of succinct, universal fingerprints that enable the characterization of individual taxonomies can reveal insights into trait development and can have widespread applications in pathogen diagnostics, human healthcare, ecology and the characterization of biomes. Here, we investigated the existence of peptide k-mer sequences that are exclusively present in a specific taxonomy and absent in every other taxonomic level, termed taxonomic quasi-primes. By analyzing proteomes across 24,073 species, we identified quasi-prime peptides specific to superkingdoms, kingdoms, and phyla, uncovering their taxonomic distributions and functional relevance. These peptides exhibit remarkable sequence uniqueness at six- and seven-amino-acid lengths, offering insights into evolutionary divergence and lineage-specific adaptations. Moreover, we show that human quasi-prime loci are more prone to harboring pathogenic variants, underscoring their functional significance. This study introduces taxonomic quasi-primes and offers insights into their contributions to proteomic diversity, evolutionary pathways, and functional adaptations across the tree of life, while emphasizing their potential impact on human health and disease.
Collapse
Affiliation(s)
- Eleftherios Bochalis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Department of Internal Medicine, Division of Endocrinology, Medical School, University of Patras, Patras, Greece
| | - Michail Patsakis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Dionysios Chartoumpekis
- Department of Internal Medicine, Division of Endocrinology, Medical School, University of Patras, Patras, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
4
|
Chan CS, Mouratidis I, Montgomery A, Tsiatsianis GC, Chantzi N, Hemberg M, Ahituv N, Georgakopoulos-Soares I. The topography of nullomer-emerging mutations and their relevance to human disease. Comput Struct Biotechnol J 2024; 30:1-11. [PMID: 39839549 PMCID: PMC11745800 DOI: 10.1016/j.csbj.2024.12.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2024] [Revised: 12/20/2024] [Accepted: 12/22/2024] [Indexed: 01/23/2025] Open
Abstract
Nullomers are short DNA sequences (11-18 base pairs) that are absent from a genome; however, they can emerge due to mutations. Here, we characterize all possible putative human nullomer-emerging single base pair mutations, population variants and disease-causing mutations. We find that the primary determinants of nullomer emergence in the human genome are the presence of CpG dinucleotides and methylated cytosines. Putative nullomer-emerging mutations are enriched at specific genomic elements, including transcription start and end sites, splice sites and transcription factor binding sites. We also observe that putative nullomer-emerging mutations are more frequent in highly conserved regions and show preferential location at nucleosomes. Among repeat elements, Alu repeats exhibit pronounced enrichment for putative nullomer-emerging mutations at specific positions. Finally, we find that disease-associated pathogenic mutations are significantly more likely to cause emergence of nullomers than their benign counterparts.
Collapse
Affiliation(s)
- Candace S.Y. Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Austin Montgomery
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Georgios Christos Tsiatsianis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Martin Hemberg
- Gene Lay Institute of Immunology and Inflammation, Brigham and Women’s Hospital, Massachusetts General Hospital, and Harvard Medical School, Boston, MA, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| |
Collapse
|
5
|
Moeckel C, Mareboina M, Konnaris MA, Chan CS, Mouratidis I, Montgomery A, Chantzi N, Pavlopoulos GA, Georgakopoulos-Soares I. A survey of k-mer methods and applications in bioinformatics. Comput Struct Biotechnol J 2024; 23:2289-2303. [PMID: 38840832 PMCID: PMC11152613 DOI: 10.1016/j.csbj.2024.05.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 06/07/2024] Open
Abstract
The rapid progression of genomics and proteomics has been driven by the advent of advanced sequencing technologies, large, diverse, and readily available omics datasets, and the evolution of computational data processing capabilities. The vast amount of data generated by these advancements necessitates efficient algorithms to extract meaningful information. K-mers serve as a valuable tool when working with large sequencing datasets, offering several advantages in computational speed and memory efficiency and carrying the potential for intrinsic biological functionality. This review provides an overview of the methods, applications, and significance of k-mers in genomic and proteomic data analyses, as well as the utility of absent sequences, including nullomers and nullpeptides, in disease detection, vaccine development, therapeutics, and forensic science. Therefore, the review highlights the pivotal role of k-mers in addressing current genomic and proteomic problems and underscores their potential for future breakthroughs in research.
Collapse
Affiliation(s)
- Camille Moeckel
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Manvita Mareboina
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Maxwell A. Konnaris
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Candace S.Y. Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Penn State University, University Park, Pennsylvania, USA
| | - Austin Montgomery
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | | | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institute of the Life Sciences, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
6
|
Mouratidis I, Baltoumas FA, Chantzi N, Patsakis M, Chan CS, Montgomery A, Konnaris MA, Aplakidou E, Georgakopoulos GC, Das A, Chartoumpekis DV, Kovac J, Pavlopoulos GA, Georgakopoulos-Soares I. kmerDB: A database encompassing the set of genomic and proteomic sequence information for each species. Comput Struct Biotechnol J 2024; 23:1919-1928. [PMID: 38711760 PMCID: PMC11070822 DOI: 10.1016/j.csbj.2024.04.050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 04/17/2024] [Accepted: 04/18/2024] [Indexed: 05/08/2024] Open
Abstract
The decrease in sequencing expenses has facilitated the creation of reference genomes and proteomes for an expanding array of organisms. Nevertheless, no established repository that details organism-specific genomic and proteomic sequences of specific lengths, referred to as kmers, exists to our knowledge. In this article, we present kmerDB, a database accessible through an interactive web interface that provides kmer-based information from genomic and proteomic sequences in a systematic way. kmerDB currently contains 202,340,859,107 base pairs and 19,304,903,356 amino acids, spanning 54,039 and 21,865 reference genomes and proteomes, respectively, as well as 6,905,362 and 149,305,183 genomic and proteomic species-specific sequences, termed quasi-primes. Additionally, we provide access to 5,186,757 nucleic and 214,904,089 peptide sequences absent from every genome and proteome, termed primes. kmerDB features a user-friendly interface offering various search options and filters for easy parsing and searching. The service is available at: www.kmerdb.com.
Collapse
Affiliation(s)
- Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, 16672, Greece
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Michail Patsakis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Candace S.Y. Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Austin Montgomery
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Maxwell A. Konnaris
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, USA
- Department of Statistics, The Pennsylvania State University, University Park, PA, USA
| | - Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, 16672, Greece
- Department of Basic Sciences, School of Medicine, University of Crete, Heraklion, Greece
| | - George C. Georgakopoulos
- National Technical University of Athens, School of Electrical and Computer Engineering, Athens, Greece
| | - Anshuman Das
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Dionysios V. Chartoumpekis
- Service of Endocrinology, Diabetology and Metabolism, Lausanne University Hospital, Lausanne, Switzerland
| | - Jasna Kovac
- Department of Food Science, The Pennsylvania State University, University Park, PA 16802, USA
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, 16672, Greece
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, Athens, 11527, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| |
Collapse
|
7
|
Montgomery A, Tsiatsianis GC, Mouratidis I, Chan CSY, Athanasiou M, Papanastasiou AD, Kantere V, Syrigos N, Vathiotis I, Syrigos K, Yee NS, Georgakopoulos-Soares I. Utilizing nullomers in cell-free RNA for early cancer detection. Cancer Gene Ther 2024; 31:861-870. [PMID: 38351138 PMCID: PMC11192629 DOI: 10.1038/s41417-024-00741-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 01/25/2024] [Accepted: 01/26/2024] [Indexed: 06/23/2024]
Abstract
Early detection of cancer can significantly improve patient outcomes; however, sensitive and highly specific biomarkers for cancer detection are currently missing. Nullomers are the shortest sequences that are absent from the human genome but can emerge due to somatic mutations in cancer. We examine over 10,000 whole exome sequencing matched tumor-normal samples to characterize nullomer emergence across exonic regions of the genome. We also identify nullomer emerging mutational hotspots within tumor genes. Finally, we provide evidence for the identification of nullomers in cell-free RNA from peripheral blood samples, enabling detection of multiple tumor types. We show multiple tumor classification models with an AUC greater than 0.9, including a hepatocellular carcinoma classifier with an AUC greater than 0.99.
Collapse
Affiliation(s)
- Austin Montgomery
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Georgios Christos Tsiatsianis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- School of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Candace S Y Chan
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Maria Athanasiou
- School of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece
| | | | - Verena Kantere
- School of Electrical and Computer Engineering, National Technical University of Athens, Athens, Greece
| | - Nikos Syrigos
- Third Department of Internal Medicine, Sotiria Hospital, National and Kapodistrian University of Athens, School of Medicine, Athens, Greece
| | - Ioannis Vathiotis
- Third Department of Internal Medicine, Sotiria Hospital, National and Kapodistrian University of Athens, School of Medicine, Athens, Greece
| | - Konstantinos Syrigos
- Third Department of Internal Medicine, Sotiria Hospital, National and Kapodistrian University of Athens, School of Medicine, Athens, Greece
| | - Nelson S Yee
- Next Generation Therapies Program, Penn State Cancer Institute; Division of Hematology-Oncology, Department of Medicine, Penn State Health Milton S. Hershey Medical Center, Hershey, PA, USA
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
| |
Collapse
|
8
|
Tsiatsianis GC, Chan CSY, Mouratidis I, Chantzi N, Tsiatsiani AM, Yee NS, Zaravinos A, Kantere V, Georgakopoulos-Soares I. Peptide absent sequences emerging in human cancers. Eur J Cancer 2024; 196:113421. [PMID: 37952501 DOI: 10.1016/j.ejca.2023.113421] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 11/01/2023] [Accepted: 11/01/2023] [Indexed: 11/14/2023]
Abstract
Early diagnosis of cancer can significantly improve survival of cancer patients; however sensitive and highly specific biomarkers for cancer detection are currently lacking for most cancer types. Nullpeptides are short peptides that are absent from the human proteome. Here, we examined the emergence of nullpeptides during cancer development. We analyzed 3,600,964 somatic mutations across 10,064 whole exome sequencing tumor samples spanning 32 cancer types. We analyze RNA-seq data from primary tumor samples to identify the subset of nullpeptides that emerge in highly expresed genes. We show that nullpeptides, and particularly the subset that is highly recurrent across cancer patients, can be identified in tumor biopsy samples. We find that cancer genes show an excess of nullpeptides and detect nullpeptide hotspots in specific loci of oncogenes and tumor suppressors. We also observe that recurrent nullpeptides are more likely to be found in neoantigens, which have been shown to be effective targets for immunotherapy, suggesting that they can be used to prioritize candidates. Our findings provide evidence for the utility of nullpeptides as cancer detection and therapeutic biomarkers.
Collapse
Affiliation(s)
- Georgios Christos Tsiatsianis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA; National Technical University of Athens, School of Electrical and Computer Engineering, Athens, Greece
| | - Candace S Y Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ioannis Mouratidis
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikol Chantzi
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Anna Maria Tsiatsiani
- National Technical University of Athens, School of Electrical and Computer Engineering, Athens, Greece; School of Medicine, National and Kapodistrian University of Athens, Athens, Greece
| | - Nelson S Yee
- Division of Hematology-Oncology, Department of Medicine, Penn State Health Milton S. Hershey Medical Center, Next-Generation Therapies Program, Penn State Cancer Institute, Hershey, PA, USA
| | - Apostolos Zaravinos
- Department of Life Sciences, School of Sciences, European University Cyprus, Nicosia 1516, Cyprus; Cancer Genetics, Genomics and Systems Biology Laboratory, Basic and Translational Cancer Research Center (BTCRC), Nicosia 1516, Cyprus
| | - Verena Kantere
- School of Electrical Engineering and Computer Science, Faculty of Engineering, University of Ottawa, Canada
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA.
| |
Collapse
|
9
|
Ali N, Wolf C, Kanchan S, Veerabhadraiah SR, Bond L, Turner MW, Jorcyk CL, Hampikian G. 9S1R nullomer peptide induces mitochondrial pathology, metabolic suppression, and enhanced immune cell infiltration, in triple-negative breast cancer mouse model. Biomed Pharmacother 2024; 170:115997. [PMID: 38118350 PMCID: PMC10872342 DOI: 10.1016/j.biopha.2023.115997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 12/04/2023] [Accepted: 12/06/2023] [Indexed: 12/22/2023] Open
Abstract
Nullomers are the shortest strings of absent amino acid (aa) sequences in a species or group of species. Primes are those nullomers that have not been detected in the genome of any species. 9S1R is a 5-aa peptide prime sequence attached to 5-arginine aa, used to treat triple negative breast cancer (TNBC) in an in vivo mouse model. This unique peptide, administered with a trehalose carrier (9S1R-NulloPT), offers enhanced solubility and exhibits distinct anti-cancer effects against TNBC. In our study, we investigated the effect of 9S1R-NulloPT on tumor growth, metabolism, metastatic burden, tumor immune-microenvironment (TME), and transcriptome of aggressive mouse TNBC tumors. Notably, treated mice had smaller tumors in the initial phase of the treatment, as compared to untreated control, and diminished in vivo and ex vivo bioluminescence at later-stages - indicative of metabolically quiescent, dying tumors. The treatment also caused changes in TME with increased infiltration of immune cells and altered tumor transcriptome, with 365 upregulated genes and 710 downregulated genes. Consistent with in vitro data, downregulated genes were enriched in cellular metabolic processes (179), specifically mitochondrial TCA cycle/oxidative phosphorylation (44), and translation machinery/ribosome biogenesis (45). The upregulated genes were associated with the developmental (13), ECM organization (12) and focal adhesion pathways (7). In conclusion, our study demonstrates that 9S1R-NulloPT effectively reduced tumor growth during its initial phase, altering the TME and tumor transcriptome. The treatment induced mitochondrial pathology which led to a metabolic deceleration in tumors, aligning with in vitro observations.
Collapse
Affiliation(s)
- Nilufar Ali
- Department of Biological Sciences, Boise State University, Boise, ID, USA.
| | - Cody Wolf
- Department of Biological Sciences, Boise State University, Boise, ID, USA; Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA
| | - Swarna Kanchan
- Department of Biological Sciences, Boise State University, Boise, ID, USA; Department of Biomedical Sciences, Jaon C. Edwards School of Medicine, Marshall University, Huntington, WV, USA
| | - Shivakumar R Veerabhadraiah
- Department of Orthopaedics, University of Utah, Salt Lake City, UT, USA; Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA
| | - Laura Bond
- Center of Biomedical Research Excellence in Matrix Biology, Boise State University, Boise, ID, USA
| | - Matthew W Turner
- Biomolecular Research Center, Boise State University, Boise, ID, USA; Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA
| | - Cheryl L Jorcyk
- Department of Biological Sciences, Boise State University, Boise, ID, USA; Biomolecular Research Center, Boise State University, Boise, ID, USA; Biomolecular Sciences Graduate Programs, Boise State University, Boise, ID, USA
| | - Greg Hampikian
- Department of Biological Sciences, Boise State University, Boise, ID, USA.
| |
Collapse
|
10
|
Ali N, Wolf C, Kanchan S, Veerabhadraiah SR, Bond L, Turner MW, Jorcyk CL, Hampikian G. Nullomer peptide increases immune cell infiltration and reduces tumor metabolism in triple negative breast cancer mouse model. RESEARCH SQUARE 2023:rs.3.rs-3097552. [PMID: 37461536 PMCID: PMC10350184 DOI: 10.21203/rs.3.rs-3097552/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Background Nullomers are the shortest strings of absent amino acid (aa) sequences in a species or group of species. Primes are those nullomers that have not been detected in the genome of any species. 9S1R is a 5-aa peptide derived from a prime sequence that is tagged with 5 arginine aa, used to treat triple negative breast cancer (TNBC) in an in vivo TNBC mouse model. 9S1R is administered in trehalose (9S1R-NulloPT), which enhances solubility and exhibits some independent effects against tumor growth and is thus an important component in the drug preparation. Method We examined the effect of 9S1R-NulloPT on tumor growth, metabolism, metastatic burden, necrosis, tumor immune microenvironment, and the transcriptome of aggressive mouse TNBC tumors. Results The peptide-treated mice had smaller tumors in the initial phase of the treatment, as compared to the untreated control, and reduced in vivo bioluminescence at later stages, which is indicative of metabolically inactive tumors. A decrease in ex vivo bioluminescence was also observed in the excised tumors of treated mice, but not in the secondary metastasis in the lungs. The treatment also caused changes in tumor immune microenvironment with increased infiltration of immune cells and margin inflammation. The treatment upregulated 365 genes and downregulated 710 genes in tumors compared to the untreated group. Consistent with in vitro findings in breast cancer cell lines, downregulated genes in the treated TNBC tumors include Cellular Metabolic Process Related genes (179), specifically mitochondrial genes associated with TCA cycle/oxidative phosphorylation (44), and translation machinery/ribosome biogenesis genes (45). Among upregulated genes, the Developmental Pathway (13), ECM Organization (12) and Focal Adhesion Related Pathways (7) were noteworthy. We also present data from a pilot study using a bilateral BC mouse model, which supports our findings. Conclusion In conclusion, although 9S1R-NulloPT was moderate at reducing the tumor volume, it altered the tumor immune microenvironment as well as the tumor transcriptome, rendering tumors metabolically less active by downregulating the mitochondrial function and ribosome biogenesis. This corroborates previously published in vitro findings.
Collapse
|
11
|
Mouratidis I, Chan CY, Chantzi N, Tsiatsianis G, Hemberg M, Ahituv N, Georgakopoulos-Soares I. Quasi-prime peptides: identification of the shortest peptide sequences unique to a species. NAR Genom Bioinform 2023; 5:lqad039. [PMID: 37101657 PMCID: PMC10124967 DOI: 10.1093/nargab/lqad039] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/02/2023] [Accepted: 04/06/2023] [Indexed: 04/28/2023] Open
Abstract
Determining the organisms present in a biosample has many important applications in agriculture, wildlife conservation, and healthcare. Here, we develop a universal fingerprint based on the identification of short peptides that are unique to a specific organism. We define quasi-prime peptides as sequences that are found in only one species, and we analyzed proteomes from 21 875 species, from viruses to humans, and annotated the smallest peptide kmer sequences that are unique to a species and absent from all other proteomes. We also perform simulations across all reference proteomes and observe a lower than expected number of peptide kmers across species and taxonomies, indicating an enrichment for nullpeptides, sequences absent from a proteome. For humans, we find that quasi-primes are found in genes enriched for specific gene ontology terms, including proteasome and ATP and GTP catalysis. We also provide a set of quasi-prime peptides for a number of human pathogens and model organisms and further showcase its utility via two case studies for Mycobacterium tuberculosis and Vibrio cholerae, where we identify quasi-prime peptides in two transmembrane and extracellular proteins with relevance for pathogen detection. Our catalog of quasi-prime peptides provides the smallest unit of information that is specific to a single organism at the protein level, providing a versatile tool for species identification.
Collapse
Affiliation(s)
- Ioannis Mouratidis
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA, USA
- Department of Engineering Science, KU Leuven, Leuven, Belgium
| | - Candace S Y Chan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Nikol Chantzi
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA, USA
| | - Georgios Christos Tsiatsianis
- Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, PA, USA
- National Technical University of Athens, School of Electrical and Computer Engineering, Athens, Greece
| | - Martin Hemberg
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | | |
Collapse
|
12
|
Santonia D, Felici G. An immunological glimpse of human virus peptides: distance from self, MHC class I binding, Proteasome Cleveage, TAP Transport and sequence composition entropy. Virus Res 2022; 317:198814. [PMID: 35588940 DOI: 10.1016/j.virusres.2022.198814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 05/13/2022] [Accepted: 05/15/2022] [Indexed: 10/18/2022]
Abstract
Adaptive immune response is triggered when specific pathogen peptides called epitopes are recognised as exogenous according to the paradigm of self/non-self. To be recognized by immune cells, epitopes have to be exposed (presented) on the surface of the cell. Predicting if a peptide is exposed is important to shed light on the rules that govern immune response, and thus to identify potential targets, and to design vaccine and drugs. We focused on peptides exposed on cell surface and made accessible to immune system through the MHC Class I complex. Before this can happen, three successive selection steps have to take place: a) Proteasome cleveage, b) TAP Transport, and c) binding to MHC-class I. Starting from a set of 211 host human reference viruses, we computed the set of unique peptides occurring in the correspondent proteomes. Then, we obtained the probability values of Proteasome Cleveage, TAP Transport and Binding to MHC Class I associated to those peptides through established prediction software tools. Such values were analysed in conjunction with two other features that could play a major role: the distance from self, strictly linked to the concept of nullomers, and the sequence entropy, measuring the complexity of the peptide amino acid composition. The analysis confirmed and extended previous results on a larger, more significant and consistent data set; we showed that the higher the distances from self, the higher the score of TAP Transport and binding to MHC class I; no significant association was instead found between distance from self and Proteasome Cleveage. Additionally, amino acid peptide composition entropy was significantly associated with the other features. In particular, higher entropies were linked with higher scores of Proteasome Cleveage, TAP Transport, Binding to MHC Class I, and higher distance from self. The relationship among the three selection steps provided evidence of a tight correlation among them, clearly suggesting it could be the product of a co-evolutive process. We believe that these results give new insights on the complex processes that regulate peptide presentation through MHC class I, and unveil the mechanisms the allow the immune system to distinguish self and viral non-self peptides.
Collapse
Affiliation(s)
- Daniele Santonia
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council of Italy, Via dei Taurini 19, Rome 00185, Italy.
| | - Giovanni Felici
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council of Italy, Via dei Taurini 19, Rome 00185, Italy
| |
Collapse
|
13
|
Georgakopoulos-Soares I, Yizhar-Barnea O, Mouratidis I, Hemberg M, Ahituv N. Absent from DNA and protein: genomic characterization of nullomers and nullpeptides across functional categories and evolution. Genome Biol 2021; 22:245. [PMID: 34433494 PMCID: PMC8386077 DOI: 10.1186/s13059-021-02459-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 08/09/2021] [Indexed: 11/13/2022] Open
Abstract
Nullomers and nullpeptides are short DNA or amino acid sequences that are absent from a genome or proteome, respectively. One potential cause for their absence could be their having a detrimental impact on an organism. RESULTS: Here, we identify all possible nullomers and nullpeptides in the genomes and proteomes of thirty eukaryotes and demonstrate that a significant proportion of these sequences are under negative selection. We also identify nullomers that are unique to specific functional categories: coding sequences, exons, introns, 5'UTR, 3'UTR, promoters, and show that coding sequence and promoter nullomers are most likely to be selected against. By analyzing all protein sequences across the tree of life, we further identify 36,081 peptides up to six amino acids in length that do not exist in any known organism, termed primes. We next characterize all possible single base pair mutations that can lead to the appearance of a nullomer in the human genome, observing a significantly higher number of mutations than expected by chance for specific nullomer sequences in transposable elements, likely due to their suppression. We also annotate nullomers that appear due to naturally occurring variants and show that a subset of them can be used to distinguish between different human populations. Analysis of nullomers and nullpeptides across vertebrate evolution shows they can also be used as phylogenetic classifiers. CONCLUSIONS: We provide a catalog of nullomers and nullpeptides in distinct functional categories, develop methods to systematically study them, and highlight the use of variability in these sequences in other analyses.
Collapse
Affiliation(s)
- Ilias Georgakopoulos-Soares
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Ofer Yizhar-Barnea
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Ioannis Mouratidis
- Department of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Martin Hemberg
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA.
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
14
|
Koulouras G, Frith MC. Significant non-existence of sequences in genomes and proteomes. Nucleic Acids Res 2021; 49:3139-3155. [PMID: 33693858 PMCID: PMC8034619 DOI: 10.1093/nar/gkab139] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 02/11/2021] [Accepted: 02/25/2021] [Indexed: 12/22/2022] Open
Abstract
Minimal absent words (MAWs) are minimal-length oligomers absent from a genome or proteome. Although some artificially synthesized MAWs have deleterious effects, there is still a lack of a strategy for the classification of non-occurring sequences as potentially malicious or benign. In this work, by using Markovian models with multiple-testing correction, we reveal significant absent oligomers, which are statistically expected to exist. This suggests that their absence is due to negative selection. We survey genomes and proteomes covering the diversity of life and find thousands of significant absent sequences. Common significant MAWs are often mono- or dinucleotide tracts, or palindromic. Significant viral MAWs are often restriction sites and may indicate unknown restriction motifs. Surprisingly, significant mammal genome MAWs are often present, but rare, in other mammals, suggesting that they are suppressed but not completely forbidden. Significant human MAWs are frequently present in prokaryotes, suggesting immune function, but rarely present in human viruses, indicating viral mimicry of the host. More than one-fourth of human proteins are one substitution away from containing a significant MAW, with the majority of replacements being predicted harmful. We provide a web-based, interactive database of significant MAWs across genomes and proteomes.
Collapse
Affiliation(s)
- Grigorios Koulouras
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan
| | - Martin C Frith
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26 Aomi, Koto-ku, Tokyo 135-0064, Japan
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), AIST, Shinjuku-ku, Tokyo, Japan
| |
Collapse
|
15
|
Vergni D, Gaudio R, Santoni D. The farther the better: Investigating how distance from human self affects the propensity of a peptide to be presented on cell surface by MHC class I molecules, the case of Trypanosoma cruzi. PLoS One 2020; 15:e0243285. [PMID: 33284846 PMCID: PMC7721184 DOI: 10.1371/journal.pone.0243285] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 11/19/2020] [Indexed: 12/04/2022] Open
Abstract
More than twenty years ago the reverse vaccinology paradigm came to light trying to design new vaccines based on the analysis of genomic information in order to select those pathogen peptides able to trigger an immune response. In this context, focusing on the proteome of Trypanosoma cruzi, we investigated the link between the probabilities for pathogen peptides to be presented on a cell surface and their distance from human self. We found a reasonable but, as far as we know, undiscovered property: the farther the distance between a peptide and the human-self the higher the probability for that peptide to be presented on a cell surface. We also found that the most distant peptides from human self bind, on average, a broader collection of HLAs than expected, implying a potential immunological role in a large portion of individuals. Finally, introducing a novel quantitative indicator for a peptide to measure its potential immunological role, we proposed a pool of peptides that could be potential epitopes and that can be suitable for experimental testing. The software to compute peptide classes according to the distance from human self is free available at http://www.iasi.cnr.it/~dsantoni/nullomers.
Collapse
Affiliation(s)
- Davide Vergni
- Istituto per le Applicazioni del Calcolo “Mauro Picone” - CNR, Rome, Italy
| | - Rosanna Gaudio
- Department of Biology, University Tor Vergata, Rome, Italy
| | - Daniele Santoni
- Istituto di Analisi dei Sistemi ed Informatica “Antonio Ruberti” - CNR, Rome, Italy
- * E-mail:
| |
Collapse
|
16
|
In the search of potential epitopes for Wuhan seafood market pneumonia virus using high order nullomers. J Immunol Methods 2020; 481-482:112787. [PMID: 32335161 PMCID: PMC7179506 DOI: 10.1016/j.jim.2020.112787] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 03/22/2020] [Indexed: 11/21/2022]
Abstract
Alarms periodically emerge for viral pneumonia infections due to coronavirus. In all cases, these are zoonoses passing the barrier between species and infect humans. The legitimate concern of the international community is due to the fact that the new identified coronavirus, named SARS-CoV-2 (previously called 2019-nCoV), has a quite high mortality rate, around 2%, and a strong ability to spread, with an estimated reproduction number higher than 2. Even though all countries are doing their utmost to stop the pandemic, the only reliable solution to tackle the infection is the rapid development of a vaccine. For this purpose, the means of bioinformatics, applied in the context of reverse-vaccinology paradigm, can be of fundamental help to select the most promising peptides able to trigger an effective immune response. In this short report, using the concept of nullomer and introducing a distance from human self, we provide a list of peptides that could deserve experimental investigation in the view of a potential vaccine for SARS-CoV-2.
Collapse
|
17
|
Mittal A, Changani AM, Taparia S. What limits the primary sequence space of natural proteins? J Biomol Struct Dyn 2019; 38:4579-4583. [DOI: 10.1080/07391102.2019.1682051] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Aditya Mittal
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi (IIT Delhi), New Delhi, India
| | | | - Sakshi Taparia
- Department of Mathematics, Bachelors Program in Mathematics & Computing, Indian Institute of Technology Delhi (IIT Delhi), New Delhi, India
| |
Collapse
|
18
|
Viral peptides-MHC interaction: Binding probability and distance from human peptides. J Immunol Methods 2018; 459:35-43. [PMID: 29800577 DOI: 10.1016/j.jim.2018.05.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 03/26/2018] [Accepted: 05/09/2018] [Indexed: 11/23/2022]
Abstract
Identification of peptides binding to MHC class I complex can play a crucial role in retrieving potential targets able to trigger an immune response. Affinity binding of viral peptides can be estimated through effective computational methods that in the most of cases are based on machine learning approach. Achieving a better insight into peptide features that impact on the affinity binding rate is a challenging issue. In the present work we focused on 9-mer peptides of Human immunodeficiency virus type 1 and Human herpes simplex virus 1, studying their binding to MHC class I. Viral 9-mers were partitioned into different classes, where each class is characterized by how far (in terms of mutation steps) the peptides belonging to that class are from human 9-mers. Viral 9-mers were partitioned in different classes, based on the number of mutation steps they are far from human 9-mers. We showed that the overall binding probability significantly differs among classes, and it typically increases as the distance, computed in terms of number of mutation steps from the human set of 9-mers, increases. The binding probability is particularly high when considering viral 9-mers that are far from all human 9-mers more than three mutation steps. A further evidence, providing significance to those special viral peptides and suggesting a potential role they can play, comes from the analysis of their distribution along viral genomes, as it revealed they are not randomly located, but they preferentially occur in specific genes.
Collapse
|