1
|
Tharanga S, Ünlü ES, Hu Y, Sjaugi MF, Çelik MA, Hekimoğlu H, Miotto O, Öncel MM, Khan AM. DiMA: sequence diversity dynamics analyser for viruses. Brief Bioinform 2024; 26:bbae607. [PMID: 39592151 PMCID: PMC11596295 DOI: 10.1093/bib/bbae607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 09/22/2024] [Accepted: 11/13/2024] [Indexed: 11/28/2024] Open
Abstract
Sequence diversity is one of the major challenges in the design of diagnostic, prophylactic, and therapeutic interventions against viruses. DiMA is a novel tool that is big data-ready and designed to facilitate the dissection of sequence diversity dynamics for viruses. DiMA stands out from other diversity analysis tools by offering various unique features. DiMA provides a quantitative overview of sequence (DNA/RNA/protein) diversity by use of Shannon's entropy corrected for size bias, applied via a user-defined k-mer sliding window to an input alignment file, and each k-mer position is dissected to various diversity motifs. The motifs are defined based on the probability of distinct sequences at a given k-mer alignment position, whereby an index is the predominant sequence, while all the others are (total) variants to the index. The total variants are sub-classified into the major (most common) variant, minor variants (occurring more than once and of incidence lower than the major), and the unique (singleton) variants. DiMA allows user-defined, sequence metadata enrichment for analyses of the motifs. The application of DiMA was demonstrated for the alignment data of the relatively conserved Spike protein (2,106,985 sequences) of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the relatively highly diverse pol gene (2637) of the human immunodeficiency virus-1 (HIV-1). The tool is publicly available as a web server (https://dima.bezmialem.edu.tr), as a Python library (via PyPi) and as a command line client (via GitHub).
Collapse
Affiliation(s)
- Shan Tharanga
- Centre for Bioinformatics, School of Data Sciences, Perdana University, MAEPS Building, Jalan MAEPS Perdana, Serdang, Kuala Lumpur 50490, Malaysia
| | - Eyyüb Selim Ünlü
- Istanbul Faculty of Medicine, Istanbul University, Turgut Özal Millet St, Topkapi, Istanbul 34093, Türkiye
- Genome Surveillance Unit, Wellcome Sanger Institute, Mill Ln, Hinxton, Saffron Walden CB10 1SA, United Kingdom
| | - Yongli Hu
- Centre for Bioinformatics, School of Data Sciences, Perdana University, MAEPS Building, Jalan MAEPS Perdana, Serdang, Kuala Lumpur 50490, Malaysia
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597, Singapore
| | - Muhammad Farhan Sjaugi
- Centre for Bioinformatics, School of Data Sciences, Perdana University, MAEPS Building, Jalan MAEPS Perdana, Serdang, Kuala Lumpur 50490, Malaysia
| | - Muhammet A Çelik
- Celik Sarayı, Yeni Elektrik Santral St. No:29/2, Meram, Konya 42090, Türkiye
| | - Hilal Hekimoğlu
- Beykoz Institute of Life Sciences and Biotechnology, Bezmialem Vakif University, Ali Ihsan Kalmaz St., No.10 Beykoz, Istanbul 34820, Türkiye
| | - Olivo Miotto
- Nuffield Department of Clinical Medicine, University of Oxford, Old Road, Old Road Campus, Oxford OX3 7LF, United Kingdom
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, 420/6 Ratchawithi Rd., Ratchathewi District, Bangkok 10400, Thailand
| | - Muhammed Miran Öncel
- Beykoz Institute of Life Sciences and Biotechnology, Bezmialem Vakif University, Ali Ihsan Kalmaz St., No.10 Beykoz, Istanbul 34820, Türkiye
| | - Asif M Khan
- Centre for Bioinformatics, School of Data Sciences, Perdana University, MAEPS Building, Jalan MAEPS Perdana, Serdang, Kuala Lumpur 50490, Malaysia
- Beykoz Institute of Life Sciences and Biotechnology, Bezmialem Vakif University, Ali Ihsan Kalmaz St., No.10 Beykoz, Istanbul 34820, Türkiye
- College of Computing and Information Technology, University of Doha for Science and Technology, Jelaiah Street, Duhail North, Doha, Qatar
| |
Collapse
|
2
|
Cao T, Li Q, Huang Y, Li A. plotnineSeqSuite: a Python package for visualizing sequence data using ggplot2 style. BMC Genomics 2023; 24:585. [PMID: 37789265 PMCID: PMC10546746 DOI: 10.1186/s12864-023-09677-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 09/14/2023] [Indexed: 10/05/2023] Open
Abstract
BACKGROUND The visual sequence logo has been a hot area in the development of bioinformatics tools. ggseqlogo written in R language has been the most popular API since it was published. With the popularity of artificial intelligence and deep learning, Python is currently the most popular programming language. The programming language used by bioinformaticians began to shift to Python. Providing APIs in Python that are similar to those in R can reduce the learning cost of relearning a programming language. And compared to ggplot2 in R, drawing framework is not as easy to use in Python. The appearance of plotnine (ggplot2 in Python version) makes it possible to unify the programming methods of bioinformatics visualization tools between R and Python. RESULTS Here, we introduce plotnineSeqSuite, a new plotnine-based Python package provides a ggseqlogo-like API for programmatic drawing of sequence logos, sequence alignment diagrams and sequence histograms. To be more precise, it supports custom letters, color themes, and fonts. Moreover, the class for drawing layers is based on object-oriented design so that users can easily encapsulate and extend it. CONCLUSIONS plotnineSeqSuite is the first ggplot2-style package to implement visualization of sequence -related graphs in Python. It enhances the uniformity of programmatic plotting between R and Python. Compared with tools appeared already, the categories supported by plotnineSeqSuite are much more complete. The source code of plotnineSeqSuite can be obtained on GitHub ( https://github.com/caotianze/plotnineseqsuite ) and PyPI ( https://pypi.org/project/plotnineseqsuite ), and the documentation homepage is freely available on GitHub at ( https://caotianze.github.io/plotnineseqsuite/ ).
Collapse
Affiliation(s)
- Tianze Cao
- School of Mathematics, Hangzhou Normal University, Hangzhou, Zhejiang Province, China
| | - Qian Li
- Department of Rehabilitation, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei Province, China
| | - Yuexia Huang
- School of Mathematics, Hangzhou Normal University, Hangzhou, Zhejiang Province, China.
| | - Anshui Li
- Department of Statistics, Shaoxing University, Shaoxing, Zhejiang Province, China.
| |
Collapse
|
3
|
Lankipalli S, H S MS, Selvam D, Samanta D, Nair D, Ramagopal UA. Cryptic association of B7-2 molecules and its implication for clustering. Protein Sci 2021; 30:1958-1973. [PMID: 34191384 PMCID: PMC8376414 DOI: 10.1002/pro.4151] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 06/16/2021] [Accepted: 06/22/2021] [Indexed: 12/23/2022]
Abstract
T-cell co-stimulation through CD28/CTLA4:B7-1/B7-2 axis is one of the extensively studied pathways that resulted in the discovery of several FDA-approved drugs for autoimmunity and cancer. However, many aspects of the signaling mechanism remain elusive, including oligomeric association and clustering of B7-2 on the cell surface. Here, we describe the structure of the IgV domain of B7-2 and its cryptic association into 1D arrays that appear to represent the pre-signaling state of B7-2 on the cell membrane. Super-resolution microscopy experiments on heterologous cells expressing B7-2 and B7-1 suggest, B7-2 form relatively elongated and larger clusters compared to B7-1. The sequence and structural comparison of other B7 family members, B7-1:CTLA4 and B7-2:CTLA-4 complex structures, support our view that the observed B7-2 1D zipper array is physiologically important. This observed 1D zipper-like array also provides an explanation for its clustering, and upright orientation on the cell surface, and avoidance of spurious signaling.
Collapse
Affiliation(s)
- Swetha Lankipalli
- Biological Sciences DivisionPoornaprajna Institute of Scientific Research (PPISR)BengaluruIndia
- Manipal Academy of Higher EducationManipalKarnatakaIndia
| | | | - Deepak Selvam
- Jawaharlal Nehru Center for Advance Scientific ResearchBengaluruKarnatakaIndia
- National Institute for Research in TuberculosisChennaiIndia
| | - Dibyendu Samanta
- School of Bioscience, Sir J. C. Bose Laboratory ComplexIndian Institute of Technology KharagpurKharagpurIndia
| | - Deepak Nair
- Centre for NeuroscienceIndian Institute of ScienceBangaloreIndia
| | - Udupi A. Ramagopal
- Biological Sciences DivisionPoornaprajna Institute of Scientific Research (PPISR)BengaluruIndia
| |
Collapse
|
4
|
Tareen A, Kinney JB. Logomaker: beautiful sequence logos in Python. Bioinformatics 2020; 36:2272-2274. [PMID: 31821414 PMCID: PMC7141850 DOI: 10.1093/bioinformatics/btz921] [Citation(s) in RCA: 257] [Impact Index Per Article: 51.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 11/14/2019] [Accepted: 12/06/2019] [Indexed: 01/09/2023] Open
Abstract
Summary Sequence logos are visually compelling ways of illustrating the biological properties of DNA, RNA and protein sequences, yet it is currently difficult to generate and customize such logos within the Python programming environment. Here we introduce Logomaker, a Python API for creating publication-quality sequence logos. Logomaker can produce both standard and highly customized logos from either a matrix-like array of numbers or a multiple-sequence alignment. Logos are rendered as native matplotlib objects that are easy to stylize and incorporate into multi-panel figures. Availability and implementation Logomaker can be installed using the pip package manager and is compatible with both Python 2.7 and Python 3.6. Documentation is provided at http://logomaker.readthedocs.io; source code is available at http://github.com/jbkinney/logomaker.
Collapse
Affiliation(s)
- Ammar Tareen
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
5
|
Liang PK, Zheng C, Xu XF, Zhao ZZ, Zhao CS, Li CH, Couvin D, Reynaud Y, Zozio T, Rastogi N, Sun Q. Local adaptive evolution of two distinct clades of Beijing and T families of Mycobacterium tuberculosis in Chongqing: a Bayesian population structure and phylogenetic study. Infect Dis Poverty 2020; 9:59. [PMID: 32487156 PMCID: PMC7268252 DOI: 10.1186/s40249-020-00674-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Accepted: 05/14/2020] [Indexed: 11/21/2022] Open
Abstract
Background Beijing sub-pedigree 2 (BSP2) and T sub-lineage 6 (TSL6) are two clades belonging to Beijing and T family of Mycobacterium tuberculosis (MTB), respectively, defined by Bayesian population structure analysis based on 24-loci mycobacterial interspersed repetitive unit-variable number of tandem repeats (MIRU-VNTR). Globally, over 99% of BSP2 and 89% of TSL6 isolates were distributed in Chongqing, suggesting their possible local adaptive evolution. The objective of this paper is to explore whether BSP2 and TSL6 originated by their local adaptive evolution from the specific isolates of Beijing and T families in Chongqing. Methods The genotyping data of 16 090 MTB isolates were collected from laboratory collection, published literatures and SITVIT database before subjected to Bayesian population structure analysis based on 24-loci MIRU-VNTR. Spacer Oligonucleotide Forest (Spoligoforest) and 24-loci MIRU-VNTR-based minimum spanning tree (MST) were used to explore their phylogenetic pathways, with Bayesian demographic analysis for exploring the recent demographic change of TSL6. Results Phylogenetic analysis suggested that BSP2 and TSL6 in Chongqing may evolve from BSP4 and TSL5, respectively, which were locally predominant in Tibet and Jiangsu, respectively. Spoligoforest showed that Beijing and T families were genetically distant, while the convergence of the MIRU-VNTR pattern of BSP2 and TSL6 was revealed by WebLogo. The demographic analysis concluded that the recent demographic change of TSL6 might take 111.25 years. Conclusions BSP2 and TSL6 clades might originate from BSP4 and TSL5, respectively, by their local adaptive evolution in Chongqing. Our study suggests MIRU-VNTR be combined with other robust markers for a more comprehensive genotyping approach, especially for families of clades with the same MIRU-VNTR pattern.
Collapse
Affiliation(s)
- Peng-Kuan Liang
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China
| | - Chao Zheng
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China.,Bacteriology & Antibacterial Resistance Surveillance Laboratory, Shenzhen Institute of Respiratory Disease, Shenzhen People's Hospital, Second Clinical Medical College of Jinan University, First Affiliated Hospital of SUSTC, Shenzhen, 518020, China.,Integrated Chinese and Western Medicine Postdoctoral Research Station, Jinan University, Guangzhou, 510632, China
| | - Xiao-Fang Xu
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China
| | - Zhe-Ze Zhao
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China.,School of Life Sciences and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
| | - Chang-Song Zhao
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China
| | - Chang-He Li
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China
| | - David Couvin
- WHO Supranational TB Reference Laboratory, Unité de la Tuberculose et des Mycobactéries, Institut Pasteur de Guadeloupe, Abymes Cedex, Guadeloupe, France
| | - Yann Reynaud
- WHO Supranational TB Reference Laboratory, Unité de la Tuberculose et des Mycobactéries, Institut Pasteur de Guadeloupe, Abymes Cedex, Guadeloupe, France
| | - Thierry Zozio
- WHO Supranational TB Reference Laboratory, Unité de la Tuberculose et des Mycobactéries, Institut Pasteur de Guadeloupe, Abymes Cedex, Guadeloupe, France
| | - Nalin Rastogi
- WHO Supranational TB Reference Laboratory, Unité de la Tuberculose et des Mycobactéries, Institut Pasteur de Guadeloupe, Abymes Cedex, Guadeloupe, France
| | - Qun Sun
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, People's Republic of China.
| |
Collapse
|
6
|
Nerstedt A, Kurhe Y, Cansby E, Caputo M, Gao L, Vorontsov E, Ståhlman M, Nuñez-Durán E, Borén J, Marschall HU, Mashek DG, Saunders DN, Sihlbom C, Hoy AJ, Mahlapuu M. Lipid droplet-associated kinase STK25 regulates peroxisomal activity and metabolic stress response in steatotic liver. J Lipid Res 2019; 61:178-191. [PMID: 31857389 DOI: 10.1194/jlr.ra119000316] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 11/05/2019] [Indexed: 12/18/2022] Open
Abstract
Nonalcoholic fatty liver disease (NAFLD) and nonalcoholic steatohepatitis (NASH) are emerging as leading causes of liver disease worldwide and have been recognized as one of the major unmet medical needs of the 21st century. Our recent translational studies in mouse models, human cell lines, and well-characterized patient cohorts have identified serine/threonine kinase (STK)25 as a protein that coats intrahepatocellular lipid droplets (LDs) and critically regulates liver lipid homeostasis and progression of NAFLD/NASH. Here, we studied the mechanism-of-action of STK25 in steatotic liver by relative quantification of the hepatic LD-associated phosphoproteome from high-fat diet-fed Stk25 knockout mice compared with their wild-type littermates. We observed a total of 131 proteins and 60 phosphoproteins that were differentially represented in STK25-deficient livers. Most notably, a number of proteins involved in peroxisomal function, ubiquitination-mediated proteolysis, and antioxidant defense were coordinately regulated in Stk25 -/- versus wild-type livers. We confirmed attenuated peroxisomal biogenesis and protection against oxidative and ER stress in STK25-deficient human liver cells, demonstrating the hepatocyte-autonomous manner of STK25's action. In summary, our results suggest that regulation of peroxisomal function and metabolic stress response may be important molecular mechanisms by which STK25 controls the development and progression of NAFLD/NASH.
Collapse
Affiliation(s)
- Annika Nerstedt
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Yeshwant Kurhe
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Emmelie Cansby
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Mara Caputo
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Lei Gao
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Egor Vorontsov
- Proteomics Core Facility, University of Gothenburg, Gothenburg, Sweden
| | - Marcus Ståhlman
- Molecular and Clinical Medicine/Wallenberg Laboratory, Institute of Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Esther Nuñez-Durán
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Jan Borén
- Molecular and Clinical Medicine/Wallenberg Laboratory, Institute of Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Hanns-Ulrich Marschall
- Molecular and Clinical Medicine/Wallenberg Laboratory, Institute of Medicine, University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Douglas G Mashek
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN
| | - Darren N Saunders
- School of Medical Sciences, Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Carina Sihlbom
- Proteomics Core Facility, University of Gothenburg, Gothenburg, Sweden
| | - Andrew J Hoy
- Discipline of Physiology, School of Medical Sciences, Charles Perkins Centre, University of Sydney, Sydney, NSW, Australia
| | - Margit Mahlapuu
- Departments of Chemistry and Molecular Biology University of Gothenburg and Sahlgrenska University Hospital, Gothenburg, Sweden
| |
Collapse
|
7
|
Peng G, Wilson R, Tang Y, Lam TT, Nairn AC, Williams K, Zhao H. ProteomicsBrowser: MS/proteomics data visualization and investigation. Bioinformatics 2019; 35:2313-2314. [PMID: 30462190 DOI: 10.1093/bioinformatics/bty958] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 10/26/2018] [Accepted: 11/20/2018] [Indexed: 11/12/2022] Open
Abstract
SUMMARY Large-scale, quantitative proteomics data are being generated at ever increasing rates by high-throughput, mass spectrometry technologies. However, due to the complexity of these large datasets as well as the increasing numbers of post-translational modifications (PTMs) that are being identified, developing effective methods for proteomic visualization has been challenging. ProteomicsBrowser was designed to meet this need for comprehensive data visualization. Using peptide information files exported from mass spectrometry search engines or quantitative tools as input, the peptide sequences are aligned to an internal protein database such as UniProtKB. Each identified peptide ion including those with PTMs is then visualized along the parent protein in the Browser. A unique property of ProteomicsBrowser is the ability to combine overlapping peptides in different ways to focus analysis of sequence coverage, charge state or PTMs. ProteomicsBrowser includes other useful functions, such as a data filtering tool and basic statistical analyses to qualify quantitative data. AVAILABILITY AND IMPLEMENTATION ProteomicsBrowser is implemented in Java8 and is available at https://medicine.yale.edu/keck/nida/proteomicsbrowser.aspx and https://github.com/peng-gang/ProteomicsBrowser. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gang Peng
- Department of Biostatistics, School of Medicine, Yale University, New Haven, CT, USA.,Department of Genetics, School of Medicine, Yale University, New Haven, CT, USA
| | - Rashaun Wilson
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
| | - Yishuo Tang
- Department of Genetics, School of Medicine, Yale University, New Haven, CT, USA
| | - TuKiet T Lam
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, New Haven, CT, USA.,MS & Proteomics Resource, W.M. Keck Biotechnology Resource Laboratory, School of Medicine, Yale University, New Haven, CT, USA
| | - Angus C Nairn
- Department of Psychiatry, School of Medicine, Yale University, New Haven, CT, USA
| | - Kenneth Williams
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, New Haven, CT, USA
| | - Hongyu Zhao
- Department of Biostatistics, School of Medicine, Yale University, New Haven, CT, USA.,Department of Genetics, School of Medicine, Yale University, New Haven, CT, USA
| |
Collapse
|
8
|
Kalmykova SD, Arapidi GP, Urban AS, Osetrova MS, Gordeeva VD, Ivanov VT, Govorun VM. In Silico Analysis of Peptide Potential Biological Functions. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2018. [DOI: 10.1134/s106816201804009x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
9
|
Clonal analysis of Salmonella-specific effector T cells reveals serovar-specific and cross-reactive T cell responses. Nat Immunol 2018; 19:742-754. [PMID: 29925993 DOI: 10.1038/s41590-018-0133-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 04/19/2018] [Indexed: 12/20/2022]
Abstract
To tackle the complexity of cross-reactive and pathogen-specific T cell responses against related Salmonella serovars, we used mass cytometry, unbiased single-cell cloning, live fluorescence barcoding, and T cell-receptor sequencing to reconstruct the Salmonella-specific repertoire of circulating effector CD4+ T cells, isolated from volunteers challenged with Salmonella enterica serovar Typhi (S. Typhi) or Salmonella Paratyphi A (S. Paratyphi). We describe the expansion of cross-reactive responses against distantly related Salmonella serovars and of clonotypes recognizing immunodominant antigens uniquely expressed by S. Typhi or S. Paratyphi A. In addition, single-amino acid variations in two immunodominant proteins, CdtB and PhoN, lead to the accumulation of T cells that do not cross-react against the different serovars, thus demonstrating how minor sequence variations in a complex microorganism shape the pathogen-specific T cell repertoire. Our results identify immune-dominant, serovar-specific, and cross-reactive T cell antigens, which should aid in the design of T cell-vaccination strategies against Salmonella.
Collapse
|
10
|
Singh T, Fakiola M, Oommen J, Singh AP, Singh AK, Smith N, Chakravarty J, Sundar S, Blackwell JM. Epitope-Binding Characteristics for Risk versus Protective DRB1 Alleles for Visceral Leishmaniasis. THE JOURNAL OF IMMUNOLOGY 2018; 200:2727-2737. [PMID: 29507109 DOI: 10.4049/jimmunol.1701764] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2017] [Accepted: 02/08/2018] [Indexed: 11/19/2022]
Abstract
HLA-DRB1 is the major genetic risk factor for visceral leishmaniasis (VL). We used SNP2HLA to impute HLA-DRB1 alleles and SNPTEST to carry out association analyses in 889 human cases and 977 controls from India. NetMHCIIpan 2.1 was used to map epitopes and binding affinities across 49 Leishmania vaccine candidates, as well as across peptide epitopes captured from dendritic cells treated with crude Leishmania Ag and identified using mass spectrometry and alignment to amino acid sequences of a reference Leishmania genome. Cytokines were measured in peptide-stimulated whole blood from 26 cured VL cases and eight endemic healthy controls. HLA-DRB1*1501 and DRB1*1404/DRB1*1301 were the most significant protective and risk alleles, respectively, with specific residues at aa positions 11 and 13 unique to protective alleles. We observed greater peptide promiscuity in sequence motifs for 9-mer core epitopes predicted to bind to risk (*1404/*1301) compared with protective (*1501) DRB1 alleles. There was a higher frequency of basic amino acids in DRB1*1404/*1301-specific epitopes compared with hydrophobic and polar amino acids in DRB1*1501-specific epitopes at anchor residues pocket 4 and pocket 6, which interact with residues at DRB1 positions 11 and 13. Cured VL patients made variable, but robust, IFN-γ, TNF, and IL-10 responses to 20-mer peptides based on captured epitopes, with peptides based on DRB1*1501-captured epitopes resulting in a higher proportion (odds ratio 2.23, 95% confidence interval 1.17-4.25, p = 0.017) of patients with IFN-γ/IL-10 ratios > 2-fold compared with peptides based on DRB1*1301-captured epitopes. Our data provide insight into the molecular mechanisms underpinning the association of HLA-DRB1 alleles with risk versus protection in VL in humans.
Collapse
Affiliation(s)
- Toolika Singh
- Institute of Medical Sciences, Banaras Hindu University, Varanasi OS 221 005, India
| | - Michaela Fakiola
- Department of Pathology, University of Cambridge, Cambridge CB2 1QP, United Kingdom
| | - Joyce Oommen
- Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia 6008, Australia; and
| | - Akhil Pratap Singh
- Institute of Medical Sciences, Banaras Hindu University, Varanasi OS 221 005, India
| | - Abhishek K Singh
- Institute of Medical Sciences, Banaras Hindu University, Varanasi OS 221 005, India
| | - Noel Smith
- Lonza Biologics PLC, Great Abington, Cambridge CB21 6GS, United Kingdom
| | - Jaya Chakravarty
- Institute of Medical Sciences, Banaras Hindu University, Varanasi OS 221 005, India
| | - Shyam Sundar
- Institute of Medical Sciences, Banaras Hindu University, Varanasi OS 221 005, India
| | - Jenefer M Blackwell
- Department of Pathology, University of Cambridge, Cambridge CB2 1QP, United Kingdom; .,Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia 6008, Australia; and
| |
Collapse
|
11
|
Sommer R, Makshakova ON, Wohlschlager T, Hutin S, Marsh M, Titz A, Künzler M, Varrot A. Crystal Structures of Fungal Tectonin in Complex with O-Methylated Glycans Suggest Key Role in Innate Immune Defense. Structure 2018; 26:391-402.e4. [DOI: 10.1016/j.str.2018.01.003] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 11/24/2017] [Accepted: 01/05/2018] [Indexed: 12/18/2022]
|
12
|
Zheng C, Reynaud Y, Zhao C, Zozio T, Li S, Luo D, Sun Q, Rastogi N. New Mycobacterium tuberculosis Beijing clonal complexes in China revealed by phylogenetic and Bayesian population structure analyses of 24-loci MIRU-VNTRs. Sci Rep 2017; 7:6065. [PMID: 28729708 PMCID: PMC5519585 DOI: 10.1038/s41598-017-06346-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2016] [Accepted: 06/12/2017] [Indexed: 11/09/2022] Open
Abstract
Beijing lineage of Mycobacterium tuberculosis constitutes the most predominant lineage in East Asia. Beijing epidemiology, evolutionary history, genetics are studied in details for years revealing probable origin from China followed by worldwide expansion, partially linked to higher mutation rate, hypervirulence, drug-resistance, and association with cases of mixed infections. Considering huge amount of data available for 24-loci Mycobacterial Interspersed Repetitive Units-Variable Number of Tandem Repeats, we performed detailed phylogenetic and Bayesian population structure analyses of Beijing lineage strains in mainland China and Taiwan using available 24-loci MIRU-VNTR data extracted from publications or the SITVIT2 database (n = 1490). Results on genetic structuration were compared to previously published data. A total of three new Beijing clonal complexes tentatively named BSP1, BPS2 and BSP3 were revealed with surprising phylogeographical specificities to previously unstudied regions in Sichuan, Chongqing and Taiwan, proving the need for continued investigations with extended datasets. Such geographical restriction could correspond to local adaptation of these “ecological specialist” Beijing isolates to local human host populations in contrast with “generalist pathogens” able to adapt to several human populations and to spread worldwide.
Collapse
Affiliation(s)
- Chao Zheng
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, PR China.,WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière, 97183, Abymes, Guadeloupe, France
| | - Yann Reynaud
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière, 97183, Abymes, Guadeloupe, France.
| | - Changsong Zhao
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, PR China
| | - Thierry Zozio
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière, 97183, Abymes, Guadeloupe, France
| | - Song Li
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, PR China
| | - Dongxia Luo
- Public Health Clinical Center of Chengdu, Chengdu, Sichuan, 610000, PR China
| | - Qun Sun
- Key Laboratory of Bio-resources and Eco-environment of the Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, Sichuan, 610065, PR China.
| | - Nalin Rastogi
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière, 97183, Abymes, Guadeloupe, France.
| |
Collapse
|
13
|
Reynaud Y, Zheng C, Wu G, Sun Q, Rastogi N. Bayesian population structure analysis reveals presence of phylogeographically specific sublineages within previously ill-defined T group of Mycobacterium tuberculosis. PLoS One 2017; 12:e0171584. [PMID: 28166309 PMCID: PMC5293260 DOI: 10.1371/journal.pone.0171584] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Accepted: 01/22/2017] [Indexed: 11/23/2022] Open
Abstract
Mycobacterium tuberculosis genetic structure, and evolutionary history have been studied for years by several genotyping approaches, but delineation of a few sublineages remains controversial and needs better characterization. This is particularly the case of T group within lineage 4 (L4) which was first described using spoligotyping to pool together a number of strains with ill-defined signatures. Although T strains were not traditionally considered as a real phylogenetic group, they did contain a few phylogenetically meaningful sublineages as shown using SNPs. We therefore decided to investigate if this observation could be corroborated using other robust genetic markers. We consequently made a first assessment of genetic structure using 24-loci MIRU-VNTRs data extracted from the SITVIT2 database (n = 607 clinical isolates collected in Russia, Albania, Turkey, Iraq, Brazil and China). Combining Minimum Spanning Trees and Bayesian population structure analyses (using STRUCTURE and TESS softwares), we distinctly identified eight tentative phylogenetic groups (T1-T8) with a remarkable correlation with geographical origin. We further compared the present structure observed with other L4 sublineages (n = 416 clinical isolates belonging to LAM, Haarlem, X, S sublineages), and showed that 5 out of 8 T groups seemed phylogeographically well-defined as opposed to the remaining 3 groups that partially mixed with other L4 isolates. These results provide with novel evidence about phylogeographically specificity of a proportion of ill-defined T group of M. tuberculosis. The genetic structure observed will now be further validated on an enlarged worldwide dataset using Whole Genome Sequencing (WGS).
Collapse
Affiliation(s)
- Yann Reynaud
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière Abymes, Guadeloupe, France
| | - Chao Zheng
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière Abymes, Guadeloupe, France
- College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Guihui Wu
- Chengdu Public Health Clinical Center, Chengdu, Sichuan, China
| | - Qun Sun
- College of Life Sciences, Sichuan University, Chengdu, Sichuan, China
| | - Nalin Rastogi
- WHO Supranational TB Reference Laboratory, Tuberculosis and Mycobacteria Unit, Institut Pasteur de la Guadeloupe, Morne Jolivière Abymes, Guadeloupe, France
| |
Collapse
|
14
|
Olsen LR, Simon C, Kudahl UJ, Bagger FO, Winther O, Reinherz EL, Zhang GL, Brusic V. A computational method for identification of vaccine targets from protein regions of conserved human leukocyte antigen binding. BMC Med Genomics 2015; 8 Suppl 4:S1. [PMID: 26679766 PMCID: PMC4682376 DOI: 10.1186/1755-8794-8-s4-s1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Computational methods for T cell-based vaccine target discovery focus on selection of highly conserved peptides identified across pathogen variants, followed by prediction of their binding of human leukocyte antigen molecules. However, experimental studies have shown that T cells often target diverse regions in highly variable viral pathogens and this diversity may need to be addressed through redefinition of suitable peptide targets. Methods We have developed a method for antigen assessment and target selection for polyvalent vaccines, with which we identified immune epitopes from variable regions, where all variants bind HLA. These regions, although variable, can thus be considered stable in terms of HLA binding and represent valuable vaccine targets. Results We applied this method to predict CD8+ T-cell targets in influenza A H7N9 hemagglutinin and significantly increased the number of potential vaccine targets compared to the number of targets discovered using the traditional approach where low-frequency peptides are excluded. Conclusions We developed a webserver with an intuitive visualization scheme for summarizing the T cell-based antigenic potential of any given protein or proteome using human leukocyte antigen binding predictions and made a web-accessible software implementation freely available at http://met-hilab.cbs.dtu.dk/blockcons/.
Collapse
|
15
|
Sun J, Brusic V. A systematic analysis of a broadly neutralizing antibody AR3C epitopes on Hepatitis C virus E2 envelope glycoprotein and their cross-reactivity. BMC Med Genomics 2015; 8 Suppl 4:S6. [PMID: 26681161 PMCID: PMC4682370 DOI: 10.1186/1755-8794-8-s4-s6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Background Hepatitis C virus (HCV) belongs to Flaviviridae family of viruses. HCV represents a major challenge to public health since its estimated global prevalence is 2.8% of the world's human population. The design and development of HCV vaccine has been hampered by rapid evolution of viral quasispecies resulting in antibody escape variants. HCV envelope glycoprotein E1 and E2 that mediate fusion and entry of the virus into host cells are primary targets of the host immune responses. Results Structural characterization of E2 core protein and a broadly neutralizing antibody AR3C together with E1E2 sequence information enabled the analysis of B-cell epitope variability. The E2 binding site by AR3C and its surrounding area were identified from the crystal structure of E2c-AR3C complex. We clustered HCV strains using the concept of "discontinuous motif/peptide" and classified B-cell epitopes based on their similarity. Conclusions The assessment of antibody neutralizing coverage provides insights into potential cross-reactivity of the AR3C neutralizing antibody across a large number of HCV variants.
Collapse
|
16
|
Sobhy H. Shetti, a simple tool to parse, manipulate and search large datasets of sequences. Microb Genom 2015; 1:e000035. [PMID: 28348820 PMCID: PMC5320677 DOI: 10.1099/mgen.0.000035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 09/21/2015] [Indexed: 11/20/2022] Open
Abstract
Parsing and manipulating long and/or multiple protein or gene sequences can be a challenging process for experimental biologists and microbiologists lacking prior knowledge of bioinformatics and programming. Here we present a simple, easy, user-friendly and versatile tool to parse, manipulate and search within large datasets of long and multiple protein or gene sequences. The Shetti tool can be used to search for a sequence, species, protein/gene or pattern/motif. Moreover, it can also be used to construct a universal consensus or molecular signatures for proteins based on their physical characteristics. Shetti is an efficient and fast tool that can deal with large sets of long sequences efficiently. Shetti parses UniProt Knowledgebase and NCBI GenBank flat files and visualizes them as a table.
Collapse
Affiliation(s)
- Haitham Sobhy
- Dalian Institute of Chemical Physics, CAS, Dalian, PR China
| |
Collapse
|
17
|
Simon C, Kudahl UJ, Sun J, Olsen LR, Zhang GL, Reinherz EL, Brusic V. FluKB: A Knowledge-Based System for Influenza Vaccine Target Discovery and Analysis of the Immunological Properties of Influenza Viruses. J Immunol Res 2015; 2015:380975. [PMID: 26504853 PMCID: PMC4609449 DOI: 10.1155/2015/380975] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Accepted: 03/12/2015] [Indexed: 01/01/2023] Open
Abstract
FluKB is a knowledge-based system focusing on data and analytical tools for influenza vaccine discovery. The main goal of FluKB is to provide access to curated influenza sequence and epitope data and enhance the analysis of influenza sequence diversity and the analysis of targets of immune responses. FluKB consists of more than 400,000 influenza protein sequences, known epitope data (357 verified T-cell epitopes, 685 HLA binders, and 16 naturally processed MHC ligands), and a collection of 28 influenza antibodies and their structurally defined B-cell epitopes. FluKB was built using a modular framework allowing the implementation of analytical workflows and includes standard search tools, such as keyword search and sequence similarity queries, as well as advanced tools for the analysis of sequence variability. The advanced analytical tools for vaccine discovery include visual mapping of T- and B-cell vaccine targets and assessment of neutralizing antibody coverage. FluKB supports the discovery of vaccine targets and the analysis of viral diversity and its implications for vaccine discovery as well as potential T-cell breadth and antibody cross neutralization involving multiple strains. FluKB is representation of a new generation of databases that integrates data, analytical tools, and analytical workflows that enable comprehensive analysis and automatic generation of analysis reports.
Collapse
Affiliation(s)
- Christian Simon
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Lyngby, Denmark
- Department of Disease Systems Biology, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Ulrich J. Kudahl
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, 2800 Lyngby, Denmark
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Jing Sun
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Lars Rønn Olsen
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
- Bioinformatics Centre, Department of Biology, University of Copenhagen, 1017 Copenhagen, Denmark
| | - Guang Lan Zhang
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
- Department of Computer Science, Metropolitan College, Boston University, Boston, MA 02215, USA
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
| | - Ellis L. Reinherz
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
- Laboratory of Immunobiology, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | - Vladimir Brusic
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA
- Department of Computer Science, Metropolitan College, Boston University, Boston, MA 02215, USA
- Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
18
|
Anderson D, Fakiola M, Hales BJ, Pennell CE, Thomas WR, Blackwell JM. Genome-wide association study of IgG1 responses to the choline-binding protein PspC of Streptococcus pneumoniae. Genes Immun 2015; 16:289-96. [PMID: 25928883 DOI: 10.1038/gene.2015.12] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 02/12/2015] [Accepted: 03/04/2015] [Indexed: 11/09/2022]
Abstract
Streptococcus pneumoniae causes invasive pneumococcal disease. Delayed development of antibodies to S. pneumoniae in infancy is associated with the development of atopy and asthma. Pneumococcal surface protein C (PspC) is a vaccine candidate and variation in its choline-binding region is associated with invasive disease. This study examined 523 060 single-nucleotide polymorphisms in The Western Australian Pregnancy Cohort (Raine) Study to find loci influencing immunoglobulin G1 (IgG1) responses to PspC measured at age 14 years (n=1152). Genome-wide significance (top SNP rs9275596; P=3.1 × 10(-14)) was only observed at human leucocyte antigen (HLA). Imputed HLA amino-acid polymorphisms showed the strongest associations at positions DRB1 47 (P=3.2 × 10(-11)), 13SRG (P=9.8 × 10(-10)) and 11SP (P=9.8 × 10(-10)), and at DQA1 34 (P=6.4 × 10(-10)), DQB1 167R (P=9.3 × 10(-6)) and HLA-B 95 W (P=1.2 × 10(-9)). Conditional analyses showed independent contributions from DRB1 47 and DQB1 167R to the signal at rs9275596, supported by an omnibus test showing a strong signal for the haplotype DRB1_47_DQB1_167 (P=9.02 × 10(-15)). In silico analysis showed that DRB1 four-digit allele groups defined by DRB1 47F bind to a greater complexity of core 9-mer epitopes compared with DRB1 47Y, especially across repeats in the C-term choline-binding region. Consequent differences in CD4 T-cell help for IgG1 to PspC could have implications for vaccine design. Further analysis in other cohorts is merited.
Collapse
Affiliation(s)
- D Anderson
- Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia, Australia
| | - M Fakiola
- Department of Pathology, University of Cambridge, Cambridge, UK
| | - B J Hales
- Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia, Australia
| | - C E Pennell
- School of Women's and Infants' Health, University of Western Australia, Perth, Australia
| | - W R Thomas
- Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia, Australia
| | - J M Blackwell
- Telethon Kids Institute, The University of Western Australia, Subiaco, Western Australia, Australia
| |
Collapse
|
19
|
Dolan PT, Roth AP, Xue B, Sun R, Dunker AK, Uversky VN, LaCount DJ. Intrinsic disorder mediates hepatitis C virus core-host cell protein interactions. Protein Sci 2014; 24:221-35. [PMID: 25424537 DOI: 10.1002/pro.2608] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2014] [Accepted: 11/19/2014] [Indexed: 12/18/2022]
Abstract
Viral proteins bind to numerous cellular and viral proteins throughout the infection cycle. However, the mechanisms by which viral proteins interact with such large numbers of factors remain unknown. Cellular proteins that interact with multiple, distinct partners often do so through short sequences known as molecular recognition features (MoRFs) embedded within intrinsically disordered regions (IDRs). In this study, we report the first evidence that MoRFs in viral proteins play a similar role in targeting the host cell. Using a combination of evolutionary modeling, protein-protein interaction analyses and forward genetic screening, we systematically investigated two computationally predicted MoRFs within the N-terminal IDR of the hepatitis C virus (HCV) Core protein. Sequence analysis of the MoRFs showed their conservation across all HCV genotypes and the canine and equine Hepaciviruses. Phylogenetic modeling indicated that the Core MoRFs are under stronger purifying selection than the surrounding sequence, suggesting that these modules have a biological function. Using the yeast two-hybrid assay, we identified three cellular binding partners for each HCV Core MoRF, including two previously characterized cellular targets of HCV Core (DDX3X and NPM1). Random and site-directed mutagenesis demonstrated that the predicted MoRF regions were required for binding to the cellular proteins, but that different residues within each MoRF were critical for binding to different partners. This study demonstrated that viruses may use intrinsic disorder to target multiple cellular proteins with the same amino acid sequence and provides a framework for characterizing the binding partners of other disordered regions in viral and cellular proteomes.
Collapse
Affiliation(s)
- Patrick T Dolan
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana, 47907
| | | | | | | | | | | | | |
Collapse
|
20
|
Olsen LR, Campos B, Barnkob MS, Winther O, Brusic V, Andersen MH. Bioinformatics for cancer immunotherapy target discovery. Cancer Immunol Immunother 2014; 63:1235-49. [PMID: 25344903 PMCID: PMC11029190 DOI: 10.1007/s00262-014-1627-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 10/08/2014] [Indexed: 12/13/2022]
Abstract
The mechanisms of immune response to cancer have been studied extensively and great effort has been invested into harnessing the therapeutic potential of the immune system. Immunotherapies have seen significant advances in the past 20 years, but the full potential of protective and therapeutic cancer immunotherapies has yet to be fulfilled. The insufficient efficacy of existing treatments can be attributed to a number of biological and technical issues. In this review, we detail the current limitations of immunotherapy target selection and design, and review computational methods to streamline therapy target discovery in a bioinformatics analysis pipeline. We describe specialized bioinformatics tools and databases for three main bottlenecks in immunotherapy target discovery: the cataloging of potentially antigenic proteins, the identification of potential HLA binders, and the selection epitopes and co-targets for single-epitope and multi-epitope strategies. We provide examples of application to the well-known tumor antigen HER2 and suggest bioinformatics methods to ameliorate therapy resistance and ensure efficient and lasting control of tumors.
Collapse
Affiliation(s)
- Lars Rønn Olsen
- Department of Biology, Bioinformatics Centre, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark,
| | | | | | | | | | | |
Collapse
|
21
|
Big data analytics in immunology: a knowledge-based approach. BIOMED RESEARCH INTERNATIONAL 2014; 2014:437987. [PMID: 25045677 PMCID: PMC4090507 DOI: 10.1155/2014/437987] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 05/07/2014] [Indexed: 01/27/2023]
Abstract
With the vast amount of immunological data available, immunology research is entering the big data era. These data vary in granularity, quality, and complexity and are stored in various formats, including publications, technical reports, and databases. The challenge is to make the transition from data to actionable knowledge and wisdom and bridge the knowledge gap and application gap. We report a knowledge-based approach based on a framework called KB-builder that facilitates data mining by enabling fast development and deployment of web-accessible immunological data knowledge warehouses. Immunological knowledge discovery relies heavily on both the availability of accurate, up-to-date, and well-organized data and the proper analytics tools. We propose the use of knowledge-based approaches by developing knowledgebases combining well-annotated data with specialized analytical tools and integrating them into analytical workflow. A set of well-defined workflow types with rich summarization and visualization capacity facilitates the transformation from data to critical information and knowledge. By using KB-builder, we enabled streamlining of normally time-consuming processes of database development. The knowledgebases built using KB-builder will speed up rational vaccine design by providing accurate and well-annotated data coupled with tailored computational analysis tools and workflow.
Collapse
|
22
|
Sun J, Kudahl UJ, Simon C, Cao Z, Reinherz EL, Brusic V. Large-scale analysis of B-cell epitopes on influenza virus hemagglutinin - implications for cross-reactivity of neutralizing antibodies. Front Immunol 2014; 5:38. [PMID: 24570677 PMCID: PMC3916768 DOI: 10.3389/fimmu.2014.00038] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 01/22/2014] [Indexed: 11/13/2022] Open
Abstract
Influenza viruses continue to cause substantial morbidity and mortality worldwide. Fast gene mutation on surface proteins of influenza virus result in increasing resistance to current vaccines and available antiviral drugs. Broadly neutralizing antibodies (bnAbs) represent targets for prophylactic and therapeutic treatments of influenza. We performed a systematic bioinformatics study of cross-reactivity of neutralizing antibodies (nAbs) against influenza virus surface glycoprotein hemagglutinin (HA). This study utilized the available crystal structures of HA complexed with the antibodies for the analysis of tens of thousands of HA sequences. The detailed description of B-cell epitopes, measurement of epitope area similarity among different strains, and estimation of antibody neutralizing coverage provide insights into cross-reactivity status of existing nAbs against influenza virus. We have developed a method to assess the likely cross-reactivity potential of bnAbs for influenza strains, either newly emerged or existing. Our method catalogs influenza strains by a new concept named discontinuous peptide, and then provide assessment of cross-reactivity. Potentially cross-reactive strains are those that share 100% identity with experimentally verified neutralized strains. By cataloging influenza strains and their B-cell epitopes for known bnAbs, our method provides guidance for selection of representative strains for further experimental design. The knowledge of sequences, their B-cell epitopes, and differences between historical influenza strains, we enhance our preparedness and the ability to respond to the emerging pandemic threats.
Collapse
Affiliation(s)
- Jing Sun
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA ; Department of Medicine, Harvard Medical School , Boston, MA , USA
| | - Ulrich J Kudahl
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA ; Center for Biological Sequence Analysis, Technical University of Denmark , Lyngby , Denmark
| | - Christian Simon
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA ; Center for Biological Sequence Analysis, Technical University of Denmark , Lyngby , Denmark
| | - Zhiwei Cao
- School of Life Sciences and Technology, Tongji University , Shanghai , China
| | - Ellis L Reinherz
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA ; Department of Medicine, Harvard Medical School , Boston, MA , USA ; Laboratory of Immunobiology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA
| | - Vladimir Brusic
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Harvard Medical School , Boston, MA , USA ; Department of Medicine, Harvard Medical School , Boston, MA , USA
| |
Collapse
|