1
|
Balachandran K, Ramli R, Karsani SA, Abdul Rahman M. Identification of Potential Biomarkers and Small Molecule Drugs for Bisphosphonate-Related Osteonecrosis of the Jaw (BRONJ): An Integrated Bioinformatics Study Using Big Data. Int J Mol Sci 2023; 24:ijms24108635. [PMID: 37239981 DOI: 10.3390/ijms24108635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/18/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
This study aimed to identify potential molecular mechanisms and therapeutic targets for bisphosphonate-related osteonecrosis of the jaw (BRONJ), a rare but serious side effect of bisphosphonate therapy. This study analyzed a microarray dataset (GSE7116) of multiple myeloma patients with BRONJ (n = 11) and controls (n = 10), and performed gene ontology, a pathway enrichment analysis, and a protein-protein interaction network analysis. A total of 1481 differentially expressed genes were identified, including 381 upregulated and 1100 downregulated genes, with enriched functions and pathways related to apoptosis, RNA splicing, signaling pathways, and lipid metabolism. Seven hub genes (FN1, TNF, JUN, STAT3, ACTB, GAPDH, and PTPRC) were also identified using the cytoHubba plugin in Cytoscape. This study further screened small-molecule drugs using CMap and verified the results using molecular docking methods. This study identified 3-(5-(4-(Cyclopentyloxy)-2-hydroxybenzoyl)-2-((3-hydroxybenzo[d]isoxazol-6-yl) methoxy) phenyl) propanoic acid as a potential drug treatment and prognostic marker for BRONJ. The findings of this study provide reliable molecular insight for biomarker validation and potential drug development for the screening, diagnosis, and treatment of BRONJ. Further research is needed to validate these findings and develop an effective biomarker for BRONJ.
Collapse
Affiliation(s)
- Kumarendran Balachandran
- Department of Craniofacial Diagnostics and Biosciences, Faculty of Dentistry, University Kebangsaan Malaysia, Jalan Raja Muda Abdul Aziz, Kuala Lumpur 50300, Malaysia
| | - Roszalina Ramli
- Department of Oral and Maxillofacial Surgery, Faculty of Dentistry, Universiti Kebangsaan Malaysia, Kuala Lumpur 50300, Malaysia
| | - Saiful Anuar Karsani
- Institute of Biological Sciences, Faculty of Science, Universiti Malaya, Kuala Lumpur 50603, Malaysia
| | - Mariati Abdul Rahman
- Department of Craniofacial Diagnostics and Biosciences, Faculty of Dentistry, University Kebangsaan Malaysia, Jalan Raja Muda Abdul Aziz, Kuala Lumpur 50300, Malaysia
| |
Collapse
|
2
|
Chicco D, Jurman G. Ten simple rules for providing bioinformatics support within a hospital. BioData Min 2023; 16:6. [PMID: 36823520 PMCID: PMC9948383 DOI: 10.1186/s13040-023-00326-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 02/17/2023] [Indexed: 02/25/2023] Open
Abstract
Bioinformatics has become a key aspect of the biomedical research programmes of many hospitals' scientific centres, and the establishment of bioinformatics facilities within hospitals has become a common practice worldwide. Bioinformaticians working in these facilities provide computational biology support to medical doctors and principal investigators who are daily dealing with data of patients to analyze. These bioinformatics analysts, although pivotal, usually do not receive formal training for this job. We therefore propose these ten simple rules to guide these bioinformaticians in their work: ten pieces of advice on how to provide bioinformatics support to medical doctors in hospitals. We believe these simple rules can help bioinformatics facility analysts in producing better scientific results and work in a serene and fruitful environment.
Collapse
Affiliation(s)
- Davide Chicco
- Institute of Health Policy Management and Evaluation, University of Toronto, 155 College Street, M5T 3M7 Toronto, Ontario Canada
| | - Giuseppe Jurman
- Data Science for Health Unit, Fondazione Bruno Kessler, Via Sommarive 18, 38123 Povo, Trento, Italy
| |
Collapse
|
3
|
Jahantigh HR, Ahmadi N, Shahbazi B, Lovreglio P, Habibi M, Stufano A, Gouklani H, Ahmadi K. Evaluation of the dual effects of antiviral drugs on SARS-CoV-2 receptors and the ACE2 receptor using structure-based virtual screening and molecular dynamics simulation. J Biomol Struct Dyn 2022:1-23. [DOI: 10.1080/07391102.2022.2103735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Hamid Reza Jahantigh
- Interdisciplinary Department of Medicine - Section of Occupational Medicine, University of Bari, Bari, Italy
- Animal Health and Zoonosis PhD Course, Department of Veterinary Medicine, University of Bari, Bari, Italy
| | - Nahid Ahmadi
- Department of Pharmaceutical Chemistry, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Behzad Shahbazi
- Molecular Medicine Department, Biotechnology Research Center, Pasteur Institute of Iran, Tehran, Iran
| | - Piero Lovreglio
- Interdisciplinary Department of Medicine - Section of Occupational Medicine, University of Bari, Bari, Italy
| | - Mehri Habibi
- Department of Molecular Biology, Pasteur Institute of Iran, Tehran, Iran
| | - Angela Stufano
- Interdisciplinary Department of Medicine - Section of Occupational Medicine, University of Bari, Bari, Italy
- Animal Health and Zoonosis PhD Course, Department of Veterinary Medicine, University of Bari, Bari, Italy
| | - Hamed Gouklani
- Infectious and Tropical Diseases Research Center, Hormozgan Health Institute, Hormozgan University of Medical Sciences, Bandar Abbas, Iran
| | - Khadijeh Ahmadi
- Infectious and Tropical Diseases Research Center, Hormozgan Health Institute, Hormozgan University of Medical Sciences, Bandar Abbas, Iran
| |
Collapse
|
4
|
Azzahra SNA, Hanif N, Hermawan A. MDM2 is a Potential Target Gene of Glycyrrhizic Acid for Circumventing Breast Cancer Resistance to Tamoxifen: Integrative Bioinformatics Analysis. Asian Pac J Cancer Prev 2022; 23:2341-2350. [PMID: 35901340 PMCID: PMC9727350 DOI: 10.31557/apjcp.2022.23.7.2341] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Accepted: 07/18/2022] [Indexed: 06/15/2023] Open
Abstract
BACKGROUND Tamoxifen is the drug of choice for treating breast cancer, particularly the estrogen receptor-positive luminal A subtype. However, the increased occurrence of Tamoxifen resistance highlights the need to develop an agent to enhance the effectiveness of this drug. OBJECTIVE Although glycyrrhizic acid (GA) is known to exhibit cytotoxic effects on Michigan Cancer Foundation-7 cells, the specific gene targets and pathways it employs to overcome Tamoxifen resistance are incompletely understood. Therefore, the goal of the present research is to discover the potential targets and pathways of GA by using a bioinformatics approach. METHODS Differentially expressed genes (DEGs) were identified in the Gene Expression Omnibus NCBI database using microarray data from GSE67916 and GSE85871. Further analyses were performed on these DEGs by using DAVID v6.8, STRING-DB v11.0, and Cytoscape v3.8.0. Analysis of gene alterations was performed using cBioPortal for target validation, and the relevant interaction process was examined via the molecular docking method. RESULTS Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses identified the PI3K-AKT signaling as the potential target mechanism. Construction of the protein-protein interaction network and analysis of hub genes identified the top 25 hub genes. Genetic alterations were observed in six potential target genes, such as CDK2, MDM2, NF1, SMAD3, PTPN11, and CALM1. Molecular docking analysis demonstrated that the docking score of GA is lower than that of the native ligand of p53. More importantly, 3n the PI3K-AKT signaling pathway is a potential target for overcoming Tamoxifen resistance in breast cancer. CONCLUSION MDM2 may be a potential gene target of GA and the PI3K-AKT signaling may be a prospective mechanism for overcoming Tamoxifen resistance in breast cancer cells. Additional research is required to validate the findings of this study.
Collapse
Affiliation(s)
- Salma Nur Azizah Azzahra
- Laboratory of Macromolecular Engineering, Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Universitas Gadjah Mada Sekip Utara II, 55281 Yogyakarta, Indonesia.
| | - Naufa Hanif
- Cancer Chemoprevention Research Center, Faculty of Pharmacy, Universitas Gadjah Mada Sekip Utara II, 55281 Yogyakarta, Indonesia.
| | - Adam Hermawan
- Laboratory of Macromolecular Engineering, Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Universitas Gadjah Mada Sekip Utara II, 55281 Yogyakarta, Indonesia.
- Cancer Chemoprevention Research Center, Faculty of Pharmacy, Universitas Gadjah Mada Sekip Utara II, 55281 Yogyakarta, Indonesia.
| |
Collapse
|
5
|
Tantoso E, Wong WC, Tay WH, Lee J, Sinha S, Eisenhaber B, Eisenhaber F. Hypocrisy Around Medical Patient Data: Issues of Access for Biomedical Research, Data Quality, Usefulness for the Purpose and Omics Data as Game Changer. Asian Bioeth Rev 2019; 11:189-207. [PMID: 33717311 PMCID: PMC7747340 DOI: 10.1007/s41649-019-00085-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 04/23/2019] [Accepted: 04/30/2019] [Indexed: 11/14/2022] Open
Abstract
Whether due to simplicity or hypocrisy, the question of access to patient data for biomedical research is widely seen in the public discourse only from the angle of patient privacy. At the same time, the desire to live and to live without disability is of much higher value to the patients. This goal can only be achieved by extracting research insight from patient data in addition to working on model organisms, something that is well understood by many patients. Yet, most biomedical researchers working outside of clinics and hospitals are denied access to patient records when, at the same time, clinicians who guard the patient data are not optimally prepared for the data’s analysis. Medical data collection is a time- and cost-intensive process that is most of all tedious, with few elements of intellectual and emotional satisfaction on its own. In this process, clinicians and bioinformaticians, each group with their own interests, have to join forces with the goal to generate medical data sets both from clinical trials and from routinely collected electronic health records that are, as much as possible, free from errors and obvious inconsistencies. The data cleansing effort as we have learned during curation of Singaporean clinical trial data is not a trivial task. The introduction of omics and sophisticated imaging modalities into clinical practice that are only partially interpreted in terms of diagnosis and therapy with today’s level of knowledge warrant the creation of clinical databases with full patient history. This opens up opportunities for re-analyses and cross-trial studies at future time points with more sophisticated analyses of the same data, the collection of which is very expensive.
Collapse
Affiliation(s)
- Erwin Tantoso
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Wing-Cheong Wong
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Wei Hong Tay
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Joanne Lee
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Swati Sinha
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science and Technology (ASTAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore.,School of Computer Science and Engineering (SCSE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore, 637553 Singapore
| |
Collapse
|
6
|
Sinha S, Eisenhaber B, Jensen LJ, Kalbuaji B, Eisenhaber F. Darkness in the Human Gene and Protein Function Space: Widely Modest or Absent Illumination by the Life Science Literature and the Trend for Fewer Protein Function Discoveries Since 2000. Proteomics 2018; 18:e1800093. [PMID: 30265449 PMCID: PMC6282819 DOI: 10.1002/pmic.201800093] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 09/07/2018] [Indexed: 12/15/2022]
Abstract
The mentioning of gene names in the body of the scientific literature 1901-2017 and their fractional counting is used as a proxy to assess the level of biological function discovery. A literature score of one has been defined as full publication equivalent (FPE), the amount of literature necessary to achieve one publication solely dedicated to a gene. It has been found that less than 5000 human genes have each at least 100 FPEs in the available literature corpus. This group of elite genes (4817 protein-coding genes, 119 non-coding RNAs) attracts the overwhelming majority of the scientific literature about genes. Yet, thousands of proteins have never been mentioned at all, ≈2000 further proteins have not even one FPE of literature and, for ≈4600 additional proteins, the FPE count is below 10. The protein function discovery rate measured as numbers of proteins first mentioned or crossing a threshold of accumulated FPEs in a given year has grown until 2000 but is in decline thereafter. This drop is partially offset by function discoveries for non-coding RNAs. The full human genome sequencing does not boost the function discovery rate. Since 2000, the fastest growing group in the literature is that with at least 500 FPEs per gene.
Collapse
Affiliation(s)
- Swati Sinha
- Bioinformatics Institute (BII)Agency for Science and Technology (A*STAR)Matrix138671Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute (BII)Agency for Science and Technology (A*STAR)Matrix138671Singapore
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein ResearchFaculty of Health and Medical SciencesUniversity of CopenhagenDK-2200 CopenhagenDenmark
| | - Bharata Kalbuaji
- Bioinformatics Institute (BII)Agency for Science and Technology (A*STAR)Matrix138671Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII)Agency for Science and Technology (A*STAR)Matrix138671Singapore
- School of Computer Science and Engineering (SCSE)Nanyang Technological University (NTU)637553Singapore
| |
Collapse
|
7
|
Eisenhaber B, Sinha S, Wong WC, Eisenhaber F. Function of a membrane-embedded domain evolutionarily multiplied in the GPI lipid anchor pathway proteins PIG-B, PIG-M, PIG-U, PIG-W, PIG-V, and PIG-Z. Cell Cycle 2018; 17:874-880. [PMID: 29764287 PMCID: PMC6056205 DOI: 10.1080/15384101.2018.1456294] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Distant homology relationships among proteins with many transmembrane regions (TMs) are difficult to detect as they are clouded by the TMs’ hydrophobic compositional bias and mutational divergence in connecting loops. In the case of several GPI lipid anchor biosynthesis pathway components, the hidden evolutionary signal can be revealed with dissectHMMER, a sequence similarity search tool focusing on fold-critical, high complexity sequence segments. We find that a sequence module with 10 TMs in PIG-W, described as acyl transferase, is homologous to PIG-U, a transamidase subunit without characterized molecular function, and to mannosyltransferases PIG-B, PIG-M, PIG-V and PIG-Z. We conclude that this new, membrane-embedded domain named BindGPILA functions as the unit for recognizing, binding and stabilizing the GPI lipid anchor in a modification-competent form as this appears the only functional aspect shared among all proteins. Thus, PIG-U's likely molecular function is shuttling/presenting the anchor in a productive conformation to the transamidase complex.
Collapse
Affiliation(s)
- Birgit Eisenhaber
- a Bioinformatics Institute, Agency for Science , Technology and Research (A*STAR) , 30 Biopolis Street, #07-01 Matrix, Singapore 138671 , Republic of Singapore
| | - Swati Sinha
- a Bioinformatics Institute, Agency for Science , Technology and Research (A*STAR) , 30 Biopolis Street, #07-01 Matrix, Singapore 138671 , Republic of Singapore
| | - Wing-Cheong Wong
- a Bioinformatics Institute, Agency for Science , Technology and Research (A*STAR) , 30 Biopolis Street, #07-01 Matrix, Singapore 138671 , Republic of Singapore
| | - Frank Eisenhaber
- a Bioinformatics Institute, Agency for Science , Technology and Research (A*STAR) , 30 Biopolis Street, #07-01 Matrix, Singapore 138671 , Republic of Singapore.,b School of Computer Engineering , Nanyang Technological University (NTU) , 50 Nanyang Drive, Singapore 637553 , Republic of Singapore
| |
Collapse
|
8
|
Limviphuvadh V, Tan CS, Konishi F, Jenjaroenpun P, Xiang JS, Kremenska Y, Mu YS, Syn N, Lee SC, Soo RA, Eisenhaber F, Maurer-Stroh S, Yong WP. Discovering novel SNPs that are correlated with patient outcome in a Singaporean cancer patient cohort treated with gemcitabine-based chemotherapy. BMC Cancer 2018; 18:555. [PMID: 29751792 PMCID: PMC5948914 DOI: 10.1186/s12885-018-4471-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 05/01/2018] [Indexed: 12/20/2022] Open
Abstract
Background Single Nucleotide Polymorphisms (SNPs) can influence patient outcome such as drug response and toxicity after drug intervention. The purpose of this study is to develop a systematic pathway approach to accurately and efficiently predict novel non-synonymous SNPs (nsSNPs) that could be causative to gemcitabine-based chemotherapy treatment outcome in Singaporean non-small cell lung cancer (NSCLC) patients. Methods Using a pathway approach that incorporates comprehensive protein-protein interaction data to systematically extend the gemcitabine pharmacologic pathway, we identified 77 related nsSNPs, common in the Singaporean population. After that, we used five computational criteria to prioritize the SNPs based on their importance for protein function. We specifically selected and screened six candidate SNPs in a patient cohort with NSCLC treated with gemcitabine-based chemotherapy. Result We performed survival analysis followed by hematologic toxicity analyses and found that three of six candidate SNPs are significantly correlated with the patient outcome (P < 0.05) i.e. ABCG2 Q141K (rs2231142), SLC29A3 S158F (rs780668) and POLR2A N764K (rs2228130). Conclusions Our computational SNP candidate enrichment workflow approach was able to identify several high confidence biomarkers predictive for personalized drug treatment outcome while providing a rationale for a molecular mechanism of the SNP effect. Trial registration NCT00695994. Registered 10 June, 2008 ‘retrospectively registered’. Electronic supplementary material The online version of this article (10.1186/s12885-018-4471-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vachiranee Limviphuvadh
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore
| | - Chee Seng Tan
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore
| | - Fumikazu Konishi
- Education Academy of Computational Life Sciences, Tokyo Institute of Technology, Tokyo, Japan
| | - Piroon Jenjaroenpun
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore
| | - Joy Shengnan Xiang
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore
| | - Yuliya Kremenska
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore
| | - Yar Soe Mu
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore
| | - Nicholas Syn
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore.,Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Soo Chin Lee
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore
| | - Ross A Soo
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore.,Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore.,Department of Biological Sciences, National University of Singapore (NUS), 14 Science Drive 4, Singapore, 117543, Singapore.,School of Computer Engineering (SCE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore, 637553, Singapore
| | - Sebastian Maurer-Stroh
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01 Matrix, Singapore, 138671, Singapore.,Department of Biological Sciences, National University of Singapore (NUS), 14 Science Drive 4, Singapore, 117543, Singapore
| | - Wei Peng Yong
- Department of Haematology-Oncology, National University Health System, 5 Lower Kent Ridge Road, Singapore, 119074, Singapore.
| |
Collapse
|
9
|
Abstract
Since the human genome project in 2003, the view of personalized medicine to improve diagnosis and cure diseases at the molecular level became more real. Sequencing the human genome brought some benefits in medicine such as early detection of diseases with a genetic predisposition, treating patients with rare diseases, the design of gene therapy and the understanding of pharmacogenetics in the metabolism of drugs. This review explains the concepts of pharmacogenetics, polymorphisms, mutations, variations, and alleles, and how this information has helped us better understand the metabolism of drugs. Multiple resources are presented to promote reducing the gap between scientists, physicians, and patients in understanding the use and benefits of pharmacogenetics. Some of the most common clinical examples of genetic variants and how pharmacogenetics was used to determine treatment options for patients having these variants were discussed. Finally, we evaluated some of the challenges of implementing pharmacogenetics in a clinical setting and proposed actions to be taken to make pharmacogenetics a standard diagnostic tool in personalized medicine.
Collapse
Affiliation(s)
- J T Oates
- Department of Pharmaceutical Sciences, Biomanufacturing Research Institute and Technology Enterprise (BRITE), College of Arts and Sciences, North Carolina Central University, USA
| | - D Lopez
- Department of Pharmaceutical Sciences, Biomanufacturing Research Institute and Technology Enterprise (BRITE), College of Arts and Sciences, North Carolina Central University, USA
| |
Collapse
|
10
|
Caboche S, Even G, Loywick A, Audebert C, Hot D. MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data. Genome Biol 2017; 18:233. [PMID: 29258574 PMCID: PMC5738152 DOI: 10.1186/s13059-017-1367-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 11/30/2017] [Indexed: 12/15/2022] Open
Abstract
The increase in available sequence data has advanced the field of microbiology; however, making sense of these data without bioinformatics skills is still problematic. We describe MICRA, an automatic pipeline, available as a web interface, for microbial identification and characterization through reads analysis. MICRA uses iterative mapping against reference genomes to identify genes and variations. Additional modules allow prediction of antibiotic susceptibility and resistance and comparing the results of several samples. MICRA is fast, producing few false-positive annotations and variant calls compared to current methods, making it a tool of great interest for fully exploiting sequencing data.
Collapse
Affiliation(s)
- Ségolène Caboche
- University of Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019-UMR 8204-CIIL-Centre d'Infection et d'Immunité de Lille, F-59000, Lille, France. .,PEGASE-Biosciences, Institut Pasteur de Lille, 1 Rue du Professeur Calmette, 59019, Lille, France.
| | - Gaël Even
- Genes Diffusion, 3595, Route de Tournai, 59501, Douai, France.,PEGASE-Biosciences, Institut Pasteur de Lille, 1 Rue du Professeur Calmette, 59019, Lille, France
| | - Alexandre Loywick
- Genes Diffusion, 3595, Route de Tournai, 59501, Douai, France.,PEGASE-Biosciences, Institut Pasteur de Lille, 1 Rue du Professeur Calmette, 59019, Lille, France
| | - Christophe Audebert
- Genes Diffusion, 3595, Route de Tournai, 59501, Douai, France.,PEGASE-Biosciences, Institut Pasteur de Lille, 1 Rue du Professeur Calmette, 59019, Lille, France
| | - David Hot
- University of Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019-UMR 8204-CIIL-Centre d'Infection et d'Immunité de Lille, F-59000, Lille, France.,PEGASE-Biosciences, Institut Pasteur de Lille, 1 Rue du Professeur Calmette, 59019, Lille, France
| |
Collapse
|
11
|
Marakasova ES, Eisenhaber B, Maurer-Stroh S, Eisenhaber F, Baranova A. Prenylation of viral proteins by enzymes of the host: Virus-driven rationale for therapy with statins and FT/GGT1 inhibitors. Bioessays 2017; 39. [DOI: 10.1002/bies.201700014] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
| | - Birgit Eisenhaber
- Bioinformatics Institute; Agency for Science; Technology and Research Singapore
| | - Sebastian Maurer-Stroh
- Bioinformatics Institute; Agency for Science; Technology and Research Singapore
- Department of Biological Sciences; National University Singapore; Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute; Agency for Science; Technology and Research Singapore
- Department of Biological Sciences; National University Singapore; Singapore
- School of Computer Engineering; Nanyang Technological University; Singapore
| | - Ancha Baranova
- School of Systems Biology; George Mason University; Fairfax VA USA
- Research Centre for Medical Genetics; Russian Academy of Medical Sciences; Moscow Russia
| |
Collapse
|
12
|
Baker JA, Wong WC, Eisenhaber B, Warwicker J, Eisenhaber F. Charged residues next to transmembrane regions revisited: "Positive-inside rule" is complemented by the "negative inside depletion/outside enrichment rule". BMC Biol 2017; 15:66. [PMID: 28738801 PMCID: PMC5525207 DOI: 10.1186/s12915-017-0404-4] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Accepted: 07/07/2017] [Indexed: 11/25/2022] Open
Abstract
Background Transmembrane helices (TMHs) frequently occur amongst protein architectures as means for proteins to attach to or embed into biological membranes. Physical constraints such as the membrane’s hydrophobicity and electrostatic potential apply uniform requirements to TMHs and their flanking regions; consequently, they are mirrored in their sequence patterns (in addition to TMHs being a span of generally hydrophobic residues) on top of variations enforced by the specific protein’s biological functions. Results With statistics derived from a large body of protein sequences, we demonstrate that, in addition to the positive charge preference at the cytoplasmic inside (positive-inside rule), negatively charged residues preferentially occur or are even enriched at the non-cytoplasmic flank or, at least, they are suppressed at the cytoplasmic flank (negative-not-inside/negative-outside (NNI/NO) rule). As negative residues are generally rare within or near TMHs, the statistical significance is sensitive with regard to details of TMH alignment and residue frequency normalisation and also to dataset size; therefore, this trend was obscured in previous work. We observe variations amongst taxa as well as for organelles along the secretory pathway. The effect is most pronounced for TMHs from single-pass transmembrane (bitopic) proteins compared to those with multiple TMHs (polytopic proteins) and especially for the class of simple TMHs that evolved for the sole role as membrane anchors. Conclusions The charged-residue flank bias is only one of the TMH sequence features with a role in the anchorage mechanisms, others apparently being the leucine intra-helix propensity skew towards the cytoplasmic side, tryptophan flanking as well as the cysteine and tyrosine inside preference. These observations will stimulate new prediction methods for TMHs and protein topology from a sequence as well as new engineering designs for artificial membrane proteins. Electronic supplementary material The online version of this article (doi:10.1186/s12915-017-0404-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- James Alexander Baker
- Bioinformatics Institute, Agency for Science Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore, 138671, Singapore.,School of Chemistry, Manchester Institute of Biotechnology, 131 Princess Street, Manchester, M1 7DN, UK
| | - Wing-Cheong Wong
- Bioinformatics Institute, Agency for Science Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore, 138671, Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute, Agency for Science Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore, 138671, Singapore
| | - Jim Warwicker
- School of Chemistry, Manchester Institute of Biotechnology, 131 Princess Street, Manchester, M1 7DN, UK.
| | - Frank Eisenhaber
- Bioinformatics Institute, Agency for Science Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore, 138671, Singapore. .,School of Computer Engineering (SCE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore, 637553, Singapore.
| |
Collapse
|
13
|
Yap CK, Eisenhaber B, Eisenhaber F, Wong WC. xHMMER3x2: Utilizing HMMER3's speed and HMMER2's sensitivity and specificity in the glocal alignment mode for improved large-scale protein domain annotation. Biol Direct 2016; 11:63. [PMID: 27894340 PMCID: PMC5126834 DOI: 10.1186/s13062-016-0163-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Accepted: 10/24/2016] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND While the local-mode HMMER3 is notable for its massive speed improvement, the slower glocal-mode HMMER2 is more exact for domain annotation by enforcing full domain-to-sequence alignments. Since a unit of domain necessarily implies a unit of function, local-mode HMMER3 alone remains insufficient for precise function annotation tasks. In addition, the incomparable E-values for the same domain model by different HMMER builds create difficulty when checking for domain annotation consistency on a large-scale basis. RESULTS In this work, both the speed of HMMER3 and glocal-mode alignment of HMMER2 are combined within the xHMMER3x2 framework for tackling the large-scale domain annotation task. Briefly, HMMER3 is utilized for initial domain detection so that HMMER2 can subsequently perform the glocal-mode, sequence-to-full-domain alignments for the detected HMMER3 hits. An E-value calibration procedure is required to ensure that the search space by HMMER2 is sufficiently replicated by HMMER3. We find that the latter is straightforwardly possible for ~80% of the models in the Pfam domain library (release 29). However in the case of the remaining ~20% of HMMER3 domain models, the respective HMMER2 counterparts are more sensitive. Thus, HMMER3 searches alone are insufficient to ensure sensitivity and a HMMER2-based search needs to be initiated. When tested on the set of UniProt human sequences, xHMMER3x2 can be configured to be between 7× and 201× faster than HMMER2, but with descending domain detection sensitivity from 99.8 to 95.7% with respect to HMMER2 alone; HMMER3's sensitivity was 95.7%. At extremes, xHMMER3x2 is either the slow glocal-mode HMMER2 or the fast HMMER3 with glocal-mode. Finally, the E-values to false-positive rates (FPR) mapping by xHMMER3x2 allows E-values of different model builds to be compared, so that any annotation discrepancies in a large-scale annotation exercise can be flagged for further examination by dissectHMMER. CONCLUSION The xHMMER3x2 workflow allows large-scale domain annotation speed to be drastically improved over HMMER2 without compromising for domain-detection with regard to sensitivity and sequence-to-domain alignment incompleteness. The xHMMER3x2 code and its webserver (for Pfam release 27, 28 and 29) are freely available at http://xhmmer3x2.bii.a-star.edu.sg/ . REVIEWERS Reviewed by Thomas Dandekar, L. Aravind, Oliviero Carugo and Shamil Sunyaev. For the full reviews, please go to the Reviewers' comments section.
Collapse
Affiliation(s)
- Choon-Kong Yap
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671, Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671, Singapore. .,School of Computer Engineering (SCE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore, 637553, Singapore.
| | - Wing-Cheong Wong
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671, Singapore.
| |
Collapse
|
14
|
The Recipe for Protein Sequence-Based Function Prediction and Its Implementation in the ANNOTATOR Software Environment. Methods Mol Biol 2016; 1415:477-506. [PMID: 27115649 DOI: 10.1007/978-1-4939-3572-7_25] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
|
15
|
Sirota FL, Maurer-Stroh S, Eisenhaber B, Eisenhaber F. Single-residue posttranslational modification sites at the N-terminus, C-terminus or in-between: To be or not to be exposed for enzyme access. Proteomics 2016; 15:2525-46. [PMID: 26038108 PMCID: PMC4745020 DOI: 10.1002/pmic.201400633] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 04/17/2015] [Accepted: 05/29/2015] [Indexed: 11/30/2022]
Abstract
Many protein posttranslational modifications (PTMs) are the result of an enzymatic reaction. The modifying enzyme has to recognize the substrate protein's sequence motif containing the residue(s) to be modified; thus, the enzyme's catalytic cleft engulfs these residue(s) and the respective sequence environment. This residue accessibility condition principally limits the range where enzymatic PTMs can occur in the protein sequence. Non‐globular, flexible, intrinsically disordered segments or large loops/accessible long side chains should be preferred whereas residues buried in the core of structures should be void of what we call canonical, enzyme‐generated PTMs. We investigate whether PTM sites annotated in UniProtKB (with MOD_RES/LIPID keys) are situated within sequence ranges that can be mapped to known 3D structures. We find that N‐ or C‐termini harbor essentially exclusively canonical PTMs. We also find that the overwhelming majority of all other PTMs are also canonical though, later in the protein's life cycle, the PTM sites can become buried due to complex formation. Among the remaining cases, some can be explained (i) with autocatalysis, (ii) with modification before folding or after temporary unfolding, or (iii) as products of interaction with small, diffusible reactants. Others require further research how these PTMs are mechanistically generated in vivo.
Collapse
Affiliation(s)
- Fernanda L Sirota
- Bioinformatics Institute (BII), Agency for Science and Technology (A*STAR), Matrix, Singapore
| | - Sebastian Maurer-Stroh
- Bioinformatics Institute (BII), Agency for Science and Technology (A*STAR), Matrix, Singapore.,School of Biological Sciences (SBS), Nanyang Technological University (NTU), Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute (BII), Agency for Science and Technology (A*STAR), Matrix, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science and Technology (A*STAR), Matrix, Singapore.,Department of Biological Sciences (DBS), National University of Singapore (NUS), Singapore.,School of Computer Engineering (SCE), Nanyang Technological University (NTU), Singapore
| |
Collapse
|
16
|
Sherman WA, Kuchibhatla DB, Limviphuvadh V, Maurer-Stroh S, Eisenhaber B, Eisenhaber F. HPMV: human protein mutation viewer - relating sequence mutations to protein sequence architecture and function changes. J Bioinform Comput Biol 2015; 13:1550028. [PMID: 26503432 DOI: 10.1142/s0219720015500286] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Next-generation sequencing advances are rapidly expanding the number of human mutations to be analyzed for causative roles in genetic disorders. Our Human Protein Mutation Viewer (HPMV) is intended to explore the biomolecular mechanistic significance of non-synonymous human mutations in protein-coding genomic regions. The tool helps to assess whether protein mutations affect the occurrence of sequence-architectural features (globular domains, targeting signals, post-translational modification sites, etc.). As input, HPMV accepts protein mutations - as UniProt accessions with mutations (e.g. HGVS nomenclature), genome coordinates, or FASTA sequences. As output, HPMV provides an interactive cartoon showing the mutations in relation to elements of the sequence architecture. A large variety of protein sequence architectural features were selected for their particular relevance to mutation interpretation. Clicking a sequence feature in the cartoon expands a tree view of additional information including multiple sequence alignments of conserved domains and a simple 3D viewer mapping the mutation to known PDB structures, if available. The cartoon is also correlated with a multiple sequence alignment of similar sequences from other organisms. In cases where a mutation is likely to have a straightforward interpretation (e.g. a point mutation disrupting a well-understood targeting signal), this interpretation is suggested. The interactive cartoon can be downloaded as standalone viewer in Java jar format to be saved and viewed later with only a standard Java runtime environment. The HPMV website is: http://hpmv.bii.a-star.edu.sg/ .
Collapse
Affiliation(s)
- Westley Arthur Sherman
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
| | - Durga Bhavani Kuchibhatla
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
| | - Vachiranee Limviphuvadh
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
| | - Sebastian Maurer-Stroh
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
- School of Biological Sciences (SBS), Nanyang Technological University (NTU), 60 Nanyang Drive, Singapore 637551, Singapore
| | - Birgit Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive 4, Singapore 117597, Singapore
- School of Computer Engineering (SCE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore 637553, Singapore
| |
Collapse
|
17
|
Caboche S, Audebert C, Hot D. High-Throughput Sequencing, a VersatileWeapon to Support Genome-Based Diagnosis in Infectious Diseases: Applications to Clinical Bacteriology. Pathogens 2014; 3:258-79. [PMID: 25437800 PMCID: PMC4243446 DOI: 10.3390/pathogens3020258] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Revised: 02/28/2014] [Accepted: 03/20/2014] [Indexed: 12/19/2022] Open
Abstract
The recent progresses of high-throughput sequencing (HTS) technologies enable easy and cost-reduced access to whole genome sequencing (WGS) or re-sequencing. HTS associated with adapted, automatic and fast bioinformatics solutions for sequencing applications promises an accurate and timely identification and characterization of pathogenic agents. Many studies have demonstrated that data obtained from HTS analysis have allowed genome-based diagnosis, which has been consistent with phenotypic observations. These proofs of concept are probably the first steps toward the future of clinical microbiology. From concept to routine use, many parameters need to be considered to promote HTS as a powerful tool to help physicians and clinicians in microbiological investigations. This review highlights the milestones to be completed toward this purpose.
Collapse
Affiliation(s)
- Ségolène Caboche
- FRE 3642 Molecular and Cellular Medecine, CNRS, Institut Pasteur de Lille and University Lille Nord de France, Lille 59019, France.
| | | | - David Hot
- FRE 3642 Molecular and Cellular Medecine, CNRS, Institut Pasteur de Lille and University Lille Nord de France, Lille 59019, France.
| |
Collapse
|
18
|
EISENHABER FRANK, SUNG WINGKIN, WONG LIMSOON. THE 24TH INTERNATIONAL CONFERENCE ON GENOME INFORMATICS, GIW2013, IN SINGAPORE. J Bioinform Comput Biol 2013. [DOI: 10.1142/s0219720013020034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- FRANK EISENHABER
- Bioinformatics Institute, Agency for Science, Technology and Research, 30 Biopolis Street #07-01, Matrix, Singapore 138671, Singapore
- Department of Biological Sciences, National University of Singapore, 8 Medical Drive, Singapore 117597, Singapore
- School of Computer Engineering, Nanyang Technological University, 50 Nanyang Drive, Singapore 637553, Singapore
| | - WING-KIN SUNG
- School of Computing, National University of Singapore, 13 Computing Drive, Singapore 117417, Singapore
- Genome Institute of Singapore, 60 Biopolis Street #02-01, Genome, Singapore 138672, Singapore
| | - LIMSOON WONG
- School of Computing, National University of Singapore, 13 Computing Drive, Singapore 117417, Singapore
| |
Collapse
|
19
|
Zhang Y. Welcome to health information science and systems. Health Inf Sci Syst 2013; 1:1. [PMID: 25825653 PMCID: PMC4336118 DOI: 10.1186/2047-2501-1-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Accepted: 12/04/2012] [Indexed: 11/10/2022] Open
Abstract
Health Information Science and Systems is an exciting, new, multidisciplinary journal that aims to use technologies in computer science to assist in disease diagnoses, treatment, prediction and monitoring through the modeling, design, development, visualization, integration and management of health related information. These computer-science technologies include such as information systems, web technologies, data mining, image processing, user interaction and interface, sensors and wireless networking and are applicable to a wide range of health related information including medical data, biomedical data, bioinformatics data, public health data.
Collapse
Affiliation(s)
- Yanchun Zhang
- Centre for Applied Informatics, College of Engineering and Science, Victoria University, Melbourne, Australia
| |
Collapse
|