1
|
Sidders B, Zhang P, Goodwin K, O'Connor G, Russell DL, Borodovsky A, Armenia J, McEwen R, Linghu B, Bendell JC, Bauer TM, Patel MR, Falchook GS, Merchant M, Pouliot G, Barrett JC, Dry JR, Woessner R, Sachsenmeier K. Adenosine Signaling Is Prognostic for Cancer Outcome and Has Predictive Utility for Immunotherapeutic Response. Clin Cancer Res 2020; 26:2176-2187. [PMID: 31953314 DOI: 10.1158/1078-0432.ccr-19-2183] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 11/15/2019] [Accepted: 01/14/2020] [Indexed: 11/16/2022]
Abstract
PURPOSE There are several agents in early clinical trials targeting components of the adenosine pathway including A2AR and CD73. The identification of cancers with a significant adenosine drive is critical to understand the potential for these molecules. However, it is challenging to measure tumor adenosine levels at scale, thus novel, clinically tractable biomarkers are needed. EXPERIMENTAL DESIGN We generated a gene expression signature for the adenosine signaling using regulatory networks derived from the literature and validated this in patients. We applied the signature to large cohorts of disease from The Cancer Genome Atlas (TCGA) and cohorts of immune checkpoint inhibitor-treated patients. RESULTS The signature captures baseline adenosine levels in vivo (r 2 = 0.92, P = 0.018), is reduced after small-molecule inhibition of A2AR in mice (r 2 = -0.62, P = 0.001) and humans (reduction in 5 of 7 patients, 70%), and is abrogated after A2AR knockout. Analysis of TCGA confirms a negative association between adenosine and overall survival (OS, HR = 0.6, P < 2.2e-16) as well as progression-free survival (PFS, HR = 0.77, P = 0.0000006). Further, adenosine signaling is associated with reduced OS (HR = 0.47, P < 2.2e-16) and PFS (HR = 0.65, P = 0.0000002) in CD8+ T-cell-infiltrated tumors. Mutation of TGFβ superfamily members is associated with enhanced adenosine signaling and worse OS (HR = 0.43, P < 2.2e-16). Finally, adenosine signaling is associated with reduced efficacy of anti-PD1 therapy in published cohorts (HR = 0.29, P = 0.00012). CONCLUSIONS These data support the adenosine pathway as a mediator of a successful antitumor immune response, demonstrate the prognostic potential of the signature for immunotherapy, and inform patient selection strategies for adenosine pathway modulators currently in development.
Collapse
Affiliation(s)
- Ben Sidders
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, United Kingdom.
| | - Pei Zhang
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, United Kingdom
| | - Kelly Goodwin
- Discovery, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Greg O'Connor
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Deanna L Russell
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Alexandra Borodovsky
- Discovery, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Joshua Armenia
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, United Kingdom
| | - Robert McEwen
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, United Kingdom
| | - Bolan Linghu
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Johanna C Bendell
- Sarah Cannon Research Institute/Tennessee Oncology, Nashville, Tennessee
| | - Todd M Bauer
- Sarah Cannon Research Institute/Tennessee Oncology, Nashville, Tennessee
| | - Manish R Patel
- Sarah Cannon Research Institute/Florida Cancer Specialists, Sarasota, Florida
| | | | - Melinda Merchant
- Early Clinical Development, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Gayle Pouliot
- Early Clinical Development, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - J Carl Barrett
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Jonathan R Dry
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Rich Woessner
- Discovery, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| | - Kris Sachsenmeier
- Translational Medicine, Research and Early Development, Oncology R&D, AstraZeneca, Boston, Massachusetts
| |
Collapse
|
2
|
Yang X, Song Z, Wu C, Wang W, Li G, Zhang W, Wu L, Lu K. Constructing a database for the relations between CNV and human genetic diseases via systematic text mining. BMC Bioinformatics 2018; 19:528. [PMID: 30598077 PMCID: PMC6311945 DOI: 10.1186/s12859-018-2526-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The detection and interpretation of CNVs are of clinical importance in genetic testing. Several databases and web services are already being used by clinical geneticists to interpret the medical relevance of identified CNVs in patients. However, geneticists or physicians would like to obtain the original literature context for more detailed information, especially for rare CNVs that were not included in databases. RESULTS The resulting CNVdigest database includes 440,485 sentences for CNV-disease relationship. A total number of 1582 CNVs and 2425 diseases are involved. Sentences describing CNV-disease correlations are indexed in CNVdigest, with CNV mentions and disease mentions annotated. CONCLUSIONS In this paper, we use a systematic text mining method to construct a database for the relationship between CNVs and diseases. Based on that, we also developed a concise front-end to facilitate the analysis of CNV/disease association, providing a user-friendly web interface for convenient queries. The resulting system is publically available at http://cnv.gtxlab.com /.
Collapse
Affiliation(s)
- Xi Yang
- School of Computer Science, National University of Defense Technology, Changsha, 410073, China
| | - Zhuo Song
- Genetalks Biotech Inc., Beijing, 100176, China
| | - Chengkun Wu
- School of Computer Science, National University of Defense Technology, Changsha, 410073, China. .,Institute for Quantum Information & State Key Laboratory of High Performance Computing, College of Computer, National University of Defense Technology, Changsha, 410073, China.
| | - Wei Wang
- School of Computer Science, National University of Defense Technology, Changsha, 410073, China
| | - Gen Li
- Genetalks Biotech Inc., Beijing, 100176, China
| | - Wei Zhang
- Genetalks Biotech Inc., Beijing, 100176, China
| | - Lingqian Wu
- Center for Medical Genetics, Central South University, 110 Xiangya Road, Changsha, 410078, Hunan, China.
| | - Kai Lu
- School of Computer Science, National University of Defense Technology, Changsha, 410073, China.
| |
Collapse
|
3
|
Network-Based Drug Discovery: Coupling Network Pharmacology with Phenotypic Screening for Neuronal Excitability. J Mol Biol 2018; 430:3005-3015. [DOI: 10.1016/j.jmb.2018.07.016] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2018] [Revised: 07/05/2018] [Accepted: 07/10/2018] [Indexed: 01/02/2023]
|
4
|
Roth A, Subramanian S, Ganapathiraju MK. Towards Extracting Supporting Information About Predicted Protein-Protein Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1239-1246. [PMID: 26672046 DOI: 10.1109/tcbb.2015.2505278] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
One of the goals of relation extraction is to identify protein-protein interactions (PPIs) in biomedical literature. Current systems are capturing binary relations and also the direction and type of an interaction. Besides assisting in the curation PPIs into databases, there has been little real-world application of these algorithms. We describe UPSITE, a text mining tool for extracting evidence in support of a hypothesized interaction. Given a predicted PPI, UPSITE uses a binary relation detector to check whether a PPI is found in abstracts in PubMed. If it is not found, UPSITE retrieves documents relevant to each of the two proteins separately, and extracts contextual information about biological events surrounding each protein, and calculates semantic similarity of the two proteins to provide evidential support for the predicted PPI. In evaluations, relation extraction achieved an Fscore of 0.88 on the HPRD50 corpus, and semantic similarity measured with angular distance was found to be statistically significant. With the development of PPI prediction algorithms, the burden of interpreting the validity and relevance of novel PPIs is on biologists. We suggest that presenting annotations of the two proteins in a PPI side-by-side and a score that quantifies their similarity lessens this burden to some extent.
Collapse
|
5
|
Kreimeyer K, Foster M, Pandey A, Arya N, Halford G, Jones SF, Forshee R, Walderhaug M, Botsis T. Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review. J Biomed Inform 2017; 73:14-29. [PMID: 28729030 DOI: 10.1016/j.jbi.2017.07.012] [Citation(s) in RCA: 321] [Impact Index Per Article: 40.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 06/07/2017] [Accepted: 07/14/2017] [Indexed: 12/24/2022]
Abstract
We followed a systematic approach based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses to identify existing clinical natural language processing (NLP) systems that generate structured information from unstructured free text. Seven literature databases were searched with a query combining the concepts of natural language processing and structured data capture. Two reviewers screened all records for relevance during two screening phases, and information about clinical NLP systems was collected from the final set of papers. A total of 7149 records (after removing duplicates) were retrieved and screened, and 86 were determined to fit the review criteria. These papers contained information about 71 different clinical NLP systems, which were then analyzed. The NLP systems address a wide variety of important clinical and research tasks. Certain tasks are well addressed by the existing systems, while others remain as open challenges that only a small number of systems attempt, such as extraction of temporal information or normalization of concepts to standard terminologies. This review has identified many NLP systems capable of processing clinical free text and generating structured output, and the information collected and evaluated here will be important for prioritizing development of new approaches for clinical NLP.
Collapse
Affiliation(s)
- Kory Kreimeyer
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States.
| | - Matthew Foster
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Abhishek Pandey
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Nina Arya
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Gwendolyn Halford
- FDA Library, US Food and Drug Administration, Silver Spring, MD, United States
| | - Sandra F Jones
- Cancer Surveillance Branch, Division of Cancer Prevention and Control, National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA, United States
| | - Richard Forshee
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Mark Walderhaug
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| | - Taxiarchis Botsis
- Office of Biostatistics and Epidemiology, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, United States
| |
Collapse
|
6
|
McArthur AG, Tsang KK. Antimicrobial resistance surveillance in the genomic age. Ann N Y Acad Sci 2016; 1388:78-91. [PMID: 27875856 DOI: 10.1111/nyas.13289] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Revised: 09/30/2016] [Accepted: 10/05/2016] [Indexed: 12/31/2022]
Abstract
The loss of effective antimicrobials is reducing our ability to protect the global population from infectious disease. However, the field of antibiotic drug discovery and the public health monitoring of antimicrobial resistance (AMR) is beginning to exploit the power of genome and metagenome sequencing. The creation of novel AMR bioinformatics tools and databases and their continued development will advance our understanding of the molecular mechanisms and threat severity of antibiotic resistance, while simultaneously improving our ability to accurately predict and screen for antibiotic resistance genes within environmental, agricultural, and clinical settings. To do so, efforts must be focused toward exploiting the advancements of genome sequencing and information technology. Currently, AMR bioinformatics software and databases reflect different scopes and functions, each with its own strengths and weaknesses. A review of the available tools reveals common approaches and reference data but also reveals gaps in our curated data, models, algorithms, and data-sharing tools that must be addressed to conquer the limitations and areas of unmet need within the AMR research field before DNA sequencing can be fully exploited for AMR surveillance and improved clinical outcomes.
Collapse
Affiliation(s)
- Andrew G McArthur
- M. G. DeGroote Institute for Infectious Disease Research, Department of Biochemistry and Biomedical Sciences, DeGroote School of Medicine, McMaster University, Hamilton, Canada
| | - Kara K Tsang
- M. G. DeGroote Institute for Infectious Disease Research, Department of Biochemistry and Biomedical Sciences, DeGroote School of Medicine, McMaster University, Hamilton, Canada
| |
Collapse
|
7
|
Wu C, Schwartz JM, Brabant G, Peng SL, Nenadic G. Constructing a molecular interaction network for thyroid cancer via large-scale text mining of gene and pathway events. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 6:S5. [PMID: 26679379 PMCID: PMC4674859 DOI: 10.1186/1752-0509-9-s6-s5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Background Biomedical studies need assistance from automated tools and easily accessible data to address the problem of the rapidly accumulating literature. Text-mining tools and curated databases have been developed to address such needs and they can be applied to improve the understanding of molecular pathogenesis of complex diseases like thyroid cancer. Results We have developed a system, PWTEES, which extracts pathway interactions from the literature utilizing an existing event extraction tool (TEES) and pathway named entity recognition (PathNER). We then applied the system on a thyroid cancer corpus and systematically extracted molecular interactions involving either genes or pathways. With the extracted information, we constructed a molecular interaction network taking genes and pathways as nodes. Using curated pathway information and network topological analyses, we highlight key genes and pathways involved in thyroid carcinogenesis. Conclusions Mining events involving genes and pathways from the literature and integrating curated pathway knowledge can help improve the understanding of molecular interactions of complex diseases. The system developed for this study can be applied in studies other than thyroid cancer. The source code is freely available online at https://github.com/chengkun-wu/PWTEES.
Collapse
|
8
|
McArthur AG, Wright GD. Bioinformatics of antimicrobial resistance in the age of molecular epidemiology. Curr Opin Microbiol 2015; 27:45-50. [DOI: 10.1016/j.mib.2015.07.004] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2015] [Accepted: 07/10/2015] [Indexed: 12/30/2022]
|
9
|
Jamieson DG, Moss A, Kennedy M, Jones S, Nenadic G, Robertson DL, Sidders B. The pain interactome: connecting pain-specific protein interactions. Pain 2014; 155:2243-52. [PMID: 24978826 PMCID: PMC4247380 DOI: 10.1016/j.pain.2014.06.020] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2014] [Revised: 06/13/2014] [Accepted: 06/23/2014] [Indexed: 11/29/2022]
Abstract
Understanding the molecular mechanisms associated with disease is a central goal of modern medical research. As such, many thousands of experiments have been published that detail individual molecular events that contribute to a disease. Here we use a semi-automated text mining approach to accurately and exhaustively curate the primary literature for chronic pain states. In so doing, we create a comprehensive network of 1,002 contextualized protein-protein interactions (PPIs) specifically associated with pain. The PPIs form a highly interconnected and coherent structure, and the resulting network provides an alternative to those derived from connecting genes associated with pain using interactions that have not been shown to occur in a painful state. We exploit the contextual data associated with our interactions to analyse subnetworks specific to inflammatory and neuropathic pain, and to various anatomical regions. Here, we identify potential targets for further study and several drug-repurposing opportunities. Finally, the network provides a framework for the interpretation of new data within the field of pain.
Collapse
Affiliation(s)
- Daniel G Jamieson
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Manchester, UK; Computer Science, Faculty of Engineering and Physical Sciences, University of Manchester, Manchester, UK
| | - Andrew Moss
- Neusentis, Pfizer, Worldwide Research & Development, Cambridge, UK
| | - Michael Kennedy
- Neusentis, Pfizer, Worldwide Research & Development, Cambridge, UK
| | - Sherrie Jones
- Cancer Research UK Manchester Institute, University of Manchester, Manchester, UK
| | - Goran Nenadic
- Computer Science, Faculty of Engineering and Physical Sciences, University of Manchester, Manchester, UK; Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - David L Robertson
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Manchester, UK
| | - Ben Sidders
- Neusentis, Pfizer, Worldwide Research & Development, Cambridge, UK.
| |
Collapse
|
10
|
Ono T, Kuhara S. A novel method for gathering and prioritizing disease candidate genes based on construction of a set of disease-related MeSH® terms. BMC Bioinformatics 2014; 15:179. [PMID: 24917541 PMCID: PMC4068192 DOI: 10.1186/1471-2105-15-179] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 06/02/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Understanding the molecular mechanisms involved in disease is critical for the development of more effective and individualized strategies for prevention and treatment. The amount of disease-related literature, including new genetic information on the molecular mechanisms of disease, is rapidly increasing. Extracting beneficial information from literature can be facilitated by computational methods such as the knowledge-discovery approach. Several methods for mining gene-disease relationships using computational methods have been developed, however, there has been a lack of research evaluating specific disease candidate genes. RESULTS We present a novel method for gathering and prioritizing specific disease candidate genes. Our approach involved the construction of a set of Medical Subject Headings (MeSH) terms for the effective retrieval of publications related to a disease candidate gene. Information regarding the relationships between genes and publications was obtained from the gene2pubmed database. The set of genes was prioritized using a "weighted literature score" based on the number of publications and weighted by the number of genes occurring in a publication. Using our method for the disease states of pain and Alzheimer's disease, a total of 1101 pain candidate genes and 2810 Alzheimer's disease candidate genes were gathered and prioritized. The precision was 0.30 and the recall was 0.89 in the case study of pain. The precision was 0.04 and the recall was 0.6 in the case study of Alzheimer's disease. The precision-recall curve indicated that the performance of our method was superior to that of other publicly available tools. CONCLUSIONS Our method, which involved the use of a set of MeSH terms related to disease candidate genes and a novel weighted literature score, improved the accuracy of gathering and prioritizing candidate genes by focusing on a specific disease.
Collapse
Affiliation(s)
| | - Satoru Kuhara
- Department of Genetic Resources Technology, Faculty of Agriculture, Kyushu University, 6-10-1 Hakozaki Higashi-ku, Fukuoka 812-8581, Japan.
| |
Collapse
|