1
|
Selote R, Makhijani R. A knowledge graph approach to drug repurposing for Alzheimer's, Parkinson's and Glioma using drug-disease-gene associations. Comput Biol Chem 2025; 115:108302. [PMID: 39693851 DOI: 10.1016/j.compbiolchem.2024.108302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 11/06/2024] [Accepted: 11/26/2024] [Indexed: 12/20/2024]
Abstract
Drug Repurposing gives us facility to find the new uses of previously developed drugs rather than developing new drugs from start. Particularly during pandemic, drug repurposing caught much attention to provide new applications of the previously approved drugs. In our research, we provide a novel method for drug repurposing based on feature learning process from drug-disease-gene network. In our research, we aimed at finding drug candidates which can be repurposed under neurodegenerative diseases and glioma. We collected association data between drugs, diseases and genes from public resources and primarily examined the data related to Alzheimer's, Parkinson's and Glioma diseases. We created a Knowledge Graph using neo4j by integrating all these datasets and applied scalable feature learning algorithm known as node2vec to create node embeddings. These embeddings were later used to predict the unknown associations between disease and their candidate drugs by finding cosine similarity between disease and drug nodes embedding. We obtained a definitive set of candidate drugs for repurposing. These results were validated from the literature and CodReS online tool to rank the candidate drugs. Additionally, we verified the status of candidate drugs from pharmaceutical knowledge databases to confirm their significance.
Collapse
Affiliation(s)
- Ruchira Selote
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Nagpur, India.
| | - Richa Makhijani
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Nagpur, India.
| |
Collapse
|
2
|
Freidel S, Schwarz E. Knowledge graphs in psychiatric research: Potential applications and future perspectives. Acta Psychiatr Scand 2025; 151:180-191. [PMID: 38886846 PMCID: PMC11787922 DOI: 10.1111/acps.13717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/15/2024] [Accepted: 06/05/2024] [Indexed: 06/20/2024]
Abstract
BACKGROUND Knowledge graphs (KGs) remain an underutilized tool in the field of psychiatric research. In the broader biomedical field KGs are already a significant tool mainly used as knowledge database or for novel relation detection between biomedical entities. This review aims to outline how KGs would further research in the field of psychiatry in the age of Artificial Intelligence (AI) and Large Language Models (LLMs). METHODS We conducted a thorough literature review across a spectrum of scientific fields ranging from computer science and knowledge engineering to bioinformatics. The literature reviewed was taken from PubMed, Semantic Scholar and Google Scholar searches including terms such as "Psychiatric Knowledge Graphs", "Biomedical Knowledge Graphs", "Knowledge Graph Machine Learning Applications", "Knowledge Graph Applications for Biomedical Sciences". The resulting publications were then assessed and accumulated in this review regarding their possible relevance to future psychiatric applications. RESULTS A multitude of papers and applications of KGs in associated research fields that are yet to be utilized in psychiatric research was found and outlined in this review. We create a thorough recommendation for other computational researchers regarding use-cases of these KG applications in psychiatry. CONCLUSION This review illustrates use-cases of KG-based research applications in biomedicine and beyond that may aid in elucidating the complex biology of psychiatric illness and open new routes for developing innovative interventions. We conclude that there is a wealth of opportunities for KG utilization in psychiatric research across a variety of application areas including biomarker discovery, patient stratification and personalized medicine approaches.
Collapse
Affiliation(s)
- Sebastian Freidel
- Hector Institute for Artificial Intelligence in Psychiatry, Central Institute of Mental Health, Medical Faculty MannheimHeidelberg UniversityMannheimGermany
- Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty MannheimHeidelberg UniversityMannheimGermany
| | - Emanuel Schwarz
- Hector Institute for Artificial Intelligence in Psychiatry, Central Institute of Mental Health, Medical Faculty MannheimHeidelberg UniversityMannheimGermany
- Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty MannheimHeidelberg UniversityMannheimGermany
| |
Collapse
|
3
|
Nourani E, Makri EM, Mao X, Pyysalo S, Brunak S, Nastou K, Jensen LJ. LSD600: the first corpus of biomedical abstracts annotated with lifestyle-disease relations. Database (Oxford) 2025; 2025:baae129. [PMID: 39824652 PMCID: PMC11756709 DOI: 10.1093/database/baae129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 11/15/2024] [Accepted: 12/09/2024] [Indexed: 01/20/2025]
Abstract
Lifestyle factors (LSFs) are increasingly recognized as instrumental in both the development and control of diseases. Despite their importance, there is a lack of methods to extract relations between LSFs and diseases from the literature, a step necessary to consolidate the currently available knowledge into a structured form. As simple co-occurrence-based relation extraction (RE) approaches are unable to distinguish between the different types of LSF-disease relations, context-aware models such as transformers are required to extract and classify these relations into specific relation types. However, no comprehensive LSF-disease RE system existed, nor a corpus suitable for developing one. We present LSD600 (available at https://zenodo.org/records/13952449), the first corpus specifically designed for LSF-disease RE, comprising 600 abstracts with 1900 relations of eight distinct types between 5027 diseases and 6930 LSF entities. We evaluated LSD600's quality by training a RoBERTa model on the corpus, achieving an F-score of 68.5% for the multilabel RE task on the held-out test set. We further validated LSD600 by using the trained model on the two Nutrition-Disease and FoodDisease datasets, where it achieved F-scores of 70.7% and 80.7%, respectively. Building on these performance results, LSD600 and the RE system trained on it can be valuable resources to fill the existing gap in this area and pave the way for downstream applications. Database URL: https://zenodo.org/records/13952449.
Collapse
Affiliation(s)
- Esmaeil Nourani
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
- Faculty of Information Technology and Computer Engineering, Azarbaijan Shahid Madani University, Tabriz, Iran
| | - Evangelia-Mantelena Makri
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
- Department of Nutrition and Dietetics, Harokopio University, Athens 17676, Attiki, Greece
| | - Xiqing Mao
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
| | - Sampo Pyysalo
- TurkuNLP group, Department of Computing, Faculty of Technology, University of Turku, Turku 20014, Finland
| | - Søren Brunak
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
| | - Katerina Nastou
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3, Copenhagen 2200, Denmark
| |
Collapse
|
4
|
Youn J, Li F, Simmons G, Kim S, Tagkopoulos I. FoodAtlas: Automated knowledge extraction of food and chemicals from literature. Comput Biol Med 2024; 181:109072. [PMID: 39216404 DOI: 10.1016/j.compbiomed.2024.109072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 07/16/2024] [Accepted: 08/22/2024] [Indexed: 09/04/2024]
Abstract
Automated generation of knowledge graphs that accurately capture published information can help with knowledge organization and access, which have the potential to accelerate discovery and innovation. Here, we present an integrated pipeline to construct a large-scale knowledge graph using large language models in an active learning setting. We apply our pipeline to the association of raw food, ingredients, and chemicals, a domain that lacks such knowledge resources. By using an iterative active learning approach of 4120 manually curated premise-hypothesis pairs as training data for ten consecutive cycles, the entailment model extracted 230,848 food-chemical composition relationships from 155,260 scientific papers, with 106,082 (46.0 %) of them never been reported in any published database. To augment the knowledge incorporated in the knowledge graph, we further incorporated information from 5 external databases and ontology sources. We then applied a link prediction model to identify putative food-chemical relationships that were not part of the constructed knowledge graph. Validation of the 443 hypotheses generated by the link prediction model resulted in 355 new food-chemical relationships, while results show that the model score correlates well (R2 = 0.70) with the probability of a novel finding. This work demonstrates how automated learning from literature at scale can accelerate discovery and support practical applications through reproducible, evidence-based capture of latent interactions of diverse entities, such as food and chemicals.
Collapse
Affiliation(s)
- Jason Youn
- Department of Computer Science, University of California, Davis, Davis, CA, 95616, USA; Genome Center, University of California, Davis, Davis, CA, 95616, USA; USDA/NSF AI Institute for Next Generation Food Systems, Davis, CA, 95616, USA
| | - Fangzhou Li
- Department of Computer Science, University of California, Davis, Davis, CA, 95616, USA; Genome Center, University of California, Davis, Davis, CA, 95616, USA; USDA/NSF AI Institute for Next Generation Food Systems, Davis, CA, 95616, USA
| | - Gabriel Simmons
- Department of Computer Science, University of California, Davis, Davis, CA, 95616, USA; Genome Center, University of California, Davis, Davis, CA, 95616, USA; USDA/NSF AI Institute for Next Generation Food Systems, Davis, CA, 95616, USA
| | - Shanghyeon Kim
- Genome Center, University of California, Davis, Davis, CA, 95616, USA; USDA/NSF AI Institute for Next Generation Food Systems, Davis, CA, 95616, USA
| | - Ilias Tagkopoulos
- Department of Computer Science, University of California, Davis, Davis, CA, 95616, USA; Genome Center, University of California, Davis, Davis, CA, 95616, USA; USDA/NSF AI Institute for Next Generation Food Systems, Davis, CA, 95616, USA.
| |
Collapse
|
5
|
Li A, Han C, Xing X, Wei Q, Chi Y, Pu F. KGSCS-a smart care system for elderly with geriatric chronic diseases: a knowledge graph approach. BMC Med Inform Decis Mak 2024; 24:73. [PMID: 38475769 DOI: 10.1186/s12911-024-02472-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2023] [Accepted: 02/29/2024] [Indexed: 03/14/2024] Open
Abstract
BACKGROUND The increasing aging population has led to a shortage of geriatric chronic disease caregiver, resulting in inadequate care for elderly people. In this global context, many older people rely on nonprofessional family care. The credibility of existing health websites cannot meet the needs of care. Specialized health knowledge bases such as SNOMED-CT and UMLS are also difficult for nonprofessionals to use. Furthermore, professional caregiver in elderly care institutions also face difficulty caring for multiple elderly people at the same time and working handovers. As a solution, we propose a smart care system for the elderly based on a knowledge graph. METHOD First, we worked with professional caregivers to design a structured questionnaire to collect more than 100 pieces of care-related information for the elderly. Then, in the proposed system, personal information, smart device data, medical knowledge, and nursing knowledge are collected and organized into a dynamic knowledge graph. The system offers report generation, question answering, risk identification and data updating services. To evaluate the effectiveness of the system, we use the expert evaluation method to score the user experience. RESULTS The results of the study showed that compared to existing tools (health websites, archives and expert team consultation), the system achieved a score of 8 or more for basic information, health support and Dietary information. Some secondary evaluation indicators reached 9 and 10 points. This finding suggested that the system is superior to existing tools. We also present a case study to help the reader understand the role of the system. CONCLUSION The smart care system provide personalized care guidelines for nonprofessional caregivers. It also makes the job easier for institutional caregivers. In addition, the system provides great convenience for work handover.
Collapse
Affiliation(s)
- Aihua Li
- School of Management Science and Engineering, Central University of Finance and Economics, Beijing, 100080, China.
| | - Che Han
- School of Management Science and Engineering, Central University of Finance and Economics, Beijing, 100080, China
| | - Xinzhu Xing
- Beijing Academy of Science and Technology, Research Institute for Smart Aging, Beijing, 100050, China
| | - Qinyan Wei
- School of Management Science and Engineering, Central University of Finance and Economics, Beijing, 100080, China
| | - Yuxue Chi
- School of Management Science and Engineering, Central University of Finance and Economics, Beijing, 100080, China
| | - Fan Pu
- Beijing Academy of Science and Technology, Research Institute for Smart Aging, Beijing, 100050, China
| |
Collapse
|
6
|
Kilicoglu H, Ensan F, McInnes B, Wang LL. Semantics-enabled biomedical literature analytics. J Biomed Inform 2024; 150:104588. [PMID: 38244957 PMCID: PMC11771130 DOI: 10.1016/j.jbi.2024.104588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 01/10/2024] [Indexed: 01/22/2024]
Affiliation(s)
- Halil Kilicoglu
- School of Information Sciences, University of Illinois Urbana Champaign, Champaign, IL, USA.
| | - Faezeh Ensan
- Department of Electrical, Computer, and Biomedical Engineering, Toronto Metropolitan University, Toronto, ON, Canada.
| | - Bridget McInnes
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| | - Lucy Lu Wang
- Information School, University of Washington, Seattle, WA, USA.
| |
Collapse
|