1
|
Privitera GF, Musso N, Micale G, Bonomo C, Alaimo S, Bivona D, Bonacci PG, Scalia G, Stefani S, Pulvirenti A. CovidTGI: A tool to investigate the temporal genetic instability of SARS-CoV-2 variants. iScience 2025; 28:112315. [PMID: 40264800 PMCID: PMC12013479 DOI: 10.1016/j.isci.2025.112315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 12/21/2024] [Accepted: 03/25/2025] [Indexed: 04/24/2025] Open
Abstract
The COVID-19 pandemic has underscored the need for fast and accurate epidemiology, particularly due to the high observed mutation frequency in SARS-CoV-2. This study aims to explore the evolution of SARS-CoV-2 through a global analysis. To facilitate a comparative analysis of temporal mutation data, we developed CovidTGI, a Shiny web application. CovidTGI provides insights into observed mutation frequencies and the temporal relationships among mutations across various clades in different geographical regions. Our tool relies on a database that includes 2 million samples obtained from the National Center for Biotechnology Information (NCBI), along with 500 in-house Sicilian samples collected between May 2021 and June 2022. From this smaller group of samples, we identified key variants that are prevalent within a specific clade. Our tool is designed to study the evolution of SARS-CoV-2, which clearly follows a complex trajectory. This complexity highlights the necessity for sophisticated tools like CovidTGI to understand and track the evolution of this virus.
Collapse
Affiliation(s)
- Grete Francesca Privitera
- Department of Clinical and Experimental Medicine, Bioinformatic Unit, University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Nicolò Musso
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), Medical Molecular Microbiology and Antibiotic Resistance Laboratory (MMAR Lab), University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Giovanni Micale
- Department of Clinical and Experimental Medicine, Bioinformatic Unit, University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Carmelo Bonomo
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), Medical Molecular Microbiology and Antibiotic Resistance Laboratory (MMAR Lab), University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Salvatore Alaimo
- Department of Clinical and Experimental Medicine, Bioinformatic Unit, University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Dalida Bivona
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), Medical Molecular Microbiology and Antibiotic Resistance Laboratory (MMAR Lab), University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Paolo Giuseppe Bonacci
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), Medical Molecular Microbiology and Antibiotic Resistance Laboratory (MMAR Lab), University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Guido Scalia
- U.O.C. Laboratory Analysis Unit, A.O.U. ‘Policlinico-Vittorio Emanuele’, University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Stefania Stefani
- Department of Biomedical and Biotechnological Sciences (BIOMETEC), Medical Molecular Microbiology and Antibiotic Resistance Laboratory (MMAR Lab), University of Catania, Via Santa Sofia, 95125 Catania, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, Bioinformatic Unit, University of Catania, Via Santa Sofia, 95125 Catania, Italy
| |
Collapse
|
2
|
Theodosiou T, Vrettos K, Baltsavia I, Baltoumas F, Papanikolaou N, Antonakis AΝ, Mossialos D, Ouzounis CA, Promponas VJ, Karaglani M, Chatzaki E, Brandau S, Pavlopoulos GA, Andreakos E, Iliopoulos I. BioTextQuest v2.0: An evolved tool for biomedical literature mining and concept discovery. Comput Struct Biotechnol J 2024; 23:3247-3253. [PMID: 39279874 PMCID: PMC11399685 DOI: 10.1016/j.csbj.2024.08.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 08/05/2024] [Accepted: 08/15/2024] [Indexed: 09/18/2024] Open
Abstract
The process of navigating through the landscape of biomedical literature and performing searches or combining them with bioinformatics analyses can be daunting, considering the exponential growth of scientific corpora and the plethora of tools designed to mine PubMed(®) and related repositories. Herein, we present BioTextQuest v2.0, a tool for biomedical literature mining. BioTextQuest v2.0 is an open-source online web portal for document clustering based on sets of selected biomedical terms, offering efficient management of information derived from PubMed abstracts. Employing established machine learning algorithms, the tool facilitates document clustering while allowing users to customize the analysis by selecting terms of interest. BioTextQuest v2.0 streamlines the process of uncovering valuable insights from biomedical research articles, serving as an agent that connects the identification of key terms like genes/proteins, diseases, chemicals, Gene Ontology (GO) terms, functions, and others through named entity recognition, and their application in biological research. Instead of manually sifting through articles, researchers can enter their PubMed-like query and receive extracted information in two user-friendly formats, tables and word clouds, simplifying the comprehension of key findings. The latest update of BioTextQuest leverages the EXTRACT named entity recognition tagger, enhancing its ability to pinpoint various biological entities within text. BioTextQuest v2.0 acts as a research assistant, significantly reducing the time and effort required for researchers to identify and present relevant information from the biomedical literature.
Collapse
Affiliation(s)
- Theodosios Theodosiou
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| | - Konstantinos Vrettos
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| | - Ismini Baltsavia
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| | - Fotis Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Athens 16672, Greece
| | - Nikolas Papanikolaou
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| | - Andreas Ν Antonakis
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| | - Dimitrios Mossialos
- Department of Biochemistry and Biotechnology, University of Thessaly, 41500 Larissa, Greece
| | - Christos A Ouzounis
- Biological Computation & Computational Biology Group, AIIA Lab, School of Informatics, Aristotle University of Thessalonica, 57001 Thessalonica, Greece
| | - Vasilis J Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia 1678, Cyprus
| | - Makrina Karaglani
- Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece
| | - Ekaterini Chatzaki
- Medical School, Democritus University of Thrace, 68100 Alexandroupolis, Greece
| | - Sven Brandau
- Experimental and Translational Research, Department of Otorhinolaryngology, University Hospital Essen, Essen, Germany
| | - Georgios A Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Athens 16672, Greece
| | - Evangelos Andreakos
- Center for Immunology and Transplantation, Biomedical Research Foundation Academy of Athens, Athens, Greece
| | - Ioannis Iliopoulos
- Division of Basic Sciences, University of Crete Medical School, Heraklion 71110, Greece
| |
Collapse
|
3
|
Di Maria A, Bellomo L, Billeci F, Cardillo A, Alaimo S, Ferragina P, Ferro A, Pulvirenti A. NetMe 2.0: a web-based platform for extracting and modeling knowledge from biomedical literature as a labeled graph. Bioinformatics 2024; 40:btae194. [PMID: 38597890 PMCID: PMC11074003 DOI: 10.1093/bioinformatics/btae194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/29/2024] [Accepted: 04/08/2024] [Indexed: 04/11/2024] Open
Abstract
MOTIVATION The rapid increase of bio-medical literature makes it harder and harder for scientists to keep pace with the discoveries on which they build their studies. Therefore, computational tools have become more widespread, among which network analysis plays a crucial role in several life-science contexts. Nevertheless, building correct and complete networks about some user-defined biomedical topics on top of the available literature is still challenging. RESULTS We introduce NetMe 2.0, a web-based platform that automatically extracts relevant biomedical entities and their relations from a set of input texts-i.e. in the form of full-text or abstract of PubMed Central's papers, free texts, or PDFs uploaded by users-and models them as a BioMedical Knowledge Graph (BKG). NetMe 2.0 also implements an innovative Retrieval Augmented Generation module (Graph-RAG) that works on top of the relationships modeled by the BKG and allows the distilling of well-formed sentences that explain their content. The experimental results show that NetMe 2.0 can infer comprehensive and reliable biological networks with significant Precision-Recall metrics when compared to state-of-the-art approaches. AVAILABILITY AND IMPLEMENTATION https://netme.click/.
Collapse
Affiliation(s)
- Antonio Di Maria
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | | | - Fabrizio Billeci
- Department of Computer Science, University of Catania, Catania, 95125, Italy
| | - Alfio Cardillo
- Department of Computer Science, University of Catania, Catania, 95125, Italy
| | - Salvatore Alaimo
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Paolo Ferragina
- Department of Computer Science, University of Pisa, Pisa, 56126 , Italy
| | - Alfredo Ferro
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, 95125, Italy
| |
Collapse
|
4
|
Najafi A, Mugurtay N, Zouzou Y, Demirci E, Demirkiran S, Karadeniz HA, Varol O. First public dataset to study 2023 Turkish general election. Sci Rep 2024; 14:8794. [PMID: 38627434 PMCID: PMC11021468 DOI: 10.1038/s41598-024-58006-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 03/25/2024] [Indexed: 04/19/2024] Open
Abstract
In the context of Turkiye's most recent parliamentary and presidential elections ("seçim" in Turkish), social media has played an important role in shaping public debate. It is of utmost importance to capture social media trends during the 2023 Turkish elections, since it uncovers a great deal of information of election propaganda, political debates, smear campaigns, and election manipulation by domestic and international actors. We provide a comprehensive dataset for social media researchers to study Turkish elections, develop tools to prevent online manipulation, and gather novel information to inform the public. We are committed to continually improving the data collection and updating it regularly leading up to the election. Using the #Secim2023 dataset, researchers can examine the social and communication networks between political actors, track current trends, and investigate emerging threats to election integrity. Our dataset and analysis code available through Harvard Dataverse and Github, respectively.
Collapse
Affiliation(s)
- Ali Najafi
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey
| | - Nihat Mugurtay
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey
| | - Yasser Zouzou
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey
| | - Ege Demirci
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey
| | - Serhat Demirkiran
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey
| | | | - Onur Varol
- Faculty of Engineering and Natural Sciences, Sabanci University, 34456, Istanbul, Turkey.
- Center of Excellence in Data Analytics, Sabanci University, 34456, Istanbul, Turkey.
| |
Collapse
|
5
|
La Ferlita A, Alaimo S, Nigita G, Distefano R, Beane JD, Tsichlis PN, Ferro A, Croce CM, Pulvirenti A. tRFUniverse: A comprehensive resource for the interactive analyses of tRNA-derived ncRNAs in human cancer. iScience 2024; 27:108810. [PMID: 38303722 PMCID: PMC10831894 DOI: 10.1016/j.isci.2024.108810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/02/2023] [Accepted: 01/02/2024] [Indexed: 02/03/2024] Open
Abstract
tRNA-derived ncRNAs are a heterogeneous class of non-coding RNAs recently proposed to be active regulators of gene expression and be involved in many diseases, including cancer. Consequently, several online resources on tRNA-derived ncRNAs have been released. Although interesting, such resources present only basic features and do not adequately exploit the wealth of knowledge available about tRNA-derived ncRNAs. Therefore, we introduce tRFUniverse, a novel online resource for the analysis of tRNA-derived ncRNAs in human cancer. tRFUniverse presents an extensive collection of classes of tRNA-derived ncRNAs analyzed across all the TCGA and TARGET tumor cohorts, NCI-60 cell lines, and biological fluids. Moreover, public AGO CLASH/CLIP-Seq data were analyzed to identify the molecular interactions between tRNA-derived ncRNAs and other transcripts. Importantly, tRFUniverse combines in a single resource a comprehensive set of features that we believe may be helpful to investigate the involvement of tRNA-derived ncRNAs in cancer biology.
Collapse
Affiliation(s)
- Alessandro La Ferlita
- Department of Cancer Biology and Genetics, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Salvatore Alaimo
- Department of Clinical and Experimental Medicine, Knowmics Lab, University of Catania, Catania, Italy
| | - Giovanni Nigita
- Department of Cancer Biology and Genetics, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Rosario Distefano
- Department of Cancer Biology and Genetics, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Joal D. Beane
- Department of Surgery, Division of Surgical Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Philip N. Tsichlis
- Department of Cancer Biology and Genetics, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Alfredo Ferro
- Department of Clinical and Experimental Medicine, Knowmics Lab, University of Catania, Catania, Italy
| | - Carlo M. Croce
- Department of Cancer Biology and Genetics, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, Knowmics Lab, University of Catania, Catania, Italy
| |
Collapse
|
6
|
Seraphin G, Rieger S, Hewison M, Capobianco E, Lisse TS. The impact of vitamin D on cancer: A mini review. J Steroid Biochem Mol Biol 2023; 231:106308. [PMID: 37054849 PMCID: PMC10330295 DOI: 10.1016/j.jsbmb.2023.106308] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 04/04/2023] [Accepted: 04/05/2023] [Indexed: 04/15/2023]
Abstract
In this review, we summarize the most recent advances in vitamin D cancer research to provide molecular clarity, as well as its translational trajectory across the cancer landscape. Vitamin D is well known for its role in regulating mineral homeostasis; however, vitamin D deficiency has also been linked to the development and progression of a number of cancer types. Recent epigenomic, transcriptomic, and proteomic studies have revealed novel vitamin D-mediated biological mechanisms that regulate cancer cell self-renewal, differentiation, proliferation, transformation, and death. Tumor microenvironmental studies have also revealed dynamic relationships between the immune system and vitamin D's anti-neoplastic properties. These findings help to explain the large number of population-based studies that show clinicopathological correlations between circulating vitamin D levels and risk of cancer development and death. The majority of evidence suggests that low circulating vitamin D levels are associated with an increased risk of cancers, whereas supplementation alone or in combination with other chemo/immunotherapeutic drugs may improve clinical outcomes even further. These promising results still necessitate further research and development into novel approaches that target vitamin D signaling and metabolic systems to improve cancer outcomes.
Collapse
Affiliation(s)
- Gerbenn Seraphin
- University of Miami, Department of Biology, Coral Gables, FL, USA
| | - Sandra Rieger
- University of Miami, Department of Biology, Coral Gables, FL, USA; Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, FL, USA
| | - Martin Hewison
- University of Birmingham, Institute of Metabolism and Systems Research, Birmingham, UK
| | | | - Thomas S Lisse
- University of Miami, Department of Biology, Coral Gables, FL, USA; Sylvester Comprehensive Cancer Center, Miller School of Medicine, University of Miami, Miami, FL, USA; iCURA LLC, Malvern, PA, USA.
| |
Collapse
|
7
|
Turker M, Bingol HO. Multi-layer network approach in modeling epidemics in an urban town. THE EUROPEAN PHYSICAL JOURNAL. B 2023; 96:16. [PMID: 36776155 PMCID: PMC9901843 DOI: 10.1140/epjb/s10051-023-00484-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 01/23/2023] [Indexed: 06/18/2023]
Abstract
ABSTRACT The last three years have been an extraordinary time with the COVID-19 pandemic killing millions, affecting and distressing billions of people worldwide. Authorities took various measures such as turning school and work to remote and prohibiting social relations via curfews. In order to mitigate the negative impact of the epidemics, researchers tried to estimate the future of the pandemic for different scenarios, using forecasting techniques and epidemics simulations on networks. Intending to better represent the real-life in an urban town in high resolution, we propose a novel multi-layer network model, where each layer corresponds to a different interaction that occurs daily, such as "household", "work" or "school". Our simulations indicate that locking down "friendship" layer has the highest impact on slowing down epidemics. Hence, our contributions are twofold, first we propose a parametric network generator model; second, we run SIR simulations on it and show the impact of layers.
Collapse
Affiliation(s)
- Meliksah Turker
- Department of Computer Engineering, Bogazici University, Istanbul, 34342 Turkey
| | - Haluk O. Bingol
- Department of Computer Engineering, Bogazici University, Istanbul, 34342 Turkey
| |
Collapse
|
8
|
Di Maria A, Alaimo S, Bellomo L, Billeci F, Ferragina P, Ferro A, Pulvirenti A. BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis. Front Genet 2022; 13:855739. [PMID: 35571058 PMCID: PMC9096447 DOI: 10.3389/fgene.2022.855739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 03/24/2022] [Indexed: 02/02/2023] Open
Abstract
The inference of novel knowledge and new hypotheses from the current literature analysis is crucial in making new scientific discoveries. In bio-medicine, given the enormous amount of literature and knowledge bases available, the automatic gain of knowledge concerning relationships among biological elements, in the form of semantically related terms (or entities), is rising novel research challenges and corresponding applications. In this regard, we propose BioTAGME, a system that combines an entity-annotation framework based on Wikipedia corpus (i.e., TAGME tool) with a network-based inference methodology (i.e., DT-Hybrid). This integration aims to create an extensive Knowledge Graph modeling relations among biological terms and phrases extracted from titles and abstracts of papers available in PubMed. The framework consists of a back-end and a front-end. The back-end is entirely implemented in Scala and runs on top of a Spark cluster that distributes the computing effort among several machines. The front-end is released through the Laravel framework, connected with the Neo4j graph database to store the knowledge graph.
Collapse
Affiliation(s)
- Antonio Di Maria
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
| | - Salvatore Alaimo
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
| | | | - Fabrizio Billeci
- Department of Maths and Computer Science, University of Catania, Catania, Italy
| | - Paolo Ferragina
- Department of Computer Science, University of Pisa, Pisa, Italy
| | - Alfredo Ferro
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
- *Correspondence: Alfredo Pulvirenti,
| |
Collapse
|
9
|
Darling: A Web Application for Detecting Disease-Related Biomedical Entity Associations with Literature Mining. Biomolecules 2022; 12:biom12040520. [PMID: 35454109 PMCID: PMC9028073 DOI: 10.3390/biom12040520] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 03/24/2022] [Accepted: 03/28/2022] [Indexed: 12/15/2022] Open
Abstract
Finding, exploring and filtering frequent sentence-based associations between a disease and a biomedical entity, co-mentioned in disease-related PubMed literature, is a challenge, as the volume of publications increases. Darling is a web application, which utilizes Name Entity Recognition to identify human-related biomedical terms in PubMed articles, mentioned in OMIM, DisGeNET and Human Phenotype Ontology (HPO) disease records, and generates an interactive biomedical entity association network. Nodes in this network represent genes, proteins, chemicals, functions, tissues, diseases, environments and phenotypes. Users can search by identifiers, terms/entities or free text and explore the relevant abstracts in an annotated format.
Collapse
|