1
|
Ciaparrone G, Pirone D, Fiore P, Xin L, Xiao W, Li X, Bardozzo F, Bianco V, Miccio L, Pan F, Memmolo P, Tagliaferri R, Ferraro P. Label-free cell classification in holographic flow cytometry through an unbiased learning strategy. Lab Chip 2024; 24:924-932. [PMID: 38264771 DOI: 10.1039/d3lc00385j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
Nowadays, label-free imaging flow cytometry at the single-cell level is considered the stepforward lab-on-a-chip technology to address challenges in clinical diagnostics, biology, life sciences and healthcare. In this framework, digital holography in microscopy promises to be a powerful imaging modality thanks to its multi-refocusing and label-free quantitative phase imaging capabilities, along with the encoding of the highest information content within the imaged samples. Moreover, the recent achievements of new data analysis tools for cell classification based on deep/machine learning, combined with holographic imaging, are urging these systems toward the effective implementation of point of care devices. However, the generalization capabilities of learning-based models may be limited from biases caused by data obtained from other holographic imaging settings and/or different processing approaches. In this paper, we propose a combination of a Mask R-CNN to detect the cells, a convolutional auto-encoder, used to the image feature extraction and operating on unlabelled data, thus overcoming the bias due to data coming from different experimental settings, and a feedforward neural network for single cell classification, that operates on the above extracted features. We demonstrate the proposed approach in the challenging classification task related to the identification of drug-resistant endometrial cancer cells.
Collapse
Affiliation(s)
- Gioele Ciaparrone
- Neurone Lab, Department of Management and Innovation Systems (DISA-MIS), University of Salerno, Fisciano, Italy.
| | - Daniele Pirone
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Pierpaolo Fiore
- Neurone Lab, Department of Management and Innovation Systems (DISA-MIS), University of Salerno, Fisciano, Italy.
| | - Lu Xin
- Key Laboratory of Precision Opto-Mechatronics Technology of Ministry of Education, School of Instrumentation Science & Optoelectronics Engineering, Beihang University, 100191 Beijing, China.
| | - Wen Xiao
- Key Laboratory of Precision Opto-Mechatronics Technology of Ministry of Education, School of Instrumentation Science & Optoelectronics Engineering, Beihang University, 100191 Beijing, China.
| | - Xiaoping Li
- Department of Obstetrics and Gynecology, Peking University People's Hospital, Beijing 100044, China
| | - Francesco Bardozzo
- Neurone Lab, Department of Management and Innovation Systems (DISA-MIS), University of Salerno, Fisciano, Italy.
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Vittorio Bianco
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Lisa Miccio
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Feng Pan
- Key Laboratory of Precision Opto-Mechatronics Technology of Ministry of Education, School of Instrumentation Science & Optoelectronics Engineering, Beihang University, 100191 Beijing, China.
| | - Pasquale Memmolo
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Roberto Tagliaferri
- Neurone Lab, Department of Management and Innovation Systems (DISA-MIS), University of Salerno, Fisciano, Italy.
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| | - Pietro Ferraro
- CNR - Institute of Applied Sciences and Intelligent Systems "Eduardo Caianiello", Pozzuoli, Italy.
| |
Collapse
|
2
|
Gil Zuluaga FH, D’Arminio N, Bardozzo F, Tagliaferri R, Marabotti A. An automated pipeline integrating AlphaFold 2 and MODELLER for protein structure prediction. Comput Struct Biotechnol J 2023; 21:5620-5629. [PMID: 38047234 PMCID: PMC10690423 DOI: 10.1016/j.csbj.2023.10.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 10/31/2023] [Accepted: 10/31/2023] [Indexed: 12/05/2023] Open
Abstract
The ability to predict a protein's three-dimensional conformation represents a crucial starting point for investigating evolutionary connections with other members of the corresponding protein family, examining interactions with other proteins, and potentially utilizing this knowledge for the purpose of rational drug design. In this work, we evaluated the feasibility of improving AlphaFold2's three-dimensional protein predictions by developing a novel pipeline (AlphaMod) that incorporates AlphaFold2 with MODELLER, a template-based modeling program. Additionally, our tool can drive a comprehensive quality assessment of the tertiary protein structure by incorporating and comparing a set of different quality assessment tools. The outcomes of selected tools are combined into a composite score (BORDASCORE) that exhibits a meaningful correlation with GDT_TS and facilitates the selection of optimal models in the absence of a reference structure. To validate AlphaMod's results, we conducted evaluations using two distinct datasets summing up to 72 targets, previously used to independently assess AlphaFold2's performance. The generated models underwent evaluation through two methods: i) averaging the GDT_TS scores across all produced structures for a single target sequence, and ii) a pairwise comparison of the best structures generated by AlphaFold2 and AlphaMod. The latter, within the unsupervised setups, shows a rising accuracy of approximately 34% over AlphaFold2. While, when considering the supervised setup, AlphaMod surpasses AlphaFold2 in 18% of the instances. Finally, there is an 11% correspondence in outcomes between the diverse methodologies. Consequently, AlphaMod's best-predicted tertiary structures in several cases exhibited a significant improvement in the accuracy of the predictions with respect to the best models obtained by AlphaFold2. This pipeline paves the way for the integration of additional data and AI-based algorithms to further improve the reliability of the predictions.
Collapse
Affiliation(s)
- Fabio Hernan Gil Zuluaga
- Department of Management & Innovation Systems, University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, SA, Italy
| | - Nancy D’Arminio
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, SA, Italy
| | - Francesco Bardozzo
- Department of Management & Innovation Systems, University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, SA, Italy
| | - Roberto Tagliaferri
- Department of Management & Innovation Systems, University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, SA, Italy
| | - Anna Marabotti
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Via Giovanni Paolo II, 132, 84084 Fisciano, SA, Italy
| |
Collapse
|
3
|
Agliata A, Giordano D, Bardozzo F, Bottiglieri S, Facchiano A, Tagliaferri R. Machine Learning as a Support for the Diagnosis of Type 2 Diabetes. Int J Mol Sci 2023; 24:ijms24076775. [PMID: 37047748 PMCID: PMC10095542 DOI: 10.3390/ijms24076775] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/31/2023] [Accepted: 04/03/2023] [Indexed: 04/07/2023] Open
Abstract
Diabetes is a chronic, metabolic disease characterized by high blood sugar levels. Among the main types of diabetes, type 2 is the most common. Early diagnosis and treatment can prevent or delay the onset of complications. Previous studies examined the application of machine learning techniques for prediction of the pathology, and here an artificial neural network shows very promising results as a possible valuable aid in the management and prevention of diabetes. Additionally, its superior ability for long-term predictions makes it an ideal choice for this field of study. We utilized machine learning methods to uncover previously undiscovered associations between an individual’s health status and the development of type 2 diabetes, with the goal of accurately predicting its onset or determining the individual’s risk level. Our study employed a binary classifier, trained on scratch, to identify potential nonlinear relationships between the onset of type 2 diabetes and a set of parameters obtained from patient measurements. Three datasets were utilized, i.e., the National Center for Health Statistics’ (NHANES) biennial survey, MIMIC-III and MIMIC-IV. These datasets were then combined to create a single dataset with the same number of individuals with and without type 2 diabetes. Since the dataset was balanced, the primary evaluation metric for the model was accuracy. The outcomes of this study were encouraging, with the model achieving accuracy levels of up to 86% and a ROC AUC value of 0.934. Further investigation is needed to improve the reliability of the model by considering multiple measurements from the same patient over time.
Collapse
Affiliation(s)
- Antonio Agliata
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy
- BC Soft, Centro Direzionale, Via Taddeo da Sessa Isola F10, 80143 Napoli, Italy
| | - Deborah Giordano
- National Research Council, Institute of Food Science, Via Roma 64, 83100 Avellino, Italy
| | - Francesco Bardozzo
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy
| | | | - Angelo Facchiano
- National Research Council, Institute of Food Science, Via Roma 64, 83100 Avellino, Italy
| | - Roberto Tagliaferri
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy
| |
Collapse
|
4
|
Petrozziello A, Troiano L, Serra A, Jordanov I, Storti G, Tagliaferri R, La Rocca M. Deep learning for volatility forecasting in asset management. Soft comput 2022. [DOI: 10.1007/s00500-022-07161-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
AbstractPredicting volatility is a critical activity for taking risk- adjusted decisions in asset trading and allocation. In order to provide effective decision-making support, in this paper we investigate the profitability of a deep Long Short-Term Memory (LSTM) Neural Network for forecasting daily stock market volatility using a panel of 28 assets representative of the Dow Jones Industrial Average index combined with the market factor proxied by the SPY and, separately, a panel of 92 assets belonging to the NASDAQ 100 index. The Dow Jones plus SPY data are from January 2002 to August 2008, while the NASDAQ 100 is from December 2012 to November 2017. If, on the one hand, we expect that this evolutionary behavior can be effectively captured adaptively through the use of Artificial Intelligence (AI) flexible methods, on the other, in this setting, standard parametric approaches could fail to provide optimal predictions. We compared the volatility forecasts generated by the LSTM approach to those obtained through use of widely recognized benchmarks models in this field, in particular, univariate parametric models such as the Realized Generalized Autoregressive Conditionally Heteroskedastic (R-GARCH) and the Glosten–Jagannathan–Runkle Multiplicative Error Models (GJR-MEM). The results demonstrate the superiority of the LSTM over the widely popular R-GARCH and GJR-MEM univariate parametric methods, when forecasting in condition of high volatility, while still producing comparable predictions for more tranquil periods.
Collapse
|
5
|
Biancaniello C, D’Argenio A, Giordano D, Dotolo S, Scafuri B, Marabotti A, d’Acierno A, Tagliaferri R, Facchiano A. Investigating the Effects of Amino Acid Variations in Human Menin. Molecules 2022; 27:molecules27051747. [PMID: 35268848 PMCID: PMC8911756 DOI: 10.3390/molecules27051747] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/01/2022] [Accepted: 03/04/2022] [Indexed: 12/14/2022]
Abstract
Human menin is a nuclear protein that participates in many cellular processes, as transcriptional regulation, DNA damage repair, cell signaling, cell division, proliferation, and migration, by interacting with many other proteins. Mutations of the gene encoding menin cause multiple endocrine neoplasia type 1 (MEN1), a rare autosomal dominant disorder associated with tumors of the endocrine glands. In order to characterize the structural and functional effects at protein level of the hundreds of missense variations, we investigated by computational methods the wild-type menin and more than 200 variants, predicting the amino acid variations that change secondary structure, solvent accessibility, salt-bridge and H-bond interactions, protein thermostability, and altering the capability to bind known protein interactors. The structural analyses are freely accessible online by means of a web interface that integrates also a 3D visualization of the structure of the wild-type and variant proteins. The results of the study offer insight into the effects of the amino acid variations in view of a more complete understanding of their pathological role.
Collapse
Affiliation(s)
- Carmen Biancaniello
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy; (C.B.); (S.D.)
| | - Antonia D’Argenio
- National Research Council, Institute of Food Science, 83100 Avellino, Italy; (A.D.); (D.G.); (A.d.)
| | - Deborah Giordano
- National Research Council, Institute of Food Science, 83100 Avellino, Italy; (A.D.); (D.G.); (A.d.)
| | - Serena Dotolo
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy; (C.B.); (S.D.)
| | - Bernardina Scafuri
- Dipartimento di Chimica e Biologia “A. Zambelli”, Università degli Studi di Salerno, 84084 Fisciano, Italy; (B.S.); (A.M.)
| | - Anna Marabotti
- Dipartimento di Chimica e Biologia “A. Zambelli”, Università degli Studi di Salerno, 84084 Fisciano, Italy; (B.S.); (A.M.)
| | - Antonio d’Acierno
- National Research Council, Institute of Food Science, 83100 Avellino, Italy; (A.D.); (D.G.); (A.d.)
| | - Roberto Tagliaferri
- Dipartimento di Scienze Aziendali, Management and Innovation Systems, Università degli Studi di Salerno, 84084 Fisciano, Italy; (C.B.); (S.D.)
- Correspondence: (R.T.); (A.F.)
| | - Angelo Facchiano
- National Research Council, Institute of Food Science, 83100 Avellino, Italy; (A.D.); (D.G.); (A.d.)
- Correspondence: (R.T.); (A.F.)
| |
Collapse
|
6
|
Ciaparrone G, Chiariglione L, Tagliaferri R. A comparison of deep learning models for end-to-end face-based video retrieval in unconstrained videos. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06875-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
AbstractFace-based video retrieval (FBVR) is the task of retrieving videos that containing the same face shown in the query image. In this article, we present the first end-to-end FBVR pipeline that is able to operate on large datasets of unconstrained, multi-shot, multi-person videos. We adapt an existing audiovisual recognition dataset to the task of FBVR and use it to evaluate our proposed pipeline. We compare a number of deep learning models for shot detection, face detection, and face feature extraction as part of our pipeline on a validation dataset made of more than 4000 videos. We obtain 97.25% mean average precision on an independent test set, composed of more than 1000 videos. The pipeline is able to extract features from videos at $$\sim $$
∼
7 times the real-time speed, and it is able to perform a query on thousands of videos in less than 0.5 s.
Collapse
|
7
|
Bardozzo F, Collins T, Forgione A, Hostettler A, Tagliaferri R. StaSiS-Net: a stacked and siamese disparity estimation network for depth reconstruction in modern 3D laparoscopy. Med Image Anal 2022; 77:102380. [DOI: 10.1016/j.media.2022.102380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Revised: 01/26/2022] [Accepted: 01/27/2022] [Indexed: 10/19/2022]
|
8
|
Gil Zuluaga FH, Bardozzo F, Rios Patino JI, Tagliaferri R. Blind microscopy image denoising with a deep residual and multiscale encoder/decoder network. Annu Int Conf IEEE Eng Med Biol Soc 2021; 2021:3483-3486. [PMID: 34891990 DOI: 10.1109/embc46164.2021.9630502] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In computer-aided diagnosis (CAD) focused on microscopy, denoising improves the quality of image analysis. In general, the accuracy of this process may depend both on the experience of the microscopist and on the equipment sensitivity and specificity. A medical image could be corrupted by several perturbations during image acquisition. Nowadays, CAD deep learning applications pre-process images with image denoising models to reinforce learning and prediction. In this work, an innovative and lightweight deep multiscale convolutional encoder-decoder neural network is proposed. Specifically, the encoder uses deterministic mapping to map features into a hidden representation. Then, the latent representation is rebuilt to generate the reconstructed denoised image. Residual learning strategies are used to improve and accelerate the training process using skip connections in bridging across convolutional and deconvolutional layers. The proposed model reaches on average 38.38 of PSNR and 0.98 of SSIM on a test set of 57458 images overcoming state-of-the-art models in the same application domain.Clinical relevance - Encoder-decoder based denoiser enables industry experts to provide more accurate and reliable medical interpretation and diagnosis in a variety of fields, from microscopy to surgery, with the benefit of real-time processing.
Collapse
|
9
|
Bardozzo F, Lió P, Tagliaferri R. Signal metrics analysis of oscillatory patterns in bacterial multi-omic networks. Bioinformatics 2021; 37:1411-1419. [PMID: 33185666 DOI: 10.1093/bioinformatics/btaa966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 09/25/2020] [Accepted: 11/03/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION One of the branches of Systems Biology is focused on a deep understanding of underlying regulatory networks through the analysis of the biomolecules oscillations and their interplay. Synthetic Biology exploits gene or/and protein regulatory networks towards the design of oscillatory networks for producing useful compounds. Therefore, at different levels of application and for different purposes, the study of biomolecular oscillations can lead to different clues about the mechanisms underlying living cells. It is known that network-level interactions involve more than one type of biomolecule as well as biological processes operating at multiple omic levels. Combining network/pathway-level information with genetic information it is possible to describe well-understood or unknown bacterial mechanisms and organism-specific dynamics. RESULTS Following the methodologies used in signal processing and communication engineering, a methodology is introduced to identify and quantify the extent of multi-omic oscillations. These are due to the process of multi-omic integration and depend on the gene positions on the chromosome. Ad hoc signal metrics are designed to allow further biotechnological explanations and provide important clues about the oscillatory nature of the pathways and their regulatory circuits. Our algorithms designed for the analysis of multi-omic signals are tested and validated on 11 different bacteria for thousands of multi-omic signals perturbed at the network level by different experimental conditions. Information on the order of genes, codon usage, gene expression and protein molecular weight is integrated at three different functional levels. Oscillations show interesting evidence that network-level multi-omic signals present a synchronized response to perturbations and evolutionary relations along taxa. AVAILABILITY AND IMPLEMENTATION The algorithms, the code (in language R), the tool, the pipeline and the whole dataset of multi-omic signal metrics are available at: https://github.com/lodeguns/Multi-omicSignals. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Pietro Lió
- Computer Laboratory, University of Cambridge, Cambridge CB3 0FD, UK
| | | |
Collapse
|
10
|
Dotolo S, Marabotti A, Rachiglio AM, Esposito Abate R, Benedetto M, Ciardiello F, De Luca A, Normanno N, Facchiano A, Tagliaferri R. A multiple network-based bioinformatics pipeline for the study of molecular mechanisms in oncological diseases for personalized medicine. Brief Bioinform 2021; 22:6287337. [PMID: 34050359 PMCID: PMC8574709 DOI: 10.1093/bib/bbab180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 03/17/2021] [Accepted: 04/20/2021] [Indexed: 01/03/2023] Open
Abstract
Motivation Assessment of genetic mutations is an essential element in the modern era of personalized cancer treatment. Our strategy is focused on ‘multiple network analysis’ in which we try to improve cancer diagnostics by using biological networks. Genetic alterations in some important hubs or in driver genes such as BRAF and TP53 play a critical role in regulating many important molecular processes. Most of the studies are focused on the analysis of the effects of single mutations, while tumors often carry mutations of multiple driver genes. The aim of this work is to define an innovative bioinformatics pipeline focused on the design and analysis of networks (such as biomedical and molecular networks), in order to: (1) improve the disease diagnosis; (2) identify the patients that could better respond to a given drug treatment; and (3) predict what are the primary and secondary effects of gene mutations involved in human diseases. Results By using our pipeline based on a multiple network approach, it has been possible to demonstrate and validate what are the joint effects and changes of the molecular profile that occur in patients with metastatic colorectal carcinoma (mCRC) carrying mutations in multiple genes. In this way, we can identify the most suitable drugs for the therapy for the individual patient. This information is useful to improve precision medicine in cancer patients. As an application of our pipeline, the clinically significant case studies of a cohort of mCRC patients with the BRAF V600E-TP53 I195N missense combined mutation were considered. Availability The procedures used in this paper are part of the Cytoscape Core, available at (www.cytoscape.org). Data used here on mCRC patients have been published in [55]. Supplementary Information A supplementary file containing a more detailed discussion of this case study and other cases is available at the journal site as Supplementary Data.
Collapse
Affiliation(s)
- Serena Dotolo
- Dipartimento di Scienze Aziendali, Management & Innovation Systems, Università degli Studi di Salerno, Fisciano (SA), Italy
| | - Anna Marabotti
- Dipartimento di Chimica e Biologia "A. Zambelli", Università degli Studi di Salerno, Fisciano (SA), Italy
| | - Anna Maria Rachiglio
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Riziero Esposito Abate
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori -IRCCS - Fondazione G. Pascale, Naples, Italy
| | | | - Fortunato Ciardiello
- Dipartimento di Medicina di Precisione, Università degli Studi della Campania "Luigi Vanvitelli", Naples, Italy
| | - Antonella De Luca
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Nicola Normanno
- Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori - IRCCS - Fondazione G. Pascale, Naples, Italy
| | - Angelo Facchiano
- Institute of Food Sciences, Italian National Research Council (CNR), Avellino, Italy
| | - Roberto Tagliaferri
- Dipartimento di Scienze Aziendali, Management & Innovation Systems, Università degli Studi di Salerno, Fisciano (SA), Italy
| |
Collapse
|
11
|
Dotolo S, Marabotti A, Facchiano A, Tagliaferri R. A review on drug repurposing applicable to COVID-19. Brief Bioinform 2021; 22:726-741. [PMID: 33147623 PMCID: PMC7665348 DOI: 10.1093/bib/bbaa288] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/16/2020] [Accepted: 09/30/2020] [Indexed: 12/11/2022] Open
Abstract
Drug repurposing involves the identification of new applications for existing drugs at a lower cost and in a shorter time. There are different computational drug-repurposing strategies and some of these approaches have been applied to the coronavirus disease 2019 (COVID-19) pandemic. Computational drug-repositioning approaches applied to COVID-19 can be broadly categorized into (i) network-based models, (ii) structure-based approaches and (iii) artificial intelligence (AI) approaches. Network-based approaches are divided into two categories: network-based clustering approaches and network-based propagation approaches. Both of them allowed to annotate some important patterns, to identify proteins that are functionally associated with COVID-19 and to discover novel drug–disease or drug–target relationships useful for new therapies. Structure-based approaches allowed to identify small chemical compounds able to bind macromolecular targets to evaluate how a chemical compound can interact with the biological counterpart, trying to find new applications for existing drugs. AI-based networks appear, at the moment, less relevant since they need more data for their application.
Collapse
Affiliation(s)
| | | | | | - Roberto Tagliaferri
- Artificial Intelligence, Statistical Pattern Recognition, Clustering, Biomedical imaging and Bioinformatics
| |
Collapse
|
12
|
|
13
|
Coretto P, Serra A, Tagliaferri R. Robust clustering of noisy high-dimensional gene expression data for patients subtyping. Bioinformatics 2019; 34:4064-4072. [PMID: 29939219 DOI: 10.1093/bioinformatics/bty502] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2018] [Accepted: 06/19/2018] [Indexed: 12/12/2022] Open
Abstract
Motivation One of the most important research areas in personalized medicine is the discovery of disease sub-types with relevance in clinical applications. This is usually accomplished by exploring gene expression data with unsupervised clustering methodologies. Then, with the advent of multiple omics technologies, data integration methodologies have been further developed to obtain better performances in patient separability. However, these methods do not guarantee the survival separability of the patients in different clusters. Results We propose a new methodology that first computes a robust and sparse correlation matrix of the genes, then decomposes it and projects the patient data onto the first m spectral components of the correlation matrix. After that, a robust and adaptive to noise clustering algorithm is applied. The clustering is set up to optimize the separation between survival curves estimated cluster-wise. The method is able to identify clusters that have different omics signatures and also statistically significant differences in survival time. The proposed methodology is tested on five cancer datasets downloaded from The Cancer Genome Atlas repository. The proposed method is compared with the Similarity Network Fusion (SNF) approach, and model based clustering based on Student's t-distribution (TMIX). Our method obtains a better performance in terms of survival separability, even if it uses a single gene expression view compared to the multi-view approach of the SNF method. Finally, a pathway based analysis is accomplished to highlight the biological processes that differentiate the obtained patient groups. Availability and implementation Our R source code is available online at https://github.com/angy89/RobustClusteringPatientSubtyping. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pietro Coretto
- Department of Economics and Statistics, STATLAB, University of Salerno, Fisciano, SA, Italy
| | - Angela Serra
- Department of Management and Innovation Systems, NeuRoNeLab, University of Salerno, Fisciano, SA, Italy
| | - Roberto Tagliaferri
- Department of Management and Innovation Systems, NeuRoNeLab, University of Salerno, Fisciano, SA, Italy
| |
Collapse
|
14
|
Serra A, Galdi P, Pesce E, Fratello M, Trojsi F, Tedeschi G, Tagliaferri R, Esposito F. Strong-Weak Pruning for Brain Network Identification in Connectome-Wide Neuroimaging: Application to Amyotrophic Lateral Sclerosis Disease Stage Characterization. Int J Neural Syst 2019; 29:1950007. [PMID: 30929575 DOI: 10.1142/s0129065719500072] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Magnetic resonance imaging allows acquiring functional and structural connectivity data from which high-density whole-brain networks can be derived to carry out connectome-wide analyses in normal and clinical populations. Graph theory has been widely applied to investigate the modular structure of brain connections by using centrality measures to identify the "hub" of human connectomes, and community detection methods to delineate subnetworks associated with diverse cognitive and sensorimotor functions. These analyses typically rely on a preprocessing step (pruning) to reduce computational complexity and remove the weakest edges that are most likely affected by experimental noise. However, weak links may contain relevant information about brain connectivity, therefore, the identification of the optimal trade-off between retained and discarded edges is a subject of active research. We introduce a pruning algorithm to identify edges that carry the highest information content. The algorithm selects both strong edges (i.e. edges belonging to shortest paths) and weak edges that are topologically relevant in weakly connected subnetworks. The newly developed "strong-weak" pruning (SWP) algorithm was validated on simulated networks that mimic the structure of human brain networks. It was then applied for the analysis of a real dataset of subjects affected by amyotrophic lateral sclerosis (ALS), both at the early (ALS2) and late (ALS3) stage of the disease, and of healthy control subjects. SWP preprocessing allowed identifying statistically significant differences in the path length of networks between patients and healthy subjects. ALS patients showed a decrease of connectivity between frontal cortex to temporal cortex and parietal cortex and between temporal and occipital cortex. Moreover, degree of centrality measures revealed significantly different hub and centrality scores between patient subgroups. These findings suggest a widespread alteration of network topology in ALS associated with disease progression.
Collapse
Affiliation(s)
- Angela Serra
- *NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy.,†Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Paola Galdi
- *NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy.,‡MRC Centre for Reproductive Health, University of Edinburgh, EH16 4TJ Edinburgh, UK
| | - Emanuele Pesce
- *NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy.,§International Digital Laboratory, WMG, University of Coventry, CV4 7AL, UK
| | - Michele Fratello
- ¶Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Università Degli Studi Della Campania "Luigi Vanvitelli", Napoli, 80138, Italy
| | - Francesca Trojsi
- ¶Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Università Degli Studi Della Campania "Luigi Vanvitelli", Napoli, 80138, Italy
| | - Gioacchino Tedeschi
- ¶Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Università Degli Studi Della Campania "Luigi Vanvitelli", Napoli, 80138, Italy
| | - Roberto Tagliaferri
- *NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy
| | - Fabrizio Esposito
- ∥Department of Medicine, Surgery and Dentistry "Scuola Medica Salernitana", University of Salerno, Baronissi (Sa), 84081, Italy
| |
Collapse
|
15
|
Serra A, Letunic I, Fortino V, Handy RD, Fadeel B, Tagliaferri R, Greco D. INSIdE NANO: a systems biology framework to contextualize the mechanism-of-action of engineered nanomaterials. Sci Rep 2019; 9:179. [PMID: 30655578 PMCID: PMC6336851 DOI: 10.1038/s41598-018-37411-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Accepted: 11/30/2018] [Indexed: 12/14/2022] Open
Abstract
Engineered nanomaterials (ENMs) are widely present in our daily lives. Despite the efforts to characterize their mechanism of action in multiple species, their possible implications in human pathologies are still not fully understood. Here we performed an integrated analysis of the effects of ENMs on human health by contextualizing their transcriptional mechanism-of-action with respect to drugs, chemicals and diseases. We built a network of interactions of over 3,000 biological entities and developed a novel computational tool, INSIdE NANO, to infer new knowledge about ENM behavior. We highlight striking association of metal and metal-oxide nanoparticles and major neurodegenerative disorders. Our novel strategy opens possibilities to achieve fast and accurate read-across evaluation of ENMs and other chemicals based on their biosignatures.
Collapse
Affiliation(s)
- Angela Serra
- NeuRoNe Lab, DISA-MIS, University of Salerno, Salerno, Italy.,Faculty of Medicine and Life Sciences, University of Tampere, Tampere, Finland.,Institute of Biosciences and Medical Technologies, University of Tampere, Tampere, Finland
| | | | - Vittorio Fortino
- Faculty of Medicine and Life Sciences, University of Tampere, Tampere, Finland.,Institute of Biosciences and Medical Technologies, University of Tampere, Tampere, Finland.,Institute of Biotechnology, University of Helsinki, Helsinki, Finland.,Biomedicine Institute, University of Eastern Finland, Kuopio, Finland
| | - Richard D Handy
- School of Biological and Marine Sciences, University of Plymouth, Plymouth, United Kingdom
| | - Bengt Fadeel
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | | | - Dario Greco
- Faculty of Medicine and Life Sciences, University of Tampere, Tampere, Finland. .,Institute of Biosciences and Medical Technologies, University of Tampere, Tampere, Finland. .,Institute of Biotechnology, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
16
|
Galdi P, Fratello M, Trojsi F, Russo A, Tedeschi G, Tagliaferri R, Esposito F. Stochastic Rank Aggregation for the Identification of Functional Neuromarkers. Neuroinformatics 2019; 17:479-496. [PMID: 30604083 DOI: 10.1007/s12021-018-9412-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The main challenge in analysing functional magnetic resonance imaging (fMRI) data from extended samples of subject (N > 100) is to extract as much relevant information as possible from big amounts of noisy data. When studying neurodegenerative diseases with resting-state fMRI, one of the objectives is to determine regions with abnormal background activity with respect to a healthy brain and this is often attained with comparative statistical models applied to single voxels or brain parcels within one or several functional networks. In this work, we propose a novel approach based on clustering and stochastic rank aggregation to identify parcels that exhibit a coherent behaviour in groups of subjects affected by the same disorder and apply it to default-mode network independent component maps from resting-state fMRI data sets. Brain voxels are partitioned into parcels through k-means clustering, then solutions are enhanced by means of consensus techniques. For each subject, clusters are ranked according to their median value and a stochastic rank aggregation method, TopKLists, is applied to combine the individual rankings within each class of subjects. For comparison, the same approach was tested on an anatomical parcellation. We found parcels for which the rankings were different among control subjects and subjects affected by Parkinson's disease and amyotrophic lateral sclerosis and found evidence in literature for the relevance of top ranked regions in default-mode brain activity. The proposed framework represents a valid method for the identification of functional neuromarkers from resting-state fMRI data, and it might therefore constitute a step forward in the development of fully automated data-driven techniques to support early diagnoses of neurodegenerative diseases.
Collapse
Affiliation(s)
- Paola Galdi
- NeuRoNe Lab, Department of Management and Innovation Systems, University of Salerno, Fisciano, Salerno, Italy
| | - Michele Fratello
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Francesca Trojsi
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Antonio Russo
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Gioacchino Tedeschi
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Roberto Tagliaferri
- NeuRoNe Lab, Department of Management and Innovation Systems, University of Salerno, Fisciano, Salerno, Italy
| | - Fabrizio Esposito
- Department of Medicine, Surgery and Dentistry "Scuola Medica Salernitana", University of Salerno, Via S. Allende, 84081, Baronissi, Salerno, Italy.
| |
Collapse
|
17
|
Serra A, Coretto P, Fratello M, Tagliaferri R, Stegle O. Robust and sparse correlation matrix estimation for the analysis of high-dimensional genomics data. Bioinformatics 2018; 34:625-634. [PMID: 29040390 DOI: 10.1093/bioinformatics/btx642] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Accepted: 10/10/2017] [Indexed: 11/14/2022] Open
Abstract
Motivation Microarray technology can be used to study the expression of thousands of genes across a number of different experimental conditions, usually hundreds. The underlying principle is that genes sharing similar expression patterns, across different samples, can be part of the same co-expression system, or they may share the same biological functions. Groups of genes are usually identified based on cluster analysis. Clustering methods rely on the similarity matrix between genes. A common choice to measure similarity is to compute the sample correlation matrix. Dimensionality reduction is another popular data analysis task which is also based on covariance/correlation matrix estimates. Unfortunately, covariance/correlation matrix estimation suffers from the intrinsic noise present in high-dimensional data. Sources of noise are: sampling variations, presents of outlying sample units, and the fact that in most cases the number of units is much larger than the number of genes. Results In this paper, we propose a robust correlation matrix estimator that is regularized based on adaptive thresholding. The resulting method jointly tames the effects of the high-dimensionality, and data contamination. Computations are easy to implement and do not require hand tunings. Both simulated and real data are analyzed. A Monte Carlo experiment shows that the proposed method is capable of remarkable performances. Our correlation metric is more robust to outliers compared with the existing alternatives in two gene expression datasets. It is also shown how the regularization allows to automatically detect and filter spurious correlations. The same regularization is also extended to other less robust correlation measures. Finally, we apply the ARACNE algorithm on the SyNTreN gene expression data. Sensitivity and specificity of the reconstructed network is compared with the gold standard. We show that ARACNE performs better when it takes the proposed correlation matrix estimator as input. Availability and implementation The R software is available at https://github.com/angy89/RobustSparseCorrelation. Contact aserra@unisa.it or robtag@unisa.it. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Angela Serra
- NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy
| | - Pietro Coretto
- STATLAB, Department of Economics and Statistics, University of Salerno, Fisciano (Sa), 84084, Italy
| | - Michele Fratello
- Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Second University of Napoli, Piazza Luigi Miraglia, 2 Napoli 80138, Italy
| | - Roberto Tagliaferri
- NeuRoNeLab, Department of Management and Innovation Systems, University of Salerno, Fisciano (Sa), 84084, Italy
| | | |
Collapse
|
18
|
Abstract
BACKGROUND Two important challenges in the analysis of molecular biology information are data (multi-omic information) integration and the detection of patterns across large scale molecular networks and sequences. They are are actually coupled beause the integration of omic information may provide better means to detect multi-omic patterns that could reveal multi-scale or emerging properties at the phenotype levels. RESULTS Here we address the problem of integrating various types of molecular information (a large collection of gene expression and sequence data, codon usage and protein abundances) to analyse the E.coli metabolic response to treatments at the whole network level. Our algorithm, MORA (Multi-omic relations adjacency) is able to detect patterns which may represent metabolic network motifs at pathway and supra pathway levels which could hint at some functional role. We provide a description and insights on the algorithm by testing it on a large database of responses to antibiotics. Along with the algorithm MORA, a novel model for the analysis of oscillating multi-omics has been proposed. Interestingly, the resulting analysis suggests that some motifs reveal recurring oscillating or position variation patterns on multi-omics metabolic networks. Our framework, implemented in R, provides effective and friendly means to design intervention scenarios on real data. By analysing how multi-omics data build up multi-scale phenotypes, the software allows to compare and test metabolic models, design new pathways or redesign existing metabolic pathways and validate in silico metabolic models using nearby species. CONCLUSIONS The integration of multi-omic data reveals that E.coli multi-omic metabolic networks contain position dependent and recurring patterns which could provide clues of long range correlations in the bacterial genome.
Collapse
Affiliation(s)
- Francesco Bardozzo
- NeuRoNe Lab, DISA-MIS, University of Salerno, Via Giovanni Paolo II 132, Salerno, 84084 Fisciano, Italy
| | - Pietro Lió
- Computer Laboratory, Department of Computer Science, University of Cambridge, 15 JJ Thomson Ave, Cambridge, CB3 0FD, UK
| | - Roberto Tagliaferri
- NeuRoNe Lab, DISA-MIS, University of Salerno, Via Giovanni Paolo II 132, Salerno, 84084 Fisciano, Italy.
| |
Collapse
|
19
|
Montecuollo F, Schmid G, Tagliaferri R. E2FM: an encrypted and compressed full-text index for collections of genomic sequences. Bioinformatics 2017; 33:2808-2817. [PMID: 28498928 DOI: 10.1093/bioinformatics/btx313] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Accepted: 05/10/2017] [Indexed: 01/01/2023] Open
Abstract
Motivation Next Generation Sequencing (NGS) platforms and, more generally, high-throughput technologies are giving rise to an exponential growth in the size of nucleotide sequence databases. Moreover, many emerging applications of nucleotide datasets-as those related to personalized medicine-require the compliance with regulations about the storage and processing of sensitive data. Results We have designed and carefully engineered E 2 FM -index, a new full-text index in minute space which was optimized for compressing and encrypting nucleotide sequence collections in FASTA format and for performing fast pattern-search queries. E 2 FM -index allows to build self-indexes which occupy till to 1/20 of the storage required by the input FASTA file, thus permitting to save about 95% of storage when indexing collections of highly similar sequences; moreover, it can exactly search the built indexes for patterns in times ranging from few milliseconds to a few hundreds milliseconds, depending on pattern length. Availability and implementation Source code is available at https://github.com/montecuollo/E2FM . Contact ferdinando.montecuollo@unicampania.it. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ferdinando Montecuollo
- Centro Reti, Sistemi e Servizi Informatici/CRESSI, Università degli Studi della Campania "Luigi Vanvitelli," Napoli 80133, Italy
| | - Giovannni Schmid
- Istituto di Calcolo e Reti ad Alte Prestazioni/ICAR, Consiglio Nazionale delle Ricerche, Napoli 80131, Italy
| | - Roberto Tagliaferri
- Dipartimento di Scienze Aziendali - Management & Innovation Systems/DISA-MIS, Università di Salerno, Fisciano 84084, Italy
| |
Collapse
|
20
|
Galdi P, Fratello M, Trojsi F, Russo A, Tedeschi G, Tagliaferri R, Esposito F. Consensus-based feature extraction in rs-fMRI data analysis. Soft comput 2017. [DOI: 10.1007/s00500-017-2596-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
21
|
Fratello M, Caiazzo G, Trojsi F, Russo A, Tedeschi G, Tagliaferri R, Esposito F. Multi-View Ensemble Classification of Brain Connectivity Images for Neurodegeneration Type Discrimination. Neuroinformatics 2017; 15:199-213. [PMID: 28210983 PMCID: PMC5443864 DOI: 10.1007/s12021-017-9324-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Brain connectivity analyses using voxels as features are not robust enough for single-patient classification because of the inter-subject anatomical and functional variability. To construct more robust features, voxels can be aggregated into clusters that are maximally coherent across subjects. Moreover, combining multi-modal neuroimaging and multi-view data integration techniques allows generating multiple independent connectivity features for the same patient. Structural and functional connectivity features were extracted from multi-modal MRI images with a clustering technique, and used for the multi-view classification of different phenotypes of neurodegeneration by an ensemble learning method (random forest). Two different multi-view models (intermediate and late data integration) were trained on, and tested for the classification of, individual whole-brain default-mode network (DMN) and fractional anisotropy (FA) maps, from 41 amyotrophic lateral sclerosis (ALS) patients, 37 Parkinson's disease (PD) patients and 43 healthy control (HC) subjects. Both multi-view data models exhibited ensemble classification accuracies significantly above chance. In ALS patients, multi-view models exhibited the best performances (intermediate: 82.9%, late: 80.5% correct classification) and were more discriminative than each single-view model. In PD patients and controls, multi-view models' performances were lower (PD: 59.5%, 62.2%; HC: 56.8%, 59.1%) but higher than at least one single-view model. Training the models only on patients, produced more than 85% patients correctly discriminated as ALS or PD type and maximal performances for multi-view models. These results highlight the potentials of mining complementary information from the integration of multiple data views in the classification of connectivity patterns from multi-modal brain images in the study of neurodegenerative diseases.
Collapse
Affiliation(s)
- Michele Fratello
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, Second University of Naples, Naples, Italy
| | - Giuseppina Caiazzo
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, Second University of Naples, Naples, Italy
| | - Francesca Trojsi
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, Second University of Naples, Naples, Italy
| | - Antonio Russo
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, Second University of Naples, Naples, Italy
| | - Gioacchino Tedeschi
- Department of Medical, Surgical, Neurological, Metabolic and Aging Sciences, Second University of Naples, Naples, Italy
| | - Roberto Tagliaferri
- Department of Medicine Surgery and Dentistry Scuola Medica Salernitana, University of Salerno, Baronissi, Salerno, Italy
| | - Fabrizio Esposito
- Department of Medicine, Surgery and Dentistry "Scuola Medica Salernitana", University of Salerno, Via S. Allende, 84081, Baronissi, Salerno, Italy.
| |
Collapse
|
22
|
Fortino V, Tagliaferri R, Greco D. CONDOP: an R package for CONdition-Dependent Operon Predictions. Bioinformatics 2016; 32:3199-3200. [PMID: 27296981 DOI: 10.1093/bioinformatics/btw330] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 03/23/2016] [Indexed: 11/14/2022] Open
Abstract
The use of high-throughput RNA sequencing to predict dynamic operon structures in prokaryotic genomes has recently gained popularity in bioinformatics. We provide the R implementation of a novel method that uses transcriptomic features extracted from RNA-seq transcriptome profiles to develop ensemble classifiers for condition-dependent operon predictions. The CONDOP package provides a deeper insight into RNA-seq data analysis and allows scientists to highlight the operon organization in the context of transcriptional regulation with a few lines of code. AVAILABILITY AND IMPLEMENTATION CONDOP is implemented in R and is freely available at CRAN. CONTACT vittorio.fortino@helsinki.fiSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Vittorio Fortino
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | | | - Dario Greco
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| |
Collapse
|
23
|
Serra A, Fratello M, Fortino V, Raiconi G, Tagliaferri R, Greco D. MVDA: a multi-view genomic data integration methodology. BMC Bioinformatics 2015; 16:261. [PMID: 26283178 PMCID: PMC4539887 DOI: 10.1186/s12859-015-0680-3] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 07/20/2015] [Indexed: 11/18/2022] Open
Abstract
Background Multiple high-throughput molecular profiling by omics technologies can be collected for the same individuals. Combining these data, rather than exploiting them separately, can significantly increase the power of clinically relevant patients subclassifications. Results We propose a multi-view approach in which the information from different data layers (views) is integrated at the levels of the results of each single view clustering iterations. It works by factorizing the membership matrices in a late integration manner. We evaluated the effectiveness and the performance of our method on six multi-view cancer datasets. In all the cases, we found patient sub-classes with statistical significance, identifying novel sub-groups previously not emphasized in literature. Our method performed better as compared to other multi-view clustering algorithms and, unlike other existing methods, it is able to quantify the contribution of single views on the final results. Conclusion Our observations suggest that integration of prior information with genomic features in the subtyping analysis is an effective strategy in identifying disease subgroups. The methodology is implemented in R and the source code is available online at http://neuronelab.unisa.it/a-multi-view-genomic-data-integration-methodology/. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0680-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Angela Serra
- NeuRoNe Lab, Department of Computer Science, University of Salerno, Fisciano, Italy.
| | - Michele Fratello
- Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Second University of Napoli, Napoli, Italy.
| | - Vittorio Fortino
- Unit of Systems Toxicology and Nanosafety Research Centre, Finnish Institute of Occupational Health, FIOH, Helsinki, Finland.
| | - Giancarlo Raiconi
- NeuRoNe Lab, Department of Computer Science, University of Salerno, Fisciano, Italy.
| | - Roberto Tagliaferri
- NeuRoNe Lab, Department of Computer Science, University of Salerno, Fisciano, Italy.
| | - Dario Greco
- Unit of Systems Toxicology and Nanosafety Research Centre, Finnish Institute of Occupational Health, FIOH, Helsinki, Finland.
| |
Collapse
|
24
|
Fratello M, Serra A, Fortino V, Raiconi G, Tagliaferri R, Greco D. A multi-view genomic data simulator. BMC Bioinformatics 2015; 16:151. [PMID: 25962835 PMCID: PMC4448275 DOI: 10.1186/s12859-015-0577-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 04/17/2015] [Indexed: 12/17/2022] Open
Abstract
Background OMICs technologies allow to assay the state of a large number of different features (e.g., mRNA expression, miRNA expression, copy number variation, DNA methylation, etc.) from the same samples. The objective of these experiments is usually to find a reduced set of significant features, which can be used to differentiate the conditions assayed. In terms of development of novel feature selection computational methods, this task is challenging for the lack of fully annotated biological datasets to be used for benchmarking. A possible way to tackle this problem is generating appropriate synthetic datasets, whose composition and behaviour are fully controlled and known a priori. Results Here we propose a novel method centred on the generation of networks of interactions among different biological molecules, especially involved in regulating gene expression. Synthetic datasets are obtained from ordinary differential equations based models with known parameters. Our results show that the generated datasets are well mimicking the behaviour of real data, for popular data analysis methods are able to selectively identify existing interactions. Conclusions The proposed method can be used in conjunction to real biological datasets in the assessment of data mining techniques. The main strength of this method consists in the full control on the simulated data while retaining coherence with the real biological processes. The R package MVBioDataSim is freely available to the scientific community at http://neuronelab.unisa.it/?p=1722. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0577-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Michele Fratello
- Department of Medical, Surgical, Neurological, Metabolic and Ageing Sciences, Second University of Napoli, Napoli, Italy. .,Department of Computer Science, Fisciano, Italy.
| | - Angela Serra
- Department of Computer Science, Fisciano, Italy.
| | - Vittorio Fortino
- Unit of Systems Toxicology and Nanosafety Research Centre, Finnish Institute of Occupational Health, FIOH, Helsinki, Finland.
| | | | | | - Dario Greco
- Unit of Systems Toxicology and Nanosafety Research Centre, Finnish Institute of Occupational Health, FIOH, Helsinki, Finland.
| |
Collapse
|
25
|
Cama A, Verginelli F, Lotti LV, Napolitano F, Morgano A, D’Orazio A, Vacca M, Perconti S, Pepe F, Romani F, Vitullo F, di Lella F, Visone R, Mannelli M, Neumann HPH, Raiconi G, Paties C, Moschetta A, Tagliaferri R, Veronese A, Sanna M, Mariani-Costantini R. Integrative genetic, epigenetic and pathological analysis of paraganglioma reveals complex dysregulation of NOTCH signaling. Acta Neuropathol 2013; 126:575-94. [PMID: 23955600 PMCID: PMC3789891 DOI: 10.1007/s00401-013-1165-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 08/02/2013] [Indexed: 02/06/2023]
Abstract
Head and neck paragangliomas, rare neoplasms of the paraganglia composed of nests of neurosecretory and glial cells embedded in vascular stroma, provide a remarkable example of organoid tumor architecture. To identify genes and pathways commonly deregulated in head and neck paraganglioma, we integrated high-density genome-wide copy number variation (CNV) analysis with microRNA and immunomorphological studies. Gene-centric CNV analysis of 24 cases identified a list of 104 genes most significantly targeted by tumor-associated alterations. The "NOTCH signaling pathway" was the most significantly enriched term in the list (P = 0.002 after Bonferroni or Benjamini correction). Expression of the relevant NOTCH pathway proteins in sustentacular (glial), chief (neuroendocrine) and endothelial cells was confirmed by immunohistochemistry in 47 head and neck paraganglioma cases. There were no relationships between level and pattern of NOTCH1/JAG2 protein expression and germline mutation status in the SDH genes, implicated in paraganglioma predisposition, or the presence/absence of immunostaining for SDHB, a surrogate marker of SDH mutations. Interestingly, NOTCH upregulation was observed also in cases with no evidence of CNVs at NOTCH signaling genes, suggesting altered epigenetic modulation of this pathway. To address this issue we performed microarray-based microRNA expression analyses. Notably 5 microRNAs (miR-200a,b,c and miR-34b,c), including those most downregulated in the tumors, correlated to NOTCH signaling and directly targeted NOTCH1 in in vitro experiments using SH-SY5Y neuroblastoma cells. Furthermore, lentiviral transduction of miR-200s and miR-34s in patient-derived primary tympano-jugular paraganglioma cell cultures was associated with NOTCH1 downregulation and increased levels of markers of cell toxicity and cell death. Taken together, our results provide an integrated view of common molecular alterations associated with head and neck paraganglioma and reveal an essential role of NOTCH pathway deregulation in this tumor type.
Collapse
Affiliation(s)
- Alessandro Cama
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Pharmacy, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Fabio Verginelli
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Pharmacy, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Lavinia Vittoria Lotti
- Department of Experimental Medicine, University La Sapienza, Viale Regina Elena 324, 00161 Rome, Italy
| | - Francesco Napolitano
- NeuRoNe Lab, Department of Informatics, University of Salerno, Via Ponte Don Melillo, 84084 Fisciano, Salerno Italy
| | - Annalisa Morgano
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Laboratory of Lipid Metabolism and Cancer, Department of Translational Pharmacology, Consorzio Mario Negri Sud, Via Nazionale 8/A, 66030 Santa Maria Imbaro, Chieti Italy
| | - Andria D’Orazio
- Laboratory of Lipid Metabolism and Cancer, Department of Translational Pharmacology, Consorzio Mario Negri Sud, Via Nazionale 8/A, 66030 Santa Maria Imbaro, Chieti Italy
| | - Michele Vacca
- Laboratory of Lipid Metabolism and Cancer, Department of Translational Pharmacology, Consorzio Mario Negri Sud, Via Nazionale 8/A, 66030 Santa Maria Imbaro, Chieti Italy
- IRCCS National Cancer Research Center Giovanni Paolo II, Viale Orazio Flacco 65, 70124 Bari, Italy
| | - Silvia Perconti
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Pharmacy, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Felice Pepe
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- NeuRoNe Lab, Department of Informatics, University of Salerno, Via Ponte Don Melillo, 84084 Fisciano, Salerno Italy
| | - Federico Romani
- Department of Experimental Medicine, University La Sapienza, Viale Regina Elena 324, 00161 Rome, Italy
| | | | | | - Rosa Visone
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Medical, Oral and Biotechnological Sciences, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Massimo Mannelli
- Department of Experimental and Clinical Biomedical Sciences, University of Florence, Viale Morgagni 50, 50134 Florence, Italy
| | - Hartmut P. H. Neumann
- Section of Preventive Medicine, Department of Nephrology, Albert-Ludwigs-University of Freiburg, Hugstetter Strasse 55, 79106 Freiburg, Germany
| | - Giancarlo Raiconi
- NeuRoNe Lab, Department of Informatics, University of Salerno, Via Ponte Don Melillo, 84084 Fisciano, Salerno Italy
| | - Carlo Paties
- Unit of Anatomic Pathology, Department of Clinical Pathology, Hospital G. da Saliceto, Via Giuseppe Taverna 49, 29100 Piacenza, Italy
| | - Antonio Moschetta
- Laboratory of Lipid Metabolism and Cancer, Department of Translational Pharmacology, Consorzio Mario Negri Sud, Via Nazionale 8/A, 66030 Santa Maria Imbaro, Chieti Italy
- IRCCS National Cancer Research Center Giovanni Paolo II, Viale Orazio Flacco 65, 70124 Bari, Italy
| | - Roberto Tagliaferri
- NeuRoNe Lab, Department of Informatics, University of Salerno, Via Ponte Don Melillo, 84084 Fisciano, Salerno Italy
| | - Angelo Veronese
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Medical, Oral and Biotechnological Sciences, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Mario Sanna
- Gruppo Otologico, Via Emmanueli 42, 29100 Piacenza, Italy
- Department of Medical, Oral and Biotechnological Sciences, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| | - Renato Mariani-Costantini
- Unit of General Pathology, Aging Research Center (Ce.S.I.), G. d’Annunzio University Foundation, Via Colle dell’Ara, 66100 Chieti, Italy
- Department of Medical, Oral and Biotechnological Sciences, G. d’Annunzio University, Via dei Vestini 1, 66100 Chieti, Italy
| |
Collapse
|
26
|
Napolitano F, Zhao Y, Moreira VM, Tagliaferri R, Kere J, D'Amato M, Greco D. Drug repositioning: a machine-learning approach through data integration. J Cheminform 2013; 5:30. [PMID: 23800010 PMCID: PMC3704944 DOI: 10.1186/1758-2946-5-30] [Citation(s) in RCA: 193] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 06/13/2013] [Indexed: 12/19/2022] Open
Abstract
Existing computational methods for drug repositioning either rely only on the gene expression response of cell lines after treatment, or on drug-to-disease relationships, merging several information levels. However, the noisy nature of the gene expression and the scarcity of genomic data for many diseases are important limitations to such approaches. Here we focused on a drug-centered approach by predicting the therapeutic class of FDA-approved compounds, not considering data concerning the diseases. We propose a novel computational approach to predict drug repositioning based on state-of-the-art machine-learning algorithms. We have integrated multiple layers of information: i) on the distances of the drugs based on how similar are their chemical structures, ii) on how close are their targets within the protein-protein interaction network, and iii) on how correlated are the gene expression patterns after treatment. Our classifier reaches high accuracy levels (78%), allowing us to re-interpret the top misclassifications as re-classifications, after rigorous statistical evaluation. Efficient drug repurposing has the potential to significantly impact the whole field of drug development. The results presented here can significantly accelerate the translation into the clinics of known compounds for novel therapeutic uses.
Collapse
|
27
|
Abstract
Background An incremental, loosely planned development approach is often used in bioinformatic studies when dealing with custom data analysis in a rapidly changing environment. Unfortunately, the lack of a rigorous software structuring can undermine the maintainability, communicability and replicability of the process. To ameliorate this problem we propose the Leaf system, the aim of which is to seamlessly introduce the pipeline formality on top of a dynamical development process with minimum overhead for the programmer, thus providing a simple layer of software structuring. Results Leaf includes a formal language for the definition of pipelines with code that can be transparently inserted into the user’s Python code. Its syntax is designed to visually highlight dependencies in the pipeline structure it defines. While encouraging the developer to think in terms of bioinformatic pipelines, Leaf supports a number of automated features including data and session persistence, consistency checks between steps of the analysis, processing optimization and publication of the analytic protocol in the form of a hypertext. Conclusions Leaf offers a powerful balance between plan-driven and change-driven development environments in the design, management and communication of bioinformatic pipelines. Its unique features make it a valuable alternative to other related tools.
Collapse
Affiliation(s)
- Francesco Napolitano
- Department of Computer Science (DI),University of Salerno, Fisciano (SA) 84084, Italy.
| | | | | |
Collapse
|
28
|
Nosova E, Napolitano F, Amato R, Cocozza S, Miele G, Raiconi G, Tagliaferri R. An improved combinatorial biclustering algorithm. Neural Comput Appl 2013. [DOI: 10.1007/s00521-012-0902-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
29
|
Perconti S, Aceto GM, Verginelli F, Napolitano F, Petrarca C, Bernardini G, Raiconi G, Tagliaferri R, Sabbioni E, Di Gioacchino M, Mariani-Costantini R. Distinctive gene expression profiles in Balb/3T3 cells exposed to low dose cobalt nanoparticles, microparticles and ions: potential nanotoxicological relevance. J BIOL REG HOMEOS AG 2013; 27:443-454. [PMID: 23830394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Size-dependent characteristics of novel engineered nanomaterials might result in unforeseen biological responses and toxicity. To address this issue, we used cDNA microarray analysis (13443 genes) coupled with bioinformatics and functional gene annotation studies to investigate the transcriptional profiles of Balb/3T3 cells exposed to a low dose (1 μM) of cobalt nanoparticles (CoNP), microparticles (CoMP) and ions (Co2+). CoNP, CoMP and Co2+ affected 124, 91 and 80 genes, respectively. Hierarchical clustering revealed two main gene clusters, one up-regulated, mainly after Co2+, the other down-regulated, mainly after CoNP and CoMP. The significant Gene Ontology (GO) terms included oxygen binding and transport and hemoglobin binding for Co2+, while the GOs of CoMP and CoNP were related to nucleus and intracellular components. Pathway analysis highlighted: i) mitochondrial dysfunction for Co2+, ii) signaling, activation of innate immunity, and apoptosis for CoNP, and iii) cell metabolism, G1/S cell cycle checkpoint regulation and signaling for CoMP. Unlike ions, particles affected toxicologically-relevant pathways implicated in carcinogenesis and inflammation.
Collapse
Affiliation(s)
- S Perconti
- Unit of General Pathology, Aging Research Center, University G. d'Annunzio Foundation, Chieti, Italy
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Martina R, Cioffi I, Galeotti A, Tagliaferri R, Cimino R, Michelotti A, Valletta R, Farella M, Paduano S. Efficacy of the Sander bite-jumping appliance in growing patients with mandibular retrusion: a randomized controlled trial. Orthod Craniofac Res 2013; 16:116-26. [DOI: 10.1111/ocr.12013] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/28/2012] [Indexed: 11/27/2022]
Affiliation(s)
- R. Martina
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | - I. Cioffi
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | - A. Galeotti
- Division of Dentistry; Bambino Gesù Hospital; Rome; Italy
| | - R. Tagliaferri
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | - R. Cimino
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | - A. Michelotti
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | - R. Valletta
- Department of Oral Sciences; Section of Orthodontics; University of Naples Federico II; Naples; Italy
| | | | - S. Paduano
- Department of Oral Sciences; Section of Orthodontics; University of Catanzaro Magna Graecia; Catanzaro; Italy
| |
Collapse
|
31
|
Lisboa P, Vellido A, Tagliaferri R, Napolitano F, Ceccarelli M, Martin-Guerrero J, Biganzoli E. Data Mining in Cancer Research [Application Notes. IEEE COMPUT INTELL M 2010. [DOI: 10.1109/mci.2009.935311] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
32
|
Verginelli F, Bishehsari F, Napolitano F, Mahdavinia M, Cama A, Malekzadeh R, Miele G, Raiconi G, Tagliaferri R, Mariani-Costantini R. Transitions at CpG dinucleotides, geographic clustering of TP53 mutations and food availability patterns in colorectal cancer. PLoS One 2009; 4:e6824. [PMID: 19718455 PMCID: PMC2730577 DOI: 10.1371/journal.pone.0006824] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 07/14/2009] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Colorectal cancer is mainly attributed to diet, but the role exerted by foods remains unclear because involved factors are extremely complex. Geography substantially impacts on foods. Correlations between international variation in colorectal cancer-associated mutation patterns and food availabilities could highlight the influence of foods on colorectal mutagenesis. METHODOLOGY To test such hypothesis, we applied techniques based on hierarchical clustering, feature extraction and selection, and statistical pattern recognition to the analysis of 2,572 colorectal cancer-associated TP53 mutations from 12 countries/geographic areas. For food availabilities, we relied on data extracted from the Food Balance Sheets of the Food and Agriculture Organization of the United Nations. Dendrograms for mutation sites, mutation types and food patterns were constructed through Ward's hierarchical clustering algorithm and their stability was assessed evaluating silhouette values. Feature selection used entropy-based measures for similarity between clusterings, combined with principal component analysis by exhaustive and heuristic approaches. CONCLUSION/SIGNIFICANCE Mutations clustered in two major geographic groups, one including only Western countries, the other Asia and parts of Europe. This was determined by variation in the frequency of transitions at CpGs, the most common mutation type. Higher frequencies of transitions at CpGs in the cluster that included only Western countries mainly reflected higher frequencies of mutations at CpG codons 175, 248 and 273, the three major TP53 hotspots. Pearson's correlation scores, computed between the principal components of the datamatrices for mutation types, food availability and mutation sites, demonstrated statistically significant correlations between transitions at CpGs and both mutation sites and availabilities of meat, milk, sweeteners and animal fats, the energy-dense foods at the basis of "Western" diets. This is best explainable by differential exposure to nitrosative DNA damage due to foods that promote metabolic stress and chronic inflammation.
Collapse
Affiliation(s)
- Fabio Verginelli
- Department of Oncology and Neurosciences, “G. d'Annunzio” University, and Center of Excellence on Aging (CeSI), “G. d'Annunzio” University Foundation, Chieti, Italy
| | - Faraz Bishehsari
- Department of Oncology and Neurosciences, “G. d'Annunzio” University, and Center of Excellence on Aging (CeSI), “G. d'Annunzio” University Foundation, Chieti, Italy
- Digestive Disease Research Center (DDRC), Shariati Hospital, University of Tehran, Tehran, Iran
| | - Francesco Napolitano
- Department of Mathematics and Informatics, University of Salerno, Salerno, Italy
| | - Mahboobeh Mahdavinia
- Department of Oncology and Neurosciences, “G. d'Annunzio” University, and Center of Excellence on Aging (CeSI), “G. d'Annunzio” University Foundation, Chieti, Italy
- Digestive Disease Research Center (DDRC), Shariati Hospital, University of Tehran, Tehran, Iran
| | - Alessandro Cama
- Department of Oncology and Neurosciences, “G. d'Annunzio” University, and Center of Excellence on Aging (CeSI), “G. d'Annunzio” University Foundation, Chieti, Italy
| | - Reza Malekzadeh
- Digestive Disease Research Center (DDRC), Shariati Hospital, University of Tehran, Tehran, Iran
| | - Gennaro Miele
- Department of Physical Sciences, University of Naples, Naples, Italy
| | - Giancarlo Raiconi
- Department of Mathematics and Informatics, University of Salerno, Salerno, Italy
| | - Roberto Tagliaferri
- Department of Mathematics and Informatics, University of Salerno, Salerno, Italy
| | - Renato Mariani-Costantini
- Department of Oncology and Neurosciences, “G. d'Annunzio” University, and Center of Excellence on Aging (CeSI), “G. d'Annunzio” University Foundation, Chieti, Italy
| |
Collapse
|
33
|
Affiliation(s)
- Francesco Iorio
- Systems and Synthetic Biology Lab, TeleThon Institute of Genetics and Medicine (TIGEM), Naples, Italy
- Neural and Robotic Networks (NeuRoNe) Lab, Department of Mathematics and Computer Science, University of Salerno, Fisciano, Salerno, Italy
| | - Roberto Tagliaferri
- Neural and Robotic Networks (NeuRoNe) Lab, Department of Mathematics and Computer Science, University of Salerno, Fisciano, Salerno, Italy
| | - Diego di Bernardo
- Systems and Synthetic Biology Lab, TeleThon Institute of Genetics and Medicine (TIGEM), Naples, Italy
- Department of Computer Science and Systems, University “Federico II” of Naples, Naples, Italy
| |
Collapse
|
34
|
|
35
|
Abstract
In this paper, a new learning algorithm for the Simpson's fuzzy min-max neural network is presented. It overcomes some undesired properties of the Simpson's model: specifically, in it there are neither thresholds that bound the dimension of the hyperboxes nor sensitivity parameters. Our new algorithm improves the network performance: in fact, the classification result does not depend on the presentation order of the patterns in the training set, and at each step, the classification error in the training set cannot increase. The new neural model is particularly useful in classification problems as it is shown by comparison with some fuzzy neural nets cited in literature (Simpson's min-max model, fuzzy ARTMAP proposed by Carpenter, Grossberg et al. in 1992, adaptive fuzzy systems as introduced by Wang in his book) and the classical multilayer perceptron neural network with backpropagation learning algorithm. The tests were executed on three different classification problems: the first one with two-dimensional synthetic data, the second one with realistic data generated by a simulator to find anomalies in the cooling system of a blast furnace, and the third one with real data for industrial diagnosis. The experiments were made following some recent evaluation criteria known in literature and by using Microsoft Visual C++ development environment on personal computers.
Collapse
|
36
|
Gesù VD, Tagliaferri R. Preface. Int J Approx Reason 2008. [DOI: 10.1016/j.ijar.2007.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
37
|
Staiano A, Tagliaferri R, Pedrycz W. Improving RBF networks performance in regression tasks by means of a supervised fuzzy clustering. Neurocomputing 2006. [DOI: 10.1016/j.neucom.2005.06.014] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
38
|
|
39
|
Amato R, Ciaramella A, Deniskina N, Del Mondo C, di Bernardo D, Donalek C, Longo G, Mangano G, Miele G, Raiconi G, Staiano A, Tagliaferri R. A multi-step approach to time series analysis and gene expression clustering. Bioinformatics 2006; 22:589-96. [PMID: 16397005 DOI: 10.1093/bioinformatics/btk026] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The huge growth in gene expression data calls for the implementation of automatic tools for data processing and interpretation. RESULTS We present a new and comprehensive machine learning data mining framework consisting in a non-linear PCA neural network for feature extraction, and probabilistic principal surfaces combined with an agglomerative approach based on Negentropy aimed at clustering gene microarray data. The method, which provides a user-friendly visualization interface, can work on noisy data with missing points and represents an automatic procedure to get, with no a priori assumptions, the number of clusters present in the data. Cell-cycle dataset and a detailed analysis confirm the biological nature of the most significant clusters. AVAILABILITY The software described here is a subpackage part of the ASTRONEURAL package and is available upon request from the corresponding author. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- R Amato
- Dipartimento di Scienze Fisiche, University of Naples Federico II, Naples, Italy
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Eleuteri A, Tagliaferri R, Milano L. A novel information geometric approach to variable selection in MLP networks. Neural Netw 2005; 18:1309-18. [PMID: 15990274 DOI: 10.1016/j.neunet.2005.01.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2003] [Accepted: 01/04/2005] [Indexed: 11/17/2022]
Abstract
In this paper, a novel information geometric-based variable selection criterion for multi-layer perceptron networks is described. It is based on projections of the Riemannian manifold defined by a multi-layer perceptron network on submanifolds defined by multi-layer perceptron networks with reduced input dimension. We show how the divergence between models can be used as a criterion for an efficient search in the space of networks with different inputs. Then, we show how the posterior probabilities of the models can be evaluated to rank the projected models. Finally, we test our algorithm on synthetic and real data, and compare its performances with other methods reported in literature.
Collapse
Affiliation(s)
- A Eleuteri
- Dipartimento di Scienze Fisiche, Università degli Studi di Napoli Federico II, via Cintia, I-80126 Napoli, Italia.
| | | | | |
Collapse
|
41
|
De Lauro E, De Martino S, Falanga M, Ciaramella A, Tagliaferri R. Complexity of time series associated to dynamical systems inferred from independent component analysis. Phys Rev E Stat Nonlin Soft Matter Phys 2005; 72:046712. [PMID: 16383572 DOI: 10.1103/physreve.72.046712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Indexed: 05/05/2023]
Abstract
A not trivial problem for every experimental time series associated to a natural system is to individuate the significant variables to describe the dynamics, i.e., the effective degrees of freedom. The application of independent component analysis (ICA) has provided interesting results in this direction, e.g., in the seismological and atmospheric field. Since all natural phenomena can be represented by dynamical systems, our aim is to check the performance of ICA in this general context to avoid ambiguities when investigating an unknown experimental system. We show many examples, representing linear, nonlinear, and stochastic processes, in which ICA seems to be an efficacious preanalysis able to give information about the complexity of the dynamics.
Collapse
Affiliation(s)
- E De Lauro
- Dipartimento di Fisica, Università degli Studi di Salerno, Via S. Allende, Baronissi (SA), I-84084 Italy.
| | | | | | | | | |
Collapse
|
42
|
Ippolito AM, De Laurentiis M, La Rosa GL, Eleuteri A, Tagliaferri R, De Placido S, Vigneri R, Belfiore A. Neural network analysis for evaluating cancer risk in thyroid nodules with an indeterminate diagnosis at aspiration cytology: identification of a low-risk subgroup. Thyroid 2004; 14:1065-71. [PMID: 15650360 DOI: 10.1089/thy.2004.14.1065] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Thyroid nodules with a predominant follicular structure are often diagnosed as indeterminate at fine-needle aspiration biopsy (FNAB). We studied 453 patients with a thyroid nodule diagnosed as indeterminate at FNAB by using a feed-forward artificial neural network (ANN) analysis to integrate cytologic and clinical data, with the goal of subgrouping patients into a high-risk and in a low-risk category. Three hundred seventy-one patients were used to train the network and 82 patients were used to validate the model. The cytologic smears were blindly reviewed and classified in a high-risk and a low-risk subgroup on the basis of standard criteria. Neural network analysis subdivided the 371 lesions of the first series into a high-risk group (cancer rate of approximately 33% at histology) and a low-risk group (cancer rate of 3%). Only cytologic parameters contributed to this classification. Analysis of the receiver operating characteristic (ROC) curves demonstrated that the ANN model discriminated with higher sensitivity and specificity between benign and malignant nodules compared to standard cytologic criteria (p < 0.001). This value did not show degradation when ANN predictions were applied to the validation series of 82 nodules. In conclusion, neural network analysis of cytologic data may be a useful tool to refine the risk of cancer in patients with lesions diagnosed as indeterminate by FNAB.
Collapse
Affiliation(s)
- Antonio M Ippolito
- Dipartimento di Medicina Interna e di Medicina Specialistica, Cattedra di Endocrinologia, University of Catania, Ospedale Garibaldi, Italy
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Abstract
A feedforward neural network architecture aimed at survival probability estimation is presented which generalizes the standard, usually linear, models described in literature. The network builds an approximation to the survival probability of a system at a given time, conditional on the system features. The resulting model is described in a hierarchical Bayesian framework. Experiments with synthetic and real world data compare the performance of this model with the commonly used standard ones.
Collapse
|
44
|
Tagliaferri R, Longo G, D'Argenio B, Incoronato A. Introduction: Neural networks for analysis of complex scientific data: astronomy and geosciences. Neural Netw 2003. [DOI: 10.1016/s0893-6080(03)00012-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
45
|
Tagliaferri R, Longo G, Milano L, Acernese F, Barone F, Ciaramella A, De Rosa R, Donalek C, Eleuteri A, Raiconi G, Sessa S, Staiano A, Volpicelli A. Neural networks in astronomy. Neural Netw 2003; 16:297-319. [PMID: 12672427 DOI: 10.1016/s0893-6080(03)00028-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
In the last decade, the use of neural networks (NN) and of other soft computing methods has begun to spread also in the astronomical community which, due to the required accuracy of the measurements, is usually reluctant to use automatic tools to perform even the most common tasks of data reduction and data mining. The federation of heterogeneous large astronomical databases which is foreseen in the framework of the astrophysical virtual observatory and national virtual observatory projects, is, however, posing unprecedented data mining and visualization problems which will find a rather natural and user friendly answer in artificial intelligence tools based on NNs, fuzzy sets or genetic algorithms. This review is aimed to both astronomers (who often have little knowledge of the methodological background) and computer scientists (who often know little about potentially interesting applications), and therefore will be structured as follows: after giving a short introduction to the subject, we shall summarize the methodological background and focus our attention on some of the most interesting fields of application, namely: object extraction and classification, time series analysis, noise identification, and data mining. Most of the original work described in the paper has been performed in the framework of the AstroNeural collaboration (Napoli-Salerno).
Collapse
Affiliation(s)
- Roberto Tagliaferri
- Departimento di Matematica e Informatica-DMI, Università di Salerno, Baronissi, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Acernese F, Ciaramella A, De Martino S, De Rosa R, Falanga M, Tagliaferri R. Neural networks for blind-source separation of Stromboli explosion quakes. ACTA ACUST UNITED AC 2003; 14:167-75. [PMID: 18237999 DOI: 10.1109/tnn.2002.806649] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- F Acernese
- Dipt. di Sci. Fisiche, Universita di Napoli "Federico II", Italy
| | | | | | | | | | | |
Collapse
|
47
|
|
48
|
Andretta M, Eleuteri A, Fortezza F, Manco D, Mingozzi L, Serra R, Tagliaferri R. Neural Networks for Sulphur Dioxide Ground Level Concentrations Forecasting. Neural Comput Appl 2000. [DOI: 10.1007/s005210070020] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
49
|
Loia V, Sessa S, Staiano A, Tagliaferri R. Merging fuzzy logic, neural networks, and genetic computation in the design of a decision‐support system. INT J INTELL SYST 2000. [DOI: 10.1002/(sici)1098-111x(200007)15:7<575::aid-int1>3.3.co;2-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
50
|
|