1
|
König C, Copado P, Lamarca M, Guendouz W, Fischer R, Schlechte M, Acuña V, Berna F, Gawęda Ł, Vellido A, Nebot À, Angulo C, Ochoa S. Data harmonization for the analysis of personalized treatment of psychosis with metacognitive training. Sci Rep 2025; 15:10159. [PMID: 40128308 PMCID: PMC11933379 DOI: 10.1038/s41598-025-94815-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Accepted: 03/17/2025] [Indexed: 03/26/2025] Open
Abstract
Personalized medicine is a data-driven approach that aims to adapt patients' diagnostics and therapies to their characteristics and needs. The availability of patients' data is therefore paramount for the personalization of treatments on the basis of predictive models, and even more so in machine learning-based analyses. Data harmonization is an essential part of the process of data curation. This study presents research on data harmonization for the development of a harmonized retrospective database of patients in Metacognitive Training (MCT) treatment for psychotic disorders. This work is part of the European ERAPERMED 2022-292 research project entitled 'Towards a Personalized Medicine Approach to Psychological Treatment of Psychosis' (PERMEPSY), which focuses on the development of a personalized medicine platform for the treatment of psychosis. The study integrates information from 22 studies into a common format to enable a data analytical approach for personalized treatment. The harmonized database comprises information about 698 patients who underwent MCT and includes a wide range of sociodemographic variables and psychological indicators used to assess a patient's mental health state. The characteristics of patients participating in the study are analyzed using descriptive statistics and exploratory data analysis.
Collapse
Affiliation(s)
- Caroline König
- Soft Computing Research Group (SOCO) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain.
| | - Pedro Copado
- Soft Computing Research Group (SOCO) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain
| | - Maria Lamarca
- MERITT Group, Institut de Recerca Sant Joan de Déu, Parc Sanitari Sant Joan de Déu, 08830, Sant Boi de Llobregat, Barcelona, Spain
- Consorcio de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
- Clinical and Health Psychology Department, School of Psychology, Universitat Autònoma de Barcelona, Bellaterra, 08193, Barcelona, Spain
| | - Wafaa Guendouz
- Soft Computing Research Group (SOCO) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain
| | - Rabea Fischer
- Department of Psychiatry and Psychotherapy, University Medical Center Hamburg-Eppendorf, 20246, Hamburg, Germany
| | - Merle Schlechte
- Department of Psychiatry and Psychotherapy, University Medical Center Hamburg-Eppendorf, 20246, Hamburg, Germany
| | - Vanessa Acuña
- Departamento de Psiquiatría, Escuela de Medicina, Facultad de Medicina, Universidad de Valparaíso, Valparaíso, Chile
| | - Fabrice Berna
- Inserm, University Hospital of Strasbourg, University of Strasbourg, 67091, Strasbourg, France
| | - Łucasz Gawęda
- Experimental Psychopathology Lab, Institute of Psychology, Polish Academy of Sciences, Warsaw, Poland
| | - Alfredo Vellido
- Soft Computing Research Group (SOCO) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain
| | - Àngela Nebot
- Soft Computing Research Group (SOCO) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain
| | - Cecilio Angulo
- Knowledge Engineerig Research Group (GREC) at Intelligent Data Science and Artificial Intelligence (IDEAI-UPC) Research Centre, Universitat Politècnica de Catalunya (UPC Barcelona Tech), Jordi Girona 1-3, 08034, Barcelona, Spain
| | - Susana Ochoa
- MERITT Group, Institut de Recerca Sant Joan de Déu, Parc Sanitari Sant Joan de Déu, 08830, Sant Boi de Llobregat, Barcelona, Spain
- Consorcio de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
2
|
Turrisi R, Verri A, Barla A. Deep learning-based Alzheimer's disease detection: reproducibility and the effect of modeling choices. Front Comput Neurosci 2024; 18:1360095. [PMID: 39371524 PMCID: PMC11451303 DOI: 10.3389/fncom.2024.1360095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 09/03/2024] [Indexed: 10/08/2024] Open
Abstract
Introduction Machine Learning (ML) has emerged as a promising approach in healthcare, outperforming traditional statistical techniques. However, to establish ML as a reliable tool in clinical practice, adherence to best practices in data handling, and modeling design and assessment is crucial. In this work, we summarize and strictly adhere to such practices to ensure reproducible and reliable ML. Specifically, we focus on Alzheimer's Disease (AD) detection, a challenging problem in healthcare. Additionally, we investigate the impact of modeling choices, including different data augmentation techniques and model complexity, on overall performance. Methods We utilize Magnetic Resonance Imaging (MRI) data from the ADNI corpus to address a binary classification problem using 3D Convolutional Neural Networks (CNNs). Data processing and modeling are specifically tailored to address data scarcity and minimize computational overhead. Within this framework, we train 15 predictive models, considering three different data augmentation strategies and five distinct 3D CNN architectures with varying convolutional layers counts. The augmentation strategies involve affine transformations, such as zoom, shift, and rotation, applied either concurrently or separately. Results The combined effect of data augmentation and model complexity results in up to 10% variation in prediction accuracy. Notably, when affine transformation are applied separately, the model achieves higher accuracy, regardless the chosen architecture. Across all strategies, the model accuracy exhibits a concave behavior as the number of convolutional layers increases, peaking at an intermediate value. The best model reaches excellent performance both on the internal and additional external testing set. Discussions Our work underscores the critical importance of adhering to rigorous experimental practices in the field of ML applied to healthcare. The results clearly demonstrate how data augmentation and model depth-often overlooked factors- can dramatically impact final performance if not thoroughly investigated. This highlights both the necessity of exploring neglected modeling aspects and the need to comprehensively report all modeling choices to ensure reproducibility and facilitate meaningful comparisons across studies.
Collapse
Affiliation(s)
- Rosanna Turrisi
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| | - Alessandro Verri
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| | - Annalisa Barla
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
- Machine Learning Genoa (MaLGa) Center, University of Genoa, Genoa, Italy
| |
Collapse
|
3
|
Safarlou CW, Jongsma KR, Vermeulen R, Bredenoord AL. The ethical aspects of exposome research: a systematic review. EXPOSOME 2023; 3:osad004. [PMID: 37745046 PMCID: PMC7615114 DOI: 10.1093/exposome/osad004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
In recent years, exposome research has been put forward as the next frontier for the study of human health and disease. Exposome research entails the analysis of the totality of environmental exposures and their corresponding biological responses within the human body. Increasingly, this is operationalized by big-data approaches to map the effects of internal as well as external exposures using smart sensors and multiomics technologies. However, the ethical implications of exposome research are still only rarely discussed in the literature. Therefore, we conducted a systematic review of the academic literature regarding both the exposome and underlying research fields and approaches, to map the ethical aspects that are relevant to exposome research. We identify five ethical themes that are prominent in ethics discussions: the goals of exposome research, its standards, its tools, how it relates to study participants, and the consequences of its products. Furthermore, we provide a number of general principles for how future ethics research can best make use of our comprehensive overview of the ethical aspects of exposome research. Lastly, we highlight three aspects of exposome research that are most in need of ethical reflection: the actionability of its findings, the epidemiological or clinical norms applicable to exposome research, and the meaning and action-implications of bias.
Collapse
Affiliation(s)
- Caspar W. Safarlou
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
| | - Karin R. Jongsma
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
| | - Roel Vermeulen
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
- Department of Population Health Sciences, Utrecht University,
Utrecht, The Netherlands
| | - Annelien L. Bredenoord
- Department of Global Public Health and Bioethics, Julius Center for
Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The
Netherlands
- Erasmus School of Philosophy, Erasmus University Rotterdam,
Rotterdam, The Netherlands
| |
Collapse
|
4
|
Rico-Uribe LA, Morillo-Cuadrado D, Rodríguez-Laso Á, Vorstenbosch E, Weser AJ, Fincias L, Marcon Y, Rodriguez-Mañas L, Haro JM, Ayuso-Mateos JL. Worldwide mapping of initiatives that integrate population cohorts. Front Public Health 2022; 10:964086. [PMID: 36262229 PMCID: PMC9574101 DOI: 10.3389/fpubh.2022.964086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 09/14/2022] [Indexed: 11/29/2022] Open
Affiliation(s)
- Laura Alejandra Rico-Uribe
- CIBERSAM (Network-Based Biomedical Research Consortium, Area of Mental Health), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
- School of Health Sciences, Universidad Internacional de La Rioja, Logroño, Spain
| | - Daniel Morillo-Cuadrado
- CIBERSAM (Network-Based Biomedical Research Consortium, Area of Mental Health), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
- Instituto de Investigación Sanitaria La Princesa (IIS-LP), Hospital Universitario de La Princesa, Madrid, Spain
- *Correspondence: Daniel Morillo-Cuadrado
| | - Ángel Rodríguez-Laso
- CIBERFES (Network-Based Biomedical Research Consortium, Area of Frailty and Healthy Ageing), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
| | - Ellen Vorstenbosch
- CIBERSAM (Network-Based Biomedical Research Consortium, Area of Mental Health), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
- Parc Sanitari Sant Joan de Déu, Barcelona, Spain
- Department of Medicine, Universitat de Barcelona, Barcelona, Spain
| | - Andreas J. Weser
- HUNT (The Trøndelag Health Study) Research Centre, Norwegian University of Science and Technology, Trondheim, Norway
| | | | | | - Leocadio Rodriguez-Mañas
- CIBERFES (Network-Based Biomedical Research Consortium, Area of Frailty and Healthy Ageing), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
| | - Josep María Haro
- Parc Sanitari Sant Joan de Déu, Barcelona, Spain
- Department of Medicine, Universitat de Barcelona, Barcelona, Spain
| | - José Luis Ayuso-Mateos
- CIBERSAM (Network-Based Biomedical Research Consortium, Area of Mental Health), Instituto de Salud Carlos III, Spanish Ministry of Science and Innovation, Madrid, Spain
- Instituto de Investigación Sanitaria La Princesa (IIS-LP), Hospital Universitario de La Princesa, Madrid, Spain
- Department of Psychiatry, Universidad Autónoma de Madrid, Madrid, Spain
| |
Collapse
|
5
|
Torres Moral T, Sanchez-Niubo A, Monistrol-Mula A, Gerardi C, Banzi R, Garcia P, Demotes-Mainard J, Haro JM. Methods for Stratification and Validation Cohorts: A Scoping Review. J Pers Med 2022; 12:688. [PMID: 35629113 PMCID: PMC9144352 DOI: 10.3390/jpm12050688] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 03/31/2022] [Accepted: 04/15/2022] [Indexed: 12/12/2022] Open
Abstract
Personalized medicine requires large cohorts for patient stratification and validation of patient clustering. However, standards and harmonized practices on the methods and tools to be used for the design and management of cohorts in personalized medicine remain to be defined. This study aims to describe the current state-of-the-art in this area. A scoping review was conducted searching in PubMed, EMBASE, Web of Science, Psycinfo and Cochrane Library for reviews about tools and methods related to cohorts used in personalized medicine. The search focused on cancer, stroke and Alzheimer's disease and was limited to reports in English, French, German, Italian and Spanish published from 2005 to April 2020. The screening process was reported through a PRISMA flowchart. Fifty reviews were included, mostly including information about how data were generated (25/50) and about tools used for data management and analysis (24/50). No direct information was found about the quality of data and the requirements to monitor associated clinical data. A scarcity of information and standards was found in specific areas such as sample size calculation. With this information, comprehensive guidelines could be developed in the future to improve the reproducibility and robustness in the design and management of cohorts in personalized medicine studies.
Collapse
Affiliation(s)
- Teresa Torres Moral
- Research and Development Unit, Parc Sanitari Sant Joan de Déu, 08830 Barcelona, Spain; (T.T.M.); (A.M.-M.); (J.M.H.)
- Melanoma Unit, Dermatology Department, August Pi i Sunyer Biomedical Research Institute (IDIBAPS) and Hospital Clínic, 08036 Barcelona, Spain
- Center for Networked Biomedical Research on Rare Diseases (CIBERER), Carlos III Health Institute, 28029 Madrid, Spain
| | - Albert Sanchez-Niubo
- Research and Development Unit, Parc Sanitari Sant Joan de Déu, 08830 Barcelona, Spain; (T.T.M.); (A.M.-M.); (J.M.H.)
- Center for Networked Biomedical Research on Mental Health (CIBERSAM), Carlos III Health Institute, 28029 Madrid, Spain
- Department of Social Psychology and Quantitative Psychology, University of Barcelona, 08028 Barcelona, Spain
| | - Anna Monistrol-Mula
- Research and Development Unit, Parc Sanitari Sant Joan de Déu, 08830 Barcelona, Spain; (T.T.M.); (A.M.-M.); (J.M.H.)
| | - Chiara Gerardi
- Centre for Health Regulatory Policies, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy; (C.G.); (R.B.)
| | - Rita Banzi
- Centre for Health Regulatory Policies, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy; (C.G.); (R.B.)
| | - Paula Garcia
- ECRIN, European Clinical Research Infrastructure Network, 75013 Paris, France; (P.G.); (J.D.-M.)
| | - Jacques Demotes-Mainard
- ECRIN, European Clinical Research Infrastructure Network, 75013 Paris, France; (P.G.); (J.D.-M.)
| | - Josep Maria Haro
- Research and Development Unit, Parc Sanitari Sant Joan de Déu, 08830 Barcelona, Spain; (T.T.M.); (A.M.-M.); (J.M.H.)
- Center for Networked Biomedical Research on Mental Health (CIBERSAM), Carlos III Health Institute, 28029 Madrid, Spain
| | | |
Collapse
|
6
|
Pezoulas VC, Exarchos TP, Tzioufas AG, Fotiadis DI. Multiple additive regression trees with hybrid loss for classification tasks across heterogeneous clinical data in distributed environments: a case study. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:1670-1673. [PMID: 34891606 DOI: 10.1109/embc46164.2021.9629912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Multiple additive regression trees (MART) have been widely used in the literature for various classification tasks. However, the overfitting effects of MART across heterogeneous and highly imbalanced big data structures within distributed environments has not yet been investigated. In this work, we utilize distributed MART with hybrid loss to resolve overfitting effects during the training of disease classification models in a case study with 10 heterogeneous and distributed clinical datasets. Lexical and semantic analysis methods were utilized to match heterogeneous terminologies with 80% overlap. Data augmentation was used to resolve class imbalance yielding virtual data with goodness of fit 0.01 and correlation difference 0.02. Our results highlight the favorable performance of the proposed distributed MART on the augmented data with an average increase by 7.3% in the accuracy, 6.8% in sensitivity, 10.4% in specificity, for a specific loss function topology.
Collapse
|
7
|
Wong-Lin K, McClean PL, McCombe N, Kaur D, Sanchez-Bornot JM, Gillespie P, Todd S, Finn DP, Joshi A, Kane J, McGuinness B. Shaping a data-driven era in dementia care pathway through computational neurology approaches. BMC Med 2020; 18:398. [PMID: 33323116 PMCID: PMC7738245 DOI: 10.1186/s12916-020-01841-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 11/03/2020] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND Dementia is caused by a variety of neurodegenerative diseases and is associated with a decline in memory and other cognitive abilities, while inflicting an enormous socioeconomic burden. The complexity of dementia and its associated comorbidities presents immense challenges for dementia research and care, particularly in clinical decision-making. MAIN BODY Despite the lack of disease-modifying therapies, there is an increasing and urgent need to make timely and accurate clinical decisions in dementia diagnosis and prognosis to allow appropriate care and treatment. However, the dementia care pathway is currently suboptimal. We propose that through computational approaches, understanding of dementia aetiology could be improved, and dementia assessments could be more standardised, objective and efficient. In particular, we suggest that these will involve appropriate data infrastructure, the use of data-driven computational neurology approaches and the development of practical clinical decision support systems. We also discuss the technical, structural, economic, political and policy-making challenges that accompany such implementations. CONCLUSION The data-driven era for dementia research has arrived with the potential to transform the healthcare system, creating a more efficient, transparent and personalised service for dementia.
Collapse
Affiliation(s)
- KongFatt Wong-Lin
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK.
| | - Paula L McClean
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK
| | - Niamh McCombe
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK
| | - Daman Kaur
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK
| | - Jose M Sanchez-Bornot
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK
| | - Paddy Gillespie
- Health Economics and Policy Analysis Centre, Discipline of Economics, National University of Ireland, Galway, Ireland
| | - Stephen Todd
- Altnagelvin Area Hospital, Western Health and Social Care Trust, Londonderry, Northern Ireland, UK
| | - David P Finn
- Pharmacology and Therapeutics, School of Medicine, Galway Neuroscience Centre, National University of Ireland, Galway, Ireland
| | - Alok Joshi
- Intelligent Systems Research Centre, School of Computing, Engineering and Intelligent Systems, Ulster University, Magee Campus, Londonderry, Northern Ireland, UK
| | - Joseph Kane
- School of Medicine, Dentistry and Biomedical Sciences, Institute for Health Sciences, Centre for Public Health, Queen's University Belfast, Belfast, Northern Ireland, UK
| | - Bernadette McGuinness
- School of Medicine, Dentistry and Biomedical Sciences, Institute for Health Sciences, Centre for Public Health, Queen's University Belfast, Belfast, Northern Ireland, UK
| |
Collapse
|
8
|
Pezoulas VC, Kourou KD, Kalatzis F, Exarchos TP, Zampeli E, Gandolfo S, Goules A, Baldini C, Skopouli F, De Vita S, Tzioufas AG, Fotiadis DI. Overcoming the Barriers That Obscure the Interlinking and Analysis of Clinical Data Through Harmonization and Incremental Learning. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY 2020; 1:83-90. [PMID: 35402941 PMCID: PMC8940202 DOI: 10.1109/ojemb.2020.2981258] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 02/23/2020] [Accepted: 03/09/2020] [Indexed: 11/22/2022] Open
Abstract
Goal: To present a framework for data sharing, curation, harmonization and federated data analytics to solve open issues in healthcare, such as, the development of robust disease prediction models. Methods: Data curation is applied to remove data inconsistencies. Lexical and semantic matching methods are used to align the structure of the heterogeneous, curated cohort data along with incremental learning algorithms including class imbalance handling and hyperparameter optimization to enable the development of disease prediction models. Results: The applicability of the framework is demonstrated in a case study of primary Sjögren's Syndrome, yielding harmonized data with increased quality and more than 85% agreement, along with lymphoma prediction models with more than 80% sensitivity and specificity. Conclusions: The framework provides data quality, harmonization and analytics workflows that can enhance the statistical power of heterogeneous clinical data and enables the development of robust models for disease prediction.
Collapse
Affiliation(s)
- Vasileios C Pezoulas
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and EngineeringUniversity of Ioannina GR45110 Ioannina Greece
| | - Konstantina D Kourou
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and EngineeringUniversity of Ioannina GR45110 Ioannina Greece
- Department of Biological Applications and TechnologyUniversity of Ioannina GR45110 Ioannina Greece
| | - Fanis Kalatzis
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and EngineeringUniversity of Ioannina GR45110 Ioannina Greece
| | - Themis P Exarchos
- Department of InformaticsIonian University GR49100 Corfu Greece
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and EngineeringUniversity of Ioannina GR45100 Ioannina Greece
| | - Evi Zampeli
- Institute for Systemic Autoimmune and Neurological Diseases GR11743 Athens Greece
| | - Saviana Gandolfo
- Clinic of Rheumatology, Department of Medical and Biological SciencesUdine University IT33100 Udine Italy
| | - Andreas Goules
- Department of Pathophysiology, School of MedicineUniversity of Athens GR15772 Athens Greece
| | - Chiara Baldini
- Department of Clinical and Experimental MedicineUniversity of Pisa Pisa IT56126 Italy
| | - Fotini Skopouli
- Department of Internal Medicine and Clinical ImmunologyEuroclinic Hospital GR11521 Athens Greece
| | - Salvatore De Vita
- Clinic of Rheumatology, Department of Medical and Biological SciencesUdine University IT33100 Udine Italy
| | - Athanasios G Tzioufas
- Department of Pathophysiology, School of MedicineUniversity of Athens GR15772 Athens Greece
| | - Dimitrios I Fotiadis
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and EngineeringUniversity of Ioannina GR45110 Ioannina Greece
- Department of Biomedical ResearchFORTH-IMBB GR45110 Ioannina Greece
| |
Collapse
|
9
|
De Vita S, Gandolfo S. Predicting lymphoma development in patients with Sjögren's syndrome. Expert Rev Clin Immunol 2019; 15:929-938. [PMID: 31347413 DOI: 10.1080/1744666x.2019.1649596] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Introduction: The issue of predicting lymphoma in primary Sjögren's syndrome (pSS) starts from its clinical and biologic essence, i.e., an autoimmune exocrinopathy with sicca syndrome, inflammation and lymphoproliferation of MALT (mucosa-associated lymphoid tissue) in exocrine glands. Areas covered: The two major predictors to be firstly focused are persistent salivary gland (SG) swelling and cryoglobulinemic vasculitis with related features as purpura and low C4, or the sole serum cryoglobulinemia repeatedly detected. They are pathogenetically linked and reflect a heavier MALT involvement by histopathology, with the expansion of peculiar rheumatoid factor (RF)-positive clones/idiotypes. Other predictors include lymphadenopathy, splenomegaly, neutropenia, lymphopenia, serum beta2-microglobulin, monoclonal immunoglobulins, light chains, and RF. Composite indexes/scores may also predict lymphoma. Expert opinion: Prediction at baseline needs amelioration, and must be repeated in the follow-up. Careful clinical characterization, with harmonization and stratification of large cohorts, is a relevant preliminary step. Validated and new biomarkers are needed in biologic fluids and tissues. SG echography with automatic scoring could represent a future imaging biomarker, still lacking. Scoring MALT involvement in pSS, as an additional tool to evaluate disease activity and possibly to predict lymphoma, is welcomed. All these efforts are now ongoing within the HarmonicSS project and in other research initiatives in pSS.
Collapse
Affiliation(s)
- Salvatore De Vita
- Rheumatology Clinic, Udine University Hospital, Department of Medical Area, University of Udine , Udine , Italy
| | - Saviana Gandolfo
- Rheumatology Clinic, Udine University Hospital, Department of Medical Area, University of Udine , Udine , Italy
| |
Collapse
|
10
|
Pezoulas VC, Kourou KD, Kalatzis F, Exarchos TP, Venetsanopoulou A, Zampeli E, Gandolfo S, Skopouli F, De Vita S, Tzioufas AG, Fotiadis DI. Medical data quality assessment: On the development of an automated framework for medical data curation. Comput Biol Med 2019; 107:270-283. [PMID: 30878889 DOI: 10.1016/j.compbiomed.2019.03.001] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 03/01/2019] [Accepted: 03/02/2019] [Indexed: 01/25/2023]
Abstract
Data quality assessment has gained attention in the recent years since more and more companies and medical centers are highlighting the importance of an automated framework to effectively manage the quality of their big data. Data cleaning, also known as data curation, lies in the heart of the data quality assessment and is a key aspect prior to the development of any data analytics services. In this work, we present the objectives, functionalities and methodological advances of an automated framework for data curation from a medical perspective. The steps towards the development of a system for data quality assessment are first described along with multidisciplinary data quality measures. A three-layer architecture which realizes these steps is then presented. Emphasis is given on the detection and tracking of inconsistencies, missing values, outliers, and similarities, as well as, on data standardization to finally enable data harmonization. A case study is conducted in order to demonstrate the applicability and reliability of the proposed framework on two well-established cohorts with clinical data related to the primary Sjögren's Syndrome (pSS). Our results confirm the validity of the proposed framework towards the automated and fast identification of outliers, inconsistencies, and highly-correlated and duplicated terms, as well as, the successful matching of more than 85% of the pSS-related medical terms in both cohorts, yielding more accurate, relevant, and consistent clinical data.
Collapse
Affiliation(s)
- Vasileios C Pezoulas
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, GR45110, Greece
| | - Konstantina D Kourou
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, GR45110, Greece; Department of Biological Applications and Technology, University of Ioannina, Ioannina, GR45110, Greece
| | - Fanis Kalatzis
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, GR45110, Greece
| | - Themis P Exarchos
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, GR45110, Greece; Department of Informatics, Ionian University, Corfu, GR49100, Greece
| | - Aliki Venetsanopoulou
- Department of Pathophysiology, School of Medicine, University of Athens, Athens, GR15772, Greece
| | - Evi Zampeli
- Institute for Systemic Autoimmune and Neurological Diseases, Athens, GR11743, Greece
| | - Saviana Gandolfo
- Clinic of Rheumatology, Department of Medical and Biological Sciences, Udine University, Udine, IT33100, Italy
| | - Fotini Skopouli
- Department of Internal Medicine and Clinical Immunology, Euroclinic Hospital, Athens, GR11521, Greece
| | - Salvatore De Vita
- Clinic of Rheumatology, Department of Medical and Biological Sciences, Udine University, Udine, IT33100, Italy
| | - Athanasios G Tzioufas
- Department of Pathophysiology, School of Medicine, University of Athens, Athens, GR15772, Greece
| | - Dimitrios I Fotiadis
- Unit of Medical Technology and Intelligent Information Systems, Department of Materials Science and Engineering, University of Ioannina, Ioannina, GR45110, Greece; Department of Biomedical Research, FORTH-IMBB, Ioannina, GR45110, Greece.
| |
Collapse
|