1
|
Morabito A, De Simone G, Pastorelli R, Brunelli L, Ferrario M. Algorithms and tools for data-driven omics integration to achieve multilayer biological insights: a narrative review. J Transl Med 2025; 23:425. [PMID: 40211300 PMCID: PMC11987215 DOI: 10.1186/s12967-025-06446-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2025] [Accepted: 03/30/2025] [Indexed: 04/13/2025] Open
Abstract
Systems biology is a holistic approach to biological sciences that combines experimental and computational strategies, aimed at integrating information from different scales of biological processes to unravel pathophysiological mechanisms and behaviours. In this scenario, high-throughput technologies have been playing a major role in providing huge amounts of omics data, whose integration would offer unprecedented possibilities in gaining insights on diseases and identifying potential biomarkers. In the present review, we focus on strategies that have been applied in literature to integrate genomics, transcriptomics, proteomics, and metabolomics in the year range 2018-2024. Integration approaches were divided into three main categories: statistical-based approaches, multivariate methods, and machine learning/artificial intelligence techniques. Among them, statistical approaches (mainly based on correlation) were the ones with a slightly higher prevalence, followed by multivariate approaches, and machine learning techniques. Integrating multiple biological layers has shown great potential in uncovering molecular mechanisms, identifying putative biomarkers, and aid classification, most of the time resulting in better performances when compared to single omics analyses. However, significant challenges remain. The high-throughput nature of omics platforms introduces issues such as variable data quality, missing values, collinearity, and dimensionality. These challenges further increase when combining multiple omics datasets, as the complexity and heterogeneity of the data increase with integration. We report different strategies that have been found in literature to cope with these challenges, but some open issues still remain and should be addressed to disclose the full potential of omics integration.
Collapse
Affiliation(s)
- Aurelia Morabito
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy.
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133, Milan, Italy.
| | - Giulia De Simone
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
- Department of Biotechnologies and Biosciences, Università degli Studi Milano Bicocca, 20126, Milan, Italy
| | - Roberta Pastorelli
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
| | - Laura Brunelli
- Laboratory of Metabolites and Proteins in Translational Research, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156, Milan, Italy
| | - Manuela Ferrario
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133, Milan, Italy
| |
Collapse
|
2
|
Liu H, Kou W, Wu YC, Hing Chau P, Chung TWH, Fong DYT. Predicting Childhood and Adolescence Hypertension: Analysis of Predictors Using Machine Learning. Pediatrics 2025:e2024066675. [PMID: 39900096 DOI: 10.1542/peds.2024-066675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 11/19/2024] [Indexed: 02/05/2025] Open
Abstract
BACKGROUND There has been a substantial burden of hypertension in children and adolescents. Given the availability of primary prevention strategies, it is important to determine predictors for early identification of children and adolescents at risk of hypertension. This study aims to attempt and validate machine learning (ML) algorithms for accurately predicting blood pressure (BP) status (normal, prehypertension, and hypertension) over 1- and 3-year periods, identifying key predictors without compromising model performance. METHODS We included a population-based cohort of primary 1 to secondary 6 students (typically aged 6 to 18 years) during the academic years of 1995 to 1996 and 2019 to 2020 in Hong Kong. Thirty-six easy-assessed predictors were initially model childhood BP status. Multiple ML algorithms, decision tree, random forest, k-nearest neighbor, eXtreme Gradient Boosting (XGBoost), and multinomial logistic regression (MLR), were used. Model evaluation was performed by various accuracy metrics. The Shapley Additive Explanations (SHAP) was used to identify key features for both predictions. RESULTS A total of 923 301 and 602 179 visit pairs were used for the 1- and 3-year predictions, respectively. XGBoost demonstrated the highest prediction accuracies for 1-year (macro-area under the receiver operating characteristic curve [AUROC] = 0.92, micro-AUROC = 0.91) and 3-year (macro-AUROC = 0.91, micro-AUROC = 0.90) periods. The traditional MLR approach had the lowest accuracies for 1- (macro-AUROC = 0.70, micro-AUROC = 0.68) and 3-year (macro-AUROC = 0.70, micro-AUROC = 0.68) predictions. The SHAP values identified 17 key predictors without the need for direct BP measurements or laboratory tests. CONCLUSION ML prediction models can accurately predict childhood prehypertension and hypertension at 1 and 3 years, independent of BP and laboratory measurements. The identified key predictors may inform areas for personalized prevention in hypertension.
Collapse
Affiliation(s)
- Hengyan Liu
- School of Nursing, The University of Hong Kong, Hong Kong, PR China
| | - Weibin Kou
- Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, PR China
| | - Yik-Chung Wu
- Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, PR China
| | - Pui Hing Chau
- School of Nursing, The University of Hong Kong, Hong Kong, PR China
| | | | | |
Collapse
|
3
|
Fowler SL, Behr TS, Turkes E, O'Brien DP, Cauhy PM, Rawlinson I, Edmonds M, Foiani MS, Schaler A, Crowley G, Bez S, Ficulle E, Tsefou E, Fischer R, Geary B, Gaur P, Miller C, D'Acunzo P, Levy E, Duff KE, Ryskeldi-Falcon B. Tau filaments are tethered within brain extracellular vesicles in Alzheimer's disease. Nat Neurosci 2025; 28:40-48. [PMID: 39572740 PMCID: PMC11706778 DOI: 10.1038/s41593-024-01801-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 09/25/2024] [Indexed: 11/27/2024]
Abstract
The abnormal assembly of tau protein in neurons is a pathological hallmark of multiple neurodegenerative diseases, including Alzheimer's disease (AD). Assembled tau associates with extracellular vesicles (EVs) in the central nervous system of individuals with AD, which is linked to its clearance and prion-like propagation. However, the identities of the assembled tau species and EVs, as well as how they associate, are not known. Here, we combined quantitative mass spectrometry, cryo-electron tomography and single-particle cryo-electron microscopy to study brain EVs from individuals with AD. We found tau filaments composed mainly of truncated tau that were enclosed within EVs enriched in endo-lysosomal proteins. We observed multiple filament interactions, including with molecules that tethered filaments to the EV limiting membrane, suggesting selective packaging. Our findings will guide studies into the molecular mechanisms of EV-mediated secretion of assembled tau and inform the targeting of EV-associated tau as potential therapeutic and biomarker strategies for AD.
Collapse
Affiliation(s)
- Stephanie L Fowler
- UK Dementia Research Institute at University College London, London, UK
- Oxford-GSK Institute of Molecular and Computational Medicine, University of Oxford, Oxford, UK
| | - Tiana S Behr
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Emir Turkes
- UK Dementia Research Institute at University College London, London, UK
| | - Darragh P O'Brien
- Target Discovery Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | | | - Isadora Rawlinson
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Marisa Edmonds
- UK Dementia Research Institute at University College London, London, UK
| | - Martha S Foiani
- UK Dementia Research Institute at University College London, London, UK
| | - Ari Schaler
- Taub Institute, Irving Medical Center, Columbia University, New York, NY, USA
| | - Gerard Crowley
- UK Dementia Research Institute at University College London, London, UK
| | - Sumi Bez
- UK Dementia Research Institute at University College London, London, UK
| | - Elena Ficulle
- UK Dementia Research Institute at University College London, London, UK
| | - Eliona Tsefou
- UK Dementia Research Institute at University College London, London, UK
| | - Roman Fischer
- Target Discovery Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Beth Geary
- Medical Research Council Protein Phosphorylation and Ubiquitylation Unit, University of Dundee, Dundee, UK
| | - Pallavi Gaur
- Department of Neurology, Center for Translational and Computational Neuroimmunology, Columbia University Medical Center, New York, NY, USA
| | - Chelsea Miller
- The Center for Genetic and Genomic Medicine, Hackensack University Medical Center, Hackensack, NJ, USA
- Center for Dementia Research, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
| | - Pasquale D'Acunzo
- Center for Dementia Research, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, USA
| | - Efrat Levy
- Center for Dementia Research, Nathan S. Kline Institute for Psychiatric Research, Orangeburg, NY, USA
- Department of Psychiatry, New York University Grossman School of Medicine, New York, NY, USA
- Department of Biochemistry and Molecular Pharmacology, New York University Grossman School of Medicine, New York, NY, USA
- Neuroscience Institute, New York University Grossman School of Medicine, New York, NY, USA
| | - Karen E Duff
- UK Dementia Research Institute at University College London, London, UK.
- Taub Institute, Irving Medical Center, Columbia University, New York, NY, USA.
| | | |
Collapse
|
4
|
Etourneau L, Fancello L, Wieczorek S, Varoquaux N, Burger T. Penalized likelihood optimization for censored missing value imputation in proteomics. Biostatistics 2024; 26:kxaf006. [PMID: 40120089 DOI: 10.1093/biostatistics/kxaf006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 01/31/2025] [Accepted: 02/03/2025] [Indexed: 03/25/2025] Open
Abstract
Label-free bottom-up proteomics using mass spectrometry and liquid chromatography has long been established as one of the most popular high-throughput analysis workflows for proteome characterization. However, it produces data hindered by complex and heterogeneous missing values, which imputation has long remained problematic. To cope with this, we introduce Pirat, an algorithm that harnesses this challenge using an original likelihood maximization strategy. Notably, it models the instrument limit by learning a global censoring mechanism from the data available. Moreover, it estimates the covariance matrix between enzymatic cleavage products (ie peptides or precursor ions), while offering a natural way to integrate complementary transcriptomic information when multi-omic assays are available. Our benchmarking on several datasets covering a variety of experimental designs (number of samples, acquisition mode, missingness patterns, etc.) and using a variety of metrics (differential analysis ground truth or imputation errors) shows that Pirat outperforms all pre-existing imputation methods. Beyond the interest of Pirat as an imputation tool, these results pinpoint the need for a paradigm change in proteomics imputation, as most pre-existing strategies could be boosted by incorporating similar models to account for the instrument censorship or for the correlation structures, either grounded to the analytical pipeline or arising from a multi-omic approach.
Collapse
Affiliation(s)
- Lucas Etourneau
- Univ. Grenoble Alpes, CNRS, CEA, INSERM, BGE UA13, ProFI FR2048, EDyP, Bâtiment 42b, CEA de Grenoble, 17 avenue des Martyrs, 38054 Grenoble Cedex 9, France
- TIMC, Univ. Grenoble Alpes, CNRS, Grenoble INP, Laboratoire TIMC, Rond-Point de la Croix de Vie, 38700 La Tronche, France
| | - Laura Fancello
- Univ. Grenoble Alpes, CNRS, CEA, INSERM, BGE UA13, ProFI FR2048, EDyP, Bâtiment 42b, CEA de Grenoble, 17 avenue des Martyrs, 38054 Grenoble Cedex 9, France
| | - Samuel Wieczorek
- Univ. Grenoble Alpes, CNRS, CEA, INSERM, BGE UA13, ProFI FR2048, EDyP, Bâtiment 42b, CEA de Grenoble, 17 avenue des Martyrs, 38054 Grenoble Cedex 9, France
| | - Nelle Varoquaux
- TIMC, Univ. Grenoble Alpes, CNRS, Grenoble INP, Laboratoire TIMC, Rond-Point de la Croix de Vie, 38700 La Tronche, France
| | - Thomas Burger
- Univ. Grenoble Alpes, CNRS, CEA, INSERM, BGE UA13, ProFI FR2048, EDyP, Bâtiment 42b, CEA de Grenoble, 17 avenue des Martyrs, 38054 Grenoble Cedex 9, France
| |
Collapse
|
5
|
Krummeich J, Nardi L, Caliendo C, Aschauer D, Engelhardt V, Arlt A, Maier J, Bicker F, Kwiatkowski MD, Rolski K, Vincze K, Schneider R, Rumpel S, Gerber S, Schmeisser MJ, Schweiger S. Premature cognitive decline in a mouse model of tuberous sclerosis. Aging Cell 2024; 23:e14318. [PMID: 39192595 PMCID: PMC11634721 DOI: 10.1111/acel.14318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 07/15/2024] [Accepted: 08/06/2024] [Indexed: 08/29/2024] Open
Abstract
Little is known about the influence of (impaired) neurodevelopment on cognitive aging. We here used a mouse model for tuberous sclerosis (TS) carrying a heterozygous deletion of the Tsc2 gene. Loss of Tsc2 function leads to mTOR hyperactivity in mice and patients. In a longitudinal behavioral analysis, we found premature decline of hippocampus-based cognitive functions together with a significant reduction of immediate early gene (IEG) expression. While we did not detect any morphological changes of hippocampal projections and synaptic contacts, molecular markers of neurodegeneration were increased and the mTOR signaling cascade was downregulated in hippocampal synaptosomes. Injection of IGF2, a molecule that induces mTOR signaling, could fully rescue cognitive impairment and IEG expression in aging Tsc2+/- animals. This data suggests that TS is an exhausting disease that causes erosion of the mTOR pathway over time and IGF2 is a promising avenue for treating age-related degeneration in mTORopathies.
Collapse
Affiliation(s)
- J. Krummeich
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
- Present address:
Bioscientia Institut für Medizinische Diagnostik GmbH HumangenetikIngelheimGermany
| | - L. Nardi
- Institute of AnatomyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - C. Caliendo
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - D. Aschauer
- Institute of PhysiologyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - V. Engelhardt
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - A. Arlt
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
- Present address:
Institute for Genomic Statistics and BioinformaticsUniversity of BonnBonnGermany
| | - J. Maier
- Institute of AnatomyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - F. Bicker
- Institute of AnatomyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | | | - K. Rolski
- Department of BiochemistryUniversity of InnsbruckInnsbruckAustria
| | - K. Vincze
- Department of BiochemistryUniversity of InnsbruckInnsbruckAustria
| | - R. Schneider
- Department of BiochemistryUniversity of InnsbruckInnsbruckAustria
| | - S. Rumpel
- Institute of PhysiologyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - S. Gerber
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - M. J. Schmeisser
- Institute of AnatomyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - S. Schweiger
- Institute of Human GeneticsUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
- Leibniz Institute of Resilience ResearchMainzGermany
- Institute of Molecular BiologyMainzGermany
| |
Collapse
|
6
|
Blaszkiewicz M, Johnson CP, Willows JW, Gardner ML, Taplin DR, Freitas MA, Townsend KL. The early transition to cold-induced browning in mouse subcutaneous white adipose tissue (scWAT) involves proteins related to nerve remodeling, cytoskeleton, mitochondria, and immune cells. Adipocyte 2024; 13:2428938. [PMID: 39641403 PMCID: PMC11633174 DOI: 10.1080/21623945.2024.2428938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 10/10/2024] [Accepted: 11/04/2024] [Indexed: 12/07/2024] Open
Abstract
White adipose tissue (WAT) is a dynamic organ capable of remodelling in response to metabolic state. For example, in response to stimuli such as cold exposure, WAT can develop inducible brown adipocytes ('browning') capable of non-shivering thermogenesis, through concurrent changes to mitochondrial content and function. This is aided by increased neurite outgrowth and angiogenesis across the tissue, providing the needed neurovascular supply for uncoupling protein 1 activation. While several RNA-sequencing studies have been performed in WAT, including newer single cell and single nuclei studies, little work has been done to investigate changes to the adipose proteome, particularly during dynamic periods of tissue remodelling such as cold stimulation. Here, we conducted a comprehensive proteomic analysis of inguinal subcutaneous (sc) WAT during the initial 'browning' period of 24 or 72hrs of cold exposure in mice. We identified four significant pathways impacted by cold stimulation that are involved in tissue remodelling, which included mitochondrial function and metabolism, cytoskeletal remodelling, the immune response, and the nervous system. Taken together, we found that early changes in the proteome of WAT with cold stimulation predicted later structural and functional changes in the tissue that are important for tissue and whole-body remodelling to meet energetic and metabolic needs.
Collapse
Affiliation(s)
| | - Cory P. Johnson
- Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME, USA
| | - Jake W. Willows
- Department of Neurological Surgery, The Ohio State University, Columbus, OH, USA
| | - Miranda L. Gardner
- Department of Cancer Biology and Genetics, Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
- Ohio State Biochemistry Program, Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Dylan R. Taplin
- School of Biology and Ecology, University of Maine, Orono, ME, USA
| | - Michael A. Freitas
- Department of Cancer Biology and Genetics, Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
- Ohio State Biochemistry Program, Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Kristy L. Townsend
- Department of Neurological Surgery, The Ohio State University, Columbus, OH, USA
- Graduate School of Biomedical Science and Engineering, University of Maine, Orono, ME, USA
- School of Biology and Ecology, University of Maine, Orono, ME, USA
- Department of Chemical and Biomedical Engineering, University of Maine, Orono, ME, USA
| |
Collapse
|
7
|
Firsanov D, Zacher M, Tian X, Sformo TL, Zhao Y, Tombline G, Lu JY, Zheng Z, Perelli L, Gurreri E, Zhang L, Guo J, Korotkov A, Volobaev V, Biashad SA, Zhang Z, Heid J, Maslov A, Sun S, Wu Z, Gigas J, Hillpot E, Martinez J, Lee M, Williams A, Gilman A, Hamilton N, Haseljic E, Patel A, Straight M, Miller N, Ablaeva J, Tam LM, Couderc C, Hoopman M, Moritz R, Fujii S, Hayman DJ, Liu H, Cai Y, Leung AKL, Simons MJP, Zhang Z, Nelson CB, Abegglen LM, Schiffman JD, Gladyshev VN, Modesti M, Genovese G, Vijg J, Seluanov A, Gorbunova V. DNA repair and anti-cancer mechanisms in the long-lived bowhead whale. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.07.539748. [PMID: 39574710 PMCID: PMC11580846 DOI: 10.1101/2023.05.07.539748] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
At over 200 years, the maximum lifespan of the bowhead whale exceeds that of all other mammals. The bowhead is also the second-largest animal on Earth, reaching over 80,000 kg1. Despite its very large number of cells and long lifespan, the bowhead is not highly cancer-prone, an incongruity termed Peto's Paradox2. This phenomenon has been explained by the evolution of additional tumor suppressor genes in other larger animals, supported by research on elephants demonstrating expansion of the p53 gene3-5. Here we show that bowhead whale fibroblasts undergo oncogenic transformation after disruption of fewer tumor suppressors than required for human fibroblasts. However, analysis of DNA repair revealed that bowhead cells repair double strand breaks (DSBs) and mismatches with uniquely high efficiency and accuracy compared to other mammals. The protein CIRBP, implicated in protection from genotoxic stress, was present in very high abundance in the bowhead whale relative to other mammals. We show that CIRBP and its downstream protein RPA2, also present at high levels in bowhead cells, increase the efficiency and fidelity of DNA repair in human cells. These results indicate that rather than possessing additional tumor suppressor genes as barriers to oncogenesis, the bowhead whale relies on more accurate and efficient DNA repair to preserve genome integrity. This strategy which does not eliminate damaged cells but repairs them may be critical for the long and cancer-free lifespan of the bowhead whale.
Collapse
Affiliation(s)
- Denis Firsanov
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Max Zacher
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Xiao Tian
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Todd L. Sformo
- Department of Wildlife Management, North Slope Borough, Utqiaġvik (Barrow), AK 99723, USA
| | - Yang Zhao
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Greg Tombline
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - J. Yuyang Lu
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Zhizhong Zheng
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Luigi Perelli
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Enrico Gurreri
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Li Zhang
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jing Guo
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Anatoly Korotkov
- Department of Biology, University of Rochester, Rochester, NY, USA
| | | | | | - Zhihui Zhang
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Johanna Heid
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Alex Maslov
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Shixiang Sun
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Zhuoer Wu
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Jonathan Gigas
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Eric Hillpot
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - John Martinez
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Minseon Lee
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Alyssa Williams
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Abbey Gilman
- Department of Biology, University of Rochester, Rochester, NY, USA
| | | | - Ena Haseljic
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Avnee Patel
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Maggie Straight
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Nalani Miller
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Julia Ablaeva
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Lok Ming Tam
- Department of Biology, University of Rochester, Rochester, NY, USA
| | - Chloé Couderc
- Department of Biology, University of Rochester, Rochester, NY, USA
| | | | | | - Shingo Fujii
- Cancer Research Center of Marseille, Department of Genome Integrity, CNRS UMR7258, Inserm U1068, Institut Paoli-Calmettes, Aix Marseille Univ, Marseille, France
| | | | - Hongrui Liu
- Department of Biochemistry and Molecular Biology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, USA
- Cross-Disciplinary Graduate Program in Biomedical Sciences, School of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Yuxuan Cai
- Department of Biochemistry and Molecular Biology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Anthony K. L. Leung
- Department of Biochemistry and Molecular Biology, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD 21205, USA
- McKusick-Nathans Institute of the Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | | | - Zhengdong Zhang
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - C. Bradley Nelson
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Lisa M. Abegglen
- Department of Pediatrics & Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Peel Therapeutics, Inc., Salt Lake City, UT, USA
| | - Joshua D. Schiffman
- Department of Pediatrics & Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
- Peel Therapeutics, Inc., Salt Lake City, UT, USA
| | - Vadim N. Gladyshev
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Mauro Modesti
- Cancer Research Center of Marseille, Department of Genome Integrity, CNRS UMR7258, Inserm U1068, Institut Paoli-Calmettes, Aix Marseille Univ, Marseille, France
| | - Giannicola Genovese
- Department of Genitourinary Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Jan Vijg
- Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Andrei Seluanov
- Department of Biology, University of Rochester, Rochester, NY, USA
- Department of Medicine, University of Rochester Medical Center, Rochester, NY, USA
| | - Vera Gorbunova
- Department of Biology, University of Rochester, Rochester, NY, USA
- Department of Medicine, University of Rochester Medical Center, Rochester, NY, USA
| |
Collapse
|
8
|
Knox B, Güil-Oumrait N, Basagaña X, Cserbik D, Dadvand P, Foraster M, Galmes T, Gascon M, Dolores Gómez-Roig M, Gómez-Herrera L, Småstuen Haug L, Llurba E, Márquez S, Rivas I, Sunyer J, Thomsen C, Julia Zanini M, Bustamante M, Vrijheid M. Prenatal exposure to per- and polyfluoroalkyl substances, fetoplacental hemodynamics, and fetal growth. ENVIRONMENT INTERNATIONAL 2024; 193:109090. [PMID: 39454342 DOI: 10.1016/j.envint.2024.109090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 09/25/2024] [Accepted: 10/18/2024] [Indexed: 10/28/2024]
Abstract
INTRODUCTION The impact of legacy per- and polyfluoroalkyl substances (PFAS) on fetal growth has been well studied, but assessments of next-generation PFAS and PFAS mixtures are sparse and the potential role of fetoplacental hemodynamics has not been studied. We aimed to evaluate associations between prenatal PFAS exposure and fetal growth and fetoplacental hemodynamics. METHODS We included 747 pregnant women from the BiSC birth cohort (Barcelona, Spain (2018-2021)). Twenty-three PFAS were measured at 32 weeks of pregnancy in maternal plasma, of which 13 were present above detectable levels. Fetal growth was measured by ultrasound, as estimated fetal weight at 32 and 37 weeks of gestation, and weight at birth. Doppler ultrasound measurements for uterine (UtA), umbilical (UmA), and middle cerebral artery (MCA) pulsatility indices (PI), as well as the cerebroplacental ratio (CPR - ratio MCA to UmA), were obtained at 32 weeks to assess fetoplacental hemodynamics. We applied linear mixed effects models to assess the association between singular PFAS and longitudinal fetal growth and PI, and Bayesian Weighted Quantile Sum models to evaluate associations between the PFAS mixture and the aforementioned outcomes, controlled for the relevant covariates. RESULTS Single PFAS and the mixture tended to be associated with reduced fetal growth and CPR PI, but few associations reached statistical significance. Legacy PFAS PFOS, PFHpA, and PFDoDa were associated with statistically significant decreases in fetal weight z-score of 0.13 (95%CI (-0.22, -0.04), 0.06 (-0.10, 0.01), and 0.05 (-0.10, 0.00), respectively, per doubling of concentration. The PFAS mixture was associated with a non-statistically significant 0.09 decrease in birth weight z-score (95%CI -0.22, 0.04) per quartile increase. CONCLUSION This study suggests that legacy PFAS may be associated with reduced fetal growth, but associations for next generation PFAS and for the PFAS mixture were less conclusive. Associations between PFAS and fetoplacental hemodynamics warrant further investigation.
Collapse
Affiliation(s)
- Bethany Knox
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Nuria Güil-Oumrait
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain; Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, United States.
| | - Xavier Basagaña
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Dora Cserbik
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Payam Dadvand
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Maria Foraster
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Toni Galmes
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Mireia Gascon
- Unitat de Suport a la Recerca de la Catalunya Central, Fundació Institut Universitari per a la Recerca a l'Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), Manresa, Spain.
| | - Maria Dolores Gómez-Roig
- BCNatal, Fetal Medicine Research Center, Hospital Sant Joan de Déu and Hospital Clínic, University of Barcelona, Barcelona, Spain; Primary Care Interventions to Prevent Maternal and Child Chronic Diseases of Perinatal and Developmental Origin Network (RICORS), RD21/0012/0003, Instituto de Salud Carlos III, Madrid, Spain; Institut de Recerca Sant Joan de Déu, Barcelona, Spain.
| | - Laura Gómez-Herrera
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Line Småstuen Haug
- Norwegian Institute of Public Health (NIPH), Department of Food Safety, Oslo, Norway.
| | - Elisa Llurba
- Department of Obstetrics and Gynaecology. Institut d'Investigació Biomèdica Sant Pau - IIB Sant Pau. Hospital de la Santa Creu i Sant Pau, Barcelona, Spain; Primary Care Interventions to Prevent Maternal and Child Chronic Diseases d Developof Perinatal anmental Origin Network (RICORS), RD21/0012/0001, Instituto de Salud Carlos III, Madrid, Spain.
| | - Sandra Márquez
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Ioar Rivas
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Jordi Sunyer
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Cathrine Thomsen
- Norwegian Institute of Public Health (NIPH), Department of Food Safety, Oslo, Norway.
| | - Maria Julia Zanini
- BCNatal, Fetal Medicine Research Center, Hospital Sant Joan de Déu and Hospital Clínic, University of Barcelona, Barcelona, Spain; Primary Care Interventions to Prevent Maternal and Child Chronic Diseases of Perinatal and Developmental Origin Network (RICORS), RD21/0012/0003, Instituto de Salud Carlos III, Madrid, Spain
| | - Mariona Bustamante
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| | - Martine Vrijheid
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain.
| |
Collapse
|
9
|
Yan Y, Sankar BS, Mirza B, Ng DCM, Pelletier AR, Huang SD, Wang W, Watson K, Wang D, Ping P. Missing Values in Longitudinal Proteome Dynamics Studies: Making a Case for Data Multiple Imputation. J Proteome Res 2024; 23:4151-4162. [PMID: 39189460 PMCID: PMC11385379 DOI: 10.1021/acs.jproteome.4c00263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/28/2024]
Abstract
Temporal proteomics data sets are often confounded by the challenges of missing values. These missing data points, in a time-series context, can lead to fluctuations in measurements or the omission of critical events, thus hindering the ability to fully comprehend the underlying biomedical processes. We introduce a Data Multiple Imputation (DMI) pipeline designed to address this challenge in temporal data set turnover rate quantifications, enabling robust downstream analysis to gain novel discoveries. To demonstrate its utility and generalizability, we applied this pipeline to two use cases: a murine cardiac temporal proteomics data set and a human plasma temporal proteomics data set, both aimed at examining protein turnover rates. This DMI pipeline significantly enhanced the detection of protein turnover rate in both data sets, and furthermore, the imputed data sets captured new representation of proteins, leading to an augmented view of biological pathways, protein complex dynamics, as well as biomarker-disease associations. Importantly, DMI exhibited superior performance in benchmark data sets compared to single imputation methods (DSI). In summary, we have demonstrated that this DMI pipeline is effective at overcoming challenges introduced by missing values in temporal proteome dynamics studies.
Collapse
Affiliation(s)
- Yu Yan
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
| | - Baradwaj Simha Sankar
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
| | - Bilal Mirza
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
| | - Dominic C M Ng
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
| | - Alexander R Pelletier
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- Department of Computer Science and Scalable Analytics Institute, UCLA School of Engineering, Los Angeles, California 90095, United States
| | - Sarah D Huang
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
| | - Wei Wang
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- Department of Computer Science and Scalable Analytics Institute, UCLA School of Engineering, Los Angeles, California 90095, United States
| | - Karol Watson
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
| | - Ding Wang
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
| | - Peipei Ping
- Departments of Physiology and Medicine, University of California, Los Angeles (UCLA) School of Medicine, Los Angeles, California 90095, United States
- NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Los Angeles, California 90095, United States
- NIH BRIDGE2AI Center & NHLBI Integrated Cardiovascular Data Science Training Program, UCLA, Suite 1-609, MRL Building, 675 Charles E. Young Drive South, Los Angeles, California 90095, United States
- Department of Computer Science and Scalable Analytics Institute, UCLA School of Engineering, Los Angeles, California 90095, United States
| |
Collapse
|
10
|
Langan LM, Lovin LM, Taylor RB, Scarlett KR, Kevin Chambliss C, Chatterjee S, Scott JT, Brooks BW. Proteome changes in larval zebrafish (Danio rerio) and fathead minnow (Pimephales promelas) exposed to (±) anatoxin-a. ENVIRONMENT INTERNATIONAL 2024; 185:108514. [PMID: 38394915 DOI: 10.1016/j.envint.2024.108514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/16/2024] [Accepted: 02/17/2024] [Indexed: 02/25/2024]
Abstract
Anatoxin-a and its analogues are potent neurotoxins produced by several genera of cyanobacteria. Due in part to its high toxicity and potential presence in drinking water, these toxins pose threats to public health, companion animals and the environment. It primarily exerts toxicity as a cholinergic agonist, with high affinity at neuromuscular junctions, but molecular mechanisms by which it elicits toxicological responses are not fully understood. To advance understanding of this cyanobacteria, proteomic characterization (DIA shotgun proteomics) of two common fish models (zebrafish and fathead minnow) was performed following (±) anatoxin-a exposure. Specifically, proteome changes were identified and quantified in larval fish exposed for 96 h (0.01-3 mg/L (±) anatoxin-a and caffeine (a methodological positive control) with environmentally relevant treatment levels examined based on environmental exposure distributions of surface water data. Proteomic concentration - response relationships revealed 48 and 29 proteins with concentration - response relationships curves for zebrafish and fathead minnow, respectively. In contrast, the highest number of differentially expressed proteins (DEPs) varied between zebrafish (n = 145) and fathead minnow (n = 300), with only fatheads displaying DEPs at all treatment levels. For both species, genes associated with reproduction were significantly downregulated, with pathways analysis that broadly clustered genes into groups associated with DNA repair mechanisms. Importantly, significant differences in proteome response between the species was also observed, consistent with prior observations of differences in response using both behavioral assays and gene expression, adding further support to model specific differences in organismal sensitivity and/or response. When DEPs were read across from humans to zebrafish, disease ontology enrichment identified diseases associated with cognition and muscle weakness consistent with the prior literature. Our observations highlight limited knowledge of how (±) anatoxin-a, a commonly used synthetic racemate surrogate, elicits responses at a molecular level and advances its toxicological understanding.
Collapse
Affiliation(s)
- Laura M Langan
- Department of Environmental Science, Baylor University, Waco, TX 76798, USA; Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA; Department of Environmental Health Sciences, University of South Carolina, Columbia, SC 29208, USA.
| | - Lea M Lovin
- Department of Environmental Science, Baylor University, Waco, TX 76798, USA; Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA; Department of Wildlife, Fish and Environmental Studies, Swedish University of Agricultural Sciences, Umeå, Sweden
| | - Raegyn B Taylor
- Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA; Department of Chemistry, Baylor University, Waco, TX 76798, USA
| | - Kendall R Scarlett
- Department of Environmental Science, Baylor University, Waco, TX 76798, USA; Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA
| | - C Kevin Chambliss
- Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA; Department of Chemistry, Baylor University, Waco, TX 76798, USA
| | - Saurabh Chatterjee
- Department of Medicine, Department of Environmental and Occupational Health, University of California Irvine, Irvine, CA 92617, USA
| | - J Thad Scott
- Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA; Department of Biology, Baylor University, Waco, TX 76798, USA
| | - Bryan W Brooks
- Department of Environmental Science, Baylor University, Waco, TX 76798, USA; Center for Reservoir and Aquatic Systems Research, Baylor University, Waco, TX 76798, USA.
| |
Collapse
|
11
|
Länger ZM, Baur M, Korša A, Eirich J, Lindeza AS, Zanchi C, Finkemeier I, Kurtz J. Differential proteome profiling of bacterial culture supernatants reveals candidates for the induction of oral immune priming in the red flour beetle. Biol Lett 2023; 19:20230322. [PMID: 37909056 PMCID: PMC10618857 DOI: 10.1098/rsbl.2023.0322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/12/2023] [Indexed: 11/02/2023] Open
Abstract
Most organisms are host to symbionts and pathogens, which led to the evolution of immune strategies to prevent harm. Whilst the immune defences of vertebrates are classically divided into innate and adaptive, insects lack specialized cells involved in adaptive immunity, but have been shown to exhibit immune priming: the enhanced survival upon infection after a first exposure to the same pathogen or pathogen-derived components. An important piece of the puzzle are the pathogen-associated molecules that induce these immune priming responses. Here, we make use of the model system consisting of the red flour beetle (Tribolium castaneum) and its bacterial pathogen Bacillus thuringiensis, to compare the proteomes of culture supernatants of two closely related B. thuringiensis strains that either induce priming via the oral route, or not. Among the proteins that might be immunostimulatory to T. castaneum, we identify the Cry3Aa toxin, an important plasmid-encoded virulence factor of B. thuringiensis. In further priming-infection assays we test the relevance of Cry-carrying plasmids for immune priming. Our findings provide valuable insights for future studies to perform experiments on the mechanisms and evolution of immune priming.
Collapse
Affiliation(s)
- Zoe Marie Länger
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| | - Moritz Baur
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| | - Ana Korša
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| | - Jürgen Eirich
- Institute of Plant Biology and Biotechnology, University of Münster, Schlossplatz 7, 48149 Münster, Germany
| | - Ana Sofia Lindeza
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| | - Caroline Zanchi
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| | - Iris Finkemeier
- Institute of Plant Biology and Biotechnology, University of Münster, Schlossplatz 7, 48149 Münster, Germany
| | - Joachim Kurtz
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany
| |
Collapse
|
12
|
Byrne JF, Mongan D, Murphy J, Healy C, Fӧcking M, Cannon M, Cotter DR. Prognostic models predicting transition to psychotic disorder using blood-based biomarkers: a systematic review and critical appraisal. Transl Psychiatry 2023; 13:333. [PMID: 37898606 PMCID: PMC10613280 DOI: 10.1038/s41398-023-02623-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 09/15/2023] [Accepted: 10/06/2023] [Indexed: 10/30/2023] Open
Abstract
Accumulating evidence suggests individuals with psychotic disorder show abnormalities in metabolic and inflammatory processes. Recently, several studies have employed blood-based predictors in models predicting transition to psychotic disorder in risk-enriched populations. A systematic review of the performance and methodology of prognostic models using blood-based biomarkers in the prediction of psychotic disorder from risk-enriched populations is warranted. Databases (PubMed, EMBASE and PsycINFO) were searched for eligible texts from 1998 to 15/05/2023, which detailed model development or validation studies. The checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) was used to guide data extraction from eligible texts and the Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and applicability of the studies. A narrative synthesis of the included studies was performed. Seventeen eligible studies were identified: 16 eligible model development studies and one eligible model validation study. A wide range of biomarkers were assessed, including nucleic acids, proteins, metabolites, and lipids. The range of C-index (area under the curve) estimates reported for the models was 0.67-1.00. No studies assessed model calibration. According to PROBAST criteria, all studies were at high risk of bias in the analysis domain. While a wide range of potentially predictive biomarkers were identified in the included studies, most studies did not account for overfitting in model performance estimates, no studies assessed calibration, and all models were at high risk of bias according to PROBAST criteria. External validation of the models is needed to provide more accurate estimates of their performance. Future studies which follow the latest available methodological and reporting guidelines and adopt strategies to accommodate required sample sizes for model development or validation will clarify the value of including blood-based biomarkers in models predicting psychosis.
Collapse
Affiliation(s)
- Jonah F Byrne
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland.
- SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, Dublin, Ireland.
| | - David Mongan
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
- Centre for Public Health, Queen's University Belfast, Belfast, United Kingdom
| | - Jennifer Murphy
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Colm Healy
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Melanie Fӧcking
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - Mary Cannon
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
- SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, Dublin, Ireland
| | - David R Cotter
- Department of Psychiatry, Royal College of Surgeons in Ireland, Dublin, Ireland
- SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, Dublin, Ireland
| |
Collapse
|
13
|
Anderson JR, Johnson E, Jenkins R, Jacobsen S, Green D, Walters M, Bundgaard L, Hausmans BAC, van den Akker G, Welting TJM, Chabronova A, Kharaz YA, Clarke EJ, James V, Peffers MJ. Multi-Omic Temporal Landscape of Plasma and Synovial Fluid-Derived Extracellular Vesicles Using an Experimental Model of Equine Osteoarthritis. Int J Mol Sci 2023; 24:14888. [PMID: 37834337 PMCID: PMC10573509 DOI: 10.3390/ijms241914888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 09/26/2023] [Accepted: 09/28/2023] [Indexed: 10/15/2023] Open
Abstract
Extracellular vesicles (EVs) contribute to osteoarthritis pathogenesis through their release into joint tissues and synovial fluid. Synovial fluid-derived EVs have the potential to be direct biomarkers in the causal pathway of disease but also enable understanding of their role in disease progression. Utilizing a temporal model of osteoarthritis, we defined the changes in matched synovial fluid and plasma-derived EV small non-coding RNA and protein cargo using sequencing and mass spectrometry. Data exploration included time series clustering, factor analysis and gene enrichment interrogation. Chondrocyte signalling was analysed using luciferase-based transcription factor activity assays. EV protein cargo appears to be more important during osteoarthritis progression than small non-coding RNAs. Cluster analysis revealed plasma-EVs represented a time-dependent response to osteoarthritis induction associated with supramolecular complexes. Clusters for synovial fluid-derived EVs were associated with initial osteoarthritis response and represented immune/inflammatory pathways. Factor analysis for plasma-derived EVs correlated with day post-induction and were primarily composed of proteins modulating lipid metabolism. Synovial fluid-derived EVs factors represented intermediate filament and supramolecular complexes reflecting tissue repair. There was a significant interaction between time and osteoarthritis for CRE, NFkB, SRE, SRF with a trend for osteoarthritis synovial fluid-derived EVs at later time points to have a more pronounced effect.
Collapse
Affiliation(s)
- James R. Anderson
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| | - Emily Johnson
- Computational Biology Facility, Liverpool Shared Research Facilities, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L7 8TX, UK
| | - Rosalind Jenkins
- CDSS Bioanalytical Facility, Liverpool Shared Research Facilities, Department Pharmacology and Therapeutics, University of Liverpool, Liverpool L7 8TX, UK
| | - Stine Jacobsen
- Department of Veterinary Clinical Sciences, University of Copenhagen, Taastrup, DK-1870 Copenhagen, Denmark
| | - Daniel Green
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| | - Marie Walters
- Department of Veterinary Clinical Sciences, University of Copenhagen, Taastrup, DK-1870 Copenhagen, Denmark
| | - Louise Bundgaard
- Department of Veterinary Clinical Sciences, University of Copenhagen, Taastrup, DK-1870 Copenhagen, Denmark
| | - Bas A. C. Hausmans
- Laboratory for Experimental Orthopedics, Department of Orthopedic Surgery, Maastricht University, 6229 Maastricht, The Netherlands; (B.A.C.H.)
| | - Guus van den Akker
- Laboratory for Experimental Orthopedics, Department of Orthopedic Surgery, Maastricht University, 6229 Maastricht, The Netherlands; (B.A.C.H.)
| | - Tim J. M. Welting
- Laboratory for Experimental Orthopedics, Department of Orthopedic Surgery, Maastricht University, 6229 Maastricht, The Netherlands; (B.A.C.H.)
| | - Alzbeta Chabronova
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| | - Yalda A. Kharaz
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| | - Emily J. Clarke
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| | - Victoria James
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington, Loughborough, Nottingham LE12 5RD, UK
| | - Mandy J. Peffers
- Department of Musculoskeletal and Ageing Science, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L7 8TX, UK (Y.A.K.)
| |
Collapse
|
14
|
Wang J, Yu W, D'Anna R, Przybyla A, Wilson M, Sung M, Bullen J, Hurt E, D'Angelo G, Sidders B, Lai Z, Zhong W. Pan-Cancer Proteomics Analysis to Identify Tumor-Enriched and Highly Expressed Cell Surface Antigens as Potential Targets for Cancer Therapeutics. Mol Cell Proteomics 2023; 22:100626. [PMID: 37517589 PMCID: PMC10494184 DOI: 10.1016/j.mcpro.2023.100626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 07/23/2023] [Accepted: 07/25/2023] [Indexed: 08/01/2023] Open
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) provides unique opportunities for cancer target discovery using protein expression. Proteomics data from CPTAC tumor types have been primarily generated using a multiplex tandem mass tag (TMT) approach, which is designed to provide protein quantification relative to reference samples. However, relative protein expression data are suboptimal for prioritization of targets within a tissue type, which requires additional reprocessing of the original proteomics data to derive absolute quantitation estimation. We evaluated the feasibility of using differential protein analysis coupled with intensity-based absolute quantification (iBAQ) to identify tumor-enriched and highly expressed cell surface antigens, employing tandem mass tag (TMT) proteomics data from CPTAC. Absolute quantification derived from TMT proteomics data was highly correlated with that of label-free proteomics data from the CPTAC colon adenocarcinoma cohort, which contains proteomics data measured by both approaches. We validated the TMT-iBAQ approach by comparing the iBAQ value to the receptor density value of HER2 and TROP2 measured by flow cytometry in about 30 selected breast and lung cancer cell lines from the Cancer Cell Line Encyclopedia. Collections of these tumor-enriched and highly expressed cell surface antigens could serve as a valuable resource for the development of cancer therapeutics, including antibody-drug conjugates and immunotherapeutic agents.
Collapse
Affiliation(s)
- Jixin Wang
- Oncology Data Science, AstraZeneca, Gaithersburg, Maryland, USA
| | - Wen Yu
- Data Science and AI, BioPharmaceuticals R&D, AstraZeneca, Gaithersburg, Maryland, USA
| | - Rachel D'Anna
- Oncology Data Science, AstraZeneca, Gaithersburg, Maryland, USA
| | | | - Matt Wilson
- Early TDE Discovery, AstraZeneca, Cambridge, UK
| | | | - John Bullen
- Early TTD Discovery, AstraZeneca, Cambridge, UK
| | - Elaine Hurt
- Early TTD Discovery, AstraZeneca, Cambridge, UK
| | - Gina D'Angelo
- Late Oncology Statistics, Oncology R&D, AstraZeneca, Gaithersburg, Maryland, USA
| | - Ben Sidders
- Oncology Data Science, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Zhongwu Lai
- Oncology Data Science, Oncology R&D, AstraZeneca, Waltham, Massachusetts, USA
| | - Wenyan Zhong
- Oncology Data Science, Oncology R&D, AstraZeneca, New York, New York, USA.
| |
Collapse
|
15
|
Kong W, Wong BJH, Hui HWH, Lim KP, Wang Y, Wong L, Goh WWB. ProJect: a powerful mixed-model missing value imputation method. Brief Bioinform 2023:bbad233. [PMID: 37419612 DOI: 10.1093/bib/bbad233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 05/24/2023] [Accepted: 06/05/2023] [Indexed: 07/09/2023] Open
Abstract
Missing values (MVs) can adversely impact data analysis and machine-learning model development. We propose a novel mixed-model method for missing value imputation (MVI). This method, ProJect (short for Protein inJection), is a powerful and meaningful improvement over existing MVI methods such as Bayesian principal component analysis (PCA), probabilistic PCA, local least squares and quantile regression imputation of left-censored data. We rigorously tested ProJect on various high-throughput data types, including genomics and mass spectrometry (MS)-based proteomics. Specifically, we utilized renal cancer (RC) data acquired using DIA-SWATH, ovarian cancer (OC) data acquired using DIA-MS, bladder (BladderBatch) and glioblastoma (GBM) microarray gene expression dataset. Our results demonstrate that ProJect consistently performs better than other referenced MVI methods. It achieves the lowest normalized root mean square error (on average, scoring 45.92% less error in RC_C, 27.37% in RC_full, 29.22% in OC, 23.65% in BladderBatch and 20.20% in GBM relative to the closest competing method) and the Procrustes sum of squared error (Procrustes SS) (exhibits 79.71% less error in RC_C, 38.36% in RC full, 18.13% in OC, 74.74% in BladderBatch and 30.79% in GBM compared to the next best method). ProJect also leads with the highest correlation coefficient among all types of MV combinations (0.64% higher in RC_C, 0.24% in RC full, 0.55% in OC, 0.39% in BladderBatch and 0.27% in GBM versus the second-best performing method). ProJect's key strength is its ability to handle different types of MVs commonly found in real-world data. Unlike most MVI methods that are designed to handle only one type of MV, ProJect employs a decision-making algorithm that first determines if an MV is missing at random or missing not at random. It then employs targeted imputation strategies for each MV type, resulting in more accurate and reliable imputation outcomes. An R implementation of ProJect is available at https://github.com/miaomiao6606/ProJect.
Collapse
Affiliation(s)
- Weijia Kong
- School of Biological Sciences, Nanyang Technological University, Singapore
- Department of Computer Science, National University of Singapore, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | | | | | - Kai Peng Lim
- School of Biological Sciences, Nanyang Technological University, Singapore
| | - Yulan Wang
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| | - Limsoon Wong
- Department of Computer Science, National University of Singapore, Singapore
| | - Wilson Wen Bin Goh
- School of Biological Sciences, Nanyang Technological University, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
- Center for Biomedical Informatics, Nanyang Technological University, Singapore
| |
Collapse
|
16
|
Fowler SL, Behr TS, Turkes E, Cauhy PM, Foiani MS, Schaler A, Crowley G, Bez S, Ficulle E, Tsefou E, O'Brien DP, Fischer R, Geary B, Gaur P, Miller C, D'Acunzo P, Levy E, Duff KE, Ryskeldi-Falcon B. Tau filaments are tethered within brain extracellular vesicles in Alzheimer's disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.30.537820. [PMID: 37163117 PMCID: PMC10168373 DOI: 10.1101/2023.04.30.537820] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The abnormal assembly of tau protein in neurons is the pathological hallmark of multiple neurodegenerative diseases, including Alzheimer's disease (AD). In addition, assembled tau associates with extracellular vesicles (EVs) in the central nervous system of patients with AD, which is linked to its clearance and prion-like propagation between neurons. However, the identities of the assembled tau species and the EVs, as well as how they associate, are not known. Here, we combined quantitative mass spectrometry, cryo-electron tomography and single-particle cryo-electron microscopy to study brain EVs from AD patients. We found filaments of truncated tau enclosed within EVs enriched in endo-lysosomal proteins. We observed multiple filament interactions, including with molecules that tethered filaments to the EV limiting membrane, suggesting selective packaging. Our findings will guide studies into the molecular mechanisms of EV-mediated secretion of assembled tau and inform the targeting of EV-associated tau as potential therapeutic and biomarker strategies for AD.
Collapse
|
17
|
Kong W, Hui HWH, Peng H, Goh WWB. Dealing with missing values in proteomics data. Proteomics 2022; 22:e2200092. [PMID: 36349819 DOI: 10.1002/pmic.202200092] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/15/2022] [Accepted: 10/11/2022] [Indexed: 11/10/2022]
Abstract
Proteomics data are often plagued with missingness issues. These missing values (MVs) threaten the integrity of subsequent statistical analyses by reduction of statistical power, introduction of bias, and failure to represent the true sample. Over the years, several categories of missing value imputation (MVI) methods have been developed and adapted for proteomics data. These MVI methods perform their tasks based on different prior assumptions (e.g., data is normally or independently distributed) and operating principles (e.g., the algorithm is built to address random missingness only), resulting in varying levels of performance even when dealing with the same dataset. Thus, to achieve a satisfactory outcome, a suitable MVI method must be selected. To guide decision making on suitable MVI method, we provide a decision chart which facilitates strategic considerations on datasets presenting different characteristics. We also bring attention to other issues that can impact proper MVI such as the presence of confounders (e.g., batch effects) which can influence MVI performance. Thus, these too, should be considered during or before MVI.
Collapse
Affiliation(s)
- Weijia Kong
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Harvard Wai Hann Hui
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Hui Peng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.,Centre for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
18
|
Casañas JJ, Montesinos ML. Proteomic characterization of spinal cord synaptoneurosomes from Tg-SOD1/G93A mice supports a role for MNK1 and local translation in the early stages of amyotrophic lateral sclerosis. Mol Cell Neurosci 2022; 123:103792. [PMID: 36372157 DOI: 10.1016/j.mcn.2022.103792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 11/03/2022] [Accepted: 11/06/2022] [Indexed: 11/13/2022] Open
Abstract
The isolation of synaptoneurosomes (SNs) represents a useful means to study synaptic events. However, the size and density of synapses varies in different regions of the central nervous system (CNS), and this also depends on the experimental species studied, making it difficult to define a generic protocol for SNs preparation. To characterize synaptic failure in the spinal cord (SC) in the Tg-SOD1/G93A mouse model of amyotrophic lateral sclerosis (ALS), we applied a method we originally designed to isolate cortical and hippocampal SNs to SC tissue. Interestingly, we found that the SC SNs were isolated in a different gradient fraction to the cortical/hippocampal SNs. We compared the relative levels of synaptoneurosomal proteins in wild type (WT) animals, with control (Tg-SOD1) or Tg-SOD1/G93A mice at onset and those that were symptomatic using iTRAQ proteomics. The results obtained suggest that an important regulator of local synaptic translation, MNK1 (MAP kinase interacting serine/threonine kinase 1), might well influence the early stages of ALS.
Collapse
Affiliation(s)
- Juan José Casañas
- Departamento de Fisiología Médica y Biofísica, Universidad de Sevilla, E-41009 Sevilla, Spain; Instituto de Biomedicina de Sevilla, IBIS/Hospital Universitario Virgen del Rocío/CSIC/Universidad de Sevilla, Sevilla, Spain
| | - María Luz Montesinos
- Departamento de Fisiología Médica y Biofísica, Universidad de Sevilla, E-41009 Sevilla, Spain; Instituto de Biomedicina de Sevilla, IBIS/Hospital Universitario Virgen del Rocío/CSIC/Universidad de Sevilla, Sevilla, Spain.
| |
Collapse
|
19
|
Chion M, Carapito C, Bertrand F. Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics. PLoS Comput Biol 2022; 18:e1010420. [PMID: 36037245 PMCID: PMC9462777 DOI: 10.1371/journal.pcbi.1010420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 09/09/2022] [Accepted: 07/21/2022] [Indexed: 11/20/2022] Open
Abstract
Imputing missing values is common practice in label-free quantitative proteomics. Imputation aims at replacing a missing value with a user-defined one. However, the imputation itself may not be optimally considered downstream of the imputation process, as imputed datasets are often considered as if they had always been complete. Hence, the uncertainty due to the imputation is not adequately taken into account. We provide a rigorous multiple imputation strategy, leading to a less biased estimation of the parameters' variability thanks to Rubin's rules. The imputation-based peptide's intensities' variance estimator is then moderated using Bayesian hierarchical models. This estimator is finally included in moderated t-test statistics to provide differential analyses results. This workflow can be used both at peptide and protein-level in quantification datasets. Indeed, an aggregation step is included for protein-level results based on peptide-level quantification data. Our methodology, named mi4p, was compared to the state-of-the-art limma workflow implemented in the DAPAR R package, both on simulated and real datasets. We observed a trade-off between sensitivity and specificity, while the overall performance of mi4p outperforms DAPAR in terms of F-Score.
Collapse
Affiliation(s)
- Marie Chion
- Institut de Recherche Mathématique Avancée, UMR 7501, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire de Spectrométrie de Masse Bio-Organique, Institut Pluridisciplinaire Hubert Curien, UMR 7178, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire Mathématiques appliquées à Paris 5, UMR 8145, CNRS-Université Paris Cité, Paris, France
| | - Christine Carapito
- Laboratoire de Spectrométrie de Masse Bio-Organique, Institut Pluridisciplinaire Hubert Curien, UMR 7178, CNRS-Université de Strasbourg, Strasbourg, France
- Infrastructure Nationale de Protéomique ProFi - FR2048, 67087 Strasbourg, France
| | - Frédéric Bertrand
- Institut de Recherche Mathématique Avancée, UMR 7501, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire Informatique et Société Numérique, Université de Technologie de Troyes, Troyes, France
| |
Collapse
|
20
|
Hamid Z, Zimmerman KD, Guillen-Ahlers H, Li C, Nathanielsz P, Cox LA, Olivier M. Assessment of label-free quantification and missing value imputation for proteomics in non-human primates. BMC Genomics 2022; 23:496. [PMID: 35804317 PMCID: PMC9264528 DOI: 10.1186/s12864-022-08723-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 06/23/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Reliable and effective label-free quantification (LFQ) analyses are dependent not only on the method of data acquisition in the mass spectrometer, but also on the downstream data processing, including software tools, query database, data normalization and imputation. In non-human primates (NHP), LFQ is challenging because the query databases for NHP are limited since the genomes of these species are not comprehensively annotated. This invariably results in limited discovery of proteins and associated Post Translational Modifications (PTMs) and a higher fraction of missing data points. While identification of fewer proteins and PTMs due to database limitations can negatively impact uncovering important and meaningful biological information, missing data also limits downstream analyses (e.g., multivariate analyses), decreases statistical power, biases statistical inference, and makes biological interpretation of the data more challenging. In this study we attempted to address both issues: first, we used the MetaMorphues proteomics search engine to counter the limits of NHP query databases and maximize the discovery of proteins and associated PTMs, and second, we evaluated different imputation methods for accurate data inference. We used a generic approach for missing data imputation analysis without distinguising the potential source of missing data (either non-assigned m/z or missing values across runs). RESULTS Using the MetaMorpheus proteomics search engine we obtained quantitative data for 1622 proteins and 10,634 peptides including 58 different PTMs (biological, metal and artifacts) across a diverse age range of NHP brain frontal cortex. However, among the 1622 proteins identified, only 293 proteins were quantified across all samples with no missing values, emphasizing the importance of implementing an accurate and statiscaly valid imputation method to fill in missing data. In our imputation analysis we demonstrate that Single Imputation methods that borrow information from correlated proteins such as Generalized Ridge Regression (GRR), Random Forest (RF), local least squares (LLS), and a Bayesian Principal Component Analysis methods (BPCA), are able to estimate missing protein abundance values with great accuracy. CONCLUSIONS Overall, this study offers a detailed comparative analysis of LFQ data generated in NHP and proposes strategies for improved LFQ in NHP proteomics data.
Collapse
Affiliation(s)
- Zeeshan Hamid
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Kip D Zimmerman
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Hector Guillen-Ahlers
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Cun Li
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Peter Nathanielsz
- Southwest National Primate Research Center, San Antonio, TX, USA
- Department of Animal Science, University of Wyoming, Laramie, WY, USA
| | - Laura A Cox
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
- Southwest National Primate Research Center, San Antonio, TX, USA
| | - Michael Olivier
- Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
| |
Collapse
|
21
|
HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values. Nat Commun 2022; 13:3523. [PMID: 35725563 PMCID: PMC9209422 DOI: 10.1038/s41467-022-31007-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 05/25/2022] [Indexed: 01/01/2023] Open
Abstract
Dataset integration is common practice to overcome limitations in statistically underpowered omics datasets. Proteome datasets display high technical variability and frequent missing values. Sophisticated strategies for batch effect reduction are lacking or rely on error-prone data imputation. Here we introduce HarmonizR, a data harmonization tool with appropriate missing value handling. The method exploits the structure of available data and matrix dissection for minimal data loss, without data imputation. This strategy implements two common batch effect reduction methods—ComBat and limma (removeBatchEffect()). The HarmonizR strategy, evaluated on four exemplarily analyzed datasets with up to 23 batches, demonstrated successful data harmonization for different tissue preservation techniques, LC-MS/MS instrumentation setups, and quantification approaches. Compared to data imputation methods, HarmonizR was more efficient and performed superior regarding the detection of significant proteins. HarmonizR is an efficient tool for missing data tolerant experimental variance reduction and is easily adjustable for individual dataset properties and user preferences. Dataset integration is common practice to overcome limitations in statistically underpowered omics datasets. Here the authors present “HarmonizR”, a tool for missing data tolerant experimental variance reduction in large, integrated but independently generated datasets without data imputation, adjustable for individual dataset modalities, correction algorithm, and user preferences.
Collapse
|
22
|
Holzer E, Rumpf-Kienzl C, Falk S, Dammermann A. A modified TurboID approach identifies tissue-specific centriolar components in C. elegans. PLoS Genet 2022; 18:e1010150. [PMID: 35442950 PMCID: PMC9020716 DOI: 10.1371/journal.pgen.1010150] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 03/15/2022] [Indexed: 01/26/2023] Open
Abstract
Proximity-dependent labeling approaches such as BioID have been a great boon to studies of protein-protein interactions in the context of cytoskeletal structures such as centrosomes which are poorly amenable to traditional biochemical approaches like immunoprecipitation and tandem affinity purification. Yet, these methods have so far not been applied extensively to invertebrate experimental models such as C. elegans given the long labeling times required for the original promiscuous biotin ligase variant BirA*. Here, we show that the recently developed variant TurboID successfully probes the interactomes of both stably associated (SPD-5) and dynamically localized (PLK-1) centrosomal components. We further develop an indirect proximity labeling method employing a GFP nanobody-TurboID fusion, which allows the identification of protein interactors in a tissue-specific manner in the context of the whole animal. Critically, this approach utilizes available endogenous GFP fusions, avoiding the need to generate multiple additional strains for each target protein and the potential complications associated with overexpressing the protein from transgenes. Using this method, we identify homologs of two highly conserved centriolar components, Cep97 and BLD10/Cep135, which are present in various somatic tissues of the worm. Surprisingly, neither protein is expressed in early embryos, likely explaining why these proteins have escaped attention until now. Our work expands the experimental repertoire for C. elegans and opens the door for further studies of tissue-specific variation in centrosome architecture.
Collapse
Affiliation(s)
- Elisabeth Holzer
- Max Perutz Labs, University of Vienna, Vienna Biocenter (VBC), Vienna, Austria
| | | | - Sebastian Falk
- Max Perutz Labs, University of Vienna, Vienna Biocenter (VBC), Vienna, Austria
| | | |
Collapse
|