1
|
Yu J, Chen T, Zhou H, Li S, Wu B, Xiong Y. Metabolomic biomarkers and altered phenylalanine metabolic pathway in preschool children with atopic dermatitis - A pilot study. Indian J Dermatol Venereol Leprol 2024; 0:1-8. [PMID: 39361866 DOI: 10.25259/ijdvl_1125_2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 06/12/2024] [Indexed: 10/05/2024]
Abstract
Background Atopic dermatitis (AD) has high prevalence in children. Current AD diagnosis and management focuses only on clinical phenotypes, but do not explore the endophenotypes, which are more important because they are a series of biomarkers linking clinical phenotype and genotype Aims Metabolomics can qualitatively and quantitatively capture real-time dynamic changes in a wide range of small molecule metabolites. This pilot study evaluated metabolomics biomarkers and altered metabolic pathways in preschool children with AD, aiming to explore the underlying molecular mechanisms and signalling pathways of the disease. Methods Blood samples of 23 preschool children with AD and 23 healthy children without AD or any other skin disease were collected. The untargeted metabolomic measurements were performed on a SCIEX-AD ultraperformance liquid chromatography system coupled with an AB SCIEX X500B QTOF system. Characteristics of small molecules in AD children were assessed and their associations with AD clinical index were evaluated. Altered metabolic pathways in AD children were also analysed using a comprehensive metabolomics platform. Results A total of 1,969 metabolites were identified, of which AD children exhibited 377 significantly altered metabolites. Multivariate statistical analysis demonstrated that the AD group and the control group could be clearly separated. Volcano plot analysis illustrated that 144 metabolites were up-regulated and 233 metabolites were down-regulated in AD children. The Severity Scoring of Atopic Dermatitis (SCORAD index) showed a moderate-to-strong association with estrogens, carotenes, leukotrienes, flavonols and keto acids in AD children (|r|=0.440-0.557). Several pathways, including the phenylalanine metabolism, were identified as altered in AD children. Limitations A small group of children was included in the study; the results need to be validated in larger sample sizes. Conclusion Results of this study illustrate potential alterations in metabolites and the phenylalanine metabolic pathway in preschool children with AD. Although this is a pilot study with a limited sample size, it may provide a new perspective for exploring the pathogenesis of AD, and for personalised treatment modalities.
Collapse
Affiliation(s)
- Jia Yu
- Department of Dermatology, Shenzhen Maternity & Child Healthcare Hospital, Shenzhen, China
| | - Ting Chen
- Department of Dermatology, Shenzhen Maternity & Child Healthcare Hospital, Shenzhen, China
| | - He Zhou
- Shenzhen Mass Medical Co. Ltd., Shenzhen, China
| | - Sujun Li
- Translational Medicine Institute of Jiangxi, The First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Bo Wu
- Department of Dermatology, Shenzhen Maternity & Child Healthcare Hospital, Shenzhen, China
| | - Ying Xiong
- Department of Dermatology, Shenzhen Maternity & Child Healthcare Hospital, Shenzhen, China
| |
Collapse
|
2
|
Fields L, Ma M, DeLaney K, Phetsanthad A, Li L. A crustacean neuropeptide spectral library for data-independent acquisition (DIA) mass spectrometry applications. Proteomics 2024; 24:e2300285. [PMID: 38171828 PMCID: PMC11219527 DOI: 10.1002/pmic.202300285] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 11/06/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024]
Abstract
Neuropeptides have tremendous potential for application in modern medicine, including utility as biomarkers and therapeutics. To overcome the inherent challenges associated with neuropeptide identification and characterization, data-independent acquisition (DIA) is a fitting mass spectrometry (MS) method of choice to achieve sensitive and accurate analysis. It is advantageous for preliminary neuropeptidomic studies to occur in less complex organisms, with crustacean models serving as a popular choice due to their relatively simple nervous system. With spectral libraries serving as a means to interpret DIA-MS output spectra, and Cancer borealis as a model of choice for neuropeptide analysis, we performed the first spectral library mapping of crustacean neuropeptides. Leveraging pre-existing data-dependent acquisition (DDA) spectra, a spectral library was built using PEAKS Online. The library is comprised of 333 unique neuropeptides. The identification results obtained through the use of this spectral library were compared with those achieved through library-free analysis of crustacean brain, pericardial organs (PO), and thoracic ganglia (TG) tissues. A statistically significant increase (Student's t-test, P value < 0.05) in the number of identifications achieved from the TG data was observed in the spectral library results. Furthermore, in each of the tissues, a distinctly different set of identifications was found in the library search compared to the library-free search. This work highlights the necessity for the use of spectral libraries in neuropeptide analysis, illustrating the advantage of spectral libraries for interpreting DIA spectra in a reproducible manner with greater neuropeptidomic depth.
Collapse
Affiliation(s)
- Lauren Fields
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Min Ma
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, United States
| | - Kellen DeLaney
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Ashley Phetsanthad
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, United States
| | - Lingjun Li
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, United States
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, United States
| |
Collapse
|
3
|
Liu K, Ye Y, Li S, Tang H. Accurate de novo peptide sequencing using fully convolutional neural networks. Nat Commun 2023; 14:7974. [PMID: 38042873 PMCID: PMC10693636 DOI: 10.1038/s41467-023-43010-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 10/29/2023] [Indexed: 12/04/2023] Open
Abstract
De novo peptide sequencing, which does not rely on a comprehensive target sequence database, provides us with a way to identify novel peptides from tandem mass spectra. However, current de novo sequencing algorithms suffer from low accuracy and coverage, which hinders their application in proteomics. In this paper, we present PepNet, a fully convolutional neural network for high accuracy de novo peptide sequencing. PepNet takes an MS/MS spectrum (represented as a high-dimensional vector) as input, and outputs the optimal peptide sequence along with its confidence score. The PepNet model is trained using a total of 3 million high-energy collisional dissociation MS/MS spectra from multiple human peptide spectral libraries. Evaluation results show that PepNet significantly outperforms current best-performing de novo sequencing algorithms (e.g. PointNovo and DeepNovo) in both peptide-level accuracy and positional-level accuracy. PepNet can sequence a large fraction of spectra that were not identified by database search engines, and thus could be used as a complementary tool to database search engines for peptide identification in proteomics. In addition, PepNet runs around 3x and 7x faster than PointNovo and DeepNovo on GPUs, respectively, thus being more suitable for the analysis of large-scale proteomics data.
Collapse
Affiliation(s)
- Kaiyuan Liu
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, 47408, IN, USA
| | - Yuzhen Ye
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, 47408, IN, USA
| | - Sujun Li
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, 47408, IN, USA
- Dengding BioAI Co., Ltd., Bloomington, USA
| | - Haixu Tang
- Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, 47408, IN, USA.
| |
Collapse
|
4
|
Geer LY, Lapin J, Slotta DJ, Mak TD, Stein SE. AIomics: Exploring More of the Proteome Using Mass Spectral Libraries Extended by Artificial Intelligence. J Proteome Res 2023; 22:2246-2255. [PMID: 37232537 PMCID: PMC10542943 DOI: 10.1021/acs.jproteome.2c00807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The unbounded permutations of biological molecules, including proteins and their constituent peptides, present a dilemma in identifying the components of complex biosamples. Sequence search algorithms used to identify peptide spectra can be expanded to cover larger classes of molecules, including more modifications, isoforms, and atypical cleavage, but at the cost of false positives or false negatives due to the simplified spectra they compute from sequence records. Spectral library searching can help solve this issue by precisely matching experimental spectra to library spectra with excellent sensitivity and specificity. However, compiling spectral libraries that span entire proteomes is pragmatically difficult. Neural networks that predict complete spectra containing a full range of annotated and unannotated ions can be used to replace these simplified spectra with libraries of fully predicted spectra, including modified peptides. Using such a network, we created predicted spectral libraries that were used to rescore matches from a sequence search done over a large search space, including a large number of modifications. Rescoring improved the separation of true and false hits by 82%, yielding an 8% increase in peptide identifications, including a 21% increase in nonspecifically cleaved peptides and a 17% increase in phosphopeptides.
Collapse
Affiliation(s)
- Lewis Y. Geer
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Dr., Gaithersburg, Maryland 20899, United States
| | - Joel Lapin
- Department of Physics, Georgetown University, Washington, DC 20057, United States
- Associate, Mass Spectrometry Data Center, National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Dr., Gaithersburg, Maryland 20899, United States
| | - Douglas J. Slotta
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Dr., Gaithersburg, Maryland 20899, United States
| | - Tytus D. Mak
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Dr., Gaithersburg, Maryland 20899, United States
| | - Stephen E. Stein
- Mass Spectrometry Data Center, National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Dr., Gaithersburg, Maryland 20899, United States
| |
Collapse
|
5
|
Nowatzky Y, Benner P, Reinert K, Muth T. Mistle: bringing spectral library predictions to metaproteomics with an efficient search index. Bioinformatics 2023; 39:btad376. [PMID: 37294786 PMCID: PMC10313348 DOI: 10.1093/bioinformatics/btad376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 05/11/2023] [Accepted: 06/08/2023] [Indexed: 06/11/2023] Open
Abstract
MOTIVATION Deep learning has moved to the forefront of tandem mass spectrometry-driven proteomics and authentic prediction for peptide fragmentation is more feasible than ever. Still, at this point spectral prediction is mainly used to validate database search results or for confined search spaces. Fully predicted spectral libraries have not yet been efficiently adapted to large search space problems that often occur in metaproteomics or proteogenomics. RESULTS In this study, we showcase a workflow that uses Prosit for spectral library predictions on two common metaproteomes and implement an indexing and search algorithm, Mistle, to efficiently identify experimental mass spectra within the library. Hence, the workflow emulates a classic protein sequence database search with protein digestion but builds a searchable index from spectral predictions as an in-between step. We compare Mistle to popular search engines, both on a spectral and database search level, and provide evidence that this approach is more accurate than a database search using MSFragger. Mistle outperforms other spectral library search engines in terms of run time and proves to be extremely memory efficient with a 4- to 22-fold decrease in RAM usage. This makes Mistle universally applicable to large search spaces, e.g. covering comprehensive sequence databases of diverse microbiomes. AVAILABILITY AND IMPLEMENTATION Mistle is freely available on GitHub at https://github.com/BAMeScience/Mistle.
Collapse
Affiliation(s)
- Yannek Nowatzky
- Section S.3 eScience, Federal Institute for Materials Research and Testing (BAM), Berlin 12205, Germany
| | - Philipp Benner
- Section S.3 eScience, Federal Institute for Materials Research and Testing (BAM), Berlin 12205, Germany
| | - Knut Reinert
- Department of Mathematics and Computer Science, FU Berlin, Berlin 14195, Germany
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany
| | - Thilo Muth
- Section S.3 eScience, Federal Institute for Materials Research and Testing (BAM), Berlin 12205, Germany
| |
Collapse
|
6
|
Li S, Zhu J, Lubman DM, Zhou H, Tang H. GlycoSLASH: Concurrent Glycopeptide Identification from Multiple Related LC-MS/MS Data Sets by Using Spectral Clustering and Library Searching. J Proteome Res 2023; 22:1501-1509. [PMID: 36802412 PMCID: PMC10164058 DOI: 10.1021/acs.jproteome.3c00066] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
Liquid chromatography coupled with tandem mass spectrometry is commonly adopted in large-scale glycoproteomic studies involving hundreds of disease and control samples. The software for glycopeptide identification in such data (e.g., the commercial software Byonic) analyzes the individual data set and does not exploit the redundant spectra of glycopeptides presented in the related data sets. Herein, we present a novel concurrent approach for glycopeptide identification in multiple related glycoproteomic data sets by using spectral clustering and spectral library searching. The evaluation on two large-scale glycoproteomic data sets showed that the concurrent approach can identify 105%-224% more spectra as glycopeptides compared to the glycopeptide identification on individual data sets using Byonic alone. The improvement of glycopeptide identification also enabled the discovery of several potential biomarkers of protein glycosylations in hepatocellular carcinoma patients.
Collapse
Affiliation(s)
- Sujun Li
- Department of Blood Transfusion, The First Affiliated Hospital of Nanchang University, Nanchang 330000, China.,JiangXi Key Laboratory of Transfusion Medicine, Nanchang 330000, China.,Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, Indiana 47408, United States
| | - Jianhui Zhu
- Department of Surgery, University of Michigan, Medical Center, Ann Arbor, Michigan 48109, United States
| | - David M Lubman
- Department of Surgery, University of Michigan, Medical Center, Ann Arbor, Michigan 48109, United States
| | - He Zhou
- Shenzhen Dengding Biopharma Co. Ltd., Shenzhen 518000, China
| | - Haixu Tang
- Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, Indiana 47408, United States
| |
Collapse
|
7
|
Bob K, Teschner D, Kemmer T, Gomez-Zepeda D, Tenzer S, Schmidt B, Hildebrandt A. Locality-sensitive hashing enables efficient and scalable signal classification in high-throughput mass spectrometry raw data. BMC Bioinformatics 2022; 23:287. [PMID: 35858828 PMCID: PMC9301846 DOI: 10.1186/s12859-022-04833-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 07/08/2022] [Indexed: 11/10/2022] Open
Abstract
Background Mass spectrometry is an important experimental technique in the field of proteomics. However, analysis of certain mass spectrometry data faces a combination of two challenges: first, even a single experiment produces a large amount of multi-dimensional raw data and, second, signals of interest are not single peaks but patterns of peaks that span along the different dimensions. The rapidly growing amount of mass spectrometry data increases the demand for scalable solutions. Furthermore, existing approaches for signal detection usually rely on strong assumptions concerning the signals properties. Results In this study, it is shown that locality-sensitive hashing enables signal classification in mass spectrometry raw data at scale. Through appropriate choice of algorithm parameters it is possible to balance false-positive and false-negative rates. On synthetic data, a superior performance compared to an intensity thresholding approach was achieved. Real data could be strongly reduced without losing relevant information. Our implementation scaled out up to 32 threads and supports acceleration by GPUs. Conclusions Locality-sensitive hashing is a desirable approach for signal classification in mass spectrometry raw data. Availability Generated data and code are available at https://github.com/hildebrandtlab/mzBucket. Raw data is available at https://zenodo.org/record/5036526. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04833-5.
Collapse
Affiliation(s)
- Konstantin Bob
- Institute of Computer Science, Johannes Gutenberg University Mainz, D-55128, Mainz, Germany
| | - David Teschner
- Institute of Computer Science, Johannes Gutenberg University Mainz, D-55128, Mainz, Germany
| | - Thomas Kemmer
- Institute of Computer Science, Johannes Gutenberg University Mainz, D-55128, Mainz, Germany
| | - David Gomez-Zepeda
- Institute for Immunology, University Medical Center of the Johannes Gutenberg University Mainz, D-55128, Mainz, Germany.,Immunoproteomics Unit, Helmholtz-Institute for Translational Oncology (HI-TRON) Mainz, D-55131, Mainz, Germany
| | - Stefan Tenzer
- Institute for Immunology, University Medical Center of the Johannes Gutenberg University Mainz, D-55128, Mainz, Germany.,Immunoproteomics Unit, Helmholtz-Institute for Translational Oncology (HI-TRON) Mainz, D-55131, Mainz, Germany
| | - Bertil Schmidt
- Institute of Computer Science, Johannes Gutenberg University Mainz, D-55128, Mainz, Germany
| | - Andreas Hildebrandt
- Institute of Computer Science, Johannes Gutenberg University Mainz, D-55128, Mainz, Germany.
| |
Collapse
|
8
|
Wen B, Zhang B. Computational Proteomics: Focus on Deep Learning. Proteomics 2020; 20:e2000258. [PMID: 33210458 DOI: 10.1002/pmic.202000258] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 10/14/2020] [Indexed: 11/09/2022]
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| |
Collapse
|