1
|
Kelman MJ, Renaud JB, McCarron P, Hoogstra S, Chow W, Wang J, Varga E, Patriarca A, Vaya AM, Visintin L, Nguyen T, De Boevre M, De Saeger S, Karanghat V, Vuckovic D, McMullin DR, Dall'Asta C, Ayeni K, Warth B, Huang M, Tittlemier S, Mats L, Cao R, Sulyok M, Xu K, Berthiller F, Kuhn M, Cramer B, Ciasca B, Lattanzio V, De Baere S, Croubels S, DesRochers N, Sura S, Bates J, Wright EJ, Thapa I, Blackwell BA, Zhang K, Wong J, Burns L, Borts DJ, Sumarah MW. International interlaboratory study to normalize liquid chromatography-based mycotoxin retention times through implementation of a retention index system. J Chromatogr A 2025; 1745:465732. [PMID: 39913989 DOI: 10.1016/j.chroma.2025.465732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Revised: 01/17/2025] [Accepted: 01/27/2025] [Indexed: 02/25/2025]
Abstract
Monitoring for mycotoxins in food or feed matrices is necessary to ensure the safety and security of global food systems. Due to a lack of standardized methods and individual laboratory priorities, most institutions have developed their own methods for mycotoxin determinations. Given the diversity of mycotoxin chemical structures and physicochemical properties, searching databases, and comparing data between institutions is complicated. We previously introduced incorporating a retention index (RI) system into liquid chromatography mass spectrometry (LC-MS) based mycotoxin determinations. To validate this concept, we designed an interlaboratory study where each participating laboratory was sent N-alkylpyridinium-3-sulfonates (NAPS) RI standards, and 36 mycotoxin standards for analysis using their pre-optimized LC-MS methods. Data from 44 analytical methods were submitted from 24 laboratories representing various manufacturer platforms, LC columns, and mobile phase compositions. Mycotoxin retention times (tR) were converted to RI values based on their elution relative to the NAPS standards. Trichothecenes (deoxynivalenol, 3-acetyldeoxynivalenol, 15-acetyldeoxynivalenol) showed tR consistency (± 20-50 RI units, 1-5 % median RI) regardless of mobile phase or type of chromatography column in this study. For the remaining mycotoxins tested, the RI values were strongly impacted by the mobile phase composition and column chemistry. The ability to predict tR was evaluated based on the median RI mycotoxin values and the NAPS tR. These values were corrected using Tanimoto coefficients to investigate whether structurally similar compounds could be used as anchors to further improve accuracy. This study demonstrated the power of employing an RI system for mycotoxin determinations, further enhancing the confidence of identifications.
Collapse
Affiliation(s)
- M J Kelman
- London Research and Development Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, Ontario N5 V 4T3, Canada
| | - J B Renaud
- London Research and Development Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, Ontario N5 V 4T3, Canada
| | - P McCarron
- Metrology, National Research Council Canada, 1411 Oxford Street, Halifax, Nova Scotia, B3H 3Z1, Canada
| | - S Hoogstra
- Agassiz Research and Development Centre, Agriculture and Agri-Food Canada, 6947 Lougheed Hwy., Agassiz, British Columbia V0 M 1A2, Canada
| | - W Chow
- Calgary Laboratory, Canadian Food Inspection Agency, 3650 36th Street NW, Calgary, Alberta T2 L 2L1, Canada
| | - J Wang
- Calgary Laboratory, Canadian Food Inspection Agency, 3650 36th Street NW, Calgary, Alberta T2 L 2L1, Canada
| | - E Varga
- Unit Food Hygiene and Technology, Centre for Food Science and Veterinary Public Health, Clinical Department for Farm Animals and Food System Science, University of Veterinary Medicine Vienna, Veterinärplatz 1, 1210 Vienna, Austria
| | - A Patriarca
- Magan Centre of Applied Mycology, Faculty of Engineering and Applied Sciences, Cranfield University, Cranfield, Bedford MK43 0AL, United Kingdom
| | - A Medina Vaya
- Magan Centre of Applied Mycology, Faculty of Engineering and Applied Sciences, Cranfield University, Cranfield, Bedford MK43 0AL, United Kingdom
| | - L Visintin
- Campus Heymans, Department of Bioanalysis, Centre of Excellence in Mycotoxicology & Public Health, Ghent University, Ottergemsesteenweg, 460, 9000 Ghent, Belgium
| | - T Nguyen
- Campus Heymans, Department of Bioanalysis, Centre of Excellence in Mycotoxicology & Public Health, Ghent University, Ottergemsesteenweg, 460, 9000 Ghent, Belgium
| | - M De Boevre
- Campus Heymans, Department of Bioanalysis, Centre of Excellence in Mycotoxicology & Public Health, Ghent University, Ottergemsesteenweg, 460, 9000 Ghent, Belgium
| | - S De Saeger
- Campus Heymans, Department of Bioanalysis, Centre of Excellence in Mycotoxicology & Public Health, Ghent University, Ottergemsesteenweg, 460, 9000 Ghent, Belgium
| | - V Karanghat
- Department of Chemistry and Biochemistry, Concordia University, 7141 Sherbrooke Street West, Montréal, Québec H4B 1R6, Canada
| | - D Vuckovic
- Department of Chemistry and Biochemistry, Concordia University, 7141 Sherbrooke Street West, Montréal, Québec H4B 1R6, Canada
| | - D R McMullin
- Department of Chemistry, Carleton University, 1125 Colonel By Drive, Ottawa, Ontario K1S 5B6, Canada
| | - C Dall'Asta
- Department of Food and Drug, University of Parma, Parco Area delle Scienze 27/A, 43125 Parma, Italy
| | - K Ayeni
- Department of Food Chemistry and Toxicology, University of Vienna, Währingerstraße 38, A-1090 Vienna, Austria
| | - B Warth
- Department of Food Chemistry and Toxicology, University of Vienna, Währingerstraße 38, A-1090 Vienna, Austria
| | - M Huang
- Grain Research Laboratory, Canadian Grain Commission, 1404-303 Main Street, Winnipeg, Manitoba R3C 3G7, Canada
| | - S Tittlemier
- Grain Research Laboratory, Canadian Grain Commission, 1404-303 Main Street, Winnipeg, Manitoba R3C 3G7, Canada
| | - L Mats
- Guelph Research and Development Center- Agriculture and Agri-Food Canada, 93 Stone Road West, Guelph, Ontario N1 G 5C9, Canada
| | - R Cao
- Guelph Research and Development Center- Agriculture and Agri-Food Canada, 93 Stone Road West, Guelph, Ontario N1 G 5C9, Canada
| | - M Sulyok
- Institute of Bioanalytics and Agro-Metabolomics Department of Agrobiotechnology (IFA-Tulln) University of Natural Resources and Life Sciences, Vienna, Konrad Lorenz Straße 20, 3430 Tulln, Austria
| | - K Xu
- Institute of Bioanalytics and Agro-Metabolomics Department of Agrobiotechnology (IFA-Tulln) University of Natural Resources and Life Sciences, Vienna, Konrad Lorenz Straße 20, 3430 Tulln, Austria
| | - F Berthiller
- Institute of Bioanalytics and Agro-Metabolomics Department of Agrobiotechnology (IFA-Tulln) University of Natural Resources and Life Sciences, Vienna, Konrad Lorenz Straße 20, 3430 Tulln, Austria
| | - M Kuhn
- Institute of Food Chemistry, Universität Münster, Corrensstraße 45, 48149 Muenster, Germany
| | - B Cramer
- Institute of Food Chemistry, Universität Münster, Corrensstraße 45, 48149 Muenster, Germany
| | - B Ciasca
- Institute of Sciences of Food Production, National Research Council, Amendola 122/O 70126 Bari, Italy
| | - V Lattanzio
- Institute of Sciences of Food Production, National Research Council, Amendola 122/O 70126 Bari, Italy
| | - S De Baere
- Laboratory of Pharmacology and Toxicology, Department of Pathobiology, Pharmacology and Zoological Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - S Croubels
- Laboratory of Pharmacology and Toxicology, Department of Pathobiology, Pharmacology and Zoological Medicine, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - N DesRochers
- London Research and Development Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, Ontario N5 V 4T3, Canada
| | - S Sura
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Rout, Unit 100, Morden, Manitoba R6 M 1Y5, Canada
| | - J Bates
- National Research Council Canada, Metrology, 1200 Montreal Road, Ottawa, Ontario K1A 0R6, Canada
| | - E J Wright
- Metrology, National Research Council Canada, 1411 Oxford Street, Halifax, Nova Scotia, B3H 3Z1, Canada
| | - I Thapa
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario, K1A 0C6, Canada
| | - B A Blackwell
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario, K1A 0C6, Canada
| | - K Zhang
- Center for Food Safety and Applied Nutrition, US Food and Drug Administration, 5001 Campus Drive, College Park, Maryland 20740, USA
| | - J Wong
- Center for Food Safety and Applied Nutrition, US Food and Drug Administration, 5001 Campus Drive, College Park, Maryland 20740, USA
| | - L Burns
- Veterinary Diagnostic Laboratory, Iowa State University, 1850 Christensen Drive, Ames, Iowa 50011-1134, USA
| | - D J Borts
- Veterinary Diagnostic Laboratory, Iowa State University, 1850 Christensen Drive, Ames, Iowa 50011-1134, USA
| | - M W Sumarah
- London Research and Development Centre, Agriculture and Agri-Food Canada, 1391 Sandford Street, London, Ontario N5 V 4T3, Canada.
| |
Collapse
|
2
|
Xie J, Chen S, Zhao L, Dong X. Application of artificial intelligence to quantitative structure-retention relationship calculations in chromatography. J Pharm Anal 2025; 15:101155. [PMID: 39896319 PMCID: PMC11782803 DOI: 10.1016/j.jpha.2024.101155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2024] [Revised: 11/09/2024] [Accepted: 11/20/2024] [Indexed: 02/04/2025] Open
Abstract
Quantitative structure-retention relationship (QSRR) is an important tool in chromatography. QSRR examines the correlation between molecular structures and their retention behaviors during chromatographic separation. This approach involves developing models for predicting the retention time (RT) of analytes, thereby accelerating method development and facilitating compound identification. In addition, QSRR can be used to study compound retention mechanisms and support drug screening efforts. This review provides a comprehensive analysis of QSRR workflows and applications, with a special focus on the role of artificial intelligence-an area not thoroughly explored in previous reviews. Moreover, we discuss current limitations in RT prediction and propose promising solutions. Overall, this review offers a fresh perspective on future QSRR research, encouraging the development of innovative strategies that enable the diverse applications of QSRR models in chromatographic analysis.
Collapse
Affiliation(s)
- Jingru Xie
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Department of Pharmacy, Shanghai Baoshan Luodian Hospital, Baoshan District, Shanghai, 201908, China
- Luodian Clinical Drug Research Center, Institute for Translational Medicine Research, Shanghai University, Shanghai, 200444, China
| | - Si Chen
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Luodian Clinical Drug Research Center, Institute for Translational Medicine Research, Shanghai University, Shanghai, 200444, China
| | - Liang Zhao
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Department of Pharmacy, Shanghai Baoshan Luodian Hospital, Baoshan District, Shanghai, 201908, China
- Luodian Clinical Drug Research Center, Institute for Translational Medicine Research, Shanghai University, Shanghai, 200444, China
| | - Xin Dong
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Luodian Clinical Drug Research Center, Institute for Translational Medicine Research, Shanghai University, Shanghai, 200444, China
- Suzhou Innovation Center of Shanghai University, Suzhou, 215000, Jiangsu, China
| |
Collapse
|
3
|
Beck AG, Fine J, Aggarwal P, Regalado EL, Levorse D, De Jesus Silva J, Sherer EC. Machine learning models and performance dependency on 2D chemical descriptor space for retention time prediction of pharmaceuticals. J Chromatogr A 2024; 1730:465109. [PMID: 38968662 DOI: 10.1016/j.chroma.2024.465109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/07/2024]
Abstract
The predictive modeling of liquid chromatography methods can be an invaluable asset, potentially saving countless hours of labor while also reducing solvent consumption and waste. Tasks such as physicochemical screening and preliminary method screening systems where large amounts of chromatography data are collected from fast and routine operations are particularly well suited for both leveraging large datasets and benefiting from predictive models. Therefore, the generation of predictive models for retention time is an active area of development. However, for these predictive models to gain acceptance, researchers first must have confidence in model performance and the computational cost of building them should be minimal. In this study, a simple and cost-effective workflow for the development of machine learning models to predict retention time using only Molecular Operating Environment 2D descriptors as input for support vector regression is developed. Furthermore, we investigated the relative performance of models based on molecular descriptor space by utilizing uniform manifold approximation and projection and clustering with Gaussian mixture models to identify chemically distinct clusters. Results outlined herein demonstrate that local models trained on clusters in chemical space perform equivalently when compared to models trained on all data. Through 10-fold cross-validation on a comprehensive set containing 67,950 of our company's proprietary analytes, these models achieved coefficients of determination of 0.84 and 3 % error in terms of retention time. This promising statistical significance is found to translate from cross-validation to prospective prediction on an external test set of pharmaceutically relevant analytes. The observed equivalency of global and local modeling of large datasets is retained with METLIN's SMRT dataset, thereby confirming the wider applicability of the developed machine learning workflows for global models.
Collapse
Affiliation(s)
- Armen G Beck
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Jonathan Fine
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Pankaj Aggarwal
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA.
| | - Erik L Regalado
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Dorothy Levorse
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| | | | - Edward C Sherer
- Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ 07065, USA
| |
Collapse
|
4
|
Singh YR, Shah DB, Maheshwari DG, Shah JS, Shah S. Advances in AI-Driven Retention Prediction for Different Chromatographic Techniques: Unraveling the Complexity. Crit Rev Anal Chem 2023; 54:3559-3569. [PMID: 37672314 DOI: 10.1080/10408347.2023.2254379] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
Retention prediction through Artificial intelligence (AI)-based techniques has gained exponential growth due to their abilities to process complex sets of data and ease the crucial task of identification and separation of compounds in most employed chromatographic techniques. Numerous approaches were reported for retention prediction in different chromatographic techniques, and consistent results demonstrated that the accuracy and effectiveness of deep learning models outclassed the linear machine learning models, mainly in liquid and gas chromatography, as ML algorithms use fewer complex data to train and predict information. Support Vector machine-based neural networks were found to be most utilized for the prediction of retention factors of different compounds in thin-layer chromatography. Cheminformatics, chemometrics, and hybrid approaches were also employed for the modeling and were more reliable in retention prediction over conventional models. Quantitative Structure Retention Relationship (QSRR) was also a potential method for predicting retention in different chromatographic techniques and determining the separation method for analytes. These techniques demonstrated the aids of incorporating QSRR with AI-driven techniques acquiring more precise retention predictions. This review aims at recent exploration of different AI-driven approaches employed for retention prediction in different chromatographic techniques, and due to the lack of summarized literature, it also aims at providing a comprehensive literature that will be highly useful for the society of scientists exploring the field of AI in analytical chemistry.
Collapse
Affiliation(s)
- Yash Raj Singh
- Department of Pharmaceutical Quality Assurance, L. J. Institute of Pharmacy, L J University, Ahmedabad, Gujarat, India
| | - Darshil B Shah
- Department of Pharmaceutical Quality Assurance, L. J. Institute of Pharmacy, L J University, Ahmedabad, Gujarat, India
| | - Dilip G Maheshwari
- Department of Pharmaceutical Quality Assurance, L. J. Institute of Pharmacy, L J University, Ahmedabad, Gujarat, India
| | - Jignesh S Shah
- Department of Pharmaceutical Regulatory Affairs, L. J. Institute of Pharmacy, L J University, Ahmedabad, Gujarat, India
| | - Shreeraj Shah
- Department of Pharmaceutical Technology, L. J. Institute of Pharmacy, L J University, Ahmedabad, Gujarat, India
| |
Collapse
|
5
|
Singh YR, Shah DB, Kulkarni M, Patel SR, Maheshwari DG, Shah JS, Shah S. Current trends in chromatographic prediction using artificial intelligence and machine learning. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2023; 15:2785-2797. [PMID: 37264667 DOI: 10.1039/d3ay00362k] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) gained tremendous growth and are rapidly becoming popular in various fields of prediction due to their potential abilities, accuracy, and speed. Machine learning algorithms employ historical data to analyze or predict information using patterns or trends. AI and ML were most employed in chromatographic predictions and particularly attractive options for liquid chromatography method development, as they can help achieve desired results faster, more accurately, and more efficiently. This review aims at exploring various AI and ML models employed in the determination of chromatographic characteristics. This review also aims to provide deep insight into reported artificial neural network (ANN) associated techniques which maintained better accuracy and significant possibilities for chromatographic characteristics prediction in liquid chromatography over classical linear models and also emphasizes the integration of a fuzzy system with an ANN, as this integrated study provides more efficient and accurate methods in chromatographic prediction than other linear models. This study also focuses on the retention prediction of a target molecule employing QSRR methodology combined with an ANN, highlighting a more effective technique than the QSRR alone. This approach showed the benefits of combining AI or ML algorithms with the QSRR to obtain more accurate retention predictions, emphasizing the potential of artificial intelligence and machine learning for overcoming adversities in analytical chemistry.
Collapse
Affiliation(s)
- Yash Raj Singh
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Darshil B Shah
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Mangesh Kulkarni
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Shreyanshu R Patel
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Dilip G Maheshwari
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Jignesh S Shah
- Department of Pharmaceutical Regulatory Affairs, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Shreeraj Shah
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| |
Collapse
|
6
|
Prediction of surface excess adsorption and retention factors in reversed-phase liquid chromatography from molecular dynamics simulations. J Chromatogr A 2022; 1685:463627. [DOI: 10.1016/j.chroma.2022.463627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/27/2022] [Accepted: 10/29/2022] [Indexed: 11/06/2022]
|
7
|
Machine Learning-Based Retention Time Prediction of Trimethylsilyl Derivatives of Metabolites. Biomedicines 2022; 10:biomedicines10040879. [PMID: 35453629 PMCID: PMC9024754 DOI: 10.3390/biomedicines10040879] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 04/04/2022] [Accepted: 04/06/2022] [Indexed: 11/16/2022] Open
Abstract
In gas chromatography–mass spectrometry-based untargeted metabolomics, metabolites are identified by comparing mass spectra and chromatographic retention time with reference databases or standard materials. In that sense, machine learning has been used to predict the retention time of metabolites lacking reference data. However, the retention time prediction of trimethylsilyl derivatives of metabolites, typically analyzed in untargeted metabolomics using gas chromatography, has been poorly explored. Here, we provide a rationalized framework for machine learning-based retention time prediction of trimethylsilyl derivatives of metabolites in gas chromatography. We compared different machine learning paradigms, in addition to exploring the influence of the computational molecular structure representation to train the prediction models: fingerprint class and fingerprint calculation software. Our study challenged predicted retention time when using chemical ionization and electron impact ionization sources in simulated and real cases, demonstrating a good correct identity ranking capability by machine learning, despite observing a limited false identity filtering power in cases where a spectrum or a monoisotopic mass match to multiple candidates. Specifically, machine learning prediction yielded median absolute and relative retention index (relative retention time) errors of 37.1 retention index units and 2%, respectively. In addition, fingerprint class and fingerprint calculation software, as well as the molecular structural similarity between the training and test or real case sets, showed to be critical modulators of the prediction performance. Finally, we leveraged the structural similarity between the training and test or real case set to determine the probability that the prediction error is below a specific threshold. Overall, our study demonstrates that predicted retention time can provide insights into the true structure of unknown metabolites by ranking from the most to the least plausible molecular identity, and sets the guidelines to assess the confidence in metabolite identification using predicted retention time data.
Collapse
|
8
|
Borkar MR, Coutinho E. Amalgamation of comparative protein modeling with quantitative structure-retention relationship for prediction of the chromatographic behavior of peptides. J Chromatogr A 2022; 1669:462967. [DOI: 10.1016/j.chroma.2022.462967] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 03/09/2022] [Accepted: 03/11/2022] [Indexed: 10/18/2022]
|
9
|
Tian Z, Liu F, Li D, Fernie AR, Chen W. Strategies for structure elucidation of small molecules based on LC–MS/MS data from complex biological samples. Comput Struct Biotechnol J 2022; 20:5085-5097. [PMID: 36187931 PMCID: PMC9489805 DOI: 10.1016/j.csbj.2022.09.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 09/03/2022] [Accepted: 09/03/2022] [Indexed: 11/06/2022] Open
Abstract
LC–MS/MS is a major analytical platform for metabolomics, which has become a recent hotspot in the research fields of life and environmental sciences. By contrast, structure elucidation of small molecules based on LC–MS/MS data remains a major challenge in the chemical and biological interpretation of untargeted metabolomics datasets. In recent years, several strategies for structure elucidation using LC–MS/MS data from complex biological samples have been proposed, these strategies can be simply categorized into two types, one based on structure annotation of mass spectra and for the other on retention time prediction. These strategies have helped many scientists conduct research in metabolite-related fields and are indispensable for the development of future tools. Here, we summarized the characteristics of the current tools and strategies for structure elucidation of small molecules based on LC–MS/MS data, and further discussed the directions and perspectives to improve the power of the tools or strategies for structure elucidation.
Collapse
|
10
|
Yang Q, Ji H, Fan X, Zhang Z, Lu H. Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning. J Chromatogr A 2021; 1656:462536. [PMID: 34563892 DOI: 10.1016/j.chroma.2021.462536] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 09/02/2021] [Accepted: 09/03/2021] [Indexed: 01/04/2023]
Abstract
The combination of retention time (RT), accurate mass and tandem mass spectra can improve the structural annotation in untargeted metabolomics. However, the incorporation of RT for metabolite identification has received less attention because of the limitation of available RT data, especially for hydrophilic interaction liquid chromatography (HILIC). Here, the Graph Neural Network-based Transfer Learning (GNN-TL) is proposed to train a model for HILIC RTs prediction. The graph neural network was pre-trained using an in silico HILIC RT dataset (pseudo-labeling dataset) with ∼306 K molecules. Then, the weights of dense layers in the pre-trained GNN (pre-GNN) model were fine-tuned by transfer learning using a small number of experimental HILIC RTs from the target chromatographic system. The GNN-TL outperformed the methods in Retip, including the Random Forest (RF), Bayesian-regularized neural network (BRNN), XGBoost, light gradient-boosting machine (LightGBM), and Keras. It achieved the lowest mean absolute error (MAE) of 38.6 s on the test set and 33.4 s on an additional test set. It has the best ability to generalize with a small performance difference between training, test, and additional test sets. Furthermore, the predicted RTs can filter out nearly 60% false positive candidates on average, which is valuable for the identification of compounds complementary to mass spectrometry.
Collapse
Affiliation(s)
- Qiong Yang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Hongchao Ji
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Xiaqiong Fan
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China
| | - Zhimin Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.
| | - Hongmei Lu
- College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, PR China.
| |
Collapse
|
11
|
Sagandykova G, Buszewski B. Perspectives and recent advances in quantitative structure-retention relationships for high performance liquid chromatography. How far are we? Trends Analyt Chem 2021. [DOI: 10.1016/j.trac.2021.116294] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
12
|
Bride E, Heinisch S, Bonnefille B, Guillemain C, Margoum C. Suspect screening of environmental contaminants by UHPLC-HRMS and transposable Quantitative Structure-Retention Relationship modelling. JOURNAL OF HAZARDOUS MATERIALS 2021; 409:124652. [PMID: 33277075 DOI: 10.1016/j.jhazmat.2020.124652] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 10/02/2020] [Accepted: 11/20/2020] [Indexed: 06/12/2023]
Abstract
A Quantitative Structure-Retention Relationship (QSRR) model is proposed and aims at increasing the confidence level associated to the identification of organic contaminants by Ultra-High Performance Liquid Chromatography hyphenated to High Resolution Mass Spectrometry (UHPLC-HRMS) in environmental samples under a suspect screening approach. The model was built from a selection of 8 easily accessible physicochemical descriptors, and was validated from a set of 274 organic compounds commonly found in environmental samples. The proposed predictive figure approach is based on the mobile phase composition at solute elution (expressed as % acetonitrile), that has the major advantage of making the model reusable by other laboratories, since the elution composition is independent of both the column geometry and the UHPLC-system. The model quality was assessed and was altered neither by the columns from different lots, nor by the complex matrices of environmental water samples. Then, the solute retention of any organic compound present in water samples is expected to be predicted within ± 14.3% acetonitrile by our model. Solute retention can therefore be used as a supplementary tool for the identification of environmental contaminants by UHPLC-HRMS, in addition to mass spectrometry data already used in the suspect screening approach.
Collapse
Affiliation(s)
- Eloi Bride
- INRAE, UR RiverLy, F-69625 Villeurbanne, France
| | - Sabine Heinisch
- Université de Lyon, Institut des Sciences Analytiques, UMR 5280, CNRS, F-69100 Villeurbanne, France
| | | | | | | |
Collapse
|
13
|
Gritti F. Perspective on the Future Approaches to Predict Retention in Liquid Chromatography. Anal Chem 2021; 93:5653-5664. [PMID: 33797872 DOI: 10.1021/acs.analchem.0c05078] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The demand for rapid column screening, computer-assisted method development and method transfer, and unambiguous compound identification by LC/MS analyses has pushed analysts to adopt experimental protocols and software for the accurate prediction of the retention time in liquid chromatography (LC). This Perspective discusses the classical approaches used to predict retention times in LC over the last three decades and proposes future requirements to increase their accuracy. First, inverse methods for retention prediction are essentially applied during screening and gradient method optimization: a minimum number of experiments or design of experiments (DoE) is run to train and calibrate a model (either purely statistical or based on the principles and fundamentals of liquid chromatography) by a mere fitting process. They do not require the accurate knowledge of the true column hold-up volume V0, system dwell volume Vdwell (in gradient elution), and the retention behavior (k versus the content of strong solvent φ, temperature T, pH, and ionic strength I) of the analytes. Their relative accuracy is often excellent below a few percent. Statistical methods are expected to be the most attractive to handle very complex retention behavior such as in mixed-mode chromatography (MMC). Fundamentally correct retention models accounting for the simultaneous impact of φ, I, pH, and T in MMC are needed for method development based on chromatography principles. Second, direct methods for retention prediction are ideally suited for accurate method transfer from one column/system configuration to another: these quality by design (QbD) methods are based on the fundamentals and principles of solid-liquid adsorption and gradient chromatography. No model calibration is necessary; however, they require universal conventions for the accurate determination of true retention factors (for 1 < k < 30) as a function of the experimental variables (φ, T, pH, and I) and of the true column/system parameters (V0, Vdwell, dispersion volume, σ, and relaxation volume, τ, of the programmed gradient profile at the column inlet and gradient distortion at the column outlet). Finally, when the molecular structure of the analytes is either known or assumed, retention prediction has essentially been made on the basis of statistical approaches such as the linear solvation energy relationships (LSERs) and the quantitative structure retention relationships (QSRRs): their ability to accurately predict the retention remains limited within 10-30%. They have been combined with molecular similarity approaches (where the retention model is calibrated with compounds having structures similar to that of the targeted analytes) and artificial intelligence algorithms to further improve their accuracy below 10%. In this Perspective, it is proposed to adopt a more rigorous and fundamental approach by considering the very details of the solid-liquid adsorption process: Monte Carlo (MC) or molecular dynamics (MD) simulations are promising tools to explain and interpret retention data that are too complex to be described by either empirical or statistical retention models.
Collapse
Affiliation(s)
- Fabrice Gritti
- Waters Corporation, 34 Maple Street, Milford, Massachusetts 01757, United States
| |
Collapse
|
14
|
Taraji M, Haddad PR. Method Optimisation in Hydrophilic-Interaction Liquid Chromatography by Design of Experiments Combined with Quantitative Structure–Retention Relationships. Aust J Chem 2021. [DOI: 10.1071/ch21102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Accurate prediction of the separation conditions for a set of target analytes with no retention data available is fundamental for routine analytical assays but remains a very challenging task. In this paper, a quality by design (QbD) optimisation workflow capable of discovering the optimal chromatographic conditions for separation of new compounds in hydrophilic-interaction liquid chromatography (HILIC) is introduced. This workflow features the application of quantitative structure−retention relationship (QSRR) methodology in conjunction with design of experiments (DoE) principles and was used to carry out a two-level full factorial DoE optimisation for a mixture of pharmaceutical analytes on zwitterionic, amide, amine, and bare silica HILIC stationary phases, with mobile phases containing varying acetonitrile content, mobile phase pH, and salt concentration. A dual-filtering approach that considers both retention time (tR) and structural similarity was used to identify the optimal set of analytes to train the QSRR in order to maximise prediction accuracy. Highly predictive retention models (average R2 of 0.98) were obtained and statistical analysis of the prediction performance of the QSRR models demonstrated their ability to predict the retention times of new compounds based solely on their molecular structures, with root-mean-square errors of prediction in the range 7.6–11.0 %. Further, the obtained retention data for pharmaceutical test compounds were used to compute their separation selectivity, which was used as input into a DoE optimiser in order to select the optimal separation conditions. Experimental separations performed under the chosen optimal working conditions showed good agreement with the theoretical predictions. To the best of our knowledge, this is the first study of a QbD optimisation workflow assisted with dual-filtering-based retention modelling to facilitate the method development process in HILIC.
Collapse
|
15
|
Haddad PR, Taraji M, Szücs R. Prediction of Analyte Retention Time in Liquid Chromatography. Anal Chem 2020; 93:228-256. [DOI: 10.1021/acs.analchem.0c04190] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Paul R. Haddad
- Australian Centre for Research on Separation Science, School of Natural Sciences, University of Tasmania, Private Bag 75, Hobart, Tasmania, Australia 7001
| | - Maryam Taraji
- Australian Centre for Research on Separation Science, School of Natural Sciences, University of Tasmania, Private Bag 75, Hobart, Tasmania, Australia 7001
- The Australian Wine Research Institute, P.O. Box 197, Adelaide, South Australia 5064, Australia
- Metabolomics Australia, P.O. Box 197, Adelaide, South Australia 5064, Australia
| | - Roman Szücs
- Pfizer R&D UK Limited, Ramsgate Road, Sandwich CT13 9NJ, U.K
- Department of Analytical Chemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Mlynská Dolina CH2, Ilkovičova 6, SK-84215 Bratislava, Slovakia
| |
Collapse
|
16
|
Bonini P, Kind T, Tsugawa H, Barupal DK, Fiehn O. Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics. Anal Chem 2020; 92:7515-7522. [PMID: 32390414 PMCID: PMC8715951 DOI: 10.1021/acs.analchem.9b05765] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Unidentified peaks remain a major problem in untargeted metabolomics by LC-MS/MS. Confidence in peak annotations increases by combining MS/MS matching and retention time. We here show how retention times can be predicted from molecular structures. Two large, publicly available data sets were used for model training in machine learning: the Fiehn hydrophilic interaction liquid chromatography data set (HILIC) of 981 primary metabolites and biogenic amines,and the RIKEN plant specialized metabolome annotation (PlaSMA) database of 852 secondary metabolites that uses reversed-phase liquid chromatography (RPLC). Five different machine learning algorithms have been integrated into the Retip R package: the random forest, Bayesian-regularized neural network, XGBoost, light gradient-boosting machine (LightGBM), and Keras algorithms for building the retention time prediction models. A complete workflow for retention time prediction was developed in R. It can be freely downloaded from the GitHub repository (https://www.retip.app). Keras outperformed other machine learning algorithms in the test set with minimum overfitting, verified by small error differences between training, test, and validation sets. Keras yielded a mean absolute error of 0.78 min for HILIC and 0.57 min for RPLC. Retip is integrated into the mass spectrometry software tools MS-DIAL and MS-FINDER, allowing a complete compound annotation workflow. In a test application on mouse blood plasma samples, we found a 68% reduction in the number of candidate structures when searching all isomers in MS-FINDER compound identification software. Retention time prediction increases the identification rate in liquid chromatography and subsequently leads to an improved biological interpretation of metabolomics data.
Collapse
Affiliation(s)
- Paolo Bonini
- NGAlab, La Riera de Gaia, Tarragona 43762, Spain
| | - Tobias Kind
- West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, California 95616, United States
| | - Hiroshi Tsugawa
- RIKEN Center for Sustainable Resource Science, Yokohama 230-0045, Japan
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Dinesh Kumar Barupal
- West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, California 95616, United States
| | - Oliver Fiehn
- West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, 451 Health Sciences Drive, Davis, California 95616, United States
| |
Collapse
|
17
|
Serum lipidomic biomarkers for non-small cell lung cancer in nonsmoking female patients. J Pharm Biomed Anal 2020; 185:113220. [PMID: 32145537 DOI: 10.1016/j.jpba.2020.113220] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 02/26/2020] [Accepted: 02/28/2020] [Indexed: 02/06/2023]
Abstract
Lung cancer (Lca) is one of the malignant tumors with the fastest morbidity and mortality increase and the greatest threat to human health and life. The incidence of non-small cell lung cancer (NSCLC) in the nonsmoking female has increased recently. However, its pathogenesis is still unclear, and there is an urgent need for clinical diagnostic biomarkers, especially for early diagnosis. A nontargeted lipidomic approach based on ultra-high-performance liquid chromatography coupled with quadrupole time-of-flight mass spectrometry (UHPLC-Q-TOF/MS), as well as two machine learning approaches (genetic algorithm and binary logistic regression) was used to screen candidate discriminating lipids and define a combinational lipid biomarker in serum samples to distinguish female patients with NSCLC from healthy controls. Moreover, the candidate biomarkers were verified by using an external validation sample set. Our result revealed that fatty acid (FA) (20:4), FA (22:0) and LPE (20:4) can serve as a combinational biomarker for distinguishing female patients with NSCLC from healthy control with good sensitivity and specificity. Furthermore, this combinational biomarker also showed good performance in distinguishing early-stage NSCLC female patients from a healthy control. We observed that levels of unsaturated fatty acids clearly decreased, while saturated fatty acids and lysophosphatidylethanolamines pronouncedly increased in Lca patients, compared with the healthy controls, which revealed significant disturbance of lipid metabolism in NSCLC females. Our results not only provide hints to the pathological mechanism of NSCLC in nonsmoking female but also supply a combinational lipid biomarker to aid the diagnosis of NSCLC at early-stage.
Collapse
|
18
|
Park SH, De Pra M, Haddad PR, Grosse S, Pohl CA, Steiner F. Localised quantitative structure–retention relationship modelling for rapid method development in reversed-phase high performance liquid chromatography. J Chromatogr A 2020; 1609:460508. [DOI: 10.1016/j.chroma.2019.460508] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 08/21/2019] [Accepted: 09/02/2019] [Indexed: 10/26/2022]
|
19
|
Skoczylas M, Bocian S, Buszewski B. Quantitative structure – retention relationships of amino acids on the amino acid- and peptide-silica stationary phases for liquid chromatography. J Chromatogr A 2020; 1609:460514. [DOI: 10.1016/j.chroma.2019.460514] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 08/20/2019] [Accepted: 09/03/2019] [Indexed: 12/21/2022]
|
20
|
The METLIN small molecule dataset for machine learning-based retention time prediction. Nat Commun 2019; 10:5811. [PMID: 31862874 PMCID: PMC6925099 DOI: 10.1038/s41467-019-13680-7] [Citation(s) in RCA: 139] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 11/13/2019] [Indexed: 01/18/2023] Open
Abstract
Machine learning has been extensively applied in small molecule analysis to predict a wide range of molecular properties and processes including mass spectrometry fragmentation or chromatographic retention time. However, current approaches for retention time prediction lack sufficient accuracy due to limited available experimental data. Here we introduce the METLIN small molecule retention time (SMRT) dataset, an experimentally acquired reverse-phase chromatography retention time dataset covering up to 80,038 small molecules. To demonstrate the utility of this dataset, we deployed a deep learning model for retention time prediction applied to small molecule annotation. Results showed that in 70\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\%$$\end{document}% of the cases, the correct molecular identity was ranked among the top 3 candidates based on their predicted retention time. We anticipate that this dataset will enable the community to apply machine learning or first principles strategies to generate better models for retention time prediction. The use of machine learning for identifying small molecules through their retention time’s predictions has been challenging so far. Here the authors combine a large database of liquid chromatography retention time with a deep learning approach to enable accurate metabolites’s identification.
Collapse
|
21
|
Wen Y, Amos RIJ, Talebi M, Szucs R, Dolan JW, Pohl CA, Haddad PR. Retention prediction using quantitative structure‐retention relationships combined with the hydrophobic subtraction model in reversed‐phase liquid chromatography. Electrophoresis 2019; 40:2415-2419. [DOI: 10.1002/elps.201900022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Revised: 03/19/2019] [Accepted: 03/20/2019] [Indexed: 11/07/2022]
Affiliation(s)
- Yabin Wen
- Australian Centre for Research on Separation Science (ACROSS)School of Physical Sciences‐ChemistryUniversity of Tasmania Hobart Australia
| | - Ruth I. J. Amos
- Australian Centre for Research on Separation Science (ACROSS)School of Physical Sciences‐ChemistryUniversity of Tasmania Hobart Australia
| | - Mohammad Talebi
- Australian Centre for Research on Separation Science (ACROSS)School of Physical Sciences‐ChemistryUniversity of Tasmania Hobart Australia
| | - Roman Szucs
- Pfizer Global Research and Development Sandwich UK
| | | | | | - Paul R. Haddad
- Australian Centre for Research on Separation Science (ACROSS)School of Physical Sciences‐ChemistryUniversity of Tasmania Hobart Australia
| |
Collapse
|
22
|
Amos RI, Haddad PR, Szucs R, Dolan JW, Pohl CA. Molecular modeling and prediction accuracy in Quantitative Structure-Retention Relationship calculations for chromatography. Trends Analyt Chem 2018. [DOI: 10.1016/j.trac.2018.05.019] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
23
|
Wen Y, Amos RIJ, Talebi M, Szucs R, Dolan JW, Pohl CA, Haddad PR. Retention Index Prediction Using Quantitative Structure-Retention Relationships for Improving Structure Identification in Nontargeted Metabolomics. Anal Chem 2018; 90:9434-9440. [PMID: 29952550 DOI: 10.1021/acs.analchem.8b02084] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Structure identification in nontargeted metabolomics based on liquid-chromatography coupled to mass spectrometry (LC-MS) remains a significant challenge. Quantitative structure-retention relationship (QSRR) modeling is a technique capable of accelerating the structure identification of metabolites by predicting their retention, allowing false positives to be eliminated during the interpretation of metabolomics data. In this work, 191 compounds were grouped according to molecular weight and a QSRR study was carried out on the 34 resulting groups to eliminate false positives. Partial least squares (PLS) regression combined with a Genetic algorithm (GA) was applied to construct the linear QSRR models based on a variety of VolSurf+ molecular descriptors. A novel dual-filtering approach, which combines Tanimoto similarity (TS) searching as the primary filter and retention index (RI) similarity clustering as the secondary filter, was utilized to select compounds in training sets to derive the QSRR models yielding R2 of 0.8512 and an average root mean square error in prediction (RMSEP) of 8.45%. With a retention index filter expressed as ±2 standard deviations (SD) of the error, representative compounds were predicted with >91% accuracy, and for 53% of the groups (18/34), at least one false positive compound could be eliminated. The proposed strategy can thus narrow down the number of false positives to be assessed in nontargeted metabolomics.
Collapse
Affiliation(s)
- Yabin Wen
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry , University of Tasmania , Private Bag 75 , Hobart , 7001 Tasmania , Australia
| | - Ruth I J Amos
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry , University of Tasmania , Private Bag 75 , Hobart , 7001 Tasmania , Australia
| | - Mohammad Talebi
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry , University of Tasmania , Private Bag 75 , Hobart , 7001 Tasmania , Australia
| | - Roman Szucs
- Pfizer Global Research and Development , Sandwich CT139NJ , U.K
| | - John W Dolan
- LC Resources , McMinnville , Oregon 97128 , United States
| | | | - Paul R Haddad
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry , University of Tasmania , Private Bag 75 , Hobart , 7001 Tasmania , Australia
| |
Collapse
|
24
|
Blaženović I, Kind T, Ji J, Fiehn O. Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics. Metabolites 2018; 8:E31. [PMID: 29748461 PMCID: PMC6027441 DOI: 10.3390/metabo8020031] [Citation(s) in RCA: 449] [Impact Index Per Article: 64.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Revised: 04/26/2018] [Accepted: 05/06/2018] [Indexed: 01/17/2023] Open
Abstract
The annotation of small molecules remains a major challenge in untargeted mass spectrometry-based metabolomics. We here critically discuss structured elucidation approaches and software that are designed to help during the annotation of unknown compounds. Only by elucidating unknown metabolites first is it possible to biologically interpret complex systems, to map compounds to pathways and to create reliable predictive metabolic models for translational and clinical research. These strategies include the construction and quality of tandem mass spectral databases such as the coalition of MassBank repositories and investigations of MS/MS matching confidence. We present in silico fragmentation tools such as MS-FINDER, CFM-ID, MetFrag, ChemDistiller and CSI:FingerID that can annotate compounds from existing structure databases and that have been used in the CASMI (critical assessment of small molecule identification) contests. Furthermore, the use of retention time models from liquid chromatography and the utility of collision cross-section modelling from ion mobility experiments are covered. Workflows and published examples of successfully annotated unknown compounds are included.
Collapse
Affiliation(s)
- Ivana Blaženović
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Tobias Kind
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Jian Ji
- State Key Laboratory of Food Science and Technology, School of Food Science of Jiangnan University, School of Food Science Synergetic Innovation Center of Food Safety and Nutrition, Wuxi 214122, China.
| | - Oliver Fiehn
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
- Department of Biochemistry, Faculty of Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
25
|
Taraji M, Haddad PR, Amos RI, Talebi M, Szucs R, Dolan JW, Pohl CA. Error measures in quantitative structure-retention relationships studies. J Chromatogr A 2017; 1524:298-302. [DOI: 10.1016/j.chroma.2017.09.050] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 09/21/2017] [Accepted: 09/22/2017] [Indexed: 01/31/2023]
|
26
|
Amos RIJ, Tyteca E, Talebi M, Haddad PR, Szucs R, Dolan JW, Pohl CA. Benchmarking of Computational Methods for Creation of Retention Models in Quantitative Structure–Retention Relationships Studies. J Chem Inf Model 2017; 57:2754-2762. [DOI: 10.1021/acs.jcim.7b00346] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Ruth I. J. Amos
- Australian
Centre for Research on Separation Science (ACROSS), School of Physical
Sciences−Chemistry, University of Tasmania, Private Bag
75, Hobart 7001, Australia
| | - Eva Tyteca
- Australian
Centre for Research on Separation Science (ACROSS), School of Physical
Sciences−Chemistry, University of Tasmania, Private Bag
75, Hobart 7001, Australia
- Analytical
Chemistry, AgroBioChem Department, Gembloux Agro-Biotech, University of Liège, 2 Passage des Deportes, Gembloux 5030, Belgium
| | - Mohammad Talebi
- Australian
Centre for Research on Separation Science (ACROSS), School of Physical
Sciences−Chemistry, University of Tasmania, Private Bag
75, Hobart 7001, Australia
| | - Paul R. Haddad
- Australian
Centre for Research on Separation Science (ACROSS), School of Physical
Sciences−Chemistry, University of Tasmania, Private Bag
75, Hobart 7001, Australia
| | - Roman Szucs
- Pfizer Global Research and Development, Ramsgate Road, Sandwich CT13 9ND, U.K
| | - John W. Dolan
- LC Resources, 1795 NW Wallace
Road, McMinnville, Oregon 97128, United States
| | - Christopher A. Pohl
- Thermo Fisher Scientific, 490
Lakeside Drive, Sunnyvale, California 94085, United States
| |
Collapse
|
27
|
Taraji M, Haddad PR, Amos RIJ, Talebi M, Szucs R, Dolan JW, Pohl CA. Chemometric-assisted method development in hydrophilic interaction liquid chromatography: A review. Anal Chim Acta 2017; 1000:20-40. [PMID: 29289311 DOI: 10.1016/j.aca.2017.09.041] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 09/22/2017] [Accepted: 09/24/2017] [Indexed: 02/09/2023]
Abstract
With an enormous growth in the application of hydrophilic interaction liquid chromatography (HILIC), there has also been significant progress in HILIC method development. HILIC is a chromatographic method that utilises hydro-organic mobile phases with a high organic content, and a hydrophilic stationary phase. It has been applied predominantly in the determination of small polar compounds. Theoretical studies in computer-aided modelling tools, most importantly the predictive, quantitative structure retention relationship (QSRR) modelling methods, have attracted the attention of researchers and these approaches greatly assist the method development process. This review focuses on the application of computer-aided modelling tools in understanding the retention mechanism, the classification of HILIC stationary phases, prediction of retention times in HILIC systems, optimisation of chromatographic conditions, and description of the interaction effects of the chromatographic factors in HILIC separations. Additionally, what has been achieved in the potential application of QSRR methodology in combination with experimental design philosophy in the optimisation of chromatographic separation conditions in the HILIC method development process is communicated. Developing robust predictive QSRR models will undoubtedly facilitate more application of this chromatographic mode in a broader variety of research areas, significantly minimising cost and time of the experimental work.
Collapse
Affiliation(s)
- Maryam Taraji
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Paul R Haddad
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia.
| | - Ruth I J Amos
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Mohammad Talebi
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Roman Szucs
- Pfizer Global Research and Development, CT13 9NJ, Sandwich, UK
| | - John W Dolan
- LC Resources, 1795 NW Wallace Rd., McMinnville, OR 97128, USA
| | | |
Collapse
|
28
|
Park SH, Haddad PR, Amos RI, Talebi M, Szucs R, Pohl CA, Dolan JW. Towards a chromatographic similarity index to establish localised Quantitative Structure-Retention Relationships for retention prediction. III Combination of Tanimoto similarity index, log P , and retention factor ratio to identify optimal analyte training sets for ion chromatography. J Chromatogr A 2017; 1520:107-116. [DOI: 10.1016/j.chroma.2017.09.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Revised: 09/02/2017] [Accepted: 09/06/2017] [Indexed: 11/17/2022]
|