Kazantsev K, Toukach P. Remediation of the NMR data of natural glycans.
Int J Biol Macromol 2024;
282:137042. [PMID:
39521218 DOI:
10.1016/j.ijbiomac.2024.137042]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 09/05/2024] [Accepted: 10/27/2024] [Indexed: 11/16/2024]
Abstract
Primary structure elucidation in glycobiology is strongly affected by published structure-reporting NMR signals, especially on the 13C nucleus. The glycan NMR simulation accuracy and machine learning outcome depend on the quality of the NMR signal assignment in glycan databases. Within our work on improving the data quality in the Carbohydrate Structure Database (CSDB), we have applied a systematic search for inconsistencies in the published NMR data. The search was based on a bulk comparison between the experimental and simulated 13C NMR chemical shifts and manual analysis of the mismatches. On the basis of this analysis, CSDB was remediated by marking and correcting the NMR errors found in 272 structure elucidation reports published over the past 40 years.
Collapse