1
|
Chapman T, Lassmann T. Single-cell data combined with phenotypes improves variant interpretation. BMC Genomics 2025; 26:540. [PMID: 40437370 PMCID: PMC12117811 DOI: 10.1186/s12864-025-11711-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2025] [Accepted: 05/14/2025] [Indexed: 06/01/2025] Open
Abstract
BACKGROUND Whole genome sequencing offers significant potential to improve the diagnosis and treatment of rare diseases by enabling the identification of thousands of rare, potentially pathogenic variants. Existing variant prioritisation tools can be complemented by approaches that incorporate phenotype specificity and provide contextual biological information, such as tissue or cell-type specificity. We hypothesised that integrating single-cell gene expression data into phenotype-specific models would improve the accuracy and interpretability of pathogenic variant prioritisation. METHODS To test this hypothesis, we developed IMPPROVE, a new tool that constructs phenotype-specific ensemble models integrating CADD scores with bulk and single-cell gene expression data. We constructed a total of 1,866 Random Forest models for individual HPO terms, incorporating both bulk and single cell expression data. RESULTS Our phenotype-specific models utilising expression data can better predict pathogenic variants in 90% of the phenotypes (HPO terms) considered. Using single-cell expression data instead of bulk benefited the models, significantly shifting the proportion of pathogenic variants that were correctly identified at a fixed false positive rate ( p < 10 - 30 , using an approximate Wilcoxon signed rank test). We found 57 phenotypes' models exhibited a large performance difference, depending on the dataset used. Further analysis revealed biological links between the pathology and the tissues or cell-types used by these 57 models. CONCLUSIONS Phenotype-specific models that integrate gene expression data with CADD scores show great promise in improving variant prioritisation. In addition to improving diagnostic accuracy, these models offer insights into the underlying biological mechanisms of rare diseases. Enriching existing pathogenicity-related scores with gene expression datasets has the potential to advance personalised medicine through more accurate and interpretable variant prioritisation.
Collapse
Affiliation(s)
- Timothy Chapman
- The Kids Research Institute Australia, 15 Hospital Ave, Nedlands, WA, 6009, Australia
- UWA Centre for Child Health Research, The University of Western Australia, 35 Stirling Hwy, Crawley, Western Autralia, 6009, Australia
| | - Timo Lassmann
- The Kids Research Institute Australia, 15 Hospital Ave, Nedlands, WA, 6009, Australia.
- UWA Centre for Child Health Research, The University of Western Australia, 35 Stirling Hwy, Crawley, Western Autralia, 6009, Australia.
| |
Collapse
|
2
|
Hong Y, Jeong S, Park MJ, Song W, Lee N. Application of Pathomic Features for Differentiating Dysplastic Cells in Patients with Myelodysplastic Syndrome. Bioengineering (Basel) 2024; 11:1230. [PMID: 39768048 PMCID: PMC11673167 DOI: 10.3390/bioengineering11121230] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 11/29/2024] [Accepted: 11/29/2024] [Indexed: 01/11/2025] Open
Abstract
Myelodysplastic syndromes (MDSs) are a group of hematologic neoplasms accompanied by dysplasia of bone marrow (BM) hematopoietic cells with cytopenia. Recently, digitalized pathology and pathomics using computerized feature analysis have been actively researched for classifying and predicting prognosis in various tumors of hematopoietic tissues. This study analyzed the pathomic features of hematopoietic cells in BM aspiration smears of patients with MDS according to each hematopoietic cell lineage and dysplasia. We included 24 patients with an MDS and 21 with normal BM. The 12,360 hematopoietic cells utilized were to be classified into seven types: normal erythrocytes, normal granulocytes, normal megakaryocytes, dysplastic erythrocytes, dysplastic granulocytes, dysplastic megakaryocytes, and others. Four hundred seventy-six pathomic features quantifying cell intensity, shape, and texture were extracted from each segmented cell. After comparing the combination of feature selection and machine learning classifier methods using 5-fold cross-validation area under the receiver operating characteristic curve (AUROC), the quadratic discriminant analysis (QDA) with gradient boosting decision tree (AUROC = 0.63) and QDA with eXtreme gradient boosting (XGB) (AUROC = 0.64) showed a high AUROC combination. Through a feature selection process, 30 characteristics were further analyzed. Dysplastic erythrocytes and granulocytes showed lower median values on heatmap analysis compared to that of normal erythrocytes and granulocytes. The data suggest that pathomic features could be applied to cell differentiation in hematologic malignancies. It could be used as a new biomarker with an auxiliary role for more accurate diagnosis. Further studies including prediction survival and prognosis with larger cohort of patients are needed.
Collapse
Affiliation(s)
- Youngtaek Hong
- CONNECT-AI Research Center, Yonsei University College of Medicine, Seoul 03764, Republic of Korea;
| | - Seri Jeong
- Department of Laboratory Medicine, Kangnam Sacred Heart Hospital, Hallym University College of Medicine, Seoul 07440, Republic of Korea; (S.J.); (M.-J.P.); (W.S.)
| | - Min-Jeong Park
- Department of Laboratory Medicine, Kangnam Sacred Heart Hospital, Hallym University College of Medicine, Seoul 07440, Republic of Korea; (S.J.); (M.-J.P.); (W.S.)
| | - Wonkeun Song
- Department of Laboratory Medicine, Kangnam Sacred Heart Hospital, Hallym University College of Medicine, Seoul 07440, Republic of Korea; (S.J.); (M.-J.P.); (W.S.)
| | - Nuri Lee
- Department of Laboratory Medicine, Kangnam Sacred Heart Hospital, Hallym University College of Medicine, Seoul 07440, Republic of Korea; (S.J.); (M.-J.P.); (W.S.)
| |
Collapse
|
3
|
Mukherjee S, Dong W, Schiltz NK, Stange KC, Cullen J, Gerds AT, Carraway HE, Singh A, Advani AS, Sekeres MA, Koroukian SM. Patterns of Diagnostic Evaluation and Determinants of Treatment in Older Patients With Non-transfusion Dependent Myelodysplastic Syndromes. Oncologist 2023; 28:901-910. [PMID: 37120291 PMCID: PMC10546824 DOI: 10.1093/oncolo/oyad114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 03/20/2023] [Indexed: 05/01/2023] Open
Abstract
BACKGROUND Older patients with myelodysplastic syndromes (MDS), particularly those with no or one cytopenia and no transfusion dependence, typically have an indolent course. Approximately, half of these receive the recommended diagnostic evaluation (DE) for MDS. We explored factors determining DE in these patients and its impact on subsequent treatment and outcomes. PATIENTS AND METHODS We used 2011-2014 Medicare data to identify patients ≥66 years of age diagnosed with MDS. We used Classification and Regression Tree (CART) analysis to identify combinations of factors associated with DE and its impact on subsequent treatment. Variables examined included demographics, comorbidities, nursing home status, and investigative procedures performed. We conducted a logistic regression analysis to identify correlates associated with receipt of DE and treatment. RESULTS Of 16 851 patients with MDS, 51% underwent DE. patients with MDS with no cytopenia (n = 3908) had the lowest uptake of DE (34.7%). Compared to patients with no cytopenia, those with any cytopenia had nearly 3 times higher odds of receiving DE [adjusted odds ratio (AOR), 2.81: 95% CI, 2.60-3.04] and the odds were higher for men than for women [AOR, 1.39: 95%CI, 1.30-1.48] and for Non-Hispanic Whites [vs. everyone else (AOR, 1.17: 95% CI, 1.06-1.29)]. The CART showed DE as the principal discriminating node, followed by the presence of any cytopenia for receiving MDS treatment. The lowest percentage of treatment was observed in patients without DE, at 14.6%. CONCLUSION In this select older patients with MDS, we identified disparities in accurate diagnosis by demographic and clinical factors. Receipt of DE influenced subsequent treatment but not survival.
Collapse
Affiliation(s)
- Sudipto Mukherjee
- Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Weichuan Dong
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Population Cancer Analytics Shared Resource, Case Comprehensive Cancer Center, Cleveland, OH, USA
| | - Nicholas K Schiltz
- Frances P. Bolton School of Nursing, Case Western Reserve University, Cleveland, OH, USA
| | - Kurt C Stange
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Jennifer Cullen
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Aaron T Gerds
- Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Hetty E Carraway
- Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Abhay Singh
- Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Anjali S Advani
- Leukemia Program, Department of Hematology and Medical Oncology, Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Mikkael A Sekeres
- Division of Hematology, Sylvester Comprehensive Cancer Center, University of Florida, Miami, FL, USA
| | - Siran M Koroukian
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Population Cancer Analytics Shared Resource, Case Comprehensive Cancer Center, Cleveland, OH, USA
| |
Collapse
|
4
|
A geno-clinical decision model for the diagnosis of myelodysplastic syndromes. Blood Adv 2021; 5:4361-4369. [PMID: 34592765 PMCID: PMC8579270 DOI: 10.1182/bloodadvances.2021004755] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 07/28/2021] [Indexed: 11/28/2022] Open
Abstract
We developed a machine learning–based model to assist in the differential diagnosis of myeloid malignancies. Our work also describes genotype-phenotype correlations in different myeloid malignancies.
The differential diagnosis of myeloid malignancies is challenging and subject to interobserver variability. We used clinical and next-generation sequencing (NGS) data to develop a machine learning model for the diagnosis of myeloid malignancies independent of bone marrow biopsy data based on a 3-institution, international cohort of patients. The model achieves high performance, with model interpretations indicating that it relies on factors similar to those used by clinicians. In addition, we describe associations between NGS findings and clinically important phenotypes and introduce the use of machine learning algorithms to elucidate clinicogenomic relationships.
Collapse
|
5
|
Shallis RM, Xu ML, Podoltsev NA, Curtis SA, Considine BT, Khanna SR, Siddon AJ, Zeidan AM. Be careful of the masquerades: differentiating secondary myelodysplasia from myelodysplastic syndromes in clinical practice. Ann Hematol 2018; 97:2333-2343. [DOI: 10.1007/s00277-018-3474-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 08/06/2018] [Indexed: 12/17/2022]
|
6
|
Sasada K, Yamamoto N, Masuda H, Tanaka Y, Ishihara A, Takamatsu Y, Yatomi Y, Katsuda W, Sato I, Matsui H. Inter-observer variance and the need for standardization in the morphological classification of myelodysplastic syndrome. Leuk Res 2018; 69:54-59. [PMID: 29656215 DOI: 10.1016/j.leukres.2018.04.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 03/23/2018] [Accepted: 04/03/2018] [Indexed: 01/29/2023]
Abstract
In this era of genome medicine, the sub-classification of myeloid neoplasms, including myelodysplastic syndrome (MDS), is now supported by genetic testing in selected cases. However, as the initial suspicion and primary diagnosis of the disease still largely relies on morphological features and numbers of hematopoietic cells, the establishment of a uniform diagnostic basis, especially for cell morphology, is essential. In this study, we collected nearly 100,000 hematopoietic cell images from 499 peripheral blood smear specimens from patients with MDS and used these to evaluate the standardization of morphological classification by medical technologists. The observers in this study ranged between two to eleven for each image, and the images were classified according to MDS criteria through a web-based system. We found considerable inter-observer variance in the assessment of dysplastic features. Observers did not recognize cytoplasmic hypo-granularity unless almost all granules in neutrophils were absent. Pseudo Pelger-Huët anomalies were also often overlooked, except for cells with a very typical "pince-nez" appearance. Taken together, this study suggests a requirement for further standardization in terms of morphological cell classification, and a need for the development of automatic cell classification-supporting devices for the accurate diagnosis of MDS.
Collapse
Affiliation(s)
- Keiko Sasada
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan
| | - Noriko Yamamoto
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan
| | - Hiroki Masuda
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan
| | - Yoko Tanaka
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan
| | - Ayako Ishihara
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan
| | - Yasushi Takamatsu
- Department of Medical Oncology, Hematology and Infectious Diseases, Fukuoka University, Japan
| | - Yutaka Yatomi
- Department of Clinical Laboratory Medicine, Graduate School of Medicine, The University of Tokyo, Japan
| | | | - Issei Sato
- Medical Image Analysis Team, Center for Advanced Intelligence Project, Institute of Physical and Chemical Research (RIKEN), Japan
| | - Hirotaka Matsui
- Department of Laboratory Medicine, Kumamoto University Hospital, Kumamoto University, Japan; Medical Image Analysis Team, Center for Advanced Intelligence Project, Institute of Physical and Chemical Research (RIKEN), Japan; Department of Molecular Laboratory Medicine, Faculty of Life Sciences, Kumamoto University, Japan.
| |
Collapse
|