Zhang X, Zhou C, Hu J, Hu J, Ding Y, Chen S, Wang X, Xu L, Gou Z, Zhang S, Shi W. Six-gene prognostic signature for non-alcoholic fatty liver disease susceptibility using machine learning.
Medicine (Baltimore) 2024;
103:e38076. [PMID:
38728481 PMCID:
PMC11081587 DOI:
10.1097/md.0000000000038076]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 04/10/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND
nonalcoholic fatty liver disease (NAFLD) is a common liver disease affecting the global population and its impact on human health will continue to increase. Genetic susceptibility is an important factor influencing its onset and progression, and there is a lack of reliable methods to predict the susceptibility of normal populations to NAFLD using appropriate genes.
METHODS
RNA sequencing data relating to nonalcoholic fatty liver disease was analyzed using the "limma" package within the R software. Differentially expressed genes were obtained through preliminary intersection screening. Core genes were analyzed and obtained by establishing and comparing 4 machine learning models, then a prediction model for NAFLD was constructed. The effectiveness of the model was then evaluated, and its applicability and reliability verified. Finally, we conducted further gene correlation analysis, analysis of biological function and analysis of immune infiltration.
RESULTS
By comparing 4 machine learning algorithms, we identified SVM as the optimal model, with the first 6 genes (CD247, S100A9, CSF3R, DIP2C, OXCT 2 and PRAMEF16) as predictive genes. The nomogram was found to have good reliability and effectiveness. Six genes' receiver operating characteristic curves (ROC) suggest an essential role in NAFLD pathogenesis, and they exhibit a high predictive value. Further analysis of immunology demonstrated that these 6 genes were closely connected to various immune cells and pathways.
CONCLUSION
This study has successfully constructed an advanced and reliable prediction model based on 6 diagnostic gene markers to predict the susceptibility of normal populations to NAFLD, while also providing insights for potential targeted therapies.
Collapse