1
|
Yassi M, Shams Davodly E, Hajebi Khaniki S, Kerachian MA. HBCR_DMR: A Hybrid Method Based on Beta-Binomial Bayesian Hierarchical Model and Combination of Ranking Method to Detect Differential Methylation Regions in Bisulfite Sequencing Data. J Pers Med 2024; 14:361. [PMID: 38672987 PMCID: PMC11051304 DOI: 10.3390/jpm14040361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 10/20/2023] [Accepted: 01/09/2024] [Indexed: 04/28/2024] Open
Abstract
DNA methylation is a key epigenetic modification involved in gene regulation, contributing to both physiological and pathological conditions. For a more profound comprehension, it is essential to conduct a precise comparison of DNA methylation patterns between sample groups that represent distinct statuses. Analysis of differentially methylated regions (DMRs) using computational approaches can help uncover the precise relationships between these phenomena. This paper describes a hybrid model that combines the beta-binomial Bayesian hierarchical model with a combination of ranking methods known as HBCR_DMR. During the initial phase, we model the actual methylation proportions of the CpG sites (CpGs) within the replicates. This modeling is achieved through beta-binomial distribution, with parameters set by a group mean and a dispersion parameter. During the second stage, we establish the selection of distinguishing CpG sites based on their methylation status, employing multiple ranking techniques. Finally, we combine the ranking lists of differentially methylated CpG sites through a voting system. Our analyses, encompassing simulations and real data, reveal outstanding performance metrics, including a sensitivity of 0.72, specificity of 0.89, and an F1 score of 0.76, yielding an overall accuracy of 0.82 and an AUC of 0.94. These findings underscore HBCR_DMR's robust capacity to distinguish methylated regions, confirming its utility as a valuable tool for DNA methylation analysis.
Collapse
Affiliation(s)
- Maryam Yassi
- Cancer Genetics Research Unit, Reza Radiotherapy and Oncology Center, Mashhad 9184156815, Iran; (M.Y.); (E.S.D.)
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin 9054, New Zealand
| | - Ehsan Shams Davodly
- Cancer Genetics Research Unit, Reza Radiotherapy and Oncology Center, Mashhad 9184156815, Iran; (M.Y.); (E.S.D.)
| | - Saeedeh Hajebi Khaniki
- Student Research Committee, Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad 9177948564, Iran;
| | - Mohammad Amin Kerachian
- Cancer Genetics Research Unit, Reza Radiotherapy and Oncology Center, Mashhad 9184156815, Iran; (M.Y.); (E.S.D.)
- Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad 9177948564, Iran
- Department of Medical Genetics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad 9177948564, Iran
- Department of Chemistry and Biology, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada
| |
Collapse
|
2
|
Zheng X, Wu Q, Wu H, Leung KS, Wong MH, Liu X, Cheng L. Evaluating the Consistency of Gene Methylation in Liver Cancer Using Bisulfite Sequencing Data. Front Cell Dev Biol 2021; 9:671302. [PMID: 33996828 PMCID: PMC8116545 DOI: 10.3389/fcell.2021.671302] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/06/2021] [Indexed: 01/07/2023] Open
Abstract
Bisulfite sequencing is considered as the gold standard approach for measuring DNA methylation, which acts as a pivotal part in regulating a variety of biological processes without changes in DNA sequences. In this study, we introduced the most prevalent methods for processing bisulfite sequencing data and evaluated the consistency of the data acquired from different measurements in liver cancer. Firstly, we introduced three commonly used bisulfite sequencing assays, i.e., reduced-representation bisulfite sequencing (RRBS), whole-genome bisulfite sequencing (WGBS), and targeted bisulfite sequencing (targeted BS). Next, we discussed the principles and compared different methods for alignment, quality assessment, methylation level scoring, and differentially methylated region identification. After that, we screened differential methylated genes in liver cancer through the three bisulfite sequencing assays and evaluated the consistency of their results. Ultimately, we compared bisulfite sequencing to 450 k beadchip and assessed the statistical similarity and functional association of differentially methylated genes (DMGs) among the four assays. Our results demonstrated that the DMGs measured by WGBS, RRBS, targeted BS and 450 k beadchip are consistently hypo-methylated in liver cancer with high functional similarity.
Collapse
Affiliation(s)
- Xubin Zheng
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China.,Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Qiong Wu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China.,School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Haonan Wu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Kwong-Sak Leung
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Xueyan Liu
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| | - Lixin Cheng
- Department of Critical Medicine, Shenzhen People's Hospital, First Affiliated Hospital of Southern University of Science and Technology, Second Clinical Medicine College of Jinan University, Shenzhen, China
| |
Collapse
|
3
|
Li M, Tang L, Liao Z, Luo J, Wu F, Pan Y, Wang J. A novel scaffolding algorithm based on contig error correction and path extension. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 16:764-773. [PMID: 30040649 DOI: 10.1109/tcbb.2018.2858267] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The sequence assembly process can be divided into three stages: contigs extension, scaffolding, and gap filling. The scaffolding method is an essential step during the process to infer the direction and sequence relationships between the contigs. However, scaffolding still faces the challenges of uneven sequencing depth, genome repetitive regions, and sequencing errors, which often leads to many false relationships between contigs. The performance of scaffolding can be improved by removing potential false conjunctions between contigs. In this study, a novel scaffolding algorithm which is on the basis of path extension Loose-Strict-Loose strategy and contig error correction, called iLSLS. iLSLS helps reduce the false relationships between contigs, and improve the accuracy of subsequent steps. iLSLS utilizes a scoring function, which estimates the correctness of candidate paths by the distribution of paired reads, and try to conduction the extension with the path which is scored the highest. What's more, iLSLS can precisely estimate the gap size. We conduct experiments on two real datasets, and the results show that LSLS strategy is efficient to increase the correctness of scaffolds, and iLSLS performs better than other scaffolding methods.
Collapse
|