1
|
Yan Z, Zhu X, Wang Y, Nie Y, Guan S, Kuo Y, Chang D, Li R, Qiao J, Yan L. scHaplotyper: haplotype construction and visualization for genetic diagnosis using single cell DNA sequencing data. BMC Bioinformatics 2020; 21:41. [PMID: 32007105 PMCID: PMC6995221 DOI: 10.1186/s12859-020-3381-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 01/22/2020] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Haplotyping reveals chromosome blocks inherited from parents to in vitro fertilized (IVF) embryos in preimplantation genetic diagnosis (PGD), enabling the observation of the transmission of disease alleles between generations. However, the methods of haplotyping that are suitable for single cells are limited because a whole genome amplification (WGA) process is performed before sequencing or genotyping in PGD, and true haplotype profiles of embryos need to be constructed based on genotypes that can contain many WGA artifacts. RESULTS Here, we offer scHaplotyper as a genetic diagnosis tool that reconstructs and visualizes the haplotype profiles of single cells based on the Hidden Markov Model (HMM). scHaplotyper can trace the origin of each haplotype block in the embryo, enabling the detection of carrier status of disease alleles in each embryo. We applied this method in PGD in two families affected with genetic disorders, and the result was the healthy live births of two children in the two families, demonstrating the clinical application of this method. CONCLUSION Next generation sequencing (NGS) of preimplantation embryos enable genetic screening for families with genetic disorders, avoiding the birth of affected babies. With the validation and successful clinical application, we showed that scHaplotyper is a convenient and accurate method to screen out embryos. More patients with genetic disorder will benefit from the genetic diagnosis of embryos. The source code of scHaplotyper is available at GitHub repository: https://github.com/yzqheart/scHaplotyper.
Collapse
Affiliation(s)
- Zhiqiang Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.,Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xiaohui Zhu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Yuqian Wang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Yanli Nie
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Shuo Guan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Ying Kuo
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Di Chang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Rong Li
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China
| | - Jie Qiao
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China.,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China.,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.,Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China.,Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China
| | - Liying Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, 100191, China. .,Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, 100191, China. .,Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, 100191, China.
| |
Collapse
|
2
|
Ma L, Li W, Song Q. Chromosome-Range Whole-Genome High-Throughput Experimental Haplotyping by Single-Chromosome Microdissection. Methods Mol Biol 2017; 1551:161-169. [PMID: 28138846 PMCID: PMC6372095 DOI: 10.1007/978-1-4939-6750-6_9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Haplotype is fundamental genetic information; it provides essential information for deciphering the functional and etiological roles of genetic variants. As haplotype information is closely related to the functional and etiological impact of genetic variants, it is widely anticipated that haplotype information will be extremely valuable in a wide spectra of applications, including academic research, clinical diagnosis of genetic disease and in the pharmaceutical industry. Haplotyping is essential for LD (linkage disequilibrium) mapping, functional studies on cis-interactions, big data imputation, association studies, population studies, and evolutionary studies. Unfortunately, current sequencing technologies and genotyping arrays do not routinely deliver this information for each individual, but yield only unphased genotypes. Here, we describe a high-throughput and cost-effective experimental protocol to obtain high-resolution chromosomal haplotypes of each individual diploid (including human) genome by the single-chromosome microdissection and sequencing approach.
Collapse
Affiliation(s)
- Li Ma
- 4DGenome Inc, Atlanta, GA, USA. ,Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA.
| | - Wenzhi Li
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA.,Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Qing Song
- 4DGenome Inc, Atlanta, GA, USA. ,Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, 720 Westview Dr SW, Atlanta, GA, 30310, USA. ,Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China. ,Research Wing D-203, 720 Westview Drive, Atlanta, GA, 30310, USA.
| |
Collapse
|
3
|
Li W, Xu W, He S, Ma L, Song Q. Data supporting the high-accuracy haplotype imputation using unphased genotype data as the references. Data Brief 2016; 8:1412-5. [PMID: 27595130 PMCID: PMC4995474 DOI: 10.1016/j.dib.2016.06.029] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Revised: 05/10/2016] [Accepted: 06/21/2016] [Indexed: 01/24/2023] Open
Abstract
The data presented in this article is related to the research article entitled "High-accuracy haplotype imputation using unphased genotype data as the references" which reports the unphased genotype data can be used as reference for haplotyping imputation [1]. This article reports different implementation generation pipeline, the results of performance comparison between different implementations (A, B, and C) and between HiFi and three major imputation software tools. Our data showed that the performances of these three implementations are similar on accuracy, in which the accuracy of implementation-B is slightly but consistently higher than A and C. HiFi performed better on haplotype imputation accuracy and three other software performed slightly better on genotype imputation accuracy. These data may provide a strategy for choosing optimal phasing pipeline and software for different studies.
Collapse
Affiliation(s)
- Wenzhi Li
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi'an Jiaotong University, Xi'an, Shaanxi, China; Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, GA, USA
| | - Wei Xu
- Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, GA, USA
| | | | - Li Ma
- Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, GA, USA; 4DGenome Inc., Atlanta, GA, USA
| | - Qing Song
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi'an Jiaotong University, Xi'an, Shaanxi, China; Cardiovascular Research Institute, Morehouse School of Medicine, Atlanta, GA, USA; 4DGenome Inc., Atlanta, GA, USA
| |
Collapse
|
4
|
Li W, Xu W, Li Q, Ma L, Song Q. References for Haplotype Imputation in the Big Data Era. MOLECULAR BIOLOGY (LOS ANGELES, CALIF.) 2015; 4:143. [PMID: 27274952 PMCID: PMC4888899 DOI: 10.4172/2168-9547.1000143] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Imputation is a powerful in silico approach to fill in those missing values in the big datasets. This process requires a reference panel, which is a collection of big data from which the missing information can be extracted and imputed. Haplotype imputation requires ethnicity-matched references; a mismatched reference panel will significantly reduce the quality of imputation. However, currently existing big datasets cover only a small number of ethnicities, there is a lack of ethnicity-matched references for many ethnic populations in the world, which has hampered the data imputation of haplotypes and its downstream applications. To solve this issue, several approaches have been proposed and explored, including the mixed reference panel, the internal reference panel and genotype-converted reference panel. This review article provides the information and comparison between these approaches. Increasing evidence showed that not just one or two genetic elements dictate the gene activity and functions; instead, cis-interactions of multiple elements dictate gene activity. Cis-interactions require the interacting elements to be on the same chromosome molecule, therefore, haplotype analysis is essential for the investigation of cis-interactions among multiple genetic variants at different loci, and appears to be especially important for studying the common diseases. It will be valuable in a wide spectrum of applications from academic research, to clinical diagnosis, prevention, treatment, and pharmaceutical industry.
Collapse
Affiliation(s)
- Wenzhi Li
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia, USA
| | - Wei Xu
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia, USA
| | - Qiling Li
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
| | - Li Ma
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia, USA
- 4DGenome Inc, Atlanta, Georgia, USA
| | - Qing Song
- Center of Big Data and Bioinformatics, First Affiliated Hospital of Medicine School, Xi’an Jiaotong University, Xi’an, Shaanxi, China
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia, USA
- 4DGenome Inc, Atlanta, Georgia, USA
| |
Collapse
|
5
|
Bythwood TN, Xu W, Li W, Rao W, Li Q, Xue X, Richards J, Ma L, Song Q. The mirror RNA expression pattern in human tissues. PRECISION MEDICINE 2015; 1:e1036. [PMID: 28280784 PMCID: PMC5340261 DOI: 10.14800/pm.1036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
It has been realized in recent years that non-coding RNAs are playing important roles in genome functions and human diseases. Here we developed a new technology and observed a new pattern of gene expression. We observed that over 72% of RNAs in human genome are expressed in forward-reverse pairs, just like mirror images of each other between forward expression and reverse expression; the overview showed that it cannot be simply described as transcript overlapping, so we designated it as mirror expression. Furthermore, we found that the mirror expression is gene-specific and tissue-specific, and less common in the proximal promoter regions. The size of the shadows varies between different genes, different tissues and different classes. The shadow expression is most significant in the Alu element, it was also observed among L1, Simple Repeats and LTR elements, but rare in other repeats such as low-complexity, LINE/L2, DNA and MIRs. Although there is no evidence yet about the relationship of this mirror pattern and double-strand RNA (dsRNA), this new striking pattern provides a new clue and a new direction to unveil the role of RNAs in the genome functions and diseases.
Collapse
Affiliation(s)
- Tameka N. Bythwood
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
| | - Wei Xu
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| | - Wenzhi Li
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| | - Weinian Rao
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
- 4DGenome Inc, Atlanta, Georgia 30033, USA
| | - Qiling Li
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| | - Xue Xue
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| | - Jendai Richards
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
| | - Li Ma
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
- 4DGenome Inc, Atlanta, Georgia 30033, USA
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| | - Qing Song
- Cardiovascular Research Institute and Department of Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
- 4DGenome Inc, Atlanta, Georgia 30033, USA
- Center for Big Data Science, First Affiliated Hospital of Medical School, Xi’an Jiaotong University, Xi’an, Shaanxi 710061, China
| |
Collapse
|