1
|
Xiao Y, Zhu Y, Chen J, Wu M, Wang L, Su L, Feng F, Hou Y. Overexpression of SYNGAP1 suppresses the proliferation of rectal adenocarcinoma via Wnt/β-Catenin signaling pathway. Discov Oncol 2024; 15:135. [PMID: 38679635 PMCID: PMC11056356 DOI: 10.1007/s12672-024-00997-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 04/24/2024] [Indexed: 05/01/2024] Open
Abstract
Rectal adenocarcinoma (READ) is a common malignant tumor of the digestive tract. Growing studies have confirmed Ras GTPase-activating proteins are involved in the progression of several tumors. This study aimed to explore the expression and function of Ras GTPase-activating proteins in READ. In this study, we analyzed RNA sequencing data from 165 patients with READ and 789 normal tissue samples, identifying 5603 differentially expressed genes (DEGs), including 2937 upregulated genes and 2666 downregulated genes. Moreover, we also identified two dysregulated genes, RASA4 and SYNGAP1, among six Ras GTPase-activating proteins. High NF1 expression was associated with longer overall survival, while high SYNGAP1 expression showed a trend towards extended overall survival. Further analysis revealed the mutation frequency and copy number variations of Ras GTPase-activating proteins in various cancer samples. Additionally, DNA methylation analysis demonstrated a negative correlation between DNA methylation of Ras GTPase-activating proteins and their expression. Moreover, among Ras GTPase-activating proteins, we focused on SYNGAP1, and experimental validation confirmed that the overexpression of SYNGAP1 in READ significantly suppressed READ cell proliferation and increased apoptosis via regulating the Wnt/β-Catenin signaling pathway. These findings underscored the potential significance of SYNGAP1 in READ and provide new insights for further research and treatment.
Collapse
Affiliation(s)
- Yun Xiao
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Ying Zhu
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Jiaojiao Chen
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Mei Wu
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Lan Wang
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Li Su
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China
| | - Fei Feng
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China.
| | - Yanli Hou
- Department of Oncology and Hematology, Chongqing Hospital of Traditional Chinese Medicine, Chongqing, China.
| |
Collapse
|
2
|
Yang S, Ning C, Yang C, Li W, Zhang Q, Wang D, Tang H. Identify Candidate Genes Associated with the Weight and Egg Quality Traits in Wenshui Green Shell-Laying Chickens by the Copy Number Variation-Based Genome-Wide Association Study. Vet Sci 2024; 11:76. [PMID: 38393094 PMCID: PMC10892766 DOI: 10.3390/vetsci11020076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/03/2024] [Accepted: 02/04/2024] [Indexed: 02/25/2024] Open
Abstract
Copy number variation (CNV), as an essential source of genetic variation, can have an impact on gene expression, genetic diversity, disease susceptibility, and species evolution in animals. To better understand the weight and egg quality traits of chickens, this paper aimed to detect CNVs in Wenshui green shell-laying chickens and conduct a copy number variation regions (CNVRs)-based genome-wide association study (GWAS) to identify variants and candidate genes associated with their weight and egg quality traits to support related breeding efforts. In our paper, we identified 11,035 CNVRs in Wenshui green shell-laying chickens, which collectively spanned a length of 13.1 Mb, representing approximately 1.4% of its autosomal genome. Out of these CNVRs, there were 10,446 loss types, 491 gain types, and 98 mixed types. Notably, two CNVRs showed significant correlations with egg quality, while four CNVRs exhibited significant associations with body weight. These significant CNVRs are located on chromosome 4. Further analysis identified potential candidate genes that influence weight and egg quality traits, including FAM184B, MED28, LAP3, ATOH8, ST3GAL5, LDB2, and SORCS2. In this paper, the CNV map of the Wenshui green shell-laying chicken genome was constructed for the first time through population genotyping. Additionally, CNVRs can be employed as molecular markers to genetically improve chickens' weight and egg quality traits.
Collapse
Affiliation(s)
- Suozhou Yang
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| | - Chao Ning
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| | - Cheng Yang
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| | - Wenqiang Li
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
- College of Animal Science and Technology, China Agricultural University, Beijing 100083, China
| | - Dan Wang
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| | - Hui Tang
- Key Laboratory of Efficient Utilization of Non-Grain Feed Resources (Co-Construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China; (S.Y.); (C.N.); (C.Y.); (W.L.)
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, 61 Daizong Street, Tai’an 271018, China;
| |
Collapse
|
3
|
Liu X, Dai H, Li G, Jia R, Meng X, Yu S, Yang L, Hong J. Screening copy number variations in 35 unsolved inherited retinal disease families. Hum Genet 2024; 143:197-210. [PMID: 38282009 PMCID: PMC10881639 DOI: 10.1007/s00439-023-02631-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 12/15/2023] [Indexed: 01/30/2024]
Abstract
The purpose of this study was to screen Copy Number Variations (CNVs) in 35 unsolved Inherited Retinal Dystrophy (IRD) families. Initially, next generation sequencing, including a specific Hereditary Eye Disease Enrichment Panel or Whole exome sequencing, was employed to screen (likely) pathogenic Single-nucleotide Variants (SNVs) and small Insertions and Deletions (indels) for these cases. All available SNVs and indels were further validated and co-segregation analyses were performed in available family members by Sanger sequencing. If not, after excluding deep intronic variants, Multiplex ligation-dependent probe amplification (MLPA), quantitative fluorescence PCR (QF-PCR) and Sanger sequencing were employed to screen CNVs. We determined that 18 probands who had heterozygous SNVs/indels or whose parents were not consanguineous but had homozygous SNVs/indels in autosomal recessive IRDs genes had CNVs in another allele of these genes, 11 families had disease-causing hemizygous CNVs in X-linked IRD genes, 6 families had (likely) pathogenic heterozygous CNVs in PRPF31 gene. Of 35 families, 33 different CNVs in 16 IRD-associated genes were detected, with PRPF31, EYS and USH2A the most common disease-causing gene in CNVs. Twenty-six and 7 of them were deletion and duplication CNVs, respectively. Among them, 14 CNVs were first reported in this study. Our research indicates that CNVs contribute a lot to IRDs, and screening of CNVs substantially increases the diagnostic rate of IRD. Our results emphasize that MLPA and QF-PCR are ideal methods to validate CNVs, and the novel CNVs reported herein expand the mutational spectrums of IRDs.
Collapse
Affiliation(s)
- Xiaozhen Liu
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China
| | - Hehua Dai
- Department of Ophthalmology, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Jinan, 250014, China
| | - Genlin Li
- Beijing Tongren Eye Center, Beijing Tongren Hospital, Capital Medical University, Beijing Ophthalmology & Visual Sciences Key Lab, Beijing, 100730, China
| | - Ruixuan Jia
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China
| | - Xiang Meng
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China
| | - Shicheng Yu
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China
| | - Liping Yang
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China.
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China.
| | - Jing Hong
- Department of Ophthalmology, Peking University Third Hospital, Beijing, 100191, China.
- Beijing Key Laboratory of Restoration of Damaged Ocular Nerve, Peking University Third Hospital, Beijing, 100191, China.
| |
Collapse
|
4
|
Zech M, Winkelmann J. Next-generation sequencing and bioinformatics in rare movement disorders. Nat Rev Neurol 2024; 20:114-126. [PMID: 38172289 DOI: 10.1038/s41582-023-00909-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/29/2023] [Indexed: 01/05/2024]
Abstract
The ability to sequence entire exomes and genomes has revolutionized molecular testing in rare movement disorders, and genomic sequencing is becoming an integral part of routine diagnostic workflows for these heterogeneous conditions. However, interpretation of the extensive genomic variant information that is being generated presents substantial challenges. In this Perspective, we outline multidimensional strategies for genetic diagnosis in patients with rare movement disorders. We examine bioinformatics tools and computational metrics that have been developed to facilitate accurate prioritization of disease-causing variants. Additionally, we highlight community-driven data-sharing and case-matchmaking platforms, which are designed to foster the discovery of new genotype-phenotype relationships. Finally, we consider how multiomic data integration might optimize diagnostic success by combining genomic, epigenetic, transcriptomic and/or proteomic profiling to enable a more holistic evaluation of variant effects. Together, the approaches that we discuss offer pathways to the improved understanding of the genetic basis of rare movement disorders.
Collapse
Affiliation(s)
- Michael Zech
- Institute of Human Genetics, School of Medicine, Technical University of Munich, Munich, Germany
- Institute of Neurogenomics, Helmholtz Zentrum München, Munich, Germany
- Institute for Advanced Study, Technical University of Munich, Garching, Germany
| | - Juliane Winkelmann
- Institute of Human Genetics, School of Medicine, Technical University of Munich, Munich, Germany.
- Institute of Neurogenomics, Helmholtz Zentrum München, Munich, Germany.
- Munich Cluster for Systems Neurology, SyNergy, Munich, Germany.
- DZPG, Deutsches Zentrum für Psychische Gesundheit, Munich, Germany.
| |
Collapse
|
5
|
Guo F, Liu R, Pan Y, Collins C, Bean L, Ma Z, Mathur A, Da Silva C, Nallamilli B, Guruju N, Chen-Deutsch X, Yousaf R, Chin E, Balciuniene J, Hegde M. Evidence from 2100 index cases supports genome sequencing as a first-tier genetic test. Genet Med 2024; 26:100995. [PMID: 37838930 DOI: 10.1016/j.gim.2023.100995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 10/04/2023] [Accepted: 10/06/2023] [Indexed: 10/16/2023] Open
Abstract
PURPOSE Genome sequencing (GS) is one of the most comprehensive assays that interrogate single-nucleotide variants, copy number variants, mitochondrial variants, repeat expansions, and structural variants in a single assay. Despite the clear technical superiority, the full clinical utility of GS has yet to be determined. METHODS We systematically evaluated 2100 clinical GS index cases performed in our laboratory to explore the diagnostic yield of GS as first-tier and as follow-up testing. RESULTS The overall diagnostic yield was 28% (585/2100). The diagnostic yield for GS as the first-tier test was 26% (294/1146). Among cases with prior non-diagnostic genetic tests, GS provided a diagnosis for 27% (247/910) of cases, including 56 cases with prior exome sequencing (ES). Although re-analysis of previous ES might have resolved the diagnosis in 29 cases, diagnoses for 27 cases would have been missed because of the technical inferiority of ES. Moreover, GS further disclosed additional genetic etiology in 3 out of 44 cases with existing partial diagnosis. CONCLUSION We present the largest-to-date GS data set of a clinically heterogeneous cohort from a single clinical laboratory. Our data demonstrate that GS should be considered as the first-tier genetic test that has the potential to shorten the diagnostic odyssey.
Collapse
Affiliation(s)
- Fen Guo
- Revvity Omics, Pittsburgh, PA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Oketch DJA, Giulietti M, Piva F. Copy Number Variations in Pancreatic Cancer: From Biological Significance to Clinical Utility. Int J Mol Sci 2023; 25:391. [PMID: 38203561 PMCID: PMC10779192 DOI: 10.3390/ijms25010391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 12/20/2023] [Accepted: 12/24/2023] [Indexed: 01/12/2024] Open
Abstract
Pancreatic ductal adenocarcinoma (PDAC) is the most common type of pancreatic cancer, characterized by high tumor heterogeneity and a poor prognosis. Inter- and intra-tumoral heterogeneity in PDAC is a major obstacle to effective PDAC treatment; therefore, it is highly desirable to explore the tumor heterogeneity and underlying mechanisms for the improvement of PDAC prognosis. Gene copy number variations (CNVs) are increasingly recognized as a common and heritable source of inter-individual variation in genomic sequence. In this review, we outline the origin, main characteristics, and pathological aspects of CNVs. We then describe the occurrence of CNVs in PDAC, including those that have been clearly shown to have a pathogenic role, and further highlight some key examples of their involvement in tumor development and progression. The ability to efficiently identify and analyze CNVs in tumor samples is important to support translational research and foster precision oncology, as copy number variants can be utilized to guide clinical decisions. We provide insights into understanding the CNV landscapes and the role of both somatic and germline CNVs in PDAC, which could lead to significant advances in diagnosis, prognosis, and treatment. Although there has been significant progress in this field, understanding the full contribution of CNVs to the genetic basis of PDAC will require further research, with more accurate CNV assays such as single-cell techniques and larger cohorts than have been performed to date.
Collapse
Affiliation(s)
| | - Matteo Giulietti
- Department of Specialistic Clinical and Odontostomatological Sciences, Polytechnic University of Marche, 60131 Ancona, Italy
| | - Francesco Piva
- Department of Specialistic Clinical and Odontostomatological Sciences, Polytechnic University of Marche, 60131 Ancona, Italy
| |
Collapse
|
7
|
Lee S, Kim J, Ohn JH. Exploring quantitative traits-associated copy number deletions through reanalysis of UK10K consortium whole genome sequencing cohorts. BMC Genomics 2023; 24:787. [PMID: 38110883 PMCID: PMC10729411 DOI: 10.1186/s12864-023-09903-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 12/13/2023] [Indexed: 12/20/2023] Open
Abstract
OBJECTIVES We performed comprehensive association analyses of common high-confidence gnomAD-reported copy number deletions (CNDs) with 60 quantitative traits from UK10K consortium WGS data. METHODS The study made use of data generated by the UK10K Consortium. UK10K consortium WGS data consist of TwinsUK (n = 1754, middle-aged females) and ALSPAC (n = 1867, birth to adolescence) cohorts. UK10K consortium called 18,739 CNDs (hg19) with GenomeSTRiP software. After filtering out variants with minor allele frequency < 0.05 or HWE P < 1.0 × 10- 6, 1222 (TwinsUK) and 1211 (ALSPAC) CNDs remained for association analyses with 60 normalized quantitative traits. RESULTS We identified 23 genome-wide significant associations at 13 loci, among which 2 associations reached experiment-wide significance. We found that two common deletions in chromosome 4, located between WDR1 and ZNF518B (23.3 kb, dbVar ID:nssv15888957, 4:10211262-10,234,569 and 9.8 kb, dbVar ID:nssv15888975, 4:10392422-10,402,191), were associated with uric acid levels (P = 5.23 × 10- 11 and 2.29 × 10- 8, respectively). We also discovered a novel deletion spanning chromosome 18 (823 bp, dbVar ID: nssv15841628, 8:74347187-74,348,010) associated with low HDL cholesterol levels (P = 4.15 × 10- 7). Additionally, we observed two red blood cell traits-associated loci with genome-wide significance, a 13.2 kb deletion in 7q22.1 (nssv15922542) and a 3.7 kb deletion in 12q24.12 (nssv15813226), both of which were located in regions previously reported to be associated with red blood cell traits. Two deletions in 11q11 (nssv15803200 and nssv15802240), where clusters of multiple olfactory receptor genes exist, and a deletion (nssv15929560) upstream to DOCK5 were associated with childhood obesity. Finally, when defining Trait-Associated copy number Deletions (TADs) as CNDs with phenotype associations at sub-threshold significance (P < 10- 3), we identified 157 (97.5%) out of 161 TADs in non-coding regions, with a mean size of 4 kb (range: 209 - 47,942 bp). CONCLUSION We conducted a reanalysis of the UK10K Whole Genome Sequencing cohort, which led to the identification of multiple high confidence copy number deletions associated with quantitative traits. These deletions have standard dbVar IDs and replicate previous findings, as well as reveal novel loci that require further replication studies.
Collapse
Affiliation(s)
- Sejoon Lee
- Precision Medicine Center, Future Innovation Research Division, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea
- Department of Pathology, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea
| | - Jinho Kim
- Precision Medicine Center, Future Innovation Research Division, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea
- Department of Laboratory Medicine, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea
| | - Jung Hun Ohn
- Precision Medicine Center, Future Innovation Research Division, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea.
- Department of Internal Medicine, Seoul National University Bundang Hospital, 173-82, Gumi-ro, Bundang-gu, Seongnam, Gyeonggi-do, 13620, South Korea.
- Department of Internal Medicine, College of Medicine, Seoul National University, 103, Daehak-ro, Jongno-gu, Seoul, 03080, South Korea.
| |
Collapse
|
8
|
Meng X, Wang M, Luo M, Sun L, Yan Q, Liu Y. Systematic evaluation of multiple NGS platforms for structural variants detection. J Biol Chem 2023; 299:105436. [PMID: 37944616 PMCID: PMC10724692 DOI: 10.1016/j.jbc.2023.105436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 10/29/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023] Open
Abstract
Structural variations (SV) are critical genome changes affecting human diseases. Although many hybridization-based methods exist, evaluating SVs through next-generation sequencing (NGS) data is still necessary for broader research exploration. Here, we comprehensively compared the performance of 16 SV callers and multiple NGS platforms using NA12878 whole genome sequencing (WGS) datasets. The results indicated that several SV callers performed well relatively, such as Manta, GRIDSS, LUMPY, TARDIS, FermiKit, and Wham. Meanwhile, all NGS platforms have a similar performance using a single software. Additionally, we found that the source of undetected SVs was mostly from long reads datasets, therefore, the more appropriate strategy for accurate SV detection will be an integration of long and shorter reads in the future. At present, in the period of NGS as a mainstream method in bioinformatics, our study would provide helpful and comprehensive guidelines for specific categories of SV research.
Collapse
Affiliation(s)
- Xuan Meng
- School of Medicine, Southern University of Science and Technology, Shenzhen, China
| | - Miao Wang
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Mingjie Luo
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Lei Sun
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Qin Yan
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China
| | - Yongfeng Liu
- Research Cooperation Department, GeneMind Biosciences Company Limited, Shenzhen, China.
| |
Collapse
|
9
|
Yang P, Wang G, Jiang S, Chen M, Zeng J, Pang Q, Du D, Zhou M. Comparative analysis of genome-wide copy number variations between Tibetan sheep and White Suffolk sheep. Anim Biotechnol 2023; 34:986-993. [PMID: 34865600 DOI: 10.1080/10495398.2021.2007937] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
The DNA copy number variations (CNVs) are widely involved in affecting various kinds of biological functions, such as environmental adaptation. Tibetan sheep and White Suffolk sheep are two representative indigenous and exotic breeds raised in Sichuan, China, and both of them have many contrasting biological characteristics. In this study, we employed high-throughput sequencing approach to investigate genome-wide CNVs between the two sheep breeds. A total of 11,135 CNV regions (CNVRs) consisting of 6,488 deletions and 4,647 duplications were detected, whose length ranged from 1,599 bp to 0.56 Mb with the mean of 4,658 bp. There were 281 CNVRs segregated between Tibetan sheep and White Suffolk sheep, and 18 of them have been fixed within both breeds. Functional analyses of candidate genes within the segregating CNVRs revealed the thyroid hormone signaling pathway and CTNNB1 gene that would be responsible for differential biological characteristics of breeds, such as energy metabolism, seasonal reproduction, and litter size. Furthermore, the segregating CNVRs identified in this study were overlapped with many known quantitative trait loci that are associated with growth, testis weight, and reproductive seasonality. In conclusion, these results help us better understanding differential biological characteristics between Tibetan sheep and White Suffolk sheep.
Collapse
Affiliation(s)
- Pinggui Yang
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Gaofu Wang
- Chongqing Academy of Animal Sciences, Chongqing, China
| | - Shihai Jiang
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Minghua Chen
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Jie Zeng
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Qian Pang
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Dan Du
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| | - Mingliang Zhou
- Institute of Plateau Animals, Sichuan Academy of Grassland Sciences, Chengdu, China
| |
Collapse
|
10
|
Ahmad SF, Chandrababu Shailaja C, Vaishnav S, Kumar A, Gaur GK, Janga SC, Ahmad SM, Malla WA, Dutt T. Read-depth based approach on whole genome resequencing data reveals important insights into the copy number variation (CNV) map of major global buffalo breeds. BMC Genomics 2023; 24:616. [PMID: 37845620 PMCID: PMC10580622 DOI: 10.1186/s12864-023-09720-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 10/05/2023] [Indexed: 10/18/2023] Open
Abstract
BACKGROUND Elucidating genome-wide structural variants including copy number variations (CNVs) have gained increased significance in recent times owing to their contribution to genetic diversity and association with important pathophysiological states. The present study aimed to elucidate the high-resolution CNV map of six different global buffalo breeds using whole genome resequencing data at two coverages (10X and 30X). Post-quality control, the sequence reads were aligned to the latest draft release of the Bubaline genome. The genome-wide CNVs were elucidated using a read-depth approach in CNVnator with different bin sizes. Adjacent CNVs were concatenated into copy number variation regions (CNVRs) in different breeds and their genomic coverage was elucidated. RESULTS Overall, the average size of CNVR was lower at 30X coverage, providing finer details. Most of the CNVRs were either deletion or duplication type while the occurrence of mixed events was lesser in number on a comparative basis in all breeds. The average CNVR size was lower at 30X coverage (0.201 Mb) as compared to 10X (0.013 Mb) with the finest variants in Banni buffaloes. The maximum number of CNVs was observed in Murrah (2627) and Pandharpuri (25,688) at 10X and 30X coverages, respectively. Whereas the minimum number of CNVs were scored in Surti at both coverages (2092 and 17,373). On the other hand, the highest and lowest number of CNVRs were scored in Jaffarabadi (833 and 10,179 events) and Surti (783 and 7553 events) at both coverages. Deletion events overnumbered duplications in all breeds at both coverages. Gene profiling of common overlapped genes and longest CNVRs provided important insights into the evolutionary history of these breeds and indicate the genomic regions under selection in respective breeds. CONCLUSION The present study is the first of its kind to elucidate the high-resolution CNV map in major buffalo populations using a read-depth approach on whole genome resequencing data. The results revealed important insights into the divergence of major global buffalo breeds along the evolutionary timescale.
Collapse
Affiliation(s)
- Sheikh Firdous Ahmad
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India.
| | - Celus Chandrababu Shailaja
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| | - Sakshi Vaishnav
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| | - Amit Kumar
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| | - Gyanendra Kumar Gaur
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| | - Sarath Chandra Janga
- Luddy School of Informatics, Computing & Engineering, Indiana University Indianapolis (IUI), Indianapolis, 46202, USA
| | - Syed Mudasir Ahmad
- Division of Animal Biotechnology, Faculty of Veterinary Sciences and AH, Sher-e-Kashmir University of Agricultural Sciences and Technology, Srinagar, Jammu & Kashmir, 190006, India.
| | - Waseem Akram Malla
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| | - Triveni Dutt
- Division of Animal Genetics, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly, Uttar Pradesh, 243122, India
| |
Collapse
|
11
|
Thorburn DMJ, Sagonas K, Binzer-Panchal M, Chain FJJ, Feulner PGD, Bornberg-Bauer E, Reusch TBH, Samonte-Padilla IE, Milinski M, Lenz TL, Eizaguirre C. Origin matters: Using a local reference genome improves measures in population genomics. Mol Ecol Resour 2023; 23:1706-1723. [PMID: 37489282 DOI: 10.1111/1755-0998.13838] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 05/10/2023] [Accepted: 06/02/2023] [Indexed: 07/26/2023]
Abstract
Genome sequencing enables answering fundamental questions about the genetic basis of adaptation, population structure and epigenetic mechanisms. Yet, we usually need a suitable reference genome for mapping population-level resequencing data. In some model systems, multiple reference genomes are available, giving the challenging task of determining which reference genome best suits the data. Here, we compared the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome derived from a European gynogenetic individual and the published reference genome of a North American individual. Specifically, we investigated the impact of using a local reference versus one generated from a distinct lineage on several common population genomics analyses. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we demonstrate that genetic distance among samples and the reference genomes impacts downstream analyses. Using a local reference genome increased mapping efficiency and genotyping accuracy, effectively retaining more and better data. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e. π, Tajima's D and FST ), window-based statistics using different references resulted in different outlier genes and enriched gene functions. A marker-based analysis of DNA methylation distributions had a comparably high overlap in outlier genes and functions, yet with distinct differences depending on the reference genome. Overall, our results highlight how using a local reference genome decreases reference bias to increase confidence in downstream analyses of the data. Such results have significant implications in all reference-genome-based population genomic analyses.
Collapse
Affiliation(s)
- Doko-Miles J Thorburn
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
- Department of Life Sciences, Imperial College London, London, UK
| | - Kostas Sagonas
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Mahesh Binzer-Panchal
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, National Bioinformatics Infrastructure Sweden (NBIS), Uppsala University, Uppsala, Sweden
| | - Frederic J J Chain
- Department of Biological Sciences, University of Massachusetts Lowell, Lowell, Massachusetts, USA
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland
- Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Erich Bornberg-Bauer
- Evolutionary Bioinformatics, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Thorsten B H Reusch
- Marine Evolutionary Ecology, GEOMAR Helmholtz Centre for Ocean Research, Kiel, Germany
| | - Irene E Samonte-Padilla
- Department of Evolutionary Ecology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Manfred Milinski
- Department of Evolutionary Ecology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Tobias L Lenz
- Research Group for Evolutionary Immunogenomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, Hamburg, Germany
| | - Christophe Eizaguirre
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| |
Collapse
|
12
|
Bonini KE, Thomas-Wilson A, Marathe PN, Sebastin M, Odgis JA, Biase MD, Kelly NR, Ramos MA, Insel BJ, Scarimbolo L, Rehman AU, Guha S, Okur V, Abhyankar A, Phadke S, Nava C, Gallagher KM, Elkhoury L, Edelmann L, Zinberg RE, Abul-Husn NS, Diaz GA, Greally JM, Suckiel SA, Horowitz CR, Kenny EE, Wasserstein M, Gelb BD, Jobanputra V. Identification of copy number variants with genome sequencing: Clinical experiences from the NYCKidSeq program. Clin Genet 2023; 104:210-225. [PMID: 37334874 PMCID: PMC10505482 DOI: 10.1111/cge.14365] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 04/28/2023] [Accepted: 05/15/2023] [Indexed: 06/21/2023]
Abstract
Copy number variations (CNVs) play a significant role in human disease. While chromosomal microarray has traditionally been the first-tier test for CNV detection, use of genome sequencing (GS) is increasing. We report the frequency of CNVs detected with GS in a diverse pediatric cohort from the NYCKidSeq program and highlight specific examples of its clinical impact. A total of 1052 children (0-21 years) with neurodevelopmental, cardiac, and/or immunodeficiency phenotypes received GS. Phenotype-driven analysis was used, resulting in 183 (17.4%) participants with a diagnostic result. CNVs accounted for 20.2% of participants with a diagnostic result (37/183) and ranged from 0.5 kb to 16 Mb. Of participants with a diagnostic result (n = 183) and phenotypes in more than one category, 5/17 (29.4%) were solved by a CNV finding, suggesting a high prevalence of diagnostic CNVs in participants with complex phenotypes. Thirteen participants with a diagnostic CNV (35.1%) had previously uninformative genetic testing, of which nine included a chromosomal microarray. This study demonstrates the benefits of GS for reliable detection of CNVs in a pediatric cohort with variable phenotypes.
Collapse
Affiliation(s)
- Katherine E. Bonini
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
| | | | - Priya N. Marathe
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Monisha Sebastin
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | - Jacqueline A. Odgis
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Miranda Di Biase
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | - Nicole R. Kelly
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | - Michelle A. Ramos
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY
- Institute for Health Equity Research, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Beverly J. Insel
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Laura Scarimbolo
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
| | | | - Saurav Guha
- Molecular Diagnostics, New York Genome Center, New York, NY
| | - Volkan Okur
- Molecular Diagnostics, New York Genome Center, New York, NY
| | | | - Shruti Phadke
- Molecular Diagnostics, New York Genome Center, New York, NY
| | - Caroline Nava
- Molecular Diagnostics, New York Genome Center, New York, NY
| | - Katie M. Gallagher
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | | | | | - Randi E. Zinberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Obstetrics, Gynecology and Reproductive Science, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Noura S. Abul-Husn
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - George A. Diaz
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY
| | - John M. Greally
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | - Sabrina A. Suckiel
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Carol R. Horowitz
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY
- Institute for Health Equity Research, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Eimear E. Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Melissa Wasserstein
- Department of Pediatrics, Division of Pediatric Genetic Medicine, Children’s Hospital at Montefiore/Montefiore Medical Center/Albert Einstein College of Medicine, Bronx, NY
| | - Bruce D. Gelb
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Pediatrics, Icahn School of Medicine at Mount Sinai, New York, NY
- Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Vaidehi Jobanputra
- Molecular Diagnostics, New York Genome Center, New York, NY
- Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY
| |
Collapse
|
13
|
Kosugi S, Kamatani Y, Harada K, Tomizuka K, Momozawa Y, Morisaki T, Terao C. Detection of trait-associated structural variations using short-read sequencing. CELL GENOMICS 2023; 3:100328. [PMID: 37388916 PMCID: PMC10300613 DOI: 10.1016/j.xgen.2023.100328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 02/17/2023] [Accepted: 04/25/2023] [Indexed: 07/01/2023]
Abstract
Genomic structural variation (SV) affects genetic and phenotypic characteristics in diverse organisms, but the lack of reliable methods to detect SV has hindered genetic analysis. We developed a computational algorithm (MOPline) that includes missing call recovery combined with high-confidence SV call selection and genotyping using short-read whole-genome sequencing (WGS) data. Using 3,672 high-coverage WGS datasets, MOPline stably detected ∼16,000 SVs per individual, which is over ∼1.7-3.3-fold higher than previous large-scale projects while exhibiting a comparable level of statistical quality metrics. We imputed SVs from 181,622 Japanese individuals for 42 diseases and 60 quantitative traits. A genome-wide association study with the imputed SVs revealed 41 top-ranked or nearly top-ranked genome-wide significant SVs, including 8 exonic SVs with 5 novel associations and enriched mobile element insertions. This study demonstrates that short-read WGS data can be used to identify rare and common SVs associated with a variety of traits.
Collapse
Affiliation(s)
- Shunichi Kosugi
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
| | - Yoichiro Kamatani
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5, Kashiwanoha, Kashiwa-shi, Chiba 277-8562, Japan
| | - Katsutoshi Harada
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Kohei Tomizuka
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, Japan
| | - Takayuki Morisaki
- Division of Molecular Pathology, Institute of Medical Science, The University of Tokyo, 4-6-1, Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan
| | | | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
- The Department of Applied Genetics, The School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| |
Collapse
|
14
|
Spealman P, De T, Chuong JN, Gresham D. Best Practices in Microbial Experimental Evolution: Using Reporters and Long-Read Sequencing to Identify Copy Number Variation in Experimental Evolution. J Mol Evol 2023; 91:356-368. [PMID: 37012421 PMCID: PMC10275804 DOI: 10.1007/s00239-023-10102-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 02/21/2023] [Indexed: 04/05/2023]
Abstract
Copy number variants (CNVs), comprising gene amplifications and deletions, are a pervasive class of heritable variation. CNVs play a key role in rapid adaptation in both natural, and experimental, evolution. However, despite the advent of new DNA sequencing technologies, detection and quantification of CNVs in heterogeneous populations has remained challenging. Here, we summarize recent advances in the use of CNV reporters that provide a facile means of quantifying de novo CNVs at a specific locus in the genome, and nanopore sequencing, for resolving the often complex structures of CNVs. We provide guidance for the engineering and analysis of CNV reporters and practical guidelines for single-cell analysis of CNVs using flow cytometry. We summarize recent advances in nanopore sequencing, discuss the utility of this technology, and provide guidance for the bioinformatic analysis of these data to define the molecular structure of CNVs. The combination of reporter systems for tracking and isolating CNV lineages and long-read DNA sequencing for characterizing CNV structures enables unprecedented resolution of the mechanisms by which CNVs are generated and their evolutionary dynamics.
Collapse
Affiliation(s)
- Pieter Spealman
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Titir De
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Julie N Chuong
- Department of Biology, New York University, New York, NY, 10003, USA
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - David Gresham
- Department of Biology, New York University, New York, NY, 10003, USA.
- Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|
15
|
Ye B, Tang X, Liao S, Ding K. A comparison of algorithms for identifying copy number variants in family-based whole-exome sequencing data and its implications in inheritance pattern analysis. Gene 2023; 861:147237. [PMID: 36731620 DOI: 10.1016/j.gene.2023.147237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 12/27/2022] [Accepted: 01/26/2023] [Indexed: 01/31/2023]
Abstract
There remain challenges in accurately identifying constitutional or germline copy number variants (gCNVs) based on whole-exome sequencing data that have implications for genetic diagnosis for 'rare undiagnosed disease' in the clinical setting. Although multiple algorithms have been proposed, a systematic comparison of these algorithms for calling gCNVs and analyzing inherited pattern have yet to be fully conducted. Therefore, we empirically compared seven exome-based algorithms, including XHMM, CLAMMS, CODEX2, ExomeDepth, DECoN, CN.MOPS, and GATK gCNV, for calling gCNVs in 151 individuals from 44 pedigrees, together with the gold standard of genotyping-derived gCNVs in the same cohort for the performance assessment. These algorithms demonstrated varied powers in identifying gCNVs, although the distribution of gCNVs size was similar. The number of shared gCNVs across these algorithms was limited (e.g., only four gCNVs shared among seven algorithms); however, several algorithms showed varying degrees of consistency (e.g., 1,843 gCNVs shared between DECoN and ExomeDepth). CLAMMS and CODEX2 outperformed the remaining algorithms according to a relatively higher F-score (i.e., 0.145 and 0.152, respectively). In addition, these algorithms exhibited different Mendelian inconsistencies of gCNVs and significant challenges remained in inheritance pattern analysis. In conclusion, selecting good algorithms may have important implications in gCNVs-based inheritance pattern analysis for family-based studies.
Collapse
Affiliation(s)
- Bo Ye
- Department of Bioinformatics, School of Basic Medicine, Chongqing Medical University, Chongqing 400016, PR China
| | - Xia Tang
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, PR China
| | - Shixiu Liao
- Medical Genetic Institute of Henan Province, Henan Provincial People's Hospital, Henan Key Laboratory of Genetic Diseases and Functional Genomics, Henan Provincial People's Hospital of Henan University, People's Hospital of Zhengzhou University, Zhengzhou, Henan Province 450003, PR China.
| | - Keyue Ding
- Medical Genetic Institute of Henan Province, Henan Provincial People's Hospital, Henan Key Laboratory of Genetic Diseases and Functional Genomics, Henan Provincial People's Hospital of Henan University, People's Hospital of Zhengzhou University, Zhengzhou, Henan Province 450003, PR China; Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, United States.
| |
Collapse
|
16
|
Gudkov M, Thibaut L, Khushi M, Blue GM, Winlaw DS, Dunwoodie SL, Giannoulatou E. ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data. BMC Bioinformatics 2023; 24:49. [PMID: 36792982 PMCID: PMC9930243 DOI: 10.1186/s12859-023-05154-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 01/19/2023] [Indexed: 02/17/2023] Open
Abstract
BACKGROUND A wide range of tools are available for the detection of copy number variants (CNVs) from whole-genome sequencing (WGS) data. However, none of them focus on clinically-relevant CNVs, such as those that are associated with known genetic syndromes. Such variants are often large in size, typically 1-5 Mb, but currently available CNV callers have been developed and benchmarked for the discovery of smaller variants. Thus, the ability of these programs to detect tens of real syndromic CNVs remains largely unknown. RESULTS Here we present ConanVarvar, a tool which implements a complete workflow for the targeted analysis of large germline CNVs from WGS data. ConanVarvar comes with an intuitive R Shiny graphical user interface and annotates identified variants with information about 56 associated syndromic conditions. We benchmarked ConanVarvar and four other programs on a dataset containing real and simulated syndromic CNVs larger than 1 Mb. In comparison to other tools, ConanVarvar reports 10-30 times less false-positive variants without compromising sensitivity and is quicker to run, especially on large batches of samples. CONCLUSIONS ConanVarvar is a useful instrument for primary analysis in disease sequencing studies, where large CNVs could be the cause of disease.
Collapse
Affiliation(s)
- Mikhail Gudkov
- grid.1057.30000 0000 9472 3971Victor Chang Cardiac Research Institute, Sydney, NSW 2010 Australia ,grid.1013.30000 0004 1936 834XSchool of Biomedical Engineering, The University of Sydney, Sydney, NSW 2006 Australia ,grid.1005.40000 0004 4902 0432St Vincent’s Clinical Campus, School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW 2010 Australia
| | - Loïc Thibaut
- grid.1057.30000 0000 9472 3971Victor Chang Cardiac Research Institute, Sydney, NSW 2010 Australia ,grid.1005.40000 0004 4902 0432School of Mathematics and Statistics, UNSW Sydney, Sydney, NSW 2052 Australia
| | - Matloob Khushi
- grid.1013.30000 0004 1936 834XSchool of Computer Science, The University of Sydney, Sydney, NSW 2006 Australia
| | - Gillian M. Blue
- grid.1013.30000 0004 1936 834XSydney Medical School, The University of Sydney, Sydney, NSW 2006 Australia ,grid.413973.b0000 0000 9690 854XHeart Centre for Children, The Children’s Hospital at Westmead, Sydney, NSW 2145 Australia
| | - David S. Winlaw
- grid.1013.30000 0004 1936 834XSydney Medical School, The University of Sydney, Sydney, NSW 2006 Australia ,grid.413973.b0000 0000 9690 854XHeart Centre for Children, The Children’s Hospital at Westmead, Sydney, NSW 2145 Australia
| | - Sally L. Dunwoodie
- grid.1057.30000 0000 9472 3971Victor Chang Cardiac Research Institute, Sydney, NSW 2010 Australia ,grid.1005.40000 0004 4902 0432St Vincent’s Clinical Campus, School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW 2010 Australia ,grid.1005.40000 0004 4902 0432School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, NSW 2052 Australia
| | - Eleni Giannoulatou
- Victor Chang Cardiac Research Institute, Sydney, NSW, 2010, Australia. .,St Vincent's Clinical Campus, School of Clinical Medicine, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW, 2010, Australia.
| |
Collapse
|
17
|
Braga LG, Chud TCS, Watanabe RN, Savegnago RP, Sena TM, do Carmo AS, Machado MA, Panetto JCDC, da Silva MVGB, Munari DP. Identification of copy number variations in the genome of Dairy Gir cattle. PLoS One 2023; 18:e0284085. [PMID: 37036840 PMCID: PMC10085049 DOI: 10.1371/journal.pone.0284085] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 03/23/2023] [Indexed: 04/11/2023] Open
Abstract
Studying structural variants that can control complex traits is relevant for dairy cattle production, especially for animals that are tolerant to breeding conditions in the tropics, such as the Dairy Gir cattle. This study identified and characterized high confidence copy number variation regions (CNVR) in the Gir breed genome. A total of 38 animals were whole-genome sequenced, and 566 individuals were genotyped with a high-density SNP panel, among which 36 animals had both sequencing and SNP genotyping data available. Two sets of high confidence CNVR were established: one based on common CNV identified in the studied population (CNVR_POP), and another with CNV identified in sires with both sequence and SNP genotyping data available (CNVR_ANI). We found 10 CNVR_POP and 45 CNVR_ANI, which covered 1.05 Mb and 4.4 Mb of the bovine genome, respectively. Merging these CNV sets for functional analysis resulted in 48 unique high confidence CNVR. The overlapping genes were previously related to embryonic mortality, environmental adaptation, evolutionary process, immune response, longevity, mammary gland, resistance to gastrointestinal parasites, and stimuli recognition, among others. Our results contribute to a better understanding of the Gir breed genome. Moreover, the CNV identified in this study can potentially affect genes related to complex traits, such as production, health, and reproduction.
Collapse
Affiliation(s)
- Larissa G Braga
- Departamento de Engenharia e Ciências Exatas, Universidade Estadual Paulista, Jaboticabal, São Paulo, Brazil
| | - Tatiane C S Chud
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, Ontario, Canada
| | - Rafael N Watanabe
- Departamento de Engenharia e Ciências Exatas, Universidade Estadual Paulista, Jaboticabal, São Paulo, Brazil
| | - Rodrigo P Savegnago
- Department of Animal Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Thomaz M Sena
- Departamento de Engenharia e Ciências Exatas, Universidade Estadual Paulista, Jaboticabal, São Paulo, Brazil
| | - Adriana S do Carmo
- Departamento de Zootecnia, Universidade Federal de Goiás, Goiânia, Goiás, Brazil
| | | | | | | | - Danísio P Munari
- Departamento de Engenharia e Ciências Exatas, Universidade Estadual Paulista, Jaboticabal, São Paulo, Brazil
| |
Collapse
|
18
|
Cone Sullivan JK, Gleadall N, Lane WJ. Blood Group Genotyping. Clin Lab Med 2022; 42:645-668. [PMID: 36368788 DOI: 10.1016/j.cll.2022.09.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Jensyn K Cone Sullivan
- Department of Pathology, The Neely Cell Therapy Center, Tufts Medical Center, 800 Washington Street, #826, Boston, MA 02111, USA; Tufts University School of Medicine, Boston, MA, USA
| | - Nicholas Gleadall
- Department of Haematology, University of Cambridge, University of Cambridge Biomedical Campus, Long Road, Cambridge, CB2 0PT, UK
| | - William J Lane
- Department of Pathology, Brigham and Women's Hospital, Hale Building for Transformative Medicine, Room 8002L, 60 Fenwood Road, Boston, MA 02115, USA; Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
19
|
Testard Q, Vanhoye X, Yauy K, Naud ME, Vieville G, Rousseau F, Dauriat B, Marquet V, Bourthoumieu S, Geneviève D, Gatinois V, Wells C, Willems M, Coubes C, Pinson L, Dard R, Tessier A, Hervé B, Vialard F, Harzallah I, Touraine R, Cogné B, Deb W, Besnard T, Pichon O, Laudier B, Mesnard L, Doreille A, Busa T, Missirian C, Satre V, Coutton C, Celse T, Harbuz R, Raymond L, Taly JF, Thevenon J. Exome sequencing as a first-tier test for copy number variant detection: retrospective evaluation and prospective screening in 2418 cases. J Med Genet 2022; 59:1234-1240. [PMID: 36137615 DOI: 10.1136/jmg-2022-108439] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 08/10/2022] [Indexed: 01/12/2023]
Abstract
BACKGROUND Despite the availability of whole exome (WES) and genome sequencing (WGS), chromosomal microarray (CMA) remains the first-line diagnostic test in most rare disorders diagnostic workup, looking for copy number variations (CNVs), with a diagnostic yield of 10%-20%. The question of the equivalence of CMA and WES in CNV calling is an organisational and economic question, especially when ordering a WGS after a negative CMA and/or WES. METHODS This study measures the equivalence between CMA and GATK4 exome sequencing depth of coverage method in detecting coding CNVs on a retrospective cohort of 615 unrelated individuals. A prospective detection of WES-CNV on a cohort of 2418 unrelated individuals, including the 615 individuals from the validation cohort, was performed. RESULTS On the retrospective validation cohort, every CNV detectable by the method (ie, a CNV with at least one exon not in a dark zone) was accurately called (64/64 events). In the prospective cohort, 32 diagnoses were performed among the 2418 individuals with CNVs ranging from 704 bp to aneuploidy. An incidental finding was reported. The overall increase in diagnostic yield was of 1.7%, varying from 1.2% in individuals with multiple congenital anomalies to 1.9% in individuals with chronic kidney failure. CONCLUSION Combining single-nucleotide variant (SNV) and CNV detection increases the suitability of exome sequencing as a first-tier diagnostic test for suspected rare Mendelian disorders. Before considering the prescription of a WGS after a negative WES, a careful reanalysis with updated CNV calling and SNV annotation should be considered.
Collapse
Affiliation(s)
- Quentin Testard
- Service de Génétique, Eurofins Biomnis, Lyon, France.,Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France.,CNRS UMR 5309, INSERM, U1209, Université Grenoble Alpes, Institute for Advanced Bioscience, Grenoble, France
| | | | - Kevin Yauy
- CNRS UMR 5309, INSERM, U1209, Université Grenoble Alpes, Institute for Advanced Bioscience, Grenoble, France.,SeqOne Genomics, Montpellier, France
| | | | - Gaelle Vieville
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France
| | | | - Benjamin Dauriat
- Service de Cytogénétique, Génétique Médicale et Biologie de la Reproduction, CHU Limoges, Limoges, France
| | - Valentine Marquet
- Service de Cytogénétique, Génétique Médicale et Biologie de la Reproduction, CHU Limoges, Limoges, France
| | - Sylvie Bourthoumieu
- Service de Cytogénétique, Génétique Médicale et Biologie de la Reproduction, CHU Limoges, Limoges, France
| | - David Geneviève
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France.,Unité INSERM U1183, University Montpellier 1, Montpellier, France
| | - Vincent Gatinois
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France
| | - Constance Wells
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France
| | - Marjolaine Willems
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France
| | - Christine Coubes
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France
| | - Lucile Pinson
- Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France
| | - Rodolphe Dard
- Département de Génétique, CHI Poissy-Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - Aude Tessier
- Département de Génétique, CHI Poissy-Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - Bérénice Hervé
- Département de Génétique, CHI Poissy-Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - François Vialard
- Département de Génétique, CHI Poissy-Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - Ines Harzallah
- Service de génétique clinique, chromosomique et moléculaire, CHU Saint-Étienne, Saint-Etienne, France
| | - Renaud Touraine
- Service de génétique clinique, chromosomique et moléculaire, CHU Saint-Étienne, Saint-Etienne, France
| | - Benjamin Cogné
- Service de Génétique Médicale, CHU Nantes, Nantes, France
| | - Wallid Deb
- Service de Génétique Médicale, CHU Nantes, Nantes, France
| | - Thomas Besnard
- Service de Génétique Médicale, CHU Nantes, Nantes, France
| | - Olivier Pichon
- Service de Génétique Médicale, CHU Nantes, Nantes, France
| | - Béatrice Laudier
- Laboratoire d'Immunologie et Neurogénétique Expérimentales et Moléculaires INEM UMR7355, CHR d'Orléans, Orléans, France
| | - Laurent Mesnard
- Sorbonne Université, Urgences Néphrologiques et Transplantation Rénale, APHP, Hôpital Tenon, Paris, France
| | - Alice Doreille
- Sorbonne Université, Urgences Néphrologiques et Transplantation Rénale, APHP, Hôpital Tenon, Paris, France
| | - Tiffany Busa
- Département de génétique médicale, AP HM, Hôpital de la Timone Enfant, Marseille, France
| | - Chantal Missirian
- Département de génétique médicale, AP HM, Hôpital de la Timone Enfant, Marseille, France
| | - Véronique Satre
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France.,CNRS UMR 5309, INSERM, U1209, Université Grenoble Alpes, Institute for Advanced Bioscience, Grenoble, France
| | - Charles Coutton
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France.,CNRS UMR 5309, INSERM, U1209, Université Grenoble Alpes, Institute for Advanced Bioscience, Grenoble, France
| | - Tristan Celse
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France
| | - Radu Harbuz
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France
| | - Laure Raymond
- Service de Génétique, Eurofins Biomnis, Lyon, France
| | | | - Julien Thevenon
- Service de Génétique et Procréation, CHU Grenoble Alpes, Grenoble, France .,CNRS UMR 5309, INSERM, U1209, Université Grenoble Alpes, Institute for Advanced Bioscience, Grenoble, France
| |
Collapse
|
20
|
Samlali K, Thornbury M, Venter A. Community-led risk analysis of direct-to-consumer whole-genome sequencing. Biochem Cell Biol 2022; 100:499-509. [PMID: 35939839 DOI: 10.1139/bcb-2021-0506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Direct-to-consumer (DTC) genetic testing is cheaper and more accessible than ever before; however, the intention to combine, reuse, and resell this genetic information as powerful data sets is generally hidden from the consumer. This financial gain is creating a competitive DTC market, reducing the price of whole-genome sequencing (WGS) to under 300 USD. Entering this transition from single-nucleotide polymorphism-based DTC testing to WGS DTC testing, individuals looking for access to their whole-genomic information face new privacy and security risks. Differences between WGS and other methods of consumer genetic tests are left unexplored by regulation, leading to the application of legal data anonymization methods on whole-genome data, and questionable consent methods. Large representative genomic data sets are important for research and improve the standard of medicine and personalized care. However, these data can also be used by market players, law enforcement, and governments for surveillance, population analyses, marketing purposes, and discrimination. Here, we present a summary of the state of WGS DTC genetic testing and its current regulation, through a community-based lens to expose dual-use risks in consumer-facing biotechnologies.
Collapse
Affiliation(s)
- Kenza Samlali
- BricoBio Community Biology Lab, Montréal, QC, Canada.,Centre for Applied Synthetic Biology, Concordia University, Montréal, QC, Canada.,Department of Electrical and Computer Engineering, Concordia University, Montréal, QC, Canada
| | - Mackenzie Thornbury
- BricoBio Community Biology Lab, Montréal, QC, Canada.,Centre for Applied Synthetic Biology, Concordia University, Montréal, QC, Canada.,Department of Biology, Concordia University, Montréal, QC, Canada
| | - Andrei Venter
- BricoBio Community Biology Lab, Montréal, QC, Canada
| |
Collapse
|
21
|
van der Laan L, Rooney K, Trooster TM, Mannens MM, Sadikovic B, Henneman P. DNA methylation episignatures: insight into copy number variation. Epigenomics 2022; 14:1373-1388. [PMID: 36537268 DOI: 10.2217/epi-2022-0287] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
In this review we discuss epigenetic disorders that result from aberrations in genes linked to epigenetic regulation. We describe current testing methods for the detection of copy number variants (CNVs) in Mendelian disorders, dosage sensitivity, reciprocal phenotypes and the challenges of test selection and overlapping clinical features in genetic diagnosis. We discuss aberrations of DNA methylation and propose a role for episignatures as a novel clinical testing method in CNV disorders. Finally, we postulate that episignature mapping in CNV disorders may provide novel insights into the molecular mechanisms of disease and unlock key findings of the genome-wide impact on disease gene networks.
Collapse
Affiliation(s)
- Liselot van der Laan
- Department of Human Genetics, Amsterdam Reproduction & Development Research Institute, Amsterdam University Medical Centers, Amsterdam, 1105 AZ, The Netherlands
| | - Kathleen Rooney
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, N5A 3K7, Canada.,Verspeeten Clinical Genome Centre, London Health Science Centre, London, Ontario, N6A 5W9, Canada
| | - Tessa Ma Trooster
- Department of Human Genetics, Amsterdam Reproduction & Development Research Institute, Amsterdam University Medical Centers, Amsterdam, 1105 AZ, The Netherlands
| | - Marcel Mam Mannens
- Department of Human Genetics, Amsterdam Reproduction & Development Research Institute, Amsterdam University Medical Centers, Amsterdam, 1105 AZ, The Netherlands
| | - Bekim Sadikovic
- Department of Pathology & Laboratory Medicine, Western University, London, Ontario, N5A 3K7, Canada.,Verspeeten Clinical Genome Centre, London Health Science Centre, London, Ontario, N6A 5W9, Canada
| | - Peter Henneman
- Department of Human Genetics, Amsterdam Reproduction & Development Research Institute, Amsterdam University Medical Centers, Amsterdam, 1105 AZ, The Netherlands
| |
Collapse
|
22
|
Lin YC, Chang YH, Chiu FPC, Akiyama M, Hsu CK. Application of nanopore sequencing in identifying null mutations and intragenic copy number variations (CNVs) in FLG. J Dermatol Sci 2022; 108:48-50. [DOI: 10.1016/j.jdermsci.2022.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 08/24/2022] [Accepted: 09/11/2022] [Indexed: 12/13/2022]
|
23
|
Mouka A, Arkoun B, Moison P, Drévillon L, Jarray R, Brisset S, Mayeur A, Bouligand J, Boland-Auge A, Deleuze JF, Yates F, Lemonnier T, Callier P, Duffourd Y, Nitschke P, Ollivier E, Bourdin A, De Vos J, Livera G, Tachdjian G, Maouche-Chrétien L, Tosca L. iPSCs derived from infertile men carrying complex genetic abnormalities can generate primordial germ-like cells. Sci Rep 2022; 12:14302. [PMID: 35995809 PMCID: PMC9395518 DOI: 10.1038/s41598-022-17337-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 07/25/2022] [Indexed: 11/29/2022] Open
Abstract
Despite increasing insight into the genetics of infertility, the developmental disease processes remain unclear due to the lack of adequate experimental models. The advent of induced pluripotent stem cell (iPSC) technology has provided a unique tool for in vitro disease modeling enabling major advances in our understanding of developmental disease processes. We report the full characterization of complex genetic abnormalities in two infertile patients with either azoospermia or XX male syndrome and we identify genes of potential interest implicated in their infertility. Using the erythroblasts of both patients, we generated primed iPSCs and converted them into a naive-like pluripotent state. Naive-iPSCs were then differentiated into primordial germ-like cells (PGC-LCs). The expression of early PGC marker genes SOX17, CD-38, NANOS3, c-KIT, TFAP2C, and D2-40, confirmed progression towards the early germline stage. Our results demonstrate that iPSCs from two infertile patients with significant genetic abnormalities are capable of efficient production of PGCs. Such in vitro model of infertility will certainly help identifying causative factors leading to early germ cells development failure and provide a valuable tool to explore novel therapeutic strategies.
Collapse
Affiliation(s)
- Aurélie Mouka
- AP-HP, Université Paris-Saclay-Hôpital Antoine Béclère, Service d'Histologie, Embryologie et Cytogénétique, 92140, Clamart, France.,Faculté de Médecine, Université Paris-Saclay, 94270, Le Kremlin-Bicêtre, France
| | - Brahim Arkoun
- Inserm U1287, Laboratoire Cellules Souches Hématopoïétiques et Hémopathies Myeloïdes, Université Paris-Saclay, Gustave Roussy Cancer Campus, 94800, Villejuif, France.,Laboratoire de Développement des Gonades, UMRE008 Stabilité Génétique Cellules Souches et Radiations, Commissariat à l'Energie Atomique et Aux Énergies Alternatives, Institut de Biologie François Jacob, 92265, Fontenay-aux-Roses, France.,Université de Paris, Paris, France.,Université Paris-Saclay, 91400, Orsay, France
| | - Pauline Moison
- Laboratoire de Développement des Gonades, UMRE008 Stabilité Génétique Cellules Souches et Radiations, Commissariat à l'Energie Atomique et Aux Énergies Alternatives, Institut de Biologie François Jacob, 92265, Fontenay-aux-Roses, France.,Université de Paris, Paris, France.,Université Paris-Saclay, 91400, Orsay, France
| | - Loïc Drévillon
- AP-HP Sorbonne Université-La Pitié Salpêtrière, SiRIC Curamus, 75013, Paris, France
| | - Rafika Jarray
- Sup'Biotech/ Laboratoire CEA-IBFJ-SEPIA, 92265, Fontenay-aux-Roses, France
| | - Sophie Brisset
- AP-HP, Université Paris-Saclay-Hôpital Antoine Béclère, Service d'Histologie, Embryologie et Cytogénétique, 92140, Clamart, France.,Faculté de Médecine, Université Paris-Saclay, 94270, Le Kremlin-Bicêtre, France
| | - Anne Mayeur
- AP-HP, Université Paris-Saclay - Hôpital Antoine Béclère, Biologie de la Reproduction, 92140, Clamart, France
| | - Jérôme Bouligand
- INSERM UMR_S U1185, Faculté de Médecine Paris-Saclay, Université Paris-Saclay, Le Kremlin Bicêtre, France.,Service de Génétique Moléculaire, Pharmacogénétique et Hormonologie, Hôpitaux Universitaires Paris Sud, AH-HP, CHU Bicêtre, Paris, France
| | - Anne Boland-Auge
- Centre National de Recherche en Génomique Humaine, Université Paris-Saclay, CEA, 91057, Evry, France
| | - Jean-François Deleuze
- Centre National de Recherche en Génomique Humaine, Université Paris-Saclay, CEA, 91057, Evry, France
| | - Frank Yates
- Sup'Biotech/ Laboratoire CEA-IBFJ-SEPIA, 92265, Fontenay-aux-Roses, France
| | - Thomas Lemonnier
- Sup'Biotech/ Laboratoire CEA-IBFJ-SEPIA, 92265, Fontenay-aux-Roses, France
| | - Patrick Callier
- Département de Génétique Humaine, Hôpital Universitaire de Dijon, Dijon, France
| | - Yannis Duffourd
- Inserm UMR 1231 GAD, Faculté des Sciences de la Santé, Université de Bourgogne et de Franche-Comté, Dijon, France
| | - Patrick Nitschke
- Plateforme Bio-Informatique, IMAGINE Institute, Université Paris Descartes, Paris, France
| | - Emmanuelle Ollivier
- Plateforme Bio-Informatique, IMAGINE Institute, Université Paris Descartes, Paris, France
| | - Arnaud Bourdin
- PhyMedExp, Université Montpellier, INSERM, CHU Montpellier, Montpellier, France
| | - John De Vos
- IRMB, Université Montpellier, INSERM, CHU Montpellier, Montpellier, France
| | - Gabriel Livera
- Laboratoire de Développement des Gonades, UMRE008 Stabilité Génétique Cellules Souches et Radiations, Commissariat à l'Energie Atomique et Aux Énergies Alternatives, Institut de Biologie François Jacob, 92265, Fontenay-aux-Roses, France.,Université de Paris, Paris, France.,Université Paris-Saclay, 91400, Orsay, France
| | - Gérard Tachdjian
- AP-HP, Université Paris-Saclay-Hôpital Antoine Béclère, Service d'Histologie, Embryologie et Cytogénétique, 92140, Clamart, France.,Faculté de Médecine, Université Paris-Saclay, 94270, Le Kremlin-Bicêtre, France.,Laboratoire de Développement des Gonades, UMRE008 Stabilité Génétique Cellules Souches et Radiations, Commissariat à l'Energie Atomique et Aux Énergies Alternatives, Institut de Biologie François Jacob, 92265, Fontenay-aux-Roses, France
| | - Leïla Maouche-Chrétien
- Laboratoire des Mécanismes Moléculaires et Cellulaires des Maladies Hématologiques et leurs Implications Thérapeutiques; INSERM U 1163, Institut IMAGINE, Paris, France. .,Division des Thérapies Innovantes, CEA, Institut de Biologie François Jacob, 92260, Fontenay-aux-Roses, France.
| | - Lucie Tosca
- AP-HP, Université Paris-Saclay-Hôpital Antoine Béclère, Service d'Histologie, Embryologie et Cytogénétique, 92140, Clamart, France.,Faculté de Médecine, Université Paris-Saclay, 94270, Le Kremlin-Bicêtre, France.,Laboratoire de Développement des Gonades, UMRE008 Stabilité Génétique Cellules Souches et Radiations, Commissariat à l'Energie Atomique et Aux Énergies Alternatives, Institut de Biologie François Jacob, 92265, Fontenay-aux-Roses, France
| |
Collapse
|
24
|
Lambrescu I, Popa A, Manole E, Ceafalan LC, Gaina G. Application of Droplet Digital PCR Technology in Muscular Dystrophies Research. Int J Mol Sci 2022; 23:ijms23094802. [PMID: 35563191 PMCID: PMC9099497 DOI: 10.3390/ijms23094802] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 04/19/2022] [Accepted: 04/25/2022] [Indexed: 11/25/2022] Open
Abstract
Although they are considered rare disorders, muscular dystrophies have a strong impact on people’s health. Increased disease severity with age, frequently accompanied by the loss of ability to walk in some people, and the lack of treatment, have directed the researchers towards the development of more effective therapeutic strategies aimed to improve the quality of life and life expectancy, slow down the progression, and delay the onset or convert a severe phenotype into a milder one. Improved understanding of the complex pathology of these diseases together with the tremendous advances in molecular biology technologies has led to personalized therapeutic procedures. Different approaches that are currently under extensive investigation require more efficient, sensitive, and less invasive methods. Due to its remarkable analytical sensitivity, droplet digital PCR has become a promising tool for accurate measurement of biomarkers that monitor disease progression and quantification of various therapeutic efficiency and can be considered a tool for non-invasive prenatal diagnosis and newborn screening. Here, we summarize the recent applications of droplet digital PCR in muscular dystrophy research and discuss the factors that should be considered to get the best performance with this technology.
Collapse
Affiliation(s)
- Ioana Lambrescu
- Laboratory of Cell Biology, Neuroscience and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania; (I.L.); (A.P.); (E.M.); (L.C.C.)
- Department of Cell Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania
| | - Alexandra Popa
- Laboratory of Cell Biology, Neuroscience and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania; (I.L.); (A.P.); (E.M.); (L.C.C.)
- Department of Animal Production and Public Health, University of Agronomic Sciences and Veterinary Medicine of Bucharest, 050097 Bucharest, Romania
| | - Emilia Manole
- Laboratory of Cell Biology, Neuroscience and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania; (I.L.); (A.P.); (E.M.); (L.C.C.)
- Pathology Department, Colentina Clinical Hospital, 020125 Bucharest, Romania
| | - Laura Cristina Ceafalan
- Laboratory of Cell Biology, Neuroscience and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania; (I.L.); (A.P.); (E.M.); (L.C.C.)
- Department of Cell Biology and Histology, Carol Davila University of Medicine and Pharmacy, 050474 Bucharest, Romania
| | - Gisela Gaina
- Laboratory of Cell Biology, Neuroscience and Experimental Myology, Victor Babes National Institute of Pathology, 050096 Bucharest, Romania; (I.L.); (A.P.); (E.M.); (L.C.C.)
- Correspondence: ; Tel.: +40-21-319-2732
| |
Collapse
|
25
|
Palma-Vera SE, Reyer H, Langhammer M, Reinsch N, Derezanin L, Fickel J, Qanbari S, Weitzel JM, Franzenburg S, Hemmrich-Stanisak G, Schoen J. Genomic characterization of the world's longest selection experiment in mouse reveals the complexity of polygenic traits. BMC Biol 2022; 20:52. [PMID: 35189878 PMCID: PMC8862358 DOI: 10.1186/s12915-022-01248-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 02/07/2022] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Long-term selection experiments are a powerful tool to understand the genetic background of complex traits. The longest of such experiments has been conducted in the Research Institute for Farm Animal Biology (FBN), generating extreme mouse lines with increased fertility, body mass, protein mass and endurance. For >140 generations, these lines have been maintained alongside an unselected control line, representing a valuable resource for understanding the genetic basis of polygenic traits. However, their history and genomes have not been reported in a comprehensive manner yet. Therefore, the aim of this study is to provide a summary of the breeding history and phenotypic traits of these lines along with their genomic characteristics. We further attempt to decipher the effects of the observed line-specific patterns of genetic variation on each of the selected traits. RESULTS Over the course of >140 generations, selection on the control line has given rise to two extremely fertile lines (>20 pups per litter each), two giant growth lines (one lean, one obese) and one long-distance running line. Whole genome sequencing analysis on 25 animals per line revealed line-specific patterns of genetic variation among lines, as well as high levels of homozygosity within lines. This high degree of distinctiveness results from the combined effects of long-term continuous selection, genetic drift, population bottleneck and isolation. Detection of line-specific patterns of genetic differentiation and structural variation revealed multiple candidate genes behind the improvement of the selected traits. CONCLUSIONS The genomes of the Dummerstorf trait-selected mouse lines display distinct patterns of genomic variation harbouring multiple trait-relevant genes. Low levels of within-line genetic diversity indicate that many of the beneficial alleles have arrived to fixation alongside with neutral alleles. This study represents the first step in deciphering the influence of selection and neutral evolutionary forces on the genomes of these extreme mouse lines and depicts the genetic complexity underlying polygenic traits.
Collapse
Affiliation(s)
- Sergio E Palma-Vera
- Institute of Reproductive Biology, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany.
| | - Henry Reyer
- Institute of Genome Biology, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
| | - Martina Langhammer
- Institute of Genetics and Biometry, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
| | - Norbert Reinsch
- Institute of Genetics and Biometry, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
| | - Lorena Derezanin
- Institute of Reproductive Biology, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
- Department of Evolutionary Genetics, Research Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| | - Joerns Fickel
- Department of Evolutionary Genetics, Research Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
- University of Potsdam, Institute for Biochemistry and Biology, Potsdam, Germany
| | - Saber Qanbari
- Institute of Genetics and Biometry, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
| | - Joachim M Weitzel
- Institute of Reproductive Biology, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
| | | | | | - Jennifer Schoen
- Institute of Reproductive Biology, Research Institute for Farm Animal Biology (FBN), Dummerstorf, Germany
- Department of Reproduction Biology, Research Institute for Zoo and Wildlife Research (IZW), Berlin, Germany
| |
Collapse
|
26
|
Zhou J, Liu L, Reynolds E, Huang X, Garrick D, Shi Y. Discovering Copy Number Variation in Dual-Purpose XinJiang Brown Cattle. Front Genet 2022; 12:747431. [PMID: 35222511 PMCID: PMC8873982 DOI: 10.3389/fgene.2021.747431] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 11/01/2021] [Indexed: 12/02/2022] Open
Abstract
Copy number variants (CNVs), which are a class of structural variant, can be important in relating genomic variation to phenotype. The primary aims of this study were to discover the common CNV regions (CNVRs) in the dual-purpose XinJiang-Brown cattle population and to detect differences between CNVs inferred using the ARS-UCD 1.2 (ARS) or the UMD 3.1 (UMD) genome assemblies based on the 150K SNP (Single Nucleotide Polymorphisms) Chip. PennCNV and CNVPartition methods were applied to calculate the deviation of the standardized signal intensity of SNPs markers to detect CNV status. Following the discovery of CNVs, we used the R package HandyCNV to generate and visualize CNVRs, compare CNVs and CNVRs between genome assemblies, and identify consensus genes using annotation resources. We identified 38 consensus CNVRs using the ARS assembly with 1.95% whole genome coverage, and 33 consensus CNVRs using the UMD assembly with 1.46% whole genome coverage using PennCNV and CNVPartition. We identified 37 genes that intersected 13 common CNVs (>5% frequency), these included functionally interesting genes such as GBP4 for which an increased copy number has been negatively associated with cattle stature, and the BoLA gene family which has been linked to the immune response and adaption of cattle. The ARS map file of the GGP Bovine 150K Bead Chip maps the genomic position of more SNPs with increased accuracy compared to the UMD map file. Comparison of the CNVRs identified between the two reference assemblies suggests the newly released ARS reference assembly is better for CNV detection. In spite of this, different CNV detection methods can complement each other to generate a larger number of CNVRs than using a single approach and can highlight more genes of interest.
Collapse
Affiliation(s)
- Jinghang Zhou
- School of Agriculture, Ningxia University, Yinchuan, China
- AL Rae Centre for Genetics and Breeding, Massey University, Hamilton, New Zealand
| | - Liyuan Liu
- School of Agriculture, Ningxia University, Yinchuan, China
- AL Rae Centre for Genetics and Breeding, Massey University, Hamilton, New Zealand
| | - Edwardo Reynolds
- AL Rae Centre for Genetics and Breeding, Massey University, Hamilton, New Zealand
| | - Xixia Huang
- College of Animal Science, Xinjiang Agricultural University, Urumqi, China
| | - Dorian Garrick
- AL Rae Centre for Genetics and Breeding, Massey University, Hamilton, New Zealand
- *Correspondence: Yuangang Shi, ; Dorian Garrick, mailto:
| | - Yuangang Shi
- School of Agriculture, Ningxia University, Yinchuan, China
- *Correspondence: Yuangang Shi, ; Dorian Garrick, mailto:
| |
Collapse
|
27
|
Tisserant E, Vitobello A, Callegarin D, Verdez S, Bruel AL, Aho Glele LS, Sorlin A, Viora-Dupont E, Konyukh M, Marle N, Nambot S, Moutton S, Racine C, Garde A, Delanne J, Tran-Mau-Them F, Philippe C, Kuentz P, Poulleau M, Payet M, Poe C, Thauvin-Robinet C, Faivre L, Mosca-Boidron AL, Thevenon J, Duffourd Y, Callier P. Copy number variants calling from WES data through eXome hidden Markov model (XHMM) identifies additional 2.5% pathogenic genomic imbalances smaller than 30 kb undetected by array-CGH. Ann Hum Genet 2022; 86:171-180. [PMID: 35141892 DOI: 10.1111/ahg.12459] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 12/14/2021] [Accepted: 01/11/2022] [Indexed: 12/14/2022]
Abstract
It has been estimated that Copy Number Variants (CNVs) account for 10%-20% of patients affected by Developmental Disorder (DD)/Intellectual Disability (ID). Although array comparative genomic hybridization (array-CGH) represents the gold-standard for the detection of genomic imbalances, common Agilent array-CGH 4 × 180 kb arrays fail to detect CNVs smaller than 30 kb. Whole Exome sequencing (WES) is becoming the reference application for the detection of gene variants and makes it possible also to infer genomic imbalances at single exon resolution. However, the contribution of small CNVs in DD/ID is still underinvestigated. We made use of the eXome Hidden Markov Model (XHMM) software, a tool utilized by the ExAC consortium, to detect CNVs from whole exome sequencing data, in a cohort of 200 unsolved DD/DI patients after array-CGH and WES-based single nucleotide/indel variant analyses. In five out of 200 patients (2.5%), we identified pathogenic CNV(s) smaller than 30 kb, ranging from one to six exons. They included two heterozygous deletions in TCF4 and STXBP1 and three homozygous deletions in PPT1, CLCN2, and PIGN. After reverse phenotyping, all variants were reported as causative. This study shows the interest in applying sequencing-based CNV detection, from available WES data, to reduce the diagnostic odyssey of additional patients unsolved DD/DI patients and compare the CNV-detection yield of Agilent array-CGH 4 × 180kb versus whole exome sequencing.
Collapse
Affiliation(s)
- Emilie Tisserant
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | - Antonio Vitobello
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Davide Callegarin
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Simon Verdez
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Ange-Line Bruel
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | | | - Arthur Sorlin
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Eleonore Viora-Dupont
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Marina Konyukh
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Nathalie Marle
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Sophie Nambot
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Hospital Hygiene and Epidemiology Unit, Dijon University Hospital, Dijon Cedex, France
| | - Sébastien Moutton
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France.,Reference Center for Intellectual Disorders, Dijon University Hospital, Dijon, France
| | - Caroline Racine
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France.,Genetics Department and Reference Center for Developmental Disorders and Malformative Syndromes for East France, FHU TRANSLAD, Dijon University Hospital, Dijon, France
| | - Aurore Garde
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Julian Delanne
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Genetics Department and Reference Center for Developmental Disorders and Malformative Syndromes for East France, FHU TRANSLAD, Dijon University Hospital, Dijon, France
| | - Frédéric Tran-Mau-Them
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | - Christophe Philippe
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Paul Kuentz
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | - Marlène Poulleau
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Muriel Payet
- Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Charlotte Poe
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | - Christel Thauvin-Robinet
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Genetics Department and Reference Center for Developmental Disorders and Malformative Syndromes for East France, FHU TRANSLAD, Dijon University Hospital, Dijon, France.,Reference Center for Intellectual Disorders, Dijon University Hospital, Dijon, France
| | - Laurence Faivre
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Genetics Department and Reference Center for Developmental Disorders and Malformative Syndromes for East France, FHU TRANSLAD, Dijon University Hospital, Dijon, France.,Reference Center for Intellectual Disorders, Dijon University Hospital, Dijon, France
| | - Anne-Laure Mosca-Boidron
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| | - Julien Thevenon
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Genetics Department and Reference Center for Developmental Disorders and Malformative Syndromes for East France, FHU TRANSLAD, Dijon University Hospital, Dijon, France
| | - Yannis Duffourd
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France
| | - Patrick Callier
- Inserm UMR 1231 GAD, Faculty of Health Sciences, University of Burgundy and Franche-Comté, Dijon, France.,Molecular and chromosomal genetics laboratory, Biology Transfer Platform, Dijon University Hospital, Dijon, France
| |
Collapse
|
28
|
Chen X, Lin Y, Qu Q, Ning B, Chen H, Liao B, Li X. Analyzing Association between Expression Quantitative Trait and CNV for Breast Cancer Based on Gene Interaction Network Clustering and Group Sparse Learning. Curr Bioinform 2022. [DOI: 10.2174/1574893617666220207095117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Aims:
The occurrence and development of tumor is accompanied by the change of pathogenic gene expression. Tumor cells avoid the damage of immune cells by regulating the expression of immune related genes.
Background:
Tracing the causes of gene expression variation is helpful to understand tumor evolution and metastasis.
Objective:
Current gene expression variation explanation methods are confronted with several main challenges: low explanation power, insufficient prediction accuracy, and lack of biological meaning.
Method:
In this study, we propose a novel method to analyze the mRNA expression variations of breast cancers risk genes. Firstly, we collected some high-confidence risk genes related to breast cancer and then designed a rank-based method to preprocess the breast cancers copy number variation (CNV) and mRNA data. Secondly, to elevate the biological meaning and narrow down the combinatorial space, we introduced a prior gene interaction network and applied a network clustering algorithm to generate high density subnetworks. Lastly, to describe the interlinked structure within and between subnetworks and target genes mRNA expression, we proposed a group sparse learning model to identify CNVs for pathogenic genes expression variations.
Result:
The performance of the proposed method is evaluated by both significantly improved predication accuracy and biological meaning of pathway enrichment analysis.
Conclusion:
The experimental results show that our method has practical significance
Collapse
Affiliation(s)
- Xia Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha,
Hunan, China
- School of Basic Education, Changsha Aeronautical Vocational and Technical College,
Changsha, Hunan, China
| | - Yexiong Lin
- College of Computer Science and Electronic Engineering, Hunan University, Changsha,
Hunan, China
| | - Qiang Qu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha,
Hunan, China
| | - Bin Ning
- College of Computer Science and Electronic Engineering, Hunan University, Changsha,
Hunan, China
| | - Haowen Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha,
Hunan, China
| | - Bo Liao
- Ministry of Education, Hainan Normal University, Haikou, China
| | - Xiong Li
- School of Software, East China Jiaotong University, Nanchang, 330013, China
| |
Collapse
|
29
|
Combining callers improves the detection of copy number variants from whole-genome sequencing. Eur J Hum Genet 2022; 30:178-186. [PMID: 34744167 PMCID: PMC8821561 DOI: 10.1038/s41431-021-00983-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 09/23/2021] [Accepted: 10/04/2021] [Indexed: 01/03/2023] Open
Abstract
Copy Number Variants (CNVs) are deletions, duplications or insertions larger than 50 base pairs. They account for a large percentage of the normal genome variation and play major roles in human pathology. While array-based approaches have long been used to detect them in clinical practice, whole-genome sequencing (WGS) bears the promise to allow concomitant exploration of CNVs and smaller variants. However, accurately calling CNVs from WGS remains a difficult computational task, for which a consensus is still lacking. In this paper, we explore practical calling options to reach the best compromise between sensitivity and sensibility. We show that callers based on different signal (paired-end reads, split reads, coverage depth) yield complementary results. We suggest approaches combining four selected callers (Manta, Delly, ERDS, CNVnator) and a regenotyping tool (SV2), and show that this is applicable in everyday practice in terms of computation time and further interpretation. We demonstrate the superiority of these approaches over array-based Comparative Genomic Hybridization (aCGH), specifically regarding the lack of resolution in breakpoint definition and the detection of potentially relevant CNVs. Finally, we confirm our results on the NA12878 benchmark genome, as well as one clinically validated sample. In conclusion, we suggest that WGS constitutes a timely and economically valid alternative to the combination of aCGH and whole-exome sequencing.
Collapse
|
30
|
Identification of Copy Number Alterations from Next-Generation Sequencing Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1361:55-74. [DOI: 10.1007/978-3-030-91836-1_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
31
|
Ghieh F, Barbotin AL, Swierkowski-Blanchard N, Leroy C, Fortemps J, Gerault C, Hue C, Mambu Mambueni H, Jaillard S, Albert M, Bailly M, Izard V, Molina-Gomes D, Marcelli F, Prasivoravong J, Serazin V, Dieudonne MN, Delcroix M, Garchon HJ, Louboutin A, Mandon-Pepin B, Ferlicot S, Vialard F. OUP accepted manuscript. Hum Reprod 2022; 37:1334-1350. [PMID: 35413094 PMCID: PMC9156845 DOI: 10.1093/humrep/deac057] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 03/07/2022] [Indexed: 11/13/2022] Open
Affiliation(s)
- F Ghieh
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
| | - A L Barbotin
- Institut de Biologie de la Reproduction-Spermiologie-CECOS, Hôpital Jeanne de Flandre, Centre Hospitalier et Universitaire, Lille, France
| | - N Swierkowski-Blanchard
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
- Département de Gynécologie Obstétrique, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - C Leroy
- Institut de Biologie de la Reproduction-Spermiologie-CECOS, Hôpital Jeanne de Flandre, Centre Hospitalier et Universitaire, Lille, France
| | - J Fortemps
- Service d’Anatomie Pathologique, CHI de Poissy/Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - C Gerault
- Département de Génétique, Laboratoire de Biologie Médicale, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - C Hue
- Department of Biotechnology and Health, UVSQ, Université Paris-Saclay, Inserm UMR 1173, Montigny-le-Bretonneux, France
| | - H Mambu Mambueni
- Department of Biotechnology and Health, UVSQ, Université Paris-Saclay, Inserm UMR 1173, Montigny-le-Bretonneux, France
| | - S Jaillard
- Service de Cytogénétique, CHU Rennes, Rennes, France
- INSERM, EHESP, IRSET—UMR_S 1085, Université Rennes 1, Rennes, France
| | - M Albert
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
| | - M Bailly
- Département de Gynécologie Obstétrique, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - V Izard
- Service d’Urologie, AP-HP, Université Paris-Saclay, Hôpital de Bicêtre, Le Kremlin-Bicêtre, France
| | - D Molina-Gomes
- Département de Génétique, Laboratoire de Biologie Médicale, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - F Marcelli
- Institut de Biologie de la Reproduction-Spermiologie-CECOS, Hôpital Jeanne de Flandre, Centre Hospitalier et Universitaire, Lille, France
| | - J Prasivoravong
- Institut de Biologie de la Reproduction-Spermiologie-CECOS, Hôpital Jeanne de Flandre, Centre Hospitalier et Universitaire, Lille, France
| | - V Serazin
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
- Département de Génétique, Laboratoire de Biologie Médicale, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - M N Dieudonne
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
| | - M Delcroix
- Département de Génétique, Laboratoire de Biologie Médicale, CHI de Poissy/Saint-Germain-en-Laye, Poissy, France
| | - H J Garchon
- Department of Biotechnology and Health, UVSQ, Université Paris-Saclay, Inserm UMR 1173, Montigny-le-Bretonneux, France
| | - A Louboutin
- Service d’Anatomie Pathologique, CHI de Poissy/Saint-Germain-en-Laye, Saint-Germain-en-Laye, France
| | - B Mandon-Pepin
- Université Paris-Saclay, UVSQ, INRAE, BREED, Jouy-en-Josas, France
- École Nationale Vétérinaire d’Alfort, BREED, Maisons-Alfort, France
| | - S Ferlicot
- Service d’Anatomie Pathologique, AP-HP, Université Paris-Saclay, Hôpital de Bicêtre, Le Kremlin-Bicêtre, France
| | - F Vialard
- Correspondence address. Tel: +33-139-274-700; E-mail:
| |
Collapse
|
32
|
Wang T, Sun J, Zhang X, Wang WJ, Zhou Q. CNV-P: a machine-learning framework for predicting high confident copy number variations. PeerJ 2021; 9:e12564. [PMID: 34917425 PMCID: PMC8645205 DOI: 10.7717/peerj.12564] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 11/08/2021] [Indexed: 12/27/2022] Open
Abstract
Background Copy-number variants (CNVs) have been recognized as one of the major causes of genetic disorders. Reliable detection of CNVs from genome sequencing data has been a strong demand for disease research. However, current software for detecting CNVs has high false-positive rates, which needs further improvement. Methods Here, we proposed a novel and post-processing approach for CNVs prediction (CNV-P), a machine-learning framework that could efficiently remove false-positive fragments from results of CNVs detecting tools. A series of CNVs signals such as read depth (RD), split reads (SR) and read pair (RP) around the putative CNV fragments were defined as features to train a classifier. Results The prediction results on several real biological datasets showed that our models could accurately classify the CNVs at over 90% precision rate and 85% recall rate, which greatly improves the performance of state-of-the-art algorithms. Furthermore, our results indicate that CNV-P is robust to different sizes of CNVs and the platforms of sequencing. Conclusions Our framework for classifying high-confident CNVs could improve both basic research and clinical diagnosis of genetic diseases.
Collapse
Affiliation(s)
| | - Jinghua Sun
- BGI-Shenzhen, Shenzhen, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Xiuqing Zhang
- BGI-Shenzhen, Shenzhen, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.,Guangdong Enterprise Key Laboratory of Human Disease Genomics, Beishan Industrial Zone, Shenzhen, China
| | | | | |
Collapse
|
33
|
Gabrielaite M, Torp MH, Rasmussen MS, Andreu-Sánchez S, Vieira FG, Pedersen CB, Kinalis S, Madsen MB, Kodama M, Demircan GS, Simonyan A, Yde CW, Olsen LR, Marvig RL, Østrup O, Rossing M, Nielsen FC, Winther O, Bagger FO. A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data. Cancers (Basel) 2021; 13:cancers13246283. [PMID: 34944901 PMCID: PMC8699073 DOI: 10.3390/cancers13246283] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 12/03/2021] [Accepted: 12/08/2021] [Indexed: 12/28/2022] Open
Abstract
Copy-number variations (CNVs) have important clinical implications for several diseases and cancers. Relevant CNVs are hard to detect because common structural variations define large parts of the human genome. CNV calling from short-read sequencing would allow single protocol full genomic profiling. We reviewed 50 popular CNV calling tools and included 11 tools for benchmarking in a reference cohort encompassing 39 whole genome sequencing (WGS) samples paired current clinical standard-SNP-array based CNV calling. Additionally, for nine samples we also performed whole exome sequencing (WES), to address the effect of sequencing protocol on CNV calling. Furthermore, we included Gold Standard reference sample NA12878, and tested 12 samples with CNVs confirmed by multiplex ligation-dependent probe amplification (MLPA). Tool performance varied greatly in the number of called CNVs and bias for CNV lengths. Some tools had near-perfect recall of CNVs from arrays for some samples, but poor precision. Several tools had better performance for NA12878, which could be a result of overfitting. We suggest combining the best tools also based on different methodologies: GATK gCNV, Lumpy, DELLY, and cn.MOPS. Reducing the total number of called variants could potentially be assisted by the use of background panels for filtering of frequently called variants.
Collapse
Affiliation(s)
- Migle Gabrielaite
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Mathias Husted Torp
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Malthe Sebro Rasmussen
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Sergio Andreu-Sánchez
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Filipe Garrett Vieira
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Christina Bligaard Pedersen
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Ørsteds Pl. 345C, 2800 Kgs. Lyngby, Denmark
| | - Savvas Kinalis
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Majbritt Busk Madsen
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Miyako Kodama
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Gül Sude Demircan
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Arman Simonyan
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Christina Westmose Yde
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Lars Rønn Olsen
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
- Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Ørsteds Pl. 345C, 2800 Kgs. Lyngby, Denmark
| | - Rasmus L. Marvig
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Olga Østrup
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Maria Rossing
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
- Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Finn Cilius Nielsen
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
| | - Ole Winther
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
- Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
- Section for Cognitive Systems, Department of Applied Mathematics and Computer Science, Technical University of Denmark, Matematiktorvet 303B, 2800 Kgs. Lyngby, Denmark
| | - Frederik Otzen Bagger
- Center for Genomic Medicine, Rigshospitalet, University of Copenhagen, Blegdamsvej 9, 2100 Copenhagen, Denmark; (M.G.); (M.H.T.); (M.S.R.); (S.A.-S.); (F.G.V.); (C.B.P.); (S.K.); (M.B.M.); (M.K.); (G.S.D.); (A.S.); (C.W.Y.); (L.R.O.); (R.L.M.); (O.Ø.); (M.R.); (F.C.N.); (O.W.)
- Department of Biomedicine, UKBB Universitats-Kinderspital Basel, 4031 Basel, Switzerland
- Swiss Institute of Bioinformatics, Hebelstrasse 20, 4031 Basel, Switzerland
- Correspondence:
| |
Collapse
|
34
|
Comprehensive characterization of copy number variation (CNV) called from array, long- and short-read data. BMC Genomics 2021; 22:826. [PMID: 34789167 PMCID: PMC8596897 DOI: 10.1186/s12864-021-08082-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 10/13/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND SNP arrays, short- and long-read genome sequencing are genome-wide high-throughput technologies that may be used to assay copy number variants (CNVs) in a personal genome. Each of these technologies comes with its own limitations and biases, many of which are well-known, but not all of them are thoroughly quantified. RESULTS We assembled an ensemble of public datasets of published CNV calls and raw data for the well-studied Genome in a Bottle individual NA12878. This assembly represents a variety of methods and pipelines used for CNV calling from array, short- and long-read technologies. We then performed cross-technology comparisons regarding their ability to call CNVs. Different from other studies, we refrained from using the golden standard. Instead, we attempted to validate the CNV calls by the raw data of each technology. CONCLUSIONS Our study confirms that long-read platforms enable recalling CNVs in genomic regions inaccessible to arrays or short reads. We also found that the reproducibility of a CNV by different pipelines within each technology is strongly linked to other CNV evidence measures. Importantly, the three technologies show distinct public database frequency profiles, which differ depending on what technology the database was built on.
Collapse
|
35
|
Chiliński M, Sengupta K, Plewczynski D. From DNA human sequence to the chromatin higher order organisation and its biological meaning: Using biomolecular interaction networks to understand the influence of structural variation on spatial genome organisation and its functional effect. Semin Cell Dev Biol 2021; 121:171-185. [PMID: 34429265 DOI: 10.1016/j.semcdb.2021.08.007] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 08/06/2021] [Accepted: 08/12/2021] [Indexed: 12/30/2022]
Abstract
The three-dimensional structure of the human genome has been proven to have a significant functional impact on gene expression. The high-order spatial chromatin is organised first by looping mediated by multiple protein factors, and then it is further formed into larger structures of topologically associated domains (TADs) or chromatin contact domains (CCDs), followed by A/B compartments and finally the chromosomal territories (CTs). The genetic variation observed in human population influences the multi-scale structures, posing a question regarding the functional impact of structural variants reflected by the variability of the genes expression patterns. The current methods of evaluating the functional effect include eQTLs analysis which uses statistical testing of influence of variants on spatially close genes. Rarely, non-coding DNA sequence changes are evaluated by their impact on the biomolecular interaction network (BIN) reflecting the cellular interactome that can be analysed by the classical graph-theoretic algorithms. Therefore, in the second part of the review, we introduce the concept of BIN, i.e. a meta-network model of the complete molecular interactome developed by integrating various biological networks. The BIN meta-network model includes DNA-protein binding by the plethora of protein factors as well as chromatin interactions, therefore allowing connection of genomics with the downstream biomolecular processes present in a cell. As an illustration, we scrutinise the chromatin interactions mediated by the CTCF protein detected in a ChIA-PET experiment in the human lymphoblastoid cell line GM12878. In the corresponding BIN meta-network the DNA spatial proximity is represented as a graph model, combined with the Proteins-Interaction Network (PIN) of human proteome using the Gene Association Network (GAN). Furthermore, we enriched the BIN with the signalling and metabolic pathways and Gene Ontology (GO) terms to assert its functional context. Finally, we mapped the Single Nucleotide Polymorphisms (SNPs) from the GWAS studies and identified the chromatin mutational hot-spots associated with a significant enrichment of SNPs related to autoimmune diseases. Afterwards, we mapped Structural Variants (SVs) from healthy individuals of 1000 Genomes Project and identified an interesting example of the missing protein complex associated with protein Q6GYQ0 due to a deletion on chromosome 14. Such an analysis using the meta-network BIN model is therefore helpful in evaluating the influence of genetic variation on spatial organisation of the genome and its functional effect in a cell.
Collapse
Affiliation(s)
- Mateusz Chiliński
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland; Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| | - Kaustav Sengupta
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| | - Dariusz Plewczynski
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland; Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland.
| |
Collapse
|
36
|
Karim A, Tang CSM, Tam PKH. The Emerging Genetic Landscape of Hirschsprung Disease and Its Potential Clinical Applications. Front Pediatr 2021; 9:638093. [PMID: 34422713 PMCID: PMC8374333 DOI: 10.3389/fped.2021.638093] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Accepted: 07/02/2021] [Indexed: 12/25/2022] Open
Abstract
Hirschsprung disease (HSCR) is the leading cause of neonatal functional intestinal obstruction. It is a rare congenital disease with an incidence of one in 3,500-5,000 live births. HSCR is characterized by the absence of enteric ganglia in the distal colon, plausibly due to genetic defects perturbing the normal migration, proliferation, differentiation, and/or survival of the enteric neural crest cells as well as impaired interaction with the enteric progenitor cell niche. Early linkage analyses in Mendelian and syndromic forms of HSCR uncovered variants with large effects in major HSCR genes including RET, EDNRB, and their interacting partners in the same biological pathways. With the advances in genome-wide genotyping and next-generation sequencing technologies, there has been a remarkable progress in understanding of the genetic basis of HSCR in the past few years, with common and rare variants with small to moderate effects being uncovered. The discovery of new HSCR genes such as neuregulin and BACE2 as well as the deeper understanding of the roles and mechanisms of known HSCR genes provided solid evidence that many HSCR cases are in the form of complex polygenic/oligogenic disorder where rare variants act in the sensitized background of HSCR-associated common variants. This review summarizes the roadmap of genetic discoveries of HSCR from the earlier family-based linkage analyses to the recent population-based genome-wide analyses coupled with functional genomics, and how these discoveries facilitated our understanding of the genetic architecture of this complex disease and provide the foundation of clinical translation for precision and stratified medicine.
Collapse
Affiliation(s)
- Anwarul Karim
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Clara Sze-Man Tang
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
- Li Dak-Sum Research Center, The University of Hong Kong—Karolinska Institute Collaboration in Regenerative Medicine, Hong Kong, China
| | - Paul Kwong-Hang Tam
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
- Li Dak-Sum Research Center, The University of Hong Kong—Karolinska Institute Collaboration in Regenerative Medicine, Hong Kong, China
| |
Collapse
|
37
|
Ren Y, Lian Y, Yan Z, Zhai F, Yang M, Zhu X, Wang Y, Nie Y, Guan S, Kuo Y, Huang J, Shi X, Jia J, Qiao J, Yan L. Clinical application of an NGS-based method in the preimplantation genetic testing for Duchenne muscular dystrophy. J Assist Reprod Genet 2021; 38:1979-1986. [PMID: 33719023 PMCID: PMC8417207 DOI: 10.1007/s10815-021-02126-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 02/22/2021] [Indexed: 01/16/2023] Open
Abstract
PURPOSE To determine whether next-generation sequencing (NGS) could be used to directly detect different mutations of Duchenne muscular dystrophy (DMD) during preimplantation genetic testing (PGT). METHODS From Sep. 2016 to Aug. 2018, a total of six couples participated in this study. Four cases carried DMD exon deletions and two carried exon duplications. Trophectoderm cells were biopsied at day 5 or 6 and NGS was used in the genetic testing of the biopsied cells after whole-genome amplification. We developed a new method-DIRected Embryonic Cell Testing of Exon Deletion/Duplication (DIRECTED) to directly detect the single-gene mutation by NGS. Linage analysis based on single-nucleotide polymorphism (SNP) was used to validate the results from DIRECTED. RESULTS In the four deletion cases, DIRECTED was used to detect DMD exon deletion in 16 biopsied embryos. All DIRECTED results were consistent with linkage analysis, indicating this method was reliable in detecting deletions around 1 Mb. In the two cases carrying exon duplications, no blastocyst was obtained for biopsy. Nonetheless, preliminary experiment results suggested that DIRECTED could also be used for direct detection of exon duplications in embryos. CONCLUSIONS Exon deletions or duplications in DMD of preimplantation embryos could be detected directly by NGS-based methods during PGT.
Collapse
Affiliation(s)
- Yixin Ren
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
| | - Ying Lian
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
| | - Zhiqiang Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Fan Zhai
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Ming Yang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Xiaohui Zhu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
| | - Yuqian Wang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
| | - Yanli Nie
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Shuo Guan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Ying Kuo
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
| | - Jin Huang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Xiaodan Shi
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Jialin Jia
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
| | - Jie Qiao
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China
- National Clinical Center for Obstetrics and Gynecology, Beijing, China
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproduction, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
- Research Units of Comprehensive Diagnosis and Treatment of Oocyte Maturation Arrest, Beijing, China
| | - Liying Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, No. 49 Hua Yuan Bei Road, Hai Dian District, Beijing, 100191, China.
- National Clinical Center for Obstetrics and Gynecology, Beijing, China.
- Key Laboratory of Assisted Reproduction, Ministry of Education, Beijing, China.
| |
Collapse
|
38
|
Upadhyay M, Derks MFL, Andersson G, Medugorac I, Groenen MAM, Crooijmans RPMA. Introgression contributes to distribution of structural variations in cattle. Genomics 2021; 113:3092-3102. [PMID: 34242710 DOI: 10.1016/j.ygeno.2021.07.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 06/24/2021] [Accepted: 07/03/2021] [Indexed: 11/19/2022]
Abstract
Structural variations (SVs) are an important source of phenotypic diversity in cattle. Here, 72 whole genome sequences representing taurine and zebu cattle were used to identify SVs. Applying multiple approaches, 16,738 SVs were identified. A comparison against the Database of Genomic Variants archives revealed that 1575 SVs were novel in our data. A novel duplication covering the entire GALNT15 gene, was observed only in N'Dama. A duplication, which was previously reported only in zebu and associated with navel length, was also observed in N'Dama. Investigation of a novel deletion located upstream of CAST13 gene and identified only in Italian cattle and zebu, revealed its introgressed origin in the former. Overall, our data highlights how the SVs distribution in cattle is also shaped by forces such as demographical differences and gene flow. The cattle SVs of this study and its meta-data can be visualized on an interactive genome browser at https://tinyurl.com/svCowArs.
Collapse
Affiliation(s)
- Maulik Upadhyay
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands; Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden; Population Genomics Group, Department of Veterinary Sciences, Ludwig-Maximilians-University Munich, 80539 Munich, Germany.
| | - Martijn F L Derks
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands.
| | - Göran Andersson
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden.
| | - Ivica Medugorac
- Population Genomics Group, Department of Veterinary Sciences, Ludwig-Maximilians-University Munich, 80539 Munich, Germany.
| | - Martien A M Groenen
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands.
| | - Richard P M A Crooijmans
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands.
| |
Collapse
|
39
|
Nandolo W, Mészáros G, Wurzinger M, Banda LJ, Gondwe TN, Mulindwa HA, Nakimbugwe HN, Clark EL, Woodward-Greene MJ, Liu M, Liu GE, Van Tassell CP, Rosen BD, Sölkner J. Detection of copy number variants in African goats using whole genome sequence data. BMC Genomics 2021; 22:398. [PMID: 34051743 PMCID: PMC8164248 DOI: 10.1186/s12864-021-07703-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 05/11/2021] [Indexed: 12/21/2022] Open
Abstract
Background Copy number variations (CNV) are a significant source of variation in the genome and are therefore essential to the understanding of genetic characterization. The aim of this study was to develop a fine-scaled copy number variation map for African goats. We used sequence data from multiple breeds and from multiple African countries. Results A total of 253,553 CNV (244,876 deletions and 8677 duplications) were identified, corresponding to an overall average of 1393 CNV per animal. The mean CNV length was 3.3 kb, with a median of 1.3 kb. There was substantial differentiation between the populations for some CNV, suggestive of the effect of population-specific selective pressures. A total of 6231 global CNV regions (CNVR) were found across all animals, representing 59.2 Mb (2.4%) of the goat genome. About 1.6% of the CNVR were present in all 34 breeds and 28.7% were present in all 5 geographical areas across Africa, where animals had been sampled. The CNVR had genes that were highly enriched in important biological functions, molecular functions, and cellular components including retrograde endocannabinoid signaling, glutamatergic synapse and circadian entrainment. Conclusions This study presents the first fine CNV map of African goat based on WGS data and adds to the growing body of knowledge on the genetic characterization of goats. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07703-1.
Collapse
Affiliation(s)
- Wilson Nandolo
- University of Natural Resources and Life Sciences, Vienna, Austria.,Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi
| | - Gábor Mészáros
- University of Natural Resources and Life Sciences, Vienna, Austria
| | - Maria Wurzinger
- University of Natural Resources and Life Sciences, Vienna, Austria
| | - Liveness J Banda
- Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi
| | - Timothy N Gondwe
- Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi
| | | | | | - Emily L Clark
- The Roslin Institute, University of Edinburgh, Edinburgh, Scotland, UK
| | - M Jennifer Woodward-Greene
- Animal Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD, USA.,National Agricultural Library, USDA-ARS, Beltsville, MD, USA
| | - Mei Liu
- Animal Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD, USA
| | | | - George E Liu
- Animal Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD, USA
| | | | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, USDA-ARS, Beltsville, MD, USA.
| | - Johann Sölkner
- University of Natural Resources and Life Sciences, Vienna, Austria
| |
Collapse
|
40
|
Smolander J, Khan S, Singaravelu K, Kauko L, Lund RJ, Laiho A, Elo LL. Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data. BMC Genomics 2021; 22:357. [PMID: 34000988 PMCID: PMC8130438 DOI: 10.1186/s12864-021-07686-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 05/07/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Detection of copy number variations (CNVs) from high-throughput next-generation whole-genome sequencing (WGS) data has become a widely used research method during the recent years. However, only a little is known about the applicability of the developed algorithms to ultra-low-coverage (0.0005-0.8×) data that is used in various research and clinical applications, such as digital karyotyping and single-cell CNV detection. RESULT Here, the performance of six popular read-depth based CNV detection algorithms (BIC-seq2, Canvas, CNVnator, FREEC, HMMcopy, and QDNAseq) was studied using ultra-low-coverage WGS data. Real-world array- and karyotyping kit-based validation were used as a benchmark in the evaluation. Additionally, ultra-low-coverage WGS data was simulated to investigate the ability of the algorithms to identify CNVs in the sex chromosomes and the theoretical minimum coverage at which these tools can accurately function. Our results suggest that while all the methods were able to detect large CNVs, many methods were susceptible to producing false positives when smaller CNVs (< 2 Mbp) were detected. There was also significant variability in their ability to identify CNVs in the sex chromosomes. Overall, BIC-seq2 was found to be the best method in terms of statistical performance. However, its significant drawback was by far the slowest runtime among the methods (> 3 h) compared with FREEC (~ 3 min), which we considered the second-best method. CONCLUSIONS Our comparative analysis demonstrates that CNV detection from ultra-low-coverage WGS data can be a highly accurate method for the detection of large copy number variations when their length is in millions of base pairs. These findings facilitate applications that utilize ultra-low-coverage CNV detection.
Collapse
Affiliation(s)
- Johannes Smolander
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Sofia Khan
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Kalaimathy Singaravelu
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Leni Kauko
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Riikka J Lund
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Asta Laiho
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520, Turku, Finland.
- Institute of Biomedicine, University of Turku, 20520, Turku, Finland.
| |
Collapse
|
41
|
Moreno-Cabrera JM, Del Valle J, Castellanos E, Feliubadaló L, Pineda M, Serra E, Capellá G, Lázaro C, Gel B. CNVfilteR: an R/bioconductor package to identify false positives produced by germline NGS CNV detection tools. Bioinformatics 2021; 37:4227-4229. [PMID: 33983414 PMCID: PMC9502136 DOI: 10.1093/bioinformatics/btab356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 03/06/2021] [Accepted: 05/12/2021] [Indexed: 11/14/2022] Open
Abstract
Germline copy-number variants (CNVs) are relevant mutations for multiple genetics fields, such as the study of hereditary diseases. However, available benchmarks show that all next-generation sequencing (NGS) CNV calling tools produce false positives. We developed CNVfilteR, an R package that uses the single nucleotide variant calls usually obtained in germline NGS pipelines to identify those false positives. The package can detect both false deletions and false duplications. We evaluated CNVfilteR performance on callsets generated by 13 CNV calling tools on 3 whole-genome sequencing and 541 panel samples, showing a decrease of up to 44.8% in false positives and consistent F1-score increase. Using CNVfilteR to detect false-positive calls can improve the overall performance of existing CNV calling pipelines. AVAILABILITY CNVfilteR is released under Artistic-2.0 License. Source code and documentation are freely available at Bioconductor (http://www.bioconductor.org/packages/CNVfilteR). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- José Marcos Moreno-Cabrera
- Hereditary Cancer Group, Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Campus, Ruti Badalona Barcelona, Can Spain.,Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Jesús Del Valle
- Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Elisabeth Castellanos
- Hereditary Cancer Group, Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Campus, Ruti Badalona Barcelona, Can Spain.,Clinical Genomics Unit, Clinical Genetics Service, Northern Metropolitan Clinical Laboratory, Germans Trias i Pujol University Hospital (HUGTiP), Ruti, Campus Badalona Barcelona, Can Spain
| | - Lidia Feliubadaló
- Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Marta Pineda
- Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Eduard Serra
- Hereditary Cancer Group, Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Campus, Ruti Badalona Barcelona, Can Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Gabriel Capellá
- Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Conxi Lázaro
- Hereditary Cancer Program, Joint Program on Hereditary Cancer, Catalan Institute of Oncology, Institut d'Investigació Biomèdica de Bellvitge-IDIBELL, L'Hospitalet de Llobregat, Barcelona, Spain.,Instituto de Salud Carlos III, Centro de Investigación Biomédica en Red Cáncer (CIBERONC), Madrid, Spain
| | - Bernat Gel
- Hereditary Cancer Group, Program for Predictive and Personalized Medicine of Cancer, Germans Trias i Pujol Research Institute (PMPPC-IGTP), Campus, Ruti Badalona Barcelona, Can Spain
| |
Collapse
|
42
|
Liu S, Huckaby AC, Brown AC, Moore CC, Burbulis I, McConnell MJ, Güler JL. Single-cell sequencing of the small and AT-skewed genome of malaria parasites. Genome Med 2021; 13:75. [PMID: 33947449 PMCID: PMC8094492 DOI: 10.1186/s13073-021-00889-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 04/17/2021] [Indexed: 12/23/2022] Open
Abstract
Single-cell genomics is a rapidly advancing field; however, most techniques are designed for mammalian cells. We present a single-cell sequencing pipeline for an intracellular parasite, Plasmodium falciparum, with a small genome of extreme base content. Through optimization of a quasi-linear amplification method, we target the parasite genome over contaminants and generate coverage levels allowing detection of minor genetic variants. This work, as well as efforts that build on these findings, will enable detection of parasite heterogeneity contributing to P. falciparum adaptation. Furthermore, this study provides a framework for optimizing single-cell amplification and variant analysis in challenging genomes.
Collapse
Affiliation(s)
- Shiwei Liu
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Adam C Huckaby
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Audrey C Brown
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Christopher C Moore
- Division of Infectious Diseases and International Health, University of Virginia, Charlottesville, VA, USA
| | - Ian Burbulis
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA
- Escuela de Medicina, Universidad San Sebastian, Puerto Montt, Chile
| | - Michael J McConnell
- Department of Biochemistry and Molecular Genetics, University of Virginia School of Medicine, Charlottesville, VA, USA
- Department of Neuroscience, University of Virginia School of Medicine, Charlottesville, VA, USA
- Current address: Lieber Institute for Brain Development, Baltimore, MD, USA
| | - Jennifer L Güler
- Department of Biology, University of Virginia, Charlottesville, VA, USA.
- Division of Infectious Diseases and International Health, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
43
|
Butty AM, Chud TCS, Cardoso DF, Lopes LSF, Miglior F, Schenkel FS, Cánovas A, Häfliger IM, Drögemüller C, Stothard P, Malchiodi F, Baes CF. Genome-wide association study between copy number variants and hoof health traits in Holstein dairy cattle. J Dairy Sci 2021; 104:8050-8061. [PMID: 33896633 DOI: 10.3168/jds.2020-19879] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 01/31/2021] [Indexed: 01/06/2023]
Abstract
Genome-wide association studies based on SNP have been completed for multiple traits in dairy cattle; however, copy number variants (CNV) could add genomic information that has yet to be harnessed. The objectives of this study were to identify CNV in genotyped Holstein animals and assess their association with hoof health traits using deregressed estimated breeding values as pseudophenotypes. A total of 23,256 CNV comprising 1,645 genomic regions were identified in 5,845 animals. Fourteen genomic regions harboring structural variations, including 9 deletions and 5 duplications, were associated with at least 1 of the studied hoof health traits. This group of traits included digital dermatitis, interdigital dermatitis, heel horn erosion, sole ulcer, white line lesion, sole hemorrhage, and interdigital hyperplasia; no regions were associated with toe ulcer. Twenty candidate genes overlapped with the regions associated with these traits including SCART1, NRXN2, KIF26A, GPHN, and OR7A17. In this study, an effect on infectious hoof lesions could be attributed to the PRAME (Preferentially Expressed Antigen in Melanoma) gene. Almost all genes detected in association with noninfectious hoof lesions could be linked to known metabolic disorders. The knowledge obtained considering information of associated CNV to the traits of interest in this study could improve the accuracy of estimated breeding values. This may further increase the genetic gain for these traits in the Canadian Holstein population, thus reducing the involuntary animal losses due to lameness.
Collapse
Affiliation(s)
- Adrien M Butty
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Tatiane C S Chud
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Diercles F Cardoso
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Lucas S F Lopes
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Filippo Miglior
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Flavio S Schenkel
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Angela Cánovas
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | - Irene M Häfliger
- Vetsuisse Faculty, Institute of Genetics, University of Bern, Bern 3012, Switzerland
| | - Cord Drögemüller
- Vetsuisse Faculty, Institute of Genetics, University of Bern, Bern 3012, Switzerland
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton T6G 2R3, Canada
| | - Francesca Malchiodi
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada; The Semex Alliance, Guelph, Ontario N1H 6J2, Canada
| | - Christine F Baes
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, Ontario N1G 2W1, Canada; Vetsuisse Faculty, Institute of Genetics, University of Bern, Bern 3012, Switzerland.
| |
Collapse
|
44
|
Next Generation Sequencing Technology in the Clinic and Its Challenges. Cancers (Basel) 2021; 13:cancers13081751. [PMID: 33916923 PMCID: PMC8067551 DOI: 10.3390/cancers13081751] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 03/30/2021] [Accepted: 04/05/2021] [Indexed: 12/12/2022] Open
Abstract
Simple Summary Precise identification and annotation of mutations are of utmost importance in clinical oncology. Insights of the DNA sequence can provide meaningful knowledge to unravel the underlying genetics of disease. Hence, tailoring of personalized medicine often relies on specific genomic alteration for treatment efficacy. The aim of this review is to highlight that sequencing harbors much more than just four nucleotides. Moreover, the gradual transition from first to second generation sequencing technologies has led to awareness for choosing the most appropriate bioinformatic analytic tools based on the aim, quality and demand for a specific purpose. Thus, the same raw data can lead to various results reflecting the intrinsic features of different datamining pipelines. Abstract Data analysis has become a crucial aspect in clinical oncology to interpret output from next-generation sequencing-based testing. NGS being able to resolve billions of sequencing reactions in a few days has consequently increased the demand for tools to handle and analyze such large data sets. Many tools have been developed since the advent of NGS, featuring their own peculiarities. Increased awareness when interpreting alterations in the genome is therefore of utmost importance, as the same data using different tools can provide diverse outcomes. Hence, it is crucial to evaluate and validate bioinformatic pipelines in clinical settings. Moreover, personalized medicine implies treatment targeting efficacy of biological drugs for specific genomic alterations. Here, we focused on different sequencing technologies, features underlying the genome complexity, and bioinformatic tools that can impact the final annotation. Additionally, we discuss the clinical demand and design for implementing NGS.
Collapse
|
45
|
Cayuela H, Dorant Y, Mérot C, Laporte M, Normandeau E, Gagnon-Harvey S, Clément M, Sirois P, Bernatchez L. Thermal adaptation rather than demographic history drives genetic structure inferred by copy number variants in a marine fish. Mol Ecol 2021; 30:1624-1641. [PMID: 33565147 DOI: 10.1111/mec.15835] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 01/15/2021] [Accepted: 02/01/2021] [Indexed: 12/22/2022]
Abstract
Increasing evidence shows that structural variants represent an overlooked aspect of genetic variation with consequential evolutionary roles. Among those, copy number variants (CNVs), including duplicated genomic regions and transposable elements (TEs), may contribute to local adaptation and/or reproductive isolation among divergent populations. Those mechanisms suppose that CNVs could be used to infer neutral and/or adaptive population genetic structure, whose study has been restricted to microsatellites, mitochondrial DNA and Amplified fragment length polymorphism markers in the past and more recently the use of single nucleotide polymorphisms (SNPs). Taking advantage of recent developments allowing CNV analysis from RAD-seq data, we investigated how variation in fitness-related traits, local environmental conditions and demographic history are associated with CNVs, and how subsequent copy number variation drives population genetic structure in a marine fish, the capelin (Mallotus villosus). We collected 1538 DNA samples from 35 sampling sites in the north Atlantic Ocean and identified 6620 putative CNVs. We found associations between CNVs and the gonadosomatic index, suggesting that six duplicated regions could affect female fitness by modulating oocyte production. We also detected 105 CNV candidates associated with water temperature, among which 20% corresponded to genomic regions located within the sequence of protein-coding genes, suggesting local adaptation to cold water by means of gene sequence amplification. We also identified 175 CNVs associated with the divergence of three previously defined parapatric glacial lineages, of which 24% were located within protein-coding genes, making those loci potential candidates for reproductive isolation. Lastly, our analyses unveiled a hierarchical, complex CNV population structure determined by temperature and local geography, which was in stark contrast to that inferred based on SNPs in a previous study. Our findings underline the complementarity of those two types of genomic variation in population genomics studies.
Collapse
Affiliation(s)
- Hugo Cayuela
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada.,Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Yann Dorant
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
| | - Claire Mérot
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
| | - Martin Laporte
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
| | - Eric Normandeau
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
| | - Stéphane Gagnon-Harvey
- Département des sciences fondamentales, Université du Québec à Chicoutimi, Chicoutimi, QC, Canada
| | - Marie Clément
- Center for Fisheries Ecosystems Research, Fisheries and Marine Institute of Memorial, University of Newfoundland, St. John's, NL, Canada.,Labrador Institute of Memorial University of Newfoundland, Happy Valley-Goose Bay, NL, Canada
| | - Pascal Sirois
- Département des sciences fondamentales, Université du Québec à Chicoutimi, Chicoutimi, QC, Canada
| | - Louis Bernatchez
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
| |
Collapse
|
46
|
Minoche AE, Lundie B, Peters GB, Ohnesorg T, Pinese M, Thomas DM, Zankl A, Roscioli T, Schonrock N, Kummerfeld S, Burnett L, Dinger ME, Cowley MJ. ClinSV: clinical grade structural and copy number variant detection from whole genome sequencing data. Genome Med 2021; 13:32. [PMID: 33632298 PMCID: PMC7908648 DOI: 10.1186/s13073-021-00841-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 02/02/2021] [Indexed: 01/09/2023] Open
Abstract
Whole genome sequencing (WGS) has the potential to outperform clinical microarrays for the detection of structural variants (SV) including copy number variants (CNVs), but has been challenged by high false positive rates. Here we present ClinSV, a WGS based SV integration, annotation, prioritization, and visualization framework, which identified 99.8% of simulated pathogenic ClinVar CNVs > 10 kb and 11/11 pathogenic variants from matched microarrays. The false positive rate was low (1.5-4.5%) and reproducibility high (95-99%). In clinical practice, ClinSV identified reportable variants in 22 of 485 patients (4.7%) of which 35-63% were not detectable by current clinical microarray designs. ClinSV is available at https://github.com/KCCG/ClinSV .
Collapse
Affiliation(s)
- Andre E Minoche
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia.
- St Vincent's Clinical School, UNSW, Sydney, NSW, Australia.
| | - Ben Lundie
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
| | - Greg B Peters
- Sydney Genome Diagnostics, The Children's Hospital at Westmead, Hawkesbury Road & Hainsworth Street, Westmead, NSW, Australia
| | - Thomas Ohnesorg
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- Genome.One, Darlinghurst, NSW, Australia
| | - Mark Pinese
- Children's Cancer Institute, University of New South Wales, Randwick, Sydney, NSW, Australia
- School of Women's and Children's Health, UNSW, Sydney, NSW, Australia
| | - David M Thomas
- St Vincent's Clinical School, UNSW, Sydney, NSW, Australia
- The Kinghorn Cancer Centre and Cancer Division, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
| | - Andreas Zankl
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- Department of Clinical Genetics, The Children's Hospital at Westmead, Hawkesbury Road, Westmead, NSW, Australia
- Sydney Medical School, The University of Sydney, Camperdown, NSW, Australia
| | - Tony Roscioli
- NSW Health Pathology Randwick, Sydney, NSW, Australia
- Centre for Clinical Genetics, Sydney Children's Hospital, Randwick, NSW, Australia
- Prince of Wales Clinical School, University of New South Wales, Sydney, NSW, Australia
- Neuroscience Research Australia, University of New South Wales, Randwick, Sydney, NSW, Australia
| | - Nicole Schonrock
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- Genome.One, Darlinghurst, NSW, Australia
| | - Sarah Kummerfeld
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- St Vincent's Clinical School, UNSW, Sydney, NSW, Australia
| | - Leslie Burnett
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- St Vincent's Clinical School, UNSW, Sydney, NSW, Australia
- Genome.One, Darlinghurst, NSW, Australia
- Sydney Medical School, The University of Sydney, Camperdown, NSW, Australia
| | - Marcel E Dinger
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW, Sydney, NSW, Australia
| | - Mark J Cowley
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, 370 Victoria Street, Darlinghurst, NSW, Australia.
- St Vincent's Clinical School, UNSW, Sydney, NSW, Australia.
- Children's Cancer Institute, University of New South Wales, Randwick, Sydney, NSW, Australia.
- School of Women's and Children's Health, UNSW, Sydney, NSW, Australia.
| |
Collapse
|
47
|
Rao J, Peng L, Liang X, Jiang H, Geng C, Zhao X, Liu X, Fan G, Chen F, Mu F. Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms. BMC Bioinformatics 2020; 21:518. [PMID: 33176676 PMCID: PMC7659224 DOI: 10.1186/s12859-020-03859-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 11/03/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND DNBSEQ™ platforms are new massively parallel sequencing (MPS) platforms that use DNA nanoball technology. Use of data generated from DNBSEQ™ platforms to detect single nucleotide variants (SNVs) and small insertions and deletions (indels) has proven to be quite effective, while the feasibility of copy number variants (CNVs) detection is unclear. RESULTS Here, we first benchmarked different CNV detection tools based on Illumina whole-genome sequencing (WGS) data of NA12878 and then assessed these tools in CNV detection based on DNBSEQ™ sequencing data from the same sample. When the same tool was used, the CNVs detected based on DNBSEQ™ and Illumina data were similar in quantity, length and distribution, while great differences existed within results from different tools and even based on data from a single platform. We further estimated the CNV detection power based on available CNV benchmarks of NA12878 and found similar precision and sensitivity between the DNBSEQ™ and Illumina platforms. We also found higher precision of CNVs shorter than 1 kbp based on DNBSEQ™ platforms than those based on Illumina platforms by using Pindel, DELLY and LUMPY. We carefully compared these two available benchmarks and found a large proportion of specific CNVs between them. Thus, we constructed a more complete CNV benchmark of NA12878 containing 3512 CNV regions. CONCLUSIONS We assessed and benchmarked CNV detections based on WGS with DNBSEQ™ platforms and provide guidelines for future studies.
Collapse
Affiliation(s)
- Junhua Rao
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | | | | | - Hui Jiang
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | | | - Xia Zhao
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, 518083, China.,BGI-Qingdao, BGI-Shenzhen, Qingdao, 266555, Shandong, China.,IGDB-BGI Joint Center for Omics, BGI-Shenzhen, Shenzhen, 518083, China.,State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Guangyi Fan
- BGI-Qingdao, BGI-Shenzhen, Qingdao, 266555, Shandong, China.,State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Fang Chen
- MGI, BGI-Shenzhen, Shenzhen, 518083, China. .,BGI-Shenzhen, Shenzhen, 518083, China. .,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.
| | - Feng Mu
- MGI, BGI-Shenzhen, Shenzhen, 518083, China. .,MGI-Wuhan, BGI-Shenzhen, Wuhan, 430074, China.
| |
Collapse
|
48
|
Zhuang X, Ye R, So MT, Lam WY, Karim A, Yu M, Ngo ND, Cherny SS, Tam PKH, Garcia-Barcelo MM, Tang CSM, Sham PC. A random forest-based framework for genotyping and accuracy assessment of copy number variations. NAR Genom Bioinform 2020; 2:lqaa071. [PMID: 33575619 PMCID: PMC7671382 DOI: 10.1093/nargab/lqaa071] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 08/18/2020] [Accepted: 08/26/2020] [Indexed: 12/24/2022] Open
Abstract
Detection of copy number variations (CNVs) is essential for uncovering genetic factors underlying human diseases. However, CNV detection by current methods is prone to error, and precisely identifying CNVs from paired-end whole genome sequencing (WGS) data is still challenging. Here, we present a framework, CNV-JACG, for Judging the Accuracy of CNVs and Genotyping using paired-end WGS data. CNV-JACG is based on a random forest model trained on 21 distinctive features characterizing the CNV region and its breakpoints. Using the data from the 1000 Genomes Project, Genome in a Bottle Consortium, the Human Genome Structural Variation Consortium and in-house technical replicates, we show that CNV-JACG has superior sensitivity over the latest genotyping method, SV2, particularly for the small CNVs (≤1 kb). We also demonstrate that CNV-JACG outperforms SV2 in terms of Mendelian inconsistency in trios and concordance between technical replicates. Our study suggests that CNV-JACG would be a useful tool in assessing the accuracy of CNVs to meet the ever-growing needs for uncovering the missing heritability linked to CNVs.
Collapse
Affiliation(s)
- Xuehan Zhuang
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Rui Ye
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Man-Ting So
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Wai-Yee Lam
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Anwarul Karim
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Michelle Yu
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Ngoc Diem Ngo
- National Hospital of Pediatrics, Ha Noi 100000, Vietnam
| | - Stacey S Cherny
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Paul Kwong-Hang Tam
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | | | - Clara Sze-Man Tang
- Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Pak Chung Sham
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| |
Collapse
|
49
|
Yang L. A Practical Guide for Structural Variation Detection in the Human Genome. CURRENT PROTOCOLS IN HUMAN GENETICS 2020; 107:e103. [PMID: 32813322 PMCID: PMC7738216 DOI: 10.1002/cphg.103] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Profiling genetic variants-including single nucleotide variants, small insertions and deletions, copy number variations, and structural variations (SVs)-from both healthy individuals and individuals with disease is a key component of genetic and biomedical research. SVs are large-scale changes in the genome and involve breakage and rejoining of DNA fragments. They may affect thousands to millions of nucleotides and can lead to loss, gain, and reshuffling of genes and regulatory elements. SVs are known to impact gene expression and potentially result in altered phenotypes and diseases. Therefore, identifying SVs from the human genomes is particularly important. In this review, I describe advantages and disadvantages of the available high-throughput assays for the discovery of SVs, which are the most challenging genetic alterations to detect. A practical guide is offered to suggest the most suitable strategies for discovering different types of SVs including common germline, rare, somatic, and complex variants. I also discuss factors to be considered, such as cost and performance, for different strategies when designing experiments. Last, I present several approaches to identify potential SV artifacts caused by samples, experimental procedures, and computational analysis. © 2020 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Lixing Yang
- Ben May Department for Cancer Research, Department of Human Genetics, University of Chicago, Chicago, Illinois
| |
Collapse
|
50
|
Bai H, He Y, Ding Y, Chu Q, Lian L, Heifetz EM, Yang N, Cheng HH, Zhang H, Chen J, Song J. Genome-wide characterization of copy number variations in the host genome in genetic resistance to Marek's disease using next generation sequencing. BMC Genet 2020; 21:77. [PMID: 32677890 PMCID: PMC7364486 DOI: 10.1186/s12863-020-00884-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 07/05/2020] [Indexed: 11/13/2022] Open
Abstract
Background Marek’s disease (MD) is a highly neoplastic disease primarily affecting chickens, and remains as a chronic infectious disease that threatens the poultry industry. Copy number variation (CNV) has been examined in many species and is recognized as a major source of genetic variation that directly contributes to phenotypic variation such as resistance to infectious diseases. Two highly inbred chicken lines, 63 (MD-resistant) and 72 (MD-susceptible), as well as their F1 generation and six recombinant congenic strains (RCSs) with varied susceptibility to MD, are considered as ideal models to identify the complex mechanisms of genetic and molecular resistance to MD. Results In the present study, to unravel the potential genetic mechanisms underlying resistance to MD, we performed a genome-wide CNV detection using next generation sequencing on the inbred chicken lines with the assistance of CNVnator. As a result, a total of 1649 CNV regions (CNVRs) were successfully identified after merging all the nine datasets, of which 90 CNVRs were overlapped across all the chicken lines. Within these shared regions, 1360 harbored genes were identified. In addition, 55 and 44 CNVRs with 62 and 57 harbored genes were specifically identified in line 63 and 72, respectively. Bioinformatics analysis showed that the nearby genes were significantly enriched in 36 GO terms and 6 KEGG pathways including JAK/STAT signaling pathway. Ten CNVRs (nine deletions and one duplication) involved in 10 disease-related genes were selected for validation by using quantitative real-time PCR (qPCR), all of which were successfully confirmed. Finally, qPCR was also used to validate two deletion events in line 72 that were definitely normal in line 63. One high-confidence gene, IRF2 was identified as the most promising candidate gene underlying resistance and susceptibility to MD in view of its function and overlaps with data from previous study. Conclusions Our findings provide valuable insights for understanding the genetic mechanism of resistance to MD and the identified gene and pathway could be considered as the subject of further functional characterization.
Collapse
Affiliation(s)
- Hao Bai
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Institutes of Agricultural Science and Technology Development, Yangzhou University, Yangzhou, 225009, China.,Department of Animal & Avian Sciences, University of Maryland, College Park, MD, 20742, USA.,Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Yanghua He
- Department of Animal & Avian Sciences, University of Maryland, College Park, MD, 20742, USA.,Department of Human Nutrition, Food and Animal Sciences, University of Hawaii at Manoa, Honolulu, HI, 96822, USA
| | - Yi Ding
- Department of Animal & Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Qin Chu
- Department of Animal & Avian Sciences, University of Maryland, College Park, MD, 20742, USA.,Institute of Animal Husbandry and Veterinary Medicine, Beijing Academy of Agriculture and Forestry Sciences, Beijing, 100097, China
| | - Ling Lian
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Eliyahu M Heifetz
- Faculty of Health Sciences, Jerusalem College of Technology, 9116001, Jerusalem, Israel
| | - Ning Yang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Hans H Cheng
- USDA, Agricultural Research Service, Avian Disease and Oncology Laboratory, East Lansing, MI, 48823, USA
| | - Huanmin Zhang
- USDA, Agricultural Research Service, Avian Disease and Oncology Laboratory, East Lansing, MI, 48823, USA
| | - Jilan Chen
- Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jiuzhou Song
- Department of Animal & Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| |
Collapse
|