Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lai WR, Johnson MD, Kucherlapati R, Park PJ. Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics 2005;21:3763-70. [PMID: 16081473 PMCID: PMC2819184 DOI: 10.1093/bioinformatics/bti611] [Citation(s) in RCA: 297] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Lai WR, Johnson MD, Kucherlapati R, Park PJ. Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics 2005;21:3763-70. [PMID: 16081473 PMCID: PMC2819184 DOI: 10.1093/bioinformatics/bti611] [Citation(s) in RCA: 297] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Liu X, Duan J, Gong D. MSigSeg: An R package for multiple signals segmentation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025;265:108744. [PMID: 40199111 DOI: 10.1016/j.cmpb.2025.108744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 03/07/2025] [Accepted: 03/26/2025] [Indexed: 04/10/2025]

Li H, Li S, Zhao Z, Kong L, Fu X, Zhu J, Feng J, Tang W, Wu D, Kong X. Noninvasive prenatal diagnosis (NIPD) of non-syndromic hearing loss (NSHL) for singleton and twin pregnancies in the first trimester. Orphanet J Rare Dis 2025;20:40. [PMID: 39871362 PMCID: PMC11773923 DOI: 10.1186/s13023-025-03558-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 01/17/2025] [Indexed: 01/29/2025] Open

Abstract

BACKGROUND

Noninvasive prenatal diagnosis (NIPD) has been proven feasible for non-syndromic hearing loss (NSHL) in singleton pregnancies. However, previous research is limited to the second trimester and the application in twin pregnancies is blank. Here we provide a novel algorithmic approach to assess singleton and twin pregnancies in the first trimester.

METHODS

A 324.614 kb capture panel was designed to selectively enrich target regions. Parental haplotypes were constructed by target sequencing of blood samples from the parents and the proband. Then single nucleotide polymorphisms (SNP) within target regions were classified into four and six categories in singleton and twin pregnancy, respectively. Combining relative haplotype dosage change (RHDO) and the Bayes factor (BF), fetal fraction (FF) and fetal genotype were deduced in singleton and twin pregnancies. The pregnant women's NIPD results were validated by invasive prenatal diagnosis and Sanger sequencing.

RESULTS

Sixteen women with singleton pregnancies and one woman with a twin pregnancy were recruited. Among the 16 singleton pregnancies, NIPD was successfully applied in 15 families and the coincidence rate with invasive prenatal diagnosis was 100% (15/15). Only one family NIPD result is "no call" because the imbalance distribution of SNP sites makes it difficult to estimate recombination events. Most (13/15) of pregnant women were diagnosed in the first trimester and the earliest gestation week was the 7th week. The twin pregnancy was a dichorionic diamniotic twin (DCDA). NIPD confirmed one fetus is affected, and another is a carrier with c.299_300delAT of GJB2 gene.

CONCLUSION

This study represents the pioneering evidence in the field, demonstrating the feasibility of NIPD for NSHL in twin pregnancies. Moreover, it provides a novel and advanced diagnostic approach for families at high risk of NSHL during pregnancy, offering earlier detection, enhanced safety, and improved accuracy.

Collapse

Wang J, Zhu QW, Cui AM, Lin MS, Lou HQ. Application of Genetic Origin Analysis of Copy Number Variations in Non-Invasive Prenatal Testing. Prenat Diagn 2025;45:44-56. [PMID: 39425690 DOI: 10.1002/pd.6688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 09/24/2024] [Accepted: 10/03/2024] [Indexed: 10/21/2024]

Abstract

OBJECTIVE

This study aimed to assess the application of origin analysis of copy number variations (CNVs) in non-invasive prenatal testing (NIPT) and provide a basis for expanding the clinical application of NIPT.

METHOD

We enrolled 35,317 patients who underwent NIPT between January 2019 and March 2023. Genome sequencing of copy number variation (CNV-Seq) analysis was performed using the CNV calling pipeline to identify subchromosomal abnormalities in maternal plasma. Genetic origin was determined by comparing the chimaerism ratio of CNV and the concentration of cell-free foetal DNA (cffDNA). All pregnant women with a high risk of CNV, as indicated by the NIPT, were informed of their genetic origins. Amniocentesis was recommended for detecting the CNVs in foetal chromosomes, and pregnancy outcomes were tracked.

RESULTS

A total of 109 pregnancies showed clinically significant positive results for CNV after NIPT, including 65 cases of maternal/foetal (M/F)-CNVs and 44 cases of F-CNVs. The occurrence of M/F-CNVs was independent of age, screening (serological or ultrasound) indications for abnormalities, and mode of pregnancy. The incidence of pathogenic/likely pathogenic (P/LP)-F-CNVs was high in cases where serological screening indicated intermediate, high-risk, or abnormal US findings (p < 0.05). In the M/F-CNV group, most of the P/LP-CNVs were small fragments with low penetrance; 55 (84.62%) were less than 5 Mb in size, and nine (13.85%) were between 5 and 10 Mb. In the F-CNV group, foetal P/LP-CNV was detected in 36 of 42 cases undergoing prenatal diagnosis, and no significant bias was noted in the size distribution of P/LP-F-CNV fragments. The prenatal diagnostic rate and positive predictive value in the F-CNV group were 95.45% and 85.71%, respectively, which were significantly different from those in the M/F group (26.15% and 52.95%), respectively (p < 0.05).

CONCLUSIONS

Genetic origin analysis of CNV can effectively improve adherence to prenatal diagnosis in pregnant women and the accuracy of prenatal diagnosis.

Collapse

Peripolli E, Stafuzza NB, Machado MA, do Carmo Panetto JC, do Egito AA, Baldi F, da Silva MVGB. Assessment of copy number variants in three Brazilian locally adapted cattle breeds using whole-genome re-sequencing data. Anim Genet 2023;54:254-270. [PMID: 36740987 DOI: 10.1111/age.13298] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 12/13/2021] [Accepted: 01/13/2023] [Indexed: 02/07/2023]

WAVECNV: A New Approach for Detecting Copy Number Variation by Wavelet Clustering. MATHEMATICS 2022. [DOI: 10.3390/math10122151] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Banerjee S. Horseshoe shrinkage methods for Bayesian fusion estimation. Comput Stat Data Anal 2022. [DOI: 10.1016/j.csda.2022.107450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Jia S, Shi L. Efficient change-points detection for genomic sequences via cumulative segmented regression. Bioinformatics 2022;38:311-317. [PMID: 34601562 DOI: 10.1093/bioinformatics/btab685] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 07/08/2021] [Accepted: 09/28/2021] [Indexed: 02/03/2023] Open

Chan NH, Ng WL, Yau CY, Yu H. Optimal change-point estimation in time series. Ann Stat 2021. [DOI: 10.1214/20-aos2039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Wei YC, Huang GH. CONY: A Bayesian procedure for detecting copy number variations from sequencing read depths. Sci Rep 2020;10:10493. [PMID: 32591545 PMCID: PMC7319969 DOI: 10.1038/s41598-020-64353-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 04/15/2020] [Indexed: 12/26/2022] Open

Fang X, Li J, Siegmund D. Segmentation and estimation of change-point models: False positive control and confidence regions. Ann Stat 2020. [DOI: 10.1214/19-aos1861] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Alshawaqfeh M, Al Kawam A, Serpedin E, Datta A. Robust Recurrent CNV Detection in the Presence of Inter-Subject Variability. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:1056-1067. [PMID: 30387737 DOI: 10.1109/tcbb.2018.2878560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Wang S, Lee S, Chu C, Jain D, Kerpedjiev P, Nelson GM, Walsh JM, Alver BH, Park PJ. HiNT: a computational method for detecting copy number variations and translocations from Hi-C data. Genome Biol 2020;21:73. [PMID: 32293513 PMCID: PMC7087379 DOI: 10.1186/s13059-020-01986-5] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 03/05/2020] [Indexed: 12/25/2022] Open

Cheng D, He Z, Schwartzman A. Multiple testing of local extrema for detection of change points. Electron J Stat 2020. [DOI: 10.1214/20-ejs1751] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Wang X, Lebarbier E, Aubert J, Robin S. Variational Inference for Coupled Hidden Markov Models Applied to the Joint Detection of Copy Number Variations. Int J Biostat 2019;15:/j/ijb.ahead-of-print/ijb-2018-0023/ijb-2018-0023.xml. [PMID: 30779702 DOI: 10.1515/ijb-2018-0023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 11/21/2018] [Indexed: 02/04/2023]

Li H, Guo Q, Munk A. Multiscale change-point segmentation: beyond step functions. Electron J Stat 2019. [DOI: 10.1214/19-ejs1608] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Collilieux X, Lebarbier E, Robin S. A factor model approach for the joint segmentation with between‐series correlation. Scand Stat Theory Appl 2018. [DOI: 10.1111/sjos.12368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Nguyen N, Vo A, Sun H, Huang H. Heavy-Tailed Noise Suppression and Derivative Wavelet Scalogram for Detecting DNA Copy Number Aberrations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:1625-1635. [PMID: 28692986 DOI: 10.1109/tcbb.2017.2723884] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Liu J, Zhou Y, Liu S, Song X, Yang XZ, Fan Y, Chen W, Akdemir ZC, Yan Z, Zuo Y, Du R, Liu Z, Yuan B, Zhao S, Liu G, Chen Y, Zhao Y, Lin M, Zhu Q, Niu Y, Liu P, Ikegawa S, Song YQ, Posey JE, Qiu G, Zhang F, Wu Z, Lupski JR, Wu N. The coexistence of copy number variations (CNVs) and single nucleotide polymorphisms (SNPs) at a locus can result in distorted calculations of the significance in associating SNPs to disease. Hum Genet 2018;137:553-567. [PMID: 30019117 DOI: 10.1007/s00439-018-1910-3] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2018] [Accepted: 07/07/2018] [Indexed: 01/25/2023]

Abstract

With the recent advance in genome-wide association studies (GWAS), disease-associated single nucleotide polymorphisms (SNPs) and copy number variants (CNVs) have been extensively reported. Accordingly, the issue of incorrect identification of recombination events that can induce the distortion of multi-allelic or hemizygous variants has received more attention. However, the potential distorted calculation bias or significance of a detected association in a GWAS due to the coexistence of CNVs and SNPs in the same genomic region may remain under-recognized. Here we performed the association study within a congenital scoliosis (CS) cohort whose genetic etiology was recently elucidated as a compound inheritance model, including mostly one rare variant deletion CNV null allele and one common variant non-coding hypomorphic haplotype of the TBX6 gene. We demonstrated that the existence of a deletion in TBX6 led to an overestimation of the contribution of the SNPs on the hypomorphic allele. Furthermore, we generalized a model to explain the calculation bias, or distorted significance calculation for an association study, that can be 'induced' by CNVs at a locus. Meanwhile, overlapping between the disease-associated SNPs from published GWAS and common CNVs (overlap 10%) and pathogenic/likely pathogenic CNVs (overlap 99.69%) was significantly higher than the random distribution (p < 1 × 10^-6 and p = 0.034, respectively), indicating that such co-existence of CNV and SNV alleles might generally influence data interpretation and potential outcomes of a GWAS. We also verified and assessed the influence of colocalizing CNVs to the detection sensitivity of disease-associated SNP variant alleles in another adolescent idiopathic scoliosis (AIS) genome-wide association study. We proposed that detecting co-existent CNVs when evaluating the association signals between SNPs and disease traits could improve genetic model analyses and better integrate GWAS with robust Mendelian principles.

Collapse

Affiliation(s)

Jiaqi Liu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Breast Surgical Oncology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
Yangzhong Zhou Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Internal Medicine, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
Sen Liu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Xiaofei Song Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Xin-Zhuang Yang Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
Yanhui Fan School of Biomedical Sciences, The University of Hong Kong, Hong Kong, China
Weisheng Chen Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Zeynep Coban Akdemir Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Zihui Yan Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Yuzhi Zuo Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Renqian Du Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Zhenlei Liu Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Department of Neurosurgery, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China
Bo Yuan Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Sen Zhao Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Gang Liu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Yixin Chen Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Yanxue Zhao Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Mao Lin Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Qiankun Zhu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China
Yuchen Niu Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.,Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
Pengfei Liu Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Shiro Ikegawa Laboratory of Bone and Joint Diseases, Center for Integrative Medical Sciences, RIKEN, Tokyo, 108-8639, Japan
You-Qiang Song School of Biomedical Sciences, The University of Hong Kong, Hong Kong, China
Jennifer E Posey Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
Guixing Qiu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China.,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China

Feng Zhang Obstetrics and Gynecology Hospital, Institute of Reproduction and Development, Fudan University, Shanghai, 200433, China.,Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200433, China
Zhihong Wu Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China.,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.,Department of Central Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
James R Lupski Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA.,Texas Children's Hospital, Houston, TX, 77030, USA
Nan Wu Department of Orthopedic Surgery, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, No.1 Shuaifuyuan, Beijing, 100730, China. .,Beijing Key Laboratory for Genetic Research of Skeletal Deformity, Beijing, 100730, China. .,Medical Research Center of Orthopedics, Chinese Academy of Medical Sciences, Beijing, 100730, China.

Collapse

Montoril MH, Pinheiro A, Vidakovic B. Wavelet‐based estimators for mixture regression. Scand Stat Theory Appl 2018. [DOI: 10.1111/sjos.12344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Girimurugan SB, Liu Y, Lung PY, Vera DL, Dennis JH, Bass HW, Zhang J. iSeg: an efficient algorithm for segmentation of genomic and epigenomic data. BMC Bioinformatics 2018;19:131. [PMID: 29642840 PMCID: PMC5896135 DOI: 10.1186/s12859-018-2140-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Accepted: 03/26/2018] [Indexed: 11/16/2022] Open

Abstract

Background

Identification of functional elements of a genome often requires dividing a sequence of measurements along a genome into segments where adjacent segments have different properties, such as different mean values. Despite dozens of algorithms developed to address this problem in genomics research, methods with improved accuracy and speed are still needed to effectively tackle both existing and emerging genomic and epigenomic segmentation problems.

Results

We designed an efficient algorithm, called iSeg, for segmentation of genomic and epigenomic profiles. iSeg first utilizes dynamic programming to identify candidate segments and test for significance. It then uses a novel data structure based on two coupled balanced binary trees to detect overlapping significant segments and update them simultaneously during searching and refinement stages. Refinement and merging of significant segments are performed at the end to generate the final set of segments. By using an objective function based on the p-values of the segments, the algorithm can serve as a general computational framework to be combined with different assumptions on the distributions of the data. As a general segmentation method, it can segment different types of genomic and epigenomic data, such as DNA copy number variation, nucleosome occupancy, nuclease sensitivity, and differential nuclease sensitivity data. Using simple t-tests to compute p-values across multiple datasets of different types, we evaluate iSeg using both simulated and experimental datasets and show that it performs satisfactorily when compared with some other popular methods, which often employ more sophisticated statistical models. Implemented in C++, iSeg is also very computationally efficient, well suited for large numbers of input profiles and data with very long sequences.

Conclusions

We have developed an efficient general-purpose segmentation tool and showed that it had comparable or more accurate results than many of the most popular segment-calling algorithms used in contemporary genomic data analysis. iSeg is capable of analyzing datasets that have both positive and negative values. Tunable parameters allow users to readily adjust the statistical stringency to best match the biological nature of individual datasets, including widely or sparsely mapped genomic datasets or those with non-normal distributions.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2140-3) contains supplementary material, which is available to authorized users.

Collapse

Garreau D, Arlot S. Consistent change-point detection with kernels. Electron J Stat 2018. [DOI: 10.1214/18-ejs1513] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Antunes de Lemos MV, Berton MP, Ferreira de Camargo GM, Peripolli E, de Oliveira Silva RM, Ferreira Olivieri B, Cesar AS, Pereira ASC, de Albuquerque LG, de Oliveira HN, Tonhati H, Baldi F. Copy number variation regions in Nellore cattle: Evidences of environment adaptation. Livest Sci 2018. [DOI: 10.1016/j.livsci.2017.11.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Fan Z, Mackey L. Empirical Bayesian analysis of simultaneous changepoints in multiple data sequences. Ann Appl Stat 2017. [DOI: 10.1214/17-aoas1075] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Van De Wiel MA, Van Wieringen WN. CGHregions: Dimension Reduction for Array CGH Data with Minimal Information Loss. Cancer Inform 2017. [DOI: 10.1177/117693510700300031] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Chen H, Jiang Y, Maxwell KN, Nathanson KL, Zhang N. ALLELE-SPECIFIC COPY NUMBER ESTIMATION BY WHOLE EXOME SEQUENCING. Ann Appl Stat 2017;11:1169-1192. [PMID: 28989557 DOI: 10.1214/17-aoas1043] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Delatola EI, Lebarbier E, Mary-Huard T, Radvanyi F, Robin S, Wong J. SegCorr a statistical procedure for the detection of genomic regions of correlated expression. BMC Bioinformatics 2017;18:333. [PMID: 28697800 PMCID: PMC5504623 DOI: 10.1186/s12859-017-1742-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 06/26/2017] [Indexed: 01/27/2023] Open

SLMSuite: a suite of algorithms for segmenting genomic profiles. BMC Bioinformatics 2017;18:321. [PMID: 28659129 PMCID: PMC5490196 DOI: 10.1186/s12859-017-1734-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Accepted: 06/20/2017] [Indexed: 11/10/2022] Open

Chakar S, Lebarbier E, Lévy-Leduc C, Robin S. A robust approach for estimating change-points in the mean of an $\operatorname{AR}(1)$ process. BERNOULLI 2017. [DOI: 10.3150/15-bej782] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Gao Y, Jiang J, Yang S, Hou Y, Liu GE, Zhang S, Zhang Q, Sun D. CNV discovery for milk composition traits in dairy cattle using whole genome resequencing. BMC Genomics 2017;18:265. [PMID: 28356085 PMCID: PMC5371188 DOI: 10.1186/s12864-017-3636-3] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Accepted: 03/17/2017] [Indexed: 01/08/2023] Open

Abstract

Background

Copy number variations (CNVs) are important and widely distributed in the genome. CNV detection opens a new avenue for exploring genes associated with complex traits in humans, animals and plants. Herein, we present a genome-wide assessment of CNVs that are potentially associated with milk composition traits in dairy cattle.

Results

In this study, CNVs were detected based on whole genome re-sequencing data of eight Holstein bulls from four half- and/or full-sib families, with extremely high and low estimated breeding values (EBVs) of milk protein percentage and fat percentage. The range of coverage depth per individual was 8.2–11.9×. Using CNVnator, we identified a total of 14,821 CNVs, including 5025 duplications and 9796 deletions. Among them, 487 differential CNV regions (CNVRs) comprising ~8.23 Mb of the cattle genome were observed between the high and low groups. Annotation of these differential CNVRs were performed based on the cattle genome reference assembly (UMD3.1) and totally 235 functional genes were found within the CNVRs. By Gene Ontology and KEGG pathway analyses, we found that genes were significantly enriched for specific biological functions related to protein and lipid metabolism, insulin/IGF pathway-protein kinase B signaling cascade, prolactin signaling pathway and AMPK signaling pathways. These genes included INS, IGF2, FOXO3, TH, SCD5, GALNT18, GALNT16, ART3, SNCA and WNT7A, implying their potential association with milk protein and fat traits. In addition, 95 CNVRs were overlapped with 75 known QTLs that are associated with milk protein and fat traits of dairy cattle (Cattle QTLdb).

Conclusions

In conclusion, based on NGS of 8 Holstein bulls with extremely high and low EBVs for milk PP and FP, we identified a total of 14,821 CNVs, 487 differential CNVRs between groups, and 10 genes, which were suggested as promising candidate genes for milk protein and fat traits.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-017-3636-3) contains supplementary material, which is available to authorized users.

Collapse

Lim HK, Lee J, Cheon S. Stochastic approximation Monte Carlo EM for change-point analysis. J STAT COMPUT SIM 2017. [DOI: 10.1080/00949655.2016.1192630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Cleynen A, Lebarbier E. Model selection for the segmentation of multiparameter exponential family distributions. Electron J Stat 2017. [DOI: 10.1214/17-ejs1246] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Bertin K, Collilieux X, Lebarbier E, Meza C. Semi-parametric segmentation of multiple series using a DP-Lasso strategy. J STAT COMPUT SIM 2016. [DOI: 10.1080/00949655.2016.1260726] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Kaveh F, Baumbusch LO, Nebdal D, Børresen-Dale AL, Lingjærde OC, Edvardsen H, Kristensen VN, Solvang HK. A systematic comparison of copy number alterations in four types of female cancer. BMC Cancer 2016;16:913. [PMID: 27876019 PMCID: PMC5120489 DOI: 10.1186/s12885-016-2899-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 10/30/2016] [Indexed: 01/06/2023] Open

Abstract

Background

Detection and localization of genomic alterations and breakpoints are crucial in cancer research. The purpose of this study was to investigate, in a methodological and biological perspective, different female, hormone-dependent cancers to identify common and diverse DNA aberrations, genes, and pathways.

Methods

In this work, we analyzed tissue samples from patients with breast (n = 112), ovarian (n = 74), endometrial (n = 84), or cervical (n = 76) cancer. To identify genomic aberrations, the Circular Binary Segmentation (CBS) and Piecewise Constant Fitting (PCF) algorithms were used and segmentation thresholds optimized. The Genomic Identification of Significant Targets in Cancer (GISTIC) algorithm was applied to the segmented data to identify significantly altered regions and the associated genes were analyzed by Ingenuity Pathway Analysis (IPA) to detect over-represented pathways and functions within the identified gene sets.

Results and Discussion

Analyses of high-resolution copy number alterations in four different female cancer types are presented. For appropriately adjusted segmentation parameters the two segmentation algorithms CBS and PCF performed similarly. We identified one region at 8q24.3 with focal aberrations that was altered at significant frequency across all four cancer types. Considering both, broad regions and focal peaks, three additional regions with gains at significant frequency were revealed at 1p21.1, 8p22, and 13q21.33, respectively. Several of these events involve known cancer-related genes, like PPP2R2A, PSCA, PTP4A3, and PTK2. In the female reproductive system (ovarian, endometrial, and cervix [OEC]), we discovered three common events: copy number gains at 5p15.33 and 15q11.2, further a copy number loss at 8p21.2. Interestingly, as many as 75% of the aberrations (75% amplifications and 86% deletions) identified by GISTIC were specific for just one cancer type and represented distinct molecular pathways.

Conclusions

Our results disclose that some prominent copy number changes are shared in the four examined female, hormone-dependent cancer whereas others are definitive to specific cancer types.

Electronic supplementary material

The online version of this article (doi:10.1186/s12885-016-2899-4) contains supplementary material, which is available to authorized users.

Collapse

CNARA: reliability assessment for genomic copy number profiles. BMC Genomics 2016;17:799. [PMID: 27733115 PMCID: PMC5062840 DOI: 10.1186/s12864-016-3074-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Accepted: 09/07/2016] [Indexed: 01/22/2023] Open

Abstract

Background

DNA copy number profiles from microarray and sequencing experiments sometimes contain wave artefacts which may be introduced during sample preparation and cannot be removed completely by existing preprocessing methods. Besides, large derivative log ratio spread (DLRS) of the probes correlating with poor DNA quality is sometimes observed in genome screening experiments and may lead to unreliable copy number profiles. Depending on the extent of these artefacts and the resulting misidentification of copy number alterations/variations (CNA/CNV), it may be desirable to exclude such samples from analyses or to adapt the downstream data analysis strategy accordingly.

Results

Here, we propose a method to distinguish reliable genomic copy number profiles from those containing heavy wave artefacts and/or large DLRS. We define four features that adequately summarize the copy number profiles for reliability assessment, and train a classifier on a dataset of 1522 copy number profiles from various microarray platforms. The method can be applied to predict the reliability of copy number profiles irrespective of the underlying microarray platform and may be adapted for those sequencing platforms from which copy number estimates could be computed as a piecewise constant signal. Further details can be found at https://github.com/baudisgroup/CNARA.

Conclusions

We have developed a method for the assessment of genomic copy number profiling data, and suggest to apply the method in addition to and after other state-of-the-art noise correction and quality control procedures. CNARA could be instrumental in improving the assessment of data used for genomic data mining experiments and support the reliable functional attribution of copy number aberrations especially in cancer research.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-016-3074-7) contains supplementary material, which is available to authorized users.

Collapse

Fast Bayesian Inference of Copy Number Variants using Hidden Markov Models with Wavelet Compression. PLoS Comput Biol 2016;12:e1004871. [PMID: 27177143 PMCID: PMC4866742 DOI: 10.1371/journal.pcbi.1004871] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 03/14/2016] [Indexed: 11/22/2022] Open

Huang MC, Chuang TP, Chen CH, Wu JY, Chen YT, Li LH, Yang HC. An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays. BMC Genomics 2016;17:266. [PMID: 27029637 PMCID: PMC4815280 DOI: 10.1186/s12864-016-2478-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 02/16/2016] [Indexed: 12/19/2022] Open

Abstract

BACKGROUND

Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform.

RESULTS

A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30-67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software.

CONCLUSIONS

ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html , are useful resources for analyzing genomic data from the Axiom and other SNP arrays.

Collapse

Abunimer AN, Salazar J, Noursi DP, Abu-Asab MS. A Systems Biology Interpretation of Array Comparative Genomic Hybridization (aCGH) Data through Phylogenetics. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2016;20:169-79. [PMID: 26983023 PMCID: PMC4799695 DOI: 10.1089/omi.2015.0184] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Abstract

Array Comparative Genomic Hybridization (aCGH) is a rapid screening technique to detect gene deletions and duplications, providing an overview of chromosomal aberrations throughout the entire genome of a tumor, without the need for cell culturing. However, the heterogeneity of aCGH data obfuscates existing methods of data analysis. Analysis of aCGH data from a systems biology perspective or in the context of total aberrations is largely absent in the published literature. We present here a novel alternative to the functional analysis of aCGH data using the phylogenetic paradigm that is well-suited to high dimensional datasets of heterogeneous nature, but has not been widely adapted to aCGH data. Maximum parsimony phylogenetic analysis sorts out genetic data through the simplest presentation of the data on a cladogram, a graphical evolutionary tree, thus providing a powerful and efficient method for aCGH data analysis. For example, the cladogram models the multiphasic changes in the cancer genome and identifies shared early mutations in the disease progression, providing a simple yet powerful means of aCGH data interpretation. As such, applying maximum parsimony phylogenetic analysis to aCGH results allows for the differentiation between drivers and passenger genes aberrations in cancer specimens. In addition to offering a novel methodology to analyze aCGH results, we present here a crucial software suite that we wrote to carry out the analysis. In a broader context, we wish to underscore that phylogenetic analysis of aCGH data is a non-parametric method that circumvents the pitfalls and frustrations of standard analytical techniques that rely on parametric statistics. Organizing the data in a cladogram as explained in this research article provides insights into the disease common aberrations, as well as the disease subtypes and their shared aberrations (the synapomorphies) of each subtype. Hence, we report the method and make the software suite publicly and freely available at http://software.phylomcs.com so that researchers can test alternative and innovative approaches to the analysis of aCGH data.

Collapse

Gao X. Penalized weighted low-rank approximation for robust recovery of recurrent copy number variations. BMC Bioinformatics 2015;16:407. [PMID: 26652207 PMCID: PMC4676147 DOI: 10.1186/s12859-015-0835-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 11/23/2015] [Indexed: 11/10/2022] Open

Mohammadi M, Hodtani GA, Yassi M. A robust Correntropy-based method for analyzing multisample aCGH data. Genomics 2015;106:257-64. [DOI: 10.1016/j.ygeno.2015.07.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Revised: 07/14/2015] [Accepted: 07/20/2015] [Indexed: 11/16/2022]

Chan HP, Walther G. Optimal detection of multi-sample aligned sparse signals. Ann Stat 2015. [DOI: 10.1214/15-aos1328] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Anjum S, Morganella S, D'Angelo F, Iavarone A, Ceccarelli M. VEGAWES: variational segmentation on whole exome sequencing for copy number detection. BMC Bioinformatics 2015;16:315. [PMID: 26416038 PMCID: PMC4587906 DOI: 10.1186/s12859-015-0748-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2015] [Accepted: 09/16/2015] [Indexed: 11/10/2022] Open

Masecchia S, Coco S, Barla A, Verri A, Tonini GP. Genome instability model of metastatic neuroblastoma tumorigenesis by a dictionary learning algorithm. BMC Med Genomics 2015;8:57. [PMID: 26358114 PMCID: PMC4566396 DOI: 10.1186/s12920-015-0132-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2015] [Accepted: 08/28/2015] [Indexed: 12/21/2022] Open

Abstract

Background

Metastatic neuroblastoma (NB) occurs in pediatric patients as stage 4S or stage 4 and it is characterized by heterogeneous clinical behavior associated with diverse genotypes. Tumors of stage 4 contain several structural copy number aberrations (CNAs) rarely found in stage 4S. To date, the NB tumorigenesis is not still elucidated, although it is evident that genomic instability plays a critical role in the genesis of the tumor. Here we propose a mathematical approach to decipher genomic data and we provide a new model of NB metastatic tumorigenesis.

Method

We elucidate NB tumorigenesis using Enhanced Fused Lasso Latent Feature Model (E-FLLat) modeling the array comparative chromosome hybridization (aCGH) data of 190 metastatic NBs (63 stage 4S and 127 stage 4). This model for aCGH segmentation, based on the minimization of functional dictionary learning (DL), combines several penalties tailored to the specificities of aCGH data. In DL, the original signal is approximated by a linear weighted combination of atoms: the elements of the learned dictionary.

Results

The hierarchical structures for stage 4S shows at the first level of the oncogenetic tree several whole chromosome gains except to the unbalanced gains of 17q, 2p and 2q. Conversely, the high CNA complexity found in stage 4 tumors, requires two different trees. Both stage 4 oncogenetic trees are marked diverged, up to five sublevels and the 17q gain is the most common event at the first level (2/3 nodes). Moreover the 11q deletion, one of the major unfavorable marker of disease progression, occurs before 3p loss indicating that critical chromosome aberrations appear at early stages of tumorigenesis. Finally, we also observed a significant (p = 0.025) association between patient age and chromosome loss in stage 4 cases.

Conclusion

These results led us to propose a genome instability progressive model in which NB cells initiate with a DNA synthesis uncoupled from cell division, that leads to stage 4S tumors, primarily characterized by numerical aberrations, or stage 4 tumors with high levels of genome instability resulting in complex chromosome rearrangements associated with high tumor aggressiveness and rapid disease progression.

Electronic supplementary material

The online version of this article (doi:10.1186/s12920-015-0132-y) contains supplementary material, which is available to authorized users.

Collapse

Arsuaga J, Borrman T, Cavalcante R, Gonzalez G, Park C. Identification of Copy Number Aberrations in Breast Cancer Subtypes Using Persistence Topology. MICROARRAYS (BASEL, SWITZERLAND) 2015;4:339-69. [PMID: 27600228 PMCID: PMC4996377 DOI: 10.3390/microarrays4030339] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Accepted: 08/03/2015] [Indexed: 01/01/2023]

Abstract

DNA copy number aberrations (CNAs) are of biological and medical interest because they help identify regulatory mechanisms underlying tumor initiation and evolution. Identification of tumor-driving CNAs (driver CNAs) however remains a challenging task, because they are frequently hidden by CNAs that are the product of random events that take place during tumor evolution. Experimental detection of CNAs is commonly accomplished through array comparative genomic hybridization (aCGH) assays followed by supervised and/or unsupervised statistical methods that combine the segmented profiles of all patients to identify driver CNAs. Here, we extend a previously-presented supervised algorithm for the identification of CNAs that is based on a topological representation of the data. Our method associates a two-dimensional (2D) point cloud with each aCGH profile and generates a sequence of simplicial complexes, mathematical objects that generalize the concept of a graph. This representation of the data permits segmenting the data at different resolutions and identifying CNAs by interrogating the topological properties of these simplicial complexes. We tested our approach on a published dataset with the goal of identifying specific breast cancer CNAs associated with specific molecular subtypes. Identification of CNAs associated with each subtype was performed by analyzing each subtype separately from the others and by taking the rest of the subtypes as the control. Our results found a new amplification in 11q at the location of the progesterone receptor in the Luminal A subtype. Aberrations in the Luminal B subtype were found only upon removal of the basal-like subtype from the control set. Under those conditions, all regions found in the original publication, except for 17q, were confirmed; all aberrations, except those in chromosome arms 8q and 12q were confirmed in the basal-like subtype. These two chromosome arms, however, were detected only upon removal of three patients with exceedingly large copy number values. More importantly, we detected 10 and 21 additional regions in the Luminal B and basal-like subtypes, respectively. Most of the additional regions were either validated on an independent dataset and/or using GISTIC. Furthermore, we found three new CNAs in the basal-like subtype: a combination of gains and losses in 1p, a gain in 2p and a loss in 14q. Based on these results, we suggest that topological approaches that incorporate multiresolution analyses and that interrogate topological properties of the data can help in the identification of copy number changes in cancer.

Collapse

Yokoyama T, Miura F, Araki H, Okamura K, Ito T. Changepoint detection in base-resolution methylome data reveals a robust signature of methylated domain landscape. BMC Genomics 2015;16:594. [PMID: 26265481 PMCID: PMC4534107 DOI: 10.1186/s12864-015-1809-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 08/03/2015] [Indexed: 01/08/2023] Open

Nam JY, Kim NKD, Kim SC, Joung JG, Xi R, Lee S, Park PJ, Park WY. Evaluation of somatic copy number estimation tools for whole-exome sequencing data. Brief Bioinform 2015. [PMID: 26210357 DOI: 10.1093/bib/bbv055] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Zhou L, Palais RA, Paxton CN, Geiersbach KB, Wittwer CT. Copy Number Assessment by Competitive PCR with Limiting Deoxynucleotide Triphosphates and High-Resolution Melting. Clin Chem 2015;61:724-33. [DOI: 10.1373/clinchem.2014.236208] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2014] [Accepted: 02/02/2015] [Indexed: 11/06/2022]

Abstract Abstract BACKGROUND DNA copy number variation is associated with genetic disorders and cancer. Available methods to discern variation in copy number are typically costly, slow, require specialized equipment, and/or lack precision. METHODS Multiplex PCR with different primer pairs and limiting deoxynucleotide triphosphates (dNTPs) (3–12 μmol/L) were used for relative quantification and copy number assessment. Small PCR products (50–121 bp) were designed with 1 melting domain, well-separated Tms, minimal internal sequence variation, and no common homologs. PCR products were displayed as melting curves on derivative plots and normalized to the reference peak. Different copy numbers of each target clustered together and were grouped by unbiased hierarchical clustering. RESULTS Duplex PCR of a reference gene and a target gene was used to detect copy number variation in chromosomes X, Y, 13, 18, 21, epidermal growth factor receptor (EGFR), survival of motor neuron 1, telomeric (SMN1), and survival of motor neuron 2, centromeric (SMN2). Triplex PCR was used for X and Y and CFTR exons 2 and 3. Blinded studies of 50 potential trisomic samples (13, 18, 21, or normal) and 50 samples with potential sex chromosome abnormalities were concordant to karyotyping, except for 2 samples that were originally mosaics that displayed a single karyotype after growth. Large cystic fibrosis transmembrane conductance regulator (ATP-binding cassette sub-family C, member 7) (CFTR) deletions, EGFR amplifications, and SMN1 and SMN2 copy number assessments were also demonstrated. Under ideal conditions, copy number changes of 1.11-fold or lower could be discerned with CVs of about 1%. CONCLUSIONS Relative quantification by restricting the dNTP concentration with melting curve display is a simple and precise way to assess targeted copy number variation. Collapse

Hybrid algorithms for multiple change-point detection in biological sequences. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2015;823:41-61. [PMID: 25381101 DOI: 10.1007/978-3-319-10984-8_3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]

Priyadarshana WJRM, Sofronov G. Multiple Break-Points Detection in Array CGH Data via the Cross-Entropy Method. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015;12:487-498. [PMID: 26357234 DOI: 10.1109/tcbb.2014.2361639] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Zhao C, Tynan J, Ehrich M, Hannum G, McCullough R, Saldivar JS, Oeth P, van den Boom D, Deciu C. Detection of fetal subchromosomal abnormalities by sequencing circulating cell-free DNA from maternal plasma. Clin Chem 2015;61:608-16. [PMID: 25710461 DOI: 10.1373/clinchem.2014.233312] [Citation(s) in RCA: 118] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Du C, Kao CLM, Kou SC. Stepwise Signal Extraction via Marginal Likelihood. J Am Stat Assoc 2015;111:314-330. [PMID: 27212739 PMCID: PMC4874345 DOI: 10.1080/01621459.2015.1006365] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Revised: 12/01/2014] [Indexed: 10/24/2022]