1
|
Jiang YF, Wang S, Wang CL, Xu RH, Wang WW, Jiang Y, Wang MS, Jiang L, Dai LH, Wang JR, Chu XH, Zeng YQ, Fang LZ, Wu DD, Zhang Q, Ding XD. Pangenome obtained by long-read sequencing of 11 genomes reveal hidden functional structural variants in pigs. iScience 2023; 26:106119. [PMID: 36852268 PMCID: PMC9958381 DOI: 10.1016/j.isci.2023.106119] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 12/21/2022] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
Long-read sequencing (LRS) facilitates both the genome assembly and the discovery of structural variants (SVs). Here, we built a graph-based pig pangenome by incorporating 11 LRS genomes with an average of 94.01% BUSCO completeness score, revealing 206-Mb novel sequences. We discovered 183,352 nonredundant SVs (63% novel), representing 12.12% of the reference genome. By genotyping SVs in an additional 196 short-read sequencing samples, we identified thousands of population stratified SVs. Particularly, we detected 7,568 Tibetan specific SVs, some of which demonstrate significant population differentiation between Tibetan and low-altitude pigs, which might be associated with the high-altitude hypoxia adaptation in Tibetan pigs. Further integrating functional genomic data, the most promising candidate genes within the SVs that might contribute to the high-altitude hypoxia adaptation were discovered. Overall, our study generates a benchmark pangenome resource for illustrating the important roles of SVs in adaptive evolution, domestication, and genetic improvement of agronomic traits in pigs.
Collapse
Affiliation(s)
- Yi-Fan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Chong-Long Wang
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Ru-Hai Xu
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Wen-Wen Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Yao Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Ming-Shan Wang
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Li Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Li-He Dai
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Jie-Ru Wang
- Key Laboratory of Pig Molecular Quantitative Genetics of Anhui Academy of Agricultural Sciences, Anhui Provincial Key Laboratory of Livestock and Poultry Product Safety Engineering, Institute of Animal Husbandry and Veterinary Medicine, Anhui Academy of Agricultural Sciences, Hefei 230031, China
| | - Xiao-Hong Chu
- Key Laboratory of Animal Genetics and Breeding of Zhejiang Province, Institute of Animal Husbandry and Veterinary Science, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Yong-Qing Zeng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Ling-Zhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Taian 271001, China
| | - Xiang-Dong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| |
Collapse
|
2
|
Chen H, Xue J, Zhang Z, Zhang G, Xu X, Li H, Zhang R, Ullah N, Chen L, Amanullah, Zang Z, Lai S, He X, Li W, Guan M, Li J, Chen L, Deng C. High-speed rail model reveals the gene tandem amplification mediated by short repeated sequence in eukaryote. Sci Rep 2022; 12:2289. [PMID: 35145182 PMCID: PMC8831618 DOI: 10.1038/s41598-022-06250-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 01/24/2022] [Indexed: 02/08/2023] Open
Abstract
The occurrence of gene duplication/amplification (GDA) provide potential material for adaptive evolution with environmental stress. Several molecular models have been proposed to explain GDA, recombination via short stretches of sequence similarity plays a crucial role. By screening genomes for such events, we propose a “SRS (short repeated sequence) *N + unit + SRS*N” amplified unit under USCE (unequal sister-chromatid exchange) for tandem amplification mediated by SRS with different repeat numbers in eukaryotes. The amplified units identified from 2131 well-organized amplification events that generate multi gene/element copy amplified with subsequent adaptive evolution in the respective species. Genomic data we analyzed showed dynamic changes among related species or subspecies or plants from different ecotypes/strains. This study clarifies the characteristics of variable copy number SRS on both sides of amplified unit under USCE mechanism, to explain well-organized gene tandem amplification under environmental stress mediated by SRS in all eukaryotes.
Collapse
Affiliation(s)
- Haidi Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Jingwen Xue
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Zhenghou Zhang
- The Fourth Affiliated Hospital of China Medical University, Shenyang, 110032, China
| | - Geyu Zhang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Xinyuan Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - He Li
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Ruxue Zhang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Najeeb Ullah
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Lvxing Chen
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Amanullah
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Zhuqing Zang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Shanshan Lai
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China
| | - Ximiao He
- Department of Physiology, School of Basic Medicine, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, Hubei, China.,Center for Genomics and Proteomics Research, School of Basic Medicine, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, Hubei, China.,Hubei Key Laboratory of Drug Target Research and Pharmacodynamic Evaluation, Huazhong University of Science and Technology, Wuhan, 430030, Hubei, China
| | - Wei Li
- Department of Dermatovenereology, Institutes for Systems Genetics, Rare Disease Center, West China Hospital, Sichuan University, No. 37 Guo Xue Xiang Street, Chengdu, 610041, Sichuan, China
| | - Miao Guan
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China.
| | - Jingyi Li
- M.D. Department of Dermatology and Venereology, West China Hospital of Sichuan University, No. 37 Guo Xue Lane, Chengdu, 610041, China.
| | - Liangbiao Chen
- Key Laboratory of Exploration and Utilization of Aquatic Genetic Resources (Ministry of Education), Institute of Experimental Pathology, Shanghai Ocean University, Shanghai, 201306, China.
| | - Cheng Deng
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, 1 Wenyuan Rd., Nanjing, 210023, China.
| |
Collapse
|
3
|
Suzuki Y, Myers EW, Morishita S. Rapid and ongoing evolution of repetitive sequence structures in human centromeres. SCIENCE ADVANCES 2020; 6:6/50/eabd9230. [PMID: 33310858 PMCID: PMC7732198 DOI: 10.1126/sciadv.abd9230] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 10/30/2020] [Indexed: 06/12/2023]
Abstract
Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Shinichi Morishita
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| |
Collapse
|
4
|
Miga KH. Centromeric Satellite DNAs: Hidden Sequence Variation in the Human Population. Genes (Basel) 2019; 10:E352. [PMID: 31072070 PMCID: PMC6562703 DOI: 10.3390/genes10050352] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 05/03/2019] [Accepted: 05/03/2019] [Indexed: 12/30/2022] Open
Abstract
The central goal of medical genomics is to understand the inherited basis of sequence variation that underlies human physiology, evolution, and disease. Functional association studies currently ignore millions of bases that span each centromeric region and acrocentric short arm. These regions are enriched in long arrays of tandem repeats, or satellite DNAs, that are known to vary extensively in copy number and repeat structure in the human population. Satellite sequence variation in the human genome is often so large that it is detected cytogenetically, yet due to the lack of a reference assembly and informatics tools to measure this variability, contemporary high-resolution disease association studies are unable to detect causal variants in these regions. Nevertheless, recently uncovered associations between satellite DNA variation and human disease support that these regions present a substantial and biologically important fraction of human sequence variation. Therefore, there is a pressing and unmet need to detect and incorporate this uncharacterized sequence variation into broad studies of human evolution and medical genomics. Here I discuss the current knowledge of satellite DNA variation in the human genome, focusing on centromeric satellites and their potential implications for disease.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, CA 95064, USA.
| |
Collapse
|
5
|
McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res 2018; 26:115-138. [PMID: 29974361 DOI: 10.1007/s10577-018-9582-3] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/14/2018] [Indexed: 02/05/2023]
Abstract
Repetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA. .,Division of Human Genetics, Duke University Medical Center, Durham, NC, 27710, USA.
| |
Collapse
|
6
|
Satović E, Vojvoda Zeljko T, Luchetti A, Mantovani B, Plohl M. Adjacent sequences disclose potential for intra-genomic dispersal of satellite DNA repeats and suggest a complex network with transposable elements. BMC Genomics 2016; 17:997. [PMID: 27919246 PMCID: PMC5139131 DOI: 10.1186/s12864-016-3347-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/25/2016] [Indexed: 11/14/2022] Open
Abstract
Background Satellite DNA (satDNA) sequences are typically arranged as arrays of tandemly repeated monomers. Due to the similarity among monomers, their organizational pattern and abundance, satDNAs are hardly accessible to structural and functional studies and still represent the most obscure genome component. Although many satDNA arrays of diverse length and even single monomers exist in the genome, surprisingly little is known about transition from satDNAs to other sequences. Studying satDNA monomers at junctions and identifying DNA sequences adjacent to them can help to understand the processes that (re)distribute satDNAs and significance that evolution of these sequence elements might have in creating the genomic landscape. Results We explored sets of randomly selected satDNA-harboring genomic fragments in four mollusc species to examine satDNA transition sites, and the nature of adjacent sequences. All examined junctions are characterized by abrupt transitions from satDNAs to other sequences. Among them, junctions of only one examined satDNA mapped non-randomly (within the palindrome), indicating that well-defined sequence feature is not a necessary prerequisite in the junction formation. In the studied sample, satDNA flanking sequences can be roughly classified into two groups. The first group is composed of anonymous DNA sequences which occasionally include short segments of transposable elements (TEs) as well as segments of other satDNA sequences. In the second group, satDNA repeats and the array flanking sequences are identified as parts of TEs of the Helitron superfamily. There, some array flanking regions hold fragmented satDNA monomers alternating with anonymous sequences of comparable length as missing monomer parts, suggesting a process of sequence reorganization by a mechanism able to excise short monomer parts and replace them with unrelated sequences. Conclusions The observed architecture of satDNA transition sites can be explained as a result of insertion and/or recombination events involving short arrays of satDNA monomers and TEs, in combination with hypothetical transposition-related ability of satDNA monomers to be shuffled independently in the genome. We conclude that satDNAs and TEs can form a complex network of sequences which essentially share the propagation mechanisms and in synergy shape the genome. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3347-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Eva Satović
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | | | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Bologna, Italy
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.
| |
Collapse
|
7
|
Aldrup-MacDonald ME, Kuo ME, Sullivan LL, Chew K, Sullivan BA. Genomic variation within alpha satellite DNA influences centromere location on human chromosomes with metastable epialleles. Genome Res 2016; 26:1301-1311. [PMID: 27510565 PMCID: PMC5052062 DOI: 10.1101/gr.206706.116] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 08/08/2016] [Indexed: 01/27/2023]
Abstract
Alpha satellite is a tandemly organized type of repetitive DNA that comprises 5% of the genome and is found at all human centromeres. A defined number of 171-bp monomers are organized into chromosome-specific higher-order repeats (HORs) that are reiterated thousands of times. At least half of all human chromosomes have two or more distinct HOR alpha satellite arrays within their centromere regions. We previously showed that the two alpha satellite arrays of Homo sapiens Chromosome 17 (HSA17), D17Z1 and D17Z1-B, behave as centromeric epialleles, that is, the centromere, defined by chromatin containing the centromeric histone variant CENPA and recruitment of other centromere proteins, can form at either D17Z1 or D17Z1-B. Some individuals in the human population are functional heterozygotes in that D17Z1 is the active centromere on one homolog and D17Z1-B is active on the other. In this study, we aimed to understand the molecular basis for how centromere location is determined on HSA17. Specifically, we focused on D17Z1 genomic variation as a driver of epiallele formation. We found that D17Z1 arrays that are predominantly composed of HOR size and sequence variants were functionally less competent. They either recruited decreased amounts of the centromere-specific histone variant CENPA and the HSA17 was mitotically unstable, or alternatively, the centromere was assembled at D17Z1-B and the HSA17 was stable. Our study demonstrates that genomic variation within highly repetitive, noncoding DNA of human centromere regions has a pronounced impact on genome stability and basic chromosomal function.
Collapse
Affiliation(s)
- Megan E Aldrup-MacDonald
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Molly E Kuo
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Lori L Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Kimberline Chew
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina 27710, USA; Division of Human Genetics, Duke University Medical Center, Durham, North Carolina 27710, USA
| |
Collapse
|
8
|
Murata C, Kuroki Y, Imoto I, Kuroiwa A. Ancestral Y-linked genes were maintained by translocation to the X and Y chromosomes fused to an autosomal pair in the Okinawa spiny rat Tokudaia muenninki. Chromosome Res 2016; 24:407-19. [DOI: 10.1007/s10577-016-9531-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 06/14/2016] [Indexed: 11/29/2022]
|
9
|
Sevim V, Bashir A, Chin CS, Miga KH. Alpha-CENTAURI: assessing novel centromeric repeat sequence variation with long read sequencing. Bioinformatics 2016; 32:1921-1924. [PMID: 27153570 PMCID: PMC4920115 DOI: 10.1093/bioinformatics/btw101] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 02/17/2016] [Indexed: 11/13/2022] Open
Abstract
Motivation: Long arrays of near-identical tandem repeats are a common feature of centromeric and subtelomeric regions in complex genomes. These sequences present a source of repeat structure diversity that is commonly ignored by standard genomic tools. Unlike reads shorter than the underlying repeat structure that rely on indirect inference methods, e.g. assembly, long reads allow direct inference of satellite higher order repeat structure. To automate characterization of local centromeric tandem repeat sequence variation we have designed Alpha-CENTAURI (ALPHA satellite CENTromeric AUtomated Repeat Identification), that takes advantage of Pacific Bioscience long-reads from whole-genome sequencing datasets. By operating on reads prior to assembly, our approach provides a more comprehensive set of repeat-structure variants and is not impacted by rearrangements or sequence underrepresentation due to misassembly. Results: We demonstrate the utility of Alpha-CENTAURI in characterizing repeat structure for alpha satellite containing reads in the hydatidiform mole (CHM1, haploid-like) genome. The pipeline is designed to report local repeat organization summaries for each read, thereby monitoring rearrangements in repeat units, shifts in repeat orientation and sites of array transition into non-satellite DNA, typically defined by transposable element insertion. We validate the method by showing consistency with existing centromere high order repeat references. Alpha-CENTAURI can, in principle, run on any sequence data, offering a method to generate a sequence repeat resolution that could be readily performed using consensus sequences available for other satellite families in genomes without high-quality reference assemblies. Availability and implementation: Documentation and source code for Alpha-CENTAURI are freely available at http://github.com/volkansevim/alpha-CENTAURI. Contact:ali.bashir@mssm.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Volkan Sevim
- Pacific Biosciences, Inc., Menlo Park, CA 94025, USA
| | - Ali Bashir
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, New York, NY 10029, USA.,Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, New York, NY 10029, USA
| | | | - Karen H Miga
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA 95064, USA
| |
Collapse
|
10
|
Zahn J, Kaplan MH, Fischer S, Dai M, Meng F, Saha AK, Cervantes P, Chan SM, Dube D, Omenn GS, Markovitz DM, Contreras-Galindo R. Expansion of a novel endogenous retrovirus throughout the pericentromeres of modern humans. Genome Biol 2015; 16:74. [PMID: 25886262 PMCID: PMC4425911 DOI: 10.1186/s13059-015-0641-1] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 03/23/2015] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Approximately 8% of the human genome consists of sequences of retroviral origin, a result of ancestral infections of the germ line over millions of years of evolution. The most recent of these infections is attributed to members of the human endogenous retrovirus type-K (HERV-K) (HML-2) family. We recently reported that a previously undetected, large group of HERV-K (HML-2) proviruses, which are descendants of the ancestral K111 infection, are spread throughout human centromeres. RESULTS Studying the genomes of certain cell lines and the DNA of healthy individuals that seemingly lack K111, we discover new HERV-K (HML-2) members hidden in pericentromeres of several human chromosomes. All are related through a common ancestor, termed K222, which is a virus that infected the germ line approximately 25 million years ago. K222 exists as a single copy in the genomes of baboons and high order primates, but not New World monkeys, suggesting that progenitor K222 infected the primate germ line after the split between New and Old World monkeys. K222 exists in modern humans at multiple loci spread across the pericentromeres of nine chromosomes, indicating it was amplified during the evolution of modern humans. CONCLUSIONS Copying of K222 may have occurred through recombination of the pericentromeres of different chromosomes during human evolution. Evidence of recombination between K111 and K222 suggests that these retroviral sequences have been templates for frequent cross-over events during the process of centromere recombination in humans.
Collapse
Affiliation(s)
- Joseph Zahn
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Mark H Kaplan
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Sabrina Fischer
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Manhong Dai
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Fan Meng
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Anjan Kumar Saha
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Patrick Cervantes
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Susana M Chan
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Derek Dube
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Gilbert S Omenn
- Departments of Computational Medicine and Bioinformatics, Internal Medicine, and Human Genetics, and School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - David M Markovitz
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Internal Medicine, Division of Infectious Diseases, University of Michigan, Ann Arbor, MI, 48109-5640, USA.
| | - Rafael Contreras-Galindo
- Department of Internal Medicine, Division of Infectious Diseases and Programs in Immunology, Cancer Biology, and Cellular and Molecular Biology, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Internal Medicine, Division of Infectious Diseases, University of Michigan, Ann Arbor, MI, 48109-5640, USA.
| |
Collapse
|
11
|
Henikoff JG, Thakur J, Kasinathan S, Henikoff S. A unique chromatin complex occupies young α-satellite arrays of human centromeres. SCIENCE ADVANCES 2015; 1:e1400234. [PMID: 25927077 PMCID: PMC4410388 DOI: 10.1126/sciadv.1400234] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The intractability of homogeneous α-satellite arrays has impeded understanding of human centromeres. Artificial centromeres are produced from higher-order repeats (HORs) present at centromere edges, although the exact sequences and chromatin conformations of centromere cores remain unknown. We use high-resolution chromatin immunoprecipitation (ChIP) of centromere components followed by clustering of sequence data as an unbiased approach to identify functional centromere sequences. We find that specific dimeric α-satellite units shared by multiple individuals dominate functional human centromeres. We identify two recently homogenized α-satellite dimers that are occupied by precisely positioned CENP-A (cenH3) nucleosomes with two ~100-base pair (bp) DNA wraps in tandem separated by a CENP-B/CENP-C-containing linker, whereas pericentromeric HORs show diffuse positioning. Precise positioning is largely maintained, whereas abundance decreases exponentially with divergence, which suggests that young α-satellite dimers with paired ~100-bp particles mediate evolution of functional human centromeres. Our unbiased strategy for identifying functional centromeric sequences should be generally applicable to tandem repeat arrays that dominate the centromeres of most eukaryotes.
Collapse
Affiliation(s)
- Jorja G. Henikoff
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jitendra Thakur
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
| | - Sivakanthan Kasinathan
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Medical Scientist Training Program, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Steven Henikoff
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
- Corresponding author. E-mail:
| |
Collapse
|
12
|
Scott KC, Sullivan BA. Neocentromeres: a place for everything and everything in its place. Trends Genet 2013; 30:66-74. [PMID: 24342629 DOI: 10.1016/j.tig.2013.11.003] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 11/15/2013] [Accepted: 11/19/2013] [Indexed: 01/07/2023]
Abstract
Centromeres are essential for chromosome inheritance and genome stability. Centromeric proteins, including the centromeric histone centromere protein A (CENP-A), define the site of centromeric chromatin and kinetochore assembly. In many organisms, centromeres are located in or near regions of repetitive DNA. However, some atypical centromeres spontaneously form on unique sequences. These neocentromeres, or new centromeres, were first identified in humans, but have since been described in other organisms. Neocentromeres are functionally and structurally similar to endogenous centromeres, but lack the added complication of underlying repetitive sequences. Here, we discuss recent studies in chicken and fungal systems where genomic engineering can promote neocentromere formation. These studies reveal key genomic and epigenetic factors that support de novo centromere formation in eukaryotes.
Collapse
Affiliation(s)
- Kristin C Scott
- Institute for Genome Sciences & Policy, Duke University, DUMC 3382, Durham, NC 27708, USA; Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; Division of Human Genetics, Duke University Medical Center, Durham, NC 27710, USA.
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC 27710, USA; Division of Human Genetics, Duke University Medical Center, Durham, NC 27710, USA.
| |
Collapse
|
13
|
Meštrović N, Pavlek M, Car A, Castagnone-Sereno P, Abad P, Plohl M. Conserved DNA Motifs, Including the CENP-B Box-like, Are Possible Promoters of Satellite DNA Array Rearrangements in Nematodes. PLoS One 2013; 8:e67328. [PMID: 23826269 PMCID: PMC3694981 DOI: 10.1371/journal.pone.0067328] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Accepted: 05/17/2013] [Indexed: 12/27/2022] Open
Abstract
Tandemly arrayed non-coding sequences or satellite DNAs (satDNAs) are rapidly evolving segments of eukaryotic genomes, including the centromere, and may raise a genetic barrier that leads to speciation. However, determinants and mechanisms of satDNA sequence dynamics are only partially understood. Sequence analyses of a library of five satDNAs common to the root-knot nematodes Meloidogyne chitwoodi and M. fallax together with a satDNA, which is specific for M. chitwoodi only revealed low sequence identity (32-64%) among them. However, despite sequence differences, two conserved motifs were recovered. One of them turned out to be highly similar to the CENP-B box of human alpha satDNA, identical in 10-12 out of 17 nucleotides. In addition, organization of nematode satDNAs was comparable to that found in alpha satDNA of human and primates, characterized by monomers concurrently arranged in simple and higher-order repeat (HOR) arrays. In contrast to alpha satDNA, phylogenetic clustering of nematode satDNA monomers extracted either from simple or from HOR array indicated frequent shuffling between these two organizational forms. Comparison of homogeneous simple arrays and complex HORs composed of different satDNAs, enabled, for the first time, the identification of conserved motifs as obligatory components of monomer junctions. This observation highlights the role of short motifs in rearrangements, even among highly divergent sequences. Two mechanisms are proposed to be involved in this process, i.e., putative transposition-related cut-and-paste insertions and/or illegitimate recombination. Possibility for involvement of the nematode CENP-B box-like sequence in the transposition-related mechanism and together with previously established similarity of the human CENP-B protein and pogo-like transposases implicate a novel role of the CENP-B box and related sequence motifs in addition to the known function in centromere protein binding.
Collapse
Affiliation(s)
- Nevenka Meštrović
- Department of Molecular Biology, Rudjer Bošković Institute, Zagreb, Croatia
| | | | | | | | | | | |
Collapse
|
14
|
Tsai WL, Forbes JG, Wang K. Engineering of an elastic scaffolding polyprotein based on an SH3-binding intrinsically disordered titin PEVK module. Protein Expr Purif 2012; 85:187-99. [PMID: 22910563 PMCID: PMC3463739 DOI: 10.1016/j.pep.2012.08.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2012] [Revised: 07/24/2012] [Accepted: 08/03/2012] [Indexed: 01/21/2023]
Abstract
Titin is a large elastic protein found in muscle that maintains the elasticity and structural integrity of the sarcomere. The PEVK region of titin is intrinsically disordered, highly elastic and serves as a hub to bind signaling proteins. Systematic investigation of the structure and affinity profile of the PEVK region will provide important information about the functions of titin. Since PEVK is highly heterogeneous due to extensive differential splicing from more than one hundred exons, we engineered and expressed polyproteins that consist of a defined number of identical single exon modules. These customized polyproteins reduce heterogeneity, amplify interactions of less dominant modules, and most importantly, provide tags for atomic force microscopy and allow more readily interpretable data from single-molecule techniques. Expression and purification of recombinant polyprotein with repeat regions presented many technical challenges: recombination events in tandem repeats of identical DNA sequences exacerbated by high GC content, toxicity of polymer plasmid and expressed protein to the bacteria; early truncation of proteins expressed with different numbers of modules; and extreme sensitivity to proteolysis. We have investigated a number of in vitro and in vivo bacterial and yeast expression systems, as well as baculoviral systems as potential solutions to these problems. We successfully expressed and purified in gram quantities a polyprotein derived from human titin exon 172 using Pichia pastoris yeast. This study provides valuable insights into the technical challenges regarding the engineering and purification of a tandem repeat sequence of an intrinsically disordered biopolymer.
Collapse
Affiliation(s)
- Wanxia Li Tsai
- Muscle Proteomics and Nanotechnology Section, Laboratory of Muscle Biology, National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH/DHHS, Bethesda, MD 20892-8024, USA.
| | | | | |
Collapse
|
15
|
Evtushenko EV, Elisafenko EA, Vershinin AV. The relationship between two tandem repeat families in rye heterochromatin. Mol Biol 2010. [DOI: 10.1134/s0026893310010012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
16
|
|
17
|
Takasuka TE, Cioffi A, Stein A. Sequence information encoded in DNA that may influence long-range chromatin structure correlates with human chromosome functions. PLoS One 2008; 3:e2643. [PMID: 18612465 PMCID: PMC2440353 DOI: 10.1371/journal.pone.0002643] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2008] [Accepted: 06/11/2008] [Indexed: 11/18/2022] Open
Abstract
Little is known about the possible function of the bulk of the human genome. We have recently shown that long-range regular oscillation in the motif non-T, A/T, G (VWG) existing at ten-nucleotide multiples influences large-scale nucleosome array formation. In this work, we have determined the locations of all 100 kb regions that are predicted to form distinctive chromatin structures throughout each human chromosome (except Y). Using these data, we found that a significantly greater fraction of 300 kb sequences lacked annotated transcripts in genomic DNA regions > or = 300 kb that contained nearly continuous chromatin organizing signals than in control regions. We also found a relationship between the meiotic recombination frequency and the presence of strong VWG chromatin organizing signals. Large (> or = 300 kb) genomic DNA regions having low average recombination frequency are enriched in chromatin organizing signals. As additional controls, we show using chromosome 1 that the VWG motif signals are not enriched in randomly selected DNA regions having the mean size of the recombination coldspots, and that non-VWG motif sets do not generate signals that are enriched in recombination coldspots. We also show that tandemly repeated alpha satellite DNA contains strong VWG signals for the formation of distinctive nucleosome arrays, consistent with the low recombination activity of centromeres. Our correlations cannot be explained simply by variations in the GC content. Our findings suggest that a specific set of periodic DNA motifs encoded in genomic DNA, which provide signals for chromatin organization, influence human chromosome function.
Collapse
Affiliation(s)
- Taichi E. Takasuka
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - Alfred Cioffi
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - Arnold Stein
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, United States of America
- * E-mail:
| |
Collapse
|
18
|
Abstract
Centromeres are special structures of eukaryotic chromosomes that hold sister chromatid together and ensure proper chromosome segregation during cell division. Centromeres consist of repeated sequences, which have hindered the study of centromere mitotic recombination and its consequences for centromeric function. We use a chromosome orientation fluorescence in situ hybridization technique to visualize and quantify recombination events at mouse centromeres. We show that centromere mitotic recombination occurs in normal cells to a higher frequency than telomere recombination and to a much higher frequency than chromosome-arm recombination. Furthermore, we show that centromere mitotic recombination is increased in cells lacking the Dnmt3a and Dnmt3b DNA methyltransferases, suggesting that the epigenetic state of centromeric heterochromatin controls recombination events at these regions. Increased centromere recombination in Dnmt3a,3b-deficient cells is accompanied by changes in the length of centromere repeats, suggesting that prevention of illicit centromere recombination is important to maintain centromere integrity in the mouse.
Collapse
Affiliation(s)
- Isabel Jaco
- Telomeres and Telomerase Group, Molecular Oncology Program, Spanish National Cancer Centre, 28029 Madrid, Spain
| | | | | | | |
Collapse
|
19
|
Abstract
Two novel repetitive sequence families were isolated from Turritis glabra (2n = 2x = 12). These two repeat families are similar to those of centromeric repeats in Arabidopsis thaliana, are co-localized on one chromosome pair, and differ by about 20% from each other. Phylogenetic analysis revealed that the two repeat families of T. glabra are more similar to each other than to the centromeric repeat families of other Arabidopsis and related species. The relationships of satellite sequences reflected the species phylogeny, indicating that the replacement of satellite sequences has occurred in each species lineage independently, and shared variants could not have existed for a long time between species.
Collapse
Affiliation(s)
- Akira Kawabe
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kyoto, Japan.
| | | |
Collapse
|
20
|
Mravinac B, Plohl M. Satellite DNA junctions identify the potential origin of new repetitive elements in the beetle Tribolium madens. Gene 2007; 394:45-52. [PMID: 17379457 DOI: 10.1016/j.gene.2007.01.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2006] [Revised: 01/24/2007] [Accepted: 01/26/2007] [Indexed: 11/25/2022]
Abstract
Two related satellite DNA families (satellite I and satellite II) with complex higher-order repeat (HOR) monomers represent major DNA components equilocated in the pericentromeric heterochromatin of all Tribolium madens chromosomes. Fragments obtained upon genomic DNA restriction revealed two subfamilies of satellite II monomers, and also identified regions of transition between satellite I and satellite II sequences. The two subfamilies differ not only in diagnostic nucleotides, but also in flipped orientation of constituent subunits. Hybrid genomic fragments comprise directly linked satellite I and satellite II monomers that cannot be distinguished from randomly cloned monomers of corresponding families. An exception is the most proximal satellite I monomer in the hybrid fragment named TMADhinf, which shows sequence divergence typical for repeats evolving at array ends, in zones of low homogenization efficiency. This pattern points to the extensive rearrangement processes generating abrupt transitions between satellite arrays combined with array maintenance by unequal crossover. Switching points between adjacent satellites as well as the edges of flipped subunits are localized within a short sequence segment, indicating a preferential site of recombination within satellite subunits. Multiple copies of TMADhinf junction fragment support the hypothesis that sites of evolutionary origin of novel satellite repeat (sub)families can be localized at array ends, in regions of enhanced sequence divergence.
Collapse
Affiliation(s)
- Brankica Mravinac
- Department of Molecular Biology, Ruder Bosković Institute, Bijenicka 54, HR-10002 Zagreb, Croatia
| | | |
Collapse
|
21
|
Kolas NK, Svetlanov A, Lenzi ML, Macaluso FP, Lipkin SM, Liskay RM, Greally J, Edelmann W, Cohen PE. Localization of MMR proteins on meiotic chromosomes in mice indicates distinct functions during prophase I. ACTA ACUST UNITED AC 2005; 171:447-58. [PMID: 16260499 PMCID: PMC2171243 DOI: 10.1083/jcb.200506170] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Mammalian MutL homologues function in DNA mismatch repair (MMR) after replication errors and in meiotic recombination. Both functions are initiated by a heterodimer of MutS homologues specific to either MMR (MSH2-MSH3 or MSH2-MSH6) or crossing over (MSH4-MSH5). Mutations of three of the four MutL homologues (Mlh1, Mlh3, and Pms2) result in meiotic defects. We show herein that two distinct complexes involving MLH3 are formed during murine meiosis. The first is a stable association between MLH3 and MLH1 and is involved in promoting crossing over in conjunction with MSH4-MSH5. The second complex involves MLH3 together with MSH2-MSH3 and localizes to repetitive sequences at centromeres and the Y chromosome. This complex is up-regulated in Pms2-/- males, but not females, providing an explanation for the sexual dimorphism seen in Pms2-/- mice. The association of MLH3 with repetitive DNA sequences is coincident with MSH2-MSH3 and is decreased in Msh2-/- and Msh3-/- mice, suggesting a novel role for the MMR family in the maintenance of repeat unit integrity during mammalian meiosis.
Collapse
Affiliation(s)
- Nadine K Kolas
- Department of Molecular Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Kawabe A, Nasuda S. Structure and genomic organization of centromeric repeats in Arabidopsis species. Mol Genet Genomics 2004; 272:593-602. [PMID: 15586291 DOI: 10.1007/s00438-004-1081-x] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 10/05/2004] [Indexed: 10/26/2022]
Abstract
Centromeric repetitive sequences were isolated from Arabidopsis halleri ssp. gemmifera and A. lyrata ssp. kawasakiana. Two novel repeat families isolated from A. gemmifera were designated pAge1 and pAge2. These repeats are 180 bp in length and are organized in a head-to-tail manner. They are similar to the pAL1 repeats of A. thaliana and the pAa units of A. arenosa. Both A. gemmifera and A. kawasakiana possess the pAa, pAge1 and pAge2 repeat families. Sequence comparisons of different centromeric repeats revealed that these families share a highly conserved region of approximately 50 bp. Within each of the four repeat families, two or three regions showed low levels of sequence variation. The average difference in nucleotide sequence was approximately 10% within families and 30% between families, which resulted in clear distinctions between families upon phylogenetic analysis. FISH analysis revealed that the localization patterns for the pAa, pAge1 and pAge2 families were chromosome specific in A. gemmifera and A. kawasakiana. In one pair of chromosomes in A. gemmifera, and three pairs of chromosomes in A. kawasakiana, two repeat families were present. The presence of three families of centromeric repeats in A. gemmifera and A. kawasakiana indicates that the first step toward homogenization of centromeric repeats occurred at the chromosome level.
Collapse
Affiliation(s)
- A Kawabe
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, 606-8502, Kyoto, Japan
| | | |
Collapse
|
23
|
Paar V, Pavin N, Rosandic M, Gluncic M, Basar I, Pezer R, Zinic SD. ColorHOR--novel graphical algorithm for fast scan of alpha satellite higher-order repeats and HOR annotation for GenBank sequence of human genome. Bioinformatics 2004; 21:846-52. [PMID: 15509609 DOI: 10.1093/bioinformatics/bti072] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION GenBank data are at present lacking alpha satellite higher-order repeat (HOR) annotation. Furthermore, exact HOR consensus lengths have not been reported so far. Given the fast growth of sequence databases in the centromeric region, it is of increasing interest to have efficient tools for computational identification and analysis of HORs from known sequences. RESULTS We develop a graphical user interface method, ColorHOR, for fast computational identification of HORs in a given genomic sequence, without requiring a priori information on the composition of the genomic sequence. ColorHOR is based on an extension of the key-string algorithm and provides a color representation of the order and orientation of HORs. For the key string, we use a robust 6 bp string from a consensus alpha satellite and its representative nature is tested. ColorHOR algorithm provides a direct visual identification of HORs (direct and/or reverse complement). In more detail, we first illustrate the ColorHOR results for human chromosome 1. Using ColorHOR we determine for the first time the HOR annotation of the GenBank sequence of the whole human genome. In addition to some HORs, corresponding to those determined previously biochemically, we find new HORs in chromosomes 4, 8, 9, 10, 11 and 19. For the first time, we determine exact consensus lengths of HORs in 10 chromosomes. We propose that the HOR assignment obtained by using ColorHOR be included into the GenBank database.
Collapse
Affiliation(s)
- Vladimir Paar
- Faculty of Science, University of Zagreb Bijenicka 32, 10000 Zagreb, Croatia.
| | | | | | | | | | | | | |
Collapse
|
24
|
Kato M. Evaluation of intra- and interspecific divergence of satellite DNA sequences by nucleotide frequency calculation and pairwise sequence comparison. Biol Proced Online 2003; 5:63-68. [PMID: 12734555 PMCID: PMC152575 DOI: 10.1251/bpo47] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2002] [Revised: 01/19/2003] [Accepted: 02/06/2003] [Indexed: 11/24/2022] Open
Abstract
Satellite DNA sequences are known to be highly variable and to have been subjected to concerted evolution that homogenizes member sequences within species. We have analyzed the mode of evolution of satellite DNA sequences in four fishes from the genus Diplodus by calculating the nucleotide frequency of the sequence array and the phylogenetic distances between member sequences. Calculation of nucleotide frequency and pairwise sequence comparison enabled us to characterize the divergence among member sequences in this satellite DNA family. The results suggest that the evolutionary rate of satellite DNA in D. bellottii is about two-fold greater than the average of the other three fishes, and that the sequence homogenization event occurred in D. puntazzo more recently than in the others. The procedures described here are effective to characterize mode of evolution of satellite DNA.
Collapse
Affiliation(s)
- Mikio Kato
- Department of Life Sciences, Osaka Prefecture University. 1-1 Gakuencho, Sakai 599-8531. Japan
| |
Collapse
|
25
|
Grimes BR, Rhoades AA, Willard HF. Alpha-satellite DNA and vector composition influence rates of human artificial chromosome formation. Mol Ther 2002; 5:798-805. [PMID: 12027565 DOI: 10.1006/mthe.2002.0612] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Human artificial chromosomes (HACs) have been proposed as a new class of potential gene transfer and gene therapy vector. HACs can be formed when bacterial cloning vectors containing alpha-satellite DNA are transfected into cultured human cells. We have compared the HAC-forming potential of different sequences to identify features critical to the efficiency of the process. Chromosome 17 or 21 alpha-satellite arrays are highly competent HAC-forming substrates in this assay. In contrast, a Y-chromosome-derived alpha-satellite sequence is inefficient, suggesting that centromere specification is at least partly dependent on DNA sequence. The length of the input array is also an important determinant, as reduction of the chromosome-17-based array from 80 kb to 35 kb reduced the frequency of HAC formation. In addition to the alpha-satellite component, vector composition also influenced HAC formation rates, size, and copy number. The data presented here have a significant impact on the design of future HAC vectors that have potential to be developed for therapeutic applications and as tools for investigating human chromosome structure and function.
Collapse
MESH Headings
- Cell Line
- Centromere/physiology
- Chromosomes, Artificial, Human/genetics
- Chromosomes, Artificial, Human/physiology
- Chromosomes, Human, Pair 21
- Chromosomes, Human, Y
- Cytogenetic Analysis
- DNA, Satellite/genetics
- Genetic Vectors
- Humans
- In Situ Hybridization, Fluorescence
- Kinetochores/physiology
- Mitosis
Collapse
Affiliation(s)
- Brenda R Grimes
- Department of Genetics, Case Western Reserve University School of Medicine, and Center for Human Genetics and Research Institute, University Hospitals of Cleveland, Cleveland, Ohio 44106, USA
| | | | | |
Collapse
|
26
|
Mashkova TD, Oparina NY, Lacroix MH, Fedorova LI, G Tumeneva I, Zinovieva OL, Kisselev LL. Structural rearrangements and insertions of dispersed elements in pericentromeric alpha satellites occur preferably at kinkable DNA sites. J Mol Biol 2001; 305:33-48. [PMID: 11114245 DOI: 10.1006/jmbi.2000.4270] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Centromeric region of human chromosome 21 comprises two long alphoid DNA arrays: the well homogenized and CENP-B box-rich alpha21-I and the alpha21-II, containing a set of less homogenized and CENP-B box-poor subfamilies located closer to the short arm of the chromosome. Continuous alphoid fragment of 100 monomers bordering the non-satellite sequences in human chromosome 21 was mapped to the pericentromeric short arm region by fluorescence in situ hybridization (alpha21-II locus). The alphoid sequence contained several rearrangements including five large deletions within monomers and insertions of three truncated L1 elements. No binding sites for centromeric protein CENP-B were found. We analyzed sequences with alphoid/non-alphoid junctions selectively screened from current databases and revealed various rearrangements disrupting the regular tandem alphoid structure, namely, deletions, duplications, inversions, expansions of short oligonucleotide motifs and insertions of different dispersed elements. The detailed analysis of more than 1100 alphoid monomers from junction regions showed that the vast majority of structural alterations and joinings with non-alphoid DNAs occur in alpha satellite families lacking CENP-B boxes. Most analyzed events were found in sequences located toward the edges of the centromeric alphoid arrays. Different dispersed elements were inserted into alphoid DNA at kinkable dinucleotides (TG, CA or TA) situated between pyrimidine/purine tracks. DNA rearrangements resulting from different processes such as recombination and replication occur at kinkable DNA sites alike insertions but irrespectively of the occurrence of pyrimidine/purine tracks. It seems that kinkable dinucleotides TG, CA and TA are part of recognition signals for many proteins involved in recombination, replication, and insertional events. Alphoid DNA is a good model for studying these processes.
Collapse
MESH Headings
- Alu Elements/genetics
- Autoantigens
- Base Sequence
- Binding Sites
- Centromere/chemistry
- Centromere/genetics
- Centromere/metabolism
- Centromere Protein B
- Chromosomal Proteins, Non-Histone/metabolism
- Chromosome Deletion
- Chromosome Inversion
- Chromosomes, Human, Pair 21/chemistry
- Chromosomes, Human, Pair 21/genetics
- Chromosomes, Human, Pair 21/metabolism
- Computational Biology
- Crossing Over, Genetic/genetics
- DNA Replication/genetics
- DNA, Satellite/chemistry
- DNA, Satellite/genetics
- DNA, Satellite/metabolism
- DNA-Binding Proteins
- Databases as Topic
- Dinucleotide Repeats/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Lymphocytes
- Mutagenesis, Insertional/genetics
- Mutation/genetics
- Nucleic Acid Conformation
- Polymerase Chain Reaction
- Recombination, Genetic/genetics
Collapse
Affiliation(s)
- T D Mashkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Str., Moscow, 117984, Russia.
| | | | | | | | | | | | | |
Collapse
|
27
|
Demirtas H, Candemir Z, Cücer N, Imamoglu N, Dönmez H, Bökesoy I. Essay on the nucleoli survey by the alpha- and beta-satellite DNA probes of the acrocentric chromosomes in mitogen-stimulated human lymphocytes. ANNALES DE GENETIQUE 2000; 43:61-8. [PMID: 10998446 DOI: 10.1016/s0003-3995(00)01014-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The two constitutive heterochromatin (alpha- and beta-satellite DNA) probes of human acrocentric chromosomes were assayed separately to label the nucleoli in the phytohemagglutinin (PHA)-stimulated human lymphocytes. Fluorescent in situ hybridisation (FISH) results have shown that: a) whole (100%) signal-nucleoli overlapping was obtained with both heterochromatin probes in maximally activated nuclei (MANs); b) partial overlapping was observed in non-activated or slightly activated nuclei; c) random signal-nucleolus overlapping (background level) was found to be approximately 6% by the NOR-irrelevant euchromatic probe (D5S23); d) Yq-nucleolus association in the MANs was found to be approximately 97% without the subtraction of the background level. We concluded that: a) acrocentric alpha- or beta-satellite DNA probes may be used as nucleolar markers only in the MANs and not in slightly activated or non-activated nuclei; b) the distances between rDNA loci and alpha-/beta-satellite DNA on human acrocentrics are short enough to permit their observation on the same nucleolus.
Collapse
Affiliation(s)
- H Demirtas
- Medical Biology and Genetics Department, Medical Faculty, Erciyes University, 38039, Kayseri, Turkey.
| | | | | | | | | | | |
Collapse
|
28
|
Kato M, Kato A, Shimizu N. A method for evaluating phylogenetic relationship of alpha-satellite DNA suprachromosomal family by nucleotide frequency calculation. Mol Phylogenet Evol 1999; 13:329-35. [PMID: 10603261 DOI: 10.1006/mpev.1999.0665] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The sequence similarity among chromosome-specific alpha-satellite DNA was quantitatively evaluated by a novel procedure: nucleotide frequency calculation. Tandem-arrayed repetitive DNA segments were aligned with unit length repeat, and the nucleotide frequency at each position was used to estimate the phylogenetic distance between repetitive DNA segments. The calculations for human and chimpanzee X chromosome alpha-satellites showed that the results were consistent with the known relationships of primates, indicating that the nucleotide frequency calculation worked effectively to estimate the distances between satellite arrays. Human chromosome-specific alpha-satellites had been grouped into three suprachromosomal families (I, II, and III), and in the current work the nucleotide frequency analysis has defined the quantitative distances between the chromosome-specific alpha-satellite DNA.
Collapse
Affiliation(s)
- M Kato
- College of the Integrated Arts and Sciences, Osaka Prefecture University, 1-1 Gakuencho, Sakai, 599-8531, Japan.
| | | | | |
Collapse
|
29
|
Abstract
Mammalian artificial chromosomes (MACs) hold the promise of providing autonomous vectors for gene therapy in dividing cells. They would not require insertion into the genome and could include sufficient genomic sequences that surround the therapeutic gene to ensure proper tissue-specific and temporal regulation. Several groups have reported successful formation of MACs in human cells using transfection strategies that included alpha satellite DNA, the primary DNA found at normal human centromeres. These results, although extremely encouraging, have limitations such as unpredictable chromosome formation and success thus far in only one transformed human cell line. Examination of other cells where alpha satellite DNA has integrated into ectopic chromosomal locations, as well as naturally occurring dicentric and neocentromere-containing cell lines, suggests that alpha satellite DNA may not be necessary or sufficient for centromere formation. Overall, these results suggest that epigenetic modifications of centromeric DNA are required for efficient centromere formation. Models for this centromere-specific epigenetic modification include a specialized chromatin structure and differential replication timing of centromeric DNA. Thus, further investigation of these centromere-specific epigenetic modifications may suggest strategies for increasing the efficiency of generating human artificial chromosomes for use as gene therapy vectors.
Collapse
Affiliation(s)
- P E Warburton
- Department of Human Genetics, Mount Sinai School of Medicine, 1425 Madison Avenue, New York, New York, 10029, USA
| |
Collapse
|
30
|
Puechberty J, Laurent AM, Gimenez S, Billault A, Brun-Laurent ME, Calenda A, Marçais B, Prades C, Ioannou P, Yurov Y, Roizès G. Genetic and physical analyses of the centromeric and pericentromeric regions of human chromosome 5: recombination across 5cen. Genomics 1999; 56:274-87. [PMID: 10087194 DOI: 10.1006/geno.1999.5742] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Human centromeres are poorly understood at both the genetic and the physical level. In this paper, we have been able to distinguish the alphoid centromeric sequences of chromosome 5 from those of chromosome 19. This result was obtained by pulsed-field gel electrophoresis after cutting genomic DNA with restriction endonucleases NcoI (chromosome 5) and BamHI (chromosome 19). We could thus define a highly polymorphic marker, representing length variations of the D5Z1 domain located at the q arm boundary of the chromosome 5 centromere. The centromeric region of chromosome 5 was then analyzed in full detail. We established an approximately 4.6-Mb physical map of the whole region with five rare-cutting enzymes by using nonchimeric YACs, two of which were shown to contain the very ends of 5cen on both sides. The p-arm side of 5cen was shown to contain an alphoid subset (D5Z12) different from those described thus far. Two genes and several putative cDNAs could be precisely located close to the centromere. Several L1 elements were shown to be present within alpha satellites at the boundary between alphoid and nonalphoid sequences on both sides of 5cen. They were used to define STSs that could serve as physical anchor points at the junction of 5cen with the p and q arms. Some STSs were placed on a radiation hybrid map. One was polymorphic and could therefore be used as a second centromeric genetic marker at the p arm boundary of 5cen. We could thus estimate recombination rates within and around the centromeric region of chromosome 5. Recombination is highly reduced within 5cen, with zero recombinants in 58 meioses being detected between the two markers located at the two extremities of the centromere. In its immediate vicinity, 5cen indeed exerts a direct negative effect on meiotic recombination within the proximal chromosomal DNA. This effect is, however, less important than expected and is polarized, as different rates are observed on both arms if one compares the 0 cM/Mb of the p proximal first 5.5 Mb and the 0.64 cM/Mb of the q proximal first 5 Mb to the sex-average 1.02 cM/Mb found throughout the entire chromosome 5. Rates then become close to the average when one goes further within the arms. Finally, most recombinants (21/22), irrespective of the arm, are of female origin, thus showing that recombination around 5cen is essentially occurring in the female lineage.
Collapse
MESH Headings
- Blotting, Southern
- Centromere/genetics
- Chromosomes, Artificial, Yeast
- Chromosomes, Human, Pair 5/genetics
- Contig Mapping
- Electrophoresis, Gel, Pulsed-Field
- Humans
- Models, Genetic
- Molecular Sequence Data
- Nucleic Acid Hybridization
- Oligonucleotide Probes
- Pedigree
- Physical Chromosome Mapping
- Recombination, Genetic
- Restriction Mapping
- Sequence Analysis, DNA
- Sequence Tagged Sites
- Temperature
Collapse
Affiliation(s)
- J Puechberty
- Institut de Génétique Humaine, CNRS UPR 1142, Institut de Biologie, 4 Boulevard Henri IV, Montpellier Cedex, 34060, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Craig JM, Earnshaw WC, Vagnarelli P. Mammalian centromeres: DNA sequence, protein composition, and role in cell cycle progression. Exp Cell Res 1999; 246:249-62. [PMID: 9925740 DOI: 10.1006/excr.1998.4278] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The centromere is a specialized region of the eukaryotic chromosome that is responsible for directing chromosome movements in mitosis and for coordinating the progression of mitotic events at the crucial transition between metaphase and anaphase. In this review, we will focus on recent advances in the understanding of centromere composition at the protein and DNA level and of the role of centromeres in sister-chromatid cohesion and mitotic checkpoint control.
Collapse
Affiliation(s)
- J M Craig
- Institute of Cell and Molecular Biology, University of Edinburgh, Edinburgh, EH9 3JR, Scotland, United Kingdom
| | | | | |
Collapse
|
32
|
Mashkova T, Oparina N, Alexandrov I, Zinovieva O, Marusina A, Yurov Y, Lacroix MH, Kisselev L. Unequal cross-over is involved in human alpha satellite DNA rearrangements on a border of the satellite domain. FEBS Lett 1998; 441:451-7. [PMID: 9891989 DOI: 10.1016/s0014-5793(98)01600-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
It can be invoked from the theory of tandem repeat homogenization that DNA on a satellite/non-satellite border may carry sequence marks of molecular processes basic to satellite evolution. We have sequenced a continuous 17-kb alpha satellite fragment bordering the non-satellite in human chromosome 21, which is devoid of higher-order repeated structure, contains multiple rearrangements, and exhibits higher divergence of monomers towards the border, indicating the lack of efficient homogenization. Remarkably, monomers have been found with mutually supplementary deletions matching each other as reciprocal products of unequal recombination, which provide evidence for unequal cross-over as a mechanism generating deletions in satellite DNA.
Collapse
Affiliation(s)
- T Mashkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow.
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Alves G, Seuánez HN, Fanning T. A clade of New World primates with distinctive alphoid satellite DNAs. Mol Phylogenet Evol 1998; 9:220-4. [PMID: 9562981 DOI: 10.1006/mpev.1997.0462] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Centromerically located alphoid satellite DNAs are present in all primates. They typically consist of arrays of a 340-bp monomeric unit that is composed of related, but diverged, 170-bp subunits. A unique monomeric unit has recently been described: the alphoid satellite monomers of the neotropical primate Chiropotes satanas (bearded saki) are typically 539 bp in length. In addition, a number of smaller satellite sequences are present in this species. Analysis of two primates closely related to Chiropotes, Pithecia irrorata (saki) and Cacajao melanocephalus (uakari), show that they also contain unique alphoid satellites that are different from those of Chiropotes and different from one another. Southern blot and sequence analyses suggest that an alphoid satellite rearrangement(s) occurred early in the history of the tribe Pitheciini (Chiropotes, Pithecia, Cacajao) and that rearrangements are continuing to occur in this group of primates.
Collapse
Affiliation(s)
- G Alves
- Genetics Section, Instituto Nacional do Cancer, Rio de Janeiro, Brazil
| | | | | |
Collapse
|
34
|
Vafa O, Sullivan KF. Chromatin containing CENP-A and alpha-satellite DNA is a major component of the inner kinetochore plate. Curr Biol 1997; 7:897-900. [PMID: 9382804 DOI: 10.1016/s0960-9822(06)00381-2] [Citation(s) in RCA: 128] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The pathway of molecular interactions leading to kinetochore assembly on mammalian chromosomes is unknown. Kinetochores could be specified by structural features of centromeric satellite DNA [1-3] or by specific DNA sequences, analogous to budding yeast centromeres, interspersed in centromeric satellite DNA arrays [4,5]. Alternatively, kinetochores could be epigenetic structures that replicate without strict dependence on DNA sequence [6-8]. We purified kinetochore-associated chromatin from human chromosomes by immunoprecipitation of CENP-A, a centromere-specific histone H3 homologue located in the inner plate of the kinetochore [6,9,10]. Hybridization and DNA sequence analyses of cloned kinetochore DNA fragments revealed alpha-satellite as the predominant sequence associated with CENP-A. A major site of micrococcal nuclease digestion was identified by mapping the termini of alpha-satellite clones, suggesting that the inner kinetochore plate contains phased arrays of CENP-A-alpha-satellite nucleosomes. These experiments demonstrate for the first time that complex satellite DNA is a structural component of the kinetochore. Further, because complex satellite DNA is evolutionarily unconserved, these results suggest that molecular recognition events necessary for kinetochore formation take place at the level of DNA conformation or epigenetic mechanisms rather than DNA sequence per se.
Collapse
Affiliation(s)
- O Vafa
- Department of Cell Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | | |
Collapse
|
35
|
Gangloff S, Zou H, Rothstein R. Gene conversion plays the major role in controlling the stability of large tandem repeats in yeast. EMBO J 1996; 15:1715-25. [PMID: 8612596 PMCID: PMC450084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The genomic stability of the rDNA tandem array in yeast is tightly controlled to allow sequence homogenization and at the same time prevent deleterious rearrangements. In our study, we show that gene conversion, and not unequal sister chromatid exchange, is the predominant recombination mechanism regulating the expansion and contraction of the rDNA array. Furthermore, we found that RAD52, which is essential for gene conversion, is required for marker duplication stimulated in the absence of the two yeast type I topoisomerases. Our results have implications for the mechanisms regulating genomic stability of repetitive sequence families found in all eukaryotes.
Collapse
Affiliation(s)
- S Gangloff
- Department of Genetics and Development, College of Physicians and Surgeons, Columbia University, New York, NY 10032, USA
| | | | | |
Collapse
|
36
|
Abstract
The centromere, recognized cytologically as the primary constriction, is essential for chromosomal attachment to the spindle and for proper segregation of mitotic and meiotic chromosomes. Considerable progress has been made in identifying both DNA and protein components of the centromere and kinetochore complex in mammalian chromosomes, including definition of specific motor proteins with demonstrable functions in chromosome movement. Searches for possible environmental influences on chromosome disjunction might logically be based on known components of the segregation apparatus, both intrinsic and extrinsic to the chromosomes themselves. This article reviews available information on both DNA and protein components of the centromere of mammalian, particularly human, chromosomes and summarizes our current understanding of their role(s) in facilitating normal chromosome behavior in mitosis and meiosis.
Collapse
Affiliation(s)
- B A Sullivan
- Department of Genetics, Case Western Reserve, University School of Medicine, Cleveland, Ohio 44106-4955, USA
| | | | | |
Collapse
|
37
|
Keeling PJ, Doolittle WF. Concerted evolution in protists: recent homogenization of a polyubiquitin gene in Trichomonas vaginalis. J Mol Evol 1995; 41:556-62. [PMID: 7490769 DOI: 10.1007/bf00175813] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Ubiquitin is a 76-amino-acid protein with a remarkably high degree of conservation between all known sequences. Ubiquitin genes are almost always multicopy in eukaryotes, and often are found as polyubiquitin genes--fused tandem repeats which are coexpressed. Seventeen ubiquitin sequences from the amitochondrial protist Trichomonas vaginalis have been examined here, including an 11-repeat fragment of a polyubiquitin gene. These sequences reveal a number of interesting features that are not seen in other eukaryotes. The predicted amino acid sequences lack several universally conserved residues, and individual units do not always encode identical peptides as is usually the case. On the nucleotide level, these repeats are in general highly variable, but one region in the polyubiquitin is extremely homogeneous, with seven repeats absolutely identical. Such extended stretches of homogeneity have never been observed in ubiquitin genes and since substitutions are common in other coding units, it is likely that these repeats are the product of a very recent homogenization or amplification.
Collapse
Affiliation(s)
- P J Keeling
- Canadian Institute for Advanced Research, Department of Biochemistry, Dalhousie University, Halifax, Nova Scotia, Canada
| | | |
Collapse
|
38
|
Charlesworth B, Sniegowski P, Stephan W. The evolutionary dynamics of repetitive DNA in eukaryotes. Nature 1994; 371:215-20. [PMID: 8078581 DOI: 10.1038/371215a0] [Citation(s) in RCA: 1030] [Impact Index Per Article: 33.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Repetitive DNA sequences form a large portion of the genomes of eukaryotes. The 'selfish DNA' hypothesis proposes that they are maintained by their ability to replicate within the genome. The behaviour of repetitive sequences can result in mutations that cause genetic diseases, and confer significant fitness losses on the organism. Features of the organization of repetitive sequences in eukaryotic genomes, and their distribution in natural populations, reflect the evolutionary forces acting on selfish DNA.
Collapse
Affiliation(s)
- B Charlesworth
- Department of Ecology and Evolution, University of Chicago, Illinois 60637
| | | | | |
Collapse
|
39
|
Abstract
The alpha satellite DNA of Old World (catarrhine) primates usually consists of similar, but not identical, ca. 170 bp sequences repeated tandemly hundreds to thousands of times. The 170 bp monomeric repeats are components of higher-order repeats, many of which are chromosome specific. Alpha satellites are found exclusively in centromeric regions where they appear to play a role in centromere function. We have found that alpha satellite DNA in neotropical (New World; platyrrhine) primates is very similar to its Old World counterpart: it consists of divergent ca. 170 bp subsequences that are arranged in tandem arrays with a ca. 340 bp periodicity. New and Old World alpha satellites share about 64% sequence identity overall, and contain several short sequence motifs that appear to be highly conserved. One exception to the tandemly arrayed 340 bp motif has been found: the major alpha satellite array in Chiropotes satanas (black bearded saki) has a 539 bp repeat unit that consists of a 338 bp dimer together with a duplication of 33 bp of the first monomeric unit and 168 bp of the second monomeric unit.
Collapse
Affiliation(s)
- G Alves
- Genetics Section, National Cancer Institute, Rio de Janeiro, Brazil
| | | | | |
Collapse
|