1
|
Sullivan LL, Sullivan BA. Genomic and functional variation of human centromeres. Exp Cell Res 2020; 389:111896. [PMID: 32035947 DOI: 10.1016/j.yexcr.2020.111896] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 01/29/2020] [Accepted: 02/05/2020] [Indexed: 10/25/2022]
Abstract
Centromeres are central to chromosome segregation and genome stability, and thus their molecular foundations are important for understanding their function and the ways in which they go awry. Human centromeres typically form at large megabase-sized arrays of alpha satellite DNA for which there is little genomic understanding due to its repetitive nature. Consequently, it has been difficult to achieve genome assemblies at centromeres using traditional next generation sequencing approaches, so that centromeres represent gaps in the current human genome assembly. The role of alpha satellite DNA has been debated since centromeres can form, albeit rarely, on non-alpha satellite DNA. Conversely, the simple presence of alpha satellite DNA is not sufficient for centromere function since chromosomes with multiple alpha satellite arrays only exhibit a single location of centromere assembly. Here, we discuss the organization of human centromeres as well as genomic and functional variation in human centromere location, and current understanding of the genomic and epigenetic mechanisms that underlie centromere flexibility in humans.
Collapse
Affiliation(s)
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, USA; Division of Human Genetics, Duke University School of Medicine, Durham, NC, 27710, USA.
| |
Collapse
|
2
|
Contreras-Galindo R, Fischer S, Saha AK, Lundy JD, Cervantes PW, Mourad M, Wang C, Qian B, Dai M, Meng F, Chinnaiyan A, Omenn GS, Kaplan MH, Markovitz DM. Rapid molecular assays to study human centromere genomics. Genome Res 2017; 27:2040-2049. [PMID: 29141960 PMCID: PMC5741061 DOI: 10.1101/gr.219709.116] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 10/27/2017] [Indexed: 01/16/2023]
Abstract
The centromere is the structural unit responsible for the faithful segregation of chromosomes. Although regulation of centromeric function by epigenetic factors has been well-studied, the contributions of the underlying DNA sequences have been much less well defined, and existing methodologies for studying centromere genomics in biology are laborious. We have identified specific markers in the centromere of 23 of the 24 human chromosomes that allow for rapid PCR assays capable of capturing the genomic landscape of human centromeres at a given time. Use of this genetic strategy can also delineate which specific centromere arrays in each chromosome drive the recruitment of epigenetic modulators. We further show that, surprisingly, loss and rearrangement of DNA in centromere 21 is associated with trisomy 21. This new approach can thus be used to rapidly take a snapshot of the genetics and epigenetics of each specific human centromere in nondisjunction disorders and other biological settings.
Collapse
Affiliation(s)
| | - Sabrina Fischer
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Laboratory of Molecular Virology, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay 11400
| | - Anjan K Saha
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Medical Scientist Training Program, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - John D Lundy
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Patrick W Cervantes
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mohamad Mourad
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Claire Wang
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Brian Qian
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Manhong Dai
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Fan Meng
- Molecular and Behavioral Neuroscience Institute, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Psychiatry, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Arul Chinnaiyan
- Michigan Center for Translational Pathology and Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA.,Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA
| | - Gilbert S Omenn
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Department of Human Genetics.,Departments of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Mark H Kaplan
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - David M Markovitz
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cancer Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Immunology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Program in Cellular and Molecular Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
3
|
Rosandić M, Glunčić M, Paar V, Basar I. The role of alphoid higher order repeats (HORs) in the centromere folding. J Theor Biol 2008; 254:555-60. [DOI: 10.1016/j.jtbi.2008.06.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Revised: 05/13/2008] [Accepted: 06/06/2008] [Indexed: 10/21/2022]
|
4
|
Paar V, Basar I, Rosandić M, Glunčić M. Consensus higher order repeats and frequency of string distributions in human genome. Curr Genomics 2007; 8:93-111. [PMID: 18660848 PMCID: PMC2435359 DOI: 10.2174/138920207780368169] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2007] [Revised: 01/26/2007] [Accepted: 01/30/2007] [Indexed: 02/01/2023] Open
Abstract
Key string algorithm (KSA) could be viewed as robust computational generalization of restriction enzyme method. KSA enables robust and effective identification and structural analyzes of any given genomic sequences, like in the case of NCBI assembly for human genome. We have developed a method, using total frequency distribution of all r-bp key strings in dependence on the fragment length l, to determine the exact size of all repeats within the given genomic sequence, both of monomeric and HOR type. Subsequently, for particular fragment lengths equal to each of these repeat sizes we compute the partial frequency distribution of r-bp key strings; the key string with highest frequency is a dominant key string, optimal for segmentation of a given genomic sequence into repeat units. We illustrate how a wide class of 3-bp key strings leads to a key-string-dependent periodic cell which enables a simple identification and consensus length determinations of HORs, or any other highly convergent repeat of monomeric or HOR type, both tandem or dispersed. We illustrated KSA application for HORs in human genome and determined consensus HORs in the Build 35.1 assembly. In the next step we compute suprachromosomal family classification and CENP-B box / pJalpha distributions for HORs. In the case of less convergent repeats, like for example monomeric alpha satellite (20-40% divergence), we searched for optimal compact key string using frequency method and developed a concept of composite key string (GAAAC--CTTTG) or flexible relaxation (28 bp key string) which provides both monomeric alpha satellites as well as alpha monomer segmentation of internal HOR structure. This method is convenient also for study of R-strand (direct) / S-strand (reverse complement) alpha monomer alternations. Using KSA we identified 16 alternating regions of R-strand and S-strand monomers in one contig in choromosome 7. Use of CENP-B box and/or pJalpha motif as key string is suitable both for identification of HORs and monomeric pattern as well as for studies of CENP-B box / pJalpha distribution. As an example of application of KSA to sequences outside of HOR regions we present our finding of a tandem with highly convergent 3434-bp Long monomer in chromosome 5 (divergence less then 0.3%).
Collapse
Affiliation(s)
- Vladimir Paar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Ivan Basar
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| | - Marija Rosandić
- Department of Internal Medicine,
University Hospital Rebro, Kišpatićeva 12, 10000 Zagreb, Croatia
| | - Matko Glunčić
- Faculty of Science, University of Zagreb, Bijenička 32, 10000 Zagreb, Croatia
| |
Collapse
|
5
|
Rosandić M, Paar V, Basar I, Gluncić M, Pavin N, Pilas I. CENP-B box and pJalpha sequence distribution in human alpha satellite higher-order repeats (HOR). Chromosome Res 2006; 14:735-53. [PMID: 17115329 DOI: 10.1007/s10577-006-1078-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2005] [Accepted: 06/03/2006] [Indexed: 01/13/2023]
Abstract
Using our Key String Algorithm (KSA) to analyze Build 35.1 assembly we determined consensus alpha satellite higher-order repeats (HOR) and consensus distributions of CENP-B box and pJalpha motif in human chromosomes 1, 4, 5, 7, 8, 10, 11, 17, 19, and X. We determined new suprachromosomal family (SF) assignments: SF5 for 13mer (2211 bp), SF5 for 13mer (2214 bp), SF2 for 11mer (1869 bp), SF1 for 18mer (3058 bp), SF3 for 12mer (2047 bp), SF3 for 14mer (2379 bp), and SF5 for 17mer (2896 bp) in chromosomes 4, 5, 8, 10, 11, 17, and 19, respectively. In chromosome 5 we identified SF5 13mer without any CENP-B box and pJalpha motif, highly homologous (96%) to 13mer in chromosome 19. Additionally, in chromosome 19 we identified new SF5 17mer with one CENP-B box and pJalpha motif, aligned to 13mer by deleting four monomers. In chromosome 11 we identified SF3 12mer, homologous to 12mer in chromosome X. In chromosome 10 we identified new SF1 18mer with eight CENP-B boxes in every other monomer (except one). In chromosome 4 we identified new SF5 13mer with CENP-B box in three consecutive monomers. We found four exceptions to the rule that CENP-B box belongs to type B and pJalpha motif to type A monomers.
Collapse
Affiliation(s)
- Marija Rosandić
- Department of Internal Medicine, University Hospital Rebro, University of Zagreb, 10000, Zagreb, Croatia
| | | | | | | | | | | |
Collapse
|
6
|
Kazakov AE, Shepelev VA, Tumeneva IG, Alexandrov AA, Yurov YB, Alexandrov IA. Interspersed repeats are found predominantly in the “old” α satellite families. Genomics 2003; 82:619-27. [PMID: 14611803 DOI: 10.1016/s0888-7543(03)00182-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The biased distribution of dispersed repeat insertions in various types of primate specific alpha satellites (AS) is being discussed in the literature in relation to the modes of AS evolution and their possible roles in maintenance and disruption of functional centromeres. However, such a bias has not been properly documented on a genome-wide scale so far. In this work, using a representative sample of about 100 insertions we show that the "old" AS contains at least 10 times more dispersed repeats than the "new" one. In the new arrays insertions accumulate mostly in poorly homogenized areas, presumably in the edges, and in the old AS, throughout the whole array length. Dating of L1 insertions in the old AS revealed that their massive accumulation started at or after the time when the new AS emerged and expanded in the genome and the centromere function had shifted to the new AS arrays.
Collapse
Affiliation(s)
- Alexei E Kazakov
- Mental Health Research Center, Russian Academy of Medical Sciences, Zagorodnoe sh.2, Moscow 113152, Russia
| | | | | | | | | | | |
Collapse
|
7
|
Mashkova TD, Oparina NY, Lacroix MH, Fedorova LI, G Tumeneva I, Zinovieva OL, Kisselev LL. Structural rearrangements and insertions of dispersed elements in pericentromeric alpha satellites occur preferably at kinkable DNA sites. J Mol Biol 2001; 305:33-48. [PMID: 11114245 DOI: 10.1006/jmbi.2000.4270] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Centromeric region of human chromosome 21 comprises two long alphoid DNA arrays: the well homogenized and CENP-B box-rich alpha21-I and the alpha21-II, containing a set of less homogenized and CENP-B box-poor subfamilies located closer to the short arm of the chromosome. Continuous alphoid fragment of 100 monomers bordering the non-satellite sequences in human chromosome 21 was mapped to the pericentromeric short arm region by fluorescence in situ hybridization (alpha21-II locus). The alphoid sequence contained several rearrangements including five large deletions within monomers and insertions of three truncated L1 elements. No binding sites for centromeric protein CENP-B were found. We analyzed sequences with alphoid/non-alphoid junctions selectively screened from current databases and revealed various rearrangements disrupting the regular tandem alphoid structure, namely, deletions, duplications, inversions, expansions of short oligonucleotide motifs and insertions of different dispersed elements. The detailed analysis of more than 1100 alphoid monomers from junction regions showed that the vast majority of structural alterations and joinings with non-alphoid DNAs occur in alpha satellite families lacking CENP-B boxes. Most analyzed events were found in sequences located toward the edges of the centromeric alphoid arrays. Different dispersed elements were inserted into alphoid DNA at kinkable dinucleotides (TG, CA or TA) situated between pyrimidine/purine tracks. DNA rearrangements resulting from different processes such as recombination and replication occur at kinkable DNA sites alike insertions but irrespectively of the occurrence of pyrimidine/purine tracks. It seems that kinkable dinucleotides TG, CA and TA are part of recognition signals for many proteins involved in recombination, replication, and insertional events. Alphoid DNA is a good model for studying these processes.
Collapse
MESH Headings
- Alu Elements/genetics
- Autoantigens
- Base Sequence
- Binding Sites
- Centromere/chemistry
- Centromere/genetics
- Centromere/metabolism
- Centromere Protein B
- Chromosomal Proteins, Non-Histone/metabolism
- Chromosome Deletion
- Chromosome Inversion
- Chromosomes, Human, Pair 21/chemistry
- Chromosomes, Human, Pair 21/genetics
- Chromosomes, Human, Pair 21/metabolism
- Computational Biology
- Crossing Over, Genetic/genetics
- DNA Replication/genetics
- DNA, Satellite/chemistry
- DNA, Satellite/genetics
- DNA, Satellite/metabolism
- DNA-Binding Proteins
- Databases as Topic
- Dinucleotide Repeats/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Lymphocytes
- Mutagenesis, Insertional/genetics
- Mutation/genetics
- Nucleic Acid Conformation
- Polymerase Chain Reaction
- Recombination, Genetic/genetics
Collapse
Affiliation(s)
- T D Mashkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Str., Moscow, 117984, Russia.
| | | | | | | | | | | | | |
Collapse
|
8
|
Mashkova T, Oparina N, Alexandrov I, Zinovieva O, Marusina A, Yurov Y, Lacroix MH, Kisselev L. Unequal cross-over is involved in human alpha satellite DNA rearrangements on a border of the satellite domain. FEBS Lett 1998; 441:451-7. [PMID: 9891989 DOI: 10.1016/s0014-5793(98)01600-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
It can be invoked from the theory of tandem repeat homogenization that DNA on a satellite/non-satellite border may carry sequence marks of molecular processes basic to satellite evolution. We have sequenced a continuous 17-kb alpha satellite fragment bordering the non-satellite in human chromosome 21, which is devoid of higher-order repeated structure, contains multiple rearrangements, and exhibits higher divergence of monomers towards the border, indicating the lack of efficient homogenization. Remarkably, monomers have been found with mutually supplementary deletions matching each other as reciprocal products of unequal recombination, which provide evidence for unequal cross-over as a mechanism generating deletions in satellite DNA.
Collapse
Affiliation(s)
- T Mashkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow.
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Yurov YB, Soloviev IV, Vorsanova SG, Marcais B, Roizes G, Lewis R. High resolution multicolor fluorescence in situ hybridization using cyanine and fluorescein dyes: rapid chromosome identification by directly fluorescently labeled alphoid DNA probes. Hum Genet 1996; 97:390-8. [PMID: 8786090 DOI: 10.1007/bf02185780] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
We tested DNA probes directly labeled by fluorescently labeled nucleotides (Cy3-dCTP, Cy5-dCTP, FluorX-dCTP) for high resolution uni- and multicolor detection of human chromosomes and analysis of centromeric DNA organization by in situ hybridization. Alpha-satellite DNA probes specific to chromosomes 1, 2, 3, 4 + 9, 5 + 19, 6, 7, 8, 10, 11, 13 + 21, 14 + 22, 15, 16, 17, 18, 20, 22, X and Y were suitable for the accurate identification of human chromosomes in metaphase and interphase cells. Cy3-labeled probes had several advantages: (1) a high level of fluorescence (5-10 times more compared with fluorescein-labeled probes); (2) a low level of fluorescence in solution, allowing the detection of target chromosomes in situ during hybridization without the washing of slides; and (3) high resistance to photobleaching during prolonged (1-2 h) exposure to strong light, thus allowing the use of a high energy mercury lamp or a long integration time during image acquisition in digital imaging microscopy for the determination of weak signals. For di- and multicolor fluorescence in situ hybridization (FISH), we successfully used different combinations of directly fluorophorated probes with preservation of images by conventional microscopy or by digital imaging microscopy. FluorX and Cy3 dyes allowed the use of cosmid probes for mapping in a one-step hybridization experiment. Cyanine-labeled fluorophorated DNA probes offer additional possibilities for rapid chromosome detection during a simple 15-min FISH procedure, and can be recommended for basic research and clinical studies, utilizing FISH.
Collapse
Affiliation(s)
- Y B Yurov
- National Research Centre of Mental Health, Russian Academy of Medical Sciences, Moscow, Russia
| | | | | | | | | | | |
Collapse
|
10
|
Volobouev V, Vogt N, Viegas-Péquignot E, Malfoy B, Dutrillaux B. Characterization and chromosomal location of two repeated DNAs in three Gerbillus species. Chromosoma 1995; 104:252-9. [PMID: 8565701 DOI: 10.1007/bf00352256] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Two tandemly repeated DNA sequences of Gerbillus nigeriae (Rodentia) (GN1 and GN2) were isolated and characterized. Both share a 36bp repeated unit, which includes a 20bp motif also found in primate alphoid and other repeated DNAs. The localization of GN1 and GN2 sequences on metaphase chromosomes of three Gerbillus species, G. nigeriae, G. aureus and G. nanus, was studied by fluorescence in situ hybridization (FISH). In the G. nigeriae and G. aureus karyotypes, which were shown to possess large amounts of heterochromatin and to have undergone multiple rearrangements during evolution, both GN1 and GN2 sequences were observed at various chromosomal sites: centromeric, telomeric and intercalary. In contrast, the karyotypically stable G. nanus, which does not possess large amounts of heterochromatin and seems to be a more ancestral species, possesses only GN1 sequences, localized in the juxtacentromeric regions.
Collapse
Affiliation(s)
- V Volobouev
- Institut Curie-CNRS UMR 147, 26, rue d Ulm, F-75231 Paris Cedex 05, France
| | | | | | | | | |
Collapse
|