1
|
Schiabor Barrett KM, Masnick M, Hatchell KE, Savatt JM, Banet N, Buchanan A, Willard HF. Clinical validation of genomic functional screen data: analysis of observed BRCA1 variants in an unselected population cohort. Human Genetics and Genomics Advances 2022; 3:100086. [PMID: 35128484 PMCID: PMC8804171 DOI: 10.1016/j.xhgg.2022.100086] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 01/06/2022] [Indexed: 12/02/2022] Open
Abstract
Functional assessment of genomic variants provides a promising approach to systematically examine the potential pathogenicity of variants independent of associated clinical data. However, making such conclusions requires validation with appropriate clinical findings. To this end, here, we use variant calls from exome data and BRCA1-related cancer diagnoses from electronic health records to demonstrate an association between published laboratory-based functional designations of BRCA1 variants and BRCA1-related cancer diagnoses in an unselected cohort of patient-participants. These findings validate and support further exploration of functional assay data to better understand the pathogenicity of rare variants. This information may be valuable in the context of healthy population genomic screening, where many rare, potentially pathogenic variants may not have sufficient associated clinical data to inform their interpretation directly.
Collapse
|
2
|
Buchanan AH, Lester Kirchner H, Schwartz MLB, Kelly MA, Schmidlen T, Jones LK, Hallquist MLG, Rocha H, Betts M, Schwiter R, Butry L, Lazzeri AL, Frisbie LR, Rahm AK, Hao J, Willard HF, Martin CL, Ledbetter DH, Williams MS, Sturm AC. Clinical outcomes of a genomic screening program for actionable genetic conditions. Genet Med 2020; 22:1874-1882. [PMID: 32601386 PMCID: PMC7605431 DOI: 10.1038/s41436-020-0876-4] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 06/12/2020] [Accepted: 06/12/2020] [Indexed: 01/07/2023] Open
Abstract
Purpose Three genetic conditions—hereditary breast and ovarian cancer syndrome, Lynch syndrome, and familial hypercholesterolemia—have tier 1 evidence for interventions that reduce morbidity and mortality, prompting proposals to screen unselected populations for these conditions. We examined the impact of genomic screening on risk management and early detection in an unselected population. Methods Observational study of electronic health records (EHR) among individuals in whom a pathogenic/likely pathogenic variant in a tier 1 gene was discovered through Geisinger’s MyCode project. EHR of all eligible participants was evaluated for a prior genetic diagnosis and, among participants without such a diagnosis, relevant personal/family history, postdisclosure clinical diagnoses, and postdisclosure risk management. Results Eighty-seven percent of participants (305/351) did not have a prior genetic diagnosis of their tier 1 result. Of these, 65% had EHR evidence of relevant personal and/or family history of disease. Of 255 individuals eligible to have risk management, 70% (n = 179) had a recommended risk management procedure after results disclosure. Thirteen percent of participants (41/305) received a relevant clinical diagnosis after results disclosure. Conclusion Genomic screening programs can identify previously unrecognized individuals at increased risk of cancer and heart disease and facilitate risk management and early cancer detection.
Collapse
Affiliation(s)
| | - H Lester Kirchner
- Department of Population Health Sciences, Geisinger, Danville, PA, USA
| | | | | | - Tara Schmidlen
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| | - Laney K Jones
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| | | | - Heather Rocha
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| | - Megan Betts
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| | | | - Loren Butry
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| | | | | | | | - Jing Hao
- Genomic Medicine Institute, Geisinger, Danville, PA, USA.,Department of Population Health Sciences, Geisinger, Danville, PA, USA
| | - Huntington F Willard
- Genomic Medicine Institute, Geisinger, Danville, PA, USA.,Genome Medical, Durham, NC, USA
| | - Christa L Martin
- Genomic Medicine Institute, Geisinger, Danville, PA, USA.,Autism and Developmental Medicine Institute, Geisinger, Lewisburg, PA, USA
| | - David H Ledbetter
- Genomic Medicine Institute, Geisinger, Danville, PA, USA.,Autism and Developmental Medicine Institute, Geisinger, Lewisburg, PA, USA
| | | | - Amy C Sturm
- Genomic Medicine Institute, Geisinger, Danville, PA, USA
| |
Collapse
|
3
|
Williams MS, Buchanan AH, Davis FD, Faucett WA, Hallquist MLG, Leader JB, Martin CL, McCormick CZ, Meyer MN, Murray MF, Rahm AK, Schwartz MLB, Sturm AC, Wagner JK, Williams JL, Willard HF, Ledbetter DH. Patient-Centered Precision Health In A Learning Health Care System: Geisinger's Genomic Medicine Experience. Health Aff (Millwood) 2019; 37:757-764. [PMID: 29733722 DOI: 10.1377/hlthaff.2017.1557] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Health care delivery is increasingly influenced by the emerging concepts of precision health and the learning health care system. Although not synonymous with precision health, genomics is a key enabler of individualized care. Delivering patient-centered, genomics-informed care based on individual-level data in the current national landscape of health care delivery is a daunting challenge. Problems to overcome include data generation, analysis, storage, and transfer; knowledge management and representation for patients and providers at the point of care; process management; and outcomes definition, collection, and analysis. Development, testing, and implementation of a genomics-informed program requires multidisciplinary collaboration and building the concepts of precision health into a multilevel implementation framework. Using the principles of a learning health care system provides a promising solution. This article describes the implementation of population-based genomic medicine in an integrated learning health care system-a working example of a precision health program.
Collapse
Affiliation(s)
- Marc S Williams
- Marc S. Williams ( ) is director of the Genomic Medicine Institute, Geisinger, in Danville, Pennsylvania
| | - Adam H Buchanan
- Adam H. Buchanan is an assistant professor at the Genomic Medicine Institute, Geisinger
| | - F Daniel Davis
- F. Daniel Davis is director of the Center for Bioethics and Healthcare Policy, Geisinger
| | - W Andrew Faucett
- W. Andrew Faucett is a professor at the Genomic Medicine Institute, Geisinger
| | - Miranda L G Hallquist
- Miranda L. G. Hallquist is a genetic counselor at the Genomic Medicine Institute, Geisinger
| | - Joseph B Leader
- Joseph B. Leader is director of the Phenomic Analytics and Clinical Data Core, Geisinger
| | - Christa L Martin
- Christa L. Martin is director of the Autism and Developmental Medicine Institute, Geisinger
| | - Cara Z McCormick
- Cara Z. McCormick is a senior assistant at the Genomic Medicine Institute, Geisinger
| | - Michelle N Meyer
- Michelle N. Meyer is associate director for research ethics at the Center for Translational Bioethics and Health Care Policy, Geisinger
| | - Michael F Murray
- Michael F. Murray was a physician in the Genomic Medicine Institute, Geisinger, at the time this work was completed. He is now at the Yale School of Medicine
| | - Alanna K Rahm
- Alanna K. Rahm is an assistant professor at the Genomic Medicine Institute, Geisinger
| | - Marci L B Schwartz
- Marci L. B. Schwartz is a genetic counselor at the Genomic Medicine Institute, Geisinger
| | - Amy C Sturm
- Amy C. Sturm is a professor at the Genomic Medicine Institute, Geisinger
| | - Jennifer K Wagner
- Jennifer K. Wagner is associate director of bioethics research, Center for Translational Bioethics and Health Care Policy, Geisinger
| | - Janet L Williams
- Janet L. Williams is director of research genetic counselors, Genomic Medicine Institute, Geisinger
| | - Huntington F Willard
- Huntington F. Willard is director of the National Precision Health Institute, Geisinger
| | - David H Ledbetter
- David H. Ledbetter is executive vice president and chief scientific officer, Geisinger
| |
Collapse
|
4
|
Manickam K, Buchanan AH, Schwartz MLB, Hallquist MLG, Williams JL, Rahm AK, Rocha H, Savatt JM, Evans AE, Butry LM, Lazzeri AL, Lindbuchler DM, Flansburg CN, Leeming R, Vogel VG, Lebo MS, Mason-Suares HM, Hoskinson DC, Abul-Husn NS, Dewey FE, Overton JD, Reid JG, Baras A, Willard HF, McCormick CZ, Krishnamurthy SB, Hartzel DN, Kost KA, Lavage DR, Sturm AC, Frisbie LR, Person TN, Metpally RP, Giovanni MA, Lowry LE, Leader JB, Ritchie MD, Carey DJ, Justice AE, Kirchner HL, Faucett WA, Williams MS, Ledbetter DH, Murray MF. Exome Sequencing-Based Screening for BRCA1/2 Expected Pathogenic Variants Among Adult Biobank Participants. JAMA Netw Open 2018; 1:e182140. [PMID: 30646163 PMCID: PMC6324494 DOI: 10.1001/jamanetworkopen.2018.2140] [Citation(s) in RCA: 140] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
IMPORTANCE Detection of disease-associated variants in the BRCA1 and BRCA2 (BRCA1/2) genes allows for cancer prevention and early diagnosis in high-risk individuals. OBJECTIVES To identify pathogenic and likely pathogenic (P/LP) BRCA1/2 variants in an unselected research cohort, and to characterize the features associated with P/LP variants. DESIGN, SETTING, AND PARTICIPANTS This is a cross-sectional study of adult volunteers (n = 50 726) who underwent exome sequencing at a single health care system (Geisinger Health System, Danville, Pennsylvania) from January 1, 2014, to March 1, 2016. Participants are part of the DiscovEHR cohort and were identified through the Geisinger MyCode Community Health Initiative. They consented to a research protocol that included sequencing and return of actionable test results. Clinical data from electronic health records and clinical visits were correlated with variants. Comparisons were made between those with (cases) and those without (controls) P/LP variants in BRCA1/2. MAIN OUTCOMES Prevalence of P/LP BRCA1/2 variants in cohort, proportion of variant carriers not previously ascertained through clinical testing, and personal and family history of relevant cancers among BRCA1/2 variant carriers and noncarriers. RESULTS Of the 50 726 health system patients who underwent exome sequencing, 50 459 (99.5%) had no expected pathogenic BRCA1/2 variants and 267 (0.5%) were BRCA1/2 carriers. Of the 267 cases (148 [55.4%] were women and 119 [44.6%] were men with a mean [range] age of 58.9 [23-90] years), 183 (68.5%) received clinically confirmed results in their electronic health record. Among the 267 participants with P/LP BRCA1/2 variants, 219 (82.0%) had no prior clinical testing, 95 (35.6%) had BRCA1 variants, and 172 (64.4%) had BRCA2 variants. Syndromic cancer diagnoses were present in 11 (47.8%) of the 23 deceased BRCA1/2 carriers and in 56 (20.9%) of all 267 BRCA1/2 carriers. Among women, 31 (20.9%) of 148 variant carriers had a personal history of breast cancer, compared with 1554 (5.2%) of 29 880 noncarriers (odds ratio [OR], 5.95; 95% CI, 3.88-9.13; P < .001). Ovarian cancer history was present in 15 (10.1%) of 148 variant carriers and in 195 (0.6%) of 29 880 variant noncarriers (OR, 18.30; 95% CI, 10.48-31.4; P < .001). Among 89 BRCA1/2 carriers without prior testing but with comprehensive personal and family history data, 44 (49.4%) did not meet published guidelines for clinical testing. CONCLUSIONS AND RELEVANCE This study found that compared with previous clinical care, exome sequencing-based screening identified 5 times as many individuals with P/LP BRCA1/2 variants. These findings suggest that genomic screening may identify BRCA1/2-associated cancer risk that might otherwise remain undetected within health care systems and may provide opportunities to reduce morbidity and mortality in patients.
Collapse
Affiliation(s)
- Kandamurugu Manickam
- Molecular and Human Genetics Department, Nationwide Children’s Hospital, Columbus, Ohio
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | | | | | | | | | - Heather Rocha
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | - Alyson E. Evans
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | - Loren M. Butry
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | | | | | | | - Victor G. Vogel
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | - Matthew S. Lebo
- Laboratory for Molecular Medicine, Partners HealthCare, Cambridge, Massachusetts
| | | | - Derick C. Hoskinson
- Laboratory for Molecular Medicine, Partners HealthCare, Cambridge, Massachusetts
| | | | | | | | | | - Aris Baras
- Regeneron Genetics Center, Tarrytown, New York
| | | | | | | | | | - Korey A. Kost
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | - Amy C. Sturm
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | - T. Nate Person
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | | | - Lacy E. Lowry
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | - Marylyn D. Ritchie
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
- Center for Translational Bioinformatics, University of Pennsylvania, Philadelphia
| | - David J. Carey
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | - Anne E. Justice
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
| | | | | | | | | | - Michael F. Murray
- Genomic Medicine Institute, Geisinger, Danville, Pennsylvania
- Department of Genetics, Yale School of Medicine, New Haven, Connecticut
| |
Collapse
|
5
|
Jain M, Olsen HE, Turner DJ, Stoddart D, Bulazel KV, Paten B, Haussler D, Willard HF, Akeson M, Miga KH. Linear assembly of a human centromere on the Y chromosome. Nat Biotechnol 2018; 36:321-323. [PMID: 29553574 PMCID: PMC5886786 DOI: 10.1038/nbt.4109] [Citation(s) in RCA: 167] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 02/22/2018] [Indexed: 01/21/2023]
Abstract
The human genome reference sequence remains incomplete owing to the challenge of assembling long tracts of near-identical tandem repeats in centromeres. We implemented a nanopore sequencing strategy to generate high-quality reads that span hundreds of kilobases of highly repetitive DNA in a human Y chromosome centromere. Combining these data with short-read variant validation, we assembled and characterized the centromeric region of a human Y chromosome.
Collapse
Affiliation(s)
- Miten Jain
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | - Hugh E Olsen
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | | | | | - Kira V Bulazel
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | - Huntington F Willard
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA.,Geisinger National, Bethesda, Maryland, USA
| | - Mark Akeson
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA.,Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, USA
| |
Collapse
|
6
|
Altemose N, Miga KH, Maggioni M, Willard HF. Genomic characterization of large heterochromatic gaps in the human genome assembly. PLoS Comput Biol 2014; 10:e1003628. [PMID: 24831296 PMCID: PMC4022460 DOI: 10.1371/journal.pcbi.1003628] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 03/26/2014] [Indexed: 01/24/2023] Open
Abstract
The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3). The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome shotgun read datasets. Utilizing this approach, we classify HSat2,3 sequences into fourteen subfamilies and predict their chromosomal distributions, resulting in a comprehensive satellite reference database to further enable genomic studies of heterochromatic regions. We also identify 1.3 Mb of non-repetitive sequence interspersed with HSat2,3 across 17 unmapped assembly scaffolds, including eight annotated gene predictions. Finally, we apply our satellite reference database to high-throughput sequence data from 396 males to estimate array size variation of the predominant HSat3 array on the Y chromosome, confirming that satellite array sizes can vary between individuals over an order of magnitude (7 to 98 Mb) and further demonstrating that array sizes are distributed differently within distinct Y haplogroups. In summary, we present a novel framework for generating initial reference databases for unassembled genomic regions enriched with complex satellite DNA, and we further demonstrate the utility of these reference databases for studying patterns of sequence variation within human populations. At least 5–10% of the human genome remains unassembled, unmapped, and poorly characterized. The reference assembly annotates these missing regions as multi-megabase heterochromatic gaps, found primarily near centromeres and on the short arms of the acrocentric chromosomes. This missing fraction of the genome consists predominantly of long arrays of near-identical tandem repeats called satellite DNA. Due to the repetitive nature of satellite DNA, sequence assembly algorithms cannot uniquely align overlapping sequence reads, and thus satellite-rich domains have been omitted from the reference assembly and from most genome-wide studies of variation and function. Existing methods for analyzing some satellite DNAs cannot be easily extended to a large portion of satellites whose repeat structures are complex and largely uncharacterized, such as Human Satellites 2 and 3 (HSat2,3). Here we characterize HSat2,3 using a novel approach that does not depend on having a well-defined repeat structure. By classifying genome-wide HSat2,3 sequences into subfamilies and localizing them to chromosomes, we have generated an initial HSat2,3 genomic reference, which serves as a critical foundation for future studies of variation and function in these regions. This approach should be generally applicable to other classes of satellite DNA, in both the human genome and other complex genomes.
Collapse
Affiliation(s)
- Nicolas Altemose
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Karen H. Miga
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- * E-mail:
| | - Mauro Maggioni
- Department of Mathematics, Duke University, Durham, North Carolina, United States of America
| | - Huntington F. Willard
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| |
Collapse
|
7
|
Brownstein CA, Beggs AH, Homer N, Merriman B, Yu TW, Flannery KC, DeChene ET, Towne MC, Savage SK, Price EN, Holm IA, Luquette LJ, Lyon E, Majzoub J, Neupert P, McCallie D, Szolovits P, Willard HF, Mendelsohn NJ, Temme R, Finkel RS, Yum SW, Medne L, Sunyaev SR, Adzhubey I, Cassa CA, de Bakker PIW, Duzkale H, Dworzyński P, Fairbrother W, Francioli L, Funke BH, Giovanni MA, Handsaker RE, Lage K, Lebo MS, Lek M, Leshchiner I, MacArthur DG, McLaughlin HM, Murray MF, Pers TH, Polak PP, Raychaudhuri S, Rehm HL, Soemedi R, Stitziel NO, Vestecka S, Supper J, Gugenmus C, Klocke B, Hahn A, Schubach M, Menzel M, Biskup S, Freisinger P, Deng M, Braun M, Perner S, Smith RJH, Andorf JL, Huang J, Ryckman K, Sheffield VC, Stone EM, Bair T, Black-Ziegelbein EA, Braun TA, Darbro B, DeLuca AP, Kolbe DL, Scheetz TE, Shearer AE, Sompallae R, Wang K, Bassuk AG, Edens E, Mathews K, Moore SA, Shchelochkov OA, Trapane P, Bossler A, Campbell CA, Heusel JW, Kwitek A, Maga T, Panzer K, Wassink T, Van Daele D, Azaiez H, Booth K, Meyer N, Segal MM, Williams MS, Tromp G, White P, Corsmeier D, Fitzgerald-Butt S, Herman G, Lamb-Thrush D, McBride KL, Newsom D, Pierson CR, Rakowsky AT, Maver A, Lovrečić L, Palandačić A, Peterlin B, Torkamani A, Wedell A, Huss M, Alexeyenko A, Lindvall JM, Magnusson M, Nilsson D, Stranneheim H, Taylan F, Gilissen C, Hoischen A, van Bon B, Yntema H, Nelen M, Zhang W, Sager J, Zhang L, Blair K, Kural D, Cariaso M, Lennon GG, Javed A, Agrawal S, Ng PC, Sandhu KS, Krishna S, Veeramachaneni V, Isakov O, Halperin E, Friedman E, Shomron N, Glusman G, Roach JC, Caballero J, Cox HC, Mauldin D, Ament SA, Rowen L, Richards DR, San Lucas FA, Gonzalez-Garay ML, Caskey CT, Bai Y, Huang Y, Fang F, Zhang Y, Wang Z, Barrera J, Garcia-Lobo JM, González-Lamuño D, Llorca J, Rodriguez MC, Varela I, Reese MG, De La Vega FM, Kiruluta E, Cargill M, Hart RK, Sorenson JM, Lyon GJ, Stevenson DA, Bray BE, Moore BM, Eilbeck K, Yandell M, Zhao H, Hou L, Chen X, Yan X, Chen M, Li C, Yang C, Gunel M, Li P, Kong Y, Alexander AC, Albertyn ZI, Boycott KM, Bulman DE, Gordon PMK, Innes AM, Knoppers BM, Majewski J, Marshall CR, Parboosingh JS, Sawyer SL, Samuels ME, Schwartzentruber J, Kohane IS, Margulies DM. An international effort towards developing standards for best practices in analysis, interpretation and reporting of clinical genome sequencing results in the CLARITY Challenge. Genome Biol 2014; 15:R53. [PMID: 24667040 PMCID: PMC4073084 DOI: 10.1186/gb-2014-15-3-r53] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Accepted: 03/25/2014] [Indexed: 12/30/2022] Open
Abstract
Background There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance. Results A total of 30 international groups were engaged. The entries reveal a general convergence of practices on most elements of the analysis and interpretation process. However, even given this commonality of approach, only two groups identified the consensus candidate variants in all disease cases, demonstrating a need for consistent fine-tuning of the generally accepted methods. There was greater diversity of the final clinical report content and in the patient consenting process, demonstrating that these areas require additional exploration and standardization. Conclusions The CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases. There is remarkable convergence in bioinformatic techniques, but medical interpretation and reporting are areas that require further development by many groups.
Collapse
|
8
|
Haga SB, Barry WT, Mills R, Svetkey L, Suchindran S, Willard HF, Ginsburg GS. Impact of delivery models on understanding genomic risk for type 2 diabetes. Public Health Genomics 2014; 17:95-104. [PMID: 24577154 DOI: 10.1159/000358413] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Accepted: 12/19/2013] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Genetic information, typically communicated in-person by genetic counselors, can be challenging to comprehend; delivery of this information online--as is becoming more common--has the potential of increasing these challenges. METHODS To address the impact of the mode of delivery of genomic risk information, 300 individuals were recruited from the general public and randomized to receive genomic risk information for type 2 diabetes mellitus in-person from a board-certified genetic counselor or online through the testing company's website. RESULTS Participants were asked to indicate their genomic risk and overall lifetime risk as reported on their test report as well as to interpret their genomic risk (increased, decreased, or same as population). For each question, 59% of participants correctly indicated their risk. Participants who received their results in-person were more likely than those who reviewed their results on-line to correctly interpret their genomic risk (72 vs. 47%, p = 0.0002) and report their actual genomic risk (69 vs. 49%, p = 0.002). CONCLUSIONS The delivery of personal genomic risk through a trained health professional resulted in significantly higher comprehension. Therefore, if the online delivery of genomic test results is to become more widespread, further evaluation of this method of communication may be needed to ensure the effective presentation of results to promote comprehension.
Collapse
Affiliation(s)
- S B Haga
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, N.C., USA
| | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
The human genome sequence remains incomplete, with multimegabase-sized gaps representing the endogenous centromeres and other heterochromatic regions. Available sequence-based studies within these sites in the genome have demonstrated a role in centromere function and chromosome pairing, necessary to ensure proper chromosome segregation during cell division. A common genomic feature of these regions is the enrichment of long arrays of near-identical tandem repeats, known as satellite DNAs, which offer a limited number of variant sites to differentiate individual repeat copies across millions of bases. This substantial sequence homogeneity challenges available assembly strategies and, as a result, centromeric regions are omitted from ongoing genomic studies. To address this problem, we utilize monomer sequence and ordering information obtained from whole-genome shotgun reads to model two haploid human satellite arrays on chromosomes X and Y, resulting in an initial characterization of 3.83 Mb of centromeric DNA within an individual genome. To further expand the utility of each centromeric reference sequence model, we evaluate sites within the arrays for short-read mappability and chromosome specificity. Because satellite DNAs evolve in a concerted manner, we use these centromeric assemblies to assess the extent of sequence variation among 366 individuals from distinct human populations. We thus identify two satellite array variants in both X and Y centromeres, as determined by array length and sequence composition. This study provides an initial sequence characterization of a regional centromere and establishes a foundation to extend genomic characterization to these sites as well as to other repeat-rich regions within complex genomes.
Collapse
Affiliation(s)
- Karen H Miga
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | | | | | | | |
Collapse
|
10
|
Haga SB, Barry WT, Mills R, Ginsburg GS, Svetkey L, Sullivan J, Willard HF. Public knowledge of and attitudes toward genetics and genetic testing. Genet Test Mol Biomarkers 2013; 17:327-35. [PMID: 23406207 PMCID: PMC3609633 DOI: 10.1089/gtmb.2012.0350] [Citation(s) in RCA: 160] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Variable health literacy and genetic knowledge may pose significant challenges to engaging the general public in personal genomics, specifically with respect to promoting risk comprehension and healthy behaviors. METHODS We are conducting a multistage study of individual responses to genomic risk information for Type 2 diabetes mellitus. A total of 300 individuals were recruited from the general public in Durham, North Carolina: 60% self-identified as White; 70% female; and 65% have a college degree. As part of the baseline survey, we assessed genetic knowledge and attitudes toward genetic testing. RESULTS Scores of factual knowledge of genetics ranged from 50% to 100% (average=84%), with significant differences in relation to racial groups, the education level, and age. Scores were significantly higher on questions pertaining to the inheritance and causes of disease (mean score 90%) compared to scientific questions (mean score 77.4%). Scores on the knowledge survey were significantly higher than scores from European populations. Participants' perceived knowledge of the social consequences of genetic testing was significantly lower than their perceived knowledge of the medical uses of testing. More than half agreed with the statement that testing may affect a person's ability to obtain health insurance (51.3%) and 16% were worried about the consequences of testing for chances of finding a job. CONCLUSIONS Despite the relatively high educational status and genetic knowledge of the study population, we find an imbalance of knowledge between scientific and medical concepts related to genetics as well as between the medical applications and societal consequences of testing, suggesting that more effort is needed to present the benefits, risks, and limitations of genetic testing, particularly, at the social and personal levels, to ensure informed decision making.
Collapse
Affiliation(s)
- Susanne B Haga
- Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708, USA.
| | | | | | | | | | | | | |
Collapse
|
11
|
Abstract
Background Combinations of histone variants and modifications, conceptually representing a histone code, have been proposed to play a significant role in gene regulation and developmental processes in complex organisms. While various mechanisms have been implicated in establishing and maintaining epigenetic patterns at specific locations in the genome, they are generally believed to be independent of primary DNA sequence on a more global scale. Results To address this systematically in the case of the human genome, we have analyzed primary DNA sequences underlying patterns of 19 different methylated histones in human primary T-cells and patterns of three methylated histones across additional human cell lines. We report strong sequence biases associated with most of these histone marks genome-wide in each cell type. Furthermore, the sequence characteristics for such association are distinct for different groups of histone marks. Conclusions These findings provide evidence of an influence of genomic sequence on patterns of histone modification associated with gene expression and chromatin programming, and they suggest that the mechanisms responsible for global histone modifications may interpret genomic sequence in various ways.
Collapse
Affiliation(s)
- Zhong Wang
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
| | | |
Collapse
|
12
|
Abstract
Background Centromeres are sites of chromosomal spindle attachment during mitosis and meiosis. While the sequence basis for centromere identity remains a subject of considerable debate, one approach is to examine the genomic organization at these active sites that are correlated with epigenetic marks of centromere function. Results We have developed an approach to characterize both satellite and non-satellite centromeric sequences that are missing from current assemblies in complex genomes, using the dog genome as an example. Combining this genomic reference with an epigenetic dataset corresponding to sequences associated with the histone H3 variant centromere protein A (CENP-A), we identify active satellite sequence domains that appear to be both functionally and spatially distinct within the overall definition of satellite families. Conclusions These findings establish a genomic and epigenetic foundation for exploring the functional role of centromeric sequences in the previously sequenced dog genome and provide a model for similar studies within the context of less-characterized genomes.
Collapse
Affiliation(s)
- Karen E Hayden
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC, USA.
| | | |
Collapse
|
13
|
Reddy TE, Gertz J, Pauli F, Kucera KS, Varley KE, Newberry KM, Marinov GK, Mortazavi A, Williams BA, Song L, Crawford GE, Wold B, Willard HF, Myers RM. Effects of sequence variation on differential allelic transcription factor occupancy and gene expression. Genome Res 2012; 22:860-9. [PMID: 22300769 PMCID: PMC3337432 DOI: 10.1101/gr.131201.111] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2011] [Accepted: 02/01/2012] [Indexed: 01/01/2023]
Abstract
A complex interplay between transcription factors (TFs) and the genome regulates transcription. However, connecting variation in genome sequence with variation in TF binding and gene expression is challenging due to environmental differences between individuals and cell types. To address this problem, we measured genome-wide differential allelic occupancy of 24 TFs and EP300 in a human lymphoblastoid cell line GM12878. Overall, 5% of human TF binding sites have an allelic imbalance in occupancy. At many sites, TFs clustered in TF-binding hubs on the same homolog in especially open chromatin. While genetic variation in core TF binding motifs generally resulted in large allelic differences in TF occupancy, most allelic differences in occupancy were subtle and associated with disruption of weak or noncanonical motifs. We also measured genome-wide differential allelic expression of genes with and without heterozygous exonic variants in the same cells. We found that genes with differential allelic expression were overall less expressed both in GM12878 cells and in unrelated human cell lines. Comparing TF occupancy with expression, we found strong association between allelic occupancy and expression within 100 bp of transcription start sites (TSSs), and weak association up to 100 kb from TSSs. Sites of differential allelic occupancy were significantly enriched for variants associated with disease, particularly autoimmune disease, suggesting that allelic differences in TF occupancy give functional insights into intergenic variants associated with disease. Our results have the potential to increase the power and interpretability of association studies by targeting functional intergenic variants in addition to protein coding sequences.
Collapse
Affiliation(s)
- Timothy E. Reddy
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | - Jason Gertz
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Florencia Pauli
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Katerina S. Kucera
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | | - Georgi K. Marinov
- Department of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Ali Mortazavi
- Department of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Brian A. Williams
- Department of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Lingyun Song
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | - Gregory E. Crawford
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | - Barbara Wold
- Department of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Huntington F. Willard
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | - Richard M. Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| |
Collapse
|
14
|
Anderson WA, Amasino RM, Ares M, Banerjee U, Bartel B, Corces VG, Drennan CL, Elgin SCR, Epstein IR, Fanning E, Guillette LJ, Handelsman J, Hatfull GF, Hoy RR, Kelley D, Leinwand LA, Losick R, Lu Y, Lynn DG, Neuhauser C, O'Dowd DK, Olivera T, Pevzner P, Richards-Kortum RR, Rine J, Sah RL, Strobel SA, Walker GC, Walt DR, Warner IM, Wessler S, Willard HF, Zare RN. Competencies: a cure for pre-med curriculum. Science 2011; 334:760-1. [PMID: 22076362 DOI: 10.1126/science.334.6057.760-b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
15
|
Gertz J, Varley KE, Reddy TE, Bowling KM, Pauli F, Parker SL, Kucera KS, Willard HF, Myers RM. Analysis of DNA methylation in a three-generation family reveals widespread genetic influence on epigenetic regulation. PLoS Genet 2011; 7:e1002228. [PMID: 21852959 PMCID: PMC3154961 DOI: 10.1371/journal.pgen.1002228] [Citation(s) in RCA: 236] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 06/21/2011] [Indexed: 12/17/2022] Open
Abstract
The methylation of cytosines in CpG dinucleotides is essential for cellular differentiation and the progression of many cancers, and it plays an important role in gametic imprinting. To assess variation and inheritance of genome-wide patterns of DNA methylation simultaneously in humans, we applied reduced representation bisulfite sequencing (RRBS) to somatic DNA from six members of a three-generation family. We observed that 8.1% of heterozygous SNPs are associated with differential methylation in cis, which provides a robust signature for Mendelian transmission and relatedness. The vast majority of differential methylation between homologous chromosomes (>92%) occurs on a particular haplotype as opposed to being associated with the gender of the parent of origin, indicating that genotype affects DNA methylation of far more loci than does gametic imprinting. We found that 75% of genotype-dependent differential methylation events in the family are also seen in unrelated individuals and that overall genotype can explain 80% of the variation in DNA methylation. These events are under-represented in CpG islands, enriched in intergenic regions, and located in regions of low evolutionary conservation. Even though they are generally not in functionally constrained regions, 22% (twice as many as expected by chance) of genes harboring genotype-dependent DNA methylation exhibited allele-specific gene expression as measured by RNA-seq of a lymphoblastoid cell line, indicating that some of these events are associated with gene expression differences. Overall, our results demonstrate that the influence of genotype on patterns of DNA methylation is widespread in the genome and greatly exceeds the influence of imprinting on genome-wide methylation patterns. DNA methylation is a dynamic epigenetic mark that is essential for mammalian organismal development. DNA methylation levels can be influenced by environment, a chromosome's parental origin, and genome sequence. In this study, we evaluated the impact that DNA sequence has on DNA methylation by analyzing methylation levels in a three-generation family as well as unrelated individuals. By following DNA methylation patterns through the family along with nearby SNPs, we found that allelic differences between chromosomes play a much larger role in determining DNA methylation than the parental origin of the chromosome, indicating that DNA sequence has a larger impact on DNA methylation than gametic imprinting. We also found that allelic differences in DNA methylation found in the family can also be observed in unrelated individuals. In fact, the majority of variation in DNA methylation can be explained by genotype. Our results emphasize the importance of genome sequence in setting patterns of DNA methylation and indicate that genotype will need to be taken into account when assessing DNA methylation in the context of disease.
Collapse
Affiliation(s)
- Jason Gertz
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Katherine E. Varley
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Timothy E. Reddy
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Kevin M. Bowling
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Florencia Pauli
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Stephanie L. Parker
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Katerina S. Kucera
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Huntington F. Willard
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Richard M. Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
- * E-mail:
| |
Collapse
|
16
|
Lee HR, Hayden KE, Willard HF. Organization and molecular evolution of CENP-A--associated satellite DNA families in a basal primate genome. Genome Biol Evol 2011; 3:1136-49. [PMID: 21828373 PMCID: PMC3194837 DOI: 10.1093/gbe/evr083] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Centromeric regions in many complex eukaryotic species contain highly repetitive satellite DNAs. Despite the diversity of centromeric DNA sequences among species, the functional centromeres in all species studied to date are marked by CENP-A, a centromere-specific histone H3 variant. Although it is well established that families of multimeric higher-order alpha satellite are conserved at the centromeres of human and great ape chromosomes and that diverged monomeric alpha satellite is found in old and new world monkey genomes, little is known about the organization, function, and evolution of centromeric sequences in more distant primates, including lemurs. Aye-Aye (Daubentonia madagascariensis) is a basal primate and is located at a key position in the evolutionary tree to study centromeric satellite transitions in primate genomes. Using the approach of chromatin immunoprecipitation with antibodies directed to CENP-A, we have identified two satellite families, Daubentonia madagascariensis Aye-Aye 1 (DMA1) and Daubentonia madagascariensis Aye-Aye 2 (DMA2), related to each other but unrelated in sequence to alpha satellite or any other previously described primate or mammalian satellite DNA families. Here, we describe the initial genomic and phylogenetic organization of DMA1 and DMA2 and present evidence of higher-order repeats in Aye-Aye centromeric domains, providing an opportunity to study the emergence of chromosome-specific modes of satellite DNA evolution in primate genomes.
Collapse
Affiliation(s)
- Hye-Ran Lee
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, USA
| | | | | |
Collapse
|
17
|
Kucera KS, Reddy TE, Pauli F, Gertz J, Logan JE, Myers RM, Willard HF. Allele-specific distribution of RNA polymerase II on female X chromosomes. Hum Mol Genet 2011; 20:3964-73. [PMID: 21791549 DOI: 10.1093/hmg/ddr315] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
While the distribution of RNA polymerase II (PolII) in a variety of complex genomes is correlated with gene expression, the presence of PolII at a gene does not necessarily indicate active expression. Various patterns of PolII binding have been described genome wide; however, whether or not PolII binds at transcriptionally inactive sites remains uncertain. The two X chromosomes in female cells in mammals present an opportunity to examine each of the two alleles of a given locus in both active and inactive states, depending on which X chromosome is silenced by X chromosome inactivation. Here, we investigated PolII occupancy and expression of the associated genes across the active (Xa) and inactive (Xi) X chromosomes in human female cells to elucidate the relationship of gene expression and PolII binding. We find that, while PolII in the pseudoautosomal region occupies both chromosomes at similar levels, it is significantly biased toward the Xa throughout the rest of the chromosome. The general paucity of PolII on the Xi notwithstanding, detectable (albeit significantly reduced) binding can be observed, especially on the evolutionarily younger short arm of the X. PolII levels at genes that escape inactivation correlate with the levels of their expression; however, additional PolII sites can be found at apparently silenced regions, suggesting the possibility of a subset of genes on the Xi that are poised for expression. Consistent with this hypothesis, we show that a high proportion of genes associated with PolII-accessible sites, while silenced in GM12878, are expressed in other female cell lines.
Collapse
Affiliation(s)
- Katerina S Kucera
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, CIEMAS 2376, 101 Science Drive, Durham, 27708 NC, USA
| | | | | | | | | | | | | |
Collapse
|
18
|
Horvath JE, Sheedy CB, Merrett SL, Diallo AB, Swofford DL, NISC Comparative Sequencing Program, Green ED, Willard HF. Comparative analysis of the primate X-inactivation center region and reconstruction of the ancestral primate XIST locus. Genome Res 2011; 21:850-62. [PMID: 21518738 DOI: 10.1101/gr.111849.110] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Here we provide a detailed comparative analysis across the candidate X-Inactivation Center (XIC) region and the XIST locus in the genomes of six primates and three mammalian outgroup species. Since lemurs and other strepsirrhine primates represent the sister lineage to all other primates, this analysis focuses on lemurs to reconstruct the ancestral primate sequences and to gain insight into the evolution of this region and the genes within it. This comparative evolutionary genomics approach reveals significant expansion in genomic size across the XIC region in higher primates, with minimal size alterations across the XIST locus itself. Reconstructed primate ancestral XIC sequences show that the most dramatic changes during the past 80 million years occurred between the ancestral primate and the lineage leading to Old World monkeys. In contrast, the XIST locus compared between human and the primate ancestor does not indicate any dramatic changes to exons or XIST-specific repeats; rather, evolution of this locus reflects small incremental changes in overall sequence identity and short repeat insertions. While this comparative analysis reinforces that the region around XIST has been subject to significant genomic change, even among primates, our data suggest that evolution of the XIST sequences themselves represents only small lineage-specific changes across the past 80 million years.
Collapse
Affiliation(s)
- Julie E Horvath
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina 27708, USA.
| | | | | | | | | | | | | | | |
Collapse
|
19
|
McDaniell R, Lee BK, Song L, Liu Z, Boyle AP, Erdos MR, Scott LJ, Morken MA, Kucera KS, Battenhouse A, Keefe D, Collins FS, Willard HF, Lieb JD, Furey TS, Crawford GE, Iyer VR, Birney E. Heritable individual-specific and allele-specific chromatin signatures in humans. Science 2010; 328:235-9. [PMID: 20299549 DOI: 10.1126/science.1184655] [Citation(s) in RCA: 240] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The extent to which variation in chromatin structure and transcription factor binding may influence gene expression, and thus underlie or contribute to variation in phenotype, is unknown. To address this question, we cataloged both individual-to-individual variation and differences between homologous chromosomes within the same individual (allele-specific variation) in chromatin structure and transcription factor binding in lymphoblastoid cells derived from individuals of geographically diverse ancestry. Ten percent of active chromatin sites were individual-specific; a similar proportion were allele-specific. Both individual-specific and allele-specific sites were commonly transmitted from parent to child, which suggests that they are heritable features of the human genome. Our study shows that heritable chromatin status and transcription factor binding differ as a result of genetic variation and may underlie phenotypic variation in humans.
Collapse
Affiliation(s)
- Ryan McDaniell
- Center for Systems and Synthetic Biology, Institute for Cellular and Molecular Biology, Section of Molecular Genetics and Microbiology, University of Texas, Austin, TX 78712, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Abstract
The last decade has witnessed a steady embrace of genomic and personalized medicine by senior government officials, industry leadership, health care providers, and the public. Genomic medicine, which is the use of information from genomes and their derivatives (RNA, proteins, and metabolites) to guide medical decision making-is a key component of personalized medicine, which is a rapidly advancing field of health care that is informed by each person's unique clinical, genetic, genomic, and environmental information. As medicine begins to embrace genomic tools that enable more precise prediction and treatment disease, which include "whole genome" interrogation of sequence variation, transcription, proteins, and metabolites, the fundamentals of genomic and personalized medicine will require the development, standardization, and integration of several important tools into health systems and clinical workflows. These tools include health risk assessment, family health history, and clinical decision support for complex risk and predictive information. Together with genomic information, these tools will enable a paradigm shift to a comprehensive approach that will identify individual risks and guide clinical management and decision making, all of which form the basis for a more informed and effective approach to patient care. DNA-based risk assessment for common complex disease, molecular signatures for cancer diagnosis and prognosis, and genome-guided therapy and dose selection are just among the few important examples for which genome information has already enabled personalized health care along the continuum from health to disease. In addition, information from individual genomes, which is a fast-moving area of technological development, is spawning a social and information revolution among consumers that will undoubtedly affect health care decision making. Although these and other scientific findings are making their way from the genome to the clinic, the full application of genomic and personalized medicine in health care will require dramatic changes in regulatory and reimbursement policies as well as legislative protections for privacy for system-wide adoption. Thus, there are challenges from both a scientific and a policy perspective to personalized health care; however, they will be confronted and solved with the certainty that the science behind genomic medicine is sound and the practice of medicine that it informs is evidence based.
Collapse
Affiliation(s)
- Geoffrey S Ginsburg
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA.
| | | |
Collapse
|
21
|
Abstract
Characterizing how genomic sequence interacts with trans-acting regulatory factors to implement a program of gene expression in eukaryotic organisms is critical to understanding genome function. One means by which patterns of gene expression are achieved is through the differential packaging of DNA into distinct types of chromatin. While chromatin state exerts a major influence on gene expression, the extent to which cis-acting DNA sequences contribute to the specification of chromatin state remains incompletely understood. To address this, we have used a fission yeast sequence element (L5), known to be sufficient to nucleate heterochromatin, to establish de novo heterochromatin domains in the Schizosaccharomyces pombe genome. The resulting heterochromatin domains were queried for the presence of H3K9 di-methylation and Swi6p, both hallmarks of heterochromatin, and for levels of gene expression. We describe a major effect of genomic sequences in determining the size and extent of such de novo heterochromatin domains. Heterochromatin spreading is antagonized by the presence of genes, in a manner that can occur independent of strength of transcription. Increasing the dosage of Swi6p results in increased heterochromatin proximal to the L5 element, but does not result in an expansion of the heterochromatin domain, suggesting that in this context genomic effects are dominant over trans effects. Finally, we show that the ratio of Swi6p to H3K9 di-methylation is sequence-dependent and correlates with the extent of gene repression. Taken together, these data demonstrate that the sequence content of a genomic region plays a significant role in shaping its response to encroaching heterochromatin and suggest a role of DNA sequence in specifying chromatin state. Epigenetic packaging of DNA sequence into chromatin is a major force in shaping the function of complex genomes. Different types of chromatin have distinct effects on gene expression, and thus chromatin state imparts distinct features on the associated genomic DNA. Our study focuses on the transition between two opposing chromatin states: euchromatin, which generally correlates with gene expression, and heterochromatin, which is typically refractive to gene expression. While heterochromatin is capable of spreading into euchromatic domains, the parameters that influence such spreading are unknown. We established heterochromatin at ectopic sites in the genome and evaluated whether specific DNA sequences affected the extent of heterochromatin spreading and the transition between heterochromatin and euchromatin. We found that the nature of the genomic DNA neighboring the heterochromatic sequence dramatically affected the extent of heterochromatin spreading. In particular, the presence of genes antagonized the spread of heterochromatin, whereas neutral sequence elements were incorporated into the domain. This study demonstrates that genome sequence and chromatin identity are inextricably linked; features of both interact to determine the structural and functional fate of underlying DNA sequences.
Collapse
Affiliation(s)
- Bayly S. Wheeler
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Jared A. Blau
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Huntington F. Willard
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Kristin C. Scott
- Duke Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
22
|
Kawamoto K, Lobach DF, Willard HF, Ginsburg GS. A national clinical decision support infrastructure to enable the widespread and consistent practice of genomic and personalized medicine. BMC Med Inform Decis Mak 2009; 9:17. [PMID: 19309514 PMCID: PMC2666673 DOI: 10.1186/1472-6947-9-17] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2008] [Accepted: 03/23/2009] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND In recent years, the completion of the Human Genome Project and other rapid advances in genomics have led to increasing anticipation of an era of genomic and personalized medicine, in which an individual's health is optimized through the use of all available patient data, including data on the individual's genome and its downstream products. Genomic and personalized medicine could transform healthcare systems and catalyze significant reductions in morbidity, mortality, and overall healthcare costs. DISCUSSION Critical to the achievement of more efficient and effective healthcare enabled by genomics is the establishment of a robust, nationwide clinical decision support infrastructure that assists clinicians in their use of genomic assays to guide disease prevention, diagnosis, and therapy. Requisite components of this infrastructure include the standardized representation of genomic and non-genomic patient data across health information systems; centrally managed repositories of computer-processable medical knowledge; and standardized approaches for applying these knowledge resources against patient data to generate and deliver patient-specific care recommendations. Here, we provide recommendations for establishing a national decision support infrastructure for genomic and personalized medicine that fulfills these needs, leverages existing resources, and is aligned with the Roadmap for National Action on Clinical Decision Support commissioned by the U.S. Office of the National Coordinator for Health Information Technology. Critical to the establishment of this infrastructure will be strong leadership and substantial funding from the federal government. SUMMARY A national clinical decision support infrastructure will be required for reaping the full benefits of genomic and personalized medicine. Essential components of this infrastructure include standards for data representation; centrally managed knowledge repositories; and standardized approaches for leveraging these knowledge repositories to generate patient-specific care recommendations at the point of care.
Collapse
Affiliation(s)
- Kensaku Kawamoto
- Division of Clinical Informatics, Department of Community and Family Medicine, Box 104007, Duke University Medical Center, Durham, North Carolina 27710, USA.
| | | | | | | |
Collapse
|
23
|
Tuckson RV, Willard HF. A letter response to Chaufan's “How much can a large population study on genes, environments, their interactions and common diseases contribute to the health of the American people?” (65:8, 2007, 1730–1741). Soc Sci Med 2008; 67:673-4; author reply 675-83. [DOI: 10.1016/j.socscimed.2008.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Indexed: 10/22/2022]
|
24
|
|
25
|
Horvath JE, Weisrock DW, Embry SL, Fiorentino I, Balhoff JP, Kappeler P, Wray GA, Willard HF, Yoder AD. Development and application of a phylogenomic toolkit: resolving the evolutionary history of Madagascar's lemurs. Genes Dev 2008; 18:489-99. [PMID: 18245770 PMCID: PMC2259113 DOI: 10.1101/gr.7265208] [Citation(s) in RCA: 118] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2007] [Accepted: 12/19/2007] [Indexed: 12/29/2022]
Abstract
Lemurs and the other strepsirrhine primates are of great interest to the primate genomics community due to their phylogenetic placement as the sister lineage to all other primates. Previous attempts to resolve the phylogeny of lemurs employed limited mitochondrial or small nuclear data sets, with many relationships poorly supported or entirely unresolved. We used genomic resources to develop 11 novel markers from nine chromosomes, representing approximately 9 kb of nuclear sequence data. In combination with previously published nuclear and mitochondrial loci, this yields a data set of more than 16 kb and adds approximately 275 kb of DNA sequence to current databases. Our phylogenetic analyses confirm hypotheses of lemuriform monophyly and provide robust resolution of the phylogenetic relationships among the five lemuriform families. We verify that the genus Daubentonia is the sister lineage to all other lemurs. The Cheirogaleidae and Lepilemuridae are sister taxa and together form the sister lineage to the Indriidae; this clade is the sister lineage to the Lemuridae. Divergence time estimates indicate that lemurs are an ancient group, with their initial diversification occurring around the Cretaceous-Tertiary boundary. Given the power of this data set to resolve branches in a notoriously problematic area of primate phylogeny, we anticipate that our phylogenomic toolkit will be of value to other studies of primate phylogeny and diversification. Moreover, the methods applied will be broadly applicable to other taxonomic groups where phylogenetic relationships have been notoriously difficult to resolve.
Collapse
Affiliation(s)
- Julie E Horvath
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Abstract
Heterochromatin formation involves the nucleation and spreading of structural and epigenetic features along the chromatin fiber. Chromatin barriers and associated proteins counteract the spreading of heterochromatin, thereby restricting it to specific regions of the genome. We have performed gene expression studies and chromatin immunoprecipitation on strains in which native centromere sequences have been mutated to study the mechanism by which a tRNAAlanine gene barrier (cen1 tDNAAla) blocks the spread of pericentromeric heterochromatin at the centromere of chromosome 1 (cen1) in the fission yeast, Schizosaccharomyces pombe. Within the centromere, barrier activity is a general property of tDNAs and, unlike previously characterized barriers, requires the association of both transcription factor IIIC and RNA Polymerase III. Although the cen1 tDNAAla gene is actively transcribed, barrier activity is independent of transcriptional orientation. These findings provide experimental evidence for the involvement of a fully assembled RNA polymerase III transcription complex in defining independent structural and functional domains at a eukaryotic centromere.
Collapse
Affiliation(s)
- Kristin C Scott
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America.
| | | | | |
Collapse
|
27
|
|
28
|
Horvath JE, Willard HF. Primate comparative genomics: lemur biology and evolution. Trends Genet 2007; 23:173-82. [PMID: 17331617 DOI: 10.1016/j.tig.2007.02.007] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2006] [Revised: 01/08/2007] [Accepted: 02/12/2007] [Indexed: 12/23/2022]
Abstract
Comparative genome sequencing projects are providing insight into aspects of genome biology that raise new questions and challenge existing paradigms. Placement in the phylogenetic tree can often be a major determinant of which organism to choose for study. Lemurs hold a key position at the base of the primate evolutionary tree and will be highly informative for the genomics community by offering comparisons of primate-specific characteristics and processes. Combining research in chromosome evolution, genome evolution and behavior with lemur comparative genomic sequencing will offer insights into many levels of primate evolution. We discuss the current state of lemur cytogenetic and phylogenetic analyses, and suggest how focusing more genomic efforts on lemurs will be beneficial to understanding human and primate evolution, as well as disease, and will contribute to conservation efforts.
Collapse
Affiliation(s)
- Julie E Horvath
- Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA.
| | | |
Collapse
|
29
|
Basu J, Willard HF, Stromberg G. Human Artificial Chromosome Assembly by Transposon‐Based Retrofitting of Genomic BACs with Synthetic Alpha‐Satellite Arrays. ACTA ACUST UNITED AC 2007; Chapter 5:Unit 5.18. [DOI: 10.1002/0471142905.hg0518s52] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Joydeep Basu
- Duke Institute for Genome Sciences and Policy Durham North Carolina
| | | | | |
Collapse
|
30
|
Abstract
Many achievements in the genome sciences have been facilitated by policies that have prioritized genome research, secured funding and raised public and health-professional awareness. Such policies should address ethical, legal and social concerns, and are as important to the scientific and commercial development of the field as the science itself. On occasion, policy issues take precedence over science, particularly when impasses are encountered or when public health or money is at stake. Here we discuss the spectrum of current issues and debates in genome policy, and how to actively engage all affected stakeholders to promote effective policy making.
Collapse
Affiliation(s)
- Susanne B Haga
- Institute for Genome Sciences & Policy, Duke University, 101 Science Drive, Box 3382, Durham, North Carolina 27708, USA.
| | | |
Collapse
|
31
|
|
32
|
Abstract
Human artificial chromosomes demonstrate promise as a novel class of nonintegrative gene therapy vectors. The authors outline current developments in human artificial chromosome technology and examine their potential for clinical application.
Collapse
Affiliation(s)
- Joydeep Basu
- Institute for Genome Sciences & Policy, Duke University, 101 Science Drive, Durham, NC 27708, USA.
| | | |
Collapse
|
33
|
Amos-Landgraf JM, Cottle A, Plenge RM, Friez M, Schwartz CE, Longshore J, Willard HF. X chromosome-inactivation patterns of 1,005 phenotypically unaffected females. Am J Hum Genet 2006; 79:493-9. [PMID: 16909387 PMCID: PMC1559535 DOI: 10.1086/507565] [Citation(s) in RCA: 224] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2005] [Accepted: 07/05/2006] [Indexed: 12/12/2022] Open
Abstract
X-chromosome inactivation is widely believed to be random in early female development and to result in a mosaic distribution of cells, approximately half with the paternally derived X chromosome inactive and half with the maternally derived X chromosome inactive. Significant departures from such a random pattern are hallmarks of a variety of clinical states, including being carriers for severe X-linked diseases or X-chromosome cytogenetic abnormalities. To evaluate the significance of skewed patterns of X inactivation, we examined patterns of X inactivation in a population of >1,000 phenotypically unaffected females. The data demonstrate that only a very small proportion of unaffected females show significantly skewed inactivation, especially during the neonatal period. By comparison with this data set, the degree of skewed inactivation in a given individual can now be quantified and evaluated for its potential clinical significance.
Collapse
|
34
|
Chadwick LH, Pertz LM, Broman KW, Bartolomei MS, Willard HF. Genetic control of X chromosome inactivation in mice: definition of the Xce candidate interval. Genetics 2006; 173:2103-10. [PMID: 16582439 PMCID: PMC1569705 DOI: 10.1534/genetics.105.054882] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2005] [Accepted: 03/30/2006] [Indexed: 12/29/2022] Open
Abstract
In early mammalian development, one of the two X chromosomes is silenced in each female cell as a result of X chromosome inactivation, the mammalian dosage compensation mechanism. In the mouse epiblast, the choice of which chromosome is inactivated is essentially random, but can be biased by alleles at the X-linked X controlling element (Xce). Although this locus was first described nearly four decades ago, the identity and precise genomic localization of Xce remains elusive. Within the X inactivation center region of the X chromosome, previous linkage disequilibrium studies comparing strains of known Xce genotypes have suggested that Xce is physically distinct from Xist, although this has not yet been established by genetic mapping or progeny testing. In this report, we used quantitative trait locus (QTL) mapping strategies to define the minimal Xce candidate interval. Subsequent analysis of recombinant chromosomes allowed for the establishment of a maximum 1.85-Mb candidate region for the Xce locus. Finally, we use QTL approaches in an effort to identify additional modifiers of the X chromosome choice, as we have previously demonstrated that choice in Xce heterozygous females is significantly influenced by genetic variation present on autosomes (Chadwick and Willard 2005). We did not identify any autosomal loci with significant associations and thus show conclusively that Xce is the only major locus to influence X inactivation patterns in the crosses analyzed. This study provides a foundation for future analyses into the genetic control of X chromosome inactivation and defines a 1.85-Mb interval encompassing all the major elements of the Xce locus.
Collapse
|
35
|
Wang Z, Willard HF, Mukherjee S, Furey TS. Evidence of influence of genomic DNA sequence on human X chromosome inactivation. PLoS Comput Biol 2006; 2:e113. [PMID: 16948528 PMCID: PMC1557588 DOI: 10.1371/journal.pcbi.0020113] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2006] [Accepted: 07/17/2006] [Indexed: 01/01/2023] Open
Abstract
A significant number of human X-linked genes escape X chromosome inactivation and are thus expressed from both the active and inactive X chromosomes. The basis for escape from inactivation and the potential role of the X chromosome primary DNA sequence in determining a gene's X inactivation status is unclear. Using a combination of the X chromosome sequence and a comprehensive X inactivation profile of more than 600 genes, two independent yet complementary approaches were used to systematically investigate the relationship between X inactivation and DNA sequence features. First, statistical analyses revealed that a number of repeat features, including long interspersed nuclear element (LINE) and mammalian-wide interspersed repeat repetitive elements, are significantly enriched in regions surrounding transcription start sites of genes that are subject to inactivation, while Alu repetitive elements and short motifs containing ACG/CGT are significantly enriched in those that escape inactivation. Second, linear support vector machine classifiers constructed using primary DNA sequence features were used to correctly predict the X inactivation status for >80% of all X-linked genes. We further identified a small set of features that are important for accurate classification, among which LINE-1 and LINE-2 content show the greatest individual discriminatory power. Finally, as few as 12 features can be used for accurate support vector machine classification. Taken together, these results suggest that features of the underlying primary DNA sequence of the human X chromosome may influence the spreading and/or maintenance of X inactivation.
Collapse
Affiliation(s)
- Zhong Wang
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Huntington F Willard
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Sayan Mukherjee
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
| | - Terrence S Furey
- Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
36
|
Abstract
Variation in the composition of chromatin has been proposed to generate a 'histone code' that epigenetically regulates gene expression in a variety of eukaryotic systems. As a result of the process of X chromosome inactivation, chromatinon the mammalian inactive X chromosome (Xi) is marked by several modifications, including histone hypoacetylation, trimethylation of lysine 9 on histone H3 (H3TrimK9) and substitution of core histone H2A with the histone variant MacroH2A. H3TrimK9 is a well-studied marker for heterochromatin in many organisms, but the distribution and function of MacroH2A are less clear. Cytologically, the Xi in human cells comprises alternating and largely non-overlapping approximately 10-15 Mb domains marked by MacroH2A and H3TrimK9. To examine the genomic deposition of MacroH2A, H3TrimK9 and acetylated histone H4 modifications on the Xi at higher resolution, we used chromatin immunoprecipitation in combination with a SNP-based assay to distinguish the Xi and active X (Xa) in a diploid female cell line and to determine quantitatively the relative enrichment of these histone code elements on the Xi relative to the Xa. Although we found a majority of sites were enriched for either MacroH2A or H3TrimK9 in a manner consistent with the cytological appearance of the Xi, a range of different histone code types were detected at different sites along the X. These findings suggest that the nature of the heterochromatin histone code associated with X inactivation may be more heterogeneous than previously thought and imply that gene silencing can be achieved by a variety of different epigenetic mechanisms whose genomic, evolutionary or developmental basis is now amenable to investigation.
Collapse
Affiliation(s)
- Cory M Valley
- Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
| | | | | | | |
Collapse
|
37
|
Valley CM, Willard HF. Genomic and epigenomic approaches to the study of X chromosome inactivation. Curr Opin Genet Dev 2006; 16:240-5. [PMID: 16647845 DOI: 10.1016/j.gde.2006.04.008] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2006] [Accepted: 04/18/2006] [Indexed: 10/24/2022]
Abstract
X chromosome inactivation represents a compelling example of chromosome-wide, long-range epigenetic gene-silencing in mammals. The cis- and trans-acting factors that establish and maintain the patterns and levels of gene expression from the active and inactive X chromosomes remain incompletely understood; however, the availability of the complete genomic sequence of the human X chromosome, together with complementary approaches that explore the computational biology, epigenetic modifications and gene expression-profiling along the chromosome, suggests that the features of the X chromosome that are responsible for its unique forms of gene regulation are increasingly amenable to experimental analysis.
Collapse
Affiliation(s)
- Cory M Valley
- Institute for Genome Sciences & Policy, Duke University, 101 Science Drive, CIEMAS 2376, Durham, NC 27708, USA
| | | |
Collapse
|
38
|
Scott KC, Merrett SL, Willard HF. A heterochromatin barrier partitions the fission yeast centromere into discrete chromatin domains. Curr Biol 2006; 16:119-29. [PMID: 16431364 DOI: 10.1016/j.cub.2005.11.065] [Citation(s) in RCA: 162] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2005] [Revised: 10/18/2005] [Accepted: 11/23/2005] [Indexed: 01/21/2023]
Abstract
BACKGROUND Centromeres are cis-acting chromosomal domains that direct kinetochore formation, enabling faithful chromosome segregation. Centromeric regions of higher eukaryotes are structurally complex, consisting of various epigenetically modified chromatin types including specialized chromatin at the kinetochore itself, pericentromeric heterochromatin, and flanking euchromatin. Although the features necessary for the establishment and maintenance of discrete chromatin domains remain poorly understood, two models have been proposed based either on the passive convergence of competing activities involved in individual domain formation or, alternatively, on the action of specific genomic sequences and associated proteins to actively block the propagation of one chromatin type into another. RESULTS Functional analysis of centromeric sequences located at the intersection of Schizosaccharomyces pombe central core chromatin and outer repeat heterochromatin identified a chromatin barrier that contains a transfer RNA (tRNA) gene. Deletion or modification of the barrier sequences result in the propagation of pericentromeric heterochromatin beyond its normal boundary. The tRNA gene is transcriptionally active, and barrier activity requires sequences necessary for RNA polymerase III transcription. Moreover, absence of the barrier results in abnormal meiotic chromosome segregation. CONCLUSIONS The identification of DNA sequences with chromatin barrier activity at the fission yeast centromere provides a model for establishment of centromeric chromatin domains in higher eukaryotes.
Collapse
Affiliation(s)
- Kristin C Scott
- Institute for Genome Sciences and Policy, Duke University, 101 Science Drive, Durham, North Carolina 27708, USA
| | | | | |
Collapse
|
39
|
Abstract
Advances in genome technology and other fruits of the Human Genome Project are playing a growing role in the delivery of health care. With the development of new technologies and opportunities for large-scale analysis of the genome, transcriptome, proteome and metabolome, the genome sciences are poised to have a profound impact on clinical medicine. Cancer prognostics will be among the first major test cases for a genomic medicine paradigm, given that all cancer is caused by genomic instability, and microarrays allow assessment of patients' entire expressed genomes. Analysis of breast cancer patients' expression patterns can already be highly correlated with recurrence risks. By integrating clinical data with gene expression profiles, imaging, metabolomic profiles and proteomic data, the prospect for developing truly individualized care becomes ever more real. Notwithstanding these promises, daunting challenges remain for genomic medicine. Success will require planning robust prospective trials, analysing health care economic and outcome data, assuaging insurance and privacy concerns, developing health delivery models that are commercially viable and scaling up to meet the needs of the whole population.
Collapse
Affiliation(s)
- Huntington F Willard
- Institute for Genome Sciences & Policy, Duke University, CIEMAS 2376, 101 Science Drive, Durham, NC 27708, USA.
| | | | | |
Collapse
|
40
|
Abstract
Alpha-satellite is a family of tandemly repeated sequences found at all normal human centromeres. In addition to its significance for understanding centromere function, alpha-satellite is also a model for concerted evolution, as alpha-satellite repeats are more similar within a species than between species. There are two types of alpha-satellite in the human genome; while both are made up of approximately 171-bp monomers, they can be distinguished by whether monomers are arranged in extremely homogeneous higher-order, multimeric repeat units or exist as more divergent monomeric alpha-satellite that lacks any multimeric periodicity. In this study, as a model to examine the genomic and evolutionary relationships between these two types, we have focused on the chromosome 17 centromeric region that has reached both higher-order and monomeric alpha-satellite in the human genome assembly. Monomeric and higher-order alpha-satellites on chromosome 17 are phylogenetically distinct, consistent with a model in which higher-order evolved independently of monomeric alpha-satellite. Comparative analysis between human chromosome 17 and the orthologous chimpanzee chromosome indicates that monomeric alpha-satellite is evolving at approximately the same rate as the adjacent non-alpha-satellite DNA. However, higher-order alpha-satellite is less conserved, suggesting different evolutionary rates for the two types of alpha-satellite.
Collapse
Affiliation(s)
- M Katharine Rudd
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | |
Collapse
|
41
|
Higgins AW, Gustashaw KM, Willard HF. Engineered human dicentric chromosomes show centromere plasticity. Chromosome Res 2005; 13:745-62. [PMID: 16331407 DOI: 10.1007/s10577-005-1009-2] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2005] [Revised: 09/13/2005] [Indexed: 11/26/2022]
Abstract
The centromere is essential for the faithful distribution of a cell's genetic material to subsequent generations. Despite intense scrutiny, the precise genetic and epigenetic basis for centromere function is still unknown. Here, we have used engineered dicentric human chromosomes to investigate mammalian centromere structure and function. We describe three classes of dicentric chromosomes isolated in different cell lines: functionally monocentric chromosomes, in which one of the two genetically identical centromeres is consistently inactivated; functionally dicentric chromosomes, in which both centromeres are consistently active; and dicentric chromosomes heterogeneous with respect to centromere activity. A study of serial single cell clones from heterogeneous cell lines revealed that while centromere activity is usually clonal, the centromere state (i.e. functionally monocentric or dicentric) in some lines can switch within a growing population of cells. Because pulsed field gel analysis indicated that the DNA at the centromeres of these chromosomes did not change detectably, this switching of the centromere state is most likely due to epigenetic changes. Inactivation of one of the two active centromeres in a functionally dicentric chromosome was observed in a percentage of cells after treatment with Trichostatin A, an inhibitor of histone deacetylation. This study provides evidence that the activity of human centromeres, while largely stable, can be subject to dynamic change, most likely due to epigenetic modification.
Collapse
Affiliation(s)
- Anne W Higgins
- Department of Genetics, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| | | | | |
Collapse
|
42
|
Chadwick LH, Willard HF. Genetic and parent-of-origin influences on X chromosome choice in Xce heterozygous mice. Mamm Genome 2005; 16:691-9. [PMID: 16245026 DOI: 10.1007/s00335-005-0059-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2005] [Accepted: 06/01/2005] [Indexed: 10/25/2022]
Abstract
X chromosome inactivation is unique among dosage compensation mechanisms in that the two X chromosomes in females are treated differently within the same cell; one X chromosome is stably silenced while the other remains active. It is widely believed that, when X inactivation is initiated, each cell makes a random choice of which X chromosome will be silenced. In mice, only one genetic locus, the X-linked X controlling element (X ce), is known to influence this choice, because animals that are heterozygous at X ce have X-inactivation patterns that differ markedly from a mean of 0.50. To document other genetic and epigenetic influences on choice, we have performed a population-based study of the effect of X ce genotype on X-inactivation patterns. In B 6 CAST F(1) females (X ce(b)/X ce(c)), the X-inactivation pattern followed a symmetric distribution with a mean of 0.29 (SD=0.08). Surprisingly, however, in a population of X ce(b)/X ce(c) heterozygous B 6 CAST F(2) females, we observed significant differences in both the mean (p=0.004) and variance (p=0.004) of the X-inactivation patterns. This finding is incompatible with a single-locus model and suggests that additional genetic factors also influence X chromosome choice. We show that both parent-of-origin and naturally occurring genetic variation at autosomal loci contribute to these differences. Taken together, these data reveal further genetic complexity in this epigenetic control pathway.
Collapse
Affiliation(s)
- Lisa Helbling Chadwick
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | |
Collapse
|
43
|
Abstract
Non-integrating gene-delivery platforms demonstrate promise as potentially ideal gene-therapy vector systems. Although several approaches are under development, there is little consensus as to what constitutes a true 'artificial' versus an 'engineered' human chromosome. Recent progress must be evaluated in light of significant technical challenges that remain before such vectors achieve clinical utility. Here, we examine the principal classes of non-integrating vectors, ranging from episomes to engineered mini-chromosomes to true human artificial chromosomes. We compare their potential as practical gene-transfer platforms and summarize recent advances towards eventual applications in gene therapy. Although chromosome-engineering technology has advanced considerably within recent years, difficulties in establishing composition of matter and effective vector delivery currently prevent artificial or engineered chromosomes being accepted as viable gene-delivery platforms.
Collapse
Affiliation(s)
- Joydeep Basu
- Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708, USA.
| | | |
Collapse
|
44
|
Schueler MG, Dunn JM, Bird CP, Ross MT, Viggiano L, Rocchi M, Willard HF, Green ED. Progressive proximal expansion of the primate X chromosome centromere. Proc Natl Acad Sci U S A 2005; 102:10563-8. [PMID: 16030148 PMCID: PMC1180780 DOI: 10.1073/pnas.0503346102] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Previous studies of the pericentromeric region of the human X chromosome short arm (Xp) revealed an age gradient from ancient DNA that contains expressed genes to recent human-specific DNA at the functional centromere. We analyzed the finished sequence of this human genomic region to investigate its evolutionary history. Phylogenetic analysis of >1,500 alpha-satellite monomers from the region revealed the presence of five physical domains, each containing monomers from a distinct phylogenetic clade. The most distal domain contains long interspersed nucleotide element repeats that were active >35 million years ago, whereas the four proximal domains contain more recently active long interspersed nucleotide element repeats. An out-of-register, unequal recombination (i.e., crossover) detected at the edge of the X chromosome-specific alpha-satellite array (DXZ1) may reflect the most recent of a series of punctuating events during evolution that resulted in a proximal physical expansion of the X centromere. The first 18 kb of this array has 97-99% pairwise identity among all 2-kb repeat units. To perform more detailed evolutionary comparisons, we sequenced the junction between the ancient DNA of Xp and the primate-specific alpha satellite in chimpanzee, gorilla, orangutan, vervet, macaque, and baboon. The striking conservation found in all cases supports the ancestral nature of the alpha satellite at this location. These studies demonstrate that the primate X centromere appears to have evolved through repeated expansion events occurring within the central, active region of centromeric DNA, with the newly added sequences then conferring centromere function.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Basu J, Compitello G, Stromberg G, Willard HF, Van Bokkelen G. Efficient assembly of de novo human artificial chromosomes from large genomic loci. BMC Biotechnol 2005; 5:21. [PMID: 15998466 PMCID: PMC1182356 DOI: 10.1186/1472-6750-5-21] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2005] [Accepted: 07/05/2005] [Indexed: 01/20/2023] Open
Abstract
Background Human Artificial Chromosomes (HACs) are potentially useful vectors for gene transfer studies and for functional annotation of the genome because of their suitability for cloning, manipulating and transferring large segments of the genome. However, development of HACs for the transfer of large genomic loci into mammalian cells has been limited by difficulties in manipulating high-molecular weight DNA, as well as by the low overall frequencies of de novo HAC formation. Indeed, to date, only a small number of large (>100 kb) genomic loci have been reported to be successfully packaged into de novo HACs. Results We have developed novel methodologies to enable efficient assembly of HAC vectors containing any genomic locus of interest. We report here the creation of a novel, bimolecular system based on bacterial artificial chromosomes (BACs) for the construction of HACs incorporating any defined genomic region. We have utilized this vector system to rapidly design, construct and validate multiple de novo HACs containing large (100–200 kb) genomic loci including therapeutically significant genes for human growth hormone (HGH), polycystic kidney disease (PKD1) and ß-globin. We report significant differences in the ability of different genomic loci to support de novo HAC formation, suggesting possible effects of cis-acting genomic elements. Finally, as a proof of principle, we have observed sustained ß-globin gene expression from HACs incorporating the entire 200 kb ß-globin genomic locus for over 90 days in the absence of selection. Conclusion Taken together, these results are significant for the development of HAC vector technology, as they enable high-throughput assembly and functional validation of HACs containing any large genomic locus. We have evaluated the impact of different genomic loci on the frequency of HAC formation and identified segments of genomic DNA that appear to facilitate de novo HAC formation. These genomic loci may be useful for identifying discrete functional elements that may be incorporated into future generations of HAC vectors.
Collapse
MESH Headings
- Biotechnology/methods
- Cell Line
- Chromosomes, Artificial, Bacterial/genetics
- Chromosomes, Artificial, Human/genetics
- Cloning, Molecular
- DNA
- DNA, Satellite
- Fibroblasts/cytology
- Gene Transfer Techniques
- Genetic Techniques
- Genetic Vectors
- Genome
- Globins/genetics
- Human Growth Hormone/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Microscopy, Fluorescence
- Models, Genetic
- Polycystic Kidney Diseases/genetics
- RNA, Messenger/metabolism
- Reverse Transcriptase Polymerase Chain Reaction
- Transfection
Collapse
Affiliation(s)
- Joydeep Basu
- Institute for Genome Sciences & Policy, Duke University, Durham, NC 27708, USA
- Athersys Inc., 3201 Carnegie Avenue, Cleveland, OH 44115, USA
| | | | | | | | | |
Collapse
|
46
|
Ross MT, Grafham DV, Coffey AJ, Scherer S, McLay K, Muzny D, Platzer M, Howell GR, Burrows C, Bird CP, Frankish A, Lovell FL, Howe KL, Ashurst JL, Fulton RS, Sudbrak R, Wen G, Jones MC, Hurles ME, Andrews TD, Scott CE, Searle S, Ramser J, Whittaker A, Deadman R, Carter NP, Hunt SE, Chen R, Cree A, Gunaratne P, Havlak P, Hodgson A, Metzker ML, Richards S, Scott G, Steffen D, Sodergren E, Wheeler DA, Worley KC, Ainscough R, Ambrose KD, Ansari-Lari MA, Aradhya S, Ashwell RIS, Babbage AK, Bagguley CL, Ballabio A, Banerjee R, Barker GE, Barlow KF, Barrett IP, Bates KN, Beare DM, Beasley H, Beasley O, Beck A, Bethel G, Blechschmidt K, Brady N, Bray-Allen S, Bridgeman AM, Brown AJ, Brown MJ, Bonnin D, Bruford EA, Buhay C, Burch P, Burford D, Burgess J, Burrill W, Burton J, Bye JM, Carder C, Carrel L, Chako J, Chapman JC, Chavez D, Chen E, Chen G, Chen Y, Chen Z, Chinault C, Ciccodicola A, Clark SY, Clarke G, Clee CM, Clegg S, Clerc-Blankenburg K, Clifford K, Cobley V, Cole CG, Conquer JS, Corby N, Connor RE, David R, Davies J, Davis C, Davis J, Delgado O, Deshazo D, Dhami P, Ding Y, Dinh H, Dodsworth S, Draper H, Dugan-Rocha S, Dunham A, Dunn M, Durbin KJ, Dutta I, Eades T, Ellwood M, Emery-Cohen A, Errington H, Evans KL, Faulkner L, Francis F, Frankland J, Fraser AE, Galgoczy P, Gilbert J, Gill R, Glöckner G, Gregory SG, Gribble S, Griffiths C, Grocock R, Gu Y, Gwilliam R, Hamilton C, Hart EA, Hawes A, Heath PD, Heitmann K, Hennig S, Hernandez J, Hinzmann B, Ho S, Hoffs M, Howden PJ, Huckle EJ, Hume J, Hunt PJ, Hunt AR, Isherwood J, Jacob L, Johnson D, Jones S, de Jong PJ, Joseph SS, Keenan S, Kelly S, Kershaw JK, Khan Z, Kioschis P, Klages S, Knights AJ, Kosiura A, Kovar-Smith C, Laird GK, Langford C, Lawlor S, Leversha M, Lewis L, Liu W, Lloyd C, Lloyd DM, Loulseged H, Loveland JE, Lovell JD, Lozado R, Lu J, Lyne R, Ma J, Maheshwari M, Matthews LH, McDowall J, McLaren S, McMurray A, Meidl P, Meitinger T, Milne S, Miner G, Mistry SL, Morgan M, Morris S, Müller I, Mullikin JC, Nguyen N, Nordsiek G, Nyakatura G, O'Dell CN, Okwuonu G, Palmer S, Pandian R, Parker D, Parrish J, Pasternak S, Patel D, Pearce AV, Pearson DM, Pelan SE, Perez L, Porter KM, Ramsey Y, Reichwald K, Rhodes S, Ridler KA, Schlessinger D, Schueler MG, Sehra HK, Shaw-Smith C, Shen H, Sheridan EM, Shownkeen R, Skuce CD, Smith ML, Sotheran EC, Steingruber HE, Steward CA, Storey R, Swann RM, Swarbreck D, Tabor PE, Taudien S, Taylor T, Teague B, Thomas K, Thorpe A, Timms K, Tracey A, Trevanion S, Tromans AC, d'Urso M, Verduzco D, Villasana D, Waldron L, Wall M, Wang Q, Warren J, Warry GL, Wei X, West A, Whitehead SL, Whiteley MN, Wilkinson JE, Willey DL, Williams G, Williams L, Williamson A, Williamson H, Wilming L, Woodmansey RL, Wray PW, Yen J, Zhang J, Zhou J, Zoghbi H, Zorilla S, Buck D, Reinhardt R, Poustka A, Rosenthal A, Lehrach H, Meindl A, Minx PJ, Hillier LW, Willard HF, Wilson RK, Waterston RH, Rice CM, Vaudin M, Coulson A, Nelson DL, Weinstock G, Sulston JE, Durbin R, Hubbard T, Gibbs RA, Beck S, Rogers J, Bentley DR. The DNA sequence of the human X chromosome. Nature 2005; 434:325-37. [PMID: 15772651 PMCID: PMC2665286 DOI: 10.1038/nature03440] [Citation(s) in RCA: 738] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2005] [Accepted: 02/07/2005] [Indexed: 01/19/2023]
Abstract
The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.
Collapse
MESH Headings
- Animals
- Antigens, Neoplasm/genetics
- Centromere/genetics
- Chromosomes, Human, X/genetics
- Chromosomes, Human, Y/genetics
- Contig Mapping
- Crossing Over, Genetic/genetics
- Dosage Compensation, Genetic
- Evolution, Molecular
- Female
- Genetic Linkage/genetics
- Genetics, Medical
- Genomics
- Humans
- Male
- Polymorphism, Single Nucleotide/genetics
- RNA/genetics
- Repetitive Sequences, Nucleic Acid/genetics
- Sequence Analysis, DNA
- Sequence Homology, Nucleic Acid
- Testis/metabolism
Collapse
Affiliation(s)
- Mark T Ross
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Carrel L, Willard HF. X-inactivation profile reveals extensive variability in X-linked gene expression in females. Nature 2005; 434:400-4. [PMID: 15772666 DOI: 10.1038/nature03479] [Citation(s) in RCA: 1449] [Impact Index Per Article: 76.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2005] [Accepted: 02/17/2005] [Indexed: 02/06/2023]
Abstract
In female mammals, most genes on one X chromosome are silenced as a result of X-chromosome inactivation. However, some genes escape X-inactivation and are expressed from both the active and inactive X chromosome. Such genes are potential contributors to sexually dimorphic traits, to phenotypic variability among females heterozygous for X-linked conditions, and to clinical abnormalities in patients with abnormal X chromosomes. Here, we present a comprehensive X-inactivation profile of the human X chromosome, representing an estimated 95% of assayable genes in fibroblast-based test systems. In total, about 15% of X-linked genes escape inactivation to some degree, and the proportion of genes escaping inactivation differs dramatically between different regions of the X chromosome, reflecting the evolutionary history of the sex chromosomes. An additional 10% of X-linked genes show variable patterns of inactivation and are expressed to different extents from some inactive X chromosomes. This suggests a remarkable and previously unsuspected degree of expression heterogeneity among females.
Collapse
MESH Headings
- Alleles
- Chromosomes, Human, X/genetics
- Chromosomes, Human, Y/genetics
- CpG Islands/genetics
- Dosage Compensation, Genetic
- Female
- Fibroblasts
- Gene Expression Profiling
- Gene Expression Regulation/genetics
- Heterozygote
- Humans
- Male
- Phenotype
- RNA, Long Noncoding
- RNA, Messenger/analysis
- RNA, Messenger/genetics
- RNA, Untranslated/genetics
- Sex Characteristics
- Transcription, Genetic/genetics
Collapse
Affiliation(s)
- Laura Carrel
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, Pennsylvania 17033, USA.
| | | |
Collapse
|
48
|
Angrist M, Willard HF. The genome for the masses. Nat Genet 2005. [DOI: 10.1038/ng0205-114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
49
|
Basu J, Stromberg G, Compitello G, Willard HF, Van Bokkelen G. Rapid creation of BAC-based human artificial chromosome vectors by transposition with synthetic alpha-satellite arrays. Nucleic Acids Res 2005; 33:587-96. [PMID: 15673719 PMCID: PMC548352 DOI: 10.1093/nar/gki207] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Efficient construction of BAC-based human artificial chromosomes (HACs) requires optimization of each key functional unit as well as development of techniques for the rapid and reliable manipulation of high-molecular weight BAC vectors. Here, we have created synthetic chromosome 17-derived alpha-satellite arrays, based on the 16-monomer repeat length typical of natural D17Z1 arrays, in which the consensus CENP-B box elements are either completely absent (0/16 monomers) or increased in density (16/16 monomers) compared to D17Z1 alpha-satellite (5/16 monomers). Using these vectors, we show that the presence of CENP-B box elements is a requirement for efficient de novo centromere formation and that increasing the density of CENP-B box elements may enhance the efficiency of de novo centromere formation. Furthermore, we have developed a novel, high-throughput methodology that permits the rapid conversion of any genomic BAC target into a HAC vector by transposon-mediated modification with synthetic alpha-satellite arrays and other key functional units. Taken together, these approaches offer the potential to significantly advance the utility of BAC-based HACs for functional annotation of the genome and for applications in gene transfer.
Collapse
Affiliation(s)
- Joydeep Basu
- Institute for Genome Sciences and Policy, Duke University CIEMAS Room 2379, 101 Science Drive, Durham, NC 27708, USA.
| | | | | | | | | |
Collapse
|
50
|
Chadwick BP, Willard HF. Multiple spatially distinct types of facultative heterochromatin on the human inactive X chromosome. Proc Natl Acad Sci U S A 2004; 101:17450-5. [PMID: 15574503 PMCID: PMC534659 DOI: 10.1073/pnas.0408021101] [Citation(s) in RCA: 176] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Indexed: 11/18/2022] Open
Abstract
Heterochromatin is defined classically by condensation throughout the cell cycle, replication in late S phase and gene inactivity. Facultative heterochromatin is of particular interest, because its formation is developmentally regulated as a result of cellular differentiation. The most extensive example of facultative heterochromatin is the mammalian inactive X chromosome (Xi). A variety of histone variants and covalent histone modifications have been implicated in defining the organization of the Xi heterochromatic state, and the features of Xi heterochromatin have been widely interpreted as reflecting a redundant system of gene silencing. However, here we demonstrate that the human Xi is packaged into at least two nonoverlapping heterochromatin types, each characterized by specific Xi features: one defined by the presence of Xi-specific transcript RNA, the histone variant macroH2A, and histone H3 trimethylated at lysine 27 and the other defined by H3 trimethylated at lysine 9, heterochromatin protein 1, and histone H4 trimethylated at lysine 20. Furthermore, regions of the Xi packaged in different heterochromatin types are characterized by different patterns of replication in late S phase. The arrangement of facultative heterochromatin into spatially and temporally distinct domains has implications for both the establishment and maintenance of the Xi and adds a previously unsuspected degree of epigenetic complexity.
Collapse
Affiliation(s)
- Brian P Chadwick
- Institute for Genome Sciences and Policy, Department of Molecular Genetics and Microbiology, Duke University, Durham, NC 27708, USA
| | | |
Collapse
|