1
|
Abstract
During the course of evolution the human brain has increased in size and complexity, ultimately these differences are the result of changes at the genetic level. Identifying and characterizing molecular evolution requires an understanding of both the genetic underpinning of the system as well as the comparative genetic tools to identify signatures of selection. This chapter aims to describe our current understanding of the genetics of human brain evolution. Primarily this is the story of the evolution of the human brain since our last common ape ancestor, but where relevant we will also discuss changes that are unique to the primate brain (compared to other mammals) or various other lineages in the evolution of humans more generally. It will focus on genetic changes that both directly affected the development and function of the brain as well as those that have indirectly influenced brain evolution through both prenatal and postnatal environment. This review is not meant to be exhaustive, but rather to begin to construct a general framework for understanding the full array of data being generated.
Collapse
Affiliation(s)
- Eric J Vallender
- University of Mississippi Medical Center, Jackson, MS, United States; Tulane National Primate Research Center, Covington, LA, United States.
| |
Collapse
|
2
|
DiStefano JK, Kingsley CB. Identification of Disease Susceptibility Alleles in the Next Generation Sequencing Era. Methods Mol Biol 2018; 1706:3-16. [PMID: 29423790 DOI: 10.1007/978-1-4939-7471-9_1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The development of next generation sequencing (NGS) technologies has transformed the study of human genetic variation. In less than a decade, NGS has facilitated the discovery of causal mutations in both rare, monogenic diseases and common, heterogeneous disorders, leading to unprecedented improvements in disease diagnosis and treatment strategies. Given the rapid evolution of NGS platforms, it is now possible to analyze whole genomes and exomes quickly and affordably. Further, emerging NGS applications, such as single-cell sequencing, have the power to address specific issues like somatic variation, which is yielding new insights into the role of somatic mutations in cancer and late-onset diseases. Despite limitations associated with current iterations of NGS technologies, the impact of this approach on identifying disease-causing variants has been significant. This chapter provides an overview of several NGS platforms and applications and discusses how these technologies can be used in concert with experimental and computational strategies to identify variants with a causative effect on disease development and progression.
Collapse
Affiliation(s)
- Johanna K DiStefano
- Translational Genomics Research Institute, 445 N 5th Street, Phoenix, AZ, 85004, USA.
| | | |
Collapse
|
3
|
The common marmoset genome provides insight into primate biology and evolution. Nat Genet 2014; 46:850-7. [PMID: 25038751 PMCID: PMC4138798 DOI: 10.1038/ng.3042] [Citation(s) in RCA: 170] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 06/27/2014] [Indexed: 02/06/2023]
Abstract
We report the whole-genome sequence of the common marmoset (Callithrix jacchus). The 2.26-Gb genome of a female marmoset was assembled using Sanger read data (6×) and a whole-genome shotgun strategy. A first analysis has permitted comparison with the genomes of apes and Old World monkeys and the identification of specific features that might contribute to the unique biology of this diminutive primate, including genetic changes that may influence body size, frequent twinning and chimerism. We observed positive selection in growth hormone/insulin-like growth factor genes (growth pathways), respiratory complex I genes (metabolic pathways), and genes encoding immunobiological factors and proteases (reproductive and immunity pathways). In addition, both protein-coding and microRNA genes related to reproduction exhibited evidence of rapid sequence evolution. This genome sequence for a New World monkey enables increased power for comparative analyses among available primate genomes and facilitates biomedical research application.
Collapse
|
4
|
del Rosario RCH, Rayan NA, Prabhakar S. Noncoding origins of anthropoid traits and a new null model of transposon functionalization. Genome Res 2014; 24:1469-84. [PMID: 25043600 PMCID: PMC4158753 DOI: 10.1101/gr.168963.113] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting that novel anthropoid functional elements were overwhelmingly cis-regulatory. ASCs were highly enriched in loci associated with fetal brain development, motor coordination, neurotransmission, and vision, thus providing a large set of candidate elements for exploring the molecular basis of hallmark primate traits. We validated ASC192 as a primate-specific enhancer in proliferative zones of the developing brain. Unexpectedly, transposable elements (TEs) contributed to >56% of ASCs, and almost all TE families showed functional potential similar to that of nonrepetitive DNA. Three L1PA repeat-derived ASCs displayed coherent eye-enhancer function, thus demonstrating that the "gene-battery" model of TE functionalization applies to enhancers in vivo. Our study provides fundamental insights into genome evolution and the origins of anthropoid phenotypes and supports an elegantly simple new null model of TE exaptation.
Collapse
Affiliation(s)
- Ricardo C H del Rosario
- Computational and Systems Biology, Genome Institute of Singapore, #02-01 Genome, Singapore 138672
| | - Nirmala Arul Rayan
- Computational and Systems Biology, Genome Institute of Singapore, #02-01 Genome, Singapore 138672
| | - Shyam Prabhakar
- Computational and Systems Biology, Genome Institute of Singapore, #02-01 Genome, Singapore 138672
| |
Collapse
|
5
|
Podsiadło A, Wrzesień M, Paja W, Rudnicki W, Wilczyński B. Active enhancer positions can be accurately predicted from chromatin marks and collective sequence motif data. BMC SYSTEMS BIOLOGY 2013; 7 Suppl 6:S16. [PMID: 24565409 PMCID: PMC4029456 DOI: 10.1186/1752-0509-7-s6-s16] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
BACKGROUND Transcriptional regulation in multi-cellular organisms is a complex process involving multiple modular regulatory elements for each gene. Building whole-genome models of transcriptional networks requires mapping all relevant enhancers and then linking them to target genes. Previous methods of enhancer identification based either on sequence information or on epigenetic marks have different limitations stemming from incompleteness of each of these datasets taken separately. RESULTS In this work we present a new approach for discovery of regulatory elements based on the combination of sequence motifs and epigenetic marks measured with ChIP-Seq. Our method uses supervised learning approaches to train a model describing the dependence of enhancer activity on sequence features and histone marks. Our results indicate that using combination of features provides superior results to previous approaches based on either one of the datasets. While histone modifications remain the dominant feature for accurate predictions, the models based on sequence motifs have advantages in their general applicability to different tissues. Additionally, we assess the relevance of different sequence motifs in prediction accuracy showing that even tissue-specific enhancer activity depends on multiple motifs. CONCLUSIONS Based on our results, we conclude that it is worthwhile to include sequence motif data into computational approaches to active enhancer prediction and also that classifiers trained on a specific set of enhancers can generalize with significant accuracy beyond the training set.
Collapse
Affiliation(s)
- Agnieszka Podsiadło
- Institute of Informatics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland
| | - Mariusz Wrzesień
- University of Information Technology and Management in Rzeszów, Sucharskiego 2, 35-225 Rzeszów, Poland
| | - Wiesław Paja
- University of Information Technology and Management in Rzeszów, Sucharskiego 2, 35-225 Rzeszów, Poland
| | - Witold Rudnicki
- Interdisciplinary Centre for Mathematical and Computational Modelling, University of Warsaw, Pawińskiego 5A, 02-106 Warsaw, Poland
| | - Bartek Wilczyński
- Institute of Informatics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland
| |
Collapse
|
6
|
Enard W. Functional primate genomics—leveraging the medical potential. J Mol Med (Berl) 2012; 90:471-80. [DOI: 10.1007/s00109-012-0901-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Revised: 04/04/2012] [Accepted: 04/05/2012] [Indexed: 10/28/2022]
|
7
|
MLL fusion proteins preferentially regulate a subset of wild-type MLL target genes in the leukemic genome. Blood 2011; 117:6895-905. [PMID: 21518926 DOI: 10.1182/blood-2010-12-324699] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
MLL encodes a histone methyltransferase that is critical in maintaining gene expression during embryonic development and hematopoiesis. 11q23 translocations result in the formation of chimeric MLL fusion proteins that act as potent drivers of acute leukemia. However, it remains unclear what portion of the leukemic genome is under the direct control of MLL fusions. By comparing patient-derived leukemic cell lines, we find that MLL fusion-bound genes are a small subset of that recognized by wild-type MLL. In an inducible MLL-ENL model, MLL fusion protein binding and changes in H3K79 methylation are limited to a specific portion of the genome, whereas wild-type MLL distributes to a much larger set of gene loci. Surprisingly, among 223 MLL-ENL-bound genes, only 12 demonstrate a significant increase in mRNA expression on induction of the fusion protein. In addition to Hoxa9 and Meis1, this includes Eya1 and Six1, which comprise a heterodimeric transcription factor important in several developmental pathways. We show that Eya1 has the capacity to immortalize hematopoietic progenitor cells in vitro and collaborates with Six1 in hematopoietic transformation assays. Altogether, our data suggest that MLL fusions contribute to the development of acute leukemia through direct activation of a small set of target genes.
Collapse
|
8
|
Kingsley CB. Identification of causal sequence variants of disease in the next generation sequencing era. Methods Mol Biol 2011; 700:37-46. [PMID: 21204025 DOI: 10.1007/978-1-61737-954-3_3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Over the last decade, genetic studies have identified numerous associations between single nucleotide polymorphism (SNP) alleles in the human genome and important human diseases. Unfortunately, extending these initial associative findings to identification of the true causal variants that underlie disease susceptibility is usually not a straightforward task. Causal variant identification typically involves searching through sizable regions of genomic DNA in the vicinity of disease-associated SNPs for sequence variants in functional elements including protein coding, regulatory, and structural sequences. Prioritization of these searches is greatly aided by knowledge of the location of functional sequences in the human genome. This chapter briefly reviews several of the common approaches used to functionally annotate the human genome and discusses how this information can be used in concert with the emerging technology of next generation high-throughput sequencing to identify causal variants of human disease.
Collapse
Affiliation(s)
- Christopher B Kingsley
- Diabetes, Cardiovascular, and Metabolic Diseases Division, Translational Genomics Research Institute, Phoenix, AZ, USA.
| |
Collapse
|
9
|
Affiliation(s)
- Keyue Ding
- From the Division of Cardiovascular Diseases, Mayo Clinic, Rochester, Minn
| | - Iftikhar J. Kullo
- From the Division of Cardiovascular Diseases, Mayo Clinic, Rochester, Minn
| |
Collapse
|
10
|
Babin PJ, Gibbons GF. The evolution of plasma cholesterol: direct utility or a "spandrel" of hepatic lipid metabolism? Prog Lipid Res 2008; 48:73-91. [PMID: 19049814 DOI: 10.1016/j.plipres.2008.11.002] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2008] [Revised: 11/06/2008] [Accepted: 11/07/2008] [Indexed: 02/07/2023]
Abstract
Fats provide a concentrated source of energy for multicellular organisms. The efficient transport of fats through aqueous biological environments raises issues concerning effective delivery to target tissues. Furthermore, the utilization of fatty acids presents a high risk of cytotoxicity. Improving the efficiency of fat transport while simultaneously minimizing the cytotoxic risk confers distinct selective advantages. In humans, most of the plasma cholesterol is associated with low-density lipoprotein (LDL), a metabolic by-product of very-low-density lipoprotein (VLDL), which originates in the liver. However, the functions of VLDL are not clear. This paper reviews the evidence that LDL arose as a by-product during the natural selection of VLDL. The latter, in turn, evolved as a means of improving the efficiency of diet-derived fatty acid storage and utilization, as well as neutralizing the potential cytotoxicity of fatty acids while conserving their advantages as a concentrated energy source. The evolutionary biology of lipid transport processes has provided a fascinating insight into how and why these VLDL functions emerged during animal evolution. As causes of historical origin must be separated from current utilities, our spandrel-LDL theory proposes that LDL is a spandrel of VLDL selection, which appeared non-adaptively and may later have become crucial for vertebrate fitness.
Collapse
Affiliation(s)
- Patrick J Babin
- Université Bordeaux 1, Génomique et Physiologie des Poissons, UMR NuAGe, 33405 Talence, France
| | | |
Collapse
|
11
|
De S, Lopez-Bigas N, Teichmann SA. Patterns of evolutionary constraints on genes in humans. BMC Evol Biol 2008; 8:275. [PMID: 18840274 PMCID: PMC2587479 DOI: 10.1186/1471-2148-8-275] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2008] [Accepted: 10/07/2008] [Indexed: 12/27/2022] Open
Abstract
Background Different regions in a genome evolve at different rates depending on structural and functional constraints. Some genomic regions are highly conserved during metazoan evolution, while other regions may evolve rapidly, either in all species or in a lineage-specific manner. A strong or even moderate change in constraints in functional regions, for example in coding regions, can have significant evolutionary consequences. Results Here we discuss a novel framework, 'BaseDiver', to classify groups of genes in humans based on the patterns of evolutionary constraints on polymorphic positions in their coding regions. Comparing the nucleotide-level divergence among mammals with the extent of deviation from the ancestral base in the human lineage, we identify patterns of evolutionary pressure on nonsynonymous base-positions in groups of genes belonging to the same functional category. Focussing on groups of genes in functional categories, we find that transcription factors contain a significant excess of nonsynonymous base-positions that are conserved in other mammals but changed in human, while immunity related genes harbour mutations at base-positions that evolve rapidly in all mammals including humans due to strong preference for advantageous alleles. Genes involved in olfaction also evolve rapidly in all mammals, and in humans this appears to be due to weak negative selection. Conclusion While recent studies have identified genes under positive selection in humans, our approach identifies evolutionary constraints on Gene Ontology groups identifying changes in humans relative to some of the other mammals.
Collapse
Affiliation(s)
- Subhajyoti De
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge, CB2 2QH, UK.
| | | | | |
Collapse
|
12
|
Molecular population genetics of PCSK9: a signature of recent positive selection. Pharmacogenet Genomics 2008; 18:169-79. [PMID: 18300938 DOI: 10.1097/fpc.0b013e3282f44d99] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
OBJECTIVE Proprotein convertase subtilisin-like kexin type 9 (PCSK9) is a newly discovered serine protease that plays a key role in regulating plasma low density lipoprotein (LDL) cholesterol levels. Both rare mutations and common variants in the coding regions of PCSK9 affect LDL cholesterol levels and coronary heart disease risk, as well as response to lipid-lowering therapy. METHODS The authors characterized the patterns of variation at the PCSK9 locus in African-Americans and European-Americans using resequenced data from the SeattleSNPs database (pga.gs.washington.edu). The authors performed a test of population differentiation and the long-range haplotype test to detect signatures of recent position selection on PCSK9. RESULTS A significantly high FST (a measure of population differentiation) between African-Americans and European-Americans was noted for single nucleotide polymorphism (SNP) rs505151 (FST=0.309). The long-range haplotype test was suggestive of non-neutral evolution of two SNPs, rs505151 and rs562556 that are associated with elevated LDL cholesterol levels ('gain-of-function' mutations). The modes of selection were different between African-Americans and European-Americans. The authors observed signals of recent positive selection on the ancestral allele of nonsynonymous SNP rs505151 (E670G, P=0.0227 and P=0.0001 in theoretical and empirical distribution, respectively) and the derived allele of nonsynonymous SNP rs562556 (I474V, P=0.0227 and 0.0001) in African-Americans, whereas in European-Americans the ancestral allele of SNP rs562556 (P=0.1320 and 0.0370) appeared to be under positive selection. CONCLUSION The authors' findings suggest that evolutionary dynamics may underlie the 'gain-of-function' mutations in PCSK9 that are associated with higher LDL cholesterol levels.
Collapse
|
13
|
Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes. Nat Rev Genet 2008; 9:303-13. [PMID: 18347593 DOI: 10.1038/nrg2185] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The comparison of genomic sequences is now a common approach to identifying and characterizing functional regions in vertebrate genomes. However, for theoretical reasons and because of practical issues, the generation of these data sets is non-trivial and can have many pitfalls. We are currently seeing an explosion of comparative sequence data, the benefits and limitations of which need to be disseminated to the scientific community. This Review provides a critical overview of the different types of sequence data that are available for analysis and of contemporary comparative sequence analysis methods, highlighting both their strengths and limitations. Approaches to determining the biological significance of constrained sequence are also explored.
Collapse
|
14
|
Brudno M, Poliakov A, Minovitsky S, Ratnere I, Dubchak I. Multiple whole genome alignments and novel biomedical applications at the VISTA portal. Nucleic Acids Res 2007; 35:W669-74. [PMID: 17488840 PMCID: PMC1933192 DOI: 10.1093/nar/gkm279] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The VISTA portal for comparative genomics is designed to give biomedical scientists a unified set of tools to lead them from the raw DNA sequences through the alignment and annotation to the visualization of the results. The VISTA portal also hosts the alignments of a number of genomes computed by our group, allowing users to study the regions of their interest without having to manually download the individual sequences. Here we describe various algorithmic and functional improvements implemented in the VISTA portal over the last 2 years. The VISTA Portal is accessible at http://genome.lbl.gov/vista.
Collapse
Affiliation(s)
- Michael Brudno
- Department of Computer Science, Banting and Best Department of Medical Research, University of Toronto, 6 King's College Road, Toronto, ON, Canada M5S 3G4, Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA and US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Alexander Poliakov
- Department of Computer Science, Banting and Best Department of Medical Research, University of Toronto, 6 King's College Road, Toronto, ON, Canada M5S 3G4, Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA and US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Simon Minovitsky
- Department of Computer Science, Banting and Best Department of Medical Research, University of Toronto, 6 King's College Road, Toronto, ON, Canada M5S 3G4, Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA and US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Igor Ratnere
- Department of Computer Science, Banting and Best Department of Medical Research, University of Toronto, 6 King's College Road, Toronto, ON, Canada M5S 3G4, Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA and US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Inna Dubchak
- Department of Computer Science, Banting and Best Department of Medical Research, University of Toronto, 6 King's College Road, Toronto, ON, Canada M5S 3G4, Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA and US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA
- *To whom correspondence should be addressed. +1-510-495-2419+1-510-486-5614
| |
Collapse
|
15
|
Wang QF, Prabhakar S, Chanan S, Cheng JF, Rubin EM, Boffelli D. Detection of weakly conserved ancestral mammalian regulatory sequences by primate comparisons. Genome Biol 2007; 8:R1. [PMID: 17201929 PMCID: PMC1839124 DOI: 10.1186/gb-2007-8-1-r1] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2006] [Revised: 11/15/2006] [Accepted: 01/03/2007] [Indexed: 02/18/2023] Open
Abstract
BACKGROUND Genomic comparisons between human and distant, non-primate mammals are commonly used to identify cis-regulatory elements based on constrained sequence evolution. However, these methods fail to detect functional elements that are too weakly conserved among mammals to distinguish them from non-functional DNA. RESULTS To evaluate a strategy for large scale genome annotation that is complementary to the commonly used distal species comparisons, we explored the potential of deep intra-primate sequence comparisons. We sequenced the orthologs of 558 kb of human genomic sequence, covering multiple loci involved in cholesterol homeostasis, in 6 non-human primates. Our analysis identified six non-coding DNA elements displaying significant conservation among primates but undetectable in more distant comparisons. In vitro and in vivo tests revealed that at least three of these six elements have regulatory function. Notably, the mouse orthologs of these three functional human sequences had regulatory activity despite their lack of significant sequence conservation, indicating that they are ancestral mammalian cis-regulatory elements. These regulatory elements could be detected even in a smaller set of three primate species including human, rhesus and marmoset. CONCLUSION We have demonstrated that intra-primate sequence comparisons can be used to identify functional modules in large genomic regions, including cis-regulatory elements that are not detectable through comparison with non-mammalian genomes. With the available human and rhesus genomes and that of marmoset, which is being actively sequenced, this strategy can be extended to the whole genome in the near future.
Collapse
Affiliation(s)
- Qian-fei Wang
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
- Current address: Section of Hematology/Oncology, Department of Medicine, University of Chicago, 5814 S. Ellis Avenue, Chicago, IL 60637, USA
| | - Shyam Prabhakar
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Sumita Chanan
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
| | - Jan-Fang Cheng
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Edward M Rubin
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| | - Dario Boffelli
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720 USA
- US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
| |
Collapse
|