1
|
Zhimulev I, Vatolina T, Levitsky V, Tsukanov A. Developmental and Housekeeping Genes: Two Types of Genetic Organization in the Drosophila Genome. Int J Mol Sci 2024; 25:4068. [PMID: 38612878 PMCID: PMC11012173 DOI: 10.3390/ijms25074068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 04/01/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
We developed a procedure for locating genes on Drosophila melanogaster polytene chromosomes and described three types of chromosome structures (gray bands, black bands, and interbands), which differed markedly in morphological and genetic properties. This was reached through the use of our original methods of molecular and genetic analysis, electron microscopy, and bioinformatics data processing. Analysis of the genome-wide distribution of these properties led us to a bioinformatics model of the Drosophila genome organization, in which the genome was divided into two groups of genes. One was constituted by 65, in which the genome was divided into two groups, 62 genes that are expressed in most cell types during life cycle and perform basic cellular functions (the so-called "housekeeping genes"). The other one was made up of 3162 genes that are expressed only at particular stages of development ("developmental genes"). These two groups of genes are so different that we may state that the genome has two types of genetic organization. Different are the timings of their expression, chromatin packaging levels, the composition of activating and deactivating proteins, the sizes of these genes, the lengths of their introns, the organization of the promoter regions of the genes, the locations of origin recognition complexes (ORCs), and DNA replication timings.
Collapse
Affiliation(s)
- Igor Zhimulev
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Tatyana Vatolina
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia;
| | - Victor Levitsky
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| | - Anton Tsukanov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Science, 630090 Novosibirsk, Russia; (V.L.); (A.T.)
| |
Collapse
|
2
|
Kimura A, Go AC, Markow T, Ranz JM. Evidence of Nonrandom Patterns of Functional Chromosome Organization in Danaus plexippus. Genome Biol Evol 2024; 16:evae054. [PMID: 38488057 PMCID: PMC10972686 DOI: 10.1093/gbe/evae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2024] [Indexed: 05/01/2024] Open
Abstract
Our understanding on the interplay between gene functionality and gene arrangement at different chromosome scales relies on a few Diptera and the honeybee, species with quality reference genome assemblies, accurate gene annotations, and abundant transcriptome data. Using recently generated 'omic resources in the monarch butterfly Danaus plexippus, a species with many more and smaller chromosomes relative to Drosophila species and the honeybee, we examined the organization of genes preferentially expressed at broadly defined developmental stages (larva, pupa, adult males, and adult females) at both fine and whole-chromosome scales. We found that developmental stage-regulated genes do not form more clusters, but do form larger clusters, than expected by chance, a pattern consistent across the gene categories examined. Notably, out of the 30 chromosomes in the monarch genome, 12 of them, plus the fraction of the chromosome Z that corresponds to the ancestral Z in other Lepidoptera, were found enriched for developmental stage-regulated genes. These two levels of nonrandom gene organization are not independent as enriched chromosomes for developmental stage-regulated genes tend to harbor disproportionately large clusters of these genes. Further, although paralogous genes were overrepresented in gene clusters, their presence is not enough to explain two-thirds of the documented cases of whole-chromosome enrichment. The composition of the largest clusters often included paralogs from more than one multigene family as well as unrelated single-copy genes. Our results reveal intriguing patterns at the whole-chromosome scale in D. plexippus while shedding light on the interplay between gene expression and chromosome organization beyond Diptera and Hymenoptera.
Collapse
Affiliation(s)
- Ashlyn Kimura
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| | - Alwyn C Go
- Department of Biology, University of Winnipeg, Winnipeg, MB R3B 2E9, Canada
| | - Therese Markow
- Unidad de Genómica Avanzada (Langebio), CINVESTAV, Irapuato, GTO 36824, México
- Section of Cell and Developmental Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - José M Ranz
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, CA 92647, USA
| |
Collapse
|
3
|
Piya AA, DeGiorgio M, Assis R. Predicting gene expression divergence between single-copy orthologs in two species. Genome Biol Evol 2023; 15:evad078. [PMID: 37170892 PMCID: PMC10220509 DOI: 10.1093/gbe/evad078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 04/21/2023] [Accepted: 05/02/2023] [Indexed: 05/13/2023] Open
Abstract
Predicting gene expression divergence is integral to understanding the emergence of new biological functions and associated traits. Whereas several sophisticated methods have been developed for this task, their applications are either limited to duplicate genes or require expression data from more than two species. Thus, here we present PiXi, the first machine learning framework for predicting gene expression divergence between single-copy orthologs in two species. PiXi models gene expression evolution as an Ornstein-Uhlenbeck process, and overlays this model with multi-layer neural network, random forest, and support vector machine architectures for making predictions. It outputs the predicted class "conserved" or "diverged" for each pair of orthologs, as well as their predicted expression optima in the two species. We show that PiXi has high power and accuracy in predicting gene expression divergence between single-copy orthologs, as well as high accuracy and precision in estimating their expression optima in the two species, across a wide range of evolutionary scenarios, with the globally best performance achieved by a multi-layer neural network. Moreover, application of our best performing PiXi predictor to empirical gene expression data from single-copy orthologs residing at different loci in two species of Drosophila reveals that approximately 23% underwent expression divergence after positional relocation. Further analysis shows that several of these "diverged" genes are involved in the electron transport chain of the mitochondrial membrane, suggesting that new chromatin environments may impact energy production in Drosophila. Thus, by providing a toolkit for predicting gene expression divergence between single-copy orthologs in two species, PiXi can shed light on the origins of novel phenotypes across diverse biological processes and study systems.
Collapse
Affiliation(s)
- Antara Anika Piya
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FloridaUSA
| |
Collapse
|
4
|
Shi T, Huneau C, Zhang Y, Li Y, Chen J, Salse J, Wang Q. The slow-evolving Acorus tatarinowii genome sheds light on ancestral monocot evolution. Nat Plants 2022; 8:764-777. [PMID: 35835857 PMCID: PMC9300462 DOI: 10.1038/s41477-022-01187-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 05/30/2022] [Indexed: 05/03/2023]
Abstract
Monocots are one of the most diverse groups of flowering plants, and tracing the evolution of their ancestral genome into modern species is essential for understanding their evolutionary success. Here, we report a high-quality assembly of the Acorus tatarinowii genome, a species that diverged early from all the other monocots. Genome-wide comparisons with a range of representative monocots characterized Acorus as a slowly evolved genome with one whole-genome duplication. Our inference of the ancestral monocot karyotypes provides new insights into the chromosomal evolutionary history assigned to modern species and reveals the probable molecular functions and processes related to the early adaptation of monocots to wetland or aquatic habitats (that is, low levels of inorganic phosphate, parallel leaf venation and ephemeral primary roots). The evolution of ancestral gene order in monocots is constrained by gene structural and functional features. The newly obtained Acorus genome offers crucial evidence for delineating the origin and diversification of monocots, including grasses.
Collapse
Affiliation(s)
- Tao Shi
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Cécile Huneau
- UCA, INRAE, UMR 1095 GDEC (Genetics, Diversity & Ecophysiology of Cereals), Clermont-Ferrand, France
| | - Yue Zhang
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yan Li
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinming Chen
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China.
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China.
| | - Jérôme Salse
- UCA, INRAE, UMR 1095 GDEC (Genetics, Diversity & Ecophysiology of Cereals), Clermont-Ferrand, France.
| | - Qingfeng Wang
- CAS Key Laboratory of Aquatic Botany and Watershed Ecology, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China.
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China.
- Sino-African Joint Research Center, Chinese Academy of Sciences, Wuhan, China.
| |
Collapse
|
5
|
Zinani OQH, Keseroğlu K, Özbudak EM. Regulatory mechanisms ensuring coordinated expression of functionally related genes. Trends Genet 2022; 38:73-81. [PMID: 34376301 PMCID: PMC8678166 DOI: 10.1016/j.tig.2021.07.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 07/12/2021] [Accepted: 07/14/2021] [Indexed: 01/03/2023]
Abstract
Coordinated spatiotemporal expression of large sets of genes is required for the development and homeostasis of organisms. To achieve this goal, organisms use myriad strategies where they form operons, utilize bidirectional promoters, cluster genes, share enhancers among genes by DNA looping, and form topologically associated domains and transcriptional condensates. Coexpression achieved by these different strategies is hypothesized to have functional importance in minimizing gene expression variability, establishing dosage balance to ensure stoichiometry of protein complexes, and minimizing accumulation of toxic intermediate metabolites. By combining gene-editing tools with computational modeling, recent studies tested the advantages of adjacent genes located in pairs and clusters. We propose that with the advancement of gene editing, single-cell sequencing, and imaging tools, one could readily test the functional importance of different coexpression strategies in a variety of biological processes.
Collapse
Affiliation(s)
- Oriana Q H Zinani
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA; Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Kemal Keseroğlu
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - Ertuğrul M Özbudak
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229, USA; Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA.
| |
Collapse
|
6
|
Ovens K, Eames BF, McQuillan I. Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of Evolution. Front Genet 2021; 12:695399. [PMID: 34484293 PMCID: PMC8414652 DOI: 10.3389/fgene.2021.695399] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open
Abstract
Similarities and differences in the associations of biological entities among species can provide us with a better understanding of evolutionary relationships. Often the evolution of new phenotypes results from changes to interactions in pre-existing biological networks and comparing networks across species can identify evidence of conservation or adaptation. Gene co-expression networks (GCNs), constructed from high-throughput gene expression data, can be used to understand evolution and the rise of new phenotypes. The increasing abundance of gene expression data makes GCNs a valuable tool for the study of evolution in non-model organisms. In this paper, we cover motivations for why comparing these networks across species can be valuable for the study of evolution. We also review techniques for comparing GCNs in the context of evolution, including local and global methods of graph alignment. While some protein-protein interaction (PPI) bioinformatic methods can be used to compare co-expression networks, they often disregard highly relevant properties, including the existence of continuous and negative values for edge weights. Also, the lack of comparative datasets in non-model organisms has hindered the study of evolution using PPI networks. We also discuss limitations and challenges associated with cross-species comparison using GCNs, and provide suggestions for utilizing co-expression network alignments as an indispensable tool for evolutionary studies going forward.
Collapse
Affiliation(s)
- Katie Ovens
- Augmented Intelligence & Precision Health Laboratory (AIPHL), Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - B. Frank Eames
- Department of Anatomy, Physiology, & Pharmacology, University of Saskatchewan, Saskatoon, SK, Canada
| | - Ian McQuillan
- Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
7
|
Abstract
Genes are not randomly distributed on eukaryotic chromosomes. Some neighboring genes show order conservation among species, while some neighboring genes separate during evolution even though their neighborhoods are conserved in some species. Here, I investigated whether after-separation gene repositioning is under natural selection for evolutionary conserved gene neighborhoods compared with nonconserved neighborhoods. After separation, genes with conserved neighborhoods show low-expression divergence between the after-separation species and the before-separation species. After genes separate from their conserved gene neighbors, their after-separation gene neighbors tend to show coexpression and coprotein complex with their before-separation gene neighbors. These results indicate evolutionary constraints on the selection of neighboring genes after evolutionary conserved gene neighborhoods separate.
Collapse
Affiliation(s)
- Zhiming Dai
- School of Data and Computer Science, Sun Yat-Sen University, Guangzhou, China.,Guangdong Province Key Laboratory of Big Data Analysis and Processing, Sun Yat-Sen University, Guangzhou, China
| |
Collapse
|
8
|
Mossman JA, Biancani LM, Rand DM. Mitochondrial genomic variation drives differential nuclear gene expression in discrete regions of Drosophila gene and protein interaction networks. BMC Genomics 2019; 20:691. [PMID: 31477008 PMCID: PMC6719383 DOI: 10.1186/s12864-019-6061-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 08/26/2019] [Indexed: 12/12/2022] Open
Abstract
Background Mitochondria perform many key roles in their eukaryotic hosts, from integrating signaling pathways through to modulating whole organism phenotypes. The > 1 billion years of nuclear and mitochondrial gene co-evolution has necessitated coordinated expression of gene products from both genomes that maintain mitochondrial, and more generally, eukaryotic cellular function. How mitochondrial DNA (mtDNA) variation modifies host fitness has proved a challenging question but has profound implications for evolutionary and medical genetics. In Drosophila, we have previously shown that recently diverged mtDNA haplotypes within-species can have more impact on organismal phenotypes than older, deeply diverged haplotypes from different species. Here, we tested the effects of mtDNA haplotype variation on gene expression in Drosophila under standardized conditions. Using the Drosophila Genetic Reference Panel (DGRP), we constructed a panel of mitonuclear genotypes that consists of factorial variation in nuclear and mtDNA genomes, with mtDNAs originating in D. melanogaster (2x haplotypes) and D. simulans (2x haplotypes). Results We show that mtDNA haplotype variation unequivocally alters nuclear gene expression in both females and males, and mitonuclear interactions are pervasive modifying factors for gene expression. There was appreciable overlap between the sexes for mtDNA-sensitive genes, and considerable transcriptional variation attributed to particular mtDNA contrasts. These genes are generally found in low-connectivity gene co-expression networks, occur in gene clusters along chromosomes, are often flanked by non-coding RNA, and are under-represented among housekeeping genes. Finally, we identify the giant (gt) transcription factor motif as a putative regulatory sequence associated with mtDNA-sensitive genes. Conclusions There are predictive conditions for nuclear genes that are influenced by mtDNA variation. Electronic supplementary material The online version of this article (10.1186/s12864-019-6061-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jim A Mossman
- Department of Ecology and Evolutionary Biology, Box G, Brown University, Providence, RI, 02912, USA.
| | - Leann M Biancani
- Department of Ecology and Evolutionary Biology, Box G, Brown University, Providence, RI, 02912, USA.,Present Address: Department of Biology, University of Maryland, College Park, MD, 20742, USA
| | - David M Rand
- Department of Ecology and Evolutionary Biology, Box G, Brown University, Providence, RI, 02912, USA.
| |
Collapse
|
9
|
Abstract
Motivation Genome rearrangements drastically change gene order along great stretches of a chromosome. There has been initial evidence that these apparently non-local events in the 1D sense may have breakpoints that are close in the 3D sense. We harness the power of the Double Cut and Join model of genome rearrangement, along with Hi-C chromosome conformation capture data to test this hypothesis between human and mouse. Results We devise novel statistical tests that show that indeed, rearrangement scenarios that transform the human into the mouse gene order are enriched for pairs of breakpoints that have frequent chromosome interactions. This is observed for both intra-chromosomal breakpoint pairs, as well as for inter-chromosomal pairs. For intra-chromosomal rearrangements, the enrichment exists from close (<20 Mb) to very distant (100 Mb) pairs. Further, the pattern exists across multiple cell lines in Hi-C data produced by different laboratories and at different stages of the cell cycle. We show that similarities in the contact frequencies between these many experiments contribute to the enrichment. We conclude that either (i) rearrangements usually involve breakpoints that are spatially close or (ii) there is selection against rearrangements that act on spatially distant breakpoints. Availability and implementation Our pipeline is freely available at https://bitbucket.org/thekswenson/locality. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krister M Swenson
- Laboratoire d'Informatique, de Robotique, et de Microelectronique de Montpellier (LIRMM), Université Montpellier, Montpellier, France.,Centre Nationale de la Recherche Scientifique (CNRS), France
| | | |
Collapse
|
10
|
Lian S, Liu T, Zhang Z, Yuan H, Wang L, Cheng L. Neighboring genes are closely related to whole genome duplications after their separation. Interdiscip Sci 2019; 11:655-667. [PMID: 30877640 DOI: 10.1007/s12539-019-00321-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2018] [Revised: 01/25/2019] [Accepted: 01/27/2019] [Indexed: 11/26/2022]
Abstract
BACKGROUND The gene order in a eukaryotic genome is not random. Some neighboring genes show specific similarities, while others become separated during evolution. Whole genome duplication events (WGDs) have been recognized as an important evolutionary force. The potential relationship between the separation of neighboring genes and WGDs needs to be investigated. In this study, we investigated whether there is a potential relationship between separated neighboring gene pairs and WGDs, and the mechanism by which neighboring genes are separated. Additionally, we studied whether neighboring genes tend to show intrachromosomal colocalization after their neighborhood was disrupted and the factors facilitating the intrachromosomal colocalization of separated neighboring genes. RESULTS The separation of neighboring gene pairs is closely related to whole genome duplication events. Furthermore, we found that there is a double linear relationship between separated neighboring genes, total genes, and WGDs. The process of separation of neighboring genes caused by WGDs is also not random but abides by the double linear model. Separated neighboring gene pairs tend to show intrachromosomal colocalization. The conservativism of separated neighboring genes and histone modification facilitate the intrachromosomal colocalization of neighboring genes after their separation. CONCLUSIONS These results provide new insight into the understanding of evolutionary roles of locations and the relationship of neighboring gene pairs with whole genome duplications. Furthermore, understanding the proposed mechanism for intrachromosomal colocalization of separated genes benefits our knowledge of chromosomal interactions in the nucleus.
Collapse
Affiliation(s)
- Shuaibin Lian
- College of Physics and Electronic Engineering, Xinyang Normal University, Xinyang, China
| | - Tianliang Liu
- College of Physics and Electronic Engineering, Xinyang Normal University, Xinyang, China
| | - Zaibao Zhang
- College of Life Sciences, Xinyang Normal University, Xinyang, China
| | - Hongyu Yuan
- College of Life Sciences, Xinyang Normal University, Xinyang, China
| | - Lei Wang
- College of Life Sciences, Xinyang Normal University, Xinyang, China
| | - Lin Cheng
- College of Life Sciences, Xinyang Normal University, Xinyang, China.
| |
Collapse
|
11
|
Abstract
One of the major properties of genes is their expression pattern. Notably, genes are often classified as tissue specific or housekeeping. This property is of interest to molecular evolution as an explanatory factor of, e.g. evolutionary rate, as well as a functional feature which may in itself evolve. While many different methods of measuring tissue specificity have been proposed and used for such studies, there has been no comparison or benchmarking of these methods to our knowledge, and little justification of their use. In this study, we compare nine measures of tissue specificity. Most methods were established for ESTs and microarrays, and several were later adapted to RNA-seq. We analyse their capacity to distinguish gene categories, their robustness to the choice and number of tissues used and their capture of evolutionary conservation signal.
Collapse
Affiliation(s)
- Nadezda Kryuchkova-Mostacci
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
12
|
Abstract
Genes are not randomly distributed in the genome. In humans, 10% of protein-coding genes are transcribed from bidirectional promoters and many more are organised in larger clusters. Intriguingly, neighbouring genes are frequently coexpressed but rarely functionally related. Here we show that coexpression of bidirectional gene pairs, and closeby genes in general, is buffered at the protein level. Taking into account the 3D architecture of the genome, we find that co-regulation of spatially close, functionally unrelated genes is pervasive at the transcriptome level, but does not extend to the proteome. We present evidence that non-functional mRNA coexpression in human cells arises from stochastic chromatin fluctuations and direct regulatory interference between spatially close genes. Protein-level buffering likely reflects a lack of coordination of post-transcriptional regulation of functionally unrelated genes. Grouping human genes together along the genome sequence, or through long-range chromosome folding, is associated with reduced expression noise. Our results support the hypothesis that the selection for noise reduction is a major driver of the evolution of genome organisation.
Collapse
Affiliation(s)
- Georg Kustatscher
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
| | - Piotr Grabowski
- Chair of Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
| | - Juri Rappsilber
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, UK .,Chair of Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Berlin, Germany
| |
Collapse
|
13
|
Hug CB, Grimaldi AG, Kruse K, Vaquerizas JM. Chromatin Architecture Emerges during Zygotic Genome Activation Independent of Transcription. Cell 2017; 169:216-228.e19. [PMID: 28388407 DOI: 10.1016/j.cell.2017.03.024] [Citation(s) in RCA: 294] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2016] [Revised: 02/21/2017] [Accepted: 03/16/2017] [Indexed: 01/18/2023]
Abstract
Chromatin architecture is fundamental in regulating gene expression. To investigate when spatial genome organization is first established during development, we examined chromatin conformation during Drosophila embryogenesis and observed the emergence of chromatin architecture within a tight time window that coincides with the onset of transcription activation in the zygote. Prior to zygotic genome activation, the genome is mostly unstructured. Early expressed genes serve as nucleation sites for topologically associating domain (TAD) boundaries. Activation of gene expression coincides with the establishment of TADs throughout the genome and co-localization of housekeeping gene clusters, which remain stable in subsequent stages of development. However, the appearance of TAD boundaries is independent of transcription and requires the transcription factor Zelda for locus-specific TAD boundary insulation. These results offer insight into when spatial organization of the genome emerges and identify a key factor that helps trigger this architecture.
Collapse
Affiliation(s)
- Clemens B Hug
- Max Planck Institute for Molecular Biomedicine, Roentgenstrasse 20, 48149 Muenster, Germany
| | - Alexis G Grimaldi
- Max Planck Institute for Molecular Biomedicine, Roentgenstrasse 20, 48149 Muenster, Germany
| | - Kai Kruse
- Max Planck Institute for Molecular Biomedicine, Roentgenstrasse 20, 48149 Muenster, Germany
| | - Juan M Vaquerizas
- Max Planck Institute for Molecular Biomedicine, Roentgenstrasse 20, 48149 Muenster, Germany.
| |
Collapse
|
14
|
Gandara ACP, Torres A, Bahia AC, Oliveira PL, Schama R. Evolutionary origin and function of NOX4-art, an arthropod specific NADPH oxidase. BMC Evol Biol 2017; 17:92. [PMID: 28356077 PMCID: PMC5372347 DOI: 10.1186/s12862-017-0940-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 03/16/2017] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND NADPH oxidases (NOX) are ROS producing enzymes that perform essential roles in cell physiology, including cell signaling and antimicrobial defense. This gene family is present in most eukaryotes, suggesting a common ancestor. To date, only a limited number of phylogenetic studies of metazoan NOXes have been performed, with few arthropod genes. In arthropods, only NOX5 and DUOX genes have been found and a gene called NOXm was found in mosquitoes but its origin and function has not been examined. In this study, we analyzed the evolution of this gene family in arthropods. A thorough search of genomes and transcriptomes was performed enabling us to browse most branches of arthropod phylogeny. RESULTS We have found that the subfamilies NOX5 and DUOX are present in all arthropod groups. We also show that a NOX gene, closely related to NOX4 and previously found only in mosquitoes (NOXm), can also be found in other taxonomic groups, leading us to rename it as NOX4-art. Although the accessory protein p22-phox, essential for NOX1-4 activation, was not found in any of the arthropods studied, NOX4-art of Aedes aegypti encodes an active protein that produces H2O2. Although NOX4-art has been lost in a number of arthropod lineages, it has all the domains and many signature residues and motifs necessary for ROS production and, when silenced, H2O2 production is considerably diminished in A. aegypti cells. CONCLUSIONS Combining all bioinformatic analyses and laboratory work we have reached interesting conclusions regarding arthropod NOX gene family evolution. NOX5 and DUOX are present in all arthropod lineages but it seems that a NOX2-like gene was lost in the ancestral lineage leading to Ecdysozoa. The NOX4-art gene originated from a NOX4-like ancestor and is functional. Although no p22-phox was observed in arthropods, there was no evidence of neo-functionalization and this gene probably produces H2O2 as in other metazoan NOX4 genes. Although functional and present in the genomes of many species, NOX4-art was lost in a number of arthropod lineages.
Collapse
Affiliation(s)
- Ana Caroline Paiva Gandara
- Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - André Torres
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Ana Cristina Bahia
- Instituto de Biofísica, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Pedro L Oliveira
- Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.,Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Rio de Janeiro, Brazil
| | - Renata Schama
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil. .,Instituto Nacional de Ciência e Tecnologia em Entomologia Molecular - INCT-EM, Rio de Janeiro, Brazil.
| |
Collapse
|
15
|
Tong P, Monahan J, Prendergast JGD. Shared regulatory sites are abundant in the human genome and shed light on genome evolution and disease pleiotropy. PLoS Genet 2017; 13:e1006673. [PMID: 28282383 PMCID: PMC5365138 DOI: 10.1371/journal.pgen.1006673] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 03/24/2017] [Accepted: 03/07/2017] [Indexed: 12/16/2022] Open
Abstract
Large-scale gene expression datasets are providing an increasing understanding of the location of cis-eQTLs in the human genome and their role in disease. However, little is currently known regarding the extent of regulatory site-sharing between genes. This is despite it having potentially wide-ranging implications, from the determination of the way in which genetic variants may shape multiple phenotypes to the understanding of the evolution of human gene order. By first identifying the location of non-redundant cis-eQTLs, we show that regulatory site-sharing is a relatively common phenomenon in the human genome, with over 10% of non-redundant regulatory variants linked to the expression of multiple nearby genes. We show that these shared, local regulatory sites are linked to high levels of chromatin looping between the regulatory sites and their associated genes. In addition, these co-regulated gene modules are found to be strongly conserved across mammalian species, suggesting that shared regulatory sites have played an important role in shaping human gene order. The association of these shared cis-eQTLs with multiple genes means they also appear to be unusually important in understanding the genetics of human phenotypes and pleiotropy, with shared regulatory sites more often linked to multiple human phenotypes than other regulatory variants. This study shows that regulatory site-sharing is likely an underappreciated aspect of gene regulation and has important implications for the understanding of various biological phenomena, including how the two and three dimensional structures of the genome have been shaped and the potential causes of disease pleiotropy outside coding regions.
Collapse
Affiliation(s)
- Pin Tong
- Wellcome Trust Centre for Cell Biology and Institute of Cell Biology, School of Biological Sciences, The University of Edinburgh, Max Born Crescent, Edinburgh, Scotland, United Kingdom
| | - Jack Monahan
- The European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - James G. D. Prendergast
- The Roslin Institute, The University of Edinburgh, Easter Bush, Midlothian, Scotland, United Kingdom
- * E-mail:
| |
Collapse
|
16
|
Abstract
Nested genes are the most common form of protein-coding overlap in eukaryotic genomes. Previous studies have shown that nested genes accumulate rapidly over evolutionary time, typically via the insertion of short young duplicate genes into long introns. However, the evolutionary relationship between nested genes remains unclear. Here, I compare RNA-seq expression profiles of nested, proximal intra-chromosomal, intermediate intra-chromosomal, distant intra-chromosomal, and inter-chromosomal gene pairs in two Drosophila species. I find that expression profiles of nested genes are more divergent than those of any other class of genes, supporting the hypothesis that concurrent expression of nested genes is deleterious due to transcriptional interference. Further analysis reveals that expression profiles of derived nested genes are more divergent than those of their ancestral un-nested orthologs, which are more divergent than those of un-nested genes with similar genomic features. Thus, gene expression divergence between nested genes is likely caused by selection against nesting of genes with insufficiently divergent expression profiles, as well as by continued expression divergence after nesting. Moreover, expression divergence and sequence evolutionary rates are elevated in young nested genes and reduced in old nested genes, indicating that a burst of rapid evolution occurs after nesting. Together, these findings suggest that similarity between expression profiles of nested genes is deleterious due to transcriptional interference, and that natural selection addresses this problem both by eradicating highly deleterious nestings and by enabling rapid expression divergence of surviving nested genes, thereby quickly limiting or abolishing transcriptional interference.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Biology, Pennsylvania State University, University Park
| |
Collapse
|
17
|
Abstract
Genes encoded close to one another on the chromosome are often coexpressed, by a mechanism and regulatory logic that remain poorly understood. We surveyed the yeast genome for tandem gene pairs oriented tail-to-head at which expression antisense to the upstream gene was conserved across species. The intergenic region at most such tandem pairs is a bidirectional promoter, shared by the downstream gene mRNA and the upstream antisense transcript. Genomic analyses of these intergenic loci revealed distinctive patterns of transcription factor regulation. Mutation of a given transcription factor verified its role as a regulator in trans of tandem gene pair loci, including the proximally initiating upstream antisense transcript and downstream mRNA and the distally initiating upstream mRNA. To investigate cis-regulatory activity at such a locus, we focused on the stress-induced NAD(P)H dehydratase YKL151C and its downstream neighbor, the metabolic enzyme GPM1. Previous work has implicated the region between these genes in regulation of GPM1 expression; our mutation experiments established its function in rich medium as a repressor in cis of the distally initiating YKL151C sense RNA, and an activator of the proximally initiating YKL151C antisense RNA. Wild-type expression of all three transcripts required the transcription factor Gcr2. Thus, at this locus, the intergenic region serves as a focal point of regulatory input, driving antisense expression and mediating the coordinated regulation of YKL151C and GPM1. Together, our findings implicate transcription factors in the joint control of neighboring genes specialized to opposing conditions and the antisense transcripts expressed between them.
Collapse
Affiliation(s)
- Yulia Mostovoy
- Department of Molecular and Cell Biology, University of California, Berkeley, California Present address: Cardiovascular Research Institute, University of California, San Francisco, CA
| | - Alexander Thiemicke
- Department of Molecular and Cell Biology, University of California, Berkeley, California Program in Molecular Medicine, Friedrich-Schiller-Universität, Jena, Germany Present address: Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN
| | - Tiffany Y Hsu
- Department of Molecular and Cell Biology, University of California, Berkeley, California Present address: Graduate Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA
| | - Rachel B Brem
- Department of Molecular and Cell Biology, University of California, Berkeley, California Present address: Buck Institute for Research on Aging, Novato, CA
| |
Collapse
|
18
|
Khoroshko VA, Levitsky VG, Zykova TY, Antonenko OV, Belyaeva ES, Zhimulev IF. Chromatin Heterogeneity and Distribution of Regulatory Elements in the Late-Replicating Intercalary Heterochromatin Domains of Drosophila melanogaster Chromosomes. PLoS One 2016; 11:e0157147. [PMID: 27300486 PMCID: PMC4907538 DOI: 10.1371/journal.pone.0157147] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Accepted: 05/25/2016] [Indexed: 12/28/2022] Open
Abstract
Late-replicating domains (intercalary heterochromatin) in the Drosophila genome display a number of features suggesting their organization is quite unique. Typically, they are quite large and encompass clusters of functionally unrelated tissue-specific genes. They correspond to the topologically associating domains and conserved microsynteny blocks. Our study aims at exploring further details of molecular organization of intercalary heterochromatin and has uncovered surprising heterogeneity of chromatin composition in these regions. Using the 4HMM model developed in our group earlier, intercalary heterochromatin regions were found to host chromatin fragments with a particular epigenetic profile. Aquamarine chromatin fragments (spanning 0.67% of late-replicating regions) are characterized as a class of sequences that appear heterogeneous in terms of their decompactization. These fragments are enriched with enhancer sequences and binding sites for insulator proteins. They likely mark the chromatin state that is related to the binding of cis-regulatory proteins. Malachite chromatin fragments (11% of late-replicating regions) appear to function as universal transitional regions between two contrasting chromatin states. Namely, they invariably delimit intercalary heterochromatin regions from the adjacent active chromatin of interbands. Malachite fragments also flank aquamarine fragments embedded in the repressed chromatin of late-replicating regions. Significant enrichment of insulator proteins CP190, SU(HW), and MOD2.2 was observed in malachite chromatin. Neither aquamarine nor malachite chromatin types appear to correlate with the positions of highly conserved non-coding elements (HCNE) that are typically replete in intercalary heterochromatin. Malachite chromatin found on the flanks of intercalary heterochromatin regions tends to replicate earlier than the malachite chromatin embedded in intercalary heterochromatin. In other words, there exists a gradient of replication progressing from the flanks of intercalary heterochromatin regions center-wise. The peculiar organization and features of replication in large late-replicating regions are discussed as possible factors shaping the evolutionary stability of intercalary heterochromatin.
Collapse
Affiliation(s)
| | - Viktor G. Levitsky
- Novosibirsk State University, Novosibirsk, Russia
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Tatyana Yu. Zykova
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, Russia
| | | | - Elena S. Belyaeva
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, Russia
| | - Igor F. Zhimulev
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
19
|
Liao BY, Weng MP. Functionalities of expressed messenger RNAs revealed from mutant phenotypes. Wiley Interdiscip Rev RNA 2016; 7:416-27. [PMID: 26748449 DOI: 10.1002/wrna.1329] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Revised: 11/23/2015] [Accepted: 12/02/2015] [Indexed: 12/11/2022]
Abstract
Total messenger RNAs mRNAs that are produced from a given gene under a certain set of conditions include both functional and nonfunctional transcripts. The high prevalence of nonfunctional mRNAs that have been detected in cells has raised questions regarding the functional implications of mRNA expression patterns and divergences. Phenotypes that result from the mutagenesis of protein-coding genes have provided the most straightforward descriptions of gene functions, and such data obtained from model organisms have facilitated investigations of the functionalities of expressed mRNAs. Mutant phenotype data from mouse tissues have revealed various attributes of functional mRNAs, including tissue-specificity, strength of expression, and evolutionary conservation. In addition, the role that mRNA expression evolution plays in driving morphological evolution has been revealed from studies designed to exploit morphological and physiological phenotypes of mouse mutants. Investigations into yeast essential genes (defined by an absence of colony growth after gene deletion) have further described gene regulatory strategies that reduce protein expression noise by mediating the rates of transcription and translation. In addition to the functional significance of expressed mRNAs as described in the abovementioned findings, the functionalities of other type of RNAs (i.e., noncoding RNAs) remain to be characterized with systematic mutations and phenotyping of the DNA regions that encode these RNA molecules. WIREs RNA 2016, 7:416-427. doi: 10.1002/wrna.1329 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan, Republic of China
| | - Meng-Pin Weng
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County, Taiwan, Republic of China
| |
Collapse
|
20
|
Diament A, Tuller T. Three-dimensional Genomic Organization of Genes’ Function in Eukaryotes. Evol Biol 2016. [DOI: 10.1007/978-3-319-41324-2_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
21
|
Wang D. DLGP: A database for lineage-conserved and lineage-specific gene pairs in animal and plant genomes. Biochem Biophys Res Commun 2015; 469:542-5. [PMID: 26697753 DOI: 10.1016/j.bbrc.2015.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 12/10/2015] [Indexed: 10/22/2022]
Abstract
The conservation of gene organization in the genome with lineage-specificity is an invaluable resource to decipher their potential functionality with diverse selective constraints, especially in higher animals and plants. Gene pairs appear to be the minimal structure for such kind of gene clusters that tend to reside in their preferred locations, representing the distinctive genomic characteristics in single species or a given lineage. Despite gene families having been investigated in a widespread manner, the definition of gene pair families in various taxa still lacks adequate attention. To address this issue, we report DLGP (http://lcgbase.big.ac.cn/DLGP/) that stores the pre-calculated lineage-based gene pairs in currently available 134 animal and plant genomes and inspect them under the same analytical framework, bringing out a set of innovational features. First, the taxonomy or lineage has been classified into four levels such as Kingdom, Phylum, Class and Order. It adopts all-to-all comparison strategy to identify the possible conserved gene pairs in all species for each gene pair in certain species and reckon those that are conserved in over a significant proportion of species in a given lineage (e.g. Primates, Diptera or Poales) as the lineage-conserved gene pairs. Furthermore, it predicts the lineage-specific gene pairs by retaining the above-mentioned lineage-conserved gene pairs that are not conserved in any other lineages. Second, it carries out pairwise comparison for the gene pairs between two compared species and creates the table including all the conserved gene pairs and the image elucidating the conservation degree of gene pairs in chromosomal level. Third, it supplies gene order browser to extend gene pairs to gene clusters, allowing users to view the evolution dynamics in the gene context in an intuitive manner. This database will be able to facilitate the particular comparison between animals and plants, between vertebrates and arthropods, and between monocots and eudicots, accounting for the significant contribution of gene pairs to speciation and diversification in specific lineages.
Collapse
Affiliation(s)
- Dapeng Wang
- Stem Cell Laboratory, UCL Cancer Institute, University College London, London WC1E 6BT, UK; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, PR China.
| |
Collapse
|
22
|
Pazos Obregón F, Papalardo C, Castro S, Guerberoff G, Cantera R. Putative synaptic genes defined from a Drosophila whole body developmental transcriptome by a machine learning approach. BMC Genomics 2015; 16:694. [PMID: 26370122 PMCID: PMC4570697 DOI: 10.1186/s12864-015-1888-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 09/01/2015] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Assembly and function of neuronal synapses require the coordinated expression of a yet undetermined set of genes. Although roughly a thousand genes are expected to be important for this function in Drosophila melanogaster, just a few hundreds of them are known so far. RESULTS In this work we trained three learning algorithms to predict a "synaptic function" for genes of Drosophila using data from a whole-body developmental transcriptome published by others. Using statistical and biological criteria to analyze and combine the predictions, we obtained a gene catalogue that is highly enriched in genes of relevance for Drosophila synapse assembly and function but still not recognized as such. CONCLUSIONS The utility of our approach is that it reduces the number of genes to be tested through hypothesis-driven experimentation.
Collapse
Affiliation(s)
- Flavio Pazos Obregón
- Departamento de Biología del Neurodesarrollo, Instituto de Investigaciones Biológicas Clemente Estable, Avenida Italia 3318, PC 11600, Montevideo, Uruguay.
| | - Cecilia Papalardo
- Instituto de Matemática y Estadística "Prof. Ing. Rafael Laguardia", Facultad de Ingeniería, Universidad de la República, Montevideo, Uruguay.
| | - Sebastián Castro
- Instituto de Matemática y Estadística "Prof. Ing. Rafael Laguardia", Facultad de Ingeniería, Universidad de la República, Montevideo, Uruguay.
| | - Gustavo Guerberoff
- Instituto de Matemática y Estadística "Prof. Ing. Rafael Laguardia", Facultad de Ingeniería, Universidad de la República, Montevideo, Uruguay.
| | - Rafael Cantera
- Departamento de Biología del Neurodesarrollo, Instituto de Investigaciones Biológicas Clemente Estable, Avenida Italia 3318, PC 11600, Montevideo, Uruguay.
- Zoology Department, Stockholm University, Stockholm, Sweden.
| |
Collapse
|
23
|
Andrews T, Honti F, Pfundt R, de Leeuw N, Hehir-Kwa J, Vulto-van Silfhout A, de Vries B, Webber C. The clustering of functionally related genes contributes to CNV-mediated disease. Genome Res 2015; 25:802-13. [PMID: 25887030 DOI: 10.1101/gr.184325.114] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 04/13/2015] [Indexed: 12/20/2022]
Abstract
Clusters of functionally related genes can be disrupted by a single copy number variant (CNV). We demonstrate that the simultaneous disruption of multiple functionally related genes is a frequent and significant characteristic of de novo CNVs in patients with developmental disorders (P = 1 × 10−3). Using three different functional networks, we identified unexpectedly large numbers of functionally related genes within de novo CNVs from two large independent cohorts of individuals with developmental disorders. The presence of multiple functionally related genes was a significant predictor of a CNV's pathogenicity when compared to CNVs from apparently healthy individuals and a better predictor than the presence of known disease or haploinsufficient genes for larger CNVs. The functionally related genes found in the de novo CNVs belonged to 70% of all clusters of functionally related genes found across the genome. De novo CNVs were more likely to affect functional clusters and affect them to a greater extent than benign CNVs (P = 6 × 10−4). Furthermore, such clusters of functionally related genes are phenotypically informative: Different patients possessing CNVs that affect the same cluster of functionally related genes exhibit more similar phenotypes than expected (P < 0.05). The spanning of multiple functionally similar genes by single CNVs contributes substantially to how these variants exert their pathogenic effects.
Collapse
|
24
|
Gossmann TI, Santure AW, Sheldon BC, Slate J, Zeng K. Highly variable recombinational landscape modulates efficacy of natural selection in birds. Genome Biol Evol 2015; 6:2061-75. [PMID: 25062920 PMCID: PMC4231635 DOI: 10.1093/gbe/evu157] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Determining the rate of protein evolution and identifying the causes of its variation across the genome are powerful ways to understand forces that are important for genome evolution. By using a multitissue transcriptome data set from great tit (Parus major), we analyzed patterns of molecular evolution between two passerine birds, great tit and zebra finch (Taeniopygia guttata), using the chicken genome (Gallus gallus) as an outgroup. We investigated whether a special feature of avian genomes, the highly variable recombinational landscape, modulates the efficacy of natural selection through the effects of Hill-Robertson interference, which predicts that selection should be more effective in removing deleterious mutations and incorporating beneficial mutations in high-recombination regions than in low-recombination regions. In agreement with these predictions, genes located in low-recombination regions tend to have a high proportion of neutrally evolving sites and relaxed selective constraint on sites subject to purifying selection, whereas genes that show strong support for past episodes of positive selection appear disproportionally in high-recombination regions. There is also evidence that genes located in high-recombination regions tend to have higher gene expression specificity than those located in low-recombination regions. Furthermore, more compact genes (i.e., those with fewer/shorter introns or shorter proteins) evolve faster than less compact ones. In sum, our results demonstrate that transcriptome sequencing is a powerful method to answer fundamental questions about genome evolution in nonmodel organisms.
Collapse
Affiliation(s)
- Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| | - Anna W Santure
- Department of Animal and Plant Sciences, University of Sheffield, United KingdomSchool of Biological Sciences, University of Auckland, New Zealand
| | - Ben C Sheldon
- Edward Grey Institute, Department of Zoology, University of Oxford, United Kingdom
| | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| |
Collapse
|
25
|
Abstract
When considering the evolution of a gene’s expression profile, we commonly assume that this is unaffected by its genomic neighborhood. This is, however, in contrast to what we know about the lack of autonomy between neighboring genes in gene expression profiles in extant taxa. Indeed, in all eukaryotic genomes genes of similar expression-profile tend to cluster, reflecting chromatin level dynamics. Does it follow that if a gene increases expression in a particular lineage then the genomic neighbors will also increase in their expression or is gene expression evolution autonomous? To address this here we consider evolution of human gene expression since the human-chimp common ancestor, allowing for both variation in estimation of current expression level and error in Bayesian estimation of the ancestral state. We find that in all tissues and both sexes, the change in gene expression of a focal gene on average predicts the change in gene expression of neighbors. The effect is highly pronounced in the immediate vicinity (<100 kb) but extends much further. Sex-specific expression change is also genomically clustered. As genes increasing their expression in humans tend to avoid nuclear lamina domains and be enriched for the gene activator 5-hydroxymethylcytosine, we conclude that, most probably owing to chromatin level control of gene expression, a change in gene expression of one gene likely affects the expression evolution of neighbors, what we term expression piggybacking, an analog of hitchhiking.
Collapse
Affiliation(s)
- Avazeh T Ghanbarian
- Department of Biology and Biochemisty, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemisty, University of Bath, Bath, United Kingdom
| |
Collapse
|
26
|
Abstract
Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.
Collapse
Affiliation(s)
- Pablo Librado
- Departament de Genètica and Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | | |
Collapse
|
27
|
Abstract
In Drosophila, chromosomes have been extensively reorganized during evolution, with most rearrangements affecting the gene order in chromosomal elements but not their gene content. The level of reorganization and the evidence for breakpoint reuse vary both between and within elements. The subito gene stands out as a gene involved in multiple rearrangements both because of its active single-gene transposition and because it is the nearest gene to diverse rearrangements breakpoints. Indeed, subito has undergone three single-gene transpositions and it is the nearest gene to the breakpoints of other single-gene transpositions and of two chromosomal inversions. Given that subito is involved in meiosis and therefore active in the female germ line, the high number of nearby fixed breakages might be related among others to the presumed high accessibility of the subito region to the machinery associated with double-strand breaks repair. A second important contributor would be the reduced and simple regulatory region of subito, which would imply that a fraction of the rearrangements originating from subito nearby breakages would have not affected either its pattern or timing of expression and would have, thus, not resulted in reduced fitness.
Collapse
Affiliation(s)
- Eva Puerma
- Departament de Genètica, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Dorcas J Orengo
- Departament de Genètica, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Montserrat Aguadé
- Departament de Genètica, Facultat de Biologia, i Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
28
|
Zhimulev IF, Zykova TY, Goncharov FP, Khoroshko VA, Demakova OV, Semeshin VF, Pokholkova GV, Boldyreva LV, Demidova DS, Babenko VN, Demakov SA, Belyaeva ES. Genetic organization of interphase chromosome bands and interbands in Drosophila melanogaster. PLoS One 2014; 9:e101631. [PMID: 25072930 PMCID: PMC4114487 DOI: 10.1371/journal.pone.0101631] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Accepted: 06/09/2014] [Indexed: 12/18/2022] Open
Abstract
Drosophila melanogaster polytene chromosomes display specific banding pattern; the underlying genetic organization of this pattern has remained elusive for many years. In the present paper, we analyze 32 cytology-mapped polytene chromosome interbands. We estimated molecular locations of these interbands, described their molecular and genetic organization and demonstrate that polytene chromosome interbands contain the 5' ends of housekeeping genes. As a rule, interbands display preferential "head-to-head" orientation of genes. They are enriched for "broad" class promoters characteristic of housekeeping genes and associate with open chromatin proteins and Origin Recognition Complex (ORC) components. In two regions, 10A and 100B, coding sequences of genes whose 5'-ends reside in interbands map to constantly loosely compacted, early-replicating, so-called "grey" bands. Comparison of expression patterns of genes mapping to late-replicating dense bands vs genes whose promoter regions map to interbands shows that the former are generally tissue-specific, whereas the latter are represented by ubiquitously active genes. Analysis of RNA-seq data (modENCODE-FlyBase) indicates that transcripts from interband-mapping genes are present in most tissues and cell lines studied, across most developmental stages and upon various treatment conditions. We developed a special algorithm to computationally process protein localization data generated by the modENCODE project and show that Drosophila genome has about 5700 sites that demonstrate all the features shared by the interbands cytologically mapped to date.
Collapse
Affiliation(s)
- Igor F. Zhimulev
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
- * E-mail:
| | - Tatyana Yu. Zykova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Fyodor P. Goncharov
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Varvara A. Khoroshko
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Olga V. Demakova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Valeriy F. Semeshin
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Galina V. Pokholkova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Lidiya V. Boldyreva
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Darya S. Demidova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Vladimir N. Babenko
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Sergey A. Demakov
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Elena S. Belyaeva
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
29
|
Makunin IV, Kolesnikova TD, Andreyenkova NG. Underreplicated regions in Drosophila melanogaster are enriched with fast-evolving genes and highly conserved noncoding sequences. Genome Biol Evol 2014; 6:2050-60. [PMID: 25062918 PMCID: PMC4159006 DOI: 10.1093/gbe/evu156] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Many late replicating regions are underreplicated in polytene chromosomes of Drosophila melanogaster. These regions contain silenced chromatin and overlap long syntenic blocks of conserved gene order in drosophilids. In this report we show that in D. melanogaster the underreplicated regions are enriched with fast-evolving genes lacking homologs in distant species such as mosquito or human, indicating that the phylogenetic conservation of genes correlates with replication timing and chromatin status. Drosophila genes without human homologs located in the underreplicated regions have higher nonsynonymous substitution rate and tend to encode shorter proteins when compared with those in the adjacent regions. At the same time, the underreplicated regions are enriched with ultraconserved elements and highly conserved noncoding sequences, especially in introns of very long genes indicating the presence of an extensive regulatory network that may be responsible for the conservation of gene order in these regions. The regions have a modest preference for long noncoding RNAs but are depleted for small nucleolar RNAs, microRNAs, and transfer RNAs. Our results demonstrate that the underreplicated regions have a specific genic composition and distinct pattern of evolution.
Collapse
Affiliation(s)
- Igor V Makunin
- Research Computing Centre, The University of Queensland, St Lucia, Queensland, AustraliaInstitute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Tatyana D Kolesnikova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, RussiaNovosibirsk State University, Russia
| | - Natalya G Andreyenkova
- Institute of Molecular and Cellular Biology of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
30
|
Abstract
The order of genes on eukaryotic chromosomes is nonrandom. Some neighboring genes show order conservation among species, while some neighboring genes separate during evolution. Here, we investigated whether neighboring genes show interactions after their separation. We found that neighboring gene pairs tend to show interchromosomal colocalization (i.e., nuclear colocalization) in the species in which they separate. These nuclear colocalized separated neighboring gene pairs 1) show neighborhood conservation in more species, 2) tend to be regulated by the same transcription factor, and 3) tend to be regulated by the same histone modification. These results suggest a mechanism by which neighboring genes could retain nuclear proximity after their separation.
Collapse
Affiliation(s)
- Zhiming Dai
- Department of Electronics and Communication Engineering, School of Information Science and Technology, Sun Yat-Sen University, Guangzhou, China
| | | | | |
Collapse
|
31
|
Andreyenkova NG, Kolesnikova TD, Makunin IV, Pokholkova GV, Boldyreva LV, Zykova TY, Zhimulev IF, Belyaeva ES. Late replication domains are evolutionary conserved in the Drosophila genome. PLoS One 2013; 8:e83319. [PMID: 24391753 PMCID: PMC3877026 DOI: 10.1371/journal.pone.0083319] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Accepted: 11/01/2013] [Indexed: 12/20/2022] Open
Abstract
Drosophila chromosomes are organized into distinct domains differing in their predominant chromatin composition, replication timing and evolutionary conservation. We show on a genome-wide level that genes whose order has remained unaltered across 9 Drosophila species display late replication timing and frequently map to the regions of repressive chromatin. This observation is consistent with the existence of extensive domains of repressive chromatin that replicate extremely late and have conserved gene order in the Drosophila genome. We suggest that such repressive chromatin domains correspond to a handful of regions that complete replication at the very end of S phase. We further demonstrate that the order of genes in these regions is rarely altered in evolution. Substantial proportion of such regions significantly coincide with large synteny blocks. This indicates that there are evolutionary mechanisms maintaining the integrity of these late-replicating chromatin domains. The synteny blocks corresponding to the extremely late-replicating regions in the D. melanogaster genome consistently display two-fold lower gene density across different Drosophila species.
Collapse
Affiliation(s)
- Natalya G. Andreyenkova
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Tatyana D. Kolesnikova
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Igor V. Makunin
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
- Research Computing Centre, The University of Queensland, Brisbane, St Lucia, QLD, Australia
| | - Galina V. Pokholkova
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Lidiya V. Boldyreva
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Tatyana Yu. Zykova
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Igor F. Zhimulev
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
- * E-mail:
| | - Elena S. Belyaeva
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
32
|
Abstract
Background It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. Results We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. Conclusions We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding.
Collapse
Affiliation(s)
- Alan F Rubin
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | | |
Collapse
|
33
|
Adrian AB, Comeron JM. The Drosophila early ovarian transcriptome provides insight to the molecular causes of recombination rate variation across genomes. BMC Genomics 2013; 14:794. [PMID: 24228734 PMCID: PMC3840681 DOI: 10.1186/1471-2164-14-794] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Accepted: 11/06/2013] [Indexed: 11/10/2022] Open
Abstract
Background Evidence in yeast indicates that gene expression is correlated with recombination activity and double-strand break (DSB) formation in some hotspots. Studies of nucleosome occupancy in yeast and mice also suggest that open chromatin influences the formation of DSBs. In Drosophila melanogaster, high-resolution recombination maps show an excess of DSBs within annotated transcripts relative to intergenic sequences. The impact of active transcription on recombination landscapes, however, remains unexplored in a multicellular organism. We then investigated the transcription profile during early meiosis in D. melanogaster females to obtain a glimpse at the relevant transcriptional dynamics during DSB formation, and test the specific hypothesis that DSBs preferentially target transcriptionally active genomic regions. Results Our study of transcript profiles of early- and late-meiosis using mRNA-seq revealed, 1) significant differences in gene expression, 2) new genes and exons, 3) parent-of-origin effects on transcription in early-meiosis stages, and 4) a nonrandom genomic distribution of transcribed genes. Importantly, genomic regions that are more actively transcribed during early meiosis show higher rates of recombination, and we ruled out DSB preference for genic regions that are not transcribed. Conclusions Our results provide evidence in a multicellular organism that transcription during the initial phases of meiosis increases the likelihood of DSB and give insight into the molecular determinants of recombination rate variation across the D. melanogaster genome. We propose that a model where variation in gene expression plays a role altering the recombination landscape across the genome could provide a molecular, heritable and plastic mechanism to observed patterns of recombination variation, from the high level of intra-specific variation to the known influence of environmental factors and stress conditions.
Collapse
Affiliation(s)
| | - Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, Iowa, USA.
| |
Collapse
|
34
|
Schauer T, Schwalie PC, Handley A, Margulies CE, Flicek P, Ladurner AG. CAST-ChIP maps cell-type-specific chromatin states in the Drosophila central nervous system. Cell Rep 2013; 5:271-82. [PMID: 24095734 DOI: 10.1016/j.celrep.2013.09.001] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Revised: 07/23/2013] [Accepted: 08/26/2013] [Indexed: 01/12/2023] Open
Abstract
Chromatin organization and gene activity are responsive to developmental and environmental cues. Although many genes are transcribed throughout development and across cell types, much of gene regulation is highly cell-type specific. To readily track chromatin features at the resolution of cell types within complex tissues, we developed and validated chromatin affinity purification from specific cell types by chromatin immunoprecipitation (CAST-ChIP), a broadly applicable biochemical procedure. RNA polymerase II (Pol II) CAST-ChIP identifies ~1,500 neuronal and glia-specific genes in differentiated cells within the adult Drosophila brain. In contrast, the histone H2A.Z is distributed similarly across cell types and throughout development, marking cell-type-invariant Pol II-bound regions. Our study identifies H2A.Z as an active chromatin signature that is refractory to changes across cell fates. Thus, CAST-ChIP powerfully identifies cell-type-specific as well as cell-type-invariant chromatin states, enabling the systematic dissection of chromatin structure and gene regulation within complex tissues such as the brain.
Collapse
Affiliation(s)
- Tamás Schauer
- Department of Physiological Chemistry, Butenandt Institute and LMU Biomedical Center, Ludwig Maximilians University of Munich, Butenandtstrasse 5, 81377 Munich, Germany; European Molecular Biology Laboratory International PhD Program, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | | | | | | | | | |
Collapse
|
35
|
Abstract
An intriguing association between the spatial layout of chromosomes within nuclei and the evolution of chromosome gene order was recently uncovered. Chromosome regions with conserved gene order in the Drosophila genus are larger if they interact with the inner side of the nuclear envelope in D. melanogaster somatic cells. This observation opens a new door to understand the evolution of chromosomes in the light of the dynamics of the spatial layout of chromosomes and the way double-strand breaks are repaired in D. melanogaster germ lines. Chromosome regions at the nuclear periphery in somatic cell nuclei relocate to more internal locations of male germ line cell nuclei, which might prefer a gene order-preserving mechanism to repair double-strand breaks. Conversely, chromosome regions at the nuclear periphery in somatic cells keep their location in female germ line cell nuclei, which might be inaccessible for cellular machinery that causes gene order-disrupting chromosome rearrangements. Thus, the gene order stability for genome regions at the periphery of somatic cell nuclei might result from the active repair of double-strand breaks using conservative mechanisms in male germ line cells, and the passive inaccessibility for gene order-disrupting factors at the periphery of nuclei of female germ line cells. In the present article, I find evidence consistent with a DNA break repair-based differential contribution of both D. melanogaster germ lines to the stability/disruption of gene order. The importance of germ line differences for the layout of chromosomes and DNA break repair strategies with regard to other genomic patterns is briefly discussed.
Collapse
|
36
|
Abstract
The systems genetics is an emerging discipline that integrates high-throughput expression profiling technology and systems biology approaches for revealing the molecular mechanism of complex traits, and will improve our understanding of gene functions in the biochemical pathway and genetic interactions between biological molecules. With the rapid advances of microarray analysis technologies, bioinformatics is extensively used in the studies of gene functions, SNP-SNP genetic interactions, LD block-block interactions, miRNA-mRNA interactions, DNA-protein interactions, protein-protein interactions, and functional mapping for LD blocks. Based on bioinformatics panel, which can integrate "-omics" datasets to extract systems knowledge and useful information for explaining the molecular mechanism of complex traits, systems genetics is all about to enhance our understanding of biological processes. Systems biology has provided systems level recognition of various biological phenomena, and constructed the scientific background for the development of systems genetics. In addition, the next-generation sequencing technology and post-genome wide association studies empower the discovery of new gene and rare variants. The integration of different strategies will help to propose novel hypothesis and perfect the theoretical framework of systems genetics, which will make contribution to the future development of systems genetics, and open up a whole new area of genetics.
Collapse
|
37
|
Lemay DG, Martin WF, Hinrichs AS, Rijnkels M, German JB, Korf I, Pollard KS. G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes. BMC Bioinformatics 2012; 13:253. [PMID: 23020263 PMCID: PMC3575404 DOI: 10.1186/1471-2105-13-253] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2012] [Accepted: 09/23/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In previous studies, gene neighborhoods-spatial clusters of co-expressed genes in the genome-have been defined using arbitrary rules such as requiring adjacency, a minimum number of genes, a fixed window size, or a minimum expression level. In the current study, we developed a Gene Neighborhood Scoring Tool (G-NEST) which combines genomic location, gene expression, and evolutionary sequence conservation data to score putative gene neighborhoods across all possible window sizes simultaneously. RESULTS Using G-NEST on atlases of mouse and human tissue expression data, we found that large neighborhoods of ten or more genes are extremely rare in mammalian genomes. When they do occur, neighborhoods are typically composed of families of related genes. Both the highest scoring and the largest neighborhoods in mammalian genomes are formed by tandem gene duplication. Mammalian gene neighborhoods contain highly and variably expressed genes. Co-localized noisy gene pairs exhibit lower evolutionary conservation of their adjacent genome locations, suggesting that their shared transcriptional background may be disadvantageous. Genes that are essential to mammalian survival and reproduction are less likely to occur in neighborhoods, although neighborhoods are enriched with genes that function in mitosis. We also found that gene orientation and protein-protein interactions are partially responsible for maintenance of gene neighborhoods. CONCLUSIONS Our experiments using G-NEST confirm that tandem gene duplication is the primary driver of non-random gene order in mammalian genomes. Non-essentiality, co-functionality, gene orientation, and protein-protein interactions are additional forces that maintain gene neighborhoods, especially those formed by tandem duplicates. We expect G-NEST to be useful for other applications such as the identification of core regulatory modules, common transcriptional backgrounds, and chromatin domains. The software is available at http://docpollard.org/software.html.
Collapse
Affiliation(s)
- Danielle G Lemay
- Genome Center, University of California Davis, 451 Health Science Dr, Davis, CA, 95616, United States of America.
| | | | | | | | | | | | | |
Collapse
|
38
|
Fasoli M, Dal Santo S, Zenoni S, Tornielli GB, Farina L, Zamboni A, Porceddu A, Venturini L, Bicego M, Murino V, Ferrarini A, Delledonne M, Pezzotti M. The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program. Plant Cell 2012; 24:3489-505. [PMID: 22948079 PMCID: PMC3480284 DOI: 10.1105/tpc.112.100230] [Citation(s) in RCA: 147] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Revised: 07/06/2012] [Accepted: 08/20/2012] [Indexed: 05/18/2023]
Abstract
We developed a genome-wide transcriptomic atlas of grapevine (Vitis vinifera) based on 54 samples representing green and woody tissues and organs at different developmental stages as well as specialized tissues such as pollen and senescent leaves. Together, these samples expressed ∼91% of the predicted grapevine genes. Pollen and senescent leaves had unique transcriptomes reflecting their specialized functions and physiological status. However, microarray and RNA-seq analysis grouped all the other samples into two major classes based on maturity rather than organ identity, namely, the vegetative/green and mature/woody categories. This division represents a fundamental transcriptomic reprogramming during the maturation process and was highlighted by three statistical approaches identifying the transcriptional relationships among samples (correlation analysis), putative biomarkers (O2PLS-DA approach), and sets of strongly and consistently expressed genes that define groups (topics) of similar samples (biclustering analysis). Gene coexpression analysis indicated that the mature/woody developmental program results from the reiterative coactivation of pathways that are largely inactive in vegetative/green tissues, often involving the coregulation of clusters of neighboring genes and global regulation based on codon preference. This global transcriptomic reprogramming during maturation has not been observed in herbaceous annual species and may be a defining characteristic of perennial woody plants.
Collapse
Affiliation(s)
- Marianna Fasoli
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Silvia Dal Santo
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Sara Zenoni
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | | | - Lorenzo Farina
- Dipartimento di Informatica e Sistemistica Antonio Ruberti, Università degli Studi di Roma La Sapienza, 00185 Rome, Italy
| | - Anita Zamboni
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Andrea Porceddu
- Dipartimento di Scienze Agronomiche e Genetica Vegetale Agraria, Università degli Studi di Sassari, 07100 Sassari, Italy
| | - Luca Venturini
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Manuele Bicego
- Dipartimento di Informatica, Università degli Studi di Verona, 37134 Verona, Italy
| | | | - Alberto Ferrarini
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Massimo Delledonne
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
| | - Mario Pezzotti
- Dipartimento di Biotecnologie, Università degli Studi di Verona, 37134 Verona, Italy
- Address correspondence to
| |
Collapse
|
39
|
Yang J, Ramos E, Corces VG. The BEAF-32 insulator coordinates genome organization and function during the evolution of Drosophila species. Genome Res 2012; 22:2199-207. [PMID: 22895281 DOI: 10.1101/gr.142125.112] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Understanding the relationship between genome organization and expression is central to understanding genome function. Closely apposed genes in a head-to-head orientation share the same upstream region and are likely to be coregulated. Here we identify the Drosophila BEAF-32 insulator as a cis regulatory element separating close head-to-head genes with different transcription regulation modes. We then compare the binding landscapes of the BEAF-32 insulator protein in four different Drosophila genomes and highlight the evolutionarily conserved presence of this protein between close adjacent genes. We find that changes in binding of BEAF-32 to sites in the genome of different Drosophila species correlate with alterations in genome organization caused by DNA rearrangements or genome size expansion. The cross-talk between BEAF-32 genomic distribution and genome organization contributes to new gene-expression profiles, which in turn translate into specific and distinct phenotypes. The results suggest a mechanism for the establishment of differences in transcription patterns during evolution.
Collapse
|
40
|
Irimia M, Tena JJ, Alexis MS, Fernandez-Miñan A, Maeso I, Bogdanovic O, de la Calle-Mustienes E, Roy SW, Gómez-Skarmeta JL, Fraser HB. Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints. Genome Res 2012; 22:2356-67. [PMID: 22722344 PMCID: PMC3514665 DOI: 10.1101/gr.139725.112] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The order of genes in eukaryotic genomes has generally been assumed to be neutral, since gene order is largely scrambled over evolutionary time. Only a handful of exceptional examples are known, typically involving deeply conserved clusters of tandemly duplicated genes (e.g., Hox genes and histones). Here we report the first systematic survey of microsynteny conservation across metazoans, utilizing 17 genome sequences. We identified nearly 600 pairs of unrelated genes that have remained tightly physically linked in diverse lineages across over 600 million years of evolution. Integrating sequence conservation, gene expression data, gene function, epigenetic marks, and other genomic features, we provide extensive evidence that many conserved ancient linkages involve (1) the coordinated transcription of neighboring genes, or (2) genomic regulatory blocks (GRBs) in which transcriptional enhancers controlling developmental genes are contained within nearby bystander genes. In addition, we generated ChIP-seq data for key histone modifications in zebrafish embryos, which provided further evidence of putative GRBs in embryonic development. Finally, using chromosome conformation capture (3C) assays and stable transgenic experiments, we demonstrate that enhancers within bystander genes drive the expression of genes such as Otx and Islet, critical regulators of central nervous system development across bilaterians. These results suggest that ancient genomic functional associations are far more common than previously thought—involving ∼12% of the ancestral bilaterian genome—and that cis-regulatory constraints are crucial in determining metazoan genome architecture.
Collapse
Affiliation(s)
- Manuel Irimia
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Soshnev AA, He B, Baxley RM, Jiang N, Hart CM, Tan K, Geyer PK. Genome-wide studies of the multi-zinc finger Drosophila Suppressor of Hairy-wing protein in the ovary. Nucleic Acids Res 2012; 40:5415-31. [PMID: 22406832 PMCID: PMC3384341 DOI: 10.1093/nar/gks225] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
The Drosophila Suppressor of Hairy-wing [Su(Hw)] protein is a globally expressed, multi-zinc finger (ZnF) DNA-binding protein. Su(Hw) forms a classic insulator when bound to the gypsy retrotransposon and is essential for female germline development. These functions are genetically separable, as exemplified by Su(Hw)(f) that carries a defective ZnF10, causing a loss of insulator but not germline function. Here, we completed the first genome-wide analysis of Su(Hw)-binding sites (SBSs) in the ovary, showing that tissue-specific binding is not responsible for the restricted developmental requirements for Su(Hw). Mapping of ovary Su(Hw)(f) SBSs revealed that female fertility requires binding to only one third of the wild-type sites. We demonstrate that Su(Hw)(f) retention correlates with binding site affinity and partnership with Modifier of (mdg4) 67.2 protein. Finally, we identify clusters of co-regulated ovary genes flanked by Su(Hw)(f) bound sites and show that loss of Su(Hw) has limited effects on transcription of these genes. These data imply that the fertility function of Su(Hw) may not depend upon the demarcation of transcriptional domains. Our studies establish a framework for understanding the germline Su(Hw) function and provide insights into how chromatin occupancy is achieved by multi-ZnF proteins, the most common transcription factor class in metazoans.
Collapse
Affiliation(s)
- Alexey A Soshnev
- Interdisciplinary Program in Molecular and Cellular Biology, University of Iowa, Iowa City, IA 52242, USA
| | | | | | | | | | | | | |
Collapse
|
42
|
Liao BY, Chang AY. Mammalian genes preferentially co-retained in radiation hybrid panels tend to avoid coexpression. PLoS One 2012; 7:e32284. [PMID: 22384204 DOI: 10.1371/journal.pone.0032284] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 01/24/2012] [Indexed: 11/24/2022] Open
Abstract
Coexpression has been frequently used to explore modules of functionally related genes in eukaryotic genomes. However, we found that genetically interacting mammalian genes identified through radiation hybrid (RH) genotypes tend not to be coexpressed across tissues. This pattern remained unchanged after controlling for potential confounding factors, including chromosomal linkage, chromosomal distance, and gene duplication. Because >99.9% of the genetically interacting genes were identified according to the higher co-retention frequencies, our observation implies that coexpression is not necessarily an indication of the need for the co-presence of two genes in the genome, which is a prerequisite for cofunctionality of their coding proteins in the cell. Therefore, coexpression information must be applied cautiously to the exploration of the functional relatedness of genes in a genome.
Collapse
|
43
|
Shah N, Dorer DR, Moriyama EN, Christensen AC. Evolution of a large, conserved, and syntenic gene family in insects. G3 (Bethesda) 2012; 2:313-9. [PMID: 22384409 DOI: 10.1534/g3.111.001412] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2011] [Accepted: 12/16/2011] [Indexed: 11/18/2022]
Abstract
The Osiris gene family, first described in Drosophila melanogaster, is clustered in the genomes of all Drosophila species sequenced to date. In D. melanogaster, it explains the enigmatic phenomenon of the triplo-lethal and haploinsufficient locus Tpl. The synteny of Osiris genes in flies is well conserved, and it is one of the largest syntenic blocks in the Drosophila group. By examining the genome sequences of other insects in a wide range of taxonomic orders, we show here that the gene family is well-conserved and syntenic not only in the diptera but across the holometabolous and hemimetabolous insects. Osiris gene homologs have also been found in the expressed sequence tag sequences of various other insects but are absent from all groups that are not insects, including crustacea and arachnids. It is clear that the gene family evolved by gene duplication and neofunctionalization very soon after the divergence of the insects from other arthropods but before the divergence of the insects from one another and that the sequences and synteny have been maintained by selection ever since.
Collapse
|
44
|
Webster MT, Hurst LD. Direct and indirect consequences of meiotic recombination: implications for genome evolution. Trends Genet 2011; 28:101-9. [PMID: 22154475 DOI: 10.1016/j.tig.2011.11.002] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Revised: 11/08/2011] [Accepted: 11/09/2011] [Indexed: 12/23/2022]
Abstract
There is considerable variation within eukaryotic genomes in the local rate of crossing over. Why is this and what effect does it have on genome evolution? On the genome scale, it is known that by shuffling alleles, recombination increases the efficacy of selection. By contrast, the extent to which differences in the recombination rate modulate the efficacy of selection between genomic regions is unclear. Recombination also has direct consequences on the origin and fate of mutations: biased gene conversion and other forms of meiotic drive promote the fixation of mutations in a similar way to selection, and recombination itself may be mutagenic. Consideration of both the direct and indirect effects of recombination is necessary to understand why its rate is so variable and for correct interpretation of patterns of genome evolution.
Collapse
Affiliation(s)
- Matthew T Webster
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
| | | |
Collapse
|
45
|
Feller C, Prestel M, Hartmann H, Straub T, Söding J, Becker PB. The MOF-containing NSL complex associates globally with housekeeping genes, but activates only a defined subset. Nucleic Acids Res 2011; 40:1509-22. [PMID: 22039099 PMCID: PMC3287193 DOI: 10.1093/nar/gkr869] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The MOF (males absent on the first)-containing NSL (non-specific lethal) complex binds to a subset of active promoters in Drosophila melanogaster and is thought to contribute to proper gene expression. The determinants that target NSL to specific promoters and the circumstances in which the complex engages in regulating transcription are currently unknown. Here, we show that the NSL complex primarily targets active promoters and in particular housekeeping genes, at which it colocalizes with the chromatin remodeler NURF (nucleosome remodeling factor) and the histone methyltransferase Trithorax. However, only a subset of housekeeping genes associated with NSL are actually activated by it. Our analyses reveal that these NSL-activated promoters are depleted of certain insulator binding proteins and are enriched for the core promoter motif ‘Ohler 5’. Based on these results, it is possible to predict whether the NSL complex is likely to regulate a particular promoter. We conclude that the regulatory capacity of the NSL complex is highly context-dependent. Activation by the NSL complex requires a particular promoter architecture defined by combinations of chromatin regulators and core promoter motifs.
Collapse
Affiliation(s)
- Christian Feller
- Adolf-Butenandt-Institute and Center for Integrated Protein Science of the Ludwig-Maximilians-University, Schillerstraße 44, 80336 München, Germany
| | | | | | | | | | | |
Collapse
|
46
|
|