1
|
Farberov L, Weissglas-Volkov D, Shapira G, Zoabi Y, Schiff C, Kloeckener-Gruissem B, Neidhardt J, Shomron N. mRNA splicing is modulated by intronic microRNAs. iScience 2023; 26:107723. [PMID: 37692287 PMCID: PMC10492213 DOI: 10.1016/j.isci.2023.107723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 08/06/2023] [Accepted: 08/22/2023] [Indexed: 09/12/2023] Open
Abstract
Splicing of transcripts is catalyzed by the spliceosome, a mega-complex consisting of hundreds of proteins and five snRNAs, which employs direct interactions. When U1 snRNA forms high-affinity binding, namely more than eight base pairs, with the 5'SS, the result is usually a suppressing effect on the splicing activity. This likely occurs due to the inefficient unwinding of U1/5'SS base-pairing or other regulatory obstructions. Here, we show in vitro and in patient-derived cell lines that pre-microRNAs can modulate the splicing reaction by interacting with U1 snRNA. This leads to reduced binding affinity to the 5'SS, and hence promotes the inclusion of exons containing 5'SS, despite sequence-based high affinity to U1. Application of the mechanism resulted in correction of the splicing defect in the disease-causing VCAN gene from an individual with Wagner syndrome. This pre-miRNA/U1 interaction can regulate the expression of alternatively spliced exons, thus extending the scope of mechanisms regulating splicing.
Collapse
Affiliation(s)
- Luba Farberov
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Daphna Weissglas-Volkov
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv, Israel
| | - Guy Shapira
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv, Israel
| | - Yazeed Zoabi
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv, Israel
| | - Chen Schiff
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Barbara Kloeckener-Gruissem
- Institute of Medical Molecular Genetics, University of Zurich, Zurich, Switzerland
- Department of Biology, ETHZ, Zurich, Switzerland
| | - John Neidhardt
- Human Genetics, Faculty of Medicine and Health Sciences, University of Oldenburg, Germany
- Research Center Neurosensory Science, University Oldenburg, Germany
| | - Noam Shomron
- Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv, Israel
- Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
2
|
Taylor TB, Shepherd MJ, Jackson RW, Silby MW. Natural selection on crosstalk between gene regulatory networks facilitates bacterial adaptation to novel environments. Curr Opin Microbiol 2022; 67:102140. [DOI: 10.1016/j.mib.2022.02.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 02/01/2022] [Accepted: 02/02/2022] [Indexed: 02/04/2023]
|
3
|
Denisov SV, Bazykin GA, Sutormin R, Favorov AV, Mironov AA, Gelfand MS, Kondrashov AS. Weak negative and positive selection and the drift load at splice sites. Genome Biol Evol 2014; 6:1437-47. [PMID: 24966225 PMCID: PMC4079205 DOI: 10.1093/gbe/evu100] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/05/2014] [Indexed: 11/30/2022] Open
Abstract
Splice sites (SSs) are short sequences that are crucial for proper mRNA splicing in eukaryotic cells, and therefore can be expected to be shaped by strong selection. Nevertheless, in mammals and in other intron-rich organisms, many of the SSs often involve nonconsensus (Nc), rather than consensus (Cn), nucleotides, and beyond the two critical nucleotides, the SSs are not perfectly conserved between species. Here, we compare the SS sequences between primates, and between Drosophila fruit flies, to reveal the pattern of selection acting at SSs. Cn-to-Nc substitutions are less frequent, and Nc-to-Cn substitutions are more frequent, than neutrally expected, indicating, respectively, negative and positive selection. This selection is relatively weak (1 < |4Nes| < 4), and has a similar efficiency in primates and in Drosophila. Within some nucleotide positions, the positive selection in favor of Nc-to-Cn substitutions is weaker than the negative selection maintaining already established Cn nucleotides; this difference is due to site-specific negative selection favoring current Nc nucleotides. In general, however, the strength of negative selection protecting the Cn alleles is similar in magnitude to the strength of positive selection favoring replacement of Nc alleles, as expected under the simple nearly neutral turnover. In summary, although a fraction of the Nc nucleotides within SSs is maintained by selection, the abundance of deleterious nucleotides in this class suggests a substantial genome-wide drift load.
Collapse
Affiliation(s)
- Stepan V Denisov
- A.A. Kharkevich Insitute for Information Transmission Problems RAS, Moscow, Russia
| | - Georgii A Bazykin
- A.A. Kharkevich Insitute for Information Transmission Problems RAS, Moscow, RussiaFaculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Roman Sutormin
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Alexander V Favorov
- Division of Oncology Biostatistics, The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MDLaboratory of System Biology and Computational Genetics, Department of Computational System Biology, N.I. Vavilov Institute of General Genetics, Moscow, RussiaLaboratory of Bioinformatics, State Research Institute of Genetics and Selection of Industrial Microorganism (GosNIIGenetika), Moscow, Russia
| | - Andrey A Mironov
- A.A. Kharkevich Insitute for Information Transmission Problems RAS, Moscow, RussiaFaculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Mikhail S Gelfand
- A.A. Kharkevich Insitute for Information Transmission Problems RAS, Moscow, RussiaFaculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Alexey S Kondrashov
- Faculty of Bioengineering and Bioinformatics, M.V. Lomonosov Moscow State University, Moscow, RussiaLife Sciences Institute and Department of Ecology and Evolutionary Biology, University of Michigan
| |
Collapse
|
4
|
Cornish JP, Sanchez-Alberola N, O'Neill PK, O'Keefe R, Gheba J, Erill I. Characterization of the SOS meta-regulon in the human gut microbiome. ACTA ACUST UNITED AC 2014; 30:1193-7. [PMID: 24407225 PMCID: PMC3998124 DOI: 10.1093/bioinformatics/btt753] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
MOTIVATION Data from metagenomics projects remain largely untapped for the analysis of transcriptional regulatory networks. Here, we provide proof-of-concept that metagenomic data can be effectively leveraged to analyze regulatory networks by characterizing the SOS meta-regulon in the human gut microbiome. RESULTS We combine well-established in silico and in vitro techniques to mine the human gut microbiome data and determine the relative composition of the SOS network in a natural setting. Our analysis highlights the importance of translesion synthesis as a primary function of the SOS response. We predict the association of this network with three novel protein clusters involved in cell wall biogenesis, chromosome partitioning and restriction modification, and we confirm binding of the SOS response transcriptional repressor to sites in the promoter of a cell wall biogenesis enzyme, a phage integrase and a death-on-curing protein. We discuss the implications of these findings and the potential for this approach for metagenome analysis.
Collapse
Affiliation(s)
- Joseph P Cornish
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), Baltimore, MD 21250, USA
| | | | | | | | | | | |
Collapse
|
5
|
Abstract
We tested whether functionally important sites in bacterial, yeast, and animal promoters are more conserved than their neighbors. We found that substitutions are predominantly seen in less important sites and that those that occurred tended to have less impact on gene expression than possible alternatives. These results suggest that purifying selection operates on promoter sequences.
Collapse
|
6
|
Bazykin GA, Kochetov AV. Alternative translation start sites are conserved in eukaryotic genomes. Nucleic Acids Res 2010; 39:567-77. [PMID: 20864444 PMCID: PMC3025576 DOI: 10.1093/nar/gkq806] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Alternative start AUG codons within a single transcript can contribute to diversity of the proteome; however, their functional significance remains controversial. Here, we provide comparative genomics evidence that alternative start codons are under negative selection in vertebrates, insects and yeast. In genes where the annotated start codon (sAUG) resides within the suboptimal nucleotide context, the downstream in-frame AUG codons (dAUG) among the first ∼30 codon sites are significantly more conserved between species than in genes where the sAUG resides within the optimal context. Proteomics data show that this difference is not an annotation artifact and that dAUGs are in fact under selection as alternative start sites. The key optimal, and sometimes suboptimal, context-determining nucleotides of both the sAUG and dAUGs are conserved. Selection for secondary start sites is stronger in genes with the weak primary start site. Genes with multiple conserved start sites are enriched for transcription factors, and tend to have longer 5'UTRs and higher degree of alternative splicing. Together, these results imply that the use of alternative start sites by means of leaky mRNA scanning is a functional mechanism under selection for increased efficiency of translation and/or for translation of different N-terminal protein variants.
Collapse
Affiliation(s)
- Georgii A Bazykin
- Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow, Russia.
| | | |
Collapse
|
7
|
Swamy KBS, Cho CY, Chiang S, Tsai ZTY, Tsai HK. Impact of DNA-binding position variants on yeast gene expression. Nucleic Acids Res 2010; 37:6991-7001. [PMID: 19767613 PMCID: PMC2790881 DOI: 10.1093/nar/gkp743] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Transcription factors (TFs) regulate gene expression by binding to specific binding sites (TFBSs) in gene promoters. TFBS motifs may contain one or more variable positions. Although the prevailing assumption is that nucleotide variants at such positions are functionally equivalent, there is increasing evidence that such variants play a role in regulation of gene expression. In this article, we propose a method for studying the relationship between the expression of target genes and nucleotide variants in TFBS motifs at a genome-wide scale in Saccharomyces cerevisiae, especially the combinatorial effects of variants at two positions. Our analysis shows that nucleotide variations in more than one-third of variable positions and in 20% of dependent position pairs are highly correlated to gene expression. We define such positions as 'functional'. However, some positions are only functional as dependent pairs, but not individually. In addition, a significant proportion of the functional positions have been well conserved across all yeast-related species studied. We also find that some positions require the presence of co-occurring TFs, while others maintain their functionality in the absence of a co-occurring TF. Our analysis supports the importance of nucleotide variants at variable positions of TFBSs in gene regulation.
Collapse
Affiliation(s)
- Krishna B S Swamy
- Institute of Information Science, National Yang-Ming University, Taiwan
| | | | | | | | | |
Collapse
|
8
|
Kulakovskiy IV, Favorov AV, Makeev VJ. Motif discovery and motif finding from genome-mapped DNase footprint data. ACTA ACUST UNITED AC 2009; 25:2318-25. [PMID: 19605419 DOI: 10.1093/bioinformatics/btp434] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Footprint data is an important source of information on transcription factor recognition motifs. However, a footprinting fragment can contain no sequences similar to known protein recognition sites. Inspection of genome fragments nearby can help to identify missing site positions. RESULTS Genome fragments containing footprints were supplied to a pipeline that constructed a position weight matrix (PWM) for different motif lengths and selected the optimal PWM. Fragments were aligned with the SeSiMCMC sampler and a new heuristic algorithm, Bigfoot. Footprints with missing hits were found for approximately 50% of factors. Adding only 2 bp on both sides of a footprinting fragment recovered most hits. We automatically constructed motifs for 41 Drosophila factors. New motifs can recognize footprints with a greater sensitivity at the same false positive rate than existing models. Also we discuss possible overfitting of constructed motifs. AVAILABILITY Software and the collection of regulatory motifs are freely available at http://line.imb.ac.ru/DMMPMM.
Collapse
Affiliation(s)
- Ivan V Kulakovskiy
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia.
| | | | | |
Collapse
|
9
|
Santos CL, Tavares F, Thioulouse J, Normand P. A phylogenomic analysis of bacterial helix-turn-helix transcription factors. FEMS Microbiol Rev 2008; 33:411-29. [PMID: 19076237 DOI: 10.1111/j.1574-6976.2008.00154.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Perception by each individual organism of its environment's parameters is a key factor for survival. In a constantly changing environment, the ability to assess nutrient sources and potentially stressful situations constitutes the main basis for ecological adaptability. Transcription regulators are key decision-making proteins that mediate the communication between environmental conditions and DNA transcription through a multifaceted network. The parallel study of these regulators across microbial organisms adapted to contrasting biotopes constitutes an unexplored approach to understand the evolution of genome plasticity and cell function. We present here a reassessment of bacterial helix-turn-helix regulator diversity in different organisms from a multidisciplinary perspective, on the interface that links metabolism, ecology and phylogeny, further sustained by a statistically based approach. The present revision brought to light evidence of patterns among families of regulators, suggesting that multiple selective forces modulate the number and kind of regulators present in a given genome. Besides being an important step towards understanding the adaptive traits that influence the microbial responses to the varying environment on the very first and most prevalent line of reaction, the transcription of DNA, this approach is a promising tool to extract biological trends from genomic databases.
Collapse
Affiliation(s)
- Catarina L Santos
- Instituto de Biologia Molecular e Celular, Universidade do Porto, Porto, Portugal.
| | | | | | | |
Collapse
|
10
|
Dufour YS, Landick R, Donohue TJ. Organization and evolution of the biological response to singlet oxygen stress. J Mol Biol 2008; 383:713-30. [PMID: 18723027 DOI: 10.1016/j.jmb.2008.08.017] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2008] [Revised: 08/05/2008] [Accepted: 08/07/2008] [Indexed: 10/21/2022]
Abstract
The appearance of atmospheric oxygen from photosynthetic activity led to the evolution of aerobic respiration and responses to the resulting reactive oxygen species. In Rhodobacter sphaeroides, a photosynthetic alpha-proteobacterium, a transcriptional response to the reactive oxygen species singlet oxygen ((1)O(2)) is controlled by the group IV sigma factor sigma(E) and the anti-sigma factor ChrR. In this study, we integrated various large datasets to identify genes within the (1)O(2) stress response that contain sigma(E)-dependent promoters both within R. sphaeroides and across the bacterial phylogeny. Transcript pattern clustering and a sigma(E)-binding sequence model were used to predict candidate promoters that respond to (1)O(2) stress in R. sphaeroides. These candidate promoters were experimentally validated to nine R. sphaeroides sigma(E)-dependent promoters that control the transcription of 15 (1)O(2)-activated genes. Knowledge of the R. sphaeroides response to (1)O(2) and its regulator sigma(E)-ChrR was combined with large-scale phylogenetic and sequence analyses to predict the existence of a core set of approximately eight conserved sigma(E)-dependent genes in alpha-proteobacteria and gamma-proteobacteria. The bacteria predicted to contain this conserved response to (1)O(2) include photosynthetic species, as well as free-living and symbiotic/pathogenic nonphotosynthetic species. Our analysis also predicts that the response to (1)O(2) evolved within the time frame of the accumulation of atmospheric molecular oxygen on this planet.
Collapse
Affiliation(s)
- Yann S Dufour
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | | |
Collapse
|
11
|
Perera A, Vallverdu M, Claria F, Soria JM, Caminal P. DNA binding site characterization by means of Rényi entropy measures on nucleotide transitions. IEEE Trans Nanobioscience 2008; 7:133-41. [PMID: 18556261 DOI: 10.1109/tnb.2008.2000744] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In this work, parametric information-theory measures for the characterization of binding sites in DNA are extended with the use of transitional probabilities on the sequence. We propose the use of parametric uncertainty measures such as Rényi entropies obtained from the transition probabilities for the study of the binding sites, in addition to nucleotide frequency-based Rényi measures. Results are reported in this work comparing transition frequencies (i.e., dinucleotides) and base frequencies for Shannon and parametric Rényi entropies for a number of binding sites found in E. Coli, lambda and T7 organisms. We observe that the information provided by both approaches is not redundant. Furthermore, under the presence of noise in the binding site matrix we observe overall improved robustness of nucleotide transition-based algorithms when compared with nucleotide frequency-based method.
Collapse
Affiliation(s)
- A Perera
- Centre for Biomedical Engineering Research, Technical University of Catalonia, Barcelona, Spain.
| | | | | | | | | |
Collapse
|
12
|
Bais AS, Grossmann S, Vingron M. Incorporating evolution of transcription factor binding sites into annotated alignments. J Biosci 2007; 32:841-50. [PMID: 17914226 DOI: 10.1007/s12038-007-0084-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Identifying transcription factor binding sites (TFBSs) is essential to elucidate putative regulatory mechanisms. A common strategy is to combine cross-species conservation with single sequence TFBS annotation to yield "conserved TFBSs". Most current methods in this field adopt a multi-step approach that segregates the two aspects. Again, it is widely accepted that the evolutionary dynamics of binding sites differ from those of the surrounding sequence. Hence, it is desirable to have an approach that explicitly takes this factor into account. Although a plethora of approaches have been proposed for the prediction of conserved TFBSs, very few explicitly model TFBS evolutionary properties, while additionally being multi-step. Recently, we introduced a novel approach to simultaneously align and annotate conserved TFBSs in a pair of sequences. Building upon the standard Smith-Waterman algorithm for local alignments, SimAnn introduces additional states for profiles to output extended alignments or annotated alignments. That is, alignments with parts annotated as gaplessly aligned TFBSs (pair-profile hits)are generated. Moreover,the pair- profile related parameters are derived in a sound statistical framework. In this article, we extend this approach to explicitly incorporate evolution of binding sites in the SimAnn framework. We demonstrate the extension in the theoretical derivations through two position-specific evolutionary models, previously used for modelling TFBS evolution. In a simulated setting, we provide a proof of concept that the approach works given the underlying assumptions,as compared to the original work. Finally, using a real dataset of experimentally verified binding sites in human-mouse sequence pairs,we compare the new approach (eSimAnn) to an existing multi-step tool that also considers TFBS evolution. Although it is widely accepted that binding sites evolve differently from the surrounding sequences, most comparative TFBS identification methods do not explicitly consider this.Additionally, prediction of conserved binding sites is carried out in a multi-step approach that segregates alignment from TFBS annotation. In this paper, we demonstrate how the simultaneous alignment and annotation approach of SimAnn can be further extended to incorporate TFBS evolutionary relationships. We study how alignments and binding site predictions interplay at varying evolutionary distances and for various profile qualities.
Collapse
Affiliation(s)
- Abha S Bais
- Max Planck Institute for Molecular Genetics, Berlin, Germany.
| | | | | |
Collapse
|
13
|
Gorbunov KY, Lyubetsky VA. Reconstruction of ancestral regulatory signals along a transcription factor tree. Mol Biol 2007. [DOI: 10.1134/s0026893307050172] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
14
|
Enikeeva FN, Kotelnikova EA, Gelfand MS, Makeev VJ. A model of evolution with constant selective pressure for regulatory DNA sites. BMC Evol Biol 2007; 7:125. [PMID: 17662135 PMCID: PMC1978210 DOI: 10.1186/1471-2148-7-125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2007] [Accepted: 07/27/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Molecular evolution is usually described assuming a neutral or weakly non-neutral substitution model. Recently, new data have become available on evolution of sequence regions under a selective pressure, e.g. transcription factor binding sites. To reconstruct the evolutionary history of such sequences, one needs evolutionary models that take into account a substantial constant selective pressure. RESULTS We present a simple evolutionary model with a single preferred (consensus) nucleotide and the neutral substitution model adopted for all other nucleotides. This evolutionary model has a rate matrix in which all substitutions that do not involve the consensus nucleotide occur with the same rate. The model has two time scales for achieving a stationary distribution; in the general case only one of the two rate parameters can be evaluated from the stationary distribution. In the middle-time zone, a counterintuitive behavior was observed for some parameter values, with a probability of conservation for a non-consensus nucleotide greater than that for the consensus nucleotide. Such an effect can be observed only in the case of weak preference for the consensus nucleotide, when the probability to observe the consensus nucleotide in the stationary distribution is less than 1/2. If the substitution rate is represented as a product of mutation and fixation, only the fixation can be calculated from the stationary distribution. The exhibited conservation of non-consensus nucleotides does not take place if the elements of mutation matrix are identical, and can be related to the reduced mutation rate between the non-consensus nucleotides. This bias can have no effect on the stationary distribution of nucleotide frequencies calculated over the ensemble of multiple alignments, e.g. transcription factor binding sites upstream of different sets of co-regulated orthologous genes. CONCLUSION The derived model can be used as a null model when analyzing the evolution of orthologous transcription factor binding sites. In particular, our findings show that a nucleotide preferred at some position of a multiple alignment of binding sites for some transcription factor in the same genome is not necessarily the most conserved nucleotide in an alignment of orthologous sites from different species. However, this effect can take place only in the case of a mutation matrix whose elements are not identical.
Collapse
Affiliation(s)
- Farida N Enikeeva
- Institute for Information Transmission Problems (the Kharkevich Institute) of RAS, Bolshoi Karetny pereulok, 19, GSP-4, Moscow, 127994, Russia
| | - Ekaterina A Kotelnikova
- State Research Institute of Genetics and Selection of Industrial Microorganisms, 1st Dorozhnyj proezd, 1, Moscow, 113535, Russia
- Ariadne Genomics Inc. 9700 Great Seneca Highway, Suite 113, Rockville, MD 20850, USA
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (the Kharkevich Institute) of RAS, Bolshoi Karetny pereulok, 19, GSP-4, Moscow, 127994, Russia
- Faculty of Bioengineering and Bioinformatics, Moscow State University, Vorobyevy Gory 1-73, Moscow, 119992, Russia
| | - Vsevolod J Makeev
- State Research Institute of Genetics and Selection of Industrial Microorganisms, 1st Dorozhnyj proezd, 1, Moscow, 113535, Russia
- Engelgardt Institute of Molecular Biology of RAS, Vavilova 32, Moscow, 119991, Russia
| |
Collapse
|
15
|
Gelfand MS. Evolution of transcriptional regulatory networks in microbial genomes. Curr Opin Struct Biol 2006; 16:420-9. [PMID: 16650982 DOI: 10.1016/j.sbi.2006.04.001] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2006] [Revised: 02/08/2006] [Accepted: 04/18/2006] [Indexed: 12/23/2022]
Abstract
Advances in sequencing and generating high-throughput expression data have created a situation in which it is possible to integrate comparative analysis with genome-wide studies of the structure and function of regulatory systems in model organisms. Recent studies have focused on topological properties and the evolution of regulatory networks. This problem can be addressed on several levels: evolution of binding sites upstream of orthologous or duplicated genes; co-evolution of transcription factors and the DNA motifs that they recognize; expansion, contraction and replacement of regulatory systems; the relationship between co-regulation and co-expression; and, finally, construction of evolutionary models that generate networks with realistic properties. This should eventually lead to the creation of a theory of regulatory evolution with a similar level of detail and understanding to the theory of molecular evolution of protein and DNA sequences.
Collapse
Affiliation(s)
- Mikhail S Gelfand
- Institute for Information Transmission Problems, RAS, Bolshoi Karetny pereulok 19, Moscow, 127994, Russia.
| |
Collapse
|
16
|
Boeva V, Regnier M, Papatsenko D, Makeev V. Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression. Bioinformatics 2006; 22:676-84. [PMID: 16403795 DOI: 10.1093/bioinformatics/btk032] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
MOTIVATION Genomic sequences are highly redundant and contain many types of repetitive DNA. Fuzzy tandem repeats (FTRs) are of particular interest. They are found in regulatory regions of eukaryotic genes and are reported to interact with transcription factors. However, accurate assessment of FTR occurrences in different genome segments requires specific algorithm for efficient FTR identification and classification. RESULTS We have obtained formulas for P-values of FTR occurrence and developed an FTR identification algorithm implemented in TandemSWAN software. Using TandemSWAN we compared the structure and the occurrence of FTRs with short period length (up to 24 bp) in coding and non-coding regions including UTRs, heterochromatic, intergenic and enhancer sequences of Drosophila melanogaster and Drosophila pseudoobscura. Tandems with period three and its multiples were found in coding segments, whereas FTRs with periods multiple of six are overrepresented in all non-coding segment. Periods equal to 5-7 and 11-14 were characteristic of the enhancer regions and other non-coding regions close to genes. AVAILABILITY TandemSWAN web page, stand-alone version and documentation can be found at http://bioinform.genetika.ru/projects/swan/www/ SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Valentina Boeva
- Department of Bioengineering and Bioinformatics, Moscow State University Moscow, Russia.
| | | | | | | |
Collapse
|