1
|
Brasher TC, de Bortoli L, Toledo-Silva G, Zacchi FL, Razzera G. In silico Structural Features of the CgNR5A: CgDAX Complex and Its Role in Regulating Gene Expression of CYP Target Genes in Crassostrea gigas. CHEMOSPHERE 2024; 361:142443. [PMID: 38815811 DOI: 10.1016/j.chemosphere.2024.142443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/20/2024] [Accepted: 05/24/2024] [Indexed: 06/01/2024]
Abstract
Contamination of aquatic environments has been steadily increasing due to human activities. The Pacific oyster Crassostrea gigas has been used as a key species in studies assessing the impacts of contaminants on human health and the aquatic biome. In this context, cytochrome P450 (CYPs) play a crucial role in xenobiotic metabolism. In vertebrates many of these CYPs are regulated by nuclear receptors (NRs) and little is known about the NRs role in C. gigas. Particularly, the CgNR5A represents a homolog of SF1 and LRH-1 found in vertebrates. Members of this group can regulate genes of CYPs involved in lipid/steroid metabolism, with their activity regulated by other NR, called as DAX-1, generating a NR complex on DNA response elements (REs). As C. gigas does not exhibit steroid biosynthesis pathways, CgNR5A may play other physiological roles. To clarify this issue, we conducted an in silico investigation of the interaction between CgNR5A and DNA to identify potential C. gigas CYP target genes. Using molecular docking and dynamics simulations of the CgNR5A on DNA molecules, we identified a monomeric interaction with extended REs. This RE was found in the promoter region of 30 CYP genes and also the NR CgDAX. When the upstream regulatory region was analyzed, CYP2C39, CYP3A11, CYP4C21, CYP7A1, CYP17A1, and CYP27C1 were mapped as the main genes regulated by CgNR5A. These identified CYPs belong to families known for their involvement in xenobiotic and lipid/steroid metabolism. Furthermore, we reconstructed a trimeric complex, previously proposed for vertebrates, with CgNR5A:CgDAX and subjected it to molecular dynamics simulations analysis. Heterotrimeric complex remained stable during the simulations, suggesting that CgDAX may modulate CgNR5A transcriptional activity. This study provides insights into the potential physiological processes involving these NRs in the regulation of CYPs associated with xenobiotic and steroid/lipid metabolism.
Collapse
Affiliation(s)
- Theo Cardozo Brasher
- Programa de Pós-Graduação em Bioquímica, Departamento de Bioquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil; Laboratório de Biomarcadores de Contaminação Aquática e Imunoquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil
| | - Leonardo de Bortoli
- Programa de Pós-Graduação em Bioquímica, Departamento de Bioquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil; Laboratório de Biomarcadores de Contaminação Aquática e Imunoquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil; Laboratório de Genômica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil
| | - Guilherme Toledo-Silva
- Laboratório de Genômica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil
| | - Flávia Lucena Zacchi
- Laboratório de Moluscos Marinhos, Universidade Federal de Santa Catarina, Florianópolis, SC, 88061-600, Brazil
| | - Guilherme Razzera
- Programa de Pós-Graduação em Bioquímica, Departamento de Bioquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil; Laboratório de Biomarcadores de Contaminação Aquática e Imunoquímica, Universidade Federal de Santa Catarina, Florianópolis, SC, 88040-900, Brazil.
| |
Collapse
|
2
|
Tsukanov AV, Mironova VV, Levitsky VG. Motif models proposing independent and interdependent impacts of nucleotides are related to high and low affinity transcription factor binding sites in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2022; 13:938545. [PMID: 35968123 PMCID: PMC9373801 DOI: 10.3389/fpls.2022.938545] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Accepted: 07/05/2022] [Indexed: 05/15/2023]
Abstract
Position weight matrix (PWM) is the traditional motif model representing the transcription factor (TF) binding sites. It proposes that the positions contribute independently to TFs binding affinity, although this hypothesis does not fit the data perfectly. This explains why PWM hits are missing in a substantial fraction of ChIP-seq peaks. To study various modes of the direct binding of plant TFs, we compiled the benchmark collection of 111 ChIP-seq datasets for Arabidopsis thaliana, and applied the traditional PWM, and two alternative motif models BaMM and SiteGA, proposing the dependencies of the positions. The variation in the stringency of the recognition thresholds for the models proposed that the hits of PWM, BaMM, and SiteGA models are associated with the sites of high/medium, any, and low affinity, respectively. At the medium recognition threshold, about 60% of ChIP-seq peaks contain PWM hits consisting of conserved core consensuses, while BaMM and SiteGA provide hits for an additional 15% of peaks in which a weaker core consensus is compensated through intra-motif dependencies. The presence/absence of these dependencies in the motifs of alternative/traditional models was confirmed by the dependency logo DepLogo visualizing the position-wise partitioning of the alignments of predicted sites. We exemplify the detailed analysis of ChIP-seq profiles for plant TFs CCA1, MYC2, and SEP3. Gene ontology (GO) enrichment analysis revealed that among the three motif models, the SiteGA had the highest portions of genes with the significantly enriched GO terms among all predicted genes. We showed that both alternative motif models provide for traditional PWM greater extensions in predicted sites for TFs MYC2/SEP3 with condition/tissue specific functions, compared to those for TF CCA1 with housekeeping functions. Overall, the combined application of standard and alternative motif models is beneficial to detect various modes of the direct TF-DNA interactions in the maximal portion of ChIP-seq loci.
Collapse
Affiliation(s)
- Anton V. Tsukanov
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Victoria V. Mironova
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk, Russia
- Department of Plant Systems Physiology, Radboud Institute for Biological and Environmental Sciences (RIBES), Radboud University, Nijmegen, Netherlands
| | - Victor G. Levitsky
- Department of Systems Biology, Institute of Cytology and Genetics, Novosibirsk, Russia
- Department of Natural Science, Novosibirsk State University, Novosibirsk, Russia
- *Correspondence: Victor G. Levitsky
| |
Collapse
|
3
|
Tsukanov AV, Levitsky VG, Merkulova TI. Application of alternative de novo motif recognition models for analysis of structural heterogeneity of transcription factor binding sites: a case study of FOXA2 binding sites. Vavilovskii Zhurnal Genet Selektsii 2021; 25:7. [PMID: 34547062 PMCID: PMC8408018 DOI: 10.18699/vj21.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 01/10/2021] [Accepted: 01/12/2021] [Indexed: 11/24/2022] Open
Abstract
The most popular model for the search of ChIP-seq data for transcription factor binding sites (TFBS)
is the positional weight matrix (PWM). However, this model does not take into account dependencies between
nucleotide occurrences in different site positions. Currently, two recently proposed models, BaMM and InMoDe,
can do as much. However, application of these models was usually limited only to comparing their recognition
accuracies with that of PWMs, while none of the analyses of the co-prediction and relative positioning of hits of different models in peaks has yet been performed. To close this gap, we propose the pipeline called MultiDeNA. This
pipeline includes stages of model training, assessing their recognition accuracy, scanning ChIP-seq peaks and their
classification based on scan results. We applied our pipeline to 22 ChIP-seq datasets of TF FOXA2 and considered
PWM, dinucleotide PWM (diPWM), BaMM and InMoDe models. The combination of these four models allowed a
significant increase in the fraction of recognized peaks compared to that for the sole PWM model: the increase was
26.3 %. The BaMM model provided the main contribution to the recognition of sites. Although the major fraction of
predicted peaks contained TFBS of different models with coincided positions, the medians of the fraction of peaks
containing the predictions of sole models were 1.08, 0.49, 4.15 and 1.73 % for PWM, diPWM, BaMM and InMoDe,
respectively. Thus, FOXA2 BSs were not fully described by only a sole model, which indicates theirs heterogeneity.
We assume that the BaMM model is the most successful in describing the structure of the FOXA2 BS in ChIP-seq
datasets under study.
Collapse
Affiliation(s)
- A V Tsukanov
- Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - V G Levitsky
- Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| | - T I Merkulova
- Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
4
|
Degtyareva AO, Antontseva EV, Merkulova TI. Regulatory SNPs: Altered Transcription Factor Binding Sites Implicated in Complex Traits and Diseases. Int J Mol Sci 2021; 22:6454. [PMID: 34208629 PMCID: PMC8235176 DOI: 10.3390/ijms22126454] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 12/19/2022] Open
Abstract
The vast majority of the genetic variants (mainly SNPs) associated with various human traits and diseases map to a noncoding part of the genome and are enriched in its regulatory compartment, suggesting that many causal variants may affect gene expression. The leading mechanism of action of these SNPs consists in the alterations in the transcription factor binding via creation or disruption of transcription factor binding sites (TFBSs) or some change in the affinity of these regulatory proteins to their cognate sites. In this review, we first focus on the history of the discovery of regulatory SNPs (rSNPs) and systematized description of the existing methodical approaches to their study. Then, we brief the recent comprehensive examples of rSNPs studied from the discovery of the changes in the TFBS sequence as a result of a nucleotide substitution to identification of its effect on the target gene expression and, eventually, to phenotype. We also describe state-of-the-art genome-wide approaches to identification of regulatory variants, including both making molecular sense of genome-wide association studies (GWAS) and the alternative approaches the primary goal of which is to determine the functionality of genetic variants. Among these approaches, special attention is paid to expression quantitative trait loci (eQTLs) analysis and the search for allele-specific events in RNA-seq (ASE events) as well as in ChIP-seq, DNase-seq, and ATAC-seq (ASB events) data.
Collapse
Affiliation(s)
- Arina O. Degtyareva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Elena V. Antontseva
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
| | - Tatiana I. Merkulova
- Department of Molecular Genetic, Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.O.D.); (E.V.A.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|
5
|
Ignatieva EV, Igoshin AV, Yudin NS. A database of human genes and a gene network involved in response to tick-borne encephalitis virus infection. BMC Evol Biol 2017; 17:259. [PMID: 29297316 PMCID: PMC5751789 DOI: 10.1186/s12862-017-1107-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023] Open
Abstract
BACKGROUND Tick-borne encephalitis is caused by the neurotropic, positive-sense RNA virus, tick-borne encephalitis virus (TBEV). TBEV infection can lead to a variety of clinical manifestations ranging from slight fever to severe neurological illness. Very little is known about genetic factors predisposing to severe forms of disease caused by TBEV. The aims of the study were to compile a catalog of human genes involved in response to TBEV infection and to rank genes from the catalog based on the number of neighbors in the network of pairwise interactions involving these genes and TBEV RNA or proteins. RESULTS Based on manual review and curation of scientific publications a catalog comprising 140 human genes involved in response to TBEV infection was developed. To provide access to data on all genes, the TBEVhostDB web resource ( http://icg.nsc.ru/TBEVHostDB/ ) was created. We reconstructed a network formed by pairwise interactions between TBEV virion itself, viral RNA and viral proteins and 140 genes/proteins from TBEVHostDB. Genes were ranked according to the number of interactions in the network. Two genes/proteins (CCR5 and IFNAR1) that had maximal number of interactions were revealed. It was found that the subnetworks formed by CCR5 and IFNAR1 and their neighbors were a fragments of two key pathways functioning during the course of tick-borne encephalitis: (1) the attenuation of interferon-I signaling pathway by the TBEV NS5 protein that targeted peptidase D; (2) proinflammation and tissue damage pathway triggered by chemokine receptor CCR5 interacting with CD4, CCL3, CCL4, CCL2. Among nine genes associated with severe forms of TBEV infection, three genes/proteins (CCR5, IL10, ARID1B) were found to have protein-protein interactions within the network, and two genes/proteins (IFNL3 and the IL10, that was just mentioned) were up- or down-regulated in response to TBEV infection. Based on this finding, potential mechanisms for participation of CCR5, IL10, ARID1B, and IFNL3 in the host response to TBEV infection were suggested. CONCLUSIONS A database comprising 140 human genes involved in response to TBEV infection was compiled and the TBEVHostDB web resource, providing access to all genes was created. This is the first effort of integrating and unifying data on genetic factors that may predispose to severe forms of diseases caused by TBEV. The TBEVHostDB could potentially be used for assessment of risk factors for severe forms of tick-borne encephalitis and for the design of personalized pharmacological strategies for the treatment of TBEV infection.
Collapse
Affiliation(s)
- Elena V Ignatieva
- Laboratory of Evolutionary Bioinformatics and Theoretical Genetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia. .,Center for Brain Neurobiology and Neurogenetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia. .,Novosibirsk State University, Novosibirsk, 630090, Russia.
| | - Alexander V Igoshin
- Laboratory of Infectious Disease Genomics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Nikolay S Yudin
- Laboratory of Infectious Disease Genomics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.,Novosibirsk State University, Novosibirsk, 630090, Russia
| |
Collapse
|
6
|
Yudin NS, Larkin DM, Ignatieva EV. A compendium and functional characterization of mammalian genes involved in adaptation to Arctic or Antarctic environments. BMC Genet 2017; 18:111. [PMID: 29297313 PMCID: PMC5751660 DOI: 10.1186/s12863-017-0580-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Background Many mammals are well adapted to surviving in extremely cold environments. These species have likely accumulated genetic changes that help them efficiently cope with low temperatures. It is not known whether the same genes related to cold adaptation in one species would be under selection in another species. The aims of this study therefore were: to create a compendium of mammalian genes related to adaptations to a low temperature environment; to identify genes related to cold tolerance that have been subjected to independent positive selection in several species; to determine promising candidate genes/pathways/organs for further empirical research on cold adaptation in mammals. Results After a search for publications containing keywords: “whole genome”, “transcriptome or exome sequencing data”, and “genome-wide genotyping array data” authors looked for information related to genetic signatures ascribable to positive selection in Arctic or Antarctic mammalian species. Publications related to Human, Arctic fox, Yakut horse, Mammoth, Polar bear, and Minke whale were chosen. The compendium of genes that potentially underwent positive selection in >1 of these six species consisted of 416 genes. Twelve of them showed traces of positive selection in three species. Gene ontology term enrichment analysis of 416 genes from the compendium has revealed 13 terms relevant to the scope of this study. We found that enriched terms were relevant to three major groups: terms associated with collagen proteins and the extracellular matrix; terms associated with the anatomy and physiology of cilium; terms associated with docking. We further revealed that genes from compendium were over-represented in the lists of genes expressed in the lung and liver. Conclusions A compendium combining mammalian genes involved in adaptation to cold environment was designed, based on the intersection of positively selected genes from six Arctic and Antarctic species. The compendium contained 416 genes that have been positively selected in at least two species. However, we did not reveal any positively selected genes that would be related to cold adaptation in all species from our list. But, our work points to several strong candidate genes involved in mechanisms and biochemical pathways related to cold adaptation response in different species. Electronic supplementary material The online version of this article (10.1186/s12863-017-0580-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nikolay S Yudin
- The Federal Research Center Institute of Cytology and Genetics, The Siberian Branch of the Russian Academy of Sciences, 630090, Novosibirsk, Russia. .,Novosibirsk State University, 630090, Novosibirsk, Russia.
| | - Denis M Larkin
- The Federal Research Center Institute of Cytology and Genetics, The Siberian Branch of the Russian Academy of Sciences, 630090, Novosibirsk, Russia.,The Royal Veterinary College, University of London, London, NW1 0TU, UK
| | - Elena V Ignatieva
- The Federal Research Center Institute of Cytology and Genetics, The Siberian Branch of the Russian Academy of Sciences, 630090, Novosibirsk, Russia.,Novosibirsk State University, 630090, Novosibirsk, Russia
| |
Collapse
|
7
|
Ignatieva EV, Afonnikov DA, Saik OV, Rogaev EI, Kolchanov NA. A compendium of human genes regulating feeding behavior and body weight, its functional characterization and identification of GWAS genes involved in brain-specific PPI network. BMC Genet 2016; 17:158. [PMID: 28105929 PMCID: PMC5249002 DOI: 10.1186/s12863-016-0466-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Background Obesity is heritable. It predisposes to many diseases. The objectives of this study were to create a compendium of genes relevant to feeding behavior (FB) and/or body weight (BW) regulation; to construct and to analyze networks formed by associations between genes/proteins; and to identify the most significant genes, biological processes/pathways, and tissues/organs involved in BW regulation. Results The compendium of genes controlling FB or BW includes 578 human genes. Candidate genes were identified from various sources, including previously published original research and review articles, GWAS meta-analyses, and OMIM (Online Mendelian Inheritance in Man). All genes were ranked according to knowledge about their biological role in body weight regulation and classified according to expression patterns or functional characteristics. Substantial and overrepresented numbers of genes from the compendium encoded cell surface receptors, signaling molecules (hormones, neuropeptides, cytokines), transcription factors, signal transduction proteins, cilium and BBSome components, and lipid binding proteins or were present in the brain-specific list of tissue-enriched genes identified with TSEA tool. We identified 27 pathways from KEGG, REACTOME and BIOCARTA whose genes were overrepresented in the compendium. Networks formed by physical interactions or homological relationships between proteins or interactions between proteins involved in biochemical/signaling pathways were reconstructed and analyzed. Subnetworks and clusters identified by the MCODE tool included genes/proteins associated with cilium morphogenesis, signal transduction proteins (particularly, G protein–coupled receptors, kinases or proteins involved in response to insulin stimulus) and transcription regulation (particularly nuclear receptors). We ranked GWAS genes according to the number of neighbors in three networks and revealed 22 GWAS genes involved in the brain-specific PPI network. On the base of the most reliable PPIs functioning in the brain tissue, new regulatory schemes interpreting relevance to BW regulation are proposed for three GWAS genes (ETV5, LRP1B, and NDUFS3). Conclusions A compendium comprising 578 human genes controlling FB or BW was designed, and the most significant functional groups of genes, biological processes/pathways, and tissues/organs involved in BW regulation were revealed. We ranked genes from the GWAS meta-analysis set according to the number and quality of associations in the networks and then according to their involvement in the brain-specific PPI network and proposed new regulatory schemes involving three GWAS genes (ETV5, LRP1B, and NDUFS3) in BW regulation. The compendium is expected to be useful for pathology risk estimation and for design of new pharmacological approaches in the treatment of human obesity. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0466-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Elena V Ignatieva
- Center for Brain Neurobiology and Neurogenetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia. .,Novosibirsk State University, Novosibirsk, 630090, Russia. .,Laboratory of Evolutionary Bioinformatics and Theoretical Genetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.
| | - Dmitry A Afonnikov
- Center for Brain Neurobiology and Neurogenetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.,Novosibirsk State University, Novosibirsk, 630090, Russia.,Laboratory of Evolutionary Bioinformatics and Theoretical Genetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Olga V Saik
- Center for Brain Neurobiology and Neurogenetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
| | - Evgeny I Rogaev
- Center for Brain Neurobiology and Neurogenetics, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.,BNRI, Department of Psychiatry, University of Massachusetts Medical School, Worcester, MA, 15604, USA
| | - Nikolay A Kolchanov
- Novosibirsk State University, Novosibirsk, 630090, Russia.,Department of Systems Biology, The Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
| |
Collapse
|