1
|
Association of a Specific OsCULLIN3c Haplotype with Salt Stress Responses in Local Thai Rice. Int J Mol Sci 2024; 25:1040. [PMID: 38256116 PMCID: PMC10815816 DOI: 10.3390/ijms25021040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/06/2024] [Accepted: 01/10/2024] [Indexed: 01/24/2024] Open
Abstract
We previously found that OsCUL3c is involved in the salt stress response. However, there are no definitive reports on the diversity of OsCUL3c in local Thai rice. In this study, we showed that the CUL3 group was clearly separated from the other CUL groups; next, we focused on OsCUL3c, the third CUL3 of the CUL3 family in rice, which is absent in Arabidopsis. A total of 111 SNPs and 28 indels over the OsCUL3c region, representing 79 haplotypes (haps), were found. Haplotyping revealed that group I (hap A and hap C) and group II (hap B1 and hap D) were different mutated variants, which showed their association with phenotypes under salt stress. These results were supported by cis-regulatory elements (CREs) and transcription factor binding sites (TFBSs) analyses. We found that LTR, MYC, [AP2; ERF], and NF-YB, which are related to salt stress, drought stress, and the response to abscisic acid (ABA), have distinct positions and numbers in the haplotypes of group I and group II. An RNA Seq analysis of the two predominant haplotypes from each group showed that the OsCUL3c expression of the group I representative was upregulated and that of group II was downregulated, which was confirmed by RT-qPCR. Promoter changes might affect the transcriptional responses to salt stress, leading to different regulatory mechanisms for the expression of different haplotypes. We speculate that OsCUL3c influences the regulation of salt-related responses, and haplotype variations play a role in this regulation.
Collapse
|
2
|
Enhanced Transcriptional Strength of HIV-1 Subtype C Minimizes Gene Expression Noise and Confers Stability to the Viral Latent State. J Virol 2023; 97:e0137622. [PMID: 36533949 PMCID: PMC9888270 DOI: 10.1128/jvi.01376-22] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Stochastic fluctuations in gene expression emanating from the HIV-1 long terminal repeat (LTR), amplified by the Tat positive feedback circuit, determine the choice between viral infection fates: active transcription (ON) or transcriptional silence (OFF). The emergence of several transcription factor binding site (TFBS) variant strains in HIV-1 subtype C (HIV-1C), especially those containing the duplication of the NF-κB motif, mandates the evaluation of the effect of enhanced transcriptional strength on gene expression noise and its influence on viral fate selection switch. Using a panel of subgenomic LTR-variant strains containing different copy numbers of the NF-κB motif (ranging from 0 to 4), we used flow cytometry, mRNA quantification, and pharmacological perturbations to demonstrate an inverse correlation between promoter strength and gene expression noise in Jurkat T cells and primary CD4+ T cells. The inverse correlation is consistent in clonal cell populations at constant intracellular concentrations of Tat and when NF-κB levels were regulated pharmacologically. Further, we show that strong LTRs containing at least two copies of the NF-κB motif in the enhancer establish a more stable latent state and demonstrate more rapid latency reversal than weak LTRs containing fewer motifs. We also demonstrate a cooperative binding of NF-κB to the motif cluster in HIV-1C LTRs containing two, three, or four NF-κB motifs (Hill coefficient [H] = 2.61, 3.56, and 3.75, respectively). The present work alludes to a possible evolution of the HIV-1C LTR toward gaining transcriptional strength associated with attenuated gene expression noise with implications for viral latency. IMPORTANCE Over the past two consecutive decades, HIV-1 subtype C (HIV-1C) has been undergoing directional evolution toward augmenting the transcriptional strength of the long terminal repeat (LTR) by adding more copies of the existing transcription factor binding site (TFBS) by sequence duplication. Additionally, the duplicated elements are genetically diverse, suggesting broader-range signal receptivity by variant LTRs. The HIV-1 promoter is inherently noisy, and the stochastic fluctuations in gene expression of variant LTRs may influence the active transcription (ON)/transcriptional silence (OFF) latency decisions. The evolving NF-κB motif variations of HIV-1C offer a powerful opportunity to examine how the transcriptional strength of the LTR might influence gene expression noise. Our work here shows that the augmented transcriptional strength of the HIV-1C LTR leads to concomitantly reduced gene expression noise, consequently leading to stabler latency maintenance and rapid latency reversal. The present work offers a novel lead toward appreciating the molecular mechanisms governing HIV-1 latency.
Collapse
|
3
|
GUS Reporter-Aided Promoter Deletion Analysis of A. thaliana POLYAMINE OXIDASE 3. Int J Mol Sci 2023; 24:ijms24032317. [PMID: 36768644 PMCID: PMC9916862 DOI: 10.3390/ijms24032317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 01/18/2023] [Accepted: 01/19/2023] [Indexed: 01/26/2023] Open
Abstract
Polyamine oxidases (PAOs) have been correlated with numerous physiological and developmental processes, as well as responses to biotic and abiotic stress conditions. Their transcriptional regulation is driven by signals generated by various developmental and environmental cues, including phytohormones. However, the inductive mechanism(s) of the corresponding genes remains elusive. Out of the five previously characterized Arabidopsis PAO genes, none of their regulatory sequences have been analyzed to date. In this study, a GUS reporter-aided promoter deletion approach was used to investigate the transcriptional regulation of AtPAO3 during normal growth and development as well as under various inductive environments. AtPAO3 contains an upstream open reading frame (uORF) and a short inter-cistronic sequence, while the integrity of both appears to be crucial for the proper regulation of gene expression. The full-length promoter contains several cis-acting elements that regulate the tissue-specific expression of AtPAO3 during normal growth and development. Furthermore, a number of TFBS that are involved in gene induction under various abiotic stress conditions display an additive effect on gene expression. Taken together, our data indicate that the transcription of AtPAO3 is regulated by multiple environmental factors, which probably work alongside hormonal signals and shed light on the fine-tuning mechanisms of PAO regulation.
Collapse
|
4
|
SEMplMe: a tool for integrating DNA methylation effects in transcription factor binding affinity predictions. BMC Bioinformatics 2022; 23:317. [PMID: 35927613 PMCID: PMC9351228 DOI: 10.1186/s12859-022-04865-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 07/28/2022] [Indexed: 12/02/2022] Open
Abstract
MOTIVATION Aberrant DNA methylation in transcription factor binding sites has been shown to lead to anomalous gene regulation that is strongly associated with human disease. However, the majority of methylation-sensitive positions within transcription factor binding sites remain unknown. Here we introduce SEMplMe, a computational tool to generate predictions of the effect of methylation on transcription factor binding strength in every position within a transcription factor's motif. RESULTS SEMplMe uses ChIP-seq and whole genome bisulfite sequencing to predict effects of methylation within binding sites. SEMplMe validates known methylation sensitive and insensitive positions within a binding motif, identifies cell type specific transcription factor binding driven by methylation, and outperforms SELEX-based predictions for CTCF. These predictions can be used to identify aberrant sites of DNA methylation contributing to human disease. AVAILABILITY AND IMPLEMENTATION SEMplMe is available from https://github.com/Boyle-Lab/SEMplMe .
Collapse
|
5
|
Genome-wide identification of StU-box gene family and assessment of their expression in developmental stages of Solanum tuberosum. JOURNAL OF GENETIC ENGINEERING AND BIOTECHNOLOGY 2022; 20:25. [PMID: 35147812 PMCID: PMC8837765 DOI: 10.1186/s43141-022-00306-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 01/19/2022] [Indexed: 12/16/2022]
Abstract
Background The Plant U-box (PUB), ubiquitin ligase gene, has a highly conserved domain in potato. However, little information is available about U-box genes in potato (Solanum tuberosum). In this study, 62 U-box genes were detected in the potato genome using bioinformatics methods. Further, motif analysis, gene structure, gene expression, TFBS, and synteny analysis were performed on the U-box genes. Results Based on in silico analysis, most of StU-boxs included a U-box domain; however, some of them lacked harbored domain the ARM, Pkinase_Tyr, and other domains. Based on their phylogenetic relationships, the StU-box family members were categorized into four classes. Analysis of transcription factor binding sites (TFBS) in the promoter region of StU-box genes revealed that StU-box genes had the highest and the lowest number of TFBS in MYB and CSD, respectively. Moreover, based on in silico and gene expression data, variable frequencies of TFBS in StU-box genes could indicate that these genes control different developmental stages and are involved in complex regulatory mechanisms. The number of exons in U-box genes ranged from one to sixteen. For most U-box genes, the exon–intron compositions and conserved motifs composition in most proteins in each group were similar. The intron–exon patterns and the composition of conserved motifs validated the U-box genes phylogenetic classification. Based on the results of genome distribution, StU-box genes were distributed unevenly on the 12 S. tuberosum chromosomes. The results showed that gene duplication may possess a significant role in genome expansion of S. tuberosum. Furthermore, genome evolution of S. tuberosum was surveyed using identification of orthologous and paralogous. We identified 40 orthologous gene pairs between S. tuberosum with Solanum lycopersicum, Oryza sativa, Triticum aestivum, Gossypium hirsutum, Zea maize, Coriaria mytifolia, and Arabidopsis thaliana as well as eight duplicated genes (paralogous) in S. tuberosum. StU-box 51 gene is one of the important gene among other StU-boxes in S. tuberosum under drought stress which was expressed in tuber and leaf under drought stress. Furthermore, StU-box 51 gene has the highest expression levels in four tissue-specific (stem, root, leaf, and tuber) in potato as well as it had the highest number of TFBS in promoter region. Based on our results, StU-box 51 can introduce to researcher to utilize in breeding program and genetic engineering in potato. Conclusions The results of this survey will be useful for further investigation of the probable role and molecular mechanisms of U-box genes in response to different stresses. Supplementary Information The online version contains supplementary material available at 10.1186/s43141-022-00306-7.
Collapse
|
6
|
NoAS-DS: Neural optimal architecture search for detection of diverse DNA signals. Neural Netw 2021; 147:63-71. [PMID: 34979461 DOI: 10.1016/j.neunet.2021.12.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 11/22/2021] [Accepted: 12/17/2021] [Indexed: 01/05/2023]
Abstract
Neural network architectures are high-performing variable models that can solve many learning tasks. Designing architectures manually require substantial time and also prior knowledge and expertise to develop a high-accuracy model. Most of the architecture search methods are developed over the task of image classification resulting in the building of complex architectures intended for large data inputs such as images. Motivated by the applications of DNA computing in Neural Architecture Search (NAS), we propose NoAS-DS which is specifically built for the architecture search of sequence-based classification tasks. Furthermore, NoAS-DS is applied to the task of predicting binding sites. Unlike other methods that implement only Convolution layers, NoAS-DS, specifically combines Convolution and LSTM layers that helps in the process of automatic architecture building. This hybrid approach helped in achieving high accuracy results on TFBS and RBP datasets which outperformed other models in TF-DNA binding prediction tasks. The best architectures generated by the proposed model can be applied to other DNA datasets of similar nature using transfer learning technique that demonstrates its generalization capability. This greatly reduces the effort required to build new architectures for other prediction tasks.
Collapse
|
7
|
Characterization of promoters in archaeal genomes based on DNA structural parameters. Microbiologyopen 2021; 10:e1230. [PMID: 34713600 PMCID: PMC8553660 DOI: 10.1002/mbo3.1230] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 11/10/2022] Open
Abstract
The transcription machinery of archaea can be roughly classified as a simplified version of eukaryotic organisms. The basal transcription factor machinery binds to the TATA box found around 28 nucleotides upstream of the transcription start site; however, some transcription units lack a clear TATA box and still have TBP/TFB binding over them. This apparent absence of conserved sequences could be a consequence of sequence divergence associated with the upstream region, operon, and gene organization. Furthermore, earlier studies have found that a structural analysis gains more information compared with a simple sequence inspection. In this work, we evaluated and coded 3630 archaeal promoter sequences of three organisms, Haloferax volcanii, Thermococcus kodakarensis, and Sulfolobus solfataricus into DNA duplex stability, enthalpy, curvature, and bendability parameters. We also split our dataset into conserved TATA and degenerated TATA promoters to identify differences among these two classes of promoters. The structural analysis reveals variations in archaeal promoter architecture, that is, a distinctive signal is observed in the TFB, TBP, and TFE binding sites independently of these being TATA-conserved or TATA-degenerated. In addition, the promoter encountering method was validated with upstream regions of 13 other archaea, suggesting that there might be promoter sequences among them. Therefore, we suggest a novel method for locating promoters within the genome of archaea based on DNA energetic/structural features.
Collapse
|
8
|
Caprine Arthritis Encephalitis Virus Is Associated with Renal Lesions. Viruses 2021; 13:v13061051. [PMID: 34206110 PMCID: PMC8230173 DOI: 10.3390/v13061051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 05/27/2021] [Accepted: 05/29/2021] [Indexed: 11/16/2022] Open
Abstract
Caprine arthritis encephalitis virus (CAEV) is a monocyte/macrophage-tropic lentivirus that primarily infects goats resulting in a well-recognized set of chronic inflammatory syndromes focused on the joint synovium, tissues of the central nervous system, pulmonary interstitium and mammary gland. Clinically affected animals generally manifest with one or more of these classic CAEV-associated tissue lesions; however, CAEV-associated renal inflammation in goats has not been reported in the peer-reviewed literature. Here we describe six goats with chronic, multisystemic CAEV infections in conjunction with CAEV-associated renal lesions. One of the animals had CAEV antigen-associated thrombotic arteritis resulting in infarction of both the kidney and heart. These goats had microscopic evidence of inflammatory renal injury (interstitial nephritis) with detectable renal immunolabeling for CAEV antigen in three of six animals and amplifiable proviral sequences consistent with CAEV in all six animals. Cardiac lesions (vascular, myocardial or endocardial) were also identified in four of six animals. Within the viral promoter (U3) region, known transcription factor binding sites (TFBSs) were generally conserved, although one viral isolate had a duplication of the U3 A region encoding a second gamma-activated site (GAS). Despite the TFBS conservation, the isolates demonstrated a degree of phylogenetic diversity. At present, the clinical consequence of CAEV-associated renal injury is not clear.
Collapse
|
9
|
Amino Acid and Carbohydrate Metabolism Are Coordinated to Maintain Energetic Balance during Drought in Sugarcane. Int J Mol Sci 2020; 21:ijms21239124. [PMID: 33266228 PMCID: PMC7729667 DOI: 10.3390/ijms21239124] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 11/12/2020] [Accepted: 11/13/2020] [Indexed: 01/10/2023] Open
Abstract
The ability to expand crop plantations without irrigation is a major goal to increase agriculture sustainability. To achieve this end, we need to understand the mechanisms that govern plant growth responses under drought conditions. In this study, we combined physiological, transcriptomic, and genomic data to provide a comprehensive picture of drought and recovery responses in the leaves and roots of sugarcane. Transcriptomic profiling using oligoarrays and RNA-seq identified 2898 (out of 21,902) and 46,062 (out of 373,869) transcripts as differentially expressed, respectively. Co-expression analysis revealed modules enriched in photosynthesis, small molecule metabolism, alpha-amino acid metabolism, trehalose biosynthesis, serine family amino acid metabolism, and carbohydrate transport. Together, our findings reveal that carbohydrate metabolism is coordinated with the degradation of amino acids to provide carbon skeletons to the tricarboxylic acid cycle. This coordination may help to maintain energetic balance during drought stress adaptation, facilitating recovery after the stress is alleviated. Our results shed light on candidate regulatory elements and pave the way to biotechnology strategies towards the development of drought-tolerant sugarcane plants.
Collapse
|
10
|
Homeologs of Brassica SOC1, a central regulator of flowering time, are differentially regulated due to partitioning of evolutionarily conserved transcription factor binding sites in promoters. Mol Phylogenet Evol 2020; 147:106777. [PMID: 32126279 DOI: 10.1016/j.ympev.2020.106777] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 02/03/2020] [Accepted: 02/26/2020] [Indexed: 01/06/2023]
Abstract
Evolution of Brassica genome post-polyploidization reveals asymmetrical genome fractionation and copy number variation. Herein, we describe the impact of promoter divergence among SUPPRESSOR OF OVEREXPRESSION OF CONSTANS1 (SOC1) homeologs on expression and function in Brassica spp. SOC1, a regulated floral pathway integrator, is conserved as 3 redundant homeologs in diploid Brassicas. Even with high sequence identity within coding regions (92.8-100%), the spatio-temporal expression patterns of 9 SOC1 homologs in B. juncea and B. nigra indicates regulatory divergence. While LF and MF2 SOC1 homeologs are upregulated during floral transition, MF1 is barely expressed. Also, MF2 homeolog levels do not decline post-flowering, unlike LF. To investigate the underlying source of divergence, we analyzed the sequence and phylogeny of all reported (22) and isolated (21) upstream regions of Brassica SOC1. Full length upstream regions (4712-19189 bp) reveal 5 ubiquitously conserved ancestral Blocks, harboring binding sites of 18 TFs (TFBSs) characterized in Arabidopsis thaliana. The orthologs of these TFBSs are differentially conserved among Brassica SOC1 homeologs, imparting expression divergence. No crucial TFBSs are exclusively lost from LF_SOC1 promoter, while MF1_SOC1 has lost NF-Y binding site crucial for SOC1 activation by CONSTANS. MF2_SOC1 homeologs have lost important TFBSs (SEP3, AP1 and SMZ), responsible for SOC1 repression post-flowering. BjuAALF_SOC1 promoter (proximal 2 kb) shows ubiquitous reporter expression in B. juncea cv. Varuna transgenics, while BjuAAMF1_SOC1 promoter shows absence of reporter expression, validating the impact of TFBS divergence. Conservation of the original primary protein sequence is discovered in B. rapa homeologs (46) of 18 TFs. Co-regulation pattern of these TFs appeared similar for B. rapa LF and MF2 SOC1 homeologs; MF1 shows significant variation. Strong regulatory association is recorded for AP1, AP2, SEP3, FLC and CONSTANS/NF-Y, highlighting their importance in homeolog-specific SOC1 regulation. Correlation of B. juncea AP1, AP2 and FLC expression with SOC1 homeologs also complies with the TFBS differences. We thus conclude that redundant SOC1 loci contribute differentially to cumulative expression of SOC1 due to divergent selection of ancestral TFBSs.
Collapse
|
11
|
The Plant NF-Y DNA Matrix In Vitro and In Vivo. PLANTS 2019; 8:plants8100406. [PMID: 31658622 PMCID: PMC6843132 DOI: 10.3390/plants8100406] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 09/24/2019] [Accepted: 10/08/2019] [Indexed: 01/17/2023]
Abstract
Nuclear Factor Y (NF-Y) is an evolutionarily conserved trimer formed by a Histone-Fold Domain (HFD) heterodimeric module shared by core histones, and the sequence-specific NF-YA subunit. In plants, the genes encoding each of the three subunits have expanded in number, giving rise to hundreds of potential trimers. While in mammals NF-Y binds a well-characterized motif, with a defined matrix centered on the CCAAT box, the specificity of the plant trimers has yet to be determined. Here we report that Arabidopsis thaliana NF-Y trimeric complexes, containing two different NF-YA subunits, bind DNA in vitro with similar affinities. We assayed precisely sequence-specificity by saturation mutagenesis, and analyzed genomic DNA sites bound in vivo by selected HFDs. The plant NF-Y CCAAT matrix is different in nucleotides flanking CCAAT with respect to the mammalian matrix, in vitro and in vivo. Our data point to flexible DNA-binding rules by plant NF-Ys, serving the scope of adapting to a diverse audience of genomic motifs.
Collapse
|
12
|
Promoter conservation in HDACs points to functional implications. BMC Genomics 2019; 20:613. [PMID: 31351464 PMCID: PMC6660948 DOI: 10.1186/s12864-019-5973-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 07/12/2019] [Indexed: 01/05/2023] Open
Abstract
Background Histone deacetylases (HDACs) are the proteins responsible for removing the acetyl group from lysine residues of core histones in chromosomes, a crucial component of gene regulation. Eleven known HDACs exist in humans and most other vertebrates. While the basic function of HDACs has been well characterized and new discoveries are still being made, the transcriptional regulation of their corresponding genes is still poorly understood. Results Here, we conducted a computational analysis of the eleven HDAC promoter sequences in 25 vertebrate species to determine whether transcription factor binding sites (TFBSs) are conserved in HDAC evolution, and if so, whether they provide useful information about HDAC expression and function. Furthermore, we used tissue-specific information of transcription factors to investigate the potential expression patterns of HDACs in different human tissues based on their transcription factor binding sites. We found that the TFBS profiles of most of the HDACs were well conserved in closely related species for all HDAC promoters except HDAC7 and HDAC10. HDAC5 had particularly strong conservation across over half of the species studied, with nearly identical profiles in the primate species. Our comparisons of TFBSs with the tissue specific gene expression profiles of their corresponding TFs showed that most HDACs had the ability to be ubiquitously expressed. A few HDAC promoters exhibited the potential for preferential expression in certain tissues, most notably HDAC11 in gall bladder, while HDAC9 seemed to have less propensity for expression in the nervous system. Conclusions In general, we found evolutionary conservation in HDAC promoters that seems to be more prominent for the ubiquitously expressed HDACs. In turn, when conservation did not follow usual phylogeny, human TFBS patterns indicated possible functional relevance. While we found that HDACs appear to uniformly expressed, we confirm that the functional differences in HDACs may be less a matter of location of activity than a question of which proteins and which acetyl groups they may be acting on. Electronic supplementary material The online version of this article (10.1186/s12864-019-5973-x) contains supplementary material, which is available to authorized users.
Collapse
|
13
|
Single Nucleotide Polymorphisms of NUCB2 and their Genetic Associations with Milk Production Traits in Dairy Cows. Genes (Basel) 2019; 10:genes10060449. [PMID: 31200542 PMCID: PMC6627143 DOI: 10.3390/genes10060449] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 06/04/2019] [Accepted: 06/12/2019] [Indexed: 02/07/2023] Open
Abstract
We previously used the RNA sequencing technique to detect the hepatic transcriptome of Chinese Holstein cows among the dry period, early lactation, and peak of lactation, and implied that the nucleobindin 2 (NUCB2) gene might be associated with milk production traits due to its expression being significantly increased in early lactation or peak of lactation as compared to dry period (q value < 0.05). Hence, in this study, we detected the single nucleotide polymorphisms (SNPs) of NUCB2 and analyzed their genetic associations with milk yield, fat yield, fat percentage, protein yield, and protein percentage. We re-sequenced the entire coding and 2000 bp of 5′ and 3′ flanking regions of NUCB2 by pooled sequencing, and identified ten SNPs, including one in 5′ flanking region, two in 3′ untranslated region (UTR), and seven in 3′ flanking region. The single-SNP association analysis results showed that the ten SNPs were significantly associated with milk yield, fat yield, fat percentage, protein yield, or protein percentage in the first or second lactation (p values <= 1 × 10−4 and 0.05). In addition, we estimated the linkage disequilibrium (LD) of the ten SNPs by Haploview 4.2, and found that the SNPs were highly linked in one haplotype block (D′ = 0.98–1.00), and the block was also significantly associated with at least one milk traits in the two lactations (p values: 0.0002–0.047). Further, we predicted the changes of transcription factor binding sites (TFBSs) that are caused by the SNPs in the 5′ flanking region of NUCB2, and considered that g.35735477C>T might affect the expression of NUCB2 by changing the TFBSs for ETS transcription factor 3 (ELF3), caudal type homeobox 2 (CDX2), mammalian C-type LTR TATA box (VTATA), nuclear factor of activated T-cells (NFAT), and v-ets erythroblastosis virus E26 oncogene homolog (ERG) (matrix similarity threshold, MST > 0.85). However, the further study should be performed to verify the regulatory mechanisms of NUCB2 and its polymorphisms on milk traits. Our findings first revealed the genetic effects of NUCB2 on the milk traits in dairy cows, and suggested that the significant SNPs could be used in genomic selection to improve the accuracy of selection for dairy cattle breeding.
Collapse
|
14
|
Gain of transcription factor binding sites is associated to changes in the expression signature of human brain and testis and is correlated to genes with higher expression breadth. SCIENCE CHINA-LIFE SCIENCES 2019; 62:526-534. [PMID: 30919278 DOI: 10.1007/s11427-018-9454-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 10/15/2018] [Indexed: 11/26/2022]
Abstract
The gain of transcription factor binding sites (TFBS) is believed to represent one of the major causes of biological innovation. Here we used strategies based on comparative genomics to identify 21,822 TFBS specific to the human lineage (TFBS-HS), when compared to chimpanzee and gorilla genomes. More than 40% (9,206) of these TFBS-HS are in the vicinity of 1,283 genes. A comparison of the expression pattern of these genes and the corresponding orthologs in chimpanzee and gorilla identified genes differentially expressed in human tissues. These genes show a more divergent expression pattern in the human testis and brain, suggesting a role for positive selection in the fixation of TFBS gains. Genes associated with TFBS-HS were enriched in gene ontology categories related to transcriptional regulation, signaling, differentiation/development and nervous system. Furthermore, genes associated with TFBS-HS present a higher expression breadth when compared to genes in general. This biased distribution is due to a preferential gain of TFBS in genes with higher expression breadth rather than a shift in the expression pattern after the gain of TFBS.
Collapse
|
15
|
Abstract
BACKGROUND In living organisms, small heat shock proteins (sHSPs) are triggered in response to stress situations. This family of proteins is large in plants and, in the case of tomato (Solanum lycopersicum), 33 genes have been identified, most of them related to heat stress response and to the ripening process. Transcriptomic and proteomic studies have revealed complex patterns of expression for these genes. In this work, we investigate the coregulation of these genes by performing a computational analysis of their promoter architecture to find regulatory motifs known as heat shock elements (HSEs). We leverage the presence of sHSP members that originated from tandem duplication events and analyze the promoter architecture diversity of the whole sHSP family, focusing on the identification of HSEs. RESULTS We performed a search for conserved genomic sequences in the promoter regions of the sHSPs of tomato, plus several other proteins (mainly HSPs) that are functionally related to heat stress situations or to ripening. Several computational analyses were performed to build multiple sequence motifs and identify transcription factor binding sites (TFBS) homologous to HSF1AE and HSF21 in Arabidopsis. We also investigated the expression and interaction of these proteins under two heat stress situations in whole tomato plants and in protoplast cells, both in the presence and in the absence of heat shock transcription factor A2 (HsfA2). The results of these analyses indicate that different sHSPs are up-regulated depending on the activation or repression of HsfA2, a key regulator of HSPs. Further, the analysis of protein-protein interaction between the sHSP protein family and other heat shock response proteins (Hsp70, Hsp90 and MBF1c) suggests that several sHSPs are mediating alternative stress response through a regulatory subnetwork that is not dependent on HsfA2. CONCLUSIONS Overall, this study identifies two regulatory motifs (HSF1AE and HSF21) associated with the sHSP family in tomato which are considered genomic HSEs. The study also suggests that, despite the apparent redundancy of these proteins, which has been linked to gene duplication, tomato sHSPs showed different up-regulation and different interaction patterns when analyzed under different stress situations.
Collapse
|
16
|
Simulation Study of cDNA Dataset to Investigate Possible Association of Differentially Expressed Genes of Human THP1-Monocytic Cells in Cancer Progression Affected by Bacterial Shiga Toxins. Front Microbiol 2018; 9:380. [PMID: 29593668 PMCID: PMC5859033 DOI: 10.3389/fmicb.2018.00380] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Accepted: 02/20/2018] [Indexed: 12/30/2022] Open
Abstract
Shiga toxin (Stxs) is a family of structurally and functionally related bacterial cytotoxins produced by Shigella dysenteriae serotype 1 and shigatoxigenic group of Escherichia coli that cause shigellosis and hemorrhagic colitis, respectively. Until recently, it has been thought that Stxs only inhibits the protein synthesis and induces expression to a limited number of genes in host cells, but recent data showed that Stxs can trigger several signaling pathways in mammalian cells and activate cell cycle and apoptosis. To explore the changes in gene expression induced by Stxs that have been shown in other systems to correlate with cancer progression, we performed the simulated analysis of cDNA dataset and found differentially expressed genes (DEGs) of human THP1-monocytic cells treated with Stxs. In this study, the entire data (treated and untreated replicates) was analyzed by statistical algorithms implemented in Bioconductor packages. The output data was validated by the k-fold cross technique using generalized linear Gaussian models. A total of 50 DEGs were identified. 7 genes including TSLP, IL6, GBP1, CD274, TNFSF13B, OASL, and PNPLA3 were considerably (<0.00005) related to cancer proliferation. The functional enrichment analysis showed 6 down-regulated and 1 up-regulated genes. Among these DEGs, IL6 was associated with several cancers, especially with leukemia, lymphoma, lungs, liver and breast cancers. The predicted regulatory motifs of these genes include conserved RELA, STATI, IRFI, NF-kappaB, PEND, HLF, REL, CEBPA, DI_2, and NFKB1 transcription factor binding sites (TFBS) involved in the complex biological functions. Thus, our findings suggest that Stxs has the potential as a valuable tool for better understanding of treatment strategies for several cancers.
Collapse
|
17
|
In silico comparative analysis of promoters derived from plant pararetroviruses. Virusdisease 2017; 28:416-421. [PMID: 29291233 DOI: 10.1007/s13337-017-0410-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 11/08/2017] [Indexed: 10/18/2022] Open
Abstract
Promoters are specific sequence of nucleotides present upstream of gene coding region involved in initiation and regulation of transcription. Multiple cis-acting element forms the architecture of promoter to which trans-acting nucleic binding factors bind and regulates its activity. Since 1980 genome of pararetrovirus, are being exploited for developing efficient promoters. Among all of them Cauliflower mosaic virus is the most widely used promoter for gene expression. The basic rational behind the strength of promoter lies in the sequence of cis elements and the spacer nucleotide elements between them, thereby strength of these promoter fragments can be regulated by altering these nucleotide sequences. In the present study sequence of eight putative promoters of plant pararetrovirus are retrieved from National Centre for Biotechnology Information (NCBI) website. These sequence are subjected to various bioinformatics tools comprises of Clustal W, Plant Care, Mathinspector, ModelInspector for establishing the phylogenetic similarity, to identify the quantity and quality of present cis-elements, to find the various common transcription factors binding sites and to determine the presence of module for various specific functions respectively. A range of 28.80-56.0 percentage identification was observed in phylogenetic analysis, with the greatest similarity in Mirabilis mosaic virus and Dahlia mosaic virus. A broad range of cis-elements (51), transcription factor binding site (512) was obtained and 60% observed module are in combination with DOFF motif which shows a function relevance with abiotic stress inducibility. The present study had revealed the functional significance of these elements in gene regulation of pararetrovirus genome and also gives a overall idea for designing novel synthetic promoter.
Collapse
|
18
|
EMQIT: a machine learning approach for energy based PWM matrix quality improvement. Biol Direct 2017; 12:17. [PMID: 28764727 PMCID: PMC5539975 DOI: 10.1186/s13062-017-0189-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 07/17/2017] [Indexed: 11/10/2022] Open
Abstract
Background Transcription factor binding affinities to DNA play a key role for the gene regulation. Learning the specificity of the mechanisms of binding TFs to DNA is important both to experimentalists and theoreticians. With the development of high-throughput methods such as, e.g., ChiP-seq the need to provide unbiased models of binding events has been made apparent. We present EMQIT a modification to the approach introduced by Alamanova et al. and later implemented as 3DTF server. We observed that tuning of Boltzmann factor weights, used for conversion of calculated energies to nucleotide probabilities, has a significant impact on the quality of the associated PWM matrix. Results Consequently, we proposed to use receiver operator characteristics curves and the 10-fold cross-validation to learn best weights using experimentally verified data from TRANSFAC database. We applied our method to data available for various TFs. We verified the efficiency of detecting TF binding sites by the 3DTF matrices improved with our technique using experimental data from the TRANSFAC database. The comparison showed a significant similarity and comparable performance between the improved and the experimental matrices (TRANSFAC). Improved 3DTF matrices achieved significantly higher AUC values than the original 3DTF matrices (at least by 0.1) and, at the same time, detected notably more experimentally verified TFBSs. Conclusions The resulting new improved PWM matrices for analyzed factors show similarity to TRANSFAC matrices. Matrices had comparable predictive capabilities. Moreover, improved PWMs achieve better results than matrices downloaded from 3DTF server. Presented approach is general and applicable to any energy-based matrices. EMQIT is available online at http://biosolvers.polsl.pl:3838/emqit. Reviewers This article was reviewed by Oliviero Carugo, Marek Kimmel and István Simon. Electronic supplementary material The online version of this article (doi:10.1186/s13062-017-0189-y) contains supplementary material, which is available to authorized users.
Collapse
|
19
|
Regulatory Features for Odorant Receptor Genes in the Mouse Genome. Front Genet 2017; 8:19. [PMID: 28270833 PMCID: PMC5318403 DOI: 10.3389/fgene.2017.00019] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 02/08/2017] [Indexed: 12/31/2022] Open
Abstract
The odorant receptor genes, seven transmembrane receptor genes constituting the vastest mammalian gene multifamily, are expressed monogenically and monoallelicaly in each sensory neuron in the olfactory epithelium. This characteristic, often referred to as the one neuron-one receptor rule, is driven by mostly uncharacterized molecular dynamics, generally named odorant receptor gene choice. Much attention has been paid by the scientific community to the identification of sequences regulating the expression of odorant receptor genes within their loci, where related genes are usually arranged in genomic clusters. A number of studies identified transcription factor binding sites on odorant receptor promoter sequences. Similar binding sites were also found on a number of enhancers that regulate in cis their transcription, but have been proposed to form interchromosomal networks. Odorant receptor gene choice seems to occur via the local removal of strongly repressive epigenetic markings, put in place during the maturation of the sensory neuron on each odorant receptor locus. Here we review the fast-changing state of art for the study of regulatory features for odorant receptor genes.
Collapse
|
20
|
Data in support of FSH induction of IRS-2 in human granulosa cells: Mapping the transcription factor binding sites in human IRS-2 promoter. Data Brief 2016; 6:162-7. [PMID: 26858982 PMCID: PMC4706562 DOI: 10.1016/j.dib.2015.12.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 12/01/2015] [Accepted: 12/06/2015] [Indexed: 01/08/2023] Open
Abstract
Insulin receptor substrate-2 (IRS-2) plays critical role in the regulation of various metabolic processes by insulin and IGF-1. The defects in its expression and/or function are linked to diseases like polycystic ovary syndrome (PCOS), insulin resistance and cancer. To predict the transcription factors (TFs) responsible for the regulation of human IRS-2 gene expression, the transcription factor binding sites (TFBS) and the corresponding TFs were investigated by analysis of IRS-2 promoter sequence using MatInspector Genomatix software (Cartharius et al., 2005 [1]). The ibid data is part of author׳s publication (Anjali et al., 2015 [2]) that explains Follicle stimulating hormone (FSH) mediated IRS-2 promoter activation in human granulosa cells and its importance in the pathophysiology of PCOS. Further analysis was carried out for binary interactions of TF regulatory genes in IRS-2 network using Cytoscape software tool and R-code. In this manuscript, we describe the methodology used for the identification of TFBSs in human IRS-2 promoter region and provide details on experimental procedures, analysis method, validation of data and also the raw files. The purpose of this article is to provide the data on all TFBSs in the promoter region of human IRS-2 gene as it has the potential for prediction of the regulation of IRS-2 gene in normal or diseased cells from patients with metabolic disorders and cancer.
Collapse
|
21
|
Transcriptional regulation of microRNA-100, -146a, and -150 genes by p53 and NFκB p65/RelA in mouse striatal STHdh(Q7)/ Hdh(Q7) cells and human cervical carcinoma HeLa cells. RNA Biol 2016; 12:457-77. [PMID: 25757558 DOI: 10.1080/15476286.2015.1014288] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
MicroRNA (miRNA) genes generally share many features common to those of protein coding genes. Various transcription factors (TFs) and co-regulators are also known to regulate miRNA genes. Here we identify novel p53 and NFκB p65/RelA responsive miRNAs and demonstrate that these 2 TFs bind to the regulatory sequences of miR-100, -146a and -150 in both mouse striatal and human cervical carcinoma cells and regulate their expression. p53 represses the miRNAs while NFκB p65/RelA induces them. Further, we provide evidence that exogenous p53 inhibits NFκB p65/RelA activity by reducing its nuclear content and competing with it for CBP binding. This suggests for the existence of a functional cross-talk between the 2 TFs in regulating miRNA expression. Moreover, promoter occupancy assay reveals that exogenous p53 excludes NFκB p65/RelA from its binding site in the upstream sequence of miR-100 gene thereby causing its repression. Thus, our work identifies novel p53 and NFκB p65/RelA responsive miRNAs in human and mouse and uncovers possible mechanisms of co-regulation of miR-100. It is to be mentioned here that cross-talks between p53 and NFκB p65/RelA have been observed to define the outcome of several biological processes and that the pro-apoptotic effect of p53 and the pro-survival functions of NFκB can be largely mediated via the biological roles of the miRNAs these TFs regulate. Our observation with cell lines thus provides an important platform upon which further work is to be done to establish the biological significance of such co-regulation of miRNAs by p53 and NFκB p65/RelA.
Collapse
Key Words
- ChIP, Chromatin immunoprecipitation
- Co-IP, Co-immunoprecipitation
- NFκB p65/RelA
- NFκB, nuclear factor kappa-light-chain-enhancer of activated B cells
- RLU, Relative light unit
- RNA POL II, RNA Polymerase II
- RNA POL III, RNA Polymerase III
- RT-PCR, Reverse transcription polymerase chain reaction
- TF, Transcriptional factor
- TFBS
- Transcription factor binding site
- WB, Western blot
- miR-100
- miR-146a
- miR-150
- miRNA gene regulation
- miRNAs, microRNAs
- microRNA
- p53
- p53, tumor protein 53
- p65, RELA, RELA
- transcription factor
- v-rel avian reticuloendotheliosis viral oncogene homolog A
Collapse
|
22
|
Methylated Cytosines Mutate to Transcription Factor Binding Sites that Drive Tetrapod Evolution. Genome Biol Evol 2015; 7:3155-69. [PMID: 26507798 PMCID: PMC4994754 DOI: 10.1093/gbe/evv205] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
In mammals, the cytosine in CG dinucleotides is typically methylated producing
5-methylcytosine (5mC), a chemically less stable form of cytosine that can spontaneously
deaminate to thymidine resulting in a T•G mismatched base pair. Unlike other eukaryotes
that efficiently repair this mismatched base pair back to C•G, in mammals, 5mCG
deamination is mutagenic, sometimes producing TG dinucleotides, explaining the depletion
of CG dinucleotides in mammalian genomes. It was suggested that new TG dinucleotides
generate genetic diversity that may be critical for evolutionary change. We tested this
conjecture by examining the DNA sequence properties of regulatory sequences identified by
DNase I hypersensitive sites (DHSs) in human and mouse genomes. We hypothesized that the
new TG dinucleotides generate transcription factor binding sites (TFBS) that become
tissue-specific DHSs (TS-DHSs). We find that 8-mers containing the CG dinucleotide are
enriched in DHSs in both species. However, 8-mers containing a TG and no CG dinucleotide
are preferentially enriched in TS-DHSs when compared with 8-mers with neither a TG nor a
CG dinucleotide. The most enriched 8-mer with a TG and no CG dinucleotide in
tissue-specific regulatory regions in both genomes is the AP-1 motif
(TGAC/GTCAN), and we find evidence that
TG dinucleotides in the AP-1 motif arose from CG dinucleotides. Additional TS-DHS-enriched
TFBS containing the TG/CA dinucleotide are the E-Box motif
(GCAGCTGC), the NF-1 motif (GGCA—TGCC), and the
GR (glucocorticoid receptor) motif (G-ACA—TGT-C). Our results support the
suggestion that cytosine methylation is mutagenic in tetrapods producing TG dinucleotides
that create TFBS that drive evolution.
Collapse
|
23
|
Genomic regions harboring insecticide resistance-associated Cyp genes are enriched by transposable element fragments carrying putative transcription factor binding sites in two sibling Drosophila species. Gene 2013; 537:93-9. [PMID: 24361809 DOI: 10.1016/j.gene.2013.11.080] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2011] [Revised: 11/27/2013] [Accepted: 11/30/2013] [Indexed: 11/27/2022]
Abstract
In the present study, an in silico analysis was performed to identify transposable element (TE) fragments inserted in Cyps with functions associated with resistance to insecticides and developmental regulation as well as in neighboring genes in two sibling species, Drosophila melanogaster and Drosophila simulans. The Cyps associated with insecticide resistance and their neighboring non-Cyp genes have accumulated a greater number of TE fragments than the other Cyps or a random sample of genes, predominantly in the 5'-flanking regions. Most of the insertions were due to DNA transposons, with DNAREP1 fragments being the most common. These fragments carry putative binding sites for transcription factors, which reinforces the hypothesis that DNAREP1 may influence gene regulation and play a role in the adaptation of the Drosophila species.
Collapse
|
24
|
Bovine TWINKLE and mitochondrial ribosomal protein L43 genes are regulated by an evolutionary conserved bidirectional promoter. Gene 2013; 537:154-63. [PMID: 24361965 DOI: 10.1016/j.gene.2013.11.088] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 11/30/2013] [Indexed: 11/16/2022]
Abstract
TWINKLE is a mitochondrial DNA helicase playing an important role in mitochondrial DNA replication. In human, mutations in this gene cause progressive external ophtalmoplegia and mitochondrial DNA depletion syndrome-7. TWINKLE is well conserved among multicellular eukaryotes and is believed to be a key regulator of mitochondrial DNA copy number in mammals. Despite its involvement in several diseases and its important function in mitochondrial DNA metabolism, nothing is known about the regulation of the expression of TWINKLE. We have analysed the 5'-flanking genomic region of the bovine TWINKLE gene and found it was localised adjacent to the MRPL43 gene in a head-to-head orientation, suggesting that both genes are regulated by a shared bidirectional promoter. The bovine 75-bp long intergenic region shows substantial homology across different species and contains several conserved putative transcription factor binding sites. A TATA box, however, was lacking. Using a dual fluorescent reporter system and transient transfection assays, we have analysed the bovine intergenic region between TWINKLE and MRPL43. This small genomic fragment showed a bidirectional promoter activity. As the TWINKLE/MRPL43 bidirectional promoter tested was highly conserved, it is likely that the results we obtained here in cattle may be extended to the other species.
Collapse
|
25
|
Functionally distinct gene classes as bigger or smaller transcription factor traps: a possible stochastic component to sequential gene expression programs in cancer. Gene 2013; 536:398-406. [PMID: 24291030 DOI: 10.1016/j.gene.2013.11.013] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Revised: 11/04/2013] [Accepted: 11/08/2013] [Indexed: 12/15/2022]
Abstract
In cancer biology, most molecular regulatory mechanisms are casually treated as on/off switches for specific cancer hallmarks, despite the lack of compelling evidence that cancer hallmarks can be exclusively attributed to specific regulatory proteins. To consider a novel paradigm for the basis of regulating a set of effector genes for proliferation, versus apoptosis-effector genes, we used a bioinformatics approach to ascertain differences between the transcription factor binding site occurrences in the two sets of genes. Results indicated that there are more binding sites per gene, for transcription factors that regulate both proliferation and apoptosis, among the proliferation-effector genes than among the apoptosis-effector genes. Proliferation-effector genes also had more open chromatin regions. We also applied this paradigm to the question of why p53 and interferon regulatory factor-1 (IRF-1) first activate cell cycle arrest genes followed by apoptosis genes, with results indicating the cycle arrest genes are bigger p53 and IRF-1 traps. These data support the idea that, as a set of transcription factors becomes active, there is a stochastic component leading to the accumulation of these transcription factors on genes that effect an initial phenotype before their accumulation on genes that effect a subsequent phenotype.
Collapse
|
26
|
Understanding the transcriptional regulation of cervix cancer using microarray gene expression data and promoter sequence analysis of a curated gene set. Gene 2013; 535:233-8. [PMID: 24291025 DOI: 10.1016/j.gene.2013.11.028] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2013] [Revised: 11/12/2013] [Accepted: 11/15/2013] [Indexed: 02/08/2023]
Abstract
Cervical cancer, the malignant neoplasm of the cervix uteri is the second most common cancer among women worldwide and the top-most cancer in India. Several factors are responsible for causing cervical cancer, which alter the expression of oncogenic genes resulting in up or down-regulation of gene expression and inactivation of tumor-suppressor genes/gene products. Gene expression is regulated by interactions between transcription factors (TFs) and specific regulatory elements in the promoter regions of target genes. Thus, it is important to decipher and analyze TFs that bind to regulatory regions of diseased genes and regulate their expression. In the present study, computational methods involving the combination of gene expression data from microarray experiments and promoter sequence analysis of a curated gene set involved in the cervical cancer causation have been utilized for identifying potential regulatory elements. Consensus predictions of two approaches led to the identification of twelve TFs that might be crucial to the regulation of cervical cancer progression. Subsequently, TF enrichment and oncomine expression analysis suggested that the transcription factor family E2F played an important role for the regulation of genes involve in cervical carcinogenesis. Our results suggest that E2F possesses diagnostic/prognostic value and can act as a potential drug target in cervical cancer.
Collapse
|
27
|
Role of antisense RNAs in evolution of yeast regulatory complexity. Genomics 2013; 102:484-90. [PMID: 24200499 DOI: 10.1016/j.ygeno.2013.10.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Revised: 10/01/2013] [Accepted: 10/26/2013] [Indexed: 01/01/2023]
Abstract
Antisense RNAs (asRNAs) are known to regulate gene expression. However, a genome-wide mechanism of asRNA regulation is unclear, and there is no good explanation why partial asRNAs are not functional. To explore its regulatory role, we investigated asRNAs using an evolutionary approach, as genome-wide experimental data are limited. We found that the percentage of genes coupling with asRNAs in Saccharomyces cerevisiae is negatively associated with regulatory complexity and evolutionary age. Nevertheless, asRNAs evolve more slowly when their sense genes are under more complex regulation. Older genes coupling with asRNAs are more likely to demonstrate inverse expression, reflecting the role of these asRNAs as repressors. Our analyses provide novel evidence, suggesting a minor contribution of asRNAs in developing regulatory complexity. Although our results support the leaky hypothesis for asRNA transcription, our evidence also suggests that partial asRNAs may have evolved as repressors. Our study deepens the understanding of asRNA regulatory evolution.
Collapse
|
28
|
DAX1 regulatory networks unveil conserved and potentially new functions. Gene 2013; 530:66-74. [PMID: 23954228 DOI: 10.1016/j.gene.2013.07.052] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2013] [Accepted: 07/09/2013] [Indexed: 11/22/2022]
Abstract
DAX1 is an orphan nuclear receptor with actions in mammalian sex determination, regulation of steroidogenesis, embryonic development and neural differentiation. Conserved patterns of DAX1 gene expression from mammals to fish have been taken to suggest conserved function. In the present study, the European sea bass, Dicentrarchus labrax, DAX1 promoter was isolated and its conserved features compared to other fish and mammalian DAX1 promoters in order to derive common regulators and functional gene networks. Fish and mammalian DAX1 promoters share common sets of transcription factor frameworks which were also present in the promoter region of another 127 genes. Pathway analysis clustered these into candidate gene networks associated with the fish and mammalian DAX1. The networks identified are concordant with described functions for DAX1 in embryogenesis, regulation of transcription, endocrine development and steroid production. Novel candidate gene network partners were also identified, which implicate DAX1 in ion homeostasis and transport, lipid transport and skeletal development. Experimental evidence is provided supporting roles for DAX1 in steroid signalling and osmoregulation in fish. These results highlight the usefulness of the in silico comparative approach to analyse gene regulation for hypothesis generation. Conserved promoter architecture can be used also to predict potentially new gene functions. The approach reported can be applied to genes from model and non-model species.
Collapse
|
29
|
Novel polymorphisms in UTR and coding region of inducible heat shock protein 70.1 gene in tropically adapted Indian zebu cattle (Bos indicus) and riverine buffalo (Bubalus bubalis). Gene 2013; 527:606-15. [PMID: 23792016 DOI: 10.1016/j.gene.2013.05.078] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 05/23/2013] [Accepted: 05/29/2013] [Indexed: 11/18/2022]
Abstract
Due to evolutionary divergence, cattle (taurine, and indicine) and buffalo are speculated to have different responses to heat stress condition. Variation in candidate genes associated with a heat-shock response may provide an insight into the dissimilarity and suggest targets for intervention. The present work was undertaken to characterize one of the inducible heat shock protein genes promoter and coding regions in diverse breeds of Indian zebu cattle and buffaloes. The genomic DNA from a panel of 117 unrelated animals representing 14 diversified native cattle breeds and 6 buffalo breeds were utilized to determine the complete sequence and gene diversity of HSP70.1 gene. The coding region of HSP70.1 gene in Indian zebu cattle, Bos taurus and buffalo was similar in length (1,926 bp) encoding a HSP70 protein of 641 amino acids with a calculated molecular weight (Mw) of 70.26 kDa. However buffalo had a longer 5' and 3' untranslated region (UTR) of 204 and 293 nucleotides respectively, in comparison to Indian zebu cattle and Bos taurus wherein length of 5' and 3'-UTR was 172 and 286 nucleotides, respectively. The increased length of buffalo HSP70.1 gene compared to indicine and taurine gene was due to two insertions each in 5' and 3'-UTR. Comparative sequence analysis of cattle (taurine and indicine) and buffalo HSP70.1 gene revealed a total of 54 gene variations (50 SNPs and 4 INDELs) among the three species in the HSP70.1 gene. The minor allele frequencies of these nucleotide variations varied from 0.03 to 0.5 with an average of 0.26. Among the 14 B. indicus cattle breeds studied, a total of 19 polymorphic sites were identified: 4 in the 5'-UTR and 15 in the coding region (of these 2 were non-synonymous). Analysis among buffalo breeds revealed 15 SNPs throughout the gene: 6 at the 5' flanking region and 9 in the coding region. In bubaline 5'-UTR, 2 additional putative transcription factor binding sites (Elk-1 and C-Re1) were identified, other than three common sites (CP2, HSE and Pax-4) observed across all the analyzed animals. No polymorphism was found within the 3'-UTR of Indian cattle or buffalo as it was found to be monomorphic. The promoter sequences generated in 117 individuals showed a rich array of sequence elements known to be involved in transcription regulation. A total of 11 nucleotide changes were observed in the promoter sequence across the analyzed species, 3 of these changes were located within the potential transcription factor binding domains. We also identified 4 microsatellite markers within the buffalo HSP70.1 gene and 3 microsatellites within bovine HSP70.1. The present study identified several distinct changes across indicine, taurine and bubaline HSP70.1 genes that could further be evaluated as molecular markers for thermotolerance.
Collapse
|
30
|
Association between Leptin gene polymorphisms and plasma leptin level in three consanguineous families with obesity. Gene 2013; 527:75-81. [PMID: 23751306 DOI: 10.1016/j.gene.2013.05.064] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2013] [Accepted: 05/27/2013] [Indexed: 12/16/2022]
Abstract
INTRODUCTION Leptin (LEP) gene is one of the most promising candidate genes for obesity. Previous studies have tested the association of polymorphisms in LEP gene with obesity and obesity-related metabolic biomarkers (anthropometric variables, glucose, insulin level, leptin level and lipid profile). However, the results of these studies were still controversial. To determine whether LEP gene is associated with obesity in Tunisian population, we performed a family-based association study between LEP polymorphisms and obesity and obesity-related metabolic biomarkers. METHODS Seven single nucleotide polymorphisms (SNPs) in 5' region of LEP gene were genotyped in three consanguineous families including 33 individuals. The previously reported LEP SNPs (H1328084, H1328082, rs10487506, H1328081, H1328080, G-2548A and A19G) were evaluated by PCR-RFLP and direct sequencing methods. Single SNP association and haplotype association analyses were performed using the family-based association test (FBAT). To determine allele frequencies of these SNPs in general population, 52 unrelated individuals from the general Tunisian population were also analyzed. RESULTS Two SNPs showed significant associations with plasma leptin level (H1328084: A>G, Z=2.058, p=0.039; A19G: G>A, Z=2.058, p=0.039). When haplotypes were constructed with these two-markers, the risk AA haplotype (frequency 57.1%) was positively associated with plasma leptin level (Z=2.058, p=0.039). Moreover, SNPs H1328084 and A19G are predicted to modify transcription-factor binding sites. CONCLUSIONS Our study provided that two functional variants in 5' regulatory region of LEP gene are associated with plasma leptin level as a quantitative trait. It suggested that H1328084 and A19G have an important role in regulating plasma leptin level.
Collapse
|
31
|
Different Functional Gene Clusters in Yeast have Different Spatial Distributions of the Transcription Factor Binding Sites. Bioinform Biol Insights 2011; 5:1-11. [PMID: 21423404 PMCID: PMC3045049 DOI: 10.4137/bbi.s6362] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Transcription factors control gene expression by binding to short specific DNA sequences, called transcription factor binding sites (TFBSs), in the promoter of a gene. Thus, studying the spatial distribution of TFBSs in the promoters may provide insights into the molecular mechanisms of gene regulation. I developed a method to construct the spatial distribution of TFBSs for any set of genes of interest. I found that different functional gene clusters have different spatial distributions of TFBSs, indicating that gene regulation mechanisms may be very different among different functional gene clusters. I also found that the binding sites for different transcription factors (TFs) may have different spatial distributions: a sharp peak, a plateau or no dominant single peak. The spatial distributions of binding sites for many TFs derived from my analyses are valuable prior information for TFBS prediction algorithm because different regions of a promoter can assign different possibilities for TFBS occurrence.
Collapse
|