1
|
Milito A, Aschern M, McQuillan JL, Yang JS. Challenges and advances towards the rational design of microalgal synthetic promoters in Chlamydomonas reinhardtii. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:3833-3850. [PMID: 37025006 DOI: 10.1093/jxb/erad100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 03/24/2023] [Indexed: 06/19/2023]
Abstract
Microalgae hold enormous potential to provide a safe and sustainable source of high-value compounds, acting as carbon-fixing biofactories that could help to mitigate rapidly progressing climate change. Bioengineering microalgal strains will be key to optimizing and modifying their metabolic outputs, and to render them competitive with established industrial biotechnology hosts, such as bacteria or yeast. To achieve this, precise and tuneable control over transgene expression will be essential, which would require the development and rational design of synthetic promoters as a key strategy. Among green microalgae, Chlamydomonas reinhardtii represents the reference species for bioengineering and synthetic biology; however, the repertoire of functional synthetic promoters for this species, and for microalgae generally, is limited in comparison to other commercial chassis, emphasizing the need to expand the current microalgal gene expression toolbox. Here, we discuss state-of-the-art promoter analyses, and highlight areas of research required to advance synthetic promoter development in C. reinhardtii. In particular, we exemplify high-throughput studies performed in other model systems that could be applicable to microalgae, and propose novel approaches to interrogating algal promoters. We lastly outline the major limitations hindering microalgal promoter development, while providing novel suggestions and perspectives for how to overcome them.
Collapse
Affiliation(s)
- Alfonsina Milito
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| | - Moritz Aschern
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| | - Josie L McQuillan
- Department of Chemical and Biological Engineering, University of Sheffield, Mappin Street, Sheffield, S1 3JD, UK
| | - Jae-Seong Yang
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus UAB, Bellaterra, Barcelona, Spain
| |
Collapse
|
2
|
Wang Y, Tai S, Zhang S, Sheng N, Xie X. PromGER: Promoter Prediction Based on Graph Embedding and Ensemble Learning for Eukaryotic Sequence. Genes (Basel) 2023; 14:1441. [PMID: 37510345 PMCID: PMC10379012 DOI: 10.3390/genes14071441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 07/04/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023] Open
Abstract
Promoters are DNA non-coding regions around the transcription start site and are responsible for regulating the gene transcription process. Due to their key role in gene function and transcriptional activity, the prediction of promoter sequences and their core elements accurately is a crucial research area in bioinformatics. At present, models based on machine learning and deep learning have been developed for promoter prediction. However, these models cannot mine the deeper biological information of promoter sequences and consider the complex relationship among promoter sequences. In this work, we propose a novel prediction model called PromGER to predict eukaryotic promoter sequences. For a promoter sequence, firstly, PromGER utilizes four types of feature-encoding methods to extract local information within promoter sequences. Secondly, according to the potential relationships among promoter sequences, the whole promoter sequences are constructed as a graph. Furthermore, three different scales of graph-embedding methods are applied for obtaining the global feature information more comprehensively in the graph. Finally, combining local features with global features of sequences, PromGER analyzes and predicts promoter sequences through a tree-based ensemble-learning framework. Compared with seven existing methods, PromGER improved the average specificity of 13%, accuracy of 10%, Matthew's correlation coefficient of 16%, precision of 4%, F1 score of 6%, and AUC of 9%. Specifically, this study interpreted the PromGER by the t-distributed stochastic neighbor embedding (t-SNE) method and SHAPley Additive exPlanations (SHAP) value analysis, which demonstrates the interpretability of the model.
Collapse
Affiliation(s)
- Yan Wang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
- School of Artificial Intelligence, Jilin University, Changchun 130012, China
| | - Shiwen Tai
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Shuangquan Zhang
- School of Cyber Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Nan Sheng
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Xuping Xie
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| |
Collapse
|
3
|
Shujaat M, Kim H, Tayara H, Chong KT. iProm-Sigma54: A CNN Base Prediction Tool for σ54 Promoters. Cells 2023; 12:cells12060829. [PMID: 36980170 PMCID: PMC10047130 DOI: 10.3390/cells12060829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/23/2023] [Accepted: 02/23/2023] [Indexed: 03/11/2023] Open
Abstract
The sigma (σ) factor of RNA holoenzymes is essential for identifying and binding to promoter regions during gene transcription in prokaryotes. σ54 promoters carried out various ancillary methods and environmentally responsive procedures; therefore, it is crucial to accurately identify σ54 promoter sequences to comprehend the underlying process of gene regulation. Herein, we come up with a convolutional neural network (CNN) based prediction tool named “iProm-Sigma54” for the prediction of σ54 promoters. The CNN consists of two one-dimensional convolutional layers, which are followed by max pooling layers and dropout layers. A one-hot encoding scheme was used to extract the input matrix. To determine the prediction performance of iProm-Sigma54, we employed four assessment metrics and five-fold cross-validation; performance was measured using a benchmark and test dataset. According to the findings of this comparison, iProm-Sigma54 outperformed existing methodologies for identifying σ54 promoters. Additionally, a publicly accessible web server was constructed.
Collapse
Affiliation(s)
- Muhammad Shujaat
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Hoonjoo Kim
- School of Pharmacy, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Correspondence: (H.K.); (K.T.C.)
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Correspondence: (H.K.); (K.T.C.)
| |
Collapse
|
4
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Nguyen-Hoang PU, Rahardja S, Nguyen BP. iPromoter-Seqvec: identifying promoters using bidirectional long short-term memory and sequence-embedded features. BMC Genomics 2022; 23:681. [PMID: 36192696 PMCID: PMC9531353 DOI: 10.1186/s12864-022-08829-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022] Open
Abstract
Background Promoters, non-coding DNA sequences located at upstream regions of the transcription start site of genes/gene clusters, are essential regulatory elements for the initiation and regulation of transcriptional processes. Furthermore, identifying promoters in DNA sequences and genomes significantly contributes to discovering entire structures of genes of interest. Therefore, exploration of promoter regions is one of the most imperative topics in molecular genetics and biology. Besides experimental techniques, computational methods have been developed to predict promoters. In this study, we propose iPromoter-Seqvec – an efficient computational model to predict TATA and non-TATA promoters in human and mouse genomes using bidirectional long short-term memory neural networks in combination with sequence-embedded features extracted from input sequences. The promoter and non-promoter sequences were retrieved from the Eukaryotic Promoter database and then were refined to create four benchmark datasets. Results The area under the receiver operating characteristic curve (AUCROC) and the area under the precision-recall curve (AUCPR) were used as two key metrics to evaluate model performance. Results on independent test sets showed that iPromoter-Seqvec outperformed other state-of-the-art methods with AUCROC values ranging from 0.85 to 0.99 and AUCPR values ranging from 0.86 to 0.99. Models predicting TATA promoters in both species had slightly higher predictive power compared to those predicting non-TATA promoters. With a novel idea of constructing artificial non-promoter sequences based on promoter sequences, our models were able to learn highly specific characteristics discriminating promoters from non-promoters to improve predictive efficiency. Conclusions iPromoter-Seqvec is a stable and robust model for predicting both TATA and non-TATA promoters in human and mouse genomes. Our proposed method was also deployed as an online web server with a user-friendly interface to support research communities. Links to our source codes and web server are available at https://github.com/mldlproject/2022-iPromoter-Seqvec. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08829-6.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, 100000, Hanoi, Vietnam
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Quarter 6, Linh Trung Ward, Thu Duc District, 700000, Ho Chi Minh City, Vietnam
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, 710072, Xi'an, China. .,Infocomm Technology Cluster, Singapore Institute of Technology, 10 Dover Drive, 138683, Singapore, Singapore.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand.
| |
Collapse
|
5
|
Transcriptome-Wide Analysis Revealed the Potential of the High-Affinity Potassium Transporter (HKT) Gene Family in Rice Salinity Tolerance via Ion Homeostasis. Bioengineering (Basel) 2022; 9:bioengineering9090410. [PMID: 36134956 PMCID: PMC9495969 DOI: 10.3390/bioengineering9090410] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 08/15/2022] [Indexed: 11/16/2022] Open
Abstract
The high-affinity potassium transporter (HKT) genes are key ions transporters, regulating the plant response to salt stress via sodium (Na+) and potassium (K+) homeostasis. The main goal of this research was to find and understand the HKT genes in rice and their potential biological activities in response to brassinosteroids (BRs), jasmonic acid (JA), seawater, and NaCl stress. The in silico analyses of seven OsHKT genes involved their evolutionary tree, gene structures, conserved motifs, and chemical properties, highlighting the key aspects of OsHKT genes. The Gene Ontology (GO) analysis of HKT genes revealed their roles in growth and stress responses. Promoter analysis showed that the majority of the HKT genes participate in abiotic stress responses. Tissue-specific expression analysis showed higher transcriptional activity of OsHKT genes in roots and leaves. Under NaCl, BR, and JA application, OsHKT1 was expressed differentially in roots and shoots. Similarly, the induced expression pattern of OsHKT1 was recorded in the seawater resistant (SWR) cultivar. Additionally, the Na+ to K+ ratio under different concentrations of NaCl stress has been evaluated. Our data highlighted the important role of the OsHKT gene family in regulating the JA and BR mediated rice salinity tolerance and could be useful for rice future breeding programs.
Collapse
|
6
|
Zhang M, Jia C, Li F, Li C, Zhu Y, Akutsu T, Webb GI, Zou Q, Coin LJM, Song J. Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction. Brief Bioinform 2022; 23:6502561. [PMID: 35021193 PMCID: PMC8921625 DOI: 10.1093/bib/bbab551] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 11/12/2021] [Accepted: 11/30/2021] [Indexed: 01/13/2023] Open
Abstract
Promoters are crucial regulatory DNA regions for gene transcriptional activation. Rapid advances in next-generation sequencing technologies have accelerated the accumulation of genome sequences, providing increased training data to inform computational approaches for both prokaryotic and eukaryotic promoter prediction. However, it remains a significant challenge to accurately identify species-specific promoter sequences using computational approaches. To advance computational support for promoter prediction, in this study, we curated 58 comprehensive, up-to-date, benchmark datasets for 7 different species (i.e. Escherichia coli, Bacillus subtilis, Homo sapiens, Mus musculus, Arabidopsis thaliana, Zea mays and Drosophila melanogaster) to assist the research community to assess the relative functionality of alternative approaches and support future research on both prokaryotic and eukaryotic promoters. We revisited 106 predictors published since 2000 for promoter identification (40 for prokaryotic promoter, 61 for eukaryotic promoter, and 5 for both). We systematically evaluated their training datasets, computational methodologies, calculated features, performance and software usability. On the basis of these benchmark datasets, we benchmarked 19 predictors with functioning webservers/local tools and assessed their prediction performance. We found that deep learning and traditional machine learning-based approaches generally outperformed scoring function-based approaches. Taken together, the curated benchmark dataset repository and the benchmarking analysis in this study serve to inform the design and implementation of computational approaches for promoter prediction and facilitate more rigorous comparison of new techniques in the future.
Collapse
Affiliation(s)
| | - Cangzhi Jia
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | | | | | | | | | - Geoffrey I Webb
- Department of Data Science and Artificial Intelligence, Monash University, Melbourne, VIC 3800, Australia,Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Quan Zou
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | - Lachlan J M Coin
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | - Jiangning Song
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| |
Collapse
|
7
|
Identification and in silico characterization of structural and functional impacts of genetic variants in milk protein genes in the Zebu breeds Guzerat and Gyr. Trop Anim Health Prod 2021; 53:524. [PMID: 34705124 DOI: 10.1007/s11250-021-02970-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 10/14/2021] [Indexed: 10/20/2022]
Abstract
Whole genome sequencing of bovine breeds has allowed identification of genetic variants in milk protein genes. However, functional repercussion of such variants at a molecular level has seldom been investigated. Here, the results of a multistep Bioinformatic analysis for functional characterization of recently identified genetic variants in Brazilian Gyr and Guzerat breeds is described, including predicted effects on the following: (i) evolutionary conserved nucleotide positions/regions; (ii) protein function, stability, and interactions; (iii) splicing, branching, and miRNA binding sites; (iv) promoters and transcription factor binding sites; and (v) collocation with QTL. Seventy-one genetic variants were identified in the caseins (CSN1S1, CSN2, CSN1S2, and CSN3), LALBA, LGB, and LTF genes. Eleven potentially regulatory variants and two missense mutations were identified. LALBA Ile60Val was predicted to affect protein stability and flexibility, by reducing the number the disulfide bonds established. LTF Thr546Asn is predicted to generate steric clashes, which could mildly affect iron coordination. In addition, LALBA Ile60Val and LTF Thr546Asn affect exonic splicing enhancers and silencers. Consequently, both mutations have the potential of affecting immune response at individual level, not only in the mammary gland. Although laborious, this multistep procedure for classifying variants allowed the identification of potentially functional variants for milk protein genes.
Collapse
|
8
|
Age-related demethylation of the TDP-43 autoregulatory region in the human motor cortex. Commun Biol 2021; 4:1107. [PMID: 34548609 PMCID: PMC8455575 DOI: 10.1038/s42003-021-02621-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 09/01/2021] [Indexed: 11/21/2022] Open
Abstract
In amyotrophic lateral sclerosis (ALS), TAR DNA-binding protein 43 (TDP-43), which is encoded by TARDBP, forms aggregates in the motor cortex. This aggregate formation may be triggered by an increase in the TDP-43 level with aging. However, the amount of TDP-43 is autoregulated by alternative splicing of the TARDBP 3′UTR, and how this autoregulation is affected by aging remains to be elucidated. We found that DNA demethylation in the autoregulatory region in the TARDBP 3′UTR reduced alternative splicing and increased TARDBP mRNA expression. Furthermore, in the human motor cortex, we found that this region was demethylated with aging, resulting in increased expression of TARDBP mRNA. The acceleration of DNA demethylation in the motor cortex was associated with the age of ALS onset. In summary, the dysregulation of TDP-43 autoregulation by age-related DNA demethylation in the motor cortex may explain the contribution of aging and motor system selectivity in ALS. In order to assess the effects of aging on the autoregulation of TAR DNA-binding protein 43 (TDP-43) and the potential effects of this on the role of TDP-43 in Amyotrophic Lateral Sclerosis (ALS), Koike et al examined post-mortem motor cortex tissue from ALS patients. They found that DNA demethylation in the autoregulatory region of the TARDBP 3′UTR, which encodes TDP-43, increased with age and was associated with the onset age of ALS and thus could be indicative of a role for dysregulation of TDP-43 autoregulation in ALS pathology.
Collapse
|
9
|
Role of Bioinformatics in Biological Sciences. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
10
|
Hitti-Malin RJ, Burmeister LM, Ricketts SL, Lewis TW, Pettitt L, Boursnell M, Schofield EC, Sargan D, Mellersh CS. A LINE-1 insertion situated in the promoter of IMPG2 is associated with autosomal recessive progressive retinal atrophy in Lhasa Apso dogs. BMC Genet 2020; 21:100. [PMID: 32894063 PMCID: PMC7487703 DOI: 10.1186/s12863-020-00911-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 08/30/2020] [Indexed: 12/30/2022] Open
Abstract
Background Canine progressive retinal atrophies are a group of hereditary retinal degenerations in dogs characterised by depletion of photoreceptor cells in the retina, which ultimately leads to blindness. PRA in the Lhasa Apso (LA) dog has not previously been clinically characterised or described in the literature, but owners in the UK are advised to have their dog examined through the British Veterinary Association/ Kennel Club/ International Sheep Dog Society (BVA/KC/ISDS) eye scheme annually, and similar schemes that are in operation in other countries. After the exclusion of 25 previously reported canine retinal mutations in LA PRA-affected dogs, we sought to identify the genetic cause of PRA in this breed. Results Analysis of whole-exome sequencing data of three PRA-affected LA and three LA without signs of PRA did not identify any exonic or splice site variants, suggesting the causal variant was non-exonic. We subsequently undertook a genome-wide association study (GWAS), which identified a 1.3 Mb disease-associated region on canine chromosome 33, followed by whole-genome sequencing analysis that revealed a long interspersed element-1 (LINE-1) insertion upstream of the IMPG2 gene. IMPG2 has previously been implicated in human retinal disease; however, until now no canine PRAs have been associated with this gene. The identification of this PRA-associated variant has enabled the development of a DNA test for this form of PRA in the breed, here termed PRA4 to distinguish it from other forms of PRA described in other breeds. This test has been used to determine the genotypes of over 900 LA dogs. A large cohort of genotyped dogs was used to estimate the allele frequency as between 0.07–0.1 in the UK LA population. Conclusions Through the use of GWAS and subsequent sequencing of a PRA case, we have identified a LINE-1 insertion in the retinal candidate gene IMPG2 that is associated with a form of PRA in the LA dog. Validation of this variant in 447 dogs of 123 breeds determined it was private to LA dogs. We envisage that, over time, the developed DNA test will offer breeders the opportunity to avoid producing dogs affected with this form of PRA.
Collapse
Affiliation(s)
- Rebekkah J Hitti-Malin
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK. .,Department of Veterinary Medicine, University of Cambridge, Cambridge, CB3 0ES, UK.
| | - Louise M Burmeister
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| | - Sally L Ricketts
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| | - Thomas W Lewis
- The Kennel Club, London, W1J 8AB, UK.,School of Veterinary Medicine and Science, The University of Nottingham, Sutton Bonington, Leicestershire, LE12 5RD, UK
| | - Louise Pettitt
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| | - Mike Boursnell
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| | - Ellen C Schofield
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| | - David Sargan
- Department of Veterinary Medicine, University of Cambridge, Cambridge, CB3 0ES, UK
| | - Cathryn S Mellersh
- Kennel Club Genetics Centre, Animal Health Trust, Lanwades Park, Newmarket, Suffolk, CB8 7UU, UK
| |
Collapse
|
11
|
Jex AR, Svärd S, Hagen KD, Starcevich H, Emery-Corbin SJ, Balan B, Nosala C, Dawson SC. Recent advances in functional research in Giardia intestinalis. ADVANCES IN PARASITOLOGY 2020; 107:97-137. [PMID: 32122532 PMCID: PMC7878119 DOI: 10.1016/bs.apar.2019.12.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
This review considers current advances in tools to investigate the functional biology of Giardia, it's coding and non-coding genes, features and cellular and molecular biology. We consider major gaps in current knowledge of the parasite and discuss the present state-of-the-art in its in vivo and in vitro cultivation. Advances in in silico tools, including for the modelling non-coding RNAs and genomic elements, as well as detailed exploration of coding genes through inferred homology to model organisms, have provided significant, primary level insight. Improved methods to model the three-dimensional structure of proteins offer new insights into their function, and binding interactions with ligands, other proteins or precursor drugs, and offer substantial opportunities to prioritise proteins for further study and experimentation. These approaches can be supplemented by the growing and highly accessible arsenal of systems-based methods now being applied to Giardia, led by genomic, transcriptomic and proteomic methods, but rapidly incorporating advanced tools for detection of real-time transcription, evaluation of chromatin states and direct measurement of macromolecular complexes. Methods to directly interrogate and perturb gene function have made major leaps in recent years, with CRISPr-interference now available. These approaches, coupled with protein over-expression, fluorescent labelling and in vitro and in vivo imaging, are set to revolutionize the field and herald an exciting time during which the field may finally realise Giardia's long proposed potential as a model parasite and eukaryote.
Collapse
Affiliation(s)
- Aaron R Jex
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Melbourne, VIC, Australia; Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC, Australia.
| | - Staffan Svärd
- Centre for Biomedicine, Uppsala University, Uppsala, Sweden
| | - Kari D Hagen
- College of Biological Sciences, University of California-Davis, Davis, CA, United States
| | - Hannah Starcevich
- College of Biological Sciences, University of California-Davis, Davis, CA, United States
| | - Samantha J Emery-Corbin
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Melbourne, VIC, Australia
| | - Balu Balan
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Melbourne, VIC, Australia
| | - Chris Nosala
- College of Biological Sciences, University of California-Davis, Davis, CA, United States
| | - Scott C Dawson
- College of Biological Sciences, University of California-Davis, Davis, CA, United States
| |
Collapse
|
12
|
Chow HM, Shi M, Cheng A, Gao Y, Chen G, Song X, So RWL, Zhang J, Herrup K. Age-related hyperinsulinemia leads to insulin resistance in neurons and cell-cycle-induced senescence. Nat Neurosci 2019; 22:1806-1819. [DOI: 10.1038/s41593-019-0505-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 08/21/2019] [Indexed: 01/17/2023]
|
13
|
Li H, Xu G, Yang C, Yang L, Liang Z. Genome-wide identification and expression analysis of HKT transcription factor under salt stress in nine plant species. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2019; 171:435-442. [PMID: 30639869 DOI: 10.1016/j.ecoenv.2019.01.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 12/14/2018] [Accepted: 01/04/2019] [Indexed: 05/11/2023]
Abstract
High-affinity K+ (HKT) gene family is regulated the transport of Na+ and maintain the balance between Na+ and K+ in the process of plant growth, development and response to abiotic stress. Despite this fact, systemic and comprehensive studies on HKT in multiply plants remains unknown. A total of 29 HKT genes distributed on nine species were identified. Phylogenetic tree analysis results indicated that HKT genes were divided into five homology subfamilies. Combining structural analysis with protein contains five highly conservative motifs, HKT family has similar gene structures and special gene characteristics. Finally, the expression patterns of HKT showed two different dramatic changes in different organs and tissues under different salt stress in multiply plants. This study has many implications for research into the comparative genomics analysis of HTK gene family, which revealed regulation mechanism of HKT genes, is valuable for understanding development and response to abiotic stress in plant.
Collapse
Affiliation(s)
- Huayang Li
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Guangzhao Xu
- Beijing Key Laboratory of Grape Science and Enology and Key Laboratory of Plant Resource, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Chao Yang
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Taian 271018, China
| | - Long Yang
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Taian 271018, China.
| | - Zhenchang Liang
- Beijing Key Laboratory of Grape Science and Enology and Key Laboratory of Plant Resource, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
| |
Collapse
|
14
|
Oubounyt M, Louadi Z, Tayara H, Chong KT. DeePromoter: Robust Promoter Predictor Using Deep Learning. Front Genet 2019; 10:286. [PMID: 31024615 PMCID: PMC6460014 DOI: 10.3389/fgene.2019.00286] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Accepted: 03/15/2019] [Indexed: 12/11/2022] Open
Abstract
The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm.
Collapse
Affiliation(s)
- Mhaned Oubounyt
- Department of Information and Electronics Engineering, Chonbuk National University, Jeonju, South Korea
| | - Zakaria Louadi
- Department of Information and Electronics Engineering, Chonbuk National University, Jeonju, South Korea
| | - Hilal Tayara
- Department of Information and Electronics Engineering, Chonbuk National University, Jeonju, South Korea
| | - Kil To Chong
- Advanced Research Center of Information and Electronics Engineering, Chonbuk National University, Jeonju, South Korea
| |
Collapse
|
15
|
Frankl-Vilches C, Gahr M. Androgen and estrogen sensitivity of bird song: a comparative view on gene regulatory levels. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2018; 204:113-126. [PMID: 29209770 PMCID: PMC5790841 DOI: 10.1007/s00359-017-1236-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 11/02/2017] [Accepted: 11/09/2017] [Indexed: 12/16/2022]
Abstract
Singing of songbirds is sensitive to testosterone and its androgenic and estrogenic metabolites in a species-specific way. The hormonal effects on song pattern are likely mediated by androgen receptors (AR) and estrogen receptor alpha (ERα), ligand activated transcription factors that are expressed in neurons of various areas of the songbirds' vocal control circuit. The distribution of AR in this circuit is rather similar between species while that of ERα is species variant and concerns a key vocal control area, the HVC (proper name). We discuss the regulation of the expression of the cognate AR and ERα and putative splice variants. In particular, we suggest that transcription factor binding sites in the promoter of these receptors differ between bird species. Further, we suggest that AR- and ERα-dependent gene regulation in vocal areas differs between species due to species-specific DNA binding sites of putative target genes that are required for the transcriptional activity of the receptors. We suggest that species differences in the distribution of AR and ERα in vocal areas and in the genomic sensitivity to these receptors contribute to species-specific hormonal regulation of the song.
Collapse
Affiliation(s)
- Carolina Frankl-Vilches
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, 82319, Seewiesen, Germany
| | - Manfred Gahr
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, 82319, Seewiesen, Germany.
| |
Collapse
|
16
|
Shahmuradov IA, Umarov RK, Solovyev VV. TSSPlant: a new tool for prediction of plant Pol II promoters. Nucleic Acids Res 2017; 45:e65. [PMID: 28082394 PMCID: PMC5416875 DOI: 10.1093/nar/gkw1353] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2016] [Revised: 12/16/2016] [Accepted: 12/27/2016] [Indexed: 11/22/2022] Open
Abstract
Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/.
Collapse
Affiliation(s)
- Ilham A. Shahmuradov
- King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
- Institue of Molecular Biology and Biotechnologies, ANAS, 2 Matbuat strasse, Baku AZ1073, Azerbaijan
| | - Ramzan Kh. Umarov
- King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | | |
Collapse
|
17
|
Jiang X, Feng L, Dai B, Li L, Lu W. Identification of key genes involved in nasopharyngeal carcinoma. Braz J Otorhinolaryngol 2016; 83:670-676. [PMID: 27765529 PMCID: PMC9449222 DOI: 10.1016/j.bjorl.2016.09.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 08/05/2016] [Accepted: 09/11/2016] [Indexed: 12/22/2022] Open
Abstract
Introduction Nasopharyngeal carcinoma is the most common cancer originating from the nasopharynx. Objective To study the mechanisms of nasopharyngeal carcinoma, we analyzed GSE12452 microarray data. Methods GSE12452 was downloaded from the Gene Expression Omnibus database and included 31 nasopharyngeal carcinoma samples and 10 normal nasopharyngeal tissue samples. The differentially expressed genes were screened by ANOVA in the PGS package. Using the BiNGO plugin in Cytoscape and pathway enrichment analysis in the PGS package, functional and pathway enrichment analyses were performed separately to predict potential functions of the differentially expressed genes. Furthermore, Transcription factor-differentially expressed gene pairs were searched, and then the transcription factor-differentially expressed gene regulatory network was visualized using Cytoscape software. Results A total of 487 genes were screened as differentially expressed genes between the nasopharyngeal carcinoma samples and the normal nasopharyngeal tissue samples. Enrichment analysis indicated that PTGS2 was involved in the regulation of biological process and small cell lung cancer. ZIC2 and OVOL1 may function in nasopharyngeal carcinoma through targeting significantly up-regulated genes (such as PTGS2, FN1, CXCL9 and CXCL10) in the Transcription factor-differentially expressed gene regulatory network (e.g., ZIC2→PTGS2 and OVOL1→CXCL10). Conclusion PTGS2, FN1, CXCL9, CXCL10, ZIC2 and OVOL1 might play roles in nasopharyngeal carcinoma.
Collapse
Affiliation(s)
- Xue Jiang
- Cangzhou Central Hospital, Department of Otorhinolaryngology, Cangzhou, Hebei, China
| | - Lichun Feng
- Cangzhou Central Hospital, Department of Otorhinolaryngology, Cangzhou, Hebei, China
| | - Baoqiang Dai
- Cangzhou Central Hospital, Department of Otorhinolaryngology, Cangzhou, Hebei, China
| | - Liping Li
- Cangzhou Central Hospital, Department of Otorhinolaryngology, Cangzhou, Hebei, China
| | - Weiwei Lu
- Cangzhou Central Hospital, Department of Otorhinolaryngology, Cangzhou, Hebei, China.
| |
Collapse
|
18
|
MicroRNA Promoter Identification in Arabidopsis Using Multiple Histone Markers. BIOMED RESEARCH INTERNATIONAL 2015; 2015:861402. [PMID: 26425556 PMCID: PMC4573627 DOI: 10.1155/2015/861402] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2014] [Accepted: 03/12/2015] [Indexed: 11/18/2022]
Abstract
A microRNA is a small noncoding RNA molecule, which functions in RNA silencing and posttranscriptional regulation of gene expression. To understand the mechanism of the activation of microRNA genes, the location of promoter regions driving their expression is required to be annotated precisely. Only a fraction of microRNA genes have confirmed transcription start sites (TSSs), which hinders our understanding of the transcription factor binding events. With the development of the next generation sequencing technology, the chromatin states can be inferred precisely by virtue of a combination of specific histone modifications. Using the genome-wide profiles of nine histone markers including H3K4me2, H3K4me3, H3K9Ac, H3K9me2, H3K18Ac, H3K27me1, H3K27me3, H3K36me2, and H3K36me3, we developed a computational strategy to identify the promoter regions of most microRNA genes in Arabidopsis, based upon the assumption that the distribution of histone markers around the TSSs of microRNA genes is similar to the TSSs of protein coding genes. Among 298 miRNA genes, our model identified 42 independent miRNA TSSs and 132 miRNA TSSs, which are located in the promoters of upstream genes. The identification of promoters will provide better understanding of microRNA regulation and can play an important role in the study of diseases at genetic level.
Collapse
|
19
|
Bachmann HS, Meier W, du Bois A, Kimmig R, Kuhlmann JD, Siffert W, Sehouli J, Wollschlaeger K, Huober J, Hillemanns P, Burges A, Schmalfeldt B, Aminossadati B, Wimberger P. The FNTB promoter polymorphism rs11623866 as a potential predictive biomarker for lonafarnib treatment of ovarian cancer patients. Br J Clin Pharmacol 2015; 80:1139-48. [PMID: 26033044 DOI: 10.1111/bcp.12688] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Revised: 05/22/2015] [Accepted: 05/26/2015] [Indexed: 01/01/2023] Open
Abstract
AIM Despite promising preclinical findings regarding clinical utility of farnesyltransferase inhibitors (FTI), such as lonafarnib, success of clinical trials is limited. A multicentre AGO-OVAR-15 phase II trial reported an unfavourable effect of lonafarnib on the outcome of patients with advanced ovarian cancer. This study was performed as a genetic subgroup analysis of the AGO-OVAR-15 trial, and investigated the utility of the promoter polymorphism rs11623866 of the farnesyltransferase ß-subunit gene (FNTB) in predicting the clinical effectiveness of lonafarnib. METHODS The influence of rs11623866 (c.-609G > C) on FNTB promoter activity was investigated by electrophoretic-mobility-shift assay, luciferase-reporter assay and RT-qPCR. A total of 57 out of 105 patients from the AGO-OVAR-15 trial, treated with carboplatin and paclitaxel ± lonafarnib, was genotyped for rs11623866 by restriction fragment length polymorphism analysis. Genotype-dependent survival analysis was performed by Kaplan-Meier analysis. RESULTS The presence of the G allele was associated with increased FNTB promoter activity compared with the C allele. An unfavourable effect of lonafarnib was limited to patients carrying a GG genotype (HRPFS 6.2, 95%CI = 2.01, 19.41, P = 0.002; HROS 9.6, 95%CI = 1.89, 48.54, P = 0.006). Median progression free survival (PFS) for patients with the GG genotype in the lonafarnib treated arm was 10 months, whereas median PFS without FTI-treatment was 40 months. Median overall survival (OS) in the lonafarnib-treated group was 19 months, whereas median OS was not reached in the untreated group. CONCLUSIONS Discrepancies between preclinical success and clinical failure may be due to the patients' genetic variability of FNTB. Therefore, our results may encourage retrospective evaluation of FNTB polymorphisms in previous FTI studies, especially those reporting positive FTI response.
Collapse
Affiliation(s)
| | - Werner Meier
- Department of Gynecology and Obstetrics, Evangelisches Krankenhaus Duesseldorf, Dusseldorf
| | - Andreas du Bois
- Department of Gynecology and Gynecologic Oncology, Klinikum Essen Mitte, Essen
| | - Rainer Kimmig
- Department of Gynecology and Obstetrics, University of Duisburg-Essen, Essen
| | - Jan Dominik Kuhlmann
- Department of Gynecology and Obstetrics, University of Duisburg-Essen, Essen.,Department of Gynecology and Obstetrics Medical Faculty, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden.,German Cancer Consortium (DKTK), Dresden and German Cancer Research Center (DKFZ), Heidelberg
| | - Winfried Siffert
- Institute of Pharmacogenetics, University of Duisburg-Essen, Essen
| | - Jalid Sehouli
- Department of Gynecology, Charité Medical University of Berlin, Berlin
| | | | - Jens Huober
- Department of Gynecology and Obstetrics, University of Ulm, Ulm
| | - Peter Hillemanns
- Department of Gynecology and Obstetrics, Medical University of Hannover, Hannover
| | - Alexander Burges
- Department of Gynecology and Obstetrics, Klinikum Großhadern, Munich
| | | | - Behnaz Aminossadati
- Coordinating Centre for Clinical Trials (KKS), Philipps University of Marburg, Marburg, Germany
| | - Pauline Wimberger
- Department of Gynecology and Obstetrics, University of Duisburg-Essen, Essen.,Department of Gynecology and Obstetrics Medical Faculty, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden.,German Cancer Consortium (DKTK), Dresden and German Cancer Research Center (DKFZ), Heidelberg
| |
Collapse
|
20
|
Meng X, Yang F, Ouyang T, Liu B, Wu C, Jiang W. Specific gene expression in mouse cortical astrocytes is mediated by a 1740bp-GFAP promoter-driven combined adeno-associated virus 2/5/7/8/9. Neurosci Lett 2015; 593:45-50. [PMID: 25778419 DOI: 10.1016/j.neulet.2015.03.022] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Revised: 02/20/2015] [Accepted: 03/12/2015] [Indexed: 10/23/2022]
Abstract
We sought to demonstrate the in vivo transduction efficiency and tropism range in astrocytes of a combined-serotype adeno associated virus (AAV2/5/7/8/9). To control expression of enhanced green fluorescent protein (EGFP), a 1740bp glial fibrillary acidic protein (GFAP) promoter was obtained and ligated into vectors of each AAV serotype (2/5/7/8/9). Purified AAVs were then injected into the somatosensory cortex of C57BL/6J mice. Cell-type specific antibodies and subsequent immunofluorescence were used to identify astrocytes (GFAP), neurons (neuronal nuclear antigen, NeuN), microglia (ionized calcium-binding adapter molecule 1, Iba1), and oligodendrocytes (myelin basic protein, MBP), whereby, EGFP expression was measured in each cell type at 1-4 weeks post-injection. Our results indicated that the majority of astrocytes expressed EGFP, while only a small number of neurons expressed EGFP. Both microglia and oligodendrocytes lacked EGFP expression after viral injection. Quantitative analyses revealed that the percentage of EGFP-positive astrocytes was about 98% after viral injection, while the EGFP-positive neuronal percentage was less than 2%. Thus, this study shows that using a combined-serotype AAV carrying a 1740bp GFAP promoter results in successful, cell-type specific infection of the central nervous system, with robust gene expression in murine astrocytes.
Collapse
Affiliation(s)
- Xiandong Meng
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China; Department of Neurology, Lanzhou General Hospital of PLA, Lanzhou 730050, PR China
| | - Feng Yang
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China
| | - Tangpeng Ouyang
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China
| | - Bing Liu
- Shanghai SBO Medical Biotechnology Co. Ltd., Shanghai 201203, China
| | - Chen Wu
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China
| | - Wen Jiang
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, Xi'an 710032, China.
| |
Collapse
|
21
|
Kumar PL, James PF. Identification and characterization of methylation-dependent/independent DNA regulatory elements in the human SLC9B1 gene. Gene 2015; 561:235-48. [PMID: 25701605 DOI: 10.1016/j.gene.2015.02.050] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Revised: 02/09/2015] [Accepted: 02/13/2015] [Indexed: 12/15/2022]
Abstract
The human NHEDC1 (hNHEDC1) protein is thought to be essential for sperm motility and fertility however the mechanisms regulating its gene expression are largely unknown. In this study we have identified multiple DNA regulatory elements in the 5' end of the gene encoding hNHEDC1 (SLC9B1) and have explored the role that DNA methylation at these elements plays in the regulation of its expression. We first show that the full-length hNHEDC1 protein is testis-specific for the tissues that we tested and that it localizes to the cells of the seminiferous tubules. In silico analysis of the SLC9B1 gene locus identified two putative promoters (P1 and P2) and two CpG islands - CpGI (overlapping with P1) and CpGII (intragenic) - at the 5' end of the gene. By deletion analysis of P1, we show that the region from -23 bp to +200 bp relative to the transcription start site (TSS) is sufficient for optimal promoter activity in a germ cell line. Additionally, in vitro methylation of the P1 (the -500 bp to +200 bp region relative to the TSS) abolishes its activity in germ cells and somatic cells strongly suggesting that DNA methylation at this promoter could regulate SLC9B1 expression. Furthermore, bisulfite-sequencing analysis of the P1/CpGI uncovered reduced methylation in the testis vs. lung whereas CpGII displayed no differences in methylation between these two tissues. Additionally, treatment of HEK 293 cells with 5-aza-2-Deoxycytidine led to upregulation of NHEDC1 transcript and reduced methylation in the promoter CpGI. Finally, we have uncovered both enhancer and silencer functions of the intragenic SLC9B1 CpGII. In all, our data suggests that SLC9B1 gene expression could be regulated via a concerted action of DNA methylation-dependent and independent mechanisms mediated by these multiple DNA regulatory elements.
Collapse
Affiliation(s)
- Priya L Kumar
- Department of Biology, Miami University, Oxford, OH, United States
| | - Paul F James
- Department of Biology, Miami University, Oxford, OH, United States.
| |
Collapse
|
22
|
Yella VR, Bansal M. In silico Identification of Eukaryotic Promoters. SYSTEMS AND SYNTHETIC BIOLOGY 2015. [DOI: 10.1007/978-94-017-9514-2_4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
23
|
Tornari C, Towers ER, Gale JE, Dawson SJ. Regulation of the orphan nuclear receptor Nr2f2 by the DFNA15 deafness gene Pou4f3. PLoS One 2014; 9:e112247. [PMID: 25372459 PMCID: PMC4221282 DOI: 10.1371/journal.pone.0112247] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 10/08/2014] [Indexed: 12/23/2022] Open
Abstract
Hair cells are the mechanotransducing cells of the inner ear that are essential for hearing and balance. POU4F3--a POU-domain transcription factor selectively expressed by these cells--has been shown to be essential for hair cell differentiation and survival in mice and its mutation in humans underlies late-onset progressive hearing loss (DFNA15). The downstream targets of POU4F3 are required for hair cell differentiation and survival. We aimed to identify such targets in order to elucidate the molecular pathways involved in hair cell production and maintenance. The orphan thyroid nuclear receptor Nr2f2 was identified as a POU4F3 target using a subtractive hybridization strategy and EMSA analysis showed that POU4F3 binds to two sites in the Nr2f2 5' flanking region. These sites were shown to be required for POU4F3 activation as their mutation leads to a reduction in the response of an Nr2f2 5' flanking region reporter construct to POU4F3. Immunocytochemistry was carried out in the developing and adult inner ear in order to investigate the relevance of this interaction in hearing. NR2F2 expression in the postnatal mouse organ of Corti was shown to be detectable in all sensory epithelia examined and characterised. These data demonstrate that Nr2f2 is a direct target of POU4F3 in vitro and that this regulatory relationship may be relevant to hair cell development and survival.
Collapse
Affiliation(s)
| | - Emily R. Towers
- UCL Ear Institute, University College London, London, United Kingdom
| | - Jonathan E. Gale
- UCL Ear Institute, University College London, London, United Kingdom
| | - Sally J. Dawson
- UCL Ear Institute, University College London, London, United Kingdom
| |
Collapse
|
24
|
Rossmann C, Windpassinger C, Brunner D, Kovacevic A, Schweighofer N, Malli R, Schuligoi R, Prokesch A, Kluve-Beckerman B, Graier WF, Kratky D, Sattler W, Malle E. Characterization of rat serum amyloid A4 (SAA4): a novel member of the SAA superfamily. Biochem Biophys Res Commun 2014; 450:1643-9. [PMID: 25044109 PMCID: PMC4145149 DOI: 10.1016/j.bbrc.2014.07.054] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2014] [Accepted: 07/11/2014] [Indexed: 11/16/2022]
Abstract
The full length rat SAA4 (rSAA4) mRNA was characterized by rapid amplification of cDNA ends. rSAA4 mRNA has 1830 bases including a GA dinucleotide tandem repeat in the 5′UTR. Three consecutive C/EBP promoter elements are crucial for transcription of rSAA4. rSAA4 is abundantly expressed in the liver on mRNA and protein level.
The serum amyloid A (SAA) family of proteins is encoded by multiple genes, which display allelic variation and a high degree of homology in mammals. The SAA1/2 genes code for non-glycosylated acute-phase SAA1/2 proteins, that may increase up to 1000-fold during inflammation. The SAA4 gene, well characterized in humans (hSAA4) and mice (mSaa4) codes for a SAA4 protein that is glycosylated only in humans. We here report on a previously uncharacterized SAA4 gene (rSAA4) and its product in Rattus norvegicus, the only mammalian species known not to express acute-phase SAA. The exon/intron organization of rSAA4 is similar to that reported for hSAA4 and mSaa4. By performing 5′- and 3′RACE, we identified a 1830-bases containing rSAA4 mRNA (including a GA-dinucleotide tandem repeat). Highest rSAA4 mRNA expression was detected in rat liver. In McA-RH7777 rat hepatoma cells, rSAA4 transcription was significantly upregulated in response to LPS and IL-6 while IL-1α/β and TNFα were without effect. Luciferase assays with promoter-truncation constructs identified three proximal C/EBP-elements that mediate expression of rSAA4 in McA-RH7777 cells. In line with sequence prediction a 14-kDa non-glycosylated SAA4 protein is abundantly expressed in rat liver. Fluorescence microscopy revealed predominant localization of rSAA4-GFP-tagged fusion protein in the ER.
Collapse
Affiliation(s)
- Christine Rossmann
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | | | - Daniela Brunner
- Institute of Human Genetics, Medical University of Graz, Graz, Austria
| | - Alenka Kovacevic
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Natascha Schweighofer
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Roland Malli
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Rufina Schuligoi
- Institute of Experimental and Clinical Pharmacology, Medical University of Graz, Graz, Austria
| | - Andreas Prokesch
- Institute of Cell Biology, Histology and Embryology, Medical University of Graz, Graz, Austria; Institute of Biochemistry, Graz University of Technology, Graz, Austria
| | - Barbara Kluve-Beckerman
- Department of Pathology and Laboratory Medicine, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Wolfgang F Graier
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Dagmar Kratky
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Wolfgang Sattler
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Ernst Malle
- Institute of Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria.
| |
Collapse
|
25
|
Roca H, Pande M, Huo JS, Hernandez J, Cavalcoli JD, Pienta KJ, McEachin RC. A bioinformatics approach reveals novel interactions of the OVOL transcription factors in the regulation of epithelial - mesenchymal cell reprogramming and cancer progression. BMC SYSTEMS BIOLOGY 2014; 8:29. [PMID: 24612742 PMCID: PMC4008156 DOI: 10.1186/1752-0509-8-29] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2013] [Accepted: 03/03/2014] [Indexed: 01/10/2023]
Abstract
Background Mesenchymal to Epithelial Transition (MET) plasticity is critical to cancer progression, and we recently showed that the OVOL transcription factors (TFs) are critical regulators of MET. Results of that work also posed the hypothesis that the OVOLs impact MET in a range of cancers. We now test this hypothesis by developing a model, OVOL Induced MET (OI-MET), and sub-model (OI-MET-TF), to characterize differential gene expression in MET common to prostate cancer (PC) and breast cancer (BC). Results In the OI-MET model, we identified 739 genes differentially expressed in both the PC and BC models. For this gene set, we found significant enrichment of annotation for BC, PC, cancer, and MET, as well as regulation of gene expression by AP1, STAT1, STAT3, and NFKB1. Focusing on the target genes for these four TFs plus the OVOLs, we produced the OI-MET-TF sub-model, which shows even greater enrichment for these annotations, plus significant evidence of cooperation among these five TFs. Based on known gene/drug interactions, we prioritized targets in the OI-MET-TF network for follow-on analysis, emphasizing the clinical relevance of this work. Reflecting these results back to the OI-MET model, we found that binding motifs for the TF pair AP1/MYC are more frequent than expected and that the AP1/MYC pair is significantly enriched in binding in cancer models, relative to non-cancer models, in these promoters. This effect is seen in both MET models (solid tumors) and in non-MET models (leukemia). These results are consistent with our hypothesis that the OVOLs impact cancer susceptibility by regulating MET, and extend the hypothesis to include mechanisms not specific to MET. Conclusions We find significant evidence of the OVOL, AP1, STAT1, STAT3, and NFKB1 TFs having important roles in MET, and more broadly in cancer. We prioritize known gene/drug targets for follow-up in the clinic, and we show that the AP1/MYC TF pair is a strong candidate for intervention.
Collapse
Affiliation(s)
| | | | | | | | | | - Kenneth J Pienta
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | | |
Collapse
|
26
|
Huang WL, Tung CW, Liaw C, Huang HL, Ho SY. Rule-based knowledge acquisition method for promoter prediction in human and Drosophila species. ScientificWorldJournal 2014; 2014:327306. [PMID: 24955394 PMCID: PMC3927563 DOI: 10.1155/2014/327306] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2013] [Accepted: 10/10/2013] [Indexed: 01/08/2023] Open
Abstract
The rapid and reliable identification of promoter regions is important when the number of genomes to be sequenced is increasing very speedily. Various methods have been developed but few methods investigate the effectiveness of sequence-based features in promoter prediction. This study proposes a knowledge acquisition method (named PromHD) based on if-then rules for promoter prediction in human and Drosophila species. PromHD utilizes an effective feature-mining algorithm and a reference feature set of 167 DNA sequence descriptors (DNASDs), comprising three descriptors of physicochemical properties (absorption maxima, molecular weight, and molar absorption coefficient), 128 top-ranked descriptors of 4-mer motifs, and 36 global sequence descriptors. PromHD identifies two feature subsets with 99 and 74 DNASDs and yields test accuracies of 96.4% and 97.5% in human and Drosophila species, respectively. Based on the 99- and 74-dimensional feature vectors, PromHD generates several if-then rules by using the decision tree mechanism for promoter prediction. The top-ranked informative rules with high certainty grades reveal that the global sequence descriptor, the length of nucleotide A at the first position of the sequence, and two physicochemical properties, absorption maxima and molecular weight, are effective in distinguishing promoters from non-promoters in human and Drosophila species, respectively.
Collapse
Affiliation(s)
- Wen-Lin Huang
- Department of Management Information System, Asia Pacific Institute of Creativity, Miaoli 351, Taiwan
| | - Chun-Wei Tung
- School of Pharmacy, College of Pharmacy, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Chyn Liaw
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan
| | - Hui-Ling Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan
| | - Shinn-Ying Ho
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan
| |
Collapse
|
27
|
Wu W, Zhang S, Li X, Xue M, Cao S, Chen W. Ets-2 regulates cell apoptosis via the Akt pathway, through the regulation of urothelial cancer associated 1, a long non-coding RNA, in bladder cancer cells. PLoS One 2013; 8:e73920. [PMID: 24069250 PMCID: PMC3771932 DOI: 10.1371/journal.pone.0073920] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 07/24/2013] [Indexed: 02/07/2023] Open
Abstract
The majority of the human genome is transcribed and generates non-coding RNAs (ncRNAs) that fail to encode protein information. Long non-coding RNAs (lncRNAs) are emerging as a novel class of ncRNAs, but our knowledge about these ncRNAs is limited. Previously, our laboratory has identified that a lncRNA, Urothelial cancer associated 1 (UCA1), played an important role in bladder cancer. Despite the recent interest in UCA1 as a diagnostic marker for bladder cancer, little is known about its transcriptional regulation. To elucidate the regulation of UCA1 gene expression, we have characterized the human UCA1 gene promoter. A 2.0-kb fragment of its 5' flanking region was cloned into a luciferase reporter vector. Deletion and mutation analysis suggested that an Ets-2 binding site was critical for UCA1 gene promoter activity. Further analysis of this site by gel shifting, chromatin immune precipitation (ChIP), and co-transfection experiments showed that transcription factor Ets-2 directly bound to the UCA1 promoter region and stimulated UCA1 promoter activity in bladder cancer cells. Taking into account the anti-apoptosis function of Ets-2, our data suggested that Ets-2 regulates apoptosis process by regulating the expression of UCA1, moreover UCA1 may be involved in the activation of Akt signaling pathway by Ets-2 in bladder cancer cells.
Collapse
Affiliation(s)
- Wenjing Wu
- Clinical Laboratory, the First Affiliated Hospital, School of Medicine, Xi’an Jiaotong University, Xi’an, China
| | - Shuwan Zhang
- Clinical Laboratory, the First Affiliated Hospital, School of Medicine, Xi’an Jiaotong University, Xi’an, China
- Clinical Laboratory, Xi’an Children’s Hospital, Xi’an, China
| | - Xu Li
- Translational Medicine Center, the First Affiliated Hospital, School of Medicine, Xi’an Jiaotong University, Xi’an, China
| | - Mei Xue
- Translational Medicine Center, the First Affiliated Hospital, School of Medicine, Xi’an Jiaotong University, Xi’an, China
| | - Sancheng Cao
- Clinical Laboratory, Xi’an Children’s Hospital, Xi’an, China
| | - Wei Chen
- Clinical Laboratory, the First Affiliated Hospital, School of Medicine, Xi’an Jiaotong University, Xi’an, China
- * E-mail:
| |
Collapse
|
28
|
Bicker A, Dietrich D, Gleixner E, Kristiansen G, Gorr TA, Hankeln T. Extensive transcriptional complexity during hypoxia-regulated expression of the myoglobin gene in cancer. Hum Mol Genet 2013; 23:479-90. [DOI: 10.1093/hmg/ddt438] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
|
29
|
Transcriptional Characterization of Porcine Leptin and Leptin Receptor Genes. PLoS One 2013; 8:e66398. [PMID: 23824082 PMCID: PMC3688923 DOI: 10.1371/journal.pone.0066398] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 05/06/2013] [Indexed: 12/17/2022] Open
Abstract
The leptin (LEP) and its receptor (LEPR) regulate food intake and energy balance through hypothalamic signaling. However, the LEP-LEPR axis seems to be more complex and its expression regulation has not been well described. In pigs, LEP and LEPR genes have been widely studied due to their relevance. Previous studies reported significant effects of SNPs located in both genes on growth and fatness traits. The aim of this study was to determine the expression profiles of LEP and LEPR across hypothalamic, adipose, hepatic and muscle tissues in Iberian x Landrace backcrossed pigs and to analyze the effects of gene variants on transcript abundance. To our knowledge, non porcine LEPR isoforms have been described rather than LEPRb. A short porcine LEPR isoform (LEPRa), that encodes a protein lacking the intracellular residues responsible of signal transduction, has been identified for the first time. The LEPRb isoform was only quantifiable in hypothalamus while LEPRa appeared widely expressed across tissues, but at higher levels in liver, suggesting that both isoforms would develop different roles. The unique LEP transcript showed expression in backfat and muscle. The effects of gene variants on transcript expression revealed interesting results. The LEPRc.1987C>T polymorphism showed opposite effects on LEPRb and LEPRa hypothalamic expression. In addition, one out of the 16 polymorphisms identified in the LEPR promoter region revealed high differential expression in hepatic LEPRa. These results suggest a LEPR isoform-specific regulation at tissue level. Conversely, non-differential expression of LEP conditional on the analyzed polymorphisms could be detected, indicating that its regulation is likely affected by other mechanisms rather than gene sequence variants. The present study has allowed a transcriptional characterization of LEP and LEPR isoforms on a range of tissues. Their expression patterns seem to indicate that both molecules develop peripheral roles apart from their known hypothalamic signal transduction function.
Collapse
|
30
|
Khan MAF, Soto-Jimenez LM, Howe T, Streit A, Sosinsky A, Stern CD. Computational tools and resources for prediction and analysis of gene regulatory regions in the chick genome. Genesis 2013; 51:311-24. [PMID: 23355428 PMCID: PMC3664090 DOI: 10.1002/dvg.22375] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Revised: 01/16/2013] [Accepted: 01/17/2013] [Indexed: 11/07/2022]
Abstract
The discovery of cis-regulatory elements is a challenging problem in bioinformatics, owing to distal locations and context-specific roles of these elements in controlling gene regulation. Here we review the current bioinformatics methodologies and resources available for systematic discovery of cis-acting regulatory elements and conserved transcription factor binding sites in the chick genome. In addition, we propose and make available, a novel workflow using computational tools that integrate CTCF analysis to predict putative insulator elements, enhancer prediction, and TFBS analysis. To demonstrate the usefulness of this computational workflow, we then use it to analyze the locus of the gene Sox2 whose developmental expression is known to be controlled by a complex array of cis-acting regulatory elements. The workflow accurately predicts most of the experimentally verified elements along with some that have not yet been discovered. A web version of the CTCF tool, together with instructions for using the workflow can be accessed from http://toolshed.g2.bx.psu.edu/view/mkhan1980/ctcf_analysis. For local installation of the tool, relevant Perl scripts and instructions are provided in the directory named "code" in the supplementary materials.
Collapse
Affiliation(s)
- Mohsin A F Khan
- Department of Cell & Developmental Biology, University College London, London, United Kingdom
| | | | | | | | | | | |
Collapse
|
31
|
Datta S, Mukhopadhyay S. A composite method based on formal grammar and DNA structural features in detecting human polymerase II promoter region. PLoS One 2013; 8:e54843. [PMID: 23437045 PMCID: PMC3577817 DOI: 10.1371/journal.pone.0054843] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2012] [Accepted: 12/17/2012] [Indexed: 11/25/2022] Open
Abstract
An important step in understanding gene regulation is to identify the promoter regions where the transcription factor binding takes place. Predicting a promoter region de novo has been a theoretical goal for many researchers for a long time. There exists a number of in silico methods to predict the promoter region de novo but most of these methods are still suffering from various shortcomings, a major one being the selection of appropriate features of promoter region distinguishing them from non-promoters. In this communication, we have proposed a new composite method that predicts promoter sequences based on the interrelationship between structural profiles of DNA and primary sequence elements of the promoter regions. We have shown that a Context Free Grammar (CFG) can formalize the relationships between different primary sequence features and by utilizing the CFG, we demonstrate that an efficient parser can be constructed for extracting these relationships from DNA sequences to distinguish the true promoter sequences from non-promoter sequences. Along with CFG, we have extracted the structural features of the promoter region to improve upon the efficiency of our prediction system. Extensive experiments performed on different datasets reveals that our method is effective in predicting promoter sequences on a genome-wide scale and performs satisfactorily as compared to other promoter prediction techniques.
Collapse
Affiliation(s)
- Sutapa Datta
- Department of Biophysics, Molecular Biology and Bioinformatics and Distributed Information Centre for Bioinformatics, University of Calcutta, Kolkata, West Bengal, India.
| | | |
Collapse
|
32
|
Gan Y, Guan J, Zhou S. A comparison study on feature selection of DNA structural properties for promoter prediction. BMC Bioinformatics 2012; 13:4. [PMID: 22226192 PMCID: PMC3280155 DOI: 10.1186/1471-2105-13-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2011] [Accepted: 01/07/2012] [Indexed: 01/27/2023] Open
Abstract
Background Promoter prediction is an integrant step for understanding gene regulation and annotating genomes. Traditional promoter analysis is mainly based on sequence compositional features. Recently, many kinds of structural features have been employed in promoter prediction. However, considering the high-dimensionality and overfitting problems, it is unfeasible to utilize all available features for promoter prediction. Thus it is necessary to choose some appropriate features for the prediction task. Results This paper conducts an extensive comparison study on feature selection of DNA structural properties for promoter prediction. Firstly, to examine whether promoters possess some special structures, we carry out a systematical comparison among the profiles of thirteen structural features on promoter and non-promoter sequences. Secondly, we investigate the correlations between these structural features and promoter sequences. Thirdly, both filter and wrapper methods are utilized to select appropriate feature subsets from thirteen different kinds of structural features for promoter prediction, and the predictive power of the selected feature subsets is evaluated. Finally, we compare the prediction performance of the feature subsets selected in this paper with nine existing promoter prediction approaches. Conclusions Experimental results show that the structural features are differentially correlated to promoters. Specifically, DNA-bending stiffness, DNA denaturation and energy-related features are highly correlated with promoters. The predictive power for promoter sequences differentiates greatly among different structural features. Selecting the relevant features can significantly improve the accuracy of promoter prediction.
Collapse
Affiliation(s)
- Yanglan Gan
- Department of Computer Science and Technology, Tongji University, Shanghai, China
| | | | | |
Collapse
|
33
|
Lampidonis A, Theodorou G, Pecorini C, Rebucci R, Baldi A, Politis I. Cloning of the 5′ regulatory regions and functional characterization of the core promoters of ovine PLAU (u-PA) and SERPIN1 (PAI-1). Gene 2011; 489:11-20. [DOI: 10.1016/j.gene.2011.08.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2011] [Revised: 07/08/2011] [Accepted: 08/26/2011] [Indexed: 10/17/2022]
|
34
|
Liu LH, Li H, Li JP, Zhong H, Zhang HC, Chen J, Xiao T. miR-125b suppresses the proliferation and migration of osteosarcoma cells through down-regulation of STAT3. Biochem Biophys Res Commun 2011; 416:31-8. [PMID: 22093834 DOI: 10.1016/j.bbrc.2011.10.117] [Citation(s) in RCA: 111] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2011] [Accepted: 10/26/2011] [Indexed: 01/05/2023]
Abstract
There is accumulating evidence that microRNAs are involved in multiple processes in development and tumor progression. Abnormally expressed miR-125b was found to play a fundamental role in several types of cancer; however, whether miR-125b participates in regulating the initiation and progress of osteosarcoma still remains unclear. Here we demonstrate that miR-125b is frequently down-regulated in osteosarcoma samples and human osteosarcoma cell lines. The ectopic restoration of miR-125b expression in human osteosarcoma cells suppresses proliferation and migration in vitro and inhibits tumor formation in vivo. We further identified signal transducer and activator of transcription 3 (STAT3) as the direct and functional downstream target of miR-125b. Interestingly, we discovered that the expression of miR-125b is regulated by STAT3 at the level of transcription. STAT3 binds to the promoter region of miR-125b in vitro and serves as a transactivator. Taken together, our findings point to an important role in the molecular etiology of osteosarcoma and suggest that miR-125b is a potential target in the treatment of osteosarcoma.
Collapse
Affiliation(s)
- Li-hong Liu
- Department of Orthopedics, The Second Xiangya Hospital of Central South University, Changsha 410010, China
| | | | | | | | | | | | | |
Collapse
|
35
|
Dupont PY, Stepien G. Computational analysis of the transcriptional regulation of the adenine nucleotide translocator isoform 4 gene and its role in spermatozoid glycolytic metabolism. Gene 2011; 487:38-45. [PMID: 21827840 DOI: 10.1016/j.gene.2011.07.024] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2011] [Revised: 07/08/2011] [Accepted: 07/14/2011] [Indexed: 01/01/2023]
Abstract
Computational phylogenetic analysis coupled to promoter sequence alignment was used to understand mechanisms of transcriptional regulation and to identify potentially coregulated genes. Our strategy was validated on the human ANT4 gene which encodes the fourth isoform of the mitochondrial adenine nucleotide translocator specifically expressed during spermatogenesis. The movement of sperm flagella is driven mainly by ATP generated by glycolytic pathways, and the specific induction of the mitochondrial ANT4 protein presented an interesting puzzle. We analysed the sequences of the promoters, introns and exons of 30 mammalian ANT4 genes and constructed regulatory models. The whole human genome and promoter database were screened for genes that were potentially regulated by the generated models. 80% of the identified co-regulated genes encoded proteins with specific roles in spermatogenesis and with functions linked to male reproduction. Our in silico study enabled us to precise the specific role of the ANT4 isoform in spermatozoid bioenergetics.
Collapse
|
36
|
Li W, Li X, Wang W, Li X, Tan Y, Yi M, Yang J, McCarthy JB, Xiong W, Wu M, Ma J, Su B, Zhang Z, Liao Q, Xiang B, Li G. NOR1 is an HSF1- and NRF1-regulated putative tumor suppressor inactivated by promoter hypermethylation in nasopharyngeal carcinoma. Carcinogenesis 2011; 32:1305-14. [PMID: 21803736 DOI: 10.1093/carcin/bgr174] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Promoter hypermethylation-mediated silencing of tumor suppressor genes (TSGs) is a hallmark of oncogenesis. Oxidored-nitro domain-containing protein 1 (NOR1) is a candidate TSG that is downregulated in nasopharyngeal carcinoma (NPC). In the present study, we identified a functional NOR1 promoter that is regulated by heat shock factor 1 and nuclear respiratory factor 1. The promoter is located within a CpG island. Hypermethylation of this CpG island was found in NPC tissue samples and cancer cell lines, whereas no aberrant promoter methylation was detected in non-cancerous nasopharyngeal tissue samples or normal nasopharyngeal epithelial cells. Treatment of NPC 6-10B cells and leukemia HL60 cells with 5'-aza-2'-deoxycytidine increased endogenous levels of NOR1 messenger RNA. Ectopic expression of NOR1 in NPC HNE1 cells inhibited tumor cell colony formation and viability. These findings suggest that promoter hypermethylation may participate in transcriptional inactivation of the NOR1 gene in NPC. Frequent epigenetic inactivation of the NOR1 gene in NPC suggests that it may be a critical tumor suppressor involved in the development of NPC.
Collapse
Affiliation(s)
- Wenjuan Li
- Cancer Research Institute, Xiangya School of Medicine, Central South University, Xiangya Road, Changsha, Hunan 410078, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Bensaïd M, Hmani-Aifa M, Hammami B, Tlili A, Hakim B, Charfeddine I, Ayadi H, Ghorbel A, Castillo ID, Masmoudi S. DFNB66 and DFNB67 loci are non allelic and rarely contribute to autosomal recessive nonsyndromic hearing loss. Eur J Med Genet 2011; 54:e565-9. [PMID: 21816241 DOI: 10.1016/j.ejmg.2011.07.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2010] [Accepted: 07/08/2011] [Indexed: 11/17/2022]
Abstract
We previously mapped the DFNB66 locus to an interval overlapping the DFNB67 region. Mutations in the LHFPL5 gene were identified as a cause of DFNB67 hearing loss (HL). However, screening of the coding exons of LHFPL5 did not reveal any mutation in the DFNB66 family. The objective of this study was to check whether DFNB66 and DFNB67 are distinctive loci and determining their contribution to HL. In the DFNB66 family, sequencing showed absence of mutations in the untranslated regions and the predicted promoter sequence of LHFPL5. Analysis of five microsatellites in the 6p21.31-22.3 region and screening of the LHFPL5 gene by DNA heteroduplex analysis in DHPLC revealed a novel mutation (c.89dup) in one out of 129 unrelated Tunisian families with autosomal recessive nonsyndromic (ARNS) HL. Our findings suggest that two distinct genes are responsible for DFNB66 and DFNB67 HL. These loci are likely to be a rare cause of ARNSHL.
Collapse
Affiliation(s)
- Mariem Bensaïd
- Equipe Procédés de Criblages Moléculaires et Cellulaires, Laboratoire de Microorganismes et de Biomolécules, Centre de Biotechnologie de Sfax, Université de Sfax, Tunisia
| | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Mishra H, Singh N, Misra K, Lahiri T. An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data. Bioinformation 2011; 6:240-3. [PMID: 21887014 PMCID: PMC3159145 DOI: 10.6026/97320630006240] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 05/09/2011] [Indexed: 11/23/2022] Open
Abstract
Identification of promoter region is an important part of gene annotation. Identification of promoters in eukaryotes is important as promoters modulate various
metabolic functions and cellular stress responses. In this work, a novel approach utilizing intensity values of tilling microarray data for a model eukaryotic plant
Arabidopsis thaliana, was used to specify promoter region from non-promoter region. A feed-forward back propagation neural network model supported by
genetic algorithm was employed to predict the class of data with a window size of 41. A dataset comprising of 2992 data vectors representing both promoter and
non-promoter regions, chosen randomly from probe intensity vectors for whole genome of Arabidopsis thaliana generated through tilling microarray technique
was used. The classifier model shows prediction accuracy of 69.73% and 65.36% on training and validation sets, respectively. Further, a concept of distance based
class membership was used to validate reliability of classifier, which showed promising results. The study shows the usability of micro-array probe intensities to
predict the promoter regions in eukaryotic genomes.
Collapse
Affiliation(s)
- Hrishikesh Mishra
- Division of Applied Sciences and Indo-Russian Centre for Biotechnology, Indian Institute of Information Technology, Allahabad, India
| | | | | | | |
Collapse
|
39
|
Prokesch A, Bogner-Strauss JG, Hackl H, Rieder D, Neuhold C, Walenta E, Krogsdam A, Scheideler M, Papak C, Wong WC, Vinson C, Eisenhaber F, Trajanoski Z. Arxes: retrotransposed genes required for adipogenesis. Nucleic Acids Res 2010; 39:3224-39. [PMID: 21177646 PMCID: PMC3082915 DOI: 10.1093/nar/gkq1289] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Retrotransposed sequences arise from messenger RNAs (mRNAs) that have been reinserted into genomic DNA by reverse transcription. Usually, these sequences are embedded in dormant regions, collect missense mutations over time and constitute processed, nonfunctional pseudogenes. There are thousands of processed pseudogenes in the mouse and human genome. Here, we report evidence for two paralog genes (termed Arxes1 and Arxes2), which arose by retrotransposition of the signal peptidase Spcs3 followed by a segmental duplication event. They gained a functional promoter that we show to be transactivated by adipogenic transcription factors. We further show that the Arxes mRNAs are highly expressed in adipose tissue and strongly upregulated during adipogenesis in different cell models. Additionally, their expression is elevated by an anti-diabetic agent in vitro and in vivo. Importantly, we provide evidence that the Arxes genes are translated and that the proteins are located in the endoplasmic reticulum. Although the sequence similarity and subcellular location are reminiscent of their parental gene, our data suggest that the Arxes have developed a different function, since their expression is required for adipogenesis, whereas Spcs3 is dispensable. In summary, we report retrotransposed-duplicated genes that evolved from a parental gene to function in a tissue and adipogenesis-specific context.
Collapse
Affiliation(s)
- Andreas Prokesch
- Institute for Genomics and Bioinformatics, Graz University of Technology, Biocenter, Austria.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Identification of TATA and TATA-less promoters in plant genomes by integrating diversity measure, GC-Skew and DNA geometric flexibility. Genomics 2010; 97:112-20. [PMID: 21112384 DOI: 10.1016/j.ygeno.2010.11.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2010] [Revised: 11/05/2010] [Accepted: 11/12/2010] [Indexed: 11/20/2022]
Abstract
Accurate identification of core promoters is important for gaining more insight about the understanding of the eukaryotic transcription regulation. In this study, the authors focused on the biologically realistic promoter prediction of plant genomes. By analyzing the correlative conservation, GC-compositional bias and specific structural patterns of TATA and TATA-less promoters in PlantPromDB, a hybrid multi-feature approach based on support vector machine (SVM) for predicting the two types of promoters were developed by integrating local word content, GC-Skew and DNA geometric flexibility. Compared with the TSSP-TCM program on the same test dataset, better prediction results were obtained. Especially for the TATA-less promoter, the accuracy is 10% higher than the result of TSSP-TCM program. The good performance of the hybrid promoters and the experimental data also indicate that our method has the ability to locate the promoter region of the plant genome.
Collapse
|
41
|
Zeng J, Zhao XY, Cao XQ, Yan H. SCS: signal, context, and structure features for genome-wide human promoter recognition. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2010; 7:550-562. [PMID: 20671324 DOI: 10.1109/tcbb.2008.95] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
This paper integrates the signal, context, and structure features for genome-wide human promoter recognition, which is important in improving genome annotation and analyzing transcriptional regulation without experimental supports of ESTs, cDNAs, or mRNAs. First, CpG islands are salient biological signals associated with approximately 50 percent of mammalian promoters. Second, the genomic context of promoters may have biological significance, which is based on n-mers (sequences of n bases long) and their statistics estimated from training samples. Third, sequence-dependent DNA flexibility originates from DNA 3D structures and plays an important role in guiding transcription factors to the target site in promoters. Employing decision trees, we combine above signal, context, and structure features to build a hierarchical promoter recognition system called SCS. Experimental results on controlled data sets and the entire human genome demonstrate that SCS is significantly superior in terms of sensitivity and specificity as compared to other state-of-the-art methods. The SCS promoter recognition system is available online as supplemental materials for academic use and can be found on the Computer Society Digital Library at http://doi.ieeecomputersociety.org/10.1109/TCBB.2008.95.
Collapse
Affiliation(s)
- Jia Zeng
- School of Computer Science and Technology, Soochow University, Suzhou, China.
| | | | | | | |
Collapse
|
42
|
Dunham I, Beare DM, Collins JE. The characteristics of human genes: analysis of human chromosome 22. Comp Funct Genomics 2010; 4:635-46. [PMID: 18629020 PMCID: PMC2447302 DOI: 10.1002/cfg.335] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2003] [Revised: 09/04/2003] [Accepted: 09/08/2003] [Indexed: 11/11/2022] Open
Affiliation(s)
- Ian Dunham
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
| | | | | |
Collapse
|
43
|
Chavali VRM, Sommer JR, Petters RM, Ayyagari R. Identification of a promoter for the human C1Q-tumor necrosis factor-related protein-5 gene associated with late-onset retinal degeneration. Invest Ophthalmol Vis Sci 2010; 51:5499-507. [PMID: 20554618 DOI: 10.1167/iovs.10-5543] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
PURPOSE The Complement-1q tumor necrosis factor-related protein 5 (C1QTNF5/CTRP5) gene is located in the 3' untranslated region of the Membrane Frizzled Related Protein (MFRP) gene, and these two genes are reported to be dicistronic. The authors examined the 5' upstream sequence of CTRP5 for the presence of a promoter regulating the expression of this gene. METHODS The sequence upstream of the translational start site of human CTRP5 (hCTRP5) was analyzed by Promoter Inspector software. A series of plasmids containing segments of hCTRP5 putative promoter sequence (-29 bp to -3.6 kb) upstream of the luciferase gene were generated. Cells were transiently transfected with these plasmids, and luciferase activity was measured. 5' RACE analysis was performed to determine the functional transcription start site. V5 tagged-pig CTRP5 (pCTRP5) gene, cloned downstream of the hCTRP5 putative promoter, was expressed in a human retinal cell line (ARPE-19) and a Chinese hamster ovary cell line (CHO-K1) to study the functionality of the putative promoter. RESULTS Bioinformatic analysis identified a putative promoter region between nt -1322 and +1 sequence of hCTRP5. 5' RACE analysis revealed the presence of the transcriptional start site (TSS) at 62 bp upstream of the start codon in the CTRP5. The 1.3-kb sequence of the hCTRP5 predicted promoter produced higher levels of luciferase activity, indicating the strength of the cloned CTRP5 promoter. The promoter sequence between nt -1322 bp to -29 bp upstream of the first ATG of CTRP5 was found to be essential for this promoter activity. The predicted hCTRP5 promoter was found to control the expression of V5-tagged pCTRP5 and nuclear GFP, indicating that the promoter was functional. CONCLUSIONS This study revealed the presence of a functional promoter for the CTRP5 gene located 5' of its start site. Understanding the regulation of CTRP5 gene transcription may provide insights into the possible role of CTRP5 in the retina and the pathology underlying late-onset retinal degeneration caused by mutations in this gene. In addition, these studies will determine whether CTRP5 and MFRP are functionally dicistronic.
Collapse
Affiliation(s)
- Venkata R M Chavali
- Department of Ophthalmology, University of California San Diego, La Jolla, California, USA
| | | | | | | |
Collapse
|
44
|
Brower CS, Veiga L, Jones RH, Varshavsky A. Mouse Dfa is a repressor of TATA-box promoters and interacts with the Abt1 activator of basal transcription. J Biol Chem 2010; 285:17218-34. [PMID: 20356838 DOI: 10.1074/jbc.m110.118638] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Our study of the mouse Ate1 arginyltransferase, a component of the N-end rule pathway, has shown that Ate1 pre-mRNA is produced from a bidirectional promoter that also expresses, in the opposite direction, a previously uncharacterized gene (Hu, R. G., Brower, C. S., Wang, H., Davydov, I. V., Sheng, J., Zhou, J., Kwon, Y. T., and Varshavsky, A. (2006) J. Biol. Chem. 281, 32559-32573). In this work, we began analyzing this gene, termed Dfa (divergent from Ate1). Mouse Dfa was found to be transcribed from both the bidirectional P(Ate1/Dfa) promoter and other nearby promoters. The resulting transcripts are alternatively spliced, yielding a complex set of Dfa mRNAs that are present largely, although not exclusively, in the testis. A specific Dfa mRNA encodes, via its 3'-terminal exon, a 217-residue protein termed Dfa(A). Other Dfa mRNAs also contain this exon. Dfa(A) is sequelogous (similar in sequence) to a region of the human/mouse HTEX4 protein, whose physiological function is unknown. We produced an affinity-purified antibody to recombinant mouse Dfa(A) that detected a 35-kDa protein in the mouse testis and in several cell lines. Experiments in which RNA interference was used to down-regulate Dfa indicated that the 35-kDa protein was indeed Dfa(A). Furthermore, Dfa(A) was present in the interchromatin granule clusters and was also found to bind to the Ggnbp1 gametogenetin-binding protein-1 and to the Abt1 activator of basal transcription that interacts with the TATA-binding protein. Given these results, RNA interference was used to probe the influence of Dfa levels in luciferase reporter assays. We found that Dfa(A) acts as a repressor of TATA-box transcriptional promoters.
Collapse
Affiliation(s)
- Christopher S Brower
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | | | | | | |
Collapse
|
45
|
Luquin N, Yu B, Saunderson RB, Trent RJ, Pamphlett R. Genetic variants in the promoter of TARDBP in sporadic amyotrophic lateral sclerosis. Neuromuscul Disord 2009; 19:696-700. [PMID: 19695877 DOI: 10.1016/j.nmd.2009.07.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 07/16/2009] [Indexed: 12/12/2022]
Abstract
All patients with sporadic amyotrophic lateral sclerosis (SALS) have TDP-43 inclusions in their motor neurons, suggesting this protein plays a major role in the disease. Coding mutations in the gene for TDP-43, TARDBP, have been found in only a few patients with SALS. However, the non-coding regulatory regions of TARDBP have not yet been examined in SALS. We therefore sequenced both coding and non-coding regions of TARDBP in 46 tissue-banked SALS brains (brain DNA was used to detect somatic mutations). Non-coding variants (in the promoter or intron 1) were detected in 16 patients (35%) and coding variants in 4 (9%). Two known promoter variants were found more frequently in SALS patients than in controls. Two other variants, found in one patient each but not in controls, have potential regulatory functions. In addition, a novel exon 2 change with predicted functional effects was found in one patient. In summary, variants in the promoter and other non-coding regions of TARDBP may disturb the regulation of this gene in some patients with SALS.
Collapse
Affiliation(s)
- Natasha Luquin
- The Stacey Motor Neuron Disease Laboratory, Department of Pathology, The University of Sydney, Sydney, Australia
| | | | | | | | | |
Collapse
|
46
|
Identification and analysis of the human CD160 promoter: implication of a potential AML-1 binding site in promoter activation. Genes Immun 2009; 10:616-23. [DOI: 10.1038/gene.2009.52] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
47
|
Phylogenetic Analysis and in Silico Characterization of the GARS-AIRS-GART Gene which Codes for a tri-Functional Enzyme Protein Involved in de novo Purine Biosynthesis. Mol Biotechnol 2009; 42:306-19. [DOI: 10.1007/s12033-009-9160-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2008] [Accepted: 02/25/2009] [Indexed: 10/21/2022]
|
48
|
Zeng J, Zhu S, Yan H. Towards accurate human promoter recognition: a review of currently used sequence features and classification methods. Brief Bioinform 2009; 10:498-508. [PMID: 19531545 DOI: 10.1093/bib/bbp027] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
This review describes important advances that have been made during the past decade for genome-wide human promoter recognition. Interest in promoter recognition algorithms on a genome-wide scale is worldwide and touches on a number of practical systems that are important in analysis of gene regulation and in genome annotation without experimental support of ESTs, cDNAs or mRNAs. The main focus of this review is on feature extraction and model selection for accurate human promoter recognition, with descriptions of what they are, what has been accomplished, and what remains to be done.
Collapse
Affiliation(s)
- Jia Zeng
- Department of Computer Science, Hong Kong Baptist University, Kowloon, Hong Kong.
| | | | | |
Collapse
|
49
|
Gan Y, Guan J, Zhou S. A pattern-based nearest neighbor search approach for promoter prediction using DNA structural profiles. ACTA ACUST UNITED AC 2009; 25:2006-12. [PMID: 19515962 DOI: 10.1093/bioinformatics/btp359] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Identification of core promoters is a key clue in understanding gene regulations. However, due to the diverse nature of promoter sequences, the accuracy of existing prediction approaches for non-CpG island (simply CGI)-related promoters is not as high as that for CGI-related promoters. This consequently leads to a low genome-wide promoter prediction accuracy. RESULTS In this article, we first systematically analyze the similarities and differences between the two types of promoters (CGI- and non-CGI-related) from a novel structural perspective, and then devise a unified framework, called PNNP (Pattern-based Nearest Neighbor search for Promoter), to predict both CGI- and non-CGI-related promoters based on their structural features. Our comparative analysis on the structural characteristics of promoters reveals two interesting facts: (i) the structural values of CGI- and non-CGI-related promoters are quite different, but they exhibit nearly similar structural patterns; (ii) the structural patterns of promoters are obviously different from that of non-promoter sequences though the sequences have almost similar structural values. Extensive experiments demonstrate that the proposed PNNP approach is effective in capturing the structural patterns of promoters, and can significantly improve genome-wide performance of promoters prediction, especially non-CGI-related promoters prediction. AVAILABILITY The implementation of the program PNNP is available at http://admis.tongji.edu.cn/Projects/pnnp.aspx.
Collapse
Affiliation(s)
- Yanglan Gan
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
| | | | | |
Collapse
|
50
|
Narlikar L, Ovcharenko I. Identifying regulatory elements in eukaryotic genomes. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:215-30. [PMID: 19498043 DOI: 10.1093/bfgp/elp014] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Proper development and functioning of an organism depends on precise spatial and temporal expression of all its genes. These coordinated expression-patterns are maintained primarily through the process of transcriptional regulation. Transcriptional regulation is mediated by proteins binding to regulatory elements on the DNA in a combinatorial manner, where particular combinations of transcription factor binding sites establish specific regulatory codes. In this review, we survey experimental and computational approaches geared towards the identification of proximal and distal gene regulatory elements in the genomes of complex eukaryotes. Available approaches that decipher the genetic structure and function of regulatory elements by exploiting various sources of information like gene expression data, chromatin structure, DNA-binding specificities of transcription factors, cooperativity of transcription factors, etc. are highlighted. We also discuss the relevance of regulatory elements in the context of human health through examples of mutations in some of these regions having serious implications in misregulation of genes and being strongly associated with human disorders.
Collapse
Affiliation(s)
- Leelavati Narlikar
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|