Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li L, Liang Y, Bass RL. GAPWM: a genetic algorithm method for optimizing a position weight matrix. Bioinformatics 2007;23:1188-94. [PMID: 17341493 DOI: 10.1093/bioinformatics/btm080] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

For:	Li L, Liang Y, Bass RL. GAPWM: a genetic algorithm method for optimizing a position weight matrix. Bioinformatics 2007;23:1188-94. [PMID: 17341493 DOI: 10.1093/bioinformatics/btm080] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Number

Cited by Other Article(s)

Geete K, Pandey M. Robust Transcription Factor Binding Site Prediction Using Deep Neural Networks. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200429121156] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Zhang Q, Zhu L, Huang DS. High-Order Convolutional Neural Network Architecture for Predicting DNA-Protein Binding Sites. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:1184-1192. [PMID: 29993783 DOI: 10.1109/tcbb.2018.2819660] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Zhang H, Zhu L, Huang DS. DiscMLA: An Efficient Discriminative Motif Learning Algorithm over High-Throughput Datasets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:1810-1820. [PMID: 27164602 DOI: 10.1109/tcbb.2016.2561930] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Lee NK, Li X, Wang D. A comprehensive survey on genetic algorithms for DNA motif prediction. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.07.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Zhu L, Zhang HB, Huang DS. Direct AUC optimization of regulatory motifs. Bioinformatics 2018;33:i243-i251. [PMID: 28881989 PMCID: PMC5870558 DOI: 10.1093/bioinformatics/btx255] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Zhu L, Zhang HB, Huang DS. LMMO: A Large Margin Approach for Refining Regulatory Motifs. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:913-925. [PMID: 28391205 DOI: 10.1109/tcbb.2017.2691325] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Zhang H, Zhu L, Huang DS. WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data. Sci Rep 2017;7:3217. [PMID: 28607381 PMCID: PMC5468353 DOI: 10.1038/s41598-017-03554-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 05/02/2017] [Indexed: 01/24/2023] Open

Patel RY, Stormo GD. Discriminative motif optimization based on perceptron training. ACTA ACUST UNITED AC 2013;30:941-8. [PMID: 24369152 DOI: 10.1093/bioinformatics/btt748] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Wang D, Tapan S. MISCORE: a new scoring function for characterizing DNA regulatory motifs in promoter sequences. BMC SYSTEMS BIOLOGY 2012;6 Suppl 2:S4. [PMID: 23282090 PMCID: PMC3521183 DOI: 10.1186/1752-0509-6-s2-s4] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Abstract

Background

Computational approaches for finding DNA regulatory motifs in promoter sequences are useful to biologists in terms of reducing the experimental costs and speeding up the discovery process of de novo binding sites. It is important for rule-based or clustering-based motif searching schemes to effectively and efficiently evaluate the similarity between a k-mer (a k-length subsequence) and a motif model, without assuming the independence of nucleotides in motif models or without employing computationally expensive Markov chain models to estimate the background probabilities of k-mers. Also, it is interesting and beneficial to use a priori knowledge in developing advanced searching tools.

Results

This paper presents a new scoring function, termed as MISCORE, for functional motif characterization and evaluation. Our MISCORE is free from: (i) any assumption on model dependency; and (ii) the use of Markov chain model for background modeling. It integrates the compositional complexity of motif instances into the function. Performance evaluations with comparison to the well-known Maximum a Posteriori (MAP) score and Information Content (IC) have shown that MISCORE has promising capabilities to separate and recognize functional DNA motifs and its instances from non-functional ones.

Conclusions

MISCORE is a fast computational tool for candidate motif characterization, evaluation and selection. It enables to embed priori known motif models for computing motif-to-motif similarity, which is more advantageous than IC and MAP score. In addition to these merits mentioned above, MISCORE can automatically filter out some repetitive k-mers from a motif model due to the introduction of the compositional complexity in the function. Consequently, the merits of our proposed MISCORE in terms of both motif signal modeling power and computational efficiency will make it more applicable in the development of computational motif discovery tools.

Collapse

Nandi S, Ioshikhes I. Optimizing the GATA-3 position weight matrix to improve the identification of novel binding sites. BMC Genomics 2012;13:416. [PMID: 22913572 PMCID: PMC3481455 DOI: 10.1186/1471-2164-13-416] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 08/02/2012] [Indexed: 11/21/2022] Open

Modular insulators: genome wide search for composite CTCF/thyroid hormone receptor binding sites. PLoS One 2010;5:e10119. [PMID: 20404925 PMCID: PMC2852416 DOI: 10.1371/journal.pone.0010119] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2009] [Accepted: 03/18/2010] [Indexed: 02/07/2023] Open

Le T, Altman T, Gardiner K. HIGEDA: a hierarchical gene-set genetics based algorithm for finding subtle motifs in biological sequences. Bioinformatics 2010;26:302-9. [PMID: 19996163 DOI: 10.1093/bioinformatics/btp676] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Combinatorial binding predicts spatio-temporal cis-regulatory activity. Nature 2009;462:65-70. [PMID: 19890324 DOI: 10.1038/nature08531] [Citation(s) in RCA: 299] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2009] [Accepted: 09/22/2009] [Indexed: 11/09/2022]

Jordan JJ, Menendez D, Inga A, Nourredine M, Bell D, Resnick MA. Noncanonical DNA motifs as transactivation targets by wild type and mutant p53. PLoS Genet 2008;4:e1000104. [PMID: 18714371 PMCID: PMC2518093 DOI: 10.1371/journal.pgen.1000104] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2008] [Accepted: 05/22/2008] [Indexed: 12/31/2022] Open

Abstract

Sequence-specific binding by the human p53 master regulator is critical to its tumor suppressor activity in response to environmental stresses. p53 binds as a tetramer to two decameric half-sites separated by 0–13 nucleotides (nt), originally defined by the consensus RRRCWWGYYY (n = 0–13) RRRCWWGYYY. To better understand the role of sequence, organization, and level of p53 on transactivation at target response elements (REs) by wild type (WT) and mutant p53, we deconstructed the functional p53 canonical consensus sequence using budding yeast and human cell systems. Contrary to early reports on binding in vitro, small increases in distance between decamer half-sites greatly reduces p53 transactivation, as demonstrated for the natural TIGER RE. This was confirmed with human cell extracts using a newly developed, semi–in vitro microsphere binding assay. These results contrast with the synergistic increase in transactivation from a pair of weak, full-site REs in the MDM2 promoter that are separated by an evolutionary conserved 17 bp spacer. Surprisingly, there can be substantial transactivation at noncanonical ½-(a single decamer) and ¾-sites, some of which were originally classified as biologically relevant canonical consensus sequences including PIDD and Apaf-1. p53 family members p63 and p73 yielded similar results. Efficient transactivation from noncanonical elements requires tetrameric p53, and the presence of the carboxy terminal, non-specific DNA binding domain enhanced transactivation from noncanonical sequences. Our findings demonstrate that RE sequence, organization, and level of p53 can strongly impact p53-mediated transactivation, thereby changing the view of what constitutes a functional p53 target. Importantly, inclusion of ½- and ¾-site REs greatly expands the p53 master regulatory network.

Within human cells, the tumor suppressor p53 is the central node of regulation required to elicit multiple biological responses that include cell cycle arrest and death in response to stress or DNA damage, where mutations in p53 are a hallmark of cancer. As a master regulatory gene, p53 controls the action of target genes within its network by directly interacting with a widely accepted consensus DNA binding sequence, composed of two decamer ½-sites that can be separated by up to 13 bases. While mismatches from consensus sequence are frequent, the canonical consensus sequence places a limitation upon the organization and number of target genes within the p53 transcriptional network. Using yeast and human cell systems, our goal was to further understand how the DNA sequence, DNA organization, and level of p53 expression might influence the inclusion of genes within the p53 regulatory network. We found that increases in spacer beyond a few bases greatly reduce responsiveness to p53. Importantly, we established that p53 can function from noncanonical sequences comprising only a decamer ½-site or a ¾-site. These findings further define and expand the universe of potential downstream target genes which may be regulated by p53 and bring further diversity into the p53 regulatory network.

Collapse

Li L, Bass RL, Liang Y. fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control. ACTA ACUST UNITED AC 2008;24:629-36. [PMID: 18296465 DOI: 10.1093/bioinformatics/btn009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Abstract

MOTIVATION

Most de novo motif identification methods optimize the motif model first and then separately test the statistical significance of the motif score. In the first stage, a motif abundance parameter needs to be specified or modeled. In the second stage, a Z-score or P-value is used as the test statistic. Error rates under multiple comparisons are not fully considered.

METHODOLOGY

We propose a simple but novel approach, fdrMotif, that selects as many binding sites as possible while controlling a user-specified false discovery rate (FDR). Unlike existing iterative methods, fdrMotif combines model optimization [e.g. position weight matrix (PWM)] and significance testing at each step. By monitoring the proportion of binding sites selected in many sets of background sequences, fdrMotif controls the FDR in the original data. The model is then updated using an expectation (E)- and maximization (M)-like procedure. We propose a new normalization procedure in the E-step for updating the model. This process is repeated until either the model converges or the number of iterations exceeds a maximum.

RESULTS

Simulation studies suggest that our normalization procedure assigns larger weights to the binding sites than do two other commonly used normalization procedures. Furthermore, fdrMotif requires only a user-specified FDR and an initial PWM. When tested on 542 high confidence experimental p53 binding loci, fdrMotif identified 569 p53 binding sites in 505 (93.2%) sequences. In comparison, MEME identified more binding sites but in fewer ChIP sequences than fdrMotif. When tested on 500 sets of simulated 'ChIP' sequences with embedded known p53 binding sites, fdrMotif, compared to MEME, has higher sensitivity with similar positive predictive value. Furthermore, fdrMotif is robust to noise: it selected nearly identical binding sites in data adulterated with 50% added background sequences and the unadulterated data. We suggest that fdrMotif represents an improvement over MEME.

AVAILABILITY

C code can be found at: http://www.niehs.nih.gov/research/resources/software/fdrMotif/.

Collapse