1
|
Karimzadeh M, Hoffman MM. Virtual ChIP-seq: predicting transcription factor binding by learning from the transcriptome. Genome Biol 2022; 23:126. [PMID: 35681170 PMCID: PMC9185870 DOI: 10.1186/s13059-022-02690-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 05/16/2022] [Indexed: 11/29/2022] Open
Abstract
Existing methods for computational prediction of transcription factor (TF) binding sites evaluate genomic regions with similarity to known TF sequence preferences. Most TF binding sites, however, do not resemble known TF sequence motifs, and many TFs are not sequence-specific. We developed Virtual ChIP-seq, which predicts binding of individual TFs in new cell types, integrating learned associations with gene expression and binding, TF binding sites from other cell types, and chromatin accessibility data in the new cell type. This approach outperforms methods that predict TF binding solely based on sequence preference, predicting binding for 36 TFs (MCC>0.3).
Collapse
Affiliation(s)
- Mehran Karimzadeh
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.,Princess Margaret Cancer Centre, Toronto, ON, Canada.,Vector Institute, Toronto, ON, Canada
| | - Michael M Hoffman
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada. .,Princess Margaret Cancer Centre, Toronto, ON, Canada. .,Vector Institute, Toronto, ON, Canada. .,Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
2
|
Khan A, Singh K, Jaiswal S, Raza M, Jasrotia RS, Kumar A, Gurjar AKS, Kumari J, Nayan V, Iquebal MA, Angadi UB, Rai A, Datta TK, Kumar D. Whole-Genome-Based Web Genomic Resource for Water Buffalo (Bubalus bubalis). Front Genet 2022; 13:809741. [PMID: 35480326 PMCID: PMC9035531 DOI: 10.3389/fgene.2022.809741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 02/14/2022] [Indexed: 11/13/2022] Open
Abstract
Water buffalo (Bubalus bubalis), belonging to the Bovidae family, is an economically important animal as it is the major source of milk, meat, and drought in numerous countries. It is mainly distributed in tropical and subtropical regions with a global population of approximately 202 million. The advent of low cost and rapid sequencing technologies has opened a new vista for global buffalo researchers. In this study, we utilized the genomic data of five commercially important buffalo breeds, distributed globally, namely, Mediterranean, Egyptian, Bangladesh, Jaffrarabadi, and Murrah. Since there is no whole-genome sequence analysis of these five distinct buffalo breeds, which represent a highly diverse ecosystem, we made an attempt for the same. We report the first comprehensive, holistic, and user-friendly web genomic resource of buffalo (BuffGR) accessible at http://backlin.cabgrid.res.in/buffgr/, that catalogues 6028881 SNPs and 613403 InDels extracted from a set of 31 buffalo tissues. We found a total of 7727122 SNPs and 634124 InDels distributed in four breeds of buffalo (Murrah, Bangladesh, Jaffarabadi, and Egyptian) with reference to the Mediterranean breed. It also houses 4504691 SSR markers from all the breeds along with 1458 unique circRNAs, 37712 lncRNAs, and 938 miRNAs. This comprehensive web resource can be widely used by buffalo researchers across the globe for use of markers in marker trait association, genetic diversity among the different breeds of buffalo, use of ncRNAs as regulatory molecules, post-transcriptional regulations, and role in various diseases/stresses. These SNPs and InDelscan also be used as biomarkers to address adulteration and traceability. This resource can also be useful in buffalo improvement programs and disease/breed management.
Collapse
Affiliation(s)
- Aamir Khan
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Kalpana Singh
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sarika Jaiswal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Mustafa Raza
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Rahul Singh Jasrotia
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Animesh Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Anoop Kishor Singh Gurjar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Juli Kumari
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Varij Nayan
- ICAR-Central Institute for Research on Buffaloes, Hisar, India
| | - Mir Asif Iquebal
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
- *Correspondence: Mir Asif Iquebal,
| | - U. B. Angadi
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Anil Rai
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | | | - Dinesh Kumar
- Centre for Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| |
Collapse
|
3
|
Mukherjee S, Sokol N. Resources and Methods for the Analysis of MicroRNA Function in Drosophila. Methods Mol Biol 2022; 2540:79-92. [PMID: 35980573 DOI: 10.1007/978-1-0716-2541-5_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Since the widespread discovery of microRNAs (miRNAs) 20 years ago, the Drosophila melanogaster model system has made important contributions to understanding the biology of this class of noncoding RNAs. These contributions are based on the amenability of this model system not only for biochemical analysis but molecular, genetic, and cell biological analyses as well. Nevertheless, while the Drosophila genome is now known to encode 258 miRNA precursors, the function of only a small minority of these have been well characterized. In this review, we summarize the current resources and methods that are available to study miRNA function in Drosophila with a particular focus on the large-scale resources that enable systematic analysis. Application of these methods will accelerate the discovery of ways that miRNAs are embedded into genetic networks that control basic features of metazoan cells.
Collapse
Affiliation(s)
| | - Nicholas Sokol
- Department of Biology, Indiana University, Bloomington, IN, USA.
| |
Collapse
|
4
|
Abstract
Less than 2% of mammalian genomes code for proteins, but 'the majority of its bases can be found in primary transcripts' - a phenomenon termed the pervasive transcription, which was first reported in 2007. Even though most of the transcripts do not code for proteins, they play a variety of biological functions, with regulation of gene expression appearing as the most common one. Those transcripts are divided into two groups based on their length: small non-coding RNAs, which are maximally 200 bp long, and long non-coding RNAs (lncRNAs), which are longer than 200 nucleotides. The advances in next-generation sequencing methods provided a new possibility of investigating the full set of RNA molecules in the cell. In this review, we summarized the current state of knowledge on lncRNAs in three major livestock species - Sus scrofa, Bos taurus and Gallus gallus, based on the literature and the content of biological databases. In the NONCODE database, the largest number of identified lncRNA transcripts is available for pigs, but cattle have the largest number of lncRNA genes. Poultry is represented by less than a half of records. Genomic annotation of lncRNAs showed that the majority of them are assigned to introns (pig, poultry) or intergenic (cattle). The comparison with well-annotated human and mouse genomes indicates that such annotation is a result of lack of proper lncRNA annotation data. Since lncRNAs play an important role in genomic studies, their characterization in farm animals' genomes is critical in bridging the gap between genotype and phenotype.
Collapse
|
5
|
Gonçalves-Pimentel C, Mazaud D, Kottler B, Proelss S, Hirth F, Fanto M. A miRNA screen procedure identifies garz as an essential factor in adult glia functions and validates Drosophila as a beneficial 3Rs model to study glial functions and GBF1 biology. F1000Res 2020; 9:317. [PMID: 32595956 PMCID: PMC7309417 DOI: 10.12688/f1000research.23154.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/21/2020] [Indexed: 03/21/2024] Open
Abstract
Invertebrate glia performs most of the key functions controlled by mammalian glia in the nervous system and provides an ideal model for genetic studies of glial functions. To study the influence of adult glial cells in ageing we have performed a genetic screen in Drosophila using a collection of transgenic lines providing conditional expression of micro-RNAs (miRNAs). Here, we describe a methodological algorithm to identify and rank genes that are candidate to be targeted by miRNAs that shorten lifespan when expressed in adult glia. We have used four different databases for miRNA target prediction in Drosophila but find little agreement between them, overall. However, top candidate gene analysis shows potential to identify essential genes involved in adult glial functions. One example from our top candidates' analysis is gartenzwerg ( garz). We establish that garz is necessary in many glial cell types, that it affects motor behaviour and, at the sub-cellular level, is responsible for defects in cellular membranes, autophagy and mitochondria quality control. We also verify the remarkable conservation of functions between garz and its mammalian orthologue, GBF1, validating the use of Drosophila as an alternative 3Rs-beneficial model to knock-out mice for studying the biology of GBF1, potentially involved in human neurodegenerative diseases.
Collapse
Affiliation(s)
- Catarina Gonçalves-Pimentel
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
- Champalimaud Research, Champalimaud Foundation, Av. Brasília, Lisbon, 1400-038, Portugal
| | - David Mazaud
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Benjamin Kottler
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Sandra Proelss
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Frank Hirth
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Manolis Fanto
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
- Institut du Cerveau et de la Moelle épinière (ICM), 47, bd de l'hôpital, Paris, F-75013, France
| |
Collapse
|
6
|
Gonçalves-Pimentel C, Mazaud D, Kottler B, Proelss S, Hirth F, Fanto M. A miRNA screen procedure identifies garz as an essential factor in adult glia functions and validates Drosophila as a beneficial 3Rs model to study glial functions and GBF1 biology. F1000Res 2020; 9:317. [PMID: 32595956 PMCID: PMC7309417 DOI: 10.12688/f1000research.23154.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/20/2020] [Indexed: 11/25/2022] Open
Abstract
Invertebrate glia performs most of the key functions controlled by mammalian glia in the nervous system and provides an ideal model for genetic studies of glial functions. To study the influence of adult glial cells in ageing we have performed a genetic screen in Drosophila using a collection of transgenic lines providing conditional expression of micro-RNAs (miRNAs). Here, we describe a methodological algorithm to identify and rank genes that are candidate to be targeted by miRNAs that shorten lifespan when expressed in adult glia. We have used four different databases for miRNA target prediction in Drosophila but find little agreement between them, overall. However, top candidate gene analysis shows potential to identify essential genes involved in adult glial functions. One example from our top candidates' analysis is gartenzwerg ( garz). We establish that garz is necessary in many glial cell types, that it affects motor behaviour and, at the sub-cellular level, is responsible for defects in cellular membranes, autophagy and mitochondria quality control. We also verify the remarkable conservation of functions between garz and its mammalian orthologue, GBF1, validating the use of Drosophila as an alternative 3Rs-beneficial model to knock-out mice for studying the biology of GBF1, potentially involved in human neurodegenerative diseases.
Collapse
Affiliation(s)
- Catarina Gonçalves-Pimentel
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
- Champalimaud Research, Champalimaud Foundation, Av. Brasília, Lisbon, 1400-038, Portugal
| | - David Mazaud
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Benjamin Kottler
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Sandra Proelss
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Frank Hirth
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
| | - Manolis Fanto
- Department of Basic and Clinical Neuroscience, King's College London, London, SE5 9NU, UK
- Institut du Cerveau et de la Moelle épinière (ICM), 47, bd de l'hôpital, Paris, F-75013, France
| |
Collapse
|
7
|
Kim DH, Shin M, Jung SH, Kim YJ, Jones WD. A fat-derived metabolite regulates a peptidergic feeding circuit in Drosophila. PLoS Biol 2017; 15:e2000532. [PMID: 28350856 PMCID: PMC5369665 DOI: 10.1371/journal.pbio.2000532] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 02/28/2017] [Indexed: 12/04/2022] Open
Abstract
Here, we show that the enzymatic cofactor tetrahydrobiopterin (BH4) inhibits feeding in Drosophila. BH4 biosynthesis requires the sequential action of the conserved enzymes Punch, Purple, and Sepiapterin Reductase (Sptr). Although we observe increased feeding upon loss of Punch and Purple in the adult fat body, loss of Sptr must occur in the brain. We found Sptr expression is required in four adult neurons that express neuropeptide F (NPF), the fly homologue of the vertebrate appetite regulator neuropeptide Y (NPY). As expected, feeding flies BH4 rescues the loss of Punch and Purple in the fat body and the loss of Sptr in NPF neurons. Mechanistically, we found BH4 deficiency reduces NPF staining, likely by promoting its release, while excess BH4 increases NPF accumulation without altering its expression. We thus show that, because of its physically distributed biosynthesis, BH4 acts as a fat-derived signal that induces satiety by inhibiting the activity of the NPF neurons.
Collapse
Affiliation(s)
- Do-Hyoung Kim
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
| | - Minjung Shin
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
| | - Sung-Hwan Jung
- School of Life Sciences, Gwangju Institute of Science and Technology (GIST), Gwangju, South Korea
| | - Young-Joon Kim
- School of Life Sciences, Gwangju Institute of Science and Technology (GIST), Gwangju, South Korea
| | - Walton D. Jones
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
| |
Collapse
|