51
|
Backes C, Meder B, Hart M, Ludwig N, Leidinger P, Vogel B, Galata V, Roth P, Menegatti J, Grässer F, Ruprecht K, Kahraman M, Grossmann T, Haas J, Meese E, Keller A. Prioritizing and selecting likely novel miRNAs from NGS data. Nucleic Acids Res 2015; 44:e53. [PMID: 26635395 PMCID: PMC4824081 DOI: 10.1093/nar/gkv1335] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 11/16/2015] [Indexed: 12/16/2022] Open
Abstract
Small non-coding RNAs play a key role in many physiological and pathological processes. Since 2004, miRNA sequences have been catalogued in miRBase, which is currently in its 21st version. We investigated sequence and structural features of miRNAs annotated in the miRBase and compared them between different versions of this reference database. We have identified that the two most recent releases (v20 and v21) are influenced by next-generation sequencing based miRNA predictions and show significant deviation from miRNAs discovered prior to the high-throughput profiling period. From the analysis of miRBase, we derived a set of key characteristics to predict new miRNAs and applied the implemented algorithm to evaluate novel blood-borne miRNA candidates. We carried out 705 individual whole miRNA sequencings of blood cells and collected a total of 9.7 billion reads. Using miRDeep2 we initially predicted 1452 potentially novel miRNAs. After excluding false positives, 518 candidates remained. These novel candidates were ranked according to their distance to the features in the early miRBase versions allowing for an easier selection of a subset of putative miRNAs for validation. Selected candidates were successfully validated by qRT-PCR and northern blotting. In addition, we implemented a web-server for ranking potential miRNA candidates, which is available at: www.ccb.uni-saarland.de/novomirank.
Collapse
Affiliation(s)
- Christina Backes
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Benjamin Meder
- Internal Medicine II, University Hospital Heidelberg, Heidelberg, Germany
| | - Martin Hart
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Nicole Ludwig
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Petra Leidinger
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Britta Vogel
- Internal Medicine II, University Hospital Heidelberg, Heidelberg, Germany
| | - Valentina Galata
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Patrick Roth
- University Hospital Zurich, Department of Neurology and University of Zurich, Switzerland
| | - Jennifer Menegatti
- Department of Virology, Saarland University Medical School, Homburg, Germany
| | - Friedrich Grässer
- Department of Virology, Saarland University Medical School, Homburg, Germany
| | - Klemens Ruprecht
- Department of Neurology, Charité Universitätsmedizin Berlin, Germany
| | - Mustafa Kahraman
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Thomas Grossmann
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| | - Jan Haas
- Internal Medicine II, University Hospital Heidelberg, Heidelberg, Germany
| | - Eckart Meese
- Department of Human Genetics, Saarland University, Homburg, Germany
| | - Andreas Keller
- Chair for Clinical Bioinformatics, Saarland University, Saarbrücken, Germany
| |
Collapse
|
52
|
Quek C, Jung CH, Bellingham SA, Lonie A, Hill AF. iSRAP - a one-touch research tool for rapid profiling of small RNA-seq data. J Extracell Vesicles 2015; 4:29454. [PMID: 26561006 PMCID: PMC4641893 DOI: 10.3402/jev.v4.29454] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2015] [Revised: 10/12/2015] [Accepted: 10/14/2015] [Indexed: 12/23/2022] Open
Abstract
Small non-coding RNAs have been significantly recognized as the key modulators in many biological processes, and are emerging as promising biomarkers for several diseases. These RNA species are transcribed in cells and can be packaged in extracellular vesicles, which are small vesicles released from many biotypes, and are involved in intercellular communication. Currently, the advent of next-generation sequencing (NGS) technology for high-throughput profiling has further advanced the biological insights of non-coding RNA on a genome-wide scale and has become the preferred approach for the discovery and quantification of non-coding RNA species. Despite the routine practice of NGS, the processing of large data sets poses difficulty for analysis before conducting downstream experiments. Often, the current analysis tools are designed for specific RNA species, such as microRNA, and are limited in flexibility for modifying parameters for optimization. An analysis tool that allows for maximum control of different software is essential for drawing concrete conclusions for differentially expressed transcripts. Here, we developed a one-touch integrated small RNA analysis pipeline (iSRAP) research tool that is composed of widely used tools for rapid profiling of small RNAs. The performance test of iSRAP using publicly and in-house available data sets shows its ability of comprehensive profiling of small RNAs of various classes, and analysis of differentially expressed small RNAs. iSRAP offers comprehensive analysis of small RNA sequencing data that leverage informed decisions on the downstream analyses of small RNA studies, including extracellular vesicles such as exosomes.
Collapse
Affiliation(s)
- Camelia Quek
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, Australia
| | - Chol-Hee Jung
- Victorian Life Sciences Computation Initiative (VLSCI), The University of Melbourne, Melbourne, VIC, Australia
| | - Shayne A Bellingham
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, Australia
| | - Andrew Lonie
- Victorian Life Sciences Computation Initiative (VLSCI), The University of Melbourne, Melbourne, VIC, Australia
| | - Andrew F Hill
- Department of Biochemistry and Molecular Biology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Melbourne, VIC, Australia.,Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC, Australia;
| |
Collapse
|
53
|
Evers M, Huttner M, Dueck A, Meister G, Engelmann JC. miRA: adaptable novel miRNA identification in plants using small RNA sequencing data. BMC Bioinformatics 2015; 16:370. [PMID: 26542525 PMCID: PMC4635600 DOI: 10.1186/s12859-015-0798-3] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 10/22/2015] [Indexed: 12/21/2022] Open
Abstract
Background MicroRNAs (miRNAs) are short regulatory RNAs derived from longer precursor RNAs. miRNA biogenesis has been studied in animals and plants, recently elucidating more complex aspects, such as non-conserved, species-specific, and heterogeneous miRNA precursor populations. Small RNA sequencing data can help in computationally identifying genomic loci of miRNA precursors. The challenge is to predict a valid miRNA precursor from inhomogeneous read coverage from a complex RNA library: while the mature miRNA typically produces many sequence reads, the remaining part of the precursor is covered very sparsely. As recent results suggest, alternative miRNA biogenesis pathways may lead to a more diverse miRNA precursor population than previously assumed. In plants, the latter manifests itself in e.g. complex secondary structures and expression from multiple loci within precursors. Current miRNA identification algorithms often depend on already existing gene annotation, and/or make use of specific miRNA precursor features such as precursor lengths, secondary structures etc. Consequently and in view of the emerging new understanding of a more complex miRNA biogenesis in plants, current tools may fail to characterise organism-specific and heterogeneous miRNA populations. Results miRA is a new tool to identify miRNA precursors in plants, allowing for heterogeneous and complex precursor populations. miRA requires small RNA sequencing data and a corresponding reference genome, and evaluates precursor secondary structures and precursor processing accuracy; key parameters can be adapted based on the specific organism under investigation. We show that miRA outperforms the currently best plant miRNA prediction tools both in sensitivity and specificity, for data involving Arabidopsis thaliana and the Volvocine algae Chlamydomonas reinhardtii; the latter organism has been shown to exhibit a heterogeneous and complex precursor population with little cross-species miRNA sequence conservation, and therefore constitutes an ideal model organism. Furthermore we identify novel miRNAs in the Chlamydomonas-related organism Volvox carteri. Conclusions We propose miRA, a new plant miRNA identification tool that is well adapted to complex precursor populations. miRA is particularly suited for organisms with no existing miRNA annotation, or without a known related organism with well characterized miRNAs. Moreover, miRA has proven its ability to identify species-specific miRNAs. miRA is flexible in its parameter settings, and produces user-friendly output files in various formats (pdf, csv, genome-browser-suitable annotation files, etc.). It is freely available at https://github.com/mhuttner/miRA. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0798-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maurits Evers
- Institute of Functional Genomics, University of Regensburg, Regensburg, Germany.
| | - Michael Huttner
- Institute of Functional Genomics, University of Regensburg, Regensburg, Germany
| | - Anne Dueck
- Biochemistry Center Regensburg (BZR), Laboratory for RNA Biology, University of Regensburg, Regensburg, Germany
| | - Gunter Meister
- Biochemistry Center Regensburg (BZR), Laboratory for RNA Biology, University of Regensburg, Regensburg, Germany
| | - Julia C Engelmann
- Institute of Functional Genomics, University of Regensburg, Regensburg, Germany
| |
Collapse
|
54
|
Ku YS, Wong JWH, Mui Z, Liu X, Hui JHL, Chan TF, Lam HM. Small RNAs in Plant Responses to Abiotic Stresses: Regulatory Roles and Study Methods. Int J Mol Sci 2015; 16:24532-54. [PMID: 26501263 PMCID: PMC4632763 DOI: 10.3390/ijms161024532] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Revised: 09/21/2015] [Accepted: 10/08/2015] [Indexed: 12/31/2022] Open
Abstract
To survive under abiotic stresses in the environment, plants trigger a reprogramming of gene expression, by transcriptional regulation or translational regulation, to turn on protective mechanisms. The current focus of research on how plants cope with abiotic stresses has transitioned from transcriptomic analyses to small RNA investigations. In this review, we have summarized and evaluated the current methodologies used in the identification and validation of small RNAs and their targets, in the context of plant responses to abiotic stresses.
Collapse
Affiliation(s)
- Yee-Shan Ku
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Johanna Wing-Hang Wong
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Zeta Mui
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Xuan Liu
- Department of Computer Science, The University of Hong Kong, Pokfulam, Hong Kong.
| | - Jerome Ho-Lam Hui
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Ting-Fung Chan
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Hon-Ming Lam
- Center for Soybean Research of State Key Laboratory of Agrobiotechnology and School of Life Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| |
Collapse
|
55
|
Nandety RS, Sharif A, Kamita SG, Ramasamy A, Falk BW. Identification of Novel and Conserved microRNAs in Homalodisca vitripennis, the Glassy-Winged Sharpshooter by Expression Profiling. PLoS One 2015; 10:e0139771. [PMID: 26440407 PMCID: PMC4595010 DOI: 10.1371/journal.pone.0139771] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2015] [Accepted: 09/17/2015] [Indexed: 11/21/2022] Open
Abstract
The glassy-winged sharpshooter (GWSS) Homalodisca vitripennis (Hemiptera: Cicadellidae), is a xylem-feeding leafhopper and an important vector of the bacterium Xylella fastidiosa; the causal agent of Pierce’s disease of grapevines. MicroRNAs are a class of small RNAs that play an important role in the functional development of various organisms including insects. In H. vitripennis, we identified microRNAs using high-throughput deep sequencing of adults followed by computational and manual annotation. A total of 14 novel microRNAs that are not found in the miRBase were identified from adult H. vitripennis. Conserved microRNAs were also found in our datasets. By comparison to our previously determined transcriptome sequence of H. vitripennis, we identified the potential targets of the microRNAs in the transcriptome. This microRNA profile information not only provides a more nuanced understanding of the biological and physiological mechanisms that govern gene expression in H. vitripennis, but may also lead to the identification of novel mechanisms for biorationally designed management strategies through the use of microRNAs.
Collapse
Affiliation(s)
- Raja Sekhar Nandety
- Department of Plant Pathology, University of California, Davis, California, United States of America
| | - Almas Sharif
- Department of Plant Pathology, University of California, Davis, California, United States of America
| | - Shizuo G. Kamita
- Department of Entomology & Nematology, University of California, Davis, California, United States of America
| | - Asokan Ramasamy
- Division of Biotechnology, Indian Institute of Horticultural Research, Bangalore, India
| | - Bryce W. Falk
- Department of Plant Pathology, University of California, Davis, California, United States of America
- * E-mail:
| |
Collapse
|
56
|
Qin Q, Wang J, Dai J, Wang Y, Liu Y, Liu S. Induced All-Female Autotriploidy in the Allotetraploids of Carassius auratus red var. (♀) × Megalobrama amblycephala (♂). MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2015; 17:1-7. [PMID: 26242753 DOI: 10.1007/s10126-014-9593-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2014] [Accepted: 07/15/2014] [Indexed: 05/27/2023]
Abstract
Following activation by UV-irradiated BSB sperm, the fertilized eggs of tetraploid hybrids (abbreviated as 4nF1) (4n = 148, AABB) of Carassius auratus red var. (abbreviated as RCC) (2n = 100, AA) (♀) × Megalobrama amblycephala (abbreviated as BSB) (2n = 48, BB) (♂) developed into normal live gynogenetic offspring without chromosome doubling treatment. Some of these were autotriploids with three sets of red crucian carp chromosomes (abbreviated as G1) (3n = 150, AAA). G1 were all-females, and can produce unreduced (3n) eggs at age 1 year. After activation by UV-irradiated BSB sperm, the fertilized eggs of G1 developed into a second generation of autotriploid gynogenetic offspring (abbreviated as G2) (3n = 150) without chromosome doubling treatment. G1 were obviously different from both 4nF1 and RCC in their morphological traits and showed a significantly higher growth rate than RCC. In aquaculture, the autotriploid fish could provide an important source of gametes for the production of all-female triploid fish and for the establishment of autotriploid gynogenetic lines.
Collapse
Affiliation(s)
- Qinbo Qin
- Key Laboratory of Protein Chemistry and Developmental Biology of the State Education Ministry of China, College of Life Sciences, Hunan Normal University, Changsha, 410081, People's Republic of China
| | | | | | | | | | | |
Collapse
|
57
|
Xue B. Improving MiRNA prediction accuracy by deep learning strategies. PROCEEDINGS OF THE 6TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS 2015. [DOI: 10.1145/2808719.2811417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Affiliation(s)
- Bin Xue
- University of South Florida, Tampa, FL
| |
Collapse
|
58
|
Peace RJ, Biggar KK, Storey KB, Green JR. A framework for improving microRNA prediction in non-human genomes. Nucleic Acids Res 2015; 43:e138. [PMID: 26163062 PMCID: PMC4787757 DOI: 10.1093/nar/gkv698] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 06/28/2015] [Indexed: 11/12/2022] Open
Abstract
The prediction of novel pre-microRNA (miRNA) from genomic sequence has received considerable attention recently. However, the majority of studies have focused on the human genome. Previous studies have demonstrated that sensitivity (correctly detecting true miRNA) is sustained when human-trained methods are applied to other species, however they have failed to report the dramatic drop in specificity (the ability to correctly reject non-miRNA sequences) in non-human genomes. Considering the ratio of true miRNA sequences to pseudo-miRNA sequences is on the order of 1:1000, such low specificity prevents the application of most existing tools to non-human genomes, as the number of false positives overwhelms the true predictions. We here introduce a framework (SMIRP) for creating species-specific miRNA prediction systems, leveraging sequence conservation and phylogenetic distance information. Substantial improvements in specificity and precision are obtained for four non-human test species when our framework is applied to three different prediction systems representing two types of classifiers (support vector machine and Random Forest), based on three different feature sets, with both human-specific and taxon-wide training data. The SMIRP framework is potentially applicable to all miRNA prediction systems and we expect substantial improvement in precision and specificity, while sustaining sensitivity, independent of the machine learning technique chosen.
Collapse
Affiliation(s)
- Robert J Peace
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada
| | - Kyle K Biggar
- Institute of Biochemistry and Department of Biology, Carleton University, Ottawa, Canada Department of Biochemistry, University of Western Ontario, London, Canada
| | - Kenneth B Storey
- Institute of Biochemistry and Department of Biology, Carleton University, Ottawa, Canada
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, Ottawa, Canada
| |
Collapse
|
59
|
Verdelli C, Forno I, Vaira V, Corbetta S. Epigenetic alterations in human parathyroid tumors. Endocrine 2015; 49:324-32. [PMID: 25722013 DOI: 10.1007/s12020-015-0555-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 02/13/2015] [Indexed: 12/14/2022]
Abstract
Epigenetics alterations are involved in tumorigenesis and have been identified in endocrine neoplasia. In particular, DNA methylation, microRNAs deregulations and histone methylation impairment are detected in tumors of the parathyroid glands. Parathyroid tumors are the second most common endocrine neoplasia following thyroid cancer in women, and it is associated with primary hyperparathyroidism, a disease sustained by PTH hypersecretion. Despite the hallmark of global promoter hypomethylations was not detectable in parathyroid tumors, increase of hypermethylation in specific CpG islands was detected in the progression from benign to malignant parathyroid tumors. Furthermore, deregulation of a panel of embryonic-related microRNAs (miRNAs) was documented in parathyroid tumors compared with normal glands. Impaired expression of the histone methyltransferases EZH2, BMI1, and RIZ1 have been described in parathyroid tumors. Moreover, histone methyltransferases have been shown to be modulated by the oncosuppressors HIC1, MEN1, and HRPT2/CDC73 gene products that characterize tumorigenesis of parathyroid adenomas and carcinomas, respectively. The epigenetic scenario in parathyroid tumors have just began to be decoded but emerging data highlight the involvement of an embryonic gene signature in parathyroid tumor development.
Collapse
Affiliation(s)
- Chiara Verdelli
- Laboratory of Molecular Biology, IRCCS Policlinico San Donato, San Donato Milanese, MI, Italy
| | | | | | | |
Collapse
|
60
|
Higashi S, Fournier C, Gautier C, Gaspin C, Sagot MF. Mirinho: An efficient and general plant and animal pre-miRNA predictor for genomic and deep sequencing data. BMC Bioinformatics 2015; 16:179. [PMID: 26022464 PMCID: PMC4448272 DOI: 10.1186/s12859-015-0594-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 04/23/2015] [Indexed: 12/13/2022] Open
Abstract
Background Several methods exist for the prediction of precursor miRNAs (pre-miRNAs) in genomic or sRNA-seq (small RNA sequences) data produced by NGS (Next Generation Sequencing). One key information used for this task is the characteristic hairpin structure adopted by pre-miRNAs, that in general are identified using RNA folders whose complexity is cubic in the size of the input. The vast majority of pre-miRNA predictors then rely on further information learned from previously validated miRNAs from the same or a closely related genome for the final prediction of new miRNAs. With this paper, we wished to address three main issues. The first was methodological and aimed at obtaining a more time-efficient predictor, however without losing in accuracy which represented a second issue. We indeed aimed at better predicting miRNAs at a genome scale, but also from sRNAseq data where in some cases, notably of plants, the current folding methods often infer the wrong structure. The third issue is related to the fact that it is important to rely as little as possible on previously recorded examples of miRNAs. We therefore also sought a method that is less dependent on previous miRNA records. Results As concerns the first and second issues, we present a novel alternative to a classical folder based on a thermodynamic Nearest-Neighbour (NN) model for computing the free energy and predicting the classical hairpin structure of a pre-miRNA. We show that the free energies thus computed correlate well with those of RNAfold. This novel method, called Mirinho, has quadratic instead of cubic complexity and is much more efficient also in practice. When applied to sRNAseq data of plants, it gives in general better results than classical folders. On the third issue, we show that Mirinho, which uses as only knowledge the length of the loops and stem-arms and the free energy of the pre-miRNA hairpin, compares well with algorithms that require more information. The results, obtained with different datasets, are indeed similar to those of other approaches with which such a comparison was possible. These needed to be publicly available softwares that could be used on a large input. In some cases, Mirinho is even better in terms of sensitivity or precision. Conclusion We provide a simpler and much faster method with very reasonable sensitivity and precision, which can be applied without special adaptation to the prediction of both animal and plant pre-miRNAs, using as input either genomic sequences or sRNA-seq data. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0594-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Susan Higashi
- ERABLE team, Inria Grenoble Rhône-Alpes, Montbonnot Saint-Martin, 38330, France. .,Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, F-69622, France.
| | - Cyril Fournier
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, F-69622, France.
| | - Christian Gautier
- ERABLE team, Inria Grenoble Rhône-Alpes, Montbonnot Saint-Martin, 38330, France. .,Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, F-69622, France.
| | - Christine Gaspin
- INRA, UBIA & Plateforme Bioinformatique, 24 Chemin de Borde Rouge, Auzeville, POBOX 5627, Castanet Tolosan, 31326, France.
| | - Marie-France Sagot
- ERABLE team, Inria Grenoble Rhône-Alpes, Montbonnot Saint-Martin, 38330, France. .,Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, F-69622, France.
| |
Collapse
|
61
|
Zhao F, Wang Z, Lang H, Liu X, Zhang D, Wang X, Zhang T, Wang R, Shi P, Pang X. Dynamic Expression of Novel MiRNA Candidates and MiRNA-34 Family Members in Early- to Mid-Gestational Fetal Keratinocytes Contributes to Scarless Wound Healing by Targeting the TGF-β Pathway. PLoS One 2015; 10:e0126087. [PMID: 25978377 PMCID: PMC4433274 DOI: 10.1371/journal.pone.0126087] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 03/30/2015] [Indexed: 12/21/2022] Open
Abstract
Background Early- to mid-gestational fetal mammalian skin wounds heal rapidly and without scarring. Keratinocytes (KCs) have been found to exert important effects on the regulation of fibroblasts. There may be significant differences of gestational fetal KCs at different ages. The advantages in early- to mid-gestational fetal KCs could lead to fetal scarless wound healing. Methods KCs from six human fetal skin samples were divided into two groups: a mid-gestation group (less than 28 weeks of gestational age) and a late-gestation group (more than 28 weeks of gestational age). RNA extracted from KCs was used to prepare a library of small RNAs for next-generation sequencing (NGS). To uncover potential novel microRNA (miRNAs), the mirTools 2.0 web server was used to identify candidate novel human miRNAs from the NGS data. Other bioinformatical analyses were used to further validate the novel miRNAs. The expression levels of the miRNAs were further confirmed by real-time quantitative RT-PCR. Results A total of 61.59 million reads were mapped to 1,170 known human miRNAs in miRBase. Among a total of 202 potential novel miRNAs uncovered, 106 candidates have a higher probability of being novel human miRNAs. A total of 110 miRNAs, including 22 novel miRNA candidates, were significantly differently expressed between mid- and late-gestational fetal KCs. Thirty-three differentially expressed miRNAs and miR-34 family members are correlated with the transforming growth factor-β (TGF-β) pathway. Conclusions Taken together, our results provide compelling evidence supporting the existence of 106 novel miRNAs and the dynamic expression of miRNAs that extensively targets the TGF-β pathway at different gestational ages in fetal KCs. MiRNAs showing altered expression at different gestational ages in fetal KCs may contribute to scarless wound healing in early- to mid-gestational fetal KCs, and thus may be new targets for potential scar prevention and reduction therapies.
Collapse
Affiliation(s)
- Feng Zhao
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Zhe Wang
- Department of Blood Transfusion, Shengjing Hospital of China Medical University, 39 Huaxiang Street, Tiexi District, Shenyang City 110004, Liaoning Province, China
| | - Hongxin Lang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Xiaoyu Liu
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Dianbao Zhang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Xiliang Wang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Tao Zhang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Rui Wang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
| | - Ping Shi
- Department of General Practice, First Hospital of China Medical University, 155 North Nanjing Street, Heping District, Shenyang City 110001, Liaoning Province, China
| | - Xining Pang
- Department of Stem Cells and Regenerative Medicine, Key Laboratory of Cell Biology, Ministry of Public Health and Key Laboratory of Medical Cell Biology, Ministry of Education, China Medical University, 77 Puhe Street, Shenbei New District, Shenyang City 110013, Liaoning Province, China
- * E-mail:
| |
Collapse
|
62
|
Karathanasis N, Tsamardinos I, Poirazi P. MiRduplexSVM: A High-Performing MiRNA-Duplex Prediction and Evaluation Methodology. PLoS One 2015; 10:e0126151. [PMID: 25961860 PMCID: PMC4427487 DOI: 10.1371/journal.pone.0126151] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Accepted: 03/29/2015] [Indexed: 12/26/2022] Open
Abstract
We address the problem of predicting the position of a miRNA duplex on a microRNA hairpin via the development and application of a novel SVM-based methodology. Our method combines a unique problem representation and an unbiased optimization protocol to learn from mirBase19.0 an accurate predictive model, termed MiRduplexSVM. This is the first model that provides precise information about all four ends of the miRNA duplex. We show that (a) our method outperforms four state-of-the-art tools, namely MaturePred, MiRPara, MatureBayes, MiRdup as well as a Simple Geometric Locator when applied on the same training datasets employed for each tool and evaluated on a common blind test set. (b) In all comparisons, MiRduplexSVM shows superior performance, achieving up to a 60% increase in prediction accuracy for mammalian hairpins and can generalize very well on plant hairpins, without any special optimization. (c) The tool has a number of important applications such as the ability to accurately predict the miRNA or the miRNA*, given the opposite strand of a duplex. Its performance on this task is superior to the 2nts overhang rule commonly used in computational studies and similar to that of a comparative genomic approach, without the need for prior knowledge or the complexity of performing multiple alignments. Finally, it is able to evaluate novel, potential miRNAs found either computationally or experimentally. In relation with recent confidence evaluation methods used in miRBase, MiRduplexSVM was successful in identifying high confidence potential miRNAs.
Collapse
Affiliation(s)
- Nestoras Karathanasis
- Department of Biology, University of Crete, Heraklion, Greece
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation of Research and Technology Hellas (FORTH), Heraklion, Greece
| | - Ioannis Tsamardinos
- Department of Computer Science, University of Crete, Heraklion, Greece
- Institute of Computer Science (ICS), Foundation of Research and Technology Hellas (FORTH), Heraklion, Greece
| | - Panayiota Poirazi
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation of Research and Technology Hellas (FORTH), Heraklion, Greece
- * E-mail:
| |
Collapse
|
63
|
Tran VDT, Tempel S, Zerath B, Zehraoui F, Tahi F. miRBoost: boosting support vector machines for microRNA precursor classification. RNA (NEW YORK, N.Y.) 2015; 21:775-85. [PMID: 25795417 PMCID: PMC4408786 DOI: 10.1261/rna.043612.113] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Accepted: 12/20/2014] [Indexed: 06/04/2023]
Abstract
Identification of microRNAs (miRNAs) is an important step toward understanding post-transcriptional gene regulation and miRNA-related pathology. Difficulties in identifying miRNAs through experimental techniques combined with the huge amount of data from new sequencing technologies have made in silico discrimination of bona fide miRNA precursors from non-miRNA hairpin-like structures an important topic in bioinformatics. Among various techniques developed for this classification problem, machine learning approaches have proved to be the most promising. However these approaches require the use of training data, which is problematic due to an imbalance in the number of miRNAs (positive data) and non-miRNAs (negative data), which leads to a degradation of their performance. In order to address this issue, we present an ensemble method that uses a boosting technique with support vector machine components to deal with imbalanced training data. Classification is performed following a feature selection on 187 novel and existing features. The algorithm, miRBoost, performed better in comparison with state-of-the-art methods on imbalanced human and cross-species data. It also showed the highest ability among the tested methods for discovering novel miRNA precursors. In addition, miRBoost was over 1400 times faster than the second most accurate tool tested and was significantly faster than most of the other tools. miRBoost thus provides a good compromise between prediction efficiency and execution time, making it highly suitable for use in genome-wide miRNA precursor prediction. The software miRBoost is available on our web server http://EvryRNA.ibisc.univ-evry.fr.
Collapse
Affiliation(s)
- Van Du T Tran
- IBISC - IBGBI, University of Evry, 91037 Evry CEDEX, France Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Sebastien Tempel
- IBISC - IBGBI, University of Evry, 91037 Evry CEDEX, France LCB, CNRS UMR 7283, 13009 Marseille, France
| | | | | | - Fariza Tahi
- IBISC - IBGBI, University of Evry, 91037 Evry CEDEX, France
| |
Collapse
|
64
|
Kang W, Friedländer MR. Computational Prediction of miRNA Genes from Small RNA Sequencing Data. Front Bioeng Biotechnol 2015; 3:7. [PMID: 25674563 PMCID: PMC4306309 DOI: 10.3389/fbioe.2015.00007] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 01/07/2015] [Indexed: 01/19/2023] Open
Abstract
Next-generation sequencing now for the first time allows researchers to gage the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile them. microRNAs (miRNAs) are 22 nucleotide small RNAs (sRNAs) that post-transcriptionally reduce the output of protein coding genes. They have established roles in numerous biological processes, including cancers and other diseases. During miRNA biogenesis, the sRNAs are sequentially cleaved from precursor molecules that have a characteristic hairpin RNA structure. The vast majority of new miRNA genes that are discovered are mined from small RNA sequencing (sRNA-seq), which can detect more than a billion RNAs in a single run. However, given that many of the detected RNAs are degradation products from all types of transcripts, the accurate identification of miRNAs remain a non-trivial computational problem. Here, we review the tools available to predict animal miRNAs from sRNA sequencing data. We present tools for generalist and specialist use cases, including prediction from massively pooled data or in species without reference genome. We also present wet-lab methods used to validate predicted miRNAs, and approaches to computationally benchmark prediction accuracy. For each tool, we reference validation experiments and benchmarking efforts. Last, we discuss the future of the field.
Collapse
Affiliation(s)
- Wenjing Kang
- Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University , Stockholm , Sweden
| | - Marc R Friedländer
- Science for Life Laboratory, Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University , Stockholm , Sweden
| |
Collapse
|
65
|
Oulas A, Karathanasis N, Louloupi A, Pavlopoulos GA, Poirazi P, Kalantidis K, Iliopoulos I. Prediction of miRNA targets. Methods Mol Biol 2015; 1269:207-29. [PMID: 25577381 DOI: 10.1007/978-1-4939-2291-8_13] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Computational methods for miRNA target prediction are currently undergoing extensive review and evaluation. There is still a great need for improvement of these tools and bioinformatics approaches are looking towards high-throughput experiments in order to validate predictions. The combination of large-scale techniques with computational tools will not only provide greater credence to computational predictions but also lead to the better understanding of specific biological questions. Current miRNA target prediction tools utilize probabilistic learning algorithms, machine learning methods and even empirical biologically defined rules in order to build models based on experimentally verified miRNA targets. Large-scale protein downregulation assays and next-generation sequencing (NGS) are now being used to validate methodologies and compare the performance of existing tools. Tools that exhibit greater correlation between computational predictions and protein downregulation or RNA downregulation are considered the state of the art. Moreover, efficiency in prediction of miRNA targets that are concurrently verified experimentally provides additional validity to computational predictions and further highlights the competitive advantage of specific tools and their efficacy in extracting biologically significant results. In this review paper, we discuss the computational methods for miRNA target prediction and provide a detailed comparison of methodologies and features utilized by each specific tool. Moreover, we provide an overview of current state-of-the-art high-throughput methods used in miRNA target prediction.
Collapse
Affiliation(s)
- Anastasis Oulas
- Institute of Marine Biology, Biotechnology and Aquaculture-HCMR, Heraklion, Crete, Greece
| | | | | | | | | | | | | |
Collapse
|
66
|
Hattori H, Janky R, Nietfeld W, Aerts S, Madan Babu M, Venkitaraman AR. p53 shapes genome-wide and cell type-specific changes in microRNA expression during the human DNA damage response. Cell Cycle 2014; 13:2572-86. [PMID: 25486198 PMCID: PMC4601526 DOI: 10.4161/15384101.2015.942209] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
The human DNA damage response (DDR) triggers profound changes in gene expression, whose nature and regulation remain uncertain. Although certain micro-(mi)RNA species including miR34, miR-18, miR-16 and miR-143 have been implicated in the DDR, there is as yet no comprehensive description of genome-wide changes in the expression of miRNAs triggered by DNA breakage in human cells. We have used next-generation sequencing (NGS), combined with rigorous integrative computational analyses, to describe genome-wide changes in the expression of miRNAs during the human DDR. The changes affect 150 of 1523 miRNAs known in miRBase v18 from 4-24 h after the induction of DNA breakage, in cell-type dependent patterns. The regulatory regions of the most-highly regulated miRNA species are enriched in conserved binding sites for p53. Indeed, genome-wide changes in miRNA expression during the DDR are markedly altered in TP53-/- cells compared to otherwise isogenic controls. The expression levels of certain damage-induced, p53-regulated miRNAs in cancer samples correlate with patient survival. Our work reveals genome-wide and cell type-specific alterations in miRNA expression during the human DDR, which are regulated by the tumor suppressor protein p53. These findings provide a genomic resource to identify new molecules and mechanisms involved in the DDR, and to examine their role in tumor suppression and the clinical outcome of cancer patients.
Collapse
Key Words
- AP-1, activator protein-1
- DDR, DNA damage response
- DNA damage response
- E2F1, transcription factor E2F1
- FoxM1, forkhead box protein M1
- NF-k B, nuclear factor-k B
- NGS, next-generation sequencing
- TF, transcription factor
- TP53, tumour protein p53
- clinical outcome
- computational analysis
- double stranded DNA breaks, DSBs
- ionizing radiation, IR
- miRNA, micro-RNA
- micro-RNA
- misc RNA, miscellaneous RNA
- next-generation sequencing
- p53
- scRNA, small cytoplasmic RNA
- snRNA, small nuclear RNA
- snoRNA, small nucleolar RNA
- tRNA, transfer RNA
Collapse
Affiliation(s)
- Hiroyoshi Hattori
- University of Cambridge; Medical Research Council Cancer Unit; Hutchison/MRC Research Center; Cambridge, UK,These authors contributed equally to the work.,Present address: Laboratory of Advanced Therapy; Clinical Research Center; National Hospital Organization; Nagoya Medical Center; Aichi, Japan
| | - Rekin’s Janky
- Medical Research Council Laboratory of Molecular Biology; Cambridge, UK,Center for Human Genetics; KU Leuven; Campus Gasthuisberg; Leuven, Belgium,These authors contributed equally to the work.
| | | | - Stein Aerts
- Center for Human Genetics; KU Leuven; Campus Gasthuisberg; Leuven, Belgium
| | - M Madan Babu
- Medical Research Council Laboratory of Molecular Biology; Cambridge, UK
| | - Ashok R Venkitaraman
- University of Cambridge; Medical Research Council Cancer Unit; Hutchison/MRC Research Center; Cambridge, UK,Correspondence to: Ashok R Venkitaraman;
| |
Collapse
|
67
|
Etemadi M, Gutjahr C, Couzigou JM, Zouine M, Lauressergues D, Timmers A, Audran C, Bouzayen M, Bécard G, Combier JP. Auxin perception is required for arbuscule development in arbuscular mycorrhizal symbiosis. PLANT PHYSIOLOGY 2014; 166:281-92. [PMID: 25096975 PMCID: PMC4149713 DOI: 10.1104/pp.114.246595] [Citation(s) in RCA: 121] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 08/04/2014] [Indexed: 05/02/2023]
Abstract
Most land plant species live in symbiosis with arbuscular mycorrhizal fungi. These fungi differentiate essential functional structures called arbuscules in root cortical cells from which mineral nutrients are released to the plant. We investigated the role of microRNA393 (miR393), an miRNA that targets several auxin receptors, in arbuscular mycorrhizal root colonization. Expression of the precursors of the miR393 was down-regulated during mycorrhization in three different plant species: Solanum lycopersicum, Medicago truncatula, and Oryza sativa. Treatment of S. lycopersicum, M. truncatula, and O. sativa roots with concentrations of synthetic auxin analogs that did not affect root development stimulated mycorrhization, particularly arbuscule formation. DR5-GUS, a reporter for auxin response, was preferentially expressed in root cells containing arbuscules. Finally, overexpression of miR393 in root tissues resulted in down-regulation of auxin receptor genes (transport inhibitor response1 and auxin-related F box) and underdeveloped arbuscules in all three plant species. These results support the conclusion that miR393 is a negative regulator of arbuscule formation by hampering auxin perception in arbuscule-containing cells.
Collapse
Affiliation(s)
- Mohammad Etemadi
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Caroline Gutjahr
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Jean-Malo Couzigou
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Mohamed Zouine
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Dominique Lauressergues
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Antonius Timmers
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Corinne Audran
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Mondher Bouzayen
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Guillaume Bécard
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| | - Jean-Philippe Combier
- Université Paul Sabatier Toulouse, Unité Mixte de Recherche 5546, Laboratoire de Recherche en Sciences Végétales, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5546, F-31326 Castanet-Tolosan cedex, France (M.E., J.-M.C., D.L., G.B., J.-P.C.);Institut National Polytechnique-Ecole Nationale Supérieure Agronomique Toulouse, Génomique et Biotechnologie des Fruits, F-31326 Castanet-Tolosan, France (M.E., M.Z., C.A., M.B.);Institut National de la Recherche Agronomique, Génomique et Biotechnologie des Fruits, F-52627 Auzeville, France (M.E., M.Z., C.A., M.B.);Faculty of Biology, Genetics, University of Munich, 82152 Martinsried, Germany (C.G.); andLaboratoire des Interactions Plantes-Microorganismes, Unité Mixte de Recherche 441/2594 Institut National de la Recherche Agronomique-Centre National de la Recherche Scientifique, F-31326 Castanet-Tolosan cedex, France (A.T.)
| |
Collapse
|
68
|
Rogato A, Richard H, Sarazin A, Voss B, Cheminant Navarro S, Champeimont R, Navarro L, Carbone A, Hess WR, Falciatore A. The diversity of small non-coding RNAs in the diatom Phaeodactylum tricornutum. BMC Genomics 2014; 15:698. [PMID: 25142710 PMCID: PMC4247016 DOI: 10.1186/1471-2164-15-698] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2014] [Accepted: 07/30/2014] [Indexed: 11/10/2022] Open
Abstract
Background Marine diatoms constitute a major component of eukaryotic phytoplankton and stand at the crossroads of several evolutionary lineages. These microalgae possess peculiar genomic features and novel combinations of genes acquired from bacterial, animal and plant ancestors. Furthermore, they display both DNA methylation and gene silencing activities. Yet, the biogenesis and regulatory function of small RNAs (sRNAs) remain ill defined in diatoms. Results Here we report the first comprehensive characterization of the sRNA landscape and its correlation with genomic and epigenomic information in Phaeodactylum tricornutum. The majority of sRNAs is 25 to 30 nt-long and maps to repetitive and silenced Transposable Elements marked by DNA methylation. A subset of this population also targets DNA methylated protein-coding genes, suggesting that gene body methylation might be sRNA-driven in diatoms. Remarkably, 25-30 nt sRNAs display a well-defined and unprecedented 180 nt-long periodic distribution at several highly methylated regions that awaits characterization. While canonical miRNAs are not detectable, other 21-25 nt sRNAs of unknown origin are highly expressed. Besides, non-coding RNAs with well-described function, namely tRNAs and U2 snRNA, constitute a major source of 21-25 nt sRNAs and likely play important roles under stressful environmental conditions. Conclusions P. tricornutum has evolved diversified sRNA pathways, likely implicated in the regulation of largely still uncharacterized genetic and epigenetic processes. These results uncover an unexpected complexity of diatom sRNA population and previously unappreciated features, providing new insights into the diversification of sRNA-based processes in eukaryotes. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-698) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Hugues Richard
- Sorbonne Universités, UPMC Univ Paris 06, UMR 7238, Laboratory of Computational and Quantitative Biology, F-75006 Paris, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
69
|
Omer A, Singh P, Yadav NK, Singh RK. microRNAs: role in leukemia and their computational perspective. WILEY INTERDISCIPLINARY REVIEWS-RNA 2014; 6:65-78. [PMID: 25132152 DOI: 10.1002/wrna.1256] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2014] [Revised: 06/19/2014] [Accepted: 06/26/2014] [Indexed: 12/22/2022]
Abstract
MicroRNAs (miRNAs) belong to the family of noncoding RNAs (ncRNAs) and had gained importance due to its role in complex biochemical pathways. Changes in the expression of protein coding genes are the major cause of leukemia. Role of miRNAs as tumor suppressors has provided a new insight in the field of leukemia research. Particularly, the miRNAs mediated gene regulation involves the modulation of multiple mRNAs and cooperative action of different miRNAs to regulate a particular gene expression. This highly complex array of regulatory pathway network indicates the great possibility in analyzing and identifying novel findings. Owing to the conventional, slow experimental identification process of miRNAs and their targets, the last decade has witnessed the development of a large amount of computational approaches to deal with the complex interrelations present within biological systems. This article describes the various roles played by miRNAs in regulating leukemia and the role of computational approaches in exploring new possibilities.
Collapse
Affiliation(s)
- Ankur Omer
- Division of Toxicology, CSIR-Central Drug Research Institute, Lucknow, India; Academy of Scientific & Innovative Research (AcSIR), New Delhi, India
| | | | | | | |
Collapse
|
70
|
Lei J, Sun Y. miR-PREFeR: an accurate, fast and easy-to-use plant miRNA prediction tool using small RNA-Seq data. ACTA ACUST UNITED AC 2014; 30:2837-9. [PMID: 24930140 DOI: 10.1093/bioinformatics/btu380] [Citation(s) in RCA: 82] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
SUMMARY Plant microRNA prediction tools that use small RNA-sequencing data are emerging quickly. These existing tools have at least one of the following problems: (i) high false-positive rate; (ii) long running time; (iii) work only for genomes in their databases; (iv) hard to install or use. We developed miR-PREFeR (miRNA PREdiction From small RNA-Seq data), which uses expression patterns of miRNA and follows the criteria for plant microRNA annotation to accurately predict plant miRNAs from one or more small RNA-Seq data samples of the same species. We tested miR-PREFeR on several plant species. The results show that miR-PREFeR is sensitive, accurate, fast and has low-memory footprint. AVAILABILITY AND IMPLEMENTATION https://github.com/hangelwen/miR-PREFeR
Collapse
Affiliation(s)
- Jikai Lei
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yanni Sun
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
71
|
Sadeghi B, Ahmadi H, Azimzadeh-Jamalkandi S, Nassiri MR, Masoudi-Nejad A. BosFinder: a novell pre-microRNA gene prediction algorithm inBos taurus. Anim Genet 2014; 45:479-84. [DOI: 10.1111/age.12170] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/01/2014] [Indexed: 01/27/2023]
Affiliation(s)
- B. Sadeghi
- Laboratory of Molecular Genetics and Bioinformatics (LMGB); Department of Animal Science; College of Agriculture Science and Natural Resource; Gonbad Kavous University; Gonbad Kavous Iran
- Laboratory of System Biology and Bioinformatics (LBB); Institute of Biochemistry and Biophysics; University of Tehran; Tehran Iran
| | - H. Ahmadi
- Laboratory of System Biology and Bioinformatics (LBB); Institute of Biochemistry and Biophysics; University of Tehran; Tehran Iran
| | | | - M. R. Nassiri
- Department of Animal Science and Research Institute of Biotechnology; Ferdowsi University of Mashhad; Mashhad Iran
| | - A. Masoudi-Nejad
- Laboratory of System Biology and Bioinformatics (LBB); Institute of Biochemistry and Biophysics; University of Tehran; Tehran Iran
| |
Collapse
|
72
|
Lopes IDON, Schliep A, de Carvalho ACPDLF. The discriminant power of RNA features for pre-miRNA recognition. BMC Bioinformatics 2014; 15:124. [PMID: 24884650 PMCID: PMC4046174 DOI: 10.1186/1471-2105-15-124] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 04/08/2014] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Computational discovery of microRNAs (miRNA) is based on pre-determined sets of features from miRNA precursors (pre-miRNA). Some feature sets are composed of sequence-structure patterns commonly found in pre-miRNAs, while others are a combination of more sophisticated RNA features. In this work, we analyze the discriminant power of seven feature sets, which are used in six pre-miRNA prediction tools. The analysis is based on the classification performance achieved with these feature sets for the training algorithms used in these tools. We also evaluate feature discrimination through the F-score and feature importance in the induction of random forests. RESULTS Small or non-significant differences were found among the estimated classification performances of classifiers induced using sets with diversification of features, despite the wide differences in their dimension. Inspired in these results, we obtained a lower-dimensional feature set, which achieved a sensitivity of 90% and a specificity of 95%. These estimates are within 0.1% of the maximal values obtained with any feature set (SELECT, Section "Results and discussion") while it is 34 times faster to compute. Even compared to another feature set (FS2, see Section "Results and discussion"), which is the computationally least expensive feature set of those from the literature which perform within 0.1% of the maximal values, it is 34 times faster to compute. The results obtained by the tools used as references in the experiments carried out showed that five out of these six tools have lower sensitivity or specificity. CONCLUSION In miRNA discovery the number of putative miRNA loci is in the order of millions. Analysis of putative pre-miRNAs using a computationally expensive feature set would be wasteful or even unfeasible for large genomes. In this work, we propose a relatively inexpensive feature set and explore most of the learning aspects implemented in current ab-initio pre-miRNA prediction tools, which may lead to the development of efficient ab-initio pre-miRNA discovery tools.The material to reproduce the main results from this paper can be downloaded from http://bioinformatics.rutgers.edu/Static/Software/discriminant.tar.gz.
Collapse
Affiliation(s)
- Ivani de O N Lopes
- Empresa Brasileira de Pesquisa Agropecuária, Embrapa Soja, Caixa Postal 231, Londrina-PR, CEP 86001-970, Brasil.
| | | | | |
Collapse
|
73
|
Zhang R, Murat F, Pont C, Langin T, Salse J. Paleo-evolutionary plasticity of plant disease resistance genes. BMC Genomics 2014; 15:187. [PMID: 24617999 PMCID: PMC4234491 DOI: 10.1186/1471-2164-15-187] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 02/25/2014] [Indexed: 01/28/2023] Open
Abstract
Background The recent access to a large set of genome sequences, combined with a robust evolutionary scenario of modern monocot (i.e. grasses) and eudicot (i.e. rosids) species from their founder ancestors, offered the opportunity to gain insights into disease resistance genes (R-genes) evolutionary plasticity. Results We unravel in the current article (i) a R-genes repertoire consisting in 7883 for monocots and 15758 for eudicots, (ii) a contrasted R-genes conservation with 23.8% for monocots and 6.6% for dicots, (iii) a minimal ancestral founder pool of 384 R-genes for the monocots and 150 R-genes for the eudicots, (iv) a general pattern of organization in clusters accounting for more than 60% of mapped R-genes, (v) a biased deletion of ancestral duplicated R-genes between paralogous blocks possibly compensated by clusterization, (vi) a bias in R-genes clusterization where Leucine-Rich Repeats act as a ‘glue’ for domain association, (vii) a R-genes/miRNAs interome enriched toward duplicated R-genes. Conclusions Together, our data may suggest that R-genes family plasticity operated during plant evolution (i) at the structural level through massive duplicates loss counterbalanced by massive clusterization following polyploidization; as well as at (ii) the regulation level through microRNA/R-gene interactions acting as a possible source of functional diploidization of structurally retained R-genes duplicates. Such evolutionary shuffling events leaded to CNVs (i.e. Copy Number Variation) and PAVs (i.e. Presence Absence Variation) between related species operating in the decay of R-genes colinearity between plant species.
Collapse
Affiliation(s)
| | | | | | | | - Jerome Salse
- INRA/UBP UMR 1095 GDEC 'Génétique, Diversité et Ecophysiologie des Céréales', 5 chemin de Beaulieu, 63100 Clermont-Ferrand, France.
| |
Collapse
|
74
|
Jin W, Ibeagha-Awemu EM, Liang G, Beaudoin F, Zhao X, Guan LL. Transcriptome microRNA profiling of bovine mammary epithelial cells challenged with Escherichia coli or Staphylococcus aureus bacteria reveals pathogen directed microRNA expression profiles. BMC Genomics 2014. [PMID: 24606609 DOI: 10.1186/1471‐2164‐15‐181] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) can post-transcriptionally regulate gene expression and have been shown to be critical regulators to the fine-tuning of epithelial immune responses. However, the role of miRNAs in bovine responses to E. coli and S. aureus, two mastitis causing pathogens, is not well understood. RESULTS The global expression of miRNAs in bovine mammary epithelial cells (MAC-T cells) challenged with and without heat-inactivated Staphylococcus aureus (S. aureus) or Escherichia coli (E. coli) bacteria at 0, 6, 12, 24, and 48 hr was profiled using RNA-Seq. A total of 231 known bovine miRNAs were identified with more than 10 counts per million in at least one of 13 libraries and 5 miRNAs including bta-miR-21-5p, miR-27b, miR-22-3p, miR-184 and let-7f represented more than 50% of the abundance. One hundred and thirteen novel miRNAs were also identified and more than one third of them belong to the bta-miR-2284 family. Seventeen miRNAs were significantly (P < 0.05) differentially regulated by the presence of pathogens. E. coli initiated an earlier regulation of miRNAs (6 miRNAs differentially regulated within the first 6 hrs post challenge as compared to 1 miRNA for S. aureus) while S. aureus presented a delayed response. Five differentially expressed miRNAs (bta-miR-184, miR-24-3p, miR-148, miR-486 and let-7a-5p) were unique to E. coli while four (bta-miR-2339, miR-499, miR-23a and miR-99b) were unique to S. aureus. In addition, our study revealed a temporal differential regulation of five miRNAs (bta-miR-193a-3p, miR-423-5p, miR-30b-5p, miR-29c and miR-un116) in unchallenged cells. Target gene predictions of pathogen differentially expressed miRNAs indicate a significant enrichment in gene ontology functional categories in development/cellular processes, biological regulation as well as cell growth and death. Furthermore, target genes were significantly enriched in several KEGG pathways including immune system, signal transduction, cellular process, nervous system, development and human diseases. CONCLUSION Using next-generation sequencing, our study identified a pathogen directed differential regulation of miRNAs in MAC-T cells with roles in immunity and development. Our study provides a further confirmation of the involvement of mammary epithelia cells in contributing to the immune response to infecting pathogens and suggests the potential of miRNAs to serve as biomarkers for diagnosis and development of control measures.
Collapse
Affiliation(s)
| | | | | | | | - Xin Zhao
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G2P5, Canada.
| | | |
Collapse
|
75
|
Jin W, Ibeagha-Awemu EM, Liang G, Beaudoin F, Zhao X, Guan LL. Transcriptome microRNA profiling of bovine mammary epithelial cells challenged with Escherichia coli or Staphylococcus aureus bacteria reveals pathogen directed microRNA expression profiles. BMC Genomics 2014; 15:181. [PMID: 24606609 PMCID: PMC4029070 DOI: 10.1186/1471-2164-15-181] [Citation(s) in RCA: 131] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 02/25/2014] [Indexed: 01/23/2023] Open
Abstract
Background MicroRNAs (miRNAs) can post-transcriptionally regulate gene expression and have been shown to be critical regulators to the fine-tuning of epithelial immune responses. However, the role of miRNAs in bovine responses to E. coli and S. aureus, two mastitis causing pathogens, is not well understood. Results The global expression of miRNAs in bovine mammary epithelial cells (MAC-T cells) challenged with and without heat-inactivated Staphylococcus aureus (S. aureus) or Escherichia coli (E. coli) bacteria at 0, 6, 12, 24, and 48 hr was profiled using RNA-Seq. A total of 231 known bovine miRNAs were identified with more than 10 counts per million in at least one of 13 libraries and 5 miRNAs including bta-miR-21-5p, miR-27b, miR-22-3p, miR-184 and let-7f represented more than 50% of the abundance. One hundred and thirteen novel miRNAs were also identified and more than one third of them belong to the bta-miR-2284 family. Seventeen miRNAs were significantly (P < 0.05) differentially regulated by the presence of pathogens. E. coli initiated an earlier regulation of miRNAs (6 miRNAs differentially regulated within the first 6 hrs post challenge as compared to 1 miRNA for S. aureus) while S. aureus presented a delayed response. Five differentially expressed miRNAs (bta-miR-184, miR-24-3p, miR-148, miR-486 and let-7a-5p) were unique to E. coli while four (bta-miR-2339, miR-499, miR-23a and miR-99b) were unique to S. aureus. In addition, our study revealed a temporal differential regulation of five miRNAs (bta-miR-193a-3p, miR-423-5p, miR-30b-5p, miR-29c and miR-un116) in unchallenged cells. Target gene predictions of pathogen differentially expressed miRNAs indicate a significant enrichment in gene ontology functional categories in development/cellular processes, biological regulation as well as cell growth and death. Furthermore, target genes were significantly enriched in several KEGG pathways including immune system, signal transduction, cellular process, nervous system, development and human diseases. Conclusion Using next-generation sequencing, our study identified a pathogen directed differential regulation of miRNAs in MAC-T cells with roles in immunity and development. Our study provides a further confirmation of the involvement of mammary epithelia cells in contributing to the immune response to infecting pathogens and suggests the potential of miRNAs to serve as biomarkers for diagnosis and development of control measures.
Collapse
Affiliation(s)
| | | | | | | | - Xin Zhao
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G2P5, Canada.
| | | |
Collapse
|
76
|
Yuan T, Huang X, Dittmar RL, Du M, Kohli M, Boardman L, Thibodeau SN, Wang L. eRNA: a graphic user interface-based tool optimized for large data analysis from high-throughput RNA sequencing. BMC Genomics 2014; 15:176. [PMID: 24593312 PMCID: PMC4029068 DOI: 10.1186/1471-2164-15-176] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Accepted: 02/26/2014] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND RNA sequencing (RNA-seq) is emerging as a critical approach in biological research. However, its high-throughput advantage is significantly limited by the capacity of bioinformatics tools. The research community urgently needs user-friendly tools to efficiently analyze the complicated data generated by high throughput sequencers. RESULTS We developed a standalone tool with graphic user interface (GUI)-based analytic modules, known as eRNA. The capacity of performing parallel processing and sample management facilitates large data analyses by maximizing hardware usage and freeing users from tediously handling sequencing data. The module miRNA identification" includes GUIs for raw data reading, adapter removal, sequence alignment, and read counting. The module "mRNA identification" includes GUIs for reference sequences, genome mapping, transcript assembling, and differential expression. The module "Target screening" provides expression profiling analyses and graphic visualization. The module "Self-testing" offers the directory setups, sample management, and a check for third-party package dependency. Integration of other GUIs including Bowtie, miRDeep2, and miRspring extend the program's functionality. CONCLUSIONS eRNA focuses on the common tools required for the mapping and quantification analysis of miRNA-seq and mRNA-seq data. The software package provides an additional choice for scientists who require a user-friendly computing environment and high-throughput capacity for large data analysis. eRNA is available for free download at https://sourceforge.net/projects/erna/?source=directory.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Liang Wang
- Department of Pathology and MCW Cancer Center, Medical College of Wisconsin, Milwaukee WI 53226, USA.
| |
Collapse
|
77
|
Chaturvedi A, Raeymaekers JAM, Volckaert FAM. Computational identification of miRNAs, their targets and functions in three-spined stickleback (Gasterosteus aculeatus). Mol Ecol Resour 2014; 14:768-77. [DOI: 10.1111/1755-0998.12223] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Revised: 12/27/2013] [Accepted: 01/03/2014] [Indexed: 12/12/2022]
Affiliation(s)
- Anurag Chaturvedi
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
| | - Joost A. M. Raeymaekers
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
- Zoological Institute; University of Basel; Vesalgasse 1 Basel CH-4051 Switzerland
| | - Filip A. M. Volckaert
- Laboratory of Biodiversity and Evolutionary Genomics; University of Leuven; Ch. Deberiotstraat 32 Leuven B-3000 Belgium
| |
Collapse
|
78
|
Abstract
Small RNAs are important transcriptional regulators within cells. With the advent of powerful Next Generation Sequencing platforms, sequencing small RNAs seems to be an obvious choice to understand their expression and its downstream effect. Additionally, sequencing provides an opportunity to identify novel and polymorphic miRNA. However, the biggest challenge is the appropriate data analysis pipeline, which is still in phase of active development by various academic groups. This chapter describes basic and advanced steps for small RNA sequencing analysis including quality control, small RNA alignment and quantification, differential expression analysis, novel small RNA identification, target prediction, and downstream analysis. We also provide a list of various resources for small RNA analysis.
Collapse
Affiliation(s)
- Jai Prakash Mehta
- Systems Biology Ireland, UCD Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Dublin-4, Ireland,
| |
Collapse
|
79
|
Billoud B, Nehr Z, Le Bail A, Charrier B. Computational prediction and experimental validation of microRNAs in the brown alga Ectocarpus siliculosus. Nucleic Acids Res 2014; 42:417-29. [PMID: 24078085 PMCID: PMC3874173 DOI: 10.1093/nar/gkt856] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 08/30/2013] [Accepted: 09/03/2013] [Indexed: 12/16/2022] Open
Abstract
We used an in silico approach to predict microRNAs (miRNAs) genome-wide in the brown alga Ectocarpus siliculosus. As brown algae are phylogenetically distant from both animals and land plants, our approach relied on features shared by all known organisms, excluding sequence conservation, genome localization and pattern of base-pairing with the target. We predicted between 500 and 1500 miRNAs candidates, depending on the values of the energetic parameters used to filter the potential precursors. Using quantitative polymerase chain reaction assays, we confirmed the existence of 22 miRNAs among 72 candidates tested, and of 8 predicted precursors. In addition, we compared the expression of miRNAs and their precursors in two life cycle states (sporophyte, gametophyte) and under salt stress. Several miRNA precursors, Argonaute and DICER messenger RNAs were differentially expressed in these conditions. Finally, we analyzed the gene organization and the target functions of the predicted candidates. This showed that E. siliculosus miRNA genes are, like plant miRNA genes, rarely clustered and, like animal miRNA genes, often located in introns. Among the predicted targets, several widely conserved functional domains are significantly overrepresented, like kinesin, nucleotide-binding/APAF-1, R proteins and CED-4 (NB-ARC) and tetratricopeptide repeats. The combination of computational and experimental approaches thus emphasizes the originality of molecular and cellular processes in brown algae.
Collapse
Affiliation(s)
- Bernard Billoud
- Université Pierre et Marie Curie (UPMC), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France and Centre National de la Recherche Scientifique (CNRS), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France
| | - Zofia Nehr
- Université Pierre et Marie Curie (UPMC), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France and Centre National de la Recherche Scientifique (CNRS), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France
| | - Aude Le Bail
- Université Pierre et Marie Curie (UPMC), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France and Centre National de la Recherche Scientifique (CNRS), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France
| | - Bénédicte Charrier
- Université Pierre et Marie Curie (UPMC), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France and Centre National de la Recherche Scientifique (CNRS), UMR 7139 Végétaux marins et Biomolécules, Station Biologique, CS 90074, F29688, Roscoff, France
| |
Collapse
|
80
|
Wei L, Liao M, Gao Y, Ji R, He Z, Zou Q. Improved and Promising Identification of Human MicroRNAs by Incorporating a High-Quality Negative Set. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:192-201. [PMID: 26355518 DOI: 10.1109/tcbb.2013.146] [Citation(s) in RCA: 169] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
MicroRNA (miRNA) plays an important role as a regulator in biological processes. Identification of (pre-) miRNAs helps in understanding regulatory processes. Machine learning methods have been designed for pre-miRNA identification. However, most of them cannot provide reliable predictive performances on independent testing data sets. We assumed this is because the training sets, especially the negative training sets, are not sufficiently representative. To generate a representative negative set, we proposed a novel negative sample selection technique, and successfully collected negative samples with improved quality. Two recent classifiers rebuilt with the proposed negative set achieved an improvement of ~6 percent in their predictive performance, which confirmed this assumption. Based on the proposed negative set, we constructed a training set, and developed an online system called miRNApre specifically for human pre-miRNA identification. We showed that miRNApre achieved accuracies on updated human and non-human data sets that were 34.3 and 7.6 percent higher than those achieved by current methods. The results suggest that miRNApre is an effective tool for pre-miRNA identification. Additionally, by integrating miRNApre, we developed a miRNA mining tool, mirnaDetect, which can be applied to find potential miRNAs in genome-scale data. MirnaDetect achieved a comparable mining performance on human chromosome 19 data as other existing methods.
Collapse
|
81
|
Ballén-Taborda C, Plata G, Ayling S, Rodríguez-Zapata F, Becerra Lopez-Lavalle LA, Duitama J, Tohme J. Identification of Cassava MicroRNAs under Abiotic Stress. Int J Genomics 2013; 2013:857986. [PMID: 24328029 PMCID: PMC3845235 DOI: 10.1155/2013/857986] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 10/11/2013] [Indexed: 11/18/2022] Open
Abstract
The study of microRNAs (miRNAs) in plants has gained significant attention in recent years due to their regulatory role during development and in response to biotic and abiotic stresses. Although cassava (Manihot esculenta Crantz) is tolerant to drought and other adverse conditions, most cassava miRNAs have been predicted using bioinformatics alone or through sequencing of plants challenged by biotic stress. Here, we use high-throughput sequencing and different bioinformatics methods to identify potential cassava miRNAs expressed in different tissues subject to heat and drought conditions. We identified 60 miRNAs conserved in other plant species and 821 potential cassava-specific miRNAs. We also predicted 134 and 1002 potential target genes for these two sets of sequences. Using real time PCR, we verified the condition-specific expression of 5 cassava small RNAs relative to a non-stress control. We also found, using publicly available expression data, a significantly lower expression of the predicted target genes of conserved and nonconserved miRNAs under drought stress compared to other cassava genes. Gene Ontology enrichment analysis along with condition specific expression of predicted miRNA targets, allowed us to identify several interesting miRNAs which may play a role in stress-induced posttranscriptional regulation in cassava and other plants.
Collapse
Affiliation(s)
- Carolina Ballén-Taborda
- Agrobiodiversity and Biotechnology Project, International Center for Tropical Agriculture (CIAT), A.A. 6713, Cali, Colombia
| | - Germán Plata
- Department of Systems Biology, Columbia University, 1130 Saint Nicholas Avenue, New York, NY 10032, USA
| | - Sarah Ayling
- The Genome Analysis Centre, Norwich Research Park, Norwich NR4 7UH, UK
| | - Fausto Rodríguez-Zapata
- Agrobiodiversity and Biotechnology Project, International Center for Tropical Agriculture (CIAT), A.A. 6713, Cali, Colombia
| | | | - Jorge Duitama
- Agrobiodiversity and Biotechnology Project, International Center for Tropical Agriculture (CIAT), A.A. 6713, Cali, Colombia
| | - Joe Tohme
- Agrobiodiversity and Biotechnology Project, International Center for Tropical Agriculture (CIAT), A.A. 6713, Cali, Colombia
| |
Collapse
|
82
|
Leclercq M, Diallo AB, Blanchette M. Computational prediction of the localization of microRNAs within their pre-miRNA. Nucleic Acids Res 2013; 41:7200-11. [PMID: 23748953 PMCID: PMC3753617 DOI: 10.1093/nar/gkt466] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 04/30/2013] [Accepted: 05/05/2013] [Indexed: 12/19/2022] Open
Abstract
MicroRNAs (miRNAs) are short RNA species derived from hairpin-forming miRNA precursors (pre-miRNA) and acting as key posttranscriptional regulators. Most computational tools labeled as miRNA predictors are in fact pre-miRNA predictors and provide no information about the putative miRNA location within the pre-miRNA. Sequence and structural features that determine the location of the miRNA, and the extent to which these properties vary from species to species, are poorly understood. We have developed miRdup, a computational predictor for the identification of the most likely miRNA location within a given pre-miRNA or the validation of a candidate miRNA. MiRdup is based on a random forest classifier trained with experimentally validated miRNAs from miRbase, with features that characterize the miRNA-miRNA* duplex. Because we observed that miRNAs have sequence and structural properties that differ between species, mostly in terms of duplex stability, we trained various clade-specific miRdup models and obtained increased accuracy. MiRdup self-trains on the most recent version of miRbase and is easy to use. Combined with existing pre-miRNA predictors, it will be valuable for both de novo mapping of miRNAs and filtering of large sets of candidate miRNAs obtained from transcriptome sequencing projects. MiRdup is open source under the GPLv3 and available at http://www.cs.mcgill.ca/∼blanchem/mirdup/.
Collapse
Affiliation(s)
- Mickael Leclercq
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada H3A2B2 and Laboratoire de bioinformatique du département informatique, Université du Québec À Montréal, Montreal, Quebec, Canada H2X3Y7
| | - Abdoulaye Banire Diallo
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada H3A2B2 and Laboratoire de bioinformatique du département informatique, Université du Québec À Montréal, Montreal, Quebec, Canada H2X3Y7
| | - Mathieu Blanchette
- School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada H3A2B2 and Laboratoire de bioinformatique du département informatique, Université du Québec À Montréal, Montreal, Quebec, Canada H2X3Y7
| |
Collapse
|
83
|
Zhong L, Wang JTL, Wen D, Aris V, Soteropoulos P, Shapiro BA. Effective classification of microRNA precursors using feature mining and AdaBoost algorithms. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2013; 17:486-93. [PMID: 23808606 DOI: 10.1089/omi.2013.0011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MicroRNAs play important roles in most biological processes, including cell proliferation, tissue differentiation, and embryonic development, among others. They originate from precursor transcripts (pre-miRNAs), which contain phylogenetically conserved stem-loop structures. An important bioinformatics problem is to distinguish the pre-miRNAs from pseudo pre-miRNAs that have similar stem-loop structures. We present here a novel method for tackling this bioinformatics problem. Our method, named MirID, accepts an RNA sequence as input, and classifies the RNA sequence either as positive (i.e., a real pre-miRNA) or as negative (i.e., a pseudo pre-miRNA). MirID employs a feature mining algorithm for finding combinations of features suitable for building pre-miRNA classification models. These models are implemented using support vector machines, which are combined to construct a classifier ensemble. The accuracy of the classifier ensemble is further enhanced by the utilization of an AdaBoost algorithm. When compared with two closely related tools on twelve species analyzed with these tools, MirID outperforms the existing tools on the majority of the twelve species. MirID was also tested on nine additional species, and the results showed high accuracies on the nine species. The MirID web server is fully operational and freely accessible at http://bioinformatics.njit.edu/MirID/ . Potential applications of this software in genomics and medicine are also discussed.
Collapse
Affiliation(s)
- Ling Zhong
- Bioinformatics Program and Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey 07102, USA
| | | | | | | | | | | |
Collapse
|
84
|
Wu J, Liu Q, Wang X, Zheng J, Wang T, You M, Sheng Sun Z, Shi Q. mirTools 2.0 for non-coding RNA discovery, profiling, and functional annotation based on high-throughput sequencing. RNA Biol 2013; 10:1087-92. [PMID: 23778453 DOI: 10.4161/rna.25193] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Next-generation sequencing has been widely applied to understand the complexity of non-coding RNAs (ncRNAs) in a cost-effective way. In this study, we developed mirTools 2.0, an updated version of mirTools 1.0, which includes the following new features. (1) From miRNA discovery in mirTools 1.0, mirTools 2.0 allows users to detect and profile various types of ncRNAs, such as miRNA, tRNA, snRNA, snoRNA, rRNA, and piRNA. (2) From miRNA profiling in mirTools 1.0, mirTools 2.0 allows users to identify miRNA-targeted genes and performs detailed functional annotation of miRNA targets, including Gene Ontology, KEGG pathway and protein-protein interaction. (3) From comparison of two samples for differentially expressed miRNAs in mirTools 1.0, mirTools 2.0 allows users to detect differentially expressed ncRNAs between two experimental groups or among multiple samples. (4) Other significant improvements include strategies used to detect novel miRNAs and piRNAs, more taxonomy categories to discover more known miRNAs and a stand-alone version of mirTools 2.0. In conclusion, we believe that mirTools 2.0 (122.228.158.106/mr2_dev and centre.bioinformatics.zj.cn/mr2_dev) will provide researchers with more detailed insight into small RNA transcriptomes.
Collapse
Affiliation(s)
- Jinyu Wu
- Hefei National Laboratory for Physical Sciences at Microscale and School of Life Sciences; University of Science and Technology of China; Hefei, China; Institute of Genomic Medicine; Wenzhou Medical College; Wenzhou, China
| | | | | | | | | | | | | | | |
Collapse
|
85
|
Mathelier A, Carbone A. Large scale chromosomal mapping of human microRNA structural clusters. Nucleic Acids Res 2013; 41:4392-408. [PMID: 23444140 PMCID: PMC3632110 DOI: 10.1093/nar/gkt112] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/08/2013] [Accepted: 02/03/2013] [Indexed: 02/06/2023] Open
Abstract
MicroRNAs (miRNAs) can group together along the human genome to form stable secondary structures made of several hairpins hosting miRNAs in their stems. The few known examples of such structures are all involved in cancer development. A large scale computational analysis of human chromosomes crossing sequence analysis and deep sequencing data revealed the presence of >400 structural clusters of miRNAs in the human genome. An a posteriori analysis validates predictions as bona fide miRNAs. A functional analysis of structural clusters position along the chromosomes co-localizes them with genes involved in several key cellular processes like immune systems, sensory systems, signal transduction and development. Immune systems diseases, infectious diseases and neurodegenerative diseases are characterized by genes that are especially well organized around structural clusters of miRNAs. Target genes functional analysis strongly supports a regulatory role of most predicted miRNAs and, notably, a strong involvement of predicted miRNAs in the regulation of cancer pathways. This analysis provides new fundamental insights on the genomic organization of miRNAs in human chromosomes.
Collapse
Affiliation(s)
| | - Alessandra Carbone
- Université Pierre et Marie Curie, UMR7238, 15, rue de l’Ecole de Médecine, 75006 Paris, France and CNRS, UMR7238, Laboratoire de Génomique des Microorganismes, 75006 Paris, France
| |
Collapse
|
86
|
Kleftogiannis D, Korfiati A, Theofilatos K, Likothanassis S, Tsakalidis A, Mavroudi S. Where we stand, where we are moving: Surveying computational techniques for identifying miRNA genes and uncovering their regulatory role. J Biomed Inform 2013; 46:563-73. [PMID: 23501016 DOI: 10.1016/j.jbi.2013.02.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2012] [Revised: 01/08/2013] [Accepted: 02/12/2013] [Indexed: 12/19/2022]
Abstract
Traditional biology was forced to restate some of its principles when the microRNA (miRNA) genes and their regulatory role were firstly discovered. Typically, miRNAs are small non-coding RNA molecules which have the ability to bind to the 3'untraslated region (UTR) of their mRNA target genes for cleavage or translational repression. Existing experimental techniques for their identification and the prediction of the target genes share some important limitations such as low coverage, time consuming experiments and high cost reagents. Hence, many computational methods have been proposed for these tasks to overcome these limitations. Recently, many researchers emphasized on the development of computational approaches to predict the participation of miRNA genes in regulatory networks and to analyze their transcription mechanisms. All these approaches have certain advantages and disadvantages which are going to be described in the present survey. Our work is differentiated from existing review papers by updating the methodologies list and emphasizing on the computational issues that arise from the miRNA data analysis. Furthermore, in the present survey, the various miRNA data analysis steps are treated as an integrated procedure whose aims and scope is to uncover the regulatory role and mechanisms of the miRNA genes. This integrated view of the miRNA data analysis steps may be extremely useful for all researchers even if they work on just a single step.
Collapse
Affiliation(s)
- Dimitrios Kleftogiannis
- King Abdullah University of Science and Technology (KAUST), Computer Science and Mathematical Sciences and Engineering Division, Thuwal, Saudi Arabia
| | | | | | | | | | | |
Collapse
|
87
|
Uva P, Da Sacco L, Del Cornò M, Baldassarre A, Sestili P, Orsini M, Palma A, Gessani S, Masotti A. Rat mir-155 generated from the lncRNA Bic is 'hidden' in the alternate genomic assembly and reveals the existence of novel mammalian miRNAs and clusters. RNA (NEW YORK, N.Y.) 2013; 19:365-79. [PMID: 23329697 PMCID: PMC3677247 DOI: 10.1261/rna.035394.112] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
MicroRNAs (miRNAs) are a class of small noncoding RNAs acting as post-transcriptional gene expression regulators in many physiological and pathological conditions. During the last few years, many novel mammalian miRNAs have been predicted experimentally with bioinformatics approaches and validated by next-generation sequencing. Although these strategies have prompted the discovery of several miRNAs, the total number of these genes still seems larger. Here, by exploiting the species conservation of human, mouse, and rat hairpin miRNAs, we discovered a novel rat microRNA, mir-155. We found that mature miR-155 is overexpressed in rat spleen myeloid cells treated with LPS, similarly to humans and mice. Rat mir-155 is annotated only on the alternate genome, suggesting the presence of other "hidden" miRNAs on this assembly. Therefore, we comprehensively extended the homology search also to mice and humans, finally validating 34 novel mammalian miRNAs (two in humans, five in mice, and up to 27 in rats). Surprisingly, 15 of these novel miRNAs (one for mice and 14 for rats) were found only on the alternate and not on the reference genomic assembly. To date, our findings indicate that the choice of genomic assembly, when mapping small RNA reads, is an important option that should be carefully considered, at least for these animal models. Finally, the discovery of these novel mammalian miRNA genes may contribute to a better understanding of already acquired experimental data, thereby paving the way to still unexplored investigations and to unraveling the function of miRNAs in disease models.
Collapse
Affiliation(s)
- Paolo Uva
- CRS4 Bioinformatics Laboratory, Parco Scientifico e Tecnologico POLARIS, 09010 Pula, Cagliari, Italy
| | - Letizia Da Sacco
- Gene Expression–Microarrays Laboratory, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy
| | - Manuela Del Cornò
- Department of Hematology, Oncology, and Molecular Medicine, Istituto Superiore di Sanità, 00161 Rome, Italy
| | - Antonella Baldassarre
- Gene Expression–Microarrays Laboratory, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy
| | - Paola Sestili
- Department of Hematology, Oncology, and Molecular Medicine, Istituto Superiore di Sanità, 00161 Rome, Italy
| | - Massimiliano Orsini
- CRS4 Bioinformatics Laboratory, Parco Scientifico e Tecnologico POLARIS, 09010 Pula, Cagliari, Italy
| | - Alessia Palma
- Genomic Core Facility, Bambino Gesù Children’s Hospital, IRCCS, 00139 Rome, Italy
| | - Sandra Gessani
- Department of Hematology, Oncology, and Molecular Medicine, Istituto Superiore di Sanità, 00161 Rome, Italy
| | - Andrea Masotti
- Gene Expression–Microarrays Laboratory, Bambino Gesù Children’s Hospital, IRCCS, 00165 Rome, Italy
- Corresponding authorE-mail E-mail
| |
Collapse
|
88
|
Shi Z, Montgomery TA, Qi Y, Ruvkun G. High-throughput sequencing reveals extraordinary fluidity of miRNA, piRNA, and siRNA pathways in nematodes. Genome Res 2013; 23:497-508. [PMID: 23363624 PMCID: PMC3589538 DOI: 10.1101/gr.149112.112] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The nematode Caenorhabditis elegans contains each of the broad classes of eukaryotic small RNAs, including microRNAs (miRNAs), endogenous small-interfering RNAs (endo-siRNAs), and piwi-interacting RNAs (piRNAs). To better understand the evolution of these regulatory RNAs, we deep-sequenced small RNAs from C. elegans and three closely related nematodes: C. briggsae, C. remanei, and C. brenneri. The results reveal a fluid landscape of small RNA pathways with essentially no conservation of individual sequences aside from a subset of miRNAs. We identified 54 miRNA families that are conserved in each of the four species, as well as numerous miRNAs that are species-specific or shared between only two or three species. Despite a lack of conservation of individual piRNAs and siRNAs, many of the features of each pathway are conserved between the different species. We show that the genomic distribution of 26G siRNAs and the tendency for piRNAs to cluster is conserved between C. briggsae and C. elegans. We also show that, in each species, 26G siRNAs trigger stage-specific secondary siRNA formation. piRNAs in each species also trigger secondary siRNA formation from targets containing up to three mismatches. Finally, we show that the production of male- and female-specific piRNAs is conserved in all four species, suggesting distinct roles for piRNAs in male and female germlines.
Collapse
Affiliation(s)
- Zhen Shi
- Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA
| | | | | | | |
Collapse
|
89
|
Rossi S, Calin GA. Bioinformatics, non-coding RNAs and its possible application in personalized medicine. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 774:21-37. [PMID: 23377966 DOI: 10.1007/978-94-007-5590-1_2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Non-coding RNAs are important actors in human biology. A massive amount of data has been created and manipulated, and important findings have been extracted thanks in part to bioinformatics approaches and consequent experimental validation; many of these results are for a specific class of non-coding RNAs, the microRNAs (miRNAs), that are important regulators of gene expression although their transcriptional regulation is not yet well understood. Their involvement in cancer development and progression makes the related research field an integrated one, composed of bioinformaticians, clinicians, statisticians and biologists, as well as informaticians and data miners that cure data manipulation and storage especially due to the output of the latest technologies, like the Next Generation Sequencers.In this chapter we report the main miRNA findings of the last 10 years in terms of identification and prediction techniques, data generation and manipulation methods, as well as possible use in clinical practice.
Collapse
Affiliation(s)
- Simona Rossi
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | | |
Collapse
|
90
|
An J, Lai J, Lehman ML, Nelson CC. miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data. Nucleic Acids Res 2012; 41:727-37. [PMID: 23221645 PMCID: PMC3553977 DOI: 10.1093/nar/gks1187] [Citation(s) in RCA: 171] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
miRDeep and its varieties are widely used to quantify known and novel micro RNA (miRNA) from small RNA sequencing (RNAseq). This article describes miRDeep*, our integrated miRNA identification tool, which is modeled off miRDeep, but the precision of detecting novel miRNAs is improved by introducing new strategies to identify precursor miRNAs. miRDeep* has a user-friendly graphic interface and accepts raw data in FastQ and Sequence Alignment Map (SAM) or the binary equivalent (BAM) format. Known and novel miRNA expression levels, as measured by the number of reads, are displayed in an interface, which shows each RNAseq read relative to the pre-miRNA hairpin. The secondary pre-miRNA structure and read locations for each predicted miRNA are shown and kept in a separate figure file. Moreover, the target genes of known and novel miRNAs are predicted using the TargetScan algorithm, and the targets are ranked according to the confidence score. miRDeep* is an integrated standalone application where sequence alignment, pre-miRNA secondary structure calculation and graphical display are purely Java coded. This application tool can be executed using a normal personal computer with 1.5 GB of memory. Further, we show that miRDeep* outperformed existing miRNA prediction tools using our LNCaP and other small RNAseq datasets. miRDeep* is freely available online at http://www.australianprostatecentre.org/research/software/mirdeep-star.
Collapse
Affiliation(s)
- Jiyuan An
- Australian Prostate Cancer Research Centre-Queensland, Institute of Health and Biomedical Innovation (IHBI), Queensland University of Technology, Princess Alexandra Hospital, Level 1, Building 1, Ipswich Road, Brisbane, Queensland, QLD 4102, Australia.
| | | | | | | |
Collapse
|
91
|
Cheng WC, Chung IF, Huang TS, Chang ST, Sun HJ, Tsai CF, Liang ML, Wong TT, Wang HW. YM500: a small RNA sequencing (smRNA-seq) database for microRNA research. Nucleic Acids Res 2012. [PMID: 23203880 PMCID: PMC3531161 DOI: 10.1093/nar/gks1238] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
MicroRNAs (miRNAs) are small RNAs ∼22 nt in length that are involved in the regulation of a variety of physiological and pathological processes. Advances in high-throughput small RNA sequencing (smRNA-seq), one of the next-generation sequencing applications, have reshaped the miRNA research landscape. In this study, we established an integrative database, the YM500 (http://ngs.ym.edu.tw/ym500/), containing analysis pipelines and analysis results for 609 human and mice smRNA-seq results, including public data from the Gene Expression Omnibus (GEO) and some private sources. YM500 collects analysis results for miRNA quantification, for isomiR identification (incl. RNA editing), for arm switching discovery, and, more importantly, for novel miRNA predictions. Wetlab validation on >100 miRNAs confirmed high correlation between miRNA profiling and RT-qPCR results (R = 0.84). This database allows researchers to search these four different types of analysis results via our interactive web interface. YM500 allows researchers to define the criteria of isomiRs, and also integrates the information of dbSNP to help researchers distinguish isomiRs from SNPs. A user-friendly interface is provided to integrate miRNA-related information and existing evidence from hundreds of sequencing datasets. The identified novel miRNAs and isomiRs hold the potential for both basic research and biotech applications.
Collapse
Affiliation(s)
- Wei-Chung Cheng
- Division of Pediatric Neurosurgery, Neurological Institute, Taipei Veterans General Hospital, Taipei 11217, Taiwan
| | | | | | | | | | | | | | | | | |
Collapse
|
92
|
miR-BAG: bagging based identification of microRNA precursors. PLoS One 2012; 7:e45782. [PMID: 23049860 PMCID: PMC3458082 DOI: 10.1371/journal.pone.0045782] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2012] [Accepted: 08/23/2012] [Indexed: 12/21/2022] Open
Abstract
Non-coding elements such as miRNAs play key regulatory roles in living systems. These ultra-short, ∼21 bp long, RNA molecules are derived from their hairpin precursors and usually participate in negative gene regulation by binding the target mRNAs. Discovering miRNA candidate regions across the genome has been a challenging problem. Most of the existing tools work reliably only for limited datasets. Here, we have presented a novel reliable approach, miR-BAG, developed to identify miRNA candidate regions in genomes by scanning sequences as well as by using next generation sequencing (NGS) data. miR-BAG utilizes a bootstrap aggregation based machine learning approach, successfully creating an ensemble of complementary learners to attain high accuracy while balancing sensitivity and specificity. miR-BAG was developed for wide range of species and tested extensively for performance over a wide range of experimentally validated data. Consideration of position-specific variation of triplet structural profiles and mature miRNA anchored structural profiles had a positive impact on performance. miR-BAG’s performance was found consistent and the accuracy level was observed to be >90% for most of the species considered in the present study. In a detailed comparative analysis, miR-BAG performed better than six existing tools. Using miR-BAG NGS module, we identified a total of 22 novel miRNA candidate regions in cow genome in addition to a total of 42 cow specific miRNA regions. In practice, discovery of miRNA regions in a genome demands high-throughput data analysis, requiring large amount of processing. Considering this, miR-BAG has been developed in multi-threaded parallel architecture as a web server as well as a user friendly GUI standalone version.
Collapse
|
93
|
Washietl S, Will S, Hendrix DA, Goff LA, Rinn JL, Berger B, Kellis M. Computational analysis of noncoding RNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 3:759-78. [PMID: 22991327 DOI: 10.1002/wrna.1134] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Noncoding RNAs have emerged as important key players in the cell. Understanding their surprisingly diverse range of functions is challenging for experimental and computational biology. Here, we review computational methods to analyze noncoding RNAs. The topics covered include basic and advanced techniques to predict RNA structures, annotation of noncoding RNAs in genomic data, mining RNA-seq data for novel transcripts and prediction of transcript structures, computational aspects of microRNAs, and database resources.
Collapse
Affiliation(s)
- Stefan Washietl
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | | | | | | | | | | | | |
Collapse
|
94
|
Krzyzanowski PM, Muro EM, Andrade-Navarro MA. Computational approaches to discovering noncoding RNA. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 3:567-79. [PMID: 22555938 DOI: 10.1002/wrna.1121] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
New developments are being brought to the field of molecular biology with the mounting evidence that RNA transcripts not translated into protein (noncoding RNAs, ncRNAs) hold a variety of biological functions. Computational discovery of ncRNAs is one of these developments, fueled not only by the urge to characterize these sequences but also by necessity to prioritize ones with the most relevant functions for experimental verification. The heterogeneity in size and mode of activity of ncRNAs is reflected in the corresponding diversity of computational methods for their study. Sequence and structural analysis, conservation across species, and relative position to other genomic elements are being used for ncRNA detection. In addition, the recent development of techniques that allow deep sequencing of cell transcripts either globally or from isolated ncRNA-related material is leading the field toward increased use of such high-throughput data. We expect that imminent breakthroughs will include the classification of newer types of ncRNA and new insights into miRNA and piRNA biology, eventually leading toward the completion of a catalog of all human ncRNAs.
Collapse
|
95
|
Meng F, Hackenberg M, Li Z, Yan J, Chen T. Discovery of novel microRNAs in rat kidney using next generation sequencing and microarray validation. PLoS One 2012; 7:e34394. [PMID: 22470567 PMCID: PMC3314633 DOI: 10.1371/journal.pone.0034394] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2012] [Accepted: 02/27/2012] [Indexed: 01/12/2023] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that regulate a variety of biological processes. The latest version of the miRBase database (Release 18) includes 1,157 mouse and 680 rat mature miRNAs. Only one new rat mature miRNA was added to the rat miRNA database from version 16 to version 18 of miRBase, suggesting that many rat miRNAs remain to be discovered. Given the importance of rat as a model organism, discovery of the completed set of rat miRNAs is necessary for understanding rat miRNA regulation. In this study, next generation sequencing (NGS), microarray analysis and bioinformatics technologies were applied to discover novel miRNAs in rat kidneys. MiRanalyzer was utilized to analyze the sequences of the small RNAs generated from NGS analysis of rat kidney samples. Hundreds of novel miRNA candidates were examined according to the mappings of their reads to the rat genome, presence of sequences that can form a miRNA hairpin structure around the mapped locations, Dicer cleavage patterns, and the levels of their expression determined by both NGS and microarray analyses. Nine novel rat hairpin precursor miRNAs (pre-miRNA) were discovered with high confidence. Five of the novel pre-miRNAs are also reported in other species while four of them are rat specific. In summary, 9 novel pre-miRNAs (14 novel mature miRNAs) were identified via combination of NGS, microarray and bioinformatics high-throughput technologies.
Collapse
Affiliation(s)
- Fanxue Meng
- Division of Genetic and Molecular Toxicology, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas, United States of America
| | - Michael Hackenberg
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Zhiguang Li
- Division of Genetic and Molecular Toxicology, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas, United States of America
| | - Jian Yan
- Division of Genetic and Molecular Toxicology, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas, United States of America
| | - Tao Chen
- Division of Genetic and Molecular Toxicology, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas, United States of America
- * E-mail:
| |
Collapse
|
96
|
Abstract
miRNAs are small non coding RNA structures which play important roles in biological processes. Finding miRNA precursors in genomes is therefore an important task, where computational methods are required. The goal of these methods is to select potential pre-miRNAs which could be validated by experimental methods. With the new generation of sequencing techniques, it is important to have fast algorithms that are able to treat whole genomes in acceptable times. We developed an algorithm based on an original method where an approximation of miRNA hairpins are first searched, before reconstituting the pre-miRNA structure. The approximation step allows a substantial decrease in the number of possibilities and thus the time required for searching. Our method was tested on different genomic sequences, and was compared with CID-miRNA, miRPara and VMir. It gives in almost all cases better sensitivity and selectivity. It is faster than CID-miRNA, miRPara and VMir: it takes ≈ 30 s to process a 1 MB sequence, when VMir takes 30 min, miRPara takes 20 h and CID-miRNA takes 55 h. We present here a fast ab-initio algorithm for searching for pre-miRNA precursors in genomes, called miRNAFold. miRNAFold is available at http://EvryRNA.ibisc.univ-evry.fr/.
Collapse
Affiliation(s)
- Sébastien Tempel
- Laboratoire IBISC, Université d'Evry-Val d'Essonne/Genopole, 23 Boulevard de France, 91034 Evry, France
| | | |
Collapse
|
97
|
Xie F, Xiao P, Chen D, Xu L, Zhang B. miRDeepFinder: a miRNA analysis tool for deep sequencing of plant small RNAs. PLANT MOLECULAR BIOLOGY 2012; 80:75-84. [PMID: 22290409 DOI: 10.1007/s11103-012-9885-2] [Citation(s) in RCA: 1010] [Impact Index Per Article: 77.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 01/12/2012] [Indexed: 05/19/2023]
Abstract
miRDeepFinder is a software package developed to identify and functionally analyze plant microRNAs (miRNAs) and their targets from small RNA datasets obtained from deep sequencing. The functions available in miRDeepFinder include pre-processing of raw data, identifying conserved miRNAs, mining and classifying novel miRNAs, miRNA expression profiling, predicting miRNA targets, and gene pathway and gene network analysis involving miRNAs. The fundamental design of miRDeepFinder is based on miRNA biogenesis, miRNA-mediated gene regulation and target recognition, such as perfect or near perfect hairpin structures, different read abundances of miRNA and miRNA*, and targeting patterns of plant miRNAs. To test the accuracy and robustness of miRDeepFinder, we analyzed a small RNA deep sequencing dataset of Arabidopsis thaliana published in the GEO database of NCBI. Our test retrieved 128 of 131 (97.7%) known miRNAs that have a more than 3 read count in Arabidopsis. Because many known miRNAs are not associated with miRNA*s in small RNA datasets, miRDeepFinder was also designed to recover miRNA candidates without the presence of miRNA*. To mine as many miRNAs as possible, miRDeepFinder allows users to compare mature miRNAs and their miRNA*s with other small RNA datasets from the same species. Cleaveland software package was also incorporated into miRDeepFinder for miRNA target identification using degradome sequencing analysis. Using this new computational tool, we identified 13 novel miRNA candidates with miRNA*s from Arabidopsis and validated 12 of them experimentally. Interestingly, of the 12 verified novel miRNAs, a miRNA named AC1 spans the exons of two genes (UTG71C4 and UGT71C3). Both the mature AC1 miRNA and its miRNA* were also found in four other small RNA datasets. We also developed a tool, "miRNA primer designer" to design primers for any type of miRNAs. miRDeepFinder provides a powerful tool for analyzing small RNA datasets from all species, with or without the availability of genome information. miRDeepFinder and miRNA primer designer are freely available at http://www.leonxie.com/DeepFinder.php and at http://www.leonxie.com/miRNAprimerDesigner.php , respectively. A program (called RefFinder: http://www.leonxie.com/referencegene.php ) was also developed for assessing the reliable reference genes for gene expression analysis, including miRNAs.
Collapse
Affiliation(s)
- Fuliang Xie
- Department of Biology, East Carolina University, Greenville, NC, 27858, USA
| | | | | | | | | |
Collapse
|
98
|
Li Y, Zhang Z, Liu F, Vongsangnak W, Jing Q, Shen B. Performance comparison and evaluation of software tools for microRNA deep-sequencing data analysis. Nucleic Acids Res 2012; 40:4298-305. [PMID: 22287634 PMCID: PMC3378883 DOI: 10.1093/nar/gks043] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
With the development of next-generation sequencing (NGS) techniques, many software tools have emerged for the discovery of novel microRNAs (miRNAs) and for analyzing the miRNAs expression profiles. An overall evaluation of these diverse software tools is lacking. In this study, we evaluated eight software tools based on their common feature and key algorithms. Three deep-sequencing data sets were collected from different species and used to assess the computational time, sensitivity and accuracy of detecting known miRNAs as well as their capacity for predicting novel miRNAs. Our results provide useful information for researchers to facilitate their selection of the optimal software tools for miRNA analysis depending on their specific requirements, i.e. novel miRNAs discovery or miRNA expression profile analysis of sequencing data sets.
Collapse
Affiliation(s)
- Yue Li
- Center for Systems Biology, Soochow University, Suzhou 215006, China
| | | | | | | | | | | |
Collapse
|
99
|
Mackowiak SD. Identification of Novel and Known miRNAs in Deep‐Sequencing Data with miRDeep2. ACTA ACUST UNITED AC 2011; Chapter 12:12.10.1-12.10.15. [DOI: 10.1002/0471250953.bi1210s36] [Citation(s) in RCA: 84] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Sebastian D. Mackowiak
- Laboratory for Systems Biology of Gene Regulatory Elements, Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine Berlin Germany
| |
Collapse
|
100
|
Friedländer MR, Mackowiak SD, Li N, Chen W, Rajewsky N. miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res 2011; 40:37-52. [PMID: 21911355 PMCID: PMC3245920 DOI: 10.1093/nar/gkr688] [Citation(s) in RCA: 2207] [Impact Index Per Article: 157.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
microRNAs (miRNAs) are a large class of small non-coding RNAs which post-transcriptionally regulate the expression of a large fraction of all animal genes and are important in a wide range of biological processes. Recent advances in high-throughput sequencing allow miRNA detection at unprecedented sensitivity, but the computational task of accurately identifying the miRNAs in the background of sequenced RNAs remains challenging. For this purpose, we have designed miRDeep2, a substantially improved algorithm which identifies canonical and non-canonical miRNAs such as those derived from transposable elements and informs on high-confidence candidates that are detected in multiple independent samples. Analyzing data from seven animal species representing the major animal clades, miRDeep2 identified miRNAs with an accuracy of 98.6-99.9% and reported hundreds of novel miRNAs. To test the accuracy of miRDeep2, we knocked down the miRNA biogenesis pathway in a human cell line and sequenced small RNAs before and after. The vast majority of the >100 novel miRNAs expressed in this cell line were indeed specifically downregulated, validating most miRDeep2 predictions. Last, a new miRNA expression profiling routine, low time and memory usage and user-friendly interactive graphic output can make miRDeep2 useful to a wide range of researchers.
Collapse
Affiliation(s)
- Marc R. Friedländer
- Laboratory for Systems Biology of Gene Regulatory Elements and Laboratory for New Sequencing Technology, Berlin Institute for Medical Systems Biology at the Max-Delbrück-Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - Sebastian D. Mackowiak
- Laboratory for Systems Biology of Gene Regulatory Elements and Laboratory for New Sequencing Technology, Berlin Institute for Medical Systems Biology at the Max-Delbrück-Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - Na Li
- Laboratory for Systems Biology of Gene Regulatory Elements and Laboratory for New Sequencing Technology, Berlin Institute for Medical Systems Biology at the Max-Delbrück-Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - Wei Chen
- Laboratory for Systems Biology of Gene Regulatory Elements and Laboratory for New Sequencing Technology, Berlin Institute for Medical Systems Biology at the Max-Delbrück-Center for Molecular Medicine, Berlin-Buch 13125, Germany
| | - Nikolaus Rajewsky
- Laboratory for Systems Biology of Gene Regulatory Elements and Laboratory for New Sequencing Technology, Berlin Institute for Medical Systems Biology at the Max-Delbrück-Center for Molecular Medicine, Berlin-Buch 13125, Germany
- * To whom correspondence should be addressed. Tel: +49 30 9406 2999; Fax: +49 30 9406 3068;
| |
Collapse
|