Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bao E, Jiang T, Girke T. BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences. ACTA ACUST UNITED AC 2013;29:1250-9. [PMID: 23493323 DOI: 10.1093/bioinformatics/btt127] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

For:	Bao E, Jiang T, Girke T. BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences. ACTA ACUST UNITED AC 2013;29:1250-9. [PMID: 23493323 DOI: 10.1093/bioinformatics/btt127] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Number

Cited by Other Article(s)

Huo J, Liu Z, Zhang X, Li C, Xiang D, Fu G, Lin W, Wu L, Gong S, Zhao J, Wang Z, Wang X, Xiao Z, Hao F, Ren Y, Sun YH, Zhao G. Comprehensive visceral transcriptome profiling of three pig breeds along altitudinal gradients in Yunnan. Sci Data 2025;12:735. [PMID: 40319063 PMCID: PMC12049544 DOI: 10.1038/s41597-025-05070-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2025] [Accepted: 04/24/2025] [Indexed: 05/07/2025] Open

Affiliation(s)

Jinlong Huo College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China.
Zhipeng Liu College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
Xia Zhang Department of Biological and Food Engineering, Lyuliang University, Lvliang, 033001, Shanxi, China
Changyao Li College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
Decai Xiang Yunnan Academy of Animal Husbandry and Veterinary Sciences, Kunming, 650224, Yunnan, China
Guowen Fu College of Veterinary Medicine, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
Wan Lin College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
Lingxiang Wu College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China
Shaorong Gong Baoshan Pig Research Institute, Baoshan, 678200, Yunnan, China
Jiading Zhao Baoshan Pig Research Institute, Baoshan, 678200, Yunnan, China
Zhen Wang Institute of Animal Husbandry and Veterinary Science of Diqing Tibetan Autonomous Prefecture, Diqing, 674499, Yunnan, China
Xiaohong Wang Animal Health Supervision Institute, Bureau of Agriculture and Rural Affairs of Shangri-la, Shangri-la, 674499, Yunnan, China
Zhiping Xiao Pure Land Agricultural Development Co., LTD, Shangri-la, 674401, Yunnan, China
Fanfan Hao School of Medicine and Dentistry, University of Rochester Medical center, Rochester, New York, 14642, USA
Yue Ren School of Medicine and Dentistry, University of Rochester Medical center, Rochester, New York, 14642, USA
Yu H Sun Department of Biology, University of Rochester, Rochester, New York, 14627, USA.
Guiying Zhao College of Animal Science and Technology, Yunnan Agricultural University, Kunming, 650201, Yunnan, China.

Collapse

Castric V, Batista RA, Carré A, Mousavi S, Mazoyer C, Godé C, Gallina S, Ponitzki C, Theron A, Bellec A, Marande W, Santoni S, Mariotti R, Rubini A, Legrand S, Billiard S, Vekemans X, Vernet P, Saumitou-Laprade P. The homomorphic self-incompatibility system in Oleaceae is controlled by a hemizygous genomic region expressing a gibberellin pathway gene. Curr Biol 2024;34:1967-1976.e6. [PMID: 38626763 DOI: 10.1016/j.cub.2024.03.047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 02/29/2024] [Accepted: 03/25/2024] [Indexed: 04/18/2024]

Shabbir M, Mithani A. Roast: a tool for reference-free optimization of supertranscriptome assemblies. BMC Bioinformatics 2024;25:2. [PMID: 38166712 PMCID: PMC10763045 DOI: 10.1186/s12859-023-05614-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 12/12/2023] [Indexed: 01/05/2024] Open

Williams L, Tomescu AI, Mumey B. Flow Decomposition With Subpath Constraints. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:360-370. [PMID: 35104222 DOI: 10.1109/tcbb.2022.3147697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Caceres M, Mumey B, Husic E, Rizzi R, Cairo M, Sahlin K, Tomescu AI. Safety in Multi-Assembly via Paths Appearing in All Path Covers of a DAG. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:3673-3684. [PMID: 34847041 DOI: 10.1109/tcbb.2021.3131203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Lee SG, Na D, Park C. Comparability of reference-based and reference-free transcriptome analysis approaches at the gene expression level. BMC Bioinformatics 2021;22:310. [PMID: 34674628 PMCID: PMC8529712 DOI: 10.1186/s12859-021-04226-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 06/01/2021] [Indexed: 11/10/2022] Open

Luo Y, Liao X, Wu FX, Wang J. Computational Approaches for Transcriptome Assembly Based on Sequencing Technologies. Curr Bioinform 2020. [DOI: 10.2174/1574893614666190410155603] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Utilization of Tissue Ploidy Level Variation in de Novo Transcriptome Assembly of Pinus sylvestris. G3-GENES GENOMES GENETICS 2019;9:3409-3421. [PMID: 31427456 PMCID: PMC6778806 DOI: 10.1534/g3.119.400357] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Abstract

Compared to angiosperms, gymnosperms lag behind in the availability of assembled and annotated genomes. Most genomic analyses in gymnosperms, especially conifer tree species, rely on the use of de novo assembled transcriptomes. However, the level of allelic redundancy and transcript fragmentation in these assembled transcriptomes, and their effect on downstream applications have not been fully investigated. Here, we assessed three assembly strategies for short-reads data, including the utility of haploid megagametophyte tissue during de novo assembly as single-allele guides, for six individuals and five different tissues in Pinus sylvestris. We then contrasted haploid and diploid tissue genotype calls obtained from the assembled transcriptomes to evaluate the extent of paralog mapping. The use of the haploid tissue during assembly increased its completeness without reducing the number of assembled transcripts. Our results suggest that current strategies that rely on available genomic resources as guidance to minimize allelic redundancy are less effective than the application of strategies that cluster redundant assembled transcripts. The strategy yielding the lowest levels of allelic redundancy among the assembled transcriptomes assessed here was the generation of SuperTranscripts with Lace followed by CD-HIT clustering. However, we still observed some levels of heterozygosity (multiple gene fragments per transcript reflecting allelic redundancy) in this assembled transcriptome on the haploid tissue, indicating that further filtering is required before using these assemblies for downstream applications. We discuss the influence of allelic redundancy when these reference transcriptomes are used to select regions for probe design of exome capture baits and for estimation of population genetic diversity.

Collapse

Rey C, Veber P, Boussau B, Sémon M. CAARS: comparative assembly and annotation of RNA-Seq data. Bioinformatics 2019;35:2199-2207. [PMID: 30452539 PMCID: PMC6596894 DOI: 10.1093/bioinformatics/bty903] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Revised: 09/13/2018] [Accepted: 11/16/2018] [Indexed: 02/05/2023] Open

Fu S, Chang PL, Friesen ML, Teakle NL, Tarone AM, Sze SH. Identifying similar transcripts in a related organism from de Bruijn graphs of RNA-Seq data, with applications to the study of salt and waterlogging tolerance in Melilotus. BMC Genomics 2019;20:425. [PMID: 31167652 PMCID: PMC6551239 DOI: 10.1186/s12864-019-5702-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Lowe EK, Cuomo C, Arnone MI. Omics approaches to study gene regulatory networks for development in echinoderms. Brief Funct Genomics 2018;16:299-308. [PMID: 28957458 DOI: 10.1093/bfgp/elx012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Armero A, Baudouin L, Bocs S, This D. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut. PLoS One 2017;12:e0173300. [PMID: 28334050 PMCID: PMC5363918 DOI: 10.1371/journal.pone.0173300] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 02/17/2017] [Indexed: 01/20/2023] Open

Abstract

The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).

Collapse

Kong F, Saldarriaga OA, Spratt H, Osorio EY, Travi BL, Luxon BA, Melby PC. Transcriptional Profiling in Experimental Visceral Leishmaniasis Reveals a Broad Splenic Inflammatory Environment that Conditions Macrophages toward a Disease-Promoting Phenotype. PLoS Pathog 2017;13:e1006165. [PMID: 28141856 PMCID: PMC5283737 DOI: 10.1371/journal.ppat.1006165] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2016] [Accepted: 01/03/2017] [Indexed: 11/23/2022] Open

Abstract

Visceral Leishmaniasis (VL), caused by the intracellular protozoan Leishmania donovani, is characterized by relentlessly increasing visceral parasite replication, cachexia, massive splenomegaly, pancytopenia and ultimately death. Progressive disease is considered to be due to impaired effector T cell function and/or failure of macrophages to be activated to kill the intracellular parasite. In previous studies, we used the Syrian hamster (Mesocricetus auratus) as a model because it mimics the progressive nature of active human VL. We demonstrated previously that mixed expression of macrophage-activating (IFN-γ) and regulatory (IL-4, IL-10, IL-21) cytokines, parasite-induced expression of macrophage arginase 1 (Arg1), and decreased production of nitric oxide are key immunopathologic factors. Here we examined global changes in gene expression to define the splenic environment and phenotype of splenic macrophages during progressive VL. We used RNA sequencing coupled with de novo transcriptome assembly, because the Syrian hamster does not have a fully sequenced and annotated reference genome. Differentially expressed transcripts identified a highly inflammatory spleen environment with abundant expression of type I and type II interferon response genes. However, high IFN-γ expression was ineffective in directing exclusive M1 macrophage polarization, suppressing M2-associated gene expression, and restraining parasite replication and disease. While many IFN-inducible transcripts were upregulated in the infected spleen, fewer were induced in splenic macrophages in VL. Paradoxically, IFN-γ enhanced parasite growth and induced the counter-regulatory molecules Arg1, Ido1 and Irg1 in splenic macrophages. This was mediated, at least in part, through IFN-γ-induced activation of STAT3 and expression of IL-10, which suggests that splenic macrophages in VL are conditioned to respond to macrophage activation signals with a counter-regulatory response that is ineffective and even disease-promoting. Accordingly, inhibition of STAT3 activation led to a reduced parasite load in infected macrophages. Thus, the STAT3 pathway offers a rational target for adjunctive host-directed therapy to interrupt the pathogenesis of VL.

Visceral leishmaniasis (VL) is a neglected parasitic disease that is caused by the intracellular protozoan Leishmania donovani. Patients with this disease suffer from muscle wasting, enlargement of the spleen, reduced blood counts and ultimately will die without treatment. Progressive disease is considered to be due to impaired cellular immunity, with T cell or macrophage dysfunction, or both. We studied the Syrian hamster as an infection model because it mimics the progressive nature of human disease. We examined global changes in gene expression in the spleen and splenic macrophages during experimental VL and identified a highly inflammatory spleen environment with abundant expression of interferon and interferon-response genes that would be expected to control the infection. However, the high level of IFN-γ expression was ineffective in mediating a protective macrophage response, restraining parasite replication and halting progression of disease. We found that IFN-γ itself stimulated parasite growth in splenic macrophages and induced expression of counter-regulatory molecules, which may paradoxically make the host more susceptible. These data give insights into the nature of the immune response that promotes the infection, and identifies potential targets for therapeutic intervention.

Collapse

Affiliation(s)

Fanping Kong Bioinformatics Program, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas, United States of America
Omar A. Saldarriaga Department of Internal Medicine, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas, United States of America
Heidi Spratt Bioinformatics Program, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Preventive Medicine and Community Health, University of Texas Medical Branch, Galveston, Texas, United States of America * E-mail: (PCM); (HS)
E. Yaneth Osorio Department of Internal Medicine, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas, United States of America
Bruno L. Travi Department of Internal Medicine, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America Center for Tropical Diseases and Institute for Human Infection and Immunity, University of Texas Medical Branch, Galveston, Texas, United States of America
Bruce A. Luxon Bioinformatics Program, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, Texas, United States of America
Peter C. Melby Department of Internal Medicine, Division of Infectious Diseases, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America Center for Tropical Diseases and Institute for Human Infection and Immunity, University of Texas Medical Branch, Galveston, Texas, United States of America Department of Pathology, University of Texas Medical Branch, Galveston, Texas, United States of America * E-mail: (PCM); (HS)

Collapse

Huang X, Chen XG, Armbruster PA. Comparative performance of transcriptome assembly methods for non-model organisms. BMC Genomics 2016;17:523. [PMID: 27464550 PMCID: PMC4964045 DOI: 10.1186/s12864-016-2923-8] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Accepted: 07/07/2016] [Indexed: 12/19/2022] Open

Abstract

Background

The technological revolution in next-generation sequencing has brought unprecedented opportunities to study any organism of interest at the genomic or transcriptomic level. Transcriptome assembly is a crucial first step for studying the molecular basis of phenotypes of interest using RNA-Sequencing (RNA-Seq). However, the optimal strategy for assembling vast amounts of short RNA-Seq reads remains unresolved, especially for organisms without a sequenced genome. This study compared four transcriptome assembly methods, including a widely used de novo assembler (Trinity), two transcriptome re-assembly strategies utilizing proteomic and genomic resources from closely related species (reference-based re-assembly and TransPS) and a genome-guided assembler (Cufflinks).

Results

These four assembly strategies were compared using a comprehensive transcriptomic database of Aedes albopictus, for which a genome sequence has recently been completed. The quality of the various assemblies was assessed by the number of contigs generated, contig length distribution, percent paired-end read mapping, and gene model representation via BLASTX. Our results reveal that de novo assembly generates a similar number of gene models relative to genome-guided assembly with a fragmented reference, but produces the highest level of redundancy and requires the most computational power. Using a closely related reference genome to guide transcriptome assembly can generate biased contig sequences. Increasing the number of reads used in the transcriptome assembly tends to increase the redundancy within the assembly and decrease both median contig length and percent identity between contigs and reference protein sequences.

Conclusions

This study provides general guidance for transcriptome assembly of RNA-Seq data from organisms with or without a sequenced genome. The optimal transcriptome assembly strategy will depend upon the subsequent downstream analyses. However, our results emphasize the efficacy of de novo assembly, which can be as effective as genome-guided assembly when the reference genome assembly is fragmented. If a genome assembly and sufficient computational resources are available, it can be beneficial to combine de novo and genome-guided assemblies. Caution should be taken when using a closely related reference genome to guide transcriptome assembly. The quantity of read pairs used in the transcriptome assembly does not necessarily correlate with the quality of the assembly.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-016-2923-8) contains supplementary material, which is available to authorized users.

Collapse

Bonizzoni P, Dondi R, Klau GW, Pirola Y, Pisanti N, Zaccaria S. On the Minimum Error Correction Problem for Haplotype Assembly in Diploid and Polyploid Genomes. J Comput Biol 2016;23:718-36. [PMID: 27280382 DOI: 10.1089/cmb.2015.0220] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

In diploid genomes, haplotype assembly is the computational problem of reconstructing the two parental copies, called haplotypes, of each chromosome starting from sequencing reads, called fragments, possibly affected by sequencing errors. Minimum error correction (MEC) is a prominent computational problem for haplotype assembly and, given a set of fragments, aims at reconstructing the two haplotypes by applying the minimum number of base corrections. MEC is computationally hard to solve, but some approximation-based or fixed-parameter approaches have been proved capable of obtaining accurate results on real data. In this work, we expand the current characterization of the computational complexity of MEC from the approximation and the fixed-parameter tractability point of view. In particular, we show that MEC is not approximable within a constant factor, whereas it is approximable within a logarithmic factor in the size of the input. Furthermore, we answer open questions on the fixed-parameter tractability for parameters of classical or practical interest: the total number of corrections and the fragment length. In addition, we present a direct 2-approximation algorithm for a variant of the problem that has also been applied in the framework of clustering data. Finally, since polyploid genomes, such as those of plants and fishes, are composed of more than two copies of the chromosomes, we introduce a novel formulation of MEC, namely the k-ploid MEC problem, that extends the traditional problem to deal with polyploid genomes. We show that the novel formulation is still both computationally hard and hard to approximate. Nonetheless, from the parameterized point of view, we prove that the problem is tractable for parameters of practical interest such as the number of haplotypes and the coverage, or the number of haplotypes and the fragment length.

Collapse

Bar I, Cummins S, Elizur A. Transcriptome analysis reveals differentially expressed genes associated with germ cell and gonad development in the Southern bluefin tuna (Thunnus maccoyii). BMC Genomics 2016;17:217. [PMID: 26965070 PMCID: PMC4785667 DOI: 10.1186/s12864-016-2397-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Accepted: 01/14/2016] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

Controlling and managing the breeding of bluefin tuna (Thunnus spp.) in captivity is an imperative step towards obtaining a sustainable supply of these fish in aquaculture production systems. Germ cell transplantation (GCT) is an innovative technology for the production of inter-species surrogates, by transplanting undifferentiated germ cells derived from a donor species into larvae of a host species. The transplanted surrogates will then grow and mature to produce donor-derived seed, thus providing a simpler alternative to maintaining large-bodied broodstock such as the bluefin tuna. Implementation of GCT for new species requires the development of molecular tools to follow the fate of the transplanted germ cells. These tools are based on key reproductive and germ cell-specific genes. RNA-Sequencing (RNA-Seq) provides a rapid, cost-effective method for high throughput gene identification in non-model species. This study utilized RNA-Seq to identify key genes expressed in the gonads of Southern bluefin tuna (Thunnus maccoyii, SBT) and their specific expression patterns in male and female gonad cells.

RESULTS

Key genes involved in the reproductive molecular pathway and specifically, germ cell development in gonads, were identified using analysis of RNA-Seq transcriptomes of male and female SBT gonad cells. Expression profiles of transcripts from ovary and testis cells were compared, as well as testis germ cell-enriched fraction prepared with Percoll gradient, as used in GCT studies. Ovary cells demonstrated over-expression of genes related to stem cell maintenance, while in testis cells, transcripts encoding for reproduction-associated receptors, sex steroids and hormone synthesis and signaling genes were over-expressed. Within the testis cells, the Percoll-enriched fraction showed over-expression of genes that are related to post-meiosis germ cell populations.

CONCLUSIONS

Gonad development and germ cell related genes were identified from SBT gonads and their expression patterns in ovary and testis cells were determined. These expression patterns correlate with the reproductive developmental stage of the sampled fish. The majority of the genes described in this study were sequenced for the first time in T. maccoyii. The wealth of SBT gonadal and germ cell-related gene sequences made publicly available by this study provides an extensive resource for further GCT and reproductive molecular biology studies of this commercially valuable fish.

Collapse

Liu J, Li G, Chang Z, Yu T, Liu B, McMullen R, Chen P, Huang X. BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data. PLoS Comput Biol 2016;12:e1004772. [PMID: 26894997 PMCID: PMC4760927 DOI: 10.1371/journal.pcbi.1004772] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 01/18/2016] [Indexed: 02/06/2023] Open

Abstract

High-throughput RNA-seq technology has provided an unprecedented opportunity to reveal the very complex structures of transcriptomes. However, it is an important and highly challenging task to assemble vast amounts of short RNA-seq reads into transcriptomes with alternative splicing isoforms. In this study, we present a novel de novo assembler, BinPacker, by modeling the transcriptome assembly problem as tracking a set of trajectories of items with their sizes representing coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all the existing de novo assemblers on all the tested datasets, and even outperforms those ab initio assemblers on the real dog dataset. In addition, it runs substantially faster and requires less memory space than most of the assemblers. BinPacker is published under GNU GENERAL PUBLIC LICENSE and the source is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_1.0.tar.gz/download. Quick installation version is available from: http://sourceforge.net/projects/transcriptomeassembly/files/BinPacker_binary.tar.gz/download.

The availability of RNA-seq technology drives the development of algorithms for transcriptome assembly from very short RNA sequences. However, the problem of how to (de novo) assemble transcriptome using RNA-seq datasets has not been modeled well; e.g. sequence coverage information has even not been accurately and effectively integrated into the appropriate assembling procedure, leading to a bottleneck that all the existing (de novo) strategies have encountered. We present a novel approach to remodel the problem as tracking a set of trajectories of items with their sizes representing the coverage of their corresponding isoforms by solving a series of bin-packing problems. This approach, which subtly integrates the coverage information into the procedure, has two exclusive features: 1) only splicing junctions are involved in the assembling procedure; 2) massive pell-mell reads are assembled seemingly by moving a comb along junction edges on a splicing graph. Being tested on both real and simulated RNA-seq datasets, it outperforms almost all existing de novo assemblers on all the tested datasets, even outperforms those ab initio assemblers on the dog dataset, in terms of commonly used comparison standards.

Collapse

Deng F, Chen SY. dbHT-Trans: An Efficient Tool for Filtering the Protein-Encoding Transcripts Assembled by RNA-Seq According to Search for Homologous Proteins. J Comput Biol 2015;23:1-9. [PMID: 26484655 DOI: 10.1089/cmb.2015.0137] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Chang Z, Li G, Liu J, Zhang Y, Ashby C, Liu D, Cramer CL, Huang X. Bridger: a new framework for de novo transcriptome assembly using RNA-seq data. Genome Biol 2015;16:30. [PMID: 25723335 PMCID: PMC4342890 DOI: 10.1186/s13059-015-0596-2] [Citation(s) in RCA: 191] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Accepted: 01/23/2015] [Indexed: 11/24/2022] Open

Legeai F, Derrien T. Identification of long non-coding RNAs in insects genomes. CURRENT OPINION IN INSECT SCIENCE 2015;7:37-44. [PMID: 32846672 DOI: 10.1016/j.cois.2015.01.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Revised: 01/07/2015] [Accepted: 01/07/2015] [Indexed: 06/11/2023]

Rizzi R, Tomescu AI, Mäkinen V. On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly. BMC Bioinformatics 2014;15 Suppl 9:S5. [PMID: 25252805 PMCID: PMC4168716 DOI: 10.1186/1471-2105-15-s9-s5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

Abubucker S, McNulty SN, Rosa BA, Mitreva M. Identification and characterization of alternative splicing in parasitic nematode transcriptomes. Parasit Vectors 2014;7:151. [PMID: 24690220 PMCID: PMC3997825 DOI: 10.1186/1756-3305-7-151] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Accepted: 03/14/2014] [Indexed: 12/05/2022] Open

Abstract

Background

Alternative splicing (AS) of mRNA is a vital mechanism for enhancing genomic complexity in eukaryotes. Spliced isoforms of the same gene can have diverse molecular and biological functions and are often differentially expressed across various tissues, times, and conditions. Thus, AS has important implications in the study of parasitic nematodes with complex life cycles. Transcriptomic datasets are available from many species, but data must be revisited with splice-aware assembly protocols to facilitate the study of AS in helminthes.

Methods

We sequenced cDNA from the model worm Caenorhabditis elegans using 454/Roche technology for use as an experimental dataset. Reads were assembled with Newbler software, invoking the cDNA option. Several combinations of parameters were tested and assembled transcripts were verified by comparison with previously reported C. elegans genes and transcript isoforms and with Illumina RNAseq data.

Results

Thoughtful adjustment of program parameters increased the percentage of assembled transcripts that matched known C. elegans sequences, decreased mis-assembly rates (i.e., cis- and trans-chimeras), and improved the coverage of the geneset. The optimized protocol was used to update de novo transcriptome assemblies from nine parasitic nematode species, including important pathogens of humans and domestic animals. Our assemblies indicated AS rates in the range of 20-30%, typically with 2-3 transcripts per AS locus, depending on the species. Transcript isoforms from the nine species were translated and searched for similarity to known proteins and functional domains. Some 21 InterPro domains, including several involved in nucleotide and chromatin binding, were statistically correlated with AS genetic loci. In most cases, the Roche/454 data explored in this study are the only sequences available from the species in question; however, the recently published genome of the human hookworm Necator americanus provided an additional opportunity to validate our results.

Conclusions

Our optimized assembly parameters facilitated the first survey of AS among parasitic nematodes. The nine transcriptome assemblies, their protein translations, and basic annotations are available from Nematode.net as a resource for the research community. These should be useful for studies of specific genes and gene families of interest as well as for curating draft genome assemblies as they become available.

Collapse