1
|
Wang J, Chen Y, Zou Q. Inferring gene regulatory network from single-cell transcriptomes with graph autoencoder model. PLoS Genet 2023; 19:e1010942. [PMID: 37703293 PMCID: PMC10519590 DOI: 10.1371/journal.pgen.1010942] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 09/25/2023] [Accepted: 08/29/2023] [Indexed: 09/15/2023] Open
Abstract
The gene regulatory structure of cells involves not only the regulatory relationship between two genes, but also the cooperative associations of multiple genes. However, most gene regulatory network inference methods for single cell only focus on and infer the regulatory relationships of pairs of genes, ignoring the global regulatory structure which is crucial to identify the regulations in the complex biological systems. Here, we proposed a graph-based Deep learning model for Regulatory networks Inference among Genes (DeepRIG) from single-cell RNA-seq data. To learn the global regulatory structure, DeepRIG builds a prior regulatory graph by transforming the gene expression of data into the co-expression mode. Then it utilizes a graph autoencoder model to embed the global regulatory information contained in the graph into gene latent embeddings and to reconstruct the gene regulatory network. Extensive benchmarking results demonstrate that DeepRIG can accurately reconstruct the gene regulatory networks and outperform existing methods on multiple simulated networks and real-cell regulatory networks. Additionally, we applied DeepRIG to the samples of human peripheral blood mononuclear cells and triple-negative breast cancer, and presented that DeepRIG can provide accurate cell-type-specific gene regulatory networks inference and identify novel regulators of progression and inhibition.
Collapse
Affiliation(s)
- Jiacheng Wang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
| | - Yaojia Chen
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
| |
Collapse
|
2
|
Zhou HX. Power law in a bounded range: Estimating the lower and upper bounds from sample data. J Chem Phys 2023; 158:191103. [PMID: 37184002 PMCID: PMC10188205 DOI: 10.1063/5.0151614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 04/26/2023] [Indexed: 05/16/2023] Open
Abstract
Power law distributions are widely observed in chemical physics, geophysics, biology, and beyond. The independent variable x of these distributions has an obligatory lower bound and, in many cases, also an upper bound. Estimating these bounds from sample data is notoriously difficult, with a recent method involving O(N3) operations, where N denotes sample size. Here I develop an approach for estimating the lower and upper bounds that involve O(N) operations. The approach centers on calculating the mean values, x̂min and x̂max, of the smallest x and the largest x in N-point samples. A fit of x̂min or x̂max as a function of N yields the estimate for the lower or upper bound. Application to synthetic data demonstrates the accuracy and reliability of this approach.
Collapse
Affiliation(s)
- Huan-Xiang Zhou
- Department of Chemistry and Department of Physics, University of Illinois Chicago, Chicago, Illinois 60607, USA
| |
Collapse
|
3
|
Lazzardi S, Valle F, Mazzolini A, Scialdone A, Caselle M, Osella M. Emergent statistical laws in single-cell transcriptomic data. Phys Rev E 2023; 107:044403. [PMID: 37198814 DOI: 10.1103/physreve.107.044403] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 03/24/2023] [Indexed: 05/19/2023]
Abstract
Large-scale data on single-cell gene expression have the potential to unravel the specific transcriptional programs of different cell types. The structure of these expression datasets suggests a similarity with several other complex systems that can be analogously described through the statistics of their basic building blocks. Transcriptomes of single cells are collections of messenger RNA abundances transcribed from a common set of genes just as books are different collections of words from a shared vocabulary, genomes of different species are specific compositions of genes belonging to evolutionary families, and ecological niches can be described by their species abundances. Following this analogy, we identify several emergent statistical laws in single-cell transcriptomic data closely similar to regularities found in linguistics, ecology, or genomics. A simple mathematical framework can be used to analyze the relations between different laws and the possible mechanisms behind their ubiquity. Importantly, treatable statistical models can be useful tools in transcriptomics to disentangle the actual biological variability from general statistical effects present in most component systems and from the consequences of the sampling process inherent to the experimental technique.
Collapse
Affiliation(s)
- Silvia Lazzardi
- Department of Physics, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy
| | - Filippo Valle
- Department of Physics, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy
| | - Andrea Mazzolini
- Laboratoire de Physique de l'École Normale Supérieure (PSL University), CNRS, Sorbonne Université and Université de Paris, 75005 Paris, France
| | - Antonio Scialdone
- Institute of Epigenetics and Stem Cells, Helmholtz Zentrum München, Feodor-Lynen-Straße 21, 81377 München, Germany and Institute of Functional Epigenetics and Institute of Computational Biology, Helmholtz Zentrum München, Ingolstädter Landstraße 1, 85764 Neuherberg, Germany
| | - Michele Caselle
- Department of Physics, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy
| | - Matteo Osella
- Department of Physics, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy
| |
Collapse
|
4
|
González-Arrué N, Inostroza I, Conejeros R, Rivas-Astroza M. Phenotype-specific estimation of metabolic fluxes using gene expression data. iScience 2023; 26:106201. [PMID: 36915687 PMCID: PMC10006673 DOI: 10.1016/j.isci.2023.106201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 11/30/2022] [Accepted: 02/10/2023] [Indexed: 02/17/2023] Open
Abstract
A cell's genome influences its metabolism via the expression of enzyme-related genes, but transcriptome and fluxome are not perfectly correlated as post-transcriptional mechanisms also regulate reaction's kinetics. Here, we addressed the question: given a transcriptome, how unobserved mechanisms of reaction kinetics should be systematically accounted for when inferring the fluxome? To infer the most likely and least biased fluxome, we present Pheflux, a constraint-based model maximizing Shannon's entropy of fluxes per mRNA. Benchmarked against 13C fluxes of yeast and bacteria, Pheflux accurately estimates the carbon core metabolism. We applied Pheflux to thousands of normal and tumor cell transcriptomes obtained from The Cancer Genome Atlas. Pheflux showed statistically significantly higher glucose yields on lactate in breast, kidney, and bronchus-lung tumoral cells than their normal counterparts. Results are consistent with the Warburg effect, a hallmark of cancer metabolism, suggesting that Pheflux can be efficiently used to study the metabolism of eukaryotic cells.
Collapse
Affiliation(s)
- Nicolás González-Arrué
- Universidad Tecnológica Metropolitana, Departamento de Biotecnología, Ñuñoa, Santiago 7800003, Chile
| | - Isidora Inostroza
- Universidad Tecnológica Metropolitana, Departamento de Biotecnología, Ñuñoa, Santiago 7800003, Chile
| | - Raúl Conejeros
- Pontificia Universidad Católica de Valparaíso, Escuela de Ingeniería Bioquímica, Valparaíso, 2362803, Chile
| | - Marcelo Rivas-Astroza
- Universidad Tecnológica Metropolitana, Departamento de Biotecnología, Ñuñoa, Santiago 7800003, Chile
| |
Collapse
|
5
|
Zhang J, Xu C. Gene product diversity: adaptive or not? Trends Genet 2022; 38:1112-1122. [PMID: 35641344 PMCID: PMC9560964 DOI: 10.1016/j.tig.2022.05.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 04/30/2022] [Accepted: 05/03/2022] [Indexed: 01/24/2023]
Abstract
One gene does not equal one RNA or protein. The genomic revolution has revealed numerous different RNA and protein molecules that can be produced from one gene, such as circular RNAs generated by back-splicing, proteins with residues mismatching the genomic encoding because of RNA editing, and proteins extended in the C terminus via stop codon readthrough in translation. Are these diverse products results of exquisite gene regulations or imprecise biological processes? While there are cases where the gene product diversity appears beneficial, genome-scale patterns suggest that much of this diversity arises from nonadaptive, molecular errors. This finding has important implications for studying the functions of diverse gene products and for understanding the fundamental properties and evolution of cellular life.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Chuan Xu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
6
|
Identifying toggle genes from transcriptome-wide scatter: A new perspective for biological regulation. Genomics 2021; 114:215-228. [PMID: 34843905 DOI: 10.1016/j.ygeno.2021.11.027] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 10/28/2021] [Accepted: 11/23/2021] [Indexed: 11/21/2022]
Abstract
The study of gene expression variability, especially for cancer and cell differentiation studies, has become important. Here, we investigate transcriptome-wide scatter of 23 cell types and conditions across different levels of biological complexity. We focused on genes that act like toggle switches between pairwise replicates of the same cell type, i.e. genes expressed in one replicate and not expressed in the other, sometimes also referred as ON/OFF genes. The proportion of these toggle genes dramatically increases from unicellular to multicellular organization, especially for development and cancer cells. A relevant portion of toggle switches are non-coding genes: in unicellular systems the most represented classes are tRNA and rRNA, while multicellular systems more frequently show lncRNA, sncRNA and pseudogenes. Notably, disease associated microRNAs (miRNAs), pseudogenes and numerous uncharacterized transcripts are present in both development and cancer cells. On top of the known intrinsic and extrinsic factors, our work indicates toggle genes as a novel collective component creating transcriptome-wide variability. This requires further investigation for elucidating both evolutionary and disease processes.
Collapse
|
7
|
Borella M, Martello G, Risso D, Romualdi C. PsiNorm: a scalable normalization for single-cell RNA-seq data. Bioinformatics 2021; 38:164-172. [PMID: 34499096 PMCID: PMC8696108 DOI: 10.1093/bioinformatics/btab641] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 08/30/2021] [Accepted: 09/06/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Single-cell RNA sequencing (scRNA-seq) enables transcriptome-wide gene expression measurements at single-cell resolution providing a comprehensive view of the compositions and dynamics of tissue and organism development. The evolution of scRNA-seq protocols has led to a dramatic increase of cells throughput, exacerbating many of the computational and statistical issues that previously arose for bulk sequencing. In particular, with scRNA-seq data all the analyses steps, including normalization, have become computationally intensive, both in terms of memory usage and computational time. In this perspective, new accurate methods able to scale efficiently are desirable. RESULTS Here, we propose PsiNorm, a between-sample normalization method based on the power-law Pareto distribution parameter estimate. Here, we show that the Pareto distribution well resembles scRNA-seq data, especially those coming from platforms that use unique molecular identifiers. Motivated by this result, we implement PsiNorm, a simple and highly scalable normalization method. We benchmark PsiNorm against seven other methods in terms of cluster identification, concordance and computational resources required. We demonstrate that PsiNorm is among the top performing methods showing a good trade-off between accuracy and scalability. Moreover, PsiNorm does not need a reference, a characteristic that makes it useful in supervised classification settings, in which new out-of-sample data need to be normalized. AVAILABILITY AND IMPLEMENTATION PsiNorm is implemented in the scone Bioconductor package and available at https://bioconductor.org/packages/scone/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Matteo Borella
- Department of Biology, University of Padova, Padua 35121, Italy
| | | | | | | |
Collapse
|
8
|
Vichi J, Salazar E, Jacinto VJ, Rodriguez LO, Grande R, Dantán-González E, Morett E, Hernández-Mendoza A. High-throughput transcriptome sequencing and comparative analysis of Escherichia coli and Schizosaccharomyces pombe in respiratory and fermentative growth. PLoS One 2021; 16:e0248513. [PMID: 33730068 PMCID: PMC7968713 DOI: 10.1371/journal.pone.0248513] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 02/26/2021] [Indexed: 12/13/2022] Open
Abstract
In spite of increased complexity in eukaryotes compared to prokaryotes, several basic metabolic and regulatory processes are conserved. Here we explored analogies in the eubacteria Escherichia coli and the unicellular fission yeast Schizosaccharomyces pombe transcriptomes under two carbon sources: 2% glucose; or a mix of 2% glycerol and 0.2% sodium acetate using the same growth media and growth phase. Overall, twelve RNA-seq libraries were constructed. A total of 593 and 860 genes were detected as differentially expressed for E. coli and S. pombe, respectively, with a log2 of the Fold Change ≥ 1 and False Discovery Rate ≤ 0.05. In aerobic glycolysis, most of the expressed genes were associated with cell proliferation in both organisms, including amino acid metabolism and glycolysis. In contrast in glycerol/acetate condition, genes related to flagellar assembly and membrane proteins were differentially expressed such as the general transcription factors fliA, flhD, flhC, and flagellum assembly genes were detected in E. coli, whereas in S. pombe genes for hexose transporters, integral membrane proteins, galactose metabolism, and ncRNAs related to cellular stress were overexpressed. In general, our study shows that a conserved "foraging behavior" response is observed in these eukaryotic and eubacterial organisms in gluconeogenic carbon sources.
Collapse
Affiliation(s)
- Joivier Vichi
- Centro de Investigación en Dinámica Celular, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos, México
| | - Emmanuel Salazar
- Centro de Investigación en Dinámica Celular, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos, México
| | - Verónica Jiménez Jacinto
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Leticia Olvera Rodriguez
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Ricardo Grande
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Edgar Dantán-González
- Centro de Investigación en Biotecnología, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos, México
| | - Enrique Morett
- Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Armando Hernández-Mendoza
- Centro de Investigación en Dinámica Celular, Instituto de Investigación en Ciencias Básicas y Aplicadas, Universidad Autónoma del Estado de Morelos, Cuernavaca, Morelos, México
- * E-mail:
| |
Collapse
|
9
|
Bui TT, Lee D, Selvarajoo K. ScatLay: utilizing transcriptome-wide noise for identifying and visualizing differentially expressed genes. Sci Rep 2020; 10:17483. [PMID: 33060728 PMCID: PMC7566603 DOI: 10.1038/s41598-020-74564-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 09/28/2020] [Indexed: 01/10/2023] Open
Abstract
Differential expressed (DE) genes analysis is valuable for understanding comparative transcriptomics between cells, conditions or time evolution. However, the predominant way of identifying DE genes is to use arbitrary threshold fold or expression changes as cutoff. Here, we developed a more objective method, Scatter Overlay or ScatLay, to extract and graphically visualize DE genes across any two samples by utilizing their pair-wise scatter or transcriptome-wide noise, while factoring replicate variabilities. We tested ScatLay for 3 cell types: between time points for Escherichia coli aerobiosis and Saccharomyces cerevisiae hypoxia, and between untreated and Etomoxir treated Mus Musculus embryonic stem cell. As a result, we obtain 1194, 2061 and 2932 DE genes, respectively. Next, we compared these data with two widely used current approaches (DESeq2 and NOISeq) with typical twofold expression changes threshold, and show that ScatLay reveals significantly larger number of DE genes. Hence, our method provides a wider coverage of DE genes, and will likely pave way for finding more novel regulatory genes in future works.
Collapse
Affiliation(s)
- Thuy Tien Bui
- Singapore Institute of Food and Biotechnology Innovation, Agency for Science, Technology & Research (A*STAR), 61 Biopolis Drive, Singapore, 138673, Singapore
| | - Daniel Lee
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Kumar Selvarajoo
- Singapore Institute of Food and Biotechnology Innovation, Agency for Science, Technology & Research (A*STAR), 61 Biopolis Drive, Singapore, 138673, Singapore. .,Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, 28 Medical Drive, Singapore, 117456, Singapore.
| |
Collapse
|
10
|
Summers KM, Bush SJ, Hume DA. Network analysis of transcriptomic diversity amongst resident tissue macrophages and dendritic cells in the mouse mononuclear phagocyte system. PLoS Biol 2020; 18:e3000859. [PMID: 33031383 PMCID: PMC7575120 DOI: 10.1371/journal.pbio.3000859] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 10/20/2020] [Accepted: 09/08/2020] [Indexed: 02/07/2023] Open
Abstract
The mononuclear phagocyte system (MPS) is a family of cells including progenitors, circulating blood monocytes, resident tissue macrophages, and dendritic cells (DCs) present in every tissue in the body. To test the relationships between markers and transcriptomic diversity in the MPS, we collected from National Center for Biotechnology Information Gene Expression Omnibus (NCBI-GEO) a total of 466 quality RNA sequencing (RNA-seq) data sets generated from mouse MPS cells isolated from bone marrow, blood, and multiple tissues. The primary data were randomly downsized to a depth of 10 million reads and requantified. The resulting data set was clustered using the network analysis tool BioLayout. A sample-to-sample matrix revealed that MPS populations could be separated based upon tissue of origin. Cells identified as classical DC subsets, cDC1s and cDC2s, and lacking Fcgr1 (encoding the protein CD64) were contained within the MPS cluster, no more distinct than other MPS cells. A gene-to-gene correlation matrix identified large generic coexpression clusters associated with MPS maturation and innate immune function. Smaller coexpression gene clusters, including the transcription factors that drive them, showed higher expression within defined isolated cells, including monocytes, macrophages, and DCs isolated from specific tissues. They include a cluster containing Lyve1 that implies a function in endothelial cell (EC) homeostasis, a cluster of transcripts enriched in intestinal macrophages, and a generic lymphoid tissue cDC cluster associated with Ccr7. However, transcripts encoding Adgre1, Itgax, Itgam, Clec9a, Cd163, Mertk, Mrc1, Retnla, and H2-a/e (encoding class II major histocompatibility complex [MHC] proteins) and many other proposed macrophage subset and DC lineage markers each had idiosyncratic expression profiles. Coexpression of immediate early genes (for example, Egr1, Fos, Dusp1) and inflammatory cytokines and chemokines (tumour necrosis factor [Tnf], Il1b, Ccl3/4) indicated that all tissue disaggregation and separation protocols activate MPS cells. Tissue-specific expression clusters indicated that all cell isolation procedures also co-purify other unrelated cell types that may interact with MPS cells in vivo. Comparative analysis of RNA-seq and single-cell RNA-seq (scRNA-seq) data from the same lung cell populations indicated that MPS heterogeneity implied by global cluster analysis may be even greater at a single-cell level. This analysis highlights the power of large data sets to identify the diversity of MPS cellular phenotypes and the limited predictive value of surface markers to define lineages, functions, or subpopulations.
Collapse
Affiliation(s)
- Kim M. Summers
- Mater Research Institute-University of Queensland, Translational Research Institute, Brisbane, Queensland, Australia
| | - Stephen J. Bush
- Nuffield Department of Clinical Medicine, John Radcliffe Hospital, University of Oxford, Oxford, United Kingdom
| | - David A. Hume
- Mater Research Institute-University of Queensland, Translational Research Institute, Brisbane, Queensland, Australia
- * E-mail:
| |
Collapse
|
11
|
Townes FW, Irizarry RA. Quantile normalization of single-cell RNA-seq read counts without unique molecular identifiers. Genome Biol 2020; 21:160. [PMID: 32620142 PMCID: PMC7333325 DOI: 10.1186/s13059-020-02078-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 06/19/2020] [Indexed: 12/15/2022] Open
Abstract
Single-cell RNA-seq (scRNA-seq) profiles gene expression of individual cells. Unique molecular identifiers (UMIs) remove duplicates in read counts resulting from polymerase chain reaction, a major source of noise. For scRNA-seq data lacking UMIs, we propose quasi-UMIs: quantile normalization of read counts to a compound Poisson distribution empirically derived from UMI datasets. When applied to ground-truth datasets having both reads and UMIs, quasi-UMI normalization has higher accuracy than competing methods. Using quasi-UMIs enables methods designed specifically for UMI data to be applied to non-UMI scRNA-seq datasets.
Collapse
Affiliation(s)
- F. William Townes
- Department of Computer Science, Princeton University, Princeton, NJ USA
| | - Rafael A. Irizarry
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA USA
- Department of Biostatistics, Harvard University, Cambridge, MA USA
| |
Collapse
|
12
|
Nagai M, Kurokawa M, Ying BW. The highly conserved chromosomal periodicity of transcriptomes and the correlation of its amplitude with the growth rate in Escherichia coli. DNA Res 2020; 27:5899727. [PMID: 32866232 PMCID: PMC7508348 DOI: 10.1093/dnares/dsaa018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 08/24/2020] [Indexed: 11/12/2022] Open
Abstract
The growth rate, representing the fitness of a bacterial population, is determined by the transcriptome. Chromosomal periodicity, which is known as the periodic spatial pattern of a preferred chromosomal distance in microbial genomes, is a representative overall feature of the transcriptome; however, whether and how it is associated with the bacterial growth rate are unknown. To address these questions, we analysed a total of 213 transcriptomes of multiple Escherichia coli strains growing in an assortment of culture conditions varying in terms of temperature, nutrition level and osmotic pressure. Intriguingly, Fourier transform analyses of the transcriptome identified a common chromosomal periodicity of transcriptomes, which was independent of the variation in genomes and environments. In addition, fitting of the data to a theoretical model, we found that the amplitudes of the periodic transcriptomes were significantly correlated with the growth rates. These results indicated that the amplitude of periodic transcriptomes is a parameter representing the global pattern of gene expression in correlation with the bacterial growth rate. Thus, our study provides a novel parameter for evaluating the adaptiveness of a growing bacterial population and quantitatively predicting the growth dynamics according to the global expression pattern.
Collapse
Affiliation(s)
- Motoki Nagai
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8572, Japan
| | - Masaomi Kurokawa
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8572, Japan
| | - Bei-Wen Ying
- Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki 305-8572, Japan
| |
Collapse
|
13
|
Vislova A, Sosa OA, Eppley JM, Romano AE, DeLong EF. Diel Oscillation of Microbial Gene Transcripts Declines With Depth in Oligotrophic Ocean Waters. Front Microbiol 2019; 10:2191. [PMID: 31608031 PMCID: PMC6769238 DOI: 10.3389/fmicb.2019.02191] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Accepted: 09/06/2019] [Indexed: 11/13/2022] Open
Abstract
Diel oscillations in primary and secondary production, growth, metabolic activity, and gene expression commonly occur in marine microbial communities in ocean surface waters. Diel periodicity of gene transcription has been demonstrated in photoautotrophic and heterotrophic microbes in both coastal and open ocean environments. To better define the spatiotemporal distribution and patterns of these daily oscillations, we investigated how diel periodicity in gene transcripts changed with depth from the surface waters to the upper mesopelagic. We postulated that diel oscillation of transcript abundances would diminish at greater depths across the collective microbial community due to decreasing light availability. The results showed that the number and total proportion of gene transcripts and taxa exhibiting diel periodicity were greatest in the shallow sunlit mixed layer, diminished rapidly with increasing depth to the base of the euphotic zone, and could not be detected in the mesopelagic. The results confirmed an overall decrease in microbial diel transcript oscillation with depth through the euphotic zone and suggested a relationship between abundance of diel oscillating transcripts and the daily integrated light exposure experienced by planktonic microbes in the water column. Local dissolved macronutrient concentration also appeared to influence the diel transcriptional patterns of specific microbial genes. The diminishing diel transcript oscillations found at increasing depths suggest that diel patterns of other microbial processes and interactions may likewise be attenuated at depth.
Collapse
Affiliation(s)
- Alice Vislova
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI, United States
| | - Oscar A Sosa
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI, United States
| | - John M Eppley
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI, United States
| | - Anna E Romano
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI, United States
| | - Edward F DeLong
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI, United States
| |
Collapse
|
14
|
Naiyer S, Bhattacharya A, Bhattacharya S. Advances in Entamoeba histolytica Biology Through Transcriptomic Analysis. Front Microbiol 2019; 10:1921. [PMID: 31481949 PMCID: PMC6710346 DOI: 10.3389/fmicb.2019.01921] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 08/05/2019] [Indexed: 12/13/2022] Open
Abstract
A large number of transcriptome-level studies in Entamoeba histolytica, the protozoan parasite that causes amoebiasis, have investigated gene expression patterns to help understand the pathology and biology of the organism. They have compared virulent and avirulent strains in lab culture and after tissue invasion, cells grown under different stress conditions, response to anti-amoebic drug treatments, and gene expression changes during the process of encystation. These studies have revealed interesting molecules/pathways that will help increase our mechanistic understanding of differentially expressed genes during growth perturbations and tissue invasion. Some of the important insights obtained from transcriptome studies include the observations that regulation of carbohydrate metabolism may be an important determinant for tissue invasion, while the novel up-regulated genes during encystation include phospholipase D, and meiotic genes, suggesting the possibility of meiosis during the process. Classification of genes according to expression levels showed that amongst the highly transcribed genes in cultured E. histolytica trophozoites were some virulence factors, raising the question of the role of these factors in normal parasite growth. Promoter motifs associated with differential gene expression and regulation were identified. Some of these motifs associated with high gene expression were located downstream of start codon, and were required for efficient transcription. The listing of E. histolytica genes according to transcript expression levels will help us determine the scale of post-transcriptional regulation, and the possible roles of predicted promoter motifs. The small RNA transcriptome is a valuable resource for detailed structural and functional analysis of these molecules and their regulatory roles. These studies provide new drug targets and enhance our understanding of gene regulation in E. histolytica.
Collapse
Affiliation(s)
- Sarah Naiyer
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Alok Bhattacharya
- School of Life Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Sudha Bhattacharya
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi, India
| |
Collapse
|
15
|
Matsunaga T, Muramatsu M. Self-organizing scale-free patterns in a phase-modulated periodic connecting system. BMC Res Notes 2019; 12:122. [PMID: 30836993 PMCID: PMC6402156 DOI: 10.1186/s13104-019-4149-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Accepted: 02/22/2019] [Indexed: 11/10/2022] Open
Abstract
OBJECTIVE The regularity of scale-free patterns in rank-size relations has been observed in word frequency, city size distribution, firm size distribution, and gene expression. Because of the common emergence of this regularity, understanding its mechanisms has been of great interest. For obtaining the scale-free pattern regularity, various models based on the rich-get-richer mechanism have been proposed; however, the overarching procedure of searching for the "rich" is in disagreement with the locally interacting behaviors seen in the aforementioned natural and social phenomena. RESULTS We implemented a computational model of a resource distribution system inspired by observations of word connectivity, which is created by local constraints with periodic and phase modulatory features. Here, we empirically demonstrated that a phase-modulated periodic connecting system can reach a dynamic equilibrium state as the most probable case, with the self-organizing scale-free patterns. The regularity could be a result of the configurational balance in spatiotemporal inequity during the resource distribution process with an adaptive constrained connectivity. Our results suggest that investigations of interferences of oscillating fluctuations in the system will elucidate the autoregulatory dynamic behavior.
Collapse
Affiliation(s)
- Tsutomu Matsunaga
- Research and Development Headquarters, NTT DATA Corporation, 3-3-9, Toyosu, Koto-ku, Tokyo, 135-8671, Japan.
| | - Masaaki Muramatsu
- Medical Research Institute, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8510, Japan
| |
Collapse
|
16
|
Ying BW, Yama K. Gene Expression Order Attributed to Genome Reduction and the Steady Cellular State in Escherichia coli. Front Microbiol 2018; 9:2255. [PMID: 30294319 PMCID: PMC6158460 DOI: 10.3389/fmicb.2018.02255] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 09/04/2018] [Indexed: 11/13/2022] Open
Abstract
Transcriptomes not only reflect the growth status but also link to the genome in bacteria. To investigate if and how genome or cellular state changes contribute to the gene expression order, the growth profile-associated transcriptomes of an assortment of genetically differentiated Escherichia coli either exponentially growing under varied conditions or in response to environmental disturbance were analyzed. A total of 168 microarray data sets representing 56 transcriptome variations, were categorized by genome size (full length or reduced) and cellular state (steady or unsteady). At the genome-wide level, the power-law distribution of gene expression was found to be significantly disturbed by the genome size but not the cellular state. At the regulatory network level, more networks with improved coordination of growth rates were observed in genome reduction than at the steady state. At the single-gene level, both genome reduction and steady state increased the correlation of gene expression to growth rate, but the enriched gene categories with improved correlations were different. These findings not only illustrate the order of gene expression attributed to genome reduction and steady cellular state but also indicate that the accessory sequences acquired during genome evolution largely participated in the coordination of transcriptomes to growth fitness.
Collapse
Affiliation(s)
- Bei-Wen Ying
- Institute of Biology and Information Science, East China Normal University, Shanghai, China.,Faculty of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Japan
| | - Kazuma Yama
- Advanced Analytical Science Laboratories, Research & Development Headquarters, Lion Corporation, Tokyo, Japan
| |
Collapse
|
17
|
Sato S, Horikawa M, Kondo T, Sato T, Setou M. A power law distribution of metabolite abundance levels in mice regardless of the time and spatial scale of analysis. Sci Rep 2018; 8:10315. [PMID: 29985415 PMCID: PMC6037760 DOI: 10.1038/s41598-018-28667-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Accepted: 06/26/2018] [Indexed: 11/29/2022] Open
Abstract
Biomolecule abundance levels change with the environment and enable a living system to adapt to the new conditions. Although, the living system maintains at least some characteristics, e.g. homeostasis. One of the characteristics maintained by a living system is a power law distribution of biomolecule abundance levels. Previous studies have pointed to a universal characteristic of biochemical reaction networks, with data obtained from lysates of multiple cells. As a result, the spatial scale of the data related to the power law distribution of biomolecule abundance levels is not clear. In this study, we researched the scaling law of metabolites in mouse tissue with a spatial scale of quantification that was changed stepwise between a whole-tissue section and a single-point analysis (25 μm). As a result, metabolites in mouse tissues were found to follow the power law distribution independently of the spatial scale of analysis. Additionally, we tested the temporal changes by comparing data from younger and older mice. Both followed similar power law distributions, indicating that metabolite composition is not diversified by aging to disrupt the power law distribution. The power law distribution of metabolite abundance is thus a robust characteristic of a living system regardless of time and space.
Collapse
Affiliation(s)
- Shumpei Sato
- Department of Cellular and Molecular Anatomy, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
| | - Makoto Horikawa
- Department of Cellular and Molecular Anatomy, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
- International Mass Imaging Center, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
| | - Takeshi Kondo
- Department of Cellular and Molecular Anatomy, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
- International Mass Imaging Center, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
| | - Tomohito Sato
- Department of Cellular and Molecular Anatomy, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan
| | - Mitsutoshi Setou
- Department of Cellular and Molecular Anatomy, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan.
- International Mass Imaging Center, Hamamatsu University School of Medicine, 1-20-1 Handayama Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan.
- Preeminent Medical Photonics Education & Research Center, 1-20-1 Handayama, Higashi-ku, Hamamatsu, Shizuoka, 431-3192, Japan.
- Department of Anatomy, The University of Hong Kong, 6/F, William MW Mong Block 21 Sassoon Road, Pokfulam, Hong Kong SAR, China.
| |
Collapse
|
18
|
Wong WC, Ng HK, Tantoso E, Soong R, Eisenhaber F. Finite-size effects in transcript sequencing count distribution: its power-law correction necessarily precedes downstream normalization and comparative analysis. Biol Direct 2018; 13:2. [PMID: 29433547 PMCID: PMC5809866 DOI: 10.1186/s13062-018-0204-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Accepted: 01/23/2018] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Though earlier works on modelling transcript abundance from vertebrates to lower eukaroytes have specifically singled out the Zip's law, the observed distributions often deviate from a single power-law slope. In hindsight, while power-laws of critical phenomena are derived asymptotically under the conditions of infinite observations, real world observations are finite where the finite-size effects will set in to force a power-law distribution into an exponential decay and consequently, manifests as a curvature (i.e., varying exponent values) in a log-log plot. If transcript abundance is truly power-law distributed, the varying exponent signifies changing mathematical moments (e.g., mean, variance) and creates heteroskedasticity which compromises statistical rigor in analysis. The impact of this deviation from the asymptotic power-law on sequencing count data has never truly been examined and quantified. RESULTS The anecdotal description of transcript abundance being almost Zipf's law-like distributed can be conceptualized as the imperfect mathematical rendition of the Pareto power-law distribution when subjected to the finite-size effects in the real world; This is regardless of the advancement in sequencing technology since sampling is finite in practice. Our conceptualization agrees well with our empirical analysis of two modern day NGS (Next-generation sequencing) datasets: an in-house generated dilution miRNA study of two gastric cancer cell lines (NUGC3 and AGS) and a publicly available spike-in miRNA data; Firstly, the finite-size effects causes the deviations of sequencing count data from Zipf's law and issues of reproducibility in sequencing experiments. Secondly, it manifests as heteroskedasticity among experimental replicates to bring about statistical woes. Surprisingly, a straightforward power-law correction that restores the distribution distortion to a single exponent value can dramatically reduce data heteroskedasticity to invoke an instant increase in signal-to-noise ratio by 50% and the statistical/detection sensitivity by as high as 30% regardless of the downstream mapping and normalization methods. Most importantly, the power-law correction improves concordance in significant calls among different normalization methods of a data series averagely by 22%. When presented with a higher sequence depth (4 times difference), the improvement in concordance is asymmetrical (32% for the higher sequencing depth instance versus 13% for the lower instance) and demonstrates that the simple power-law correction can increase significant detection with higher sequencing depths. Finally, the correction dramatically enhances the statistical conclusions and eludes the metastasis potential of the NUGC3 cell line against AGS of our dilution analysis. CONCLUSIONS The finite-size effects due to undersampling generally plagues transcript count data with reproducibility issues but can be minimized through a simple power-law correction of the count distribution. This distribution correction has direct implication on the biological interpretation of the study and the rigor of the scientific findings. REVIEWERS This article was reviewed by Oliviero Carugo, Thomas Dandekar and Sandor Pongor.
Collapse
Affiliation(s)
- Wing-Cheong Wong
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Hong-kiat Ng
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore
| | - Erwin Tantoso
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
| | - Richie Soong
- Cancer Science Institute of Singapore, National University of Singapore, Singapore, Singapore
| | - Frank Eisenhaber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671 Singapore
- School of Computer Engineering (SCE), Nanyang Technological University (NTU), 50 Nanyang Drive, Singapore, 637553 Singapore
| |
Collapse
|
19
|
Farkas Z, Kalapis D, Bódi Z, Szamecz B, Daraba A, Almási K, Kovács K, Boross G, Pál F, Horváth P, Balassa T, Molnár C, Pettkó-Szandtner A, Klement É, Rutkai E, Szvetnik A, Papp B, Pál C. Hsp70-associated chaperones have a critical role in buffering protein production costs. eLife 2018; 7:29845. [PMID: 29377792 PMCID: PMC5788500 DOI: 10.7554/elife.29845] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Accepted: 12/23/2017] [Indexed: 02/01/2023] Open
Abstract
Proteins are necessary for cellular growth. Concurrently, however, protein production has high energetic demands associated with transcription and translation. Here, we propose that activity of molecular chaperones shape protein burden, that is the fitness costs associated with expression of unneeded proteins. To test this hypothesis, we performed a genome-wide genetic interaction screen in baker's yeast. Impairment of transcription, translation, and protein folding rendered cells hypersensitive to protein burden. Specifically, deletion of specific regulators of the Hsp70-associated chaperone network increased protein burden. In agreement with expectation, temperature stress, increased mistranslation and a chemical misfolding agent all substantially enhanced protein burden. Finally, unneeded protein perturbed interactions between key components of the Hsp70-Hsp90 network involved in folding of native proteins. We conclude that specific chaperones contribute to protein burden. Our work indicates that by minimizing the damaging impact of gratuitous protein overproduction, chaperones enable tolerance to massive changes in genomic expression.
Collapse
Affiliation(s)
- Zoltán Farkas
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Dorottya Kalapis
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Zoltán Bódi
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Béla Szamecz
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Andreea Daraba
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Karola Almási
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Károly Kovács
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Gábor Boross
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Ferenc Pál
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Péter Horváth
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Tamás Balassa
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Csaba Molnár
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Aladár Pettkó-Szandtner
- Institute of Plant Biology, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary.,Laboratory of Proteomic Research, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Éva Klement
- Laboratory of Proteomic Research, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Edit Rutkai
- Division for Biotechnology, Bay Zoltán Nonprofit Ltd, Budapest, Hungary
| | - Attila Szvetnik
- Division for Biotechnology, Bay Zoltán Nonprofit Ltd, Budapest, Hungary
| | - Balázs Papp
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Csaba Pál
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
20
|
Mao AS, Shin JW, Mooney DJ. Effects of substrate stiffness and cell-cell contact on mesenchymal stem cell differentiation. Biomaterials 2016; 98:184-91. [PMID: 27203745 DOI: 10.1016/j.biomaterials.2016.05.004] [Citation(s) in RCA: 203] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 04/18/2016] [Accepted: 05/02/2016] [Indexed: 11/19/2022]
Abstract
The mechanical properties of the microenvironment and direct contact-mediated cell-cell interactions are two variables known to be important in the determination of stem cell differentiation fate, but little is known about the interplay of these cues. Here, we use a micropatterning approach on polyacrylamide gels of tunable stiffnesses to study how homotypic cell-cell contacts and mechanical stiffness affect different stages of osteogenesis of mesenchymal stem cells (MSCs). Nuclear localization of transcription factors associated with osteogenesis depended on substrate stiffness and was independent of the degree of cell-cell contact. However, expression of alkaline phosphatase, an early protein marker for osteogenesis, increased only in cells with both direct contact with neighboring cells and adhesion to stiffer substrates. Finally, mature osteogenesis, as assessed by calcium deposition, was low in micropatterned cells, even on stiff substrates and in multicellular clusters. These results indicate that substrate stiffness and the presence of neighboring cells regulate osteogenesis in MSCs.
Collapse
Affiliation(s)
- Angelo S Mao
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, 29 Oxford St, Cambridge, MA 02138, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, 3 Blackfan Circle, Boston, MA 02115, USA
| | - Jae-Won Shin
- Department of Pharmacology, University of Illinois College of Medicine, Chicago, IL 60612, USA
| | - David J Mooney
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, 29 Oxford St, Cambridge, MA 02138, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, 3 Blackfan Circle, Boston, MA 02115, USA.
| |
Collapse
|
21
|
Chikashige Y, Arakawa S, Leibnitz K, Tsutsumi C, Mori C, Osakada H, Murata M, Haraguchi T, Hiraoka Y. Cellular economy in fission yeast cells continuously cultured with limited nitrogen resources. Sci Rep 2015; 5:15617. [PMID: 26486373 PMCID: PMC4614384 DOI: 10.1038/srep15617] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Accepted: 09/29/2015] [Indexed: 11/12/2022] Open
Abstract
In ribosome biogenesis, a large fraction of ribosomes is used for producing ribosomal proteins themselves. Here, we applied simulation and experimentation to determine what fraction of ribosomes should be allocated for the synthesis of ribosomal proteins to optimize cellular economy for growth. We define the “r-fraction” as the fraction of mRNA of the ribosomal protein genes out of the total mRNA, and we simulated the effect of the r-fraction on the number of ribosomes. We then empirically measured the amount of protein and RNA in fission yeast cells cultured with high and low nitrogen sources. In the cells cultured with a low nitrogen source, the r-fraction decreased from 0.46 to 0.42 with a 40% reduction of rRNA, but the reduction of the total protein was smaller at 30%. These results indicate that the r-fraction is internally controlled to optimize the efficiency of protein synthesis at a limited cellular cost.
Collapse
Affiliation(s)
- Yuji Chikashige
- Advanced ICT Research Institute Kobe, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
| | - Shin'ichi Arakawa
- Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Kenji Leibnitz
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, 1-4 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Chihiro Tsutsumi
- Advanced ICT Research Institute Kobe, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
| | - Chie Mori
- Advanced ICT Research Institute Kobe, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
| | - Hiroko Osakada
- Advanced ICT Research Institute Kobe, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
| | - Masayuki Murata
- Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka 565-0871, Japan
| | - Tokuko Haraguchi
- Advanced ICT Research Institute Kobe, National Institute of Information and Communications Technology, 588-2 Iwaoka, Iwaoka-cho, Nishi-ku, Kobe 651-2492, Japan
| | - Yasushi Hiraoka
- Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamadaoka, Suita, Osaka 565-0871, Japan
| |
Collapse
|
22
|
Simeoni O, Piras V, Tomita M, Selvarajoo K. Tracking global gene expression responses in T cell differentiation. Gene 2015; 569:259-66. [PMID: 26028587 DOI: 10.1016/j.gene.2015.05.061] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 04/28/2015] [Accepted: 05/26/2015] [Indexed: 12/24/2022]
Abstract
Upon receiving antigens from the innate immune cells, CD4(+) T cells differentiate into distinct effector cells. To probe the global responses of distinct effector cells, we analyzed transcriptome-wide expressions of Th1, Th2, Treg and Th17 using Pearson correlation, entropy and principal component analyses, with Th0 as a control. Although the global response of Th0 was quite distinct from Th17, surprisingly, it was highly similar to Th1, Th2 and Treg. Moreover, 8 major temporal groups consisting of 5704 differentially expressed genes were revealed for both Th0 and Th17. Gene functional enrichment analysis showed immune responses and metabolic processes were mainly activated between Th0 and Th17, while genes related to cell cycle and replication were differentially regulated. Moreover, we found the upregulation of several novel genes for Th0 and Th17. Overall, we deduce that Th0 is globally similar to Th1, Th2 and Treg. Our results indicate that Th0 is a differentiated state and, therefore, may not be used as a control cell type.
Collapse
Affiliation(s)
- Oriane Simeoni
- ENSEIRB-MATMECA, Bordeaux Institute of Technology, Talence, France; Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan
| | - Vincent Piras
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan; Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan; Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Japan
| | - Kumar Selvarajoo
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Japan; Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Japan.
| |
Collapse
|
23
|
Suzuki A, Matsushima K, Makinoshima H, Sugano S, Kohno T, Tsuchihara K, Suzuki Y. Single-cell analysis of lung adenocarcinoma cell lines reveals diverse expression patterns of individual cells invoked by a molecular target drug treatment. Genome Biol 2015; 16:66. [PMID: 25887790 PMCID: PMC4450998 DOI: 10.1186/s13059-015-0636-y] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2014] [Accepted: 03/18/2015] [Indexed: 12/20/2022] Open
Abstract
Background To understand the heterogeneous behaviors of individual cancer cells, it is essential to investigate gene expression levels as well as their divergence between different individual cells. Recent advances in next-generation sequencing-related technologies have enabled us to conduct a single-cell RNA-Seq analysis of a series of lung adenocarcinoma cell lines. Results We analyze a total of 336 single-cell RNA-Seq libraries from seven cell lines. The results are highly robust regarding both average expression levels and the relative gene expression differences between individual cells. Gene expression diversity is characteristic depending on genes and pathways. Analyses of individual cells treated with the multi-tyrosine kinase inhibitor vandetanib reveal that, while the ribosomal genes and many other so-called house-keeping genes reduce their relative expression diversity during the drug treatment, the genes that are directly targeted by vandetanib, the EGFR and RET genes, remain constant. Rigid transcriptional control of these genes may not allow plastic changes of their expression with the drug treatment or during the cellular acquisition of drug resistance. Additionally, we find that the gene expression patterns of cancer-related genes are sometimes more diverse than expected based on the founder cells. Furthermore, we find that this diversity is occasionally latent in a normal state and initially becomes apparent after the drug treatment. Conclusions Characteristic patterns in gene expression divergence, which would not be revealed by transcriptome analysis of bulk cells, may also play important roles when cells acquire drug resistance, perhaps by providing a cellular reservoir for gene expression programs. Electronic supplementary material The online version of this article (doi:10.1186/s13059-015-0636-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ayako Suzuki
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan.
| | - Koutatsu Matsushima
- Division of TR, The Exploratory Oncology Research and Clinical Trial Center, National Cancer Center, Chiba, 277-8577, Japan.
| | - Hideki Makinoshima
- Division of TR, The Exploratory Oncology Research and Clinical Trial Center, National Cancer Center, Chiba, 277-8577, Japan.
| | - Sumio Sugano
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan.
| | - Takashi Kohno
- Division of Genome Biology, National Cancer Center Research Institute, Tokyo, 104-0045, Japan. .,Division of TR, The Exploratory Oncology Research and Clinical Trial Center, National Cancer Center, Tokyo, 104-0045, Japan.
| | - Katsuya Tsuchihara
- Division of TR, The Exploratory Oncology Research and Clinical Trial Center, National Cancer Center, Chiba, 277-8577, Japan.
| | - Yutaka Suzuki
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan. .,Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan.
| |
Collapse
|
24
|
The reduction of gene expression variability from single cells to populations follows simple statistical laws. Genomics 2015; 105:137-44. [DOI: 10.1016/j.ygeno.2014.12.007] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Revised: 12/19/2014] [Accepted: 12/19/2014] [Indexed: 12/31/2022]
|
25
|
Behura SK, Severson DW. Bidirectional promoters of insects: genome-wide comparison, evolutionary implication and influence on gene expression. J Mol Biol 2014; 427:521-36. [PMID: 25463441 DOI: 10.1016/j.jmb.2014.11.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Revised: 10/31/2014] [Accepted: 11/07/2014] [Indexed: 10/24/2022]
Abstract
Bidirectional promoters are widespread in insect genomes. By analyzing 23 insect genomes we show that the frequency of bidirectional gene pairs varies according to genome compactness and density of genes among the species. The density of bidirectional genes expected based on number of genes per megabase of genome explains the observed density suggesting that bidirectional pairing of genes may be due to random event. We identified specific transcription factor binding motifs that are enriched in bidirectional promoters across insect species. Furthermore, we observed that bidirectional promoters may act as transcriptional hotspots in insect genomes where protein coding genes tend to aggregate in significantly biased (p < 0.001) manner compared to unidirectional promoters. Natural selection seems to have an association with the extent of bidirectionality of genes among the species. The rate of non-synonymous-to-synonymous changes (dN/dS) shows a second-order polynomial distribution with bidirectionality between species indicating that bidirectionality is dependent upon evolutionary pressure acting on the genomes. Analysis of genome-wide microarray expression data of multiple insect species suggested that bidirectionality has a similar association with transcriptome variation across species. Furthermore, bidirectional promoters show significant association with correlated expression of the divergent gene pairs depending upon their motif composition. Analysis of gene ontology showed that bidirectional genes tend to have a common association with functions related to "binding" (including ion binding, nucleotide binding and protein binding) across genomes. Such functional constraint of bidirectional genes may explain their widespread persistence in genome of diverse insect species.
Collapse
Affiliation(s)
- Susanta K Behura
- Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556, USA.
| | - David W Severson
- Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
26
|
Toseland A, Moxon S, Mock T, Moulton V. Metatranscriptomes from diverse microbial communities: assessment of data reduction techniques for rigorous annotation. BMC Genomics 2014; 15:901. [PMID: 25318651 PMCID: PMC4209020 DOI: 10.1186/1471-2164-15-901] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 09/29/2014] [Indexed: 05/06/2023] Open
Abstract
Background Metatranscriptome sequence data can contain highly redundant sequences from diverse populations of microbes and so data reduction techniques are often applied before taxonomic and functional annotation. For metagenomic data, it has been observed that the variable coverage and presence of closely related organisms can lead to fragmented assemblies containing chimeric contigs that may reduce the accuracy of downstream analyses and some advocate the use of alternate data reduction techniques. However, it is unclear how such data reduction techniques impact the annotation of metatranscriptome data and thus affect the interpretation of the results. Results To investigate the effect of such techniques on the annotation of metatranscriptome data we assess two commonly employed methods: clustering and de-novo assembly. To do this, we also developed an approach to simulate 454 and Illumina metatranscriptome data sets with varying degrees of taxonomic diversity. For the Illumina simulations, we found that a two-step approach of assembly followed by clustering of contigs and unassembled sequences produced the most accurate reflection of the real protein domain content of the sample. For the 454 simulations, the combined annotation of contigs and unassembled reads produced the most accurate protein domain annotations. Conclusions Based on these data we recommend that assembly be attempted, and that unassembled reads be included in the final annotation for metatranscriptome data, even from highly diverse environments as the resulting annotations should lead to a more accurate reflection of the transcriptional behaviour of the microbial population under investigation. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-901) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andrew Toseland
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, Norfolk NR4 7TJ, UK.
| | | | | | | |
Collapse
|
27
|
Yamagishi J, Wakaguri H, Yokoyama N, Yamashita R, Suzuki Y, Xuan X, Igarashi I. The Babesia bovis gene and promoter model: an update from full-length EST analysis. BMC Genomics 2014; 15:678. [PMID: 25124460 PMCID: PMC4148916 DOI: 10.1186/1471-2164-15-678] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Accepted: 07/08/2014] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Babesia bovis is an apicomplexan parasite that causes babesiosis in infected cattle. Genomes of pathogens contain promising information that can facilitate the development of methods for controlling infections. Although the genome of B. bovis is publically available, annotated gene models are not highly reliable prior to experimental validation. Therefore, we validated a preproposed gene model of B. bovis and extended the associated annotations on the basis of experimentally obtained full-length expressed sequence tags (ESTs). RESULTS From in vitro cultured merozoites, 12,286 clones harboring full-length cDNAs were sequenced from both ends using the Sanger method, and 6,787 full-length cDNAs were assembled. These were then clustered, and a nonredundant referential data set of 2,115 full-length cDNA sequences was constructed. The comparison of the preproposed gene model with our data set identified 310 identical genes, 342 almost identical genes, 1,054 genes with potential structural inconsistencies, and 409 novel genes. The median length of 5' untranslated regions (UTRs) was 152 nt. Subsequently, we identified 4,086 transcription start sites (TSSs) and 2,023 transcriptionally active regions (TARs) by examining 5' ESTs. We identified ATGGGG and CCCCAT sites as consensus motifs in TARs that were distributed around -50 bp from TSSs. In addition, we found ACACA, TGTGT, and TATAT sites, which were distributed periodically around TSSs in cycles of approximately 150 bp. Moreover, related periodical distributions were not observed in mammalian promoter regions. CONCLUSIONS The observations in this study indicate the utility of integrated bioinformatics and experimental data for improving genome annotations. In particular, full-length cDNAs with one-base resolution for TSSs enabled the identification of consensus motifs in promoter sequences and demonstrated clear distributions of identified motifs. These observations allowed the illustration of a model promoter composition, which supports the differences in transcriptional regulation frameworks between apicomplexan parasites and mammals.
Collapse
Affiliation(s)
- Junya Yamagishi
- />Tohoku Medical Megabank Organization, Tohoku University, 6-3-09, aza Aoba, Sendai, Miyagi 980-8579 Japan
- />National Research Center for Protozoan Diseases, Obihiro University of Agriculture and Veterinary Medicine, Inada-cho west 2-13, Obihiro, Hokkaido 080-8555 Japan
| | - Hiroyuki Wakaguri
- />Department of Medical Genome Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 Japan
| | - Naoaki Yokoyama
- />National Research Center for Protozoan Diseases, Obihiro University of Agriculture and Veterinary Medicine, Inada-cho west 2-13, Obihiro, Hokkaido 080-8555 Japan
| | - Riu Yamashita
- />Tohoku Medical Megabank Organization, Tohoku University, 6-3-09, aza Aoba, Sendai, Miyagi 980-8579 Japan
| | - Yutaka Suzuki
- />Department of Medical Genome Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562 Japan
| | - Xuenan Xuan
- />National Research Center for Protozoan Diseases, Obihiro University of Agriculture and Veterinary Medicine, Inada-cho west 2-13, Obihiro, Hokkaido 080-8555 Japan
| | - Ikuo Igarashi
- />National Research Center for Protozoan Diseases, Obihiro University of Agriculture and Veterinary Medicine, Inada-cho west 2-13, Obihiro, Hokkaido 080-8555 Japan
| |
Collapse
|
28
|
Kivelä M, Arnaud-Haond S, Saramäki J. EDENetworks: A user-friendly software to build and analyse networks in biogeography, ecology and population genetics. Mol Ecol Resour 2014; 15:117-22. [DOI: 10.1111/1755-0998.12290] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2013] [Revised: 06/01/2014] [Accepted: 06/02/2014] [Indexed: 11/27/2022]
Affiliation(s)
- Mikko Kivelä
- Oxford Centre for Industrial and Applied Mathematics; Mathematical Institute; University of Oxford; Oxford UK
- Department of Biomedical Engineering and Computational Science; School of Science; Aalto University; Helsinki Finland
| | - Sophie Arnaud-Haond
- Ifremer; UMR «Ecosystèmes Marins Exploités»; Bd Jean Monnet BP 171 34203 Sète Cedex France
| | - Jari Saramäki
- Department of Biomedical Engineering and Computational Science; School of Science; Aalto University; Helsinki Finland
| |
Collapse
|
29
|
Kohen R, Dobra A, Tracy JH, Haugen E. Transcriptome profiling of human hippocampus dentate gyrus granule cells in mental illness. Transl Psychiatry 2014; 4:e366. [PMID: 24594777 PMCID: PMC3966046 DOI: 10.1038/tp.2014.9] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Accepted: 01/06/2014] [Indexed: 12/20/2022] Open
Abstract
This study is, to the best of our knowledge, the first application of whole transcriptome sequencing (RNA-seq) to cells isolated from postmortem human brain by laser capture microdissection. We investigated the transcriptome of dentate gyrus (DG) granule cells in postmortem human hippocampus in 79 subjects with mental illness (schizophrenia, bipolar disorder, major depression) and nonpsychiatric controls. We show that the choice of normalization approach for analysis of RNA-seq data had a strong effect on results; under our experimental conditions a nonstandard normalization method gave superior results. We found evidence of disrupted signaling by miR-182 in mental illness. This was confirmed using a novel method of leveraging microRNA genetic variant information to indicate active targeting. In healthy subjects and those with bipolar disorder, carriers of a high- vs those with a low-expressing genotype of miR-182 had different levels of miR-182 target gene expression, indicating an active role of miR-182 in shaping the DG transcriptome for those subject groups. By contrast, comparing the transcriptome between carriers of different genotypes among subjects with major depression and schizophrenia suggested a loss of DG miR-182 signaling in these conditions.
Collapse
Affiliation(s)
- R Kohen
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA,Department of Psychiatry and Behavioral Sciences, University of Washington, 1959 Pacific Avenue NE, Seattle, WA 98195-6560, USA. E-mail:
| | - A Dobra
- Department of Statistics, University of Washington, Seattle, WA, USA,Department of Biobehavioral Nursing and Health Systems, University of Washington, Seattle, WA, USA,Center for Statistics and The Social Sciences, University of Washington, Seattle, WA, USA
| | - J H Tracy
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, WA, USA
| | - E Haugen
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| |
Collapse
|
30
|
Bhargava V, Head SR, Ordoukhanian P, Mercola M, Subramaniam S. Technical variations in low-input RNA-seq methodologies. Sci Rep 2014; 4:3678. [PMID: 24419370 PMCID: PMC3890974 DOI: 10.1038/srep03678] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 12/13/2013] [Indexed: 12/16/2022] Open
Abstract
Recent advances in RNA-seq methodologies from limiting amounts of mRNA have facilitated the characterization of rare cell-types in various biological systems. So far, however, technical variations in these methods have not been adequately characterized, vis-à-vis sensitivity, starting with reduced levels of mRNA. Here, we generated sequencing libraries from limiting amounts of mRNA using three amplification-based methods, viz. Smart-seq, DP-seq and CEL-seq, and demonstrated significant technical variations in these libraries. Reduction in mRNA levels led to inefficient amplification of the majority of low to moderately expressed transcripts. Furthermore, noise in primer hybridization and/or enzyme incorporation was magnified during the amplification step resulting in significant distortions in fold changes of the transcripts. Consequently, the majority of the differentially expressed transcripts identified were either high-expressed and/or exhibited high fold changes. High technical variations ultimately masked subtle biological differences mandating the development of improved amplification-based strategies for quantitative transcriptomics from limiting amounts of mRNA.
Collapse
Affiliation(s)
- Vipul Bhargava
- Bioinformatics and Systems Biology Graduate Program, University of California at San Diego, La Jolla, California, USA
| | - Steven R Head
- Next Generation Sequencing Core Facility, The Scripps Research Institute, La Jolla, California, USA
| | - Phillip Ordoukhanian
- Next Generation Sequencing Core Facility, The Scripps Research Institute, La Jolla, California, USA
| | - Mark Mercola
- 1] Department of Bioengineering, University of California at San Diego, La Jolla, California, USA [2] Sanford-Burnham Medical Research Institute, La Jolla, California, USA
| | - Shankar Subramaniam
- 1] Bioinformatics and Systems Biology Graduate Program, University of California at San Diego, La Jolla, California, USA [2] Department of Bioengineering, University of California at San Diego, La Jolla, California, USA [3] Departments of Cellular and Molecular Medicine and Chemistry and Biochemistry, University of California at San Diego, La Jolla, California, USA
| |
Collapse
|
31
|
Nowrousian M. Fungal gene expression levels do not display a common mode of distribution. BMC Res Notes 2013; 6:559. [PMID: 24373411 PMCID: PMC3877863 DOI: 10.1186/1756-0500-6-559] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Accepted: 12/23/2013] [Indexed: 12/01/2022] Open
Abstract
Background RNA-seq studies in metazoa have revealed a distinct, double-peaked (bimodal) distribution of gene expression independent of species and cell type. However, two studies in filamentous fungi yielded conflicting results, with a bimodal distribution in Pyronema confluens and varying distributions in Sordaria macrospora. To obtain a broader overview of global gene expression distributions in fungi, an additional 60 publicly available RNA-seq data sets from six ascomycetes and one basidiomycete were analyzed with respect to gene expression distributions. Results Clustering of normalized, log2-transformed gene expression levels for each RNA-seq data set yielded distributions with one to five peaks. When only major peaks comprising at least 15% of all analyzed genes were considered, distributions ranged from one to three major peaks, suggesting that fungal gene expression is not generally bimodal. The number of peaks was not correlated with the phylogenetic position of a species; however, higher filamentous asco- and basidiomycetes showed up to three major peaks, whereas gene expression levels in the yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe had only one to two major peaks, with one predominant peak containing at least 70% of all expressed genes. In several species, the number of peaks varied even within a single species, e.g. depending on the growth conditions as evidenced in the one to three major peaks in different samples from Neurospora crassa. Earlier studies based on microarray and SAGE data revealed distributions of gene expression level that followed Zipf’s law, i.e. log-transformed gene expression levels were inversely proportional to the log-transformed expression rank of a gene. However, analyses of the fungal RNA-seq data sets could not identify any that confirmed to Zipf’s law. Conclusions Fungal gene expression patterns cannot generally be described by a single type of distribution (bimodal or Zipf’s law). One hypothesis to explain this finding might be that gene expression in fungi is highly dynamic, and fine-tuned at the level of transcription not only for individual genes, but also at a global level.
Collapse
Affiliation(s)
- Minou Nowrousian
- Lehrstuhl für Allgemeine und Molekulare Botanik, Ruhr-Universität Bochum, 44780 Bochum, Germany.
| |
Collapse
|
32
|
Hindmarch CCT, Franses P, Goodwin B, Murphy D. Whole transcriptome organisation in the dehydrated supraoptic nucleus. Braz J Med Biol Res 2013; 46:1000-1006. [PMID: 24345907 PMCID: PMC3935270 DOI: 10.1590/1414-431x20133328] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 09/11/2013] [Indexed: 01/20/2023] Open
Abstract
The supraoptic nucleus (SON) is part of the central osmotic circuitry that synthesises the hormone vasopressin (Avp) and transports it to terminals in the posterior lobe of the pituitary. Following osmotic stress such as dehydration, this tissue undergoes morphological, electrical and transcriptional changes to facilitate the appropriate regulation and release of Avp into the circulation where it conserves water at the level of the kidney. Here, the organisation of the whole transcriptome following dehydration is modelled to fit Zipf's law, a natural power law that holds true for all natural languages, that states if the frequency of word usage is plotted against its rank, then the log linear regression of this is -1. We have applied this model to our previously published euhydrated and dehydrated SON data to observe this trend and how it changes following dehydration. In accordance with other studies, our whole transcriptome data fit well with this model in the euhydrated SON microarrays, but interestingly, fit better in the dehydrated arrays. This trend was observed in a subset of differentially regulated genes and also following network reconstruction using a third-party database that mines public data. We make use of language as a metaphor that helps us philosophise about the role of the whole transcriptome in providing a suitable environment for the delivery of Avp following a survival threat like dehydration.
Collapse
Affiliation(s)
- C C T Hindmarch
- University of Bristol, The Henry Wellcome Laboratories for Integrative Neuroscience and Endocrinology, Bristol, UK
| | | | | | - D Murphy
- University of Bristol, The Henry Wellcome Laboratories for Integrative Neuroscience and Endocrinology, Bristol, UK
| |
Collapse
|
33
|
Fang G, Passalacqua KD, Hocking J, Llopis PM, Gerstein M, Bergman NH, Jacobs-Wagner C. Transcriptomic and phylogenetic analysis of a bacterial cell cycle reveals strong associations between gene co-expression and evolution. BMC Genomics 2013; 14:450. [PMID: 23829427 PMCID: PMC3829707 DOI: 10.1186/1471-2164-14-450] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2012] [Accepted: 05/13/2013] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The genetic network involved in the bacterial cell cycle is poorly understood even though it underpins the remarkable ability of bacteria to proliferate. How such network evolves is even less clear. The major aims of this work were to identify and examine the genes and pathways that are differentially expressed during the Caulobacter crescentus cell cycle, and to analyze the evolutionary features of the cell cycle network. RESULTS We used deep RNA sequencing to obtain high coverage RNA-Seq data of five C. crescentus cell cycle stages, each with three biological replicates. We found that 1,586 genes (over a third of the genome) display significant differential expression between stages. This gene list, which contains many genes previously unknown for their cell cycle regulation, includes almost half of the genes involved in primary metabolism, suggesting that these "house-keeping" genes are not constitutively transcribed during the cell cycle, as often assumed. Gene and module co-expression clustering reveal co-regulated pathways and suggest functionally coupled genes. In addition, an evolutionary analysis of the cell cycle network shows a high correlation between co-expression and co-evolution. Most co-expression modules have strong phylogenetic signals, with broadly conserved genes and clade-specific genes predominating different substructures of the cell cycle co-expression network. We also found that conserved genes tend to determine the expression profile of their module. CONCLUSION We describe the first phylogenetic and single-nucleotide-resolution transcriptomic analysis of a bacterial cell cycle network. In addition, the study suggests how evolution has shaped this network and provides direct biological network support that selective pressure is not on individual genes but rather on the relationship between genes, which highlights the importance of integrating phylogenetic analysis into biological network studies.
Collapse
Affiliation(s)
- Gang Fang
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA.
| | | | | | | | | | | | | |
Collapse
|
34
|
Nanostructured surfaces and detection instrumentation for photonic crystal enhanced fluorescence. SENSORS 2013; 13:5561-84. [PMID: 23624689 PMCID: PMC3690015 DOI: 10.3390/s130505561] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2013] [Revised: 03/26/2013] [Accepted: 03/27/2013] [Indexed: 01/15/2023]
Abstract
Photonic crystal (PC) surfaces have been demonstrated as a compelling platform for improving the sensitivity of surface-based fluorescent assays used in disease diagnostics and life science research. PCs can be engineered to support optical resonances at specific wavelengths at which strong electromagnetic fields are utilized to enhance the intensity of surface-bound fluorophore excitation. Meanwhile, the leaky resonant modes of PCs can be used to direct emitted photons within a narrow range of angles for more efficient collection by a fluorescence detection system. The multiplicative effects of enhanced excitation combined with enhanced photon extraction combine to provide improved signal-to-noise ratios for detection of fluorescent emitters, which in turn can be used to reduce the limits of detection of low concentration analytes, such as disease biomarker proteins. Fabrication of PCs using inexpensive manufacturing methods and materials that include replica molding on plastic, nano-imprint lithography on quartz substrates result in devices that are practical for single-use disposable applications. In this review, we will describe the motivation for implementing high-sensitivity fluorescence detection in the context of molecular diagnosis and gene expression analysis though the use of PC surfaces. Recent efforts to improve the design and fabrication of PCs and their associated detection instrumentation are summarized, including the use of PCs coupled with Fabry-Perot cavities and external cavity lasers.
Collapse
|
35
|
Yoshioka W, Endo N, Kurashige A, Haijima A, Endo T, Shibata T, Nishiyama R, Kakeyama M, Tohyama C. Fluorescence laser microdissection reveals a distinct pattern of gene activation in the mouse hippocampal region. Sci Rep 2012; 2:783. [PMID: 23136640 PMCID: PMC3491666 DOI: 10.1038/srep00783] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Accepted: 10/10/2012] [Indexed: 02/06/2023] Open
Abstract
A histoanatomical context is imperative in an analysis of gene expression in a cell in a tissue to elucidate physiological function of the cell. In this study, we made technical advances in fluorescence laser microdissection (LMD) in combination with the absolute quantification of small amounts of mRNAs from a region of interest (ROI) in fluorescence-labeled tissue sections. We demonstrate that our fluorescence LMD-RTqPCR method has three orders of dynamic range, with the lower limit of ROI-size corresponding to a single cell. The absolute quantification of the expression levels of the immediate early genes in an ROI equivalent to a few hundred neurons in the hippocampus revealed that mice transferred from their home cage to a novel environment have distinct activation profiles in the hippocampal regions (CA1, CA3, and DG) and that the gene expression pattern in CA1, but not in the other regions, follows a power law distribution.
Collapse
Affiliation(s)
- Wataru Yoshioka
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Nozomi Endo
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Akie Kurashige
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Asahi Haijima
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
- Current address: Department of Integrative Physiology, Gunma University Graduate School of Medicine, Gunma 371-8511, Japan
| | - Toshihiro Endo
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Toshiyuki Shibata
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
- Department of Human Ecology, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Ryutaro Nishiyama
- Research/Clinical/Industrial Division, Leica Microsystems K.K., Tokyo 108-0072, Japan
| | - Masaki Kakeyama
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| | - Chiharu Tohyama
- Laboratory of Environmental Health Sciences, Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-0033, Japan
| |
Collapse
|
36
|
Wirth H, von Bergen M, Binder H. Mining SOM expression portraits: feature selection and integrating concepts of molecular function. BioData Min 2012; 5:18. [PMID: 23043905 PMCID: PMC3599960 DOI: 10.1186/1756-0381-5-18] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2011] [Accepted: 09/14/2012] [Indexed: 11/30/2022] Open
Abstract
Background Self organizing maps (SOM) enable the straightforward portraying of high-dimensional data of large sample collections in terms of sample-specific images. The analysis of their texture provides so-called spot-clusters of co-expressed genes which require subsequent significance filtering and functional interpretation. We address feature selection in terms of the gene ranking problem and the interpretation of the obtained spot-related lists using concepts of molecular function. Results Different expression scores based either on simple fold change-measures or on regularized Student’s t-statistics are applied to spot-related gene lists and compared with special emphasis on the error characteristics of microarray expression data. The spot-clusters are analyzed using different methods of gene set enrichment analysis with the focus on overexpression and/or overrepresentation of predefined sets of genes. Metagene-related overrepresentation of selected gene sets was mapped into the SOM images to assign gene function to different regions. Alternatively we estimated set-related overexpression profiles over all samples studied using a gene set enrichment score. It was also applied to the spot-clusters to generate lists of enriched gene sets. We used the tissue body index data set, a collection of expression data of human tissues as an illustrative example. We found that tissue related spots typically contain enriched populations of gene sets well corresponding to molecular processes in the respective tissues. In addition, we display special sets of housekeeping and of consistently weak and high expressed genes using SOM data filtering. Conclusions The presented methods allow the comprehensive downstream analysis of SOM-transformed expression data in terms of cluster-related gene lists and enriched gene sets for functional interpretation. SOM clustering implies the ability to define either new gene sets using selected SOM spots or to verify and/or to amend existing ones.
Collapse
Affiliation(s)
- Henry Wirth
- Interdisciplinary Centre for Bioinformatics of Leipzig University, Härtelstr, 16-18, D-4107, Leipzig, Germany.
| | | | | |
Collapse
|
37
|
|
38
|
Li S, Pandey S, Gookin TE, Zhao Z, Wilson L, Assmann SM. Gene-sharing networks reveal organizing principles of transcriptomes in Arabidopsis and other multicellular organisms. THE PLANT CELL 2012; 24:1362-78. [PMID: 22517316 PMCID: PMC3398552 DOI: 10.1105/tpc.111.094748] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2011] [Revised: 02/29/2012] [Accepted: 03/16/2012] [Indexed: 05/18/2023]
Abstract
Understanding tissue-related gene expression patterns can provide important insights into gene, tissue, and organ function. Transcriptome analyses often have focused on housekeeping or tissue-specific genes or on gene coexpression. However, by analyzing thousands of single-gene expression distributions in multiple tissues of Arabidopsis thaliana, rice (Oryza sativa), human (Homo sapiens), and mouse (Mus musculus), we found that these organisms primarily operate by gene sharing, a phenomenon where, in each organism, most genes exhibit a high expression level in a few key tissues. We designed an analytical pipeline to characterize this phenomenon and then derived Arabidopsis and human gene-sharing networks, in which tissues are connected solely based on the extent of shared preferentially expressed genes. The results show that tissues or cell types from the same organ system tend to group together to form network modules. Tissues that are in consecutive developmental stages or have common physiological functions are connected in these networks, revealing the importance of shared preferentially expressed genes in conferring specialized functions of each tissue type. The networks provide predictive power for each tissue type regarding gene functions of both known and heretofore unknown genes, as shown by the identification of four new genes with functions in guard cell and abscisic acid response. We provide a Web interface that enables, based on the extent of gene sharing, both prediction of tissue-related functions for any Arabidopsis gene of interest and predictions concerning the relatedness of tissues. Common gene-sharing patterns observed in the four model organisms suggest that gene sharing evolved as a fundamental organizing principle of gene expression in diverse multicellular eukaryotes.
Collapse
Affiliation(s)
- Song Li
- Biology Department, Pensylvania State University, University Park, Pensylvania 16802, USA.
| | | | | | | | | | | |
Collapse
|
39
|
Protein misinteraction avoidance causes highly expressed proteins to evolve slowly. Proc Natl Acad Sci U S A 2012; 109:E831-40. [PMID: 22416125 DOI: 10.1073/pnas.1117408109] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The tempo and mode of protein evolution have been central questions in biology. Genomic data have shown a strong influence of the expression level of a protein on its rate of sequence evolution (E-R anticorrelation), which is currently explained by the protein misfolding avoidance hypothesis. Here, we show that this hypothesis does not fully explain the E-R anticorrelation, especially for protein surface residues. We propose that natural selection against protein-protein misinteraction, which wastes functional molecules and is potentially toxic, constrains the evolution of surface residues. Because highly expressed proteins are under stronger pressures to avoid misinteraction, surface residues are expected to show an E-R anticorrelation. Our molecular-level evolutionary simulation and yeast genomic analysis confirm multiple predictions of the hypothesis. These findings show a pluralistic origin of the E-R anticorrelation and reveal the role of protein misinteraction, an inherent property of complex cellular systems, in constraining protein evolution.
Collapse
|
40
|
Lerch JK, Kuo F, Motti D, Morris R, Bixby JL, Lemmon VP. Isoform diversity and regulation in peripheral and central neurons revealed through RNA-Seq. PLoS One 2012; 7:e30417. [PMID: 22272348 PMCID: PMC3260295 DOI: 10.1371/journal.pone.0030417] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Accepted: 12/15/2011] [Indexed: 11/19/2022] Open
Abstract
To fully understand cell type identity and function in the nervous system there is a need to understand neuronal gene expression at the level of isoform diversity. Here we applied Next Generation Sequencing of the transcriptome (RNA-Seq) to purified sensory neurons and cerebellar granular neurons (CGNs) grown on an axonal growth permissive substrate. The goal of the analysis was to uncover neuronal type specific isoforms as a prelude to understanding patterns of gene expression underlying their intrinsic growth abilities. Global gene expression patterns were comparable to those found for other cell types, in that a vast majority of genes were expressed at low abundance. Nearly 18% of gene loci produced more than one transcript. More than 8000 isoforms were differentially expressed, either to different degrees in different neuronal types or uniquely expressed in one or the other. Sensory neurons expressed a larger number of genes and gene isoforms than did CGNs. To begin to understand the mechanisms responsible for the differential gene/isoform expression we identified transcription factor binding sites present specifically in the upstream genomic sequences of differentially expressed isoforms, and analyzed the 3′ untranslated regions (3′ UTRs) for microRNA (miRNA) target sites. Our analysis defines isoform diversity for two neuronal types with diverse axon growth capabilities and begins to elucidate the complex transcriptional landscape in two neuronal populations.
Collapse
Affiliation(s)
- Jessica K. Lerch
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Frank Kuo
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Dario Motti
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Richard Morris
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- The Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - John L. Bixby
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- The Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- Department of Cellular and Molecular Pharmacology, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- * E-mail: (VPL); (JLB)
| | - Vance P. Lemmon
- The Miami Project to Cure Paralysis, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- Department of Neurological Surgery, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
- * E-mail: (VPL); (JLB)
| |
Collapse
|
41
|
Rammelt C, Bilen B, Zavolan M, Keller W. PAPD5, a noncanonical poly(A) polymerase with an unusual RNA-binding motif. RNA (NEW YORK, N.Y.) 2011; 17:1737-46. [PMID: 21788334 PMCID: PMC3162338 DOI: 10.1261/rna.2787011] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
PAPD5 is one of the seven members of the family of noncanonical poly(A) polymerases in human cells. PAPD5 was shown to polyadenylate aberrant pre-ribosomal RNAs in vivo, similar to degradation-mediating polyadenylation by the noncanonical poly(A) polymerase Trf4p in yeast. PAPD5 has been reported to be also involved in the uridylation-dependent degradation of histone mRNAs. To test whether PAPD5 indeed catalyzes adenylation as well as uridylation of RNA substrates, we analyzed the in vitro properties of recombinant PAPD5 expressed in mammalian cells as well as in bacteria. Our results show that PAPD5 catalyzes the polyadenylation of different types of RNA substrates in vitro. Interestingly, PAPD5 is active without a protein cofactor, whereas its yeast homolog Trf4p is the catalytic subunit of a bipartite poly(A) polymerase in which a separate RNA-binding subunit is needed for activity. In contrast to the yeast protein, the C terminus of PAPD5 contains a stretch of basic amino acids that is involved in binding the RNA substrate.
Collapse
Affiliation(s)
- Christiane Rammelt
- Institute of Biochemistry and Biotechnology, Martin Luther University Halle-Wittenberg, D-06099 Halle, Germany
- Corresponding authors.E-mail .E-mail .
| | - Biter Bilen
- Biozentrum, University of Basel, CH-4056 Basel, Switzerland
| | | | - Walter Keller
- Biozentrum, University of Basel, CH-4056 Basel, Switzerland
- Corresponding authors.E-mail .E-mail .
| |
Collapse
|
42
|
Nacher JC, Ryabov VB. Nonlinear response of gene expression to chemical perturbations: a noise-detector model and its predictions. Biosystems 2011; 107:9-17. [PMID: 21871947 DOI: 10.1016/j.biosystems.2011.08.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2011] [Revised: 08/08/2011] [Accepted: 08/12/2011] [Indexed: 11/17/2022]
Abstract
The widespread use of microarrays provided a first glimpse at some simple laws and organizing principles that govern the transcriptome. Previous analyses have shown that the transcriptional organization is very heterogeneous and characterized by a power-law decay for gene expression levels. Moreover, a simple law was unveiled suggesting that gene expression dynamic changes under stress are proportional to their initial expression values. However, to elucidate and assess the underlying governing principles of transcriptional organization, we do not only need to identify them, but also provide theoretical models that are able to faithfully capture and reproduce them. Here we present a method to investigate the gene expression dynamics inspired by the theory of nonlinear transformation of random signals and noise. The model is able to explain not only the well-known power-law decay for abundance of expression levels, but also to reproduce the linear dependence of the standard deviation of gene expression change with respect to the initial expression value (also known as rich-travels-more dynamics). To our knowledge, this is the first model applied to gene expression dynamics that is able to simultaneously predict both statistical features. The theoretical framework derives an indicator to measure the coupling between gene expression and specific perturbations. Using genome-wide transcriptional data, this indicator identifies genes strongly coupled to specific inflammatory responses to different pathogens. The novel application of signal and noise theory to study intracellular responses and gene expression changes offers not only a new theoretical avenue to study transcriptional responses to environmental stresses and chemical signals but also provides predictive capability at the genome scale.
Collapse
Affiliation(s)
- Jose C Nacher
- Department of Complex and Intelligent Systems, Future University Hakodate, Kamedanakano-cho, Hokkaido, Japan.
| | | |
Collapse
|
43
|
Hebenstreit D, Teichmann SA. Analysis and simulation of gene expression profiles in pure and mixed cell populations. Phys Biol 2011; 8:035013. [PMID: 21572174 DOI: 10.1088/1478-3975/8/3/035013] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
For analysis and interpretation of data obtained from experimental readouts of gene expression, such as microarrays and RNA-sequencing, log transformation is routinely applied. This is because expression data, like many biological parameters, are strongly skewed. We show here that gene expression levels in multicellular organisms often deviate from simple (log) normal distributions and instead exhibit shouldered or bimodal distributions. Based on a mathematical model and numerical simulations, we demonstrate that many observed distributions can be explained as mixtures of bimodal two-component lognormal models. This is due to the fact that after log-transformation, the resulting distributions display reductions in the first peak rather than increasing overlaps over a wide range of parameter values. By comparing the theoretical results with biological datasets, our findings suggest that the distributions are generally bimodal for single cell types and get obscured by the different cell types that are present in tissue samples. Our analysis thus provides an initial explanation for the various types of expression level distributions that are found for different datasets. This will be important for the interpretation of next-generation sequencing data such as transcriptomics by mRNA-sequencing and ChIP-sequencing of epigenetic marks.
Collapse
|
44
|
Garcia-Reyero N, Habib T, Pirooznia M, Gust KA, Gong P, Warner C, Wilbanks M, Perkins E. Conserved toxic responses across divergent phylogenetic lineages: a meta-analysis of the neurotoxic effects of RDX among multiple species using toxicogenomics. ECOTOXICOLOGY (LONDON, ENGLAND) 2011; 20:580-594. [PMID: 21516383 DOI: 10.1007/s10646-011-0623-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 02/14/2011] [Indexed: 05/28/2023]
Abstract
At military training sites, a variety of pollutants such as hexahydro-1,3,5-trinitro-1,3,5-triazine (RDX), may contaminate the area originating from used munitions. Studies investigating the mechanism of toxicity of RDX have shown that it affects the central nervous system causing seizures in humans and animals. Environmental pollutants such as RDX have the potential to affect many different species, therefore it is important to establish how phylogenetically distant species may respond to these types of emerging pollutants. In this paper, we have used a transcriptional network approach to compare and contrast the neurotoxic effects of RDX among five phylogenetically disparate species: rat (Sprague-Dawley), Northern bobwhite quail (Colinus virginianus), fathead minnow (Pimephales promelas), earthworm (Eisenia fetida), and coral (Acropora formosa). Pathway enrichment analysis indicated a conservation of RDX impacts on pathways related to neuronal function in rat, Northern bobwhite quail, fathead minnows and earthworm, but not in coral. As evolutionary distance increased common responses decreased with impacts on energy and metabolism dominating effects in coral. A neurotransmission related transcriptional network based on whole rat brain responses to RDX exposure was used to identify functionally related modules of genes, components of which were conserved across species depending upon evolutionary distance. Overall, the meta-analysis using genomic data of the effects of RDX on several species suggested a common and conserved mode of action of the chemical throughout phylogenetically remote organisms.
Collapse
|
45
|
Abstract
The next-generation sequencing technologies are being rapidly applied in biological research. Tens of millions of short sequences generated in a single experiment provide us enormous information on genome composition, genetic variants, gene expression levels and protein binding sites depending on the applications. Various methods are being developed for analyzing the data generated by these technologies. However, the relevant experimental design issues have rarely been discussed. In this review, we use RNA-seq as an example to bring this topic into focus and to discuss experimental design and validation issues pertaining to next-generation sequencing in the quantification of transcripts.
Collapse
Affiliation(s)
- Zhide Fang
- Biostatistics Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, USA
| | | |
Collapse
|
46
|
Yamashita R, Sathira NP, Kanai A, Tanimoto K, Arauchi T, Tanaka Y, Hashimoto SI, Sugano S, Nakai K, Suzuki Y. Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. Genome Res 2011; 21:775-89. [PMID: 21372179 DOI: 10.1101/gr.110254.110] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
We performed a genome-wide analysis of transcriptional start sites (TSSs) in human genes by multifaceted use of a massively parallel sequencer. By analyzing 800 million sequences that were obtained from various types of transcriptome analyses, we characterized 140 million TSS tags in 12 human cell types. Despite the large number of TSS clusters (TSCs), the number of TSCs was observed to decrease sharply with increasing expression levels. Highly expressed TSCs exhibited several characteristic features: Nucleosome-seq analysis revealed highly ordered nucleosome structures, ChIP-seq analysis detected clear RNA polymerase II binding signals in their surrounding regions, evaluations of previously sequenced and newly shotgun-sequenced complete cDNA sequences showed that they encode preferable transcripts for protein translation, and RNA-seq analysis of polysome-incorporated RNAs yielded direct evidence that those transcripts are actually translated into proteins. We also demonstrate that integrative interpretation of transcriptome data is essential for the selection of putative alternative promoter TSCs, two of which also have protein consequences. Furthermore, discriminative chromatin features that separate TSCs at different expression levels were found for both genic TSCs and intergenic TSCs. The collected integrative information should provide a useful basis for future biological characterization of TSCs.
Collapse
Affiliation(s)
- Riu Yamashita
- Frontier Research Initiative, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Abstract
With evolving interest in multiscalar biological systems one could assume that reductionist approaches may not fully describe biological complexity. Instead, tools such as mathematical modeling, network analysis, and other multiplexed clinical- and research-oriented tests enable rapid analyses of high-throughput data parsed at the genomic, proteomic, metabolomic, and physiomic levels. A physiomic-level approach allows for recursive horizontal and vertical integration of subsystem coupling across and within spatiotemporal scales. Additionally, this methodology recognizes previously ignored subsystems and the strong, nonintuitively obvious and indirect connections among physiological events that potentially account for the uncertainties in medicine. In this review, we flip the reductionist research paradigm and review the concept of systems biology and its applications to bone pathophysiology. Specifically, a bone-centric physiome model is presented that incorporates systemic-level processes with their respective therapeutic implications.
Collapse
Affiliation(s)
- Aaron J Weiss
- Division of Endocrinology, Diabetes, and Bone Disease, Mount Sinai School of Medicine, New York, New York, USA
| | | | | |
Collapse
|
48
|
Li YF, Arnold RJ, Tang H, Radivojac P. The importance of peptide detectability for protein identification, quantification, and experiment design in MS/MS proteomics. J Proteome Res 2010; 9:6288-97. [PMID: 21067214 PMCID: PMC3006185 DOI: 10.1021/pr1005586] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Peptide detectability is defined as the probability that a peptide is identified in an LC-MS/MS experiment and has been useful in providing solutions to protein inference and label-free quantification. Previously, predictors for peptide detectability trained on standard or complex samples were proposed. Although the models trained on complex samples may benefit from the large training data sets, it is unclear to what extent they are affected by the unequal abundances of identified proteins. To address this challenge and improve detectability prediction, we present a new algorithm for the iterative learning of peptide detectability from complex mixtures. We provide evidence that the new method approximates detectability with useful accuracy and, based on its design, can be used to interpret the outcome of other learning strategies. We studied the properties of peptides from the bacterium Deinococcus radiodurans and found that at standard quantities, its tryptic peptides can be roughly classified as either detectable or undetectable, with a relatively small fraction having medium detectability. We extend the concept of detectability from peptides to proteins and apply the model to predict the behavior of a replicate LC-MS/MS experiment from a single analysis. Finally, our study summarizes a theoretical framework for peptide/protein identification and label-free quantification.
Collapse
Affiliation(s)
- Yong Fuga Li
- School of Informatics and Computing, Indiana University, Bloomington, IN 47408
| | - Randy J. Arnold
- Department of Chemistry, Indiana University, Bloomington, IN 47405
| | - Haixu Tang
- School of Informatics and Computing, Indiana University, Bloomington, IN 47408
| | - Predrag Radivojac
- School of Informatics and Computing, Indiana University, Bloomington, IN 47408
| |
Collapse
|
49
|
López García De Lomana A, Beg QK, De Fabritiis G, Villà-Freixa J. Statistical analysis of global connectivity and activity distributions in cellular networks. J Comput Biol 2010; 17:869-78. [PMID: 20632868 DOI: 10.1089/cmb.2008.0240] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Various molecular interaction networks have been claimed to follow power-law decay for their global connectivity distribution. It has been proposed that there may be underlying generative models that explain this heavy-tailed behavior by self-reinforcement processes such as classical or hierarchical scale-free network models. Here, we analyze a comprehensive data set of protein-protein and transcriptional regulatory interaction networks in yeast, an Escherichia coli metabolic network, and gene activity profiles for different metabolic states in both organisms. We show that in all cases the networks have a heavy-tailed distribution, but most of them present significant differences from a power-law model according to a stringent statistical test. Those few data sets that have a statistically significant fit with a power-law model follow other distributions equally well. Thus, while our analysis supports that both global connectivity interaction networks and activity distributions are heavy-tailed, they are not generally described by any specific distribution model, leaving space for further inferences on generative models. Supplementary Material is available online at www.liebertonline.com.
Collapse
Affiliation(s)
- Adrián López García De Lomana
- Computational Biochemistry and Biophysics Laboratory, Research Unit on Biomedical Informatics, IMIM/Universitat Pompeu Fabra, Barcelona, Spain
| | | | | | | |
Collapse
|
50
|
Mathias PC, Jones SI, Wu HY, Yang F, Ganesh N, Gonzalez DO, Bollero G, Vodkin LO, Cunningham BT. Improved sensitivity of DNA microarrays using photonic crystal enhanced fluorescence. Anal Chem 2010; 82:6854-61. [PMID: 20704375 PMCID: PMC2921648 DOI: 10.1021/ac100841d] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
DNA microarrays are used to profile changes in gene expression between samples in a high-throughput manner, but measurements of genes with low expression levels can be problematic with standard microarray substrates. In this work, we expand the detection capabilities of a standard microarray experiment using a photonic crystal (PC) surface that enhances fluorescence observed from microarray spots. This PC is inexpensively and uniformly fabricated using a nanoreplica molding technique, with very little variation in its optical properties within- and between-devices. By using standard protocols to process glass microarray substrates in parallel with PCs, we evaluated the impact of this substrate on a one-color microarray experiment comparing gene expression in two developmental stages of Glycine max. The PCs enhanced the signal-to-noise ratio observed from microarray spots by 1 order of magnitude, significantly increasing the number of genes detected above substrate fluorescence noise. PC substrates more than double the number of genes classified as differentially expressed, detecting changes in expression even for low expression genes. This approach increases the dynamic range of a surface-bound fluorescence-based assay to reliably quantify small quantities of DNA that would be impossible with standard substrates.
Collapse
|