1
|
Gao Y, Liu GE, Ma L, Fang L, Li CJ, Baldwin RL. Transcriptomic profiling of gastrointestinal tracts in dairy cattle during lactation reveals molecular adaptations for milk synthesis. J Adv Res 2025; 71:67-80. [PMID: 38925453 DOI: 10.1016/j.jare.2024.06.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 06/11/2024] [Accepted: 06/21/2024] [Indexed: 06/28/2024] Open
Abstract
During lactation, dairy cattle's digestive tract requires significant adaptations to meet the increased nutrient demands for milk production. As we attempt to improve milk-related traits through selective pressure, it is crucial to understand the biological functions of the epithelia of the rumen, small intestine, and colonic tissues in response to changes in physiological state driven by changes in nutrient demands for milk synthesis. In this study, we obtained a total of 108 transcriptome profiles from three tissues (epithelia of the colon, duodenum, and rumen) of five Holstein cows, spanning eight time points from the early, mid, late lactation periods to the dry period. On average 97.06% of reads were successfully mapped to the reference genome assembly ARS-UCD1.2. We analyzed 27,607 gene expression patterns at multiple periods, enabling direct comparisons within and among tissues during different lactation stages, including early and peak lactation. We identified 1645, 813, and 2187 stage-specific genes in the colon, duodenum, and rumen, respectively, which were enriched for common or specific biological functions among different tissues. Time series analysis categorized the expressed genes within each tissue into four clusters. Furthermore, when the three tissues were analyzed collectively, 36 clusters of similarly expressed genes were identified. By integrating other comprehensive approaches such as gene co-expression analyses, functional enrichment, and cell type deconvolution, we gained profound insights into cattle lactation, revealing tissue-specific characteristics of the gastrointestinal tract and shedding light on the intricate molecular adaptations involved in nutrient absorption, immune regulation, and cellular processes for milk synthesis during lactation.
Collapse
Affiliation(s)
- Yahui Gao
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA; Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA; State Key Laboratory of Livestock and Poultry Breeding, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics (QGG), Aarhus University, Aarhus, Denmark
| | - Cong-Jun Li
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA
| | - Ransom L Baldwin
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA.
| |
Collapse
|
2
|
Zhang J, Li Y, Zhu F, Guo X, Huang Y. Time-/dose- series transcriptome data analysis and traditional Chinese medicine treatment of pneumoconiosis. Int J Biol Macromol 2024; 267:131515. [PMID: 38614165 DOI: 10.1016/j.ijbiomac.2024.131515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 04/07/2024] [Accepted: 04/09/2024] [Indexed: 04/15/2024]
Abstract
Pneumoconiosis' pathogenesis is still unclear and specific drugs for its treatment are lacking. Analysis of series transcriptome data often uses a single comparison method, and there are few reports on using such data to predict the treatment of pneumoconiosis with traditional Chinese medicine (TCM). Here, we proposed a new method for analyzing series transcriptomic data, series difference analysis (SDA), and applied it to pneumoconiosis. By comparison with 5 gene sets including existing pneumoconiosis-related genes and gene set functional enrichment analysis, we demonstrated that the new method was not inferior to two existing traditional analysis methods. Furthermore, based on the TCM-drug target interaction network, we predicted the TCM corresponding to the common pneumoconiosis-related genes obtained by multiple methods, and combined them with the high-frequency TCM for its treatment obtained through literature mining to form a new TCM formula for it. After feeding it to pneumoconiosis modeling mice for two months, compared with the untreated group, the coat color, mental state and tissue sections of the mice in the treated group were markedly improved, indicating that the new TCM formula has a certain efficacy. Our study provides new insights into method development for series transcriptomic data analysis and treatment of pneumoconiosis.
Collapse
Affiliation(s)
- Jifeng Zhang
- Key Laboratory of Industrial Dust Prevention and Control & Occupational Health and Safety, Ministry of Education, Anhui University of Science and Technology, Huainan, Anhui 232001, China; School of Biological Engineering & Institute of Digital Ecology and Health, Huainan Normal University, Huainan, China
| | - Yaobin Li
- Key Laboratory of Industrial Dust Prevention and Control & Occupational Health and Safety, Ministry of Education, Anhui University of Science and Technology, Huainan, Anhui 232001, China.
| | - Fenglin Zhu
- Key Laboratory of Industrial Dust Prevention and Control & Occupational Health and Safety, Ministry of Education, Anhui University of Science and Technology, Huainan, Anhui 232001, China
| | - Xiaodi Guo
- School of Biological Engineering & Institute of Digital Ecology and Health, Huainan Normal University, Huainan, China
| | - Yuqing Huang
- School of Biological Engineering & Institute of Digital Ecology and Health, Huainan Normal University, Huainan, China
| |
Collapse
|
3
|
Scharl T, Grün B. A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models. BMC Bioinformatics 2024; 25:90. [PMID: 38429687 PMCID: PMC10905927 DOI: 10.1186/s12859-024-05717-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 02/21/2024] [Indexed: 03/03/2024] Open
Abstract
RNA sequencing of time-course experiments results in three-way count data where the dimensions are the genes, the time points and the biological units. Clustering RNA-seq data allows to extract groups of co-expressed genes over time. After standardisation, the normalised counts of individual genes across time points and biological units have similar properties as compositional data. We propose the following procedure to suitably cluster three-way RNA-seq data: (1) pre-process the RNA-seq data by calculating the normalised expression profiles, (2) transform the data using the additive log ratio transform to map the composition in the D-part Aitchison simplex to a D - 1 -dimensional Euclidean vector, (3) cluster the transformed RNA-seq data using matrix-variate Gaussian mixture models and (4) assess the quality of the overall cluster solution and of individual clusters based on cluster separation in the transformed space using density-based silhouette information and on compactness of the cluster in the original space using cluster maps as a suitable visualisation. The proposed procedure is illustrated on RNA-seq data from fission yeast and results are also compared to an analogous two-way approach after flattening out the biological units.
Collapse
Affiliation(s)
- Theresa Scharl
- Institute of Statistics, University of Natural Resources and Life Sciences, Vienna, Austria.
| | - Bettina Grün
- Institute for Statistics and Mathematics, Vienna University of Economics and Business, Vienna, Austria
| |
Collapse
|
4
|
Zhang N, Tan Z, Wei J, Zhang S, Liu Y, Miao Y, Ding Q, Yi W, Gan M, Li C, Liu B, Wang H, Zheng Z. Identification of novel anti-ZIKV drugs from viral-infection temporal gene expression profiles. Emerg Microbes Infect 2023; 12:2174777. [PMID: 36715162 PMCID: PMC9946313 DOI: 10.1080/22221751.2023.2174777] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Zika virus (ZIKV) infections are typically asymptomatic but cause severe neurological complications (e.g. Guillain-Barré syndrome in adults, and microcephaly in newborns). There are currently no specific therapy or vaccine options available to prevent ZIKV infections. Temporal gene expression profiles of ZIKV-infected human brain microvascular endothelial cells (HBMECs) were used in this study to identify genes essential for viral replication. These genes were then used to identify novel anti-ZIKV agents and validated in publicly available data and functional wet-lab experiments. Here, we found that ZIKV effectively evaded activation of immune response-related genes and completely reprogrammed cellular transcriptional architectures. Knockdown of genes, which gradually upregulated during viral infection but showed distinct expression patterns between ZIKV- and mock infection, discovered novel proviral and antiviral factors. One-third of the 74 drugs found through signature-based drug repositioning and cross-reference with the Drug Gene Interaction Database (DGIdb) were known anti-ZIKV agents. In cellular assays, two promising antiviral candidates (Luminespib/NVP-AUY922, L-161982) were found to reduce viral replication without causing cell toxicity. Overall, our time-series transcriptome-based methods offer a novel and feasible strategy for antiviral drug discovery. Our strategies, which combine conventional and data-driven analysis, can be extended for other pathogens causing pandemics in the future.
Collapse
Affiliation(s)
- Nailou Zhang
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Zhongyuan Tan
- The Joint Laboratory for Translational Precision Medicine, a. Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, People's Republic of China and b. Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, Hubei, People's Republic of China
| | - Jinbo Wei
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Sai Zhang
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Yan Liu
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Yuanjiu Miao
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Qingwen Ding
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Wenfu Yi
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Min Gan
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Chunjie Li
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Bin Liu
- Characteristic Medical Center of Chinese People’s Armed Police Forces, Tianjin, People’s Republic of China
| | - Hanzhong Wang
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China
| | - Zhenhua Zheng
- CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan, People’s Republic of China, Zhenhua Zheng CAS Key Laboratory of Special Pathogens and Biosafety, Center for Emerging Infectious Diseases, Wuhan Institute of Virology, Center for Biosafety Mega-Science, Chinese Academy of Sciences, Wuhan430071, People’s Republic of China
| |
Collapse
|
5
|
Vu QT, Song K, Park S, Xu L, Nam HG, Hong S. An auxin-mediated ultradian rhythm positively influences root regeneration via EAR1/EUR1 in Arabidopsis. FRONTIERS IN PLANT SCIENCE 2023; 14:1136445. [PMID: 37351216 PMCID: PMC10282773 DOI: 10.3389/fpls.2023.1136445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 04/04/2023] [Indexed: 06/24/2023]
Abstract
Ultradian rhythms have been proved to be critical for diverse biological processes. However, comprehensive understanding of the short-period rhythms remains limited. Here, we discover that leaf excision triggers a gene expression rhythm with ~3-h periodicity, named as the excision ultradian rhythm (UR), which is regulated by the plant hormone auxin. Promoter-luciferase analyses showed that the spatiotemporal patterns of the excision UR were positively associated with de novo root regeneration (DNRR), a post-embryonic developmental process. Transcriptomic analysis indicated more than 4,000 genes including DNRR-associated genes were reprogramed toward ultradian oscillation. Genetic studies showed that EXCISION ULTRADIAN RHYTHM 1 (EUR1) encoding ENHANCER OF ABSCISIC ACID CO-RECEPTOR1 (EAR1), an abscisic acid signaling regulator, was required to generate the excision ultradian rhythm and enhance root regeneration. The eur1 mutant exhibited the absence of auxin-induced excision UR generation and partial failure during rescuing root regeneration. Our results demonstrate a link between the excision UR and adventitious root formation via EAR1/EUR1, implying an additional regulatory layer in plant regeneration.
Collapse
Affiliation(s)
- Quy Thi Vu
- Center for Plant Aging Research, Institute for Basic Science, Daegu, Republic of Korea
- Department of New Biology, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, Republic of Korea
| | - Kitae Song
- Center for Plant Aging Research, Institute for Basic Science, Daegu, Republic of Korea
| | - Sungjin Park
- Center for Plant Aging Research, Institute for Basic Science, Daegu, Republic of Korea
| | - Lin Xu
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, China
| | - Hong Gil Nam
- Center for Plant Aging Research, Institute for Basic Science, Daegu, Republic of Korea
- Department of New Biology, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, Republic of Korea
| | - Sunghyun Hong
- Center for Plant Aging Research, Institute for Basic Science, Daegu, Republic of Korea
| |
Collapse
|
6
|
Jiang X, Liu K, Peng H, Fang J, Zhang A, Han Y, Zhang X. Comparative network analysis reveals the dynamics of organic acid diversity during fruit ripening in peach (Prunus persica L. Batsch). BMC PLANT BIOLOGY 2023; 23:16. [PMID: 36617558 PMCID: PMC9827700 DOI: 10.1186/s12870-023-04037-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 01/02/2023] [Indexed: 06/17/2023]
Abstract
BACKGROUND Organic acids are important components that determine the fruit flavor of peach (Prunus persica L. Batsch). However, the dynamics of organic acid diversity during fruit ripening and the key genes that modulate the organic acids metabolism remain largely unknown in this kind of fruit tree which yield ranks sixth in the world. RESULTS In this study, we used 3D transcriptome data containing three dimensions of information, namely time, phenotype and gene expression, from 5 different varieties of peach to construct gene co-expression networks throughout fruit ripening of peach. With the network inferred, the time-ordered network comparative analysis was performed to select high-acid specific gene co-expression network and then clarify the regulatory factors controlling organic acid accumulation. As a result, network modules related to organic acid synthesis and metabolism under high-acid and low-acid comparison conditions were identified for our following research. In addition, we obtained 20 candidate genes as regulatory factors related to organic acid metabolism in peach. CONCLUSIONS The study provides new insights into the dynamics of organic acid accumulation during fruit ripening, complements the results of classical co-expression network analysis and establishes a foundation for key genes discovery from time-series multiple species transcriptome data.
Collapse
Affiliation(s)
- Xiaohan Jiang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Kangchen Liu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Huixiang Peng
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jing Fang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Aidi Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China
| | - Yuepeng Han
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China.
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China.
| | - Xiujun Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China.
- Center of Economic Botany, Core Botanical Gardens, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China.
| |
Collapse
|
7
|
Gao Y, Selee B, Schnabel EL, Poehlman WL, Chavan SA, Frugoli JA, Feltus FA. Time Series Transcriptome Analysis in Medicago truncatula Shoot and Root Tissue During Early Nodulation. FRONTIERS IN PLANT SCIENCE 2022; 13:861639. [PMID: 35463395 PMCID: PMC9021838 DOI: 10.3389/fpls.2022.861639] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 02/28/2022] [Indexed: 06/14/2023]
Abstract
In response to colonization by rhizobia bacteria, legumes are able to form nitrogen-fixing nodules in their roots, allowing the plants to grow efficiently in nitrogen-depleted environments. Legumes utilize a complex, long-distance signaling pathway to regulate nodulation that involves signals in both roots and shoots. We measured the transcriptional response to treatment with rhizobia in both the shoots and roots of Medicago truncatula over a 72-h time course. To detect temporal shifts in gene expression, we developed GeneShift, a novel computational statistics and machine learning workflow that addresses the time series replicate the averaging issue for detecting gene expression pattern shifts under different conditions. We identified both known and novel genes that are regulated dynamically in both tissues during early nodulation including leginsulin, defensins, root transporters, nodulin-related, and circadian clock genes. We validated over 70% of the expression patterns that GeneShift discovered using an independent M. truncatula RNA-Seq study. GeneShift facilitated the discovery of condition-specific temporally differentially expressed genes in the symbiotic nodulation biological system. In principle, GeneShift should work for time-series gene expression profiling studies from other systems.
Collapse
Affiliation(s)
- Yueyao Gao
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Bradley Selee
- Department of Electrical and Computer Engineering, Clemson University, Clemson, SC, United States
| | - Elise L. Schnabel
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - William L. Poehlman
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
- Sage Bionetworks, Seattle, WA, United States
| | - Suchitra A. Chavan
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Julia A. Frugoli
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Frank Alex Feltus
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
- Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC, United States
- Clemson Center for Human Genetics, Greenwood, SC, United States
| |
Collapse
|
8
|
Jang J, Hwang I, Jung I. TimesVector-Web: A Web Service for Analysing Time Course Transcriptome Data with Multiple Conditions. Genes (Basel) 2021; 13:73. [PMID: 35052413 PMCID: PMC8775016 DOI: 10.3390/genes13010073] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Revised: 12/19/2021] [Accepted: 12/22/2021] [Indexed: 11/27/2022] Open
Abstract
From time course gene expression data, we may identify genes that modulate in a certain pattern across time. Such patterns are advantageous to investigate the transcriptomic response to a certain condition. Especially, it is of interest to compare two or more conditions to detect gene expression patterns that significantly differ between them. Time course analysis can become difficult using traditional differentially expressed gene (DEG) analysis methods since they are based on pair-wise sample comparison instead of a series of time points. Most importantly, the related tools are mostly available as local Software, requiring technical expertise. Here, we present TimesVector-web, which is an easy to use web service for analysing time course gene expression data with multiple conditions. The web-service was developed to (1) alleviate the burden for analyzing multi-class time course data and (2) provide downstream analysis on the results for biological interpretation including TF, miRNA target, gene ontology and pathway analysis. TimesVector-web was validated using three case studies that use both microarray and RNA-seq time course data and showed that the results captured important biological findings from the original studies.
Collapse
Affiliation(s)
| | | | - Inuk Jung
- Department of Computer Science and Engineering, Kyungpook National University, Buk-gu, Deagu 41566, Korea; (J.J.); (I.H.)
| |
Collapse
|
9
|
Charting Shifts in Saccharomyces cerevisiae Gene Expression across Asynchronous Time Trajectories with Diffusion Maps. mBio 2021; 12:e0234521. [PMID: 34607457 PMCID: PMC8546541 DOI: 10.1128/mbio.02345-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
During fermentation, Saccharomyces cerevisiae metabolizes sugars and other nutrients to obtain energy for growth and survival, while also modulating these activities in response to cell-environment interactions. Here, differences in S. cerevisiae gene expression were explored over a time course of fermentation and used to differentiate fermentations, using Pinot noir grapes from 15 unique sites. Data analysis was complicated by the fact that the fermentations proceeded at different rates, making a direct comparison of time series gene expression data difficult with conventional differential expression tools. This led to the development of a novel approach combining diffusion mapping with continuous differential expression analysis (termed DMap-DE). Using this method, site-specific deviations in gene expression were identified, including changes in gene expression correlated with the non-Saccharomyces yeast Hanseniaspora uvarum, as well as initial nitrogen concentrations in grape musts. These results highlight novel relationships between site-specific variables and Saccharomyces cerevisiae gene expression that are linked to repeated fermentation outcomes. It was also demonstrated that DMap-DE can extract biologically relevant gene expression patterns from other contexts (e.g., hypoxic response of Saccharomyces cerevisiae) and offers advantages over other data dimensionality reduction approaches, indicating that DMap-DE offers a robust method for investigating asynchronous time series gene expression data.
Collapse
|
10
|
Oh VKS, Li RW. Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data. Genes (Basel) 2021; 12:352. [PMID: 33673721 PMCID: PMC7997275 DOI: 10.3390/genes12030352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 02/19/2021] [Accepted: 02/22/2021] [Indexed: 02/06/2023] Open
Abstract
Dynamic studies in time course experimental designs and clinical approaches have been widely used by the biomedical community. These applications are particularly relevant in stimuli-response models under environmental conditions, characterization of gradient biological processes in developmental biology, identification of therapeutic effects in clinical trials, disease progressive models, cell-cycle, and circadian periodicity. Despite their feasibility and popularity, sophisticated dynamic methods that are well validated in large-scale comparative studies, in terms of statistical and computational rigor, are less benchmarked, comparing to their static counterparts. To date, a number of novel methods in bulk RNA-Seq data have been developed for the various time-dependent stimuli, circadian rhythms, cell-lineage in differentiation, and disease progression. Here, we comprehensively review a key set of representative dynamic strategies and discuss current issues associated with the detection of dynamically changing genes. We also provide recommendations for future directions for studying non-periodical, periodical time course data, and meta-dynamic datasets.
Collapse
Affiliation(s)
- Vera-Khlara S. Oh
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
- Department of Computer Science and Statistics, College of Natural Sciences, Jeju National University, Jeju City 63243, Korea
| | - Robert W. Li
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
| |
Collapse
|
11
|
Development of genetic quality tests for good manufacturing practice-compliant induced pluripotent stem cells and their derivatives. Sci Rep 2020; 10:3939. [PMID: 32127560 PMCID: PMC7054319 DOI: 10.1038/s41598-020-60466-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 02/05/2020] [Indexed: 02/06/2023] Open
Abstract
Although human induced pluripotent stem cell (hiPSC) lines are karyotypically normal, they retain the potential for mutation in the genome. Accordingly, intensive and relevant quality controls for clinical-grade hiPSCs remain imperative. As a conceptual approach, we performed RNA-seq-based broad-range genetic quality tests on GMP-compliant human leucocyte antigen (HLA)-homozygous hiPSCs and their derivatives under postdistribution conditions to investigate whether sequencing data could provide a basis for future quality control. We found differences in the degree of single-nucleotide polymorphism (SNP) occurring in cells cultured at three collaborating institutes. However, the cells cultured at each centre showed similar trends, in which more SNPs occurred in late-passage hiPSCs than in early-passage hiPSCs after differentiation. In eSNP karyotyping analysis, none of the predicted copy number variations (CNVs) were identified, which confirmed the results of SNP chip-based CNV analysis. HLA genotyping analysis revealed that each cell line was homozygous for HLA-A, HLA-B, and DRB1 and heterozygous for HLA-DPB type. Gene expression profiling showed a similar differentiation ability of early- and late-passage hiPSCs into cardiomyocyte-like, hepatic-like, and neuronal cell types. However, time-course analysis identified five clusters showing different patterns of gene expression, which were mainly related to the immune response. In conclusion, RNA-seq analysis appears to offer an informative genetic quality testing approach for such cell types and allows the early screening of candidate hiPSC seed stocks for clinical use by facilitating safety and potential risk evaluation.
Collapse
|
12
|
Ahn H, Jung I, Chae H, Kang D, Jung W, Kim S. HTRgene: a computational method to perform the integrated analysis of multiple heterogeneous time-series data: case analysis of cold and heat stress response signaling genes in Arabidopsis. BMC Bioinformatics 2019; 20:588. [PMID: 31787073 PMCID: PMC6886170 DOI: 10.1186/s12859-019-3072-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Background Integrated analysis that uses multiple sample gene expression data measured under the same stress can detect stress response genes more accurately than analysis of individual sample data. However, the integrated analysis is challenging since experimental conditions (strength of stress and the number of time points) are heterogeneous across multiple samples. Results HTRgene is a computational method to perform the integrated analysis of multiple heterogeneous time-series data measured under the same stress condition. The goal of HTRgene is to identify “response order preserving DEGs” that are defined as genes not only which are differentially expressed but also whose response order is preserved across multiple samples. The utility of HTRgene was demonstrated using 28 and 24 time-series sample gene expression data measured under cold and heat stress in Arabidopsis. HTRgene analysis successfully reproduced known biological mechanisms of cold and heat stress in Arabidopsis. Also, HTRgene showed higher accuracy in detecting the documented stress response genes than existing tools. Conclusions HTRgene, a method to find the ordering of response time of genes that are commonly observed among multiple time-series samples, successfully integrated multiple heterogeneous time-series gene expression datasets. It can be applied to many research problems related to the integration of time series data analysis.
Collapse
Affiliation(s)
- Hongryul Ahn
- Department of Computer Science and Engineering, Seoul National University, Seoul, Korea
| | - Inuk Jung
- Department of Computer Science and Engineering, Kyungpook National University, Daegu, Korea
| | - Heejoon Chae
- Division of Computer Science, Sookmyung Women's University, Seoul, Korea
| | - Dongwon Kang
- Department of Computer Science and Engineering, Seoul National University, Seoul, Korea
| | - Woosuk Jung
- Department of Crop Science, Konkuk University, Seoul, Korea.
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, Korea. .,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea. .,Bioinformatics Institute, Seoul National University, Seoul, Korea.
| |
Collapse
|
13
|
Ahn H, Jo K, Jeong D, Pak M, Hur J, Jung W, Kim S. PropaNet: Time-Varying Condition-Specific Transcriptional Network Construction by Network Propagation. FRONTIERS IN PLANT SCIENCE 2019; 10:698. [PMID: 31258543 PMCID: PMC6587906 DOI: 10.3389/fpls.2019.00698] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 05/09/2019] [Indexed: 06/09/2023]
Abstract
Transcription factor (TF) has a significant influence on the state of a cell by regulating multiple down-stream genes. Thus, experimental and computational biologists have made great efforts to construct TF gene networks for regulatory interactions between TFs and their target genes. Now, an important research question is how to utilize TF networks to investigate the response of a plant to stress at the transcription control level using time-series transcriptome data. In this article, we present a new computational network, PropaNet, to investigate dynamics of TF networks from time-series transcriptome data using two state-of-the-art network analysis techniques, influence maximization and network propagation. PropaNet uses the influence maximization technique to produce a ranked list of TFs, in the order of TF that explains differentially expressed genes (DEGs) better at each time point. Then, a network propagation technique is used to select a group of TFs that explains DEGs best as a whole. For the analysis of Arabidopsis time series datasets from AtGenExpress, we used PlantRegMap as a template TF network and performed PropaNet analysis to investigate transcriptional dynamics of Arabidopsis under cold and heat stress. The time varying TF networks showed that Arabidopsis responded to cold and heat stress quite differently. For cold stress, bHLH and bZIP type TFs were the first responding TFs and the cold signal influenced histone variants, various genes involved in cell architecture, osmosis and restructuring of cells. However, the consequences of plants under heat stress were up-regulation of genes related to accelerating differentiation and starting re-differentiation. In terms of energy metabolism, plants under heat stress show elevated metabolic process and resulting in an exhausted status. We believe that PropaNet will be useful for the construction of condition-specific time-varying TF network for time-series data analysis in response to stress. PropaNet is available at http://biohealth.snu.ac.kr/software/PropaNet.
Collapse
Affiliation(s)
- Hongryul Ahn
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
| | - Kyuri Jo
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
| | - Dabin Jeong
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
| | - Minwoo Pak
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| | - Jihye Hur
- Department of Crop Science, Konkuk University, Seoul, South Korea
| | - Woosuk Jung
- Department of Crop Science, Konkuk University, Seoul, South Korea
| | - Sun Kim
- Bioinformatics Institute, Seoul National University, Seoul, South Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| |
Collapse
|
14
|
Vitali F, Li Q, Schissler AG, Berghout J, Kenost C, Lussier YA. Developing a 'personalome' for precision medicine: emerging methods that compute interpretable effect sizes from single-subject transcriptomes. Brief Bioinform 2019; 20:789-805. [PMID: 29272327 PMCID: PMC6585155 DOI: 10.1093/bib/bbx149] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2017] [Revised: 10/06/2017] [Indexed: 12/13/2022] Open
Abstract
The development of computational methods capable of analyzing -omics data at the individual level is critical for the success of precision medicine. Although unprecedented opportunities now exist to gather data on an individual's -omics profile ('personalome'), interpreting and extracting meaningful information from single-subject -omics remain underdeveloped, particularly for quantitative non-sequence measurements, including complete transcriptome or proteome expression and metabolite abundance. Conventional bioinformatics approaches have largely been designed for making population-level inferences about 'average' disease processes; thus, they may not adequately capture and describe individual variability. Novel approaches intended to exploit a variety of -omics data are required for identifying individualized signals for meaningful interpretation. In this review-intended for biomedical researchers, computational biologists and bioinformaticians-we survey emerging computational and translational informatics methods capable of constructing a single subject's 'personalome' for predicting clinical outcomes or therapeutic responses, with an emphasis on methods that provide interpretable readouts. Key points: (i) the single-subject analytics of the transcriptome shows the greatest development to date and, (ii) the methods were all validated in simulations, cross-validations or independent retrospective data sets. This survey uncovers a growing field that offers numerous opportunities for the development of novel validation methods and opens the door for future studies focusing on the interpretation of comprehensive 'personalomes' through the integration of multiple -omics, providing valuable insights into individual patient outcomes and treatments.
Collapse
Affiliation(s)
| | - Qike Li
- BIO5 Institute, University of Arizona, Tucson, AZ, USA
| | | | | | | | | |
Collapse
|
15
|
Williams JR, Yang R, Clifford JL, Watson D, Campbell R, Getnet D, Kumar R, Hammamieh R, Jett M. Functional Heatmap: an automated and interactive pattern recognition tool to integrate time with multi-omics assays. BMC Bioinformatics 2019; 20:81. [PMID: 30770734 PMCID: PMC6377781 DOI: 10.1186/s12859-019-2657-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 01/28/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Life science research is moving quickly towards large-scale experimental designs that are comprised of multiple tissues, time points, and samples. Omic time-series experiments offer answers to three big questions: what collective patterns do most analytes follow, which analytes follow an identical pattern or synchronize across multiple cohorts, and how do biological functions evolve over time. Existing tools fall short of robustly answering and visualizing all three questions in a unified interface. RESULTS Functional Heatmap offers time-series data visualization through a Master Panel page, and Combined page to answer each of the three time-series questions. It dissects the complex multi-omics time-series readouts into patterned clusters with associated biological functions. It allows users to identify a cascade of functional changes over a time variable. Inversely, Functional Heatmap can compare a pattern with specific biology respond to multiple experimental conditions. All analyses are interactive, searchable, and exportable in a form of heatmap, line-chart, or text, and the results are easy to share, maintain, and reproduce on the web platform. CONCLUSIONS Functional Heatmap is an automated and interactive tool that enables pattern recognition in time-series multi-omics assays. It significantly reduces the manual labour of pattern discovery and comparison by transferring statistical models into visual clues. The new pattern recognition feature will help researchers identify hidden trends driven by functional changes using multi-tissues/conditions on a time-series fashion from omic assays.
Collapse
Affiliation(s)
- Joshua R. Williams
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Ruoting Yang
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - John L. Clifford
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Daniel Watson
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
| | - Ross Campbell
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Derese Getnet
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Raina Kumar
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Rasha Hammamieh
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Marti Jett
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| |
Collapse
|
16
|
Comparative transcriptomics method to infer gene coexpression networks and its applications to maize and rice leaf transcriptomes. Proc Natl Acad Sci U S A 2019; 116:3091-3099. [PMID: 30718437 DOI: 10.1073/pnas.1817621116] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Time-series transcriptomes of a biological process obtained under different conditions are useful for identifying the regulators of the process and their regulatory networks. However, such data are 3D (gene expression, time, and condition), and there is currently no method that can deal with their full complexity. Here, we developed a method that avoids time-point alignment and normalization between conditions. We applied it to analyze time-series transcriptomes of developing maize leaves under light-dark cycles and under total darkness and obtained eight time-ordered gene coexpression networks (TO-GCNs), which can be used to predict upstream regulators of any genes in the GCNs. One of the eight TO-GCNs is light-independent and likely includes all genes involved in the development of Kranz anatomy, which is a structure crucial for the high efficiency of photosynthesis in C4 plants. Using this TO-GCN, we predicted and experimentally validated a regulatory cascade upstream of SHORTROOT1, a key Kranz anatomy regulator. Moreover, we applied the method to compare transcriptomes from maize and rice leaf segments and identified regulators of maize C4 enzyme genes and RUBISCO SMALL SUBUNIT2 Our study provides not only a powerful method but also novel insights into the regulatory networks underlying Kranz anatomy development and C4 photosynthesis.
Collapse
|