1
|
Ichikawa C, Kadota K. MBCdeg4: A modified clustering-based method for identifying differentially expressed genes from RNA-seq data. MethodsX 2025; 14:103149. [PMID: 39866202 PMCID: PMC11762142 DOI: 10.1016/j.mex.2024.103149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Accepted: 12/29/2024] [Indexed: 01/28/2025] Open
Abstract
RNA-seq is a commonly employed methodology for the measurement of transcriptomes, particularly for the identification of differentially expressed genes (DEGs) between different conditions or groups. In a previous report, we described a clustering-based method for identifying DEGs, designated MBCdeg1 and MBCdeg2. and a modified version, MBCdeg3. This study presents a further improved version, designated MBCdeg4. The four versions of MBCdeg employ an R package, designated MBCluster.Seq, internally. The sole distinction between them is the manner of data normalization. MBCdeg4 employs normalization factors derived from a robust normalization algorithm, designated as DEGES. Seven competing methods were compared: the four versions of MBCdeg and three conventional R packages (edgeR, DESeq2, and TCC). MBCdeg4 demonstrated superior performance in a multitude of simulation scenarios involving RNA-seq count data. Therefore, MBCdeg4 is recommended for use in preference to the earlier versions, MBCdeg1-3.•MBCdeg4 is a method for both identification and classification of DEGs from RNA-seq count data.•MBCdeg4 is available as an R function and performs well in a wide variety of simulation scenarios.
Collapse
Affiliation(s)
- Chiharu Ichikawa
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| | - Koji Kadota
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
- Interfaculty Initiative in Information Studies, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| |
Collapse
|
2
|
Kadooka C, Izumitsu K, Asai T, Hiramatsu K, Mori K, Okutsu K, Yoshizaki Y, Takamine K, Goto M, Tamaki H, Futagami T. Overexpression of the RNA-binding protein NrdA affects global gene expression and secondary metabolism in Aspergillus species. mSphere 2025; 10:e0084924. [PMID: 39853104 PMCID: PMC11852746 DOI: 10.1128/msphere.00849-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Accepted: 12/16/2024] [Indexed: 01/26/2025] Open
Abstract
RNA-binding protein Nrd1 plays a role in RNA polymerase II transcription termination. In this study, we showed that the orthologous NrdA is important in global mRNA expression and secondary metabolism in Aspergillus species. We constructed an nrdA conditional expression strain using the Tet-On system in Aspergillus luchuenesis mut. kawachii. Downregulation of nrdA caused a severe growth defect, indicating that NrdA is essential for the proliferation of A. kawachii. Parallel RNA-sequencing and RNA immunoprecipitation-sequencing analysis identified potential NrdA-interacting transcripts, corresponding to 32% of the predicted protein-coding genes of A. kawachii. Subsequent gene ontology analysis suggested that overexpression of NrdA affects the production of secondary metabolites. To clarify this, we constructed Aspergillus nidulans, Aspergillus fumigatus, and Aspergillus oryzae strains overexpressing NrdA in the early developmental stage. Overexpression of NrdA reduced the production of sterigmatocystin and penicillin in A. nidulans, as well as that of helvolic acid and pyripyropene A in A. fumigatus. Moreover, it increased the production of kojic acid and reduced the production of penicillin in A. oryzae. These effects were accompanied by almost consistent changes in the mRNA levels of relevant genes. Collectively, these results suggest that NrdA is the essential RNA-binding protein, which plays a significant role in global gene expression and secondary metabolism in Aspergillus species.IMPORTANCENrd1, a component of the Nrd1-Nab3-Sen1 complex, is an essential RNA-binding protein involved in transcriptional termination in yeast. However, its role in filamentous fungi has not been studied. In this study, we characterized an orthologous NrdA in the Aspergillus species, identified potential NrdA-interacting mRNA, and investigated the effect of overexpression of NrdA on mRNA expression in Aspergillus luchuensis mut. kawachii. The results indicated that NrdA controls global gene expression involved in versatile metabolic pathways, including the secondary metabolic process, at least in the early developmental stage. We demonstrated that NrdA overexpression significantly affected the production of secondary metabolites in Aspergillus nidulans, Aspergillus oryzae, and Aspergillus fumigatus. Our findings are of importance to the fungal research community because the secondary metabolism is an industrially and clinically important aspect for the Aspergillus species.
Collapse
Affiliation(s)
- Chihiro Kadooka
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
- Department of Biotechnology and Life Sciences, Faculty of Biotechnology and Life Sciences, Sojo University, Nishi-ku, Kumamoto, Japan
| | - Kosuke Izumitsu
- Graduate School of Environmental Science, University of Shiga Prefecture, Hikone, Shiga, Japan
| | - Teigo Asai
- Graduate School of Pharmaceutical Sciences, Tohoku University, Aramaki, Aoba-ku, Sendai, Miyagi, Japan
| | - Kentaro Hiramatsu
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
| | - Kazuki Mori
- Department of Bioscience and Biotechnology, Faculty of Agriculture, Kyushu University, Nishi-ku, Fukuoka, Japan
- Cell Innovator Co., Ltd., Higashi-ku, Fukuoka, Japan
| | - Kayu Okutsu
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
| | - Yumiko Yoshizaki
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
| | - Kazunori Takamine
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
| | - Masatoshi Goto
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Department of Applied Biochemistry and Food Science, Faculty of Agriculture, Saga University, Saga, Japan
| | - Hisanori Tamaki
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
| | - Taiki Futagami
- United Graduate School of Agricultural Sciences, Kagoshima University, Korimoto, Kagoshima, Japan
- Education and Research Centre for Fermentation Studies, Faculty of Agriculture, Kagoshima University, Korimoto, Kagoshima, Japan
| |
Collapse
|
3
|
Fathi Kazerooni A, Kraya A, Rathi KS, Kim MC, Vossough A, Khalili N, Familiar AM, Gandhi D, Khalili N, Kesherwani V, Haldar D, Anderson H, Jin R, Mahtabfar A, Bagheri S, Guo Y, Li Q, Huang X, Zhu Y, Sickler A, Lueder MR, Phul S, Koptyra M, Storm PB, Ware JB, Song Y, Davatzikos C, Foster JB, Mueller S, Fisher MJ, Resnick AC, Nabavizadeh A. Multiparametric MRI along with machine learning predicts prognosis and treatment response in pediatric low-grade glioma. Nat Commun 2025; 16:340. [PMID: 39747214 PMCID: PMC11697432 DOI: 10.1038/s41467-024-55659-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Accepted: 12/19/2024] [Indexed: 01/04/2025] Open
Abstract
Pediatric low-grade gliomas (pLGGs) exhibit heterogeneous prognoses and variable responses to treatment, leading to tumor progression and adverse outcomes in cases where complete resection is unachievable. Early prediction of treatment responsiveness and suitability for immunotherapy has the potential to improve clinical management and outcomes. Here, we present a radiogenomic analysis of pLGGs, integrating MRI and RNA sequencing data. We identify three immunologically distinct clusters, with one group characterized by increased immune activity and poorer prognosis, indicating potential benefit from immunotherapies. We develop a radiomic signature that predicts these immune profiles with over 80% accuracy. Furthermore, our clinicoradiomic model predicts progression-free survival and correlates with treatment response. We also identify genetic variants and transcriptomic pathways associated with progression risk, highlighting links to tumor growth and immune response. This radiogenomic study in pLGGs provides a framework for the identification of high-risk patients who may benefit from targeted therapies.
Collapse
Affiliation(s)
- Anahita Fathi Kazerooni
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- AI2D Center for AI and Data Science for Integrated Diagnostics, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Neurosurgery, The Children's Hospital of Philadelphia, Philadelphia, PA, USA.
| | - Adam Kraya
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Komal S Rathi
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Meen Chul Kim
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Arastoo Vossough
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Division of Radiology, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Nastaran Khalili
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ariana M Familiar
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Deep Gandhi
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Neda Khalili
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Varun Kesherwani
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Debanjan Haldar
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurosurgery, Thomas Jefferson University, Philadelphia, PA, USA
| | - Hannah Anderson
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Run Jin
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Aria Mahtabfar
- Department of Neurosurgery, Thomas Jefferson University, Philadelphia, PA, USA
| | - Sina Bagheri
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yiran Guo
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Qi Li
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Xiaoyan Huang
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yuankun Zhu
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Alex Sickler
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurosurgery, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Matthew R Lueder
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurosurgery, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Saksham Phul
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Mateusz Koptyra
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Phillip B Storm
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurosurgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Neurosurgery, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jeffrey B Ware
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yuanquan Song
- Department of Pathology and Laboratory Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
| | - Christos Davatzikos
- AI2D Center for AI and Data Science for Integrated Diagnostics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Jessica B Foster
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Sabine Mueller
- Department of Neurology and Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Michael J Fisher
- Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Adam C Resnick
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurosurgery, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ali Nabavizadeh
- Center for Data-Driven Discovery in Biomedicine (D3b), The Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
4
|
Prigent L, Quéré J, Plus M, Le Gac M. Sexual reproduction during diatom bloom. ISME COMMUNICATIONS 2025; 5:ycae169. [PMID: 39839889 PMCID: PMC11749564 DOI: 10.1093/ismeco/ycae169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 11/22/2024] [Accepted: 01/06/2025] [Indexed: 01/23/2025]
Abstract
Phytoplankton supports food webs in all aquatic ecosystems. Ecological studies highlighted the links between environmental variables and species successions in situ. However, the role of life cycle characteristics on phytoplankton community dynamics remains poorly characterized. In diatoms, sexual reproduction creates new genetic combinations and prevents excessive cell size miniaturization. It has been extensively studied in vitro but seldom in the natural environment. Here, analyzing metatranscriptomic data in the light of the expression patterns previously characterized in vitro, we identified a synchronized and transient sexual reproduction event during a bloom of the toxic diatom species Pseudo-nitzschia australis. Despite the complexity of environmental conditions encountered in situ, sexual reproduction appeared to be the strongest differential gene expression signal that occurred during the bloom. The potential link between environmental conditions and the initiation of sexual reproduction remain to be determined, but sexual reproduction probably had a major impact on the bloom dynamic.
Collapse
|
5
|
Machitani M, Nomura A, Yamashita T, Yasukawa M, Ueki S, Fujita KI, Ueno T, Yamashita A, Tanzawa Y, Watanabe M, Taniguchi T, Saitoh N, Kaneko S, Kato Y, Mano H, Masutomi K. Maintenance of R-loop structures by phosphorylated hTERT preserves genome integrity. Nat Cell Biol 2024; 26:932-945. [PMID: 38806647 DOI: 10.1038/s41556-024-01427-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 04/23/2024] [Indexed: 05/30/2024]
Abstract
As aberrant accumulation of RNA-DNA hybrids (R-loops) causes DNA damage and genome instability, cells express regulators of R-loop structures. Here we report that RNA-dependent RNA polymerase (RdRP) activity of human telomerase reverse transcriptase (hTERT) regulates R-loop formation. We found that the phosphorylated form of hTERT (p-hTERT) exhibits RdRP activity in nuclear speckles both in telomerase-positive cells and telomerase-negative cells with alternative lengthening of telomeres (ALT) activity. The p-hTERT did not associate with telomerase RNA component in nuclear speckles but, instead, with TERRA RNAs to resolve R-loops. Targeting of the TERT gene in ALT cells ablated RdRP activity and impaired tumour growth. Using a genome-scale CRISPR loss-of-function screen, we identified Fanconi anaemia/BRCA genes as synthetic lethal partners of hTERT RdRP. Inactivation of RdRP and Fanconi anaemia/BRCA genes caused accumulation of R-loop structures and DNA damage. These findings indicate that RdRP activity of p-hTERT guards against genome instability by removing R-loop structures.
Collapse
Affiliation(s)
- Mitsuhiro Machitani
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan
| | - Akira Nomura
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan
- Department of Orthopaedic Surgery, Surgical Science, Tokai University School of Medicine, Isehara, Japan
| | - Taro Yamashita
- Department of Gastroenterology, Kanazawa University Graduate School of Medical Sciences, Kanazawa, Japan
| | - Mami Yasukawa
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan
| | - Saori Ueki
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan
| | - Ken-Ichi Fujita
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan
| | - Toshihide Ueno
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, Japan
| | - Akio Yamashita
- Department of Investigative Medicine, University of the Ryukyus Graduate School of Medicine, Nakagami, Japan
| | - Yoshikazu Tanzawa
- Department of Orthopaedic Surgery, Surgical Science, Tokai University School of Medicine, Isehara, Japan
| | - Masahiko Watanabe
- Department of Orthopaedic Surgery, Surgical Science, Tokai University School of Medicine, Isehara, Japan
| | - Toshiyasu Taniguchi
- Department of Molecular Life Science, Tokai University School of Medicine, Isehara, Japan
| | - Noriko Saitoh
- Division of Cancer Biology, The Cancer Institute of JFCR, Tokyo, Japan
| | - Shuichi Kaneko
- Department of Gastroenterology, Kanazawa University Graduate School of Medical Sciences, Kanazawa, Japan
| | - Yukinari Kato
- Department of Antibody Drug Development, Tohoku University Graduate School of Medicine, Sendai, Japan
- Department of Molecular Pharmacology, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Hiroyuki Mano
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, Japan
| | - Kenkichi Masutomi
- Division of Cancer Stem Cell, National Cancer Center Research Institute, Tokyo, Japan.
| |
Collapse
|
6
|
Makino M, Shimizu K, Kadota K. Enhanced clustering-based differential expression analysis method for RNA-seq data. MethodsX 2024; 12:102518. [PMID: 38179066 PMCID: PMC10764243 DOI: 10.1016/j.mex.2023.102518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 12/10/2023] [Indexed: 01/06/2024] Open
Abstract
RNA-seq is a tool for measuring gene expression and is commonly used to identify differentially expressed genes (DEGs). Gene clustering has been widely used to classify DEGs with similar expression patterns, but rarely used to identify DEGs themselves. We recently reported that the clustering-based method (called MBCdeg1 and 2) for identifying DEGs has great potential. However, these methods left room for improvement. This study reports on the improvement (named MBCdeg3). We compared a total of six competing methods: three conventional R packages (edgeR, DESeq2, and TCC) and three versions of MBCdeg (i.e., MBCdeg1, 2, and 3) corresponding to three different normalization algorithms. As MBCdeg3 performs well in many simulation scenarios of RNA-seq count data, MBCdeg3 replaces MBCdeg1 and 2 in our previous report. •MBCdeg3 is a method for both identification and classification of DEGs from RNA-seq count data.•MBCdeg3 is available as a function of R, which is common in the field of expression analysis.•MBCdeg3 performs well in a variety of simulation scenarios for RNA-seq count data.
Collapse
Affiliation(s)
- Manon Makino
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| | - Kentaro Shimizu
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| | - Koji Kadota
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
- Interfaculty Initiative in Information Studies, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo 113-8657, Japan
| |
Collapse
|
7
|
Rutter KJ, Peake M, Hawkshaw NJ, Scholey R, Bulfone-Paus S, Friedmann PS, Farrar MD, Rhodes LE. Solar urticaria involves rapid mast cell STAT3 activation and neutrophil recruitment, with FcεRI as an upstream regulator. J Allergy Clin Immunol 2024; 153:1369-1380.e15. [PMID: 38184075 DOI: 10.1016/j.jaci.2023.12.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 12/01/2023] [Accepted: 12/21/2023] [Indexed: 01/08/2024]
Abstract
BACKGROUND Solar urticaria is a rare photodermatosis characterized by rapid-onset sunlight-induced urticaria, but its pathophysiology is not well understood. OBJECTIVE We sought to define cutaneous cellular and molecular events in the evolution of solar urticaria following its initiation by solar-simulated UV radiation (SSR) and compare with healthy controls (HC). METHODS Cutaneous biopsy specimens were taken from unexposed skin and skin exposed to a single low (physiologic) dose of SSR at 30 minutes, 3 hours, and 24 hours after exposure in 6 patients with solar urticaria and 6 HC. Biopsy specimens were assessed by immunohistochemistry and bulk RNA-sequencing analysis. RESULTS In solar urticaria specimens, there was enrichment of several innate immune pathways, with striking early involvement of neutrophils, which was not observed in HC. Multiple proinflammatory cytokine and chemokine genes were upregulated (including IL20, IL6, and CXCL8) or identified as upstream regulators (including TNF, IL-1β, and IFN-γ). IgE and FcεRI were identified as upstream regulators, and phosphorylated signal transducer and activator of transcription 3 expression in mast cells was increased in solar urticaria at 30 minutes and 3 hours after SSR exposure, suggesting a mechanism of mast cell activation. Clinical resolution of solar urticaria by 24 hours mirrored resolution of inflammatory gene signature profiles. Comparison with available datasets of chronic spontaneous urticaria showed transcriptomic similarities relating to immune activation, but several transcripts were identified solely in solar urticaria, including CXCL8 and CSF2/3. CONCLUSIONS Solar urticaria is characterized by rapid signal transducer and activator of transcription 3 activation in mast cells and involvement of multiple chemotactic and innate inflammatory pathways, with FcεRI engagement indicated as an early event.
Collapse
Affiliation(s)
- Kirsty J Rutter
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom; Salford Royal Hospital, Northern Care Alliance NHS Foundation Trust, Manchester Academic Health Science Centre, Greater Manchester, United Kingdom.
| | - Michael Peake
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom
| | - Nathan J Hawkshaw
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom
| | - Rachel Scholey
- Genomic Technologies Core Facility, University of Manchester, Manchester, United Kingdom
| | - Silvia Bulfone-Paus
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom
| | - Peter S Friedmann
- Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, United Kingdom
| | - Mark D Farrar
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom; Salford Royal Hospital, Northern Care Alliance NHS Foundation Trust, Manchester Academic Health Science Centre, Greater Manchester, United Kingdom
| | - Lesley E Rhodes
- Centre for Dermatology Research, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, NIHR Manchester Biomedical Research Centre, Manchester, United Kingdom; Salford Royal Hospital, Northern Care Alliance NHS Foundation Trust, Manchester Academic Health Science Centre, Greater Manchester, United Kingdom
| |
Collapse
|
8
|
Scharl T, Grün B. A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models. BMC Bioinformatics 2024; 25:90. [PMID: 38429687 PMCID: PMC10905927 DOI: 10.1186/s12859-024-05717-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 02/21/2024] [Indexed: 03/03/2024] Open
Abstract
RNA sequencing of time-course experiments results in three-way count data where the dimensions are the genes, the time points and the biological units. Clustering RNA-seq data allows to extract groups of co-expressed genes over time. After standardisation, the normalised counts of individual genes across time points and biological units have similar properties as compositional data. We propose the following procedure to suitably cluster three-way RNA-seq data: (1) pre-process the RNA-seq data by calculating the normalised expression profiles, (2) transform the data using the additive log ratio transform to map the composition in the D-part Aitchison simplex to a D - 1 -dimensional Euclidean vector, (3) cluster the transformed RNA-seq data using matrix-variate Gaussian mixture models and (4) assess the quality of the overall cluster solution and of individual clusters based on cluster separation in the transformed space using density-based silhouette information and on compactness of the cluster in the original space using cluster maps as a suitable visualisation. The proposed procedure is illustrated on RNA-seq data from fission yeast and results are also compared to an analogous two-way approach after flattening out the biological units.
Collapse
Affiliation(s)
- Theresa Scharl
- Institute of Statistics, University of Natural Resources and Life Sciences, Vienna, Austria.
| | - Bettina Grün
- Institute for Statistics and Mathematics, Vienna University of Economics and Business, Vienna, Austria
| |
Collapse
|
9
|
Hao X, Wang S, Fu Y, Liu Y, Shen H, Jiang L, McLamore ES, Shen Y. The WRKY46-MYC2 module plays a critical role in E-2-hexenal-induced anti-herbivore responses by promoting flavonoid accumulation. PLANT COMMUNICATIONS 2024; 5:100734. [PMID: 37859344 PMCID: PMC10873895 DOI: 10.1016/j.xplc.2023.100734] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 10/08/2023] [Accepted: 10/17/2023] [Indexed: 10/21/2023]
Abstract
Volatile organic compounds (VOCs) play key roles in plant-plant communication, especially in response to pest attack. E-2-hexenal is an important component of VOCs, but it is unclear whether it can induce endogenous plant resistance to insects. Here, we show that E-2-hexenal activates early signaling events in Arabidopsis (Arabidopsis thaliana) mesophyll cells, including an H2O2 burst at the plasma membrane, the directed flow of calcium ions, and an increase in cytosolic calcium concentration. Treatment of wild-type Arabidopsis plants with E-2-hexenal increases their resistance when challenged with the diamondback moth Plutella xylostella L., and this phenomenon is largely lost in the wrky46 mutant. Mechanistically, E-2-hexenal induces the expression of WRKY46 and MYC2, and the physical interaction of their encoded proteins was verified by yeast two-hybrid, firefly luciferase complementation imaging, and in vitro pull-down assays. The WRKY46-MYC2 complex directly binds to the promoter of RBOHD to promote its expression, as demonstrated by luciferase reporter, yeast one-hybrid, chromatin immunoprecipitation, and electrophoretic mobility shift assays. This module also positively regulates the expression of E-2-hexenal-induced naringenin biosynthesis genes (TT4 and CHIL) and the accumulation of total flavonoids, thereby modulating plant tolerance to insects. Together, our results highlight an important role for the WRKY46-MYC2 module in the E-2-hexenal-induced defense response of Arabidopsis, providing new insights into the mechanisms by which VOCs trigger plant defense responses.
Collapse
Affiliation(s)
- Xin Hao
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
| | - Shuyao Wang
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
| | - Yu Fu
- State Key Laboratory of Animal Biotech Breeding, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Yahui Liu
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
| | - Hongyu Shen
- University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA
| | - Libo Jiang
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China
| | - Eric S McLamore
- Department of Agricultural Sciences, Clemson University, Clemson, SC 29634, USA
| | - Yingbai Shen
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China.
| |
Collapse
|
10
|
Fang Y, Subedi S. Clustering microbiome data using mixtures of logistic normal multinomial models. Sci Rep 2023; 13:14758. [PMID: 37679485 PMCID: PMC10484970 DOI: 10.1038/s41598-023-41318-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 08/24/2023] [Indexed: 09/09/2023] Open
Abstract
Discrete data such as counts of microbiome taxa resulting from next-generation sequencing are routinely encountered in bioinformatics. Taxa count data in microbiome studies are typically high-dimensional, over-dispersed, and can only reveal relative abundance therefore being treated as compositional. Analyzing compositional data presents many challenges because they are restricted to a simplex. In a logistic normal multinomial model, the relative abundance is mapped from a simplex to a latent variable that exists on the real Euclidean space using the additive log-ratio transformation. While a logistic normal multinomial approach brings flexibility for modeling the data, it comes with a heavy computational cost as the parameter estimation typically relies on Bayesian techniques. In this paper, we develop a novel mixture of logistic normal multinomial models for clustering microbiome data. Additionally, we utilize an efficient framework for parameter estimation using variational Gaussian approximations (VGA). Adopting a variational Gaussian approximation for the posterior of the latent variable reduces the computational overhead substantially. The proposed method is illustrated on simulated and real datasets.
Collapse
Affiliation(s)
- Yuan Fang
- School of Pharmacy and Pharmaceutical Sciences, Binghamton University, State University of New York, 4400 Vestal Parkway East, Binghamton, NY, 13902, USA
| | - Sanjeena Subedi
- School of Mathematics and Statistics, 4302 Herzberg Laboratories, Carleton University, 1125 Colonel By Drive, Ottawa, ON, K1S 5B6, Canada.
| |
Collapse
|
11
|
Rahnenführer J, De Bin R, Benner A, Ambrogi F, Lusa L, Boulesteix AL, Migliavacca E, Binder H, Michiels S, Sauerbrei W, McShane L. Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges. BMC Med 2023; 21:182. [PMID: 37189125 DOI: 10.1186/s12916-023-02858-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 04/03/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND In high-dimensional data (HDD) settings, the number of variables associated with each observation is very large. Prominent examples of HDD in biomedical research include omics data with a large number of variables such as many measurements across the genome, proteome, or metabolome, as well as electronic health records data that have large numbers of variables recorded for each patient. The statistical analysis of such data requires knowledge and experience, sometimes of complex methods adapted to the respective research questions. METHODS Advances in statistical methodology and machine learning methods offer new opportunities for innovative analyses of HDD, but at the same time require a deeper understanding of some fundamental statistical concepts. Topic group TG9 "High-dimensional data" of the STRATOS (STRengthening Analytical Thinking for Observational Studies) initiative provides guidance for the analysis of observational studies, addressing particular statistical challenges and opportunities for the analysis of studies involving HDD. In this overview, we discuss key aspects of HDD analysis to provide a gentle introduction for non-statisticians and for classically trained statisticians with little experience specific to HDD. RESULTS The paper is organized with respect to subtopics that are most relevant for the analysis of HDD, in particular initial data analysis, exploratory data analysis, multiple testing, and prediction. For each subtopic, main analytical goals in HDD settings are outlined. For each of these goals, basic explanations for some commonly used analysis methods are provided. Situations are identified where traditional statistical methods cannot, or should not, be used in the HDD setting, or where adequate analytic tools are still lacking. Many key references are provided. CONCLUSIONS This review aims to provide a solid statistical foundation for researchers, including statisticians and non-statisticians, who are new to research with HDD or simply want to better evaluate and understand the results of HDD analyses.
Collapse
Affiliation(s)
| | | | - Axel Benner
- Division of Biostatistics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Federico Ambrogi
- Department of Clinical Sciences and Community Health, University of Milan, Milan, Italy
- Scientific Directorate, IRCCS Policlinico San Donato, San Donato Milanese, Italy
| | - Lara Lusa
- Department of Mathematics, Faculty of Mathematics, Natural Sciences and Information Technology, University of Primorksa, Koper, Slovenia
- Institute of Biostatistics and Medical Informatics, University of Ljubljana, Ljubljana, Slovenia
| | - Anne-Laure Boulesteix
- Institute for Medical Information Processing, Biometry and Epidemiology, Ludwig Maximilian University of Munich, Munich, Germany
| | | | - Harald Binder
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
| | - Stefan Michiels
- Service de Biostatistique et d'Épidémiologie, Gustave Roussy, Université Paris-Saclay, Villejuif, France
- Oncostat U1018, Inserm, Université Paris-Saclay, Labeled Ligue Contre le Cancer, Villejuif, France
| | - Willi Sauerbrei
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
| | - Lisa McShane
- Biometric Research Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, USA.
| |
Collapse
|
12
|
Vu HT, Kaur H, Kies KR, Starks RR, Tuteja G. Identifying novel regulators of placental development using time-series transcriptome data. Life Sci Alliance 2023; 6:6/2/e202201788. [PMID: 36622342 PMCID: PMC9748866 DOI: 10.26508/lsa.202201788] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 11/21/2022] [Accepted: 11/22/2022] [Indexed: 12/15/2022] Open
Abstract
The placenta serves as a connection between the mother and the fetus during pregnancy, providing the fetus with oxygen, nutrients, and growth hormones. However, the regulatory mechanisms and dynamic gene interaction networks underlying early placental development are understudied. Here, we generated RNA-sequencing data from mouse fetal placenta at embryonic days 7.5, 8.5, and 9.5 to identify genes with timepoint-specific expression, then inferred gene interaction networks to analyze highly connected network modules. We determined that timepoint-specific gene network modules were associated with distinct developmental processes, and with similar expression profiles to specific human placental cell populations. From each module, we identified hub genes and their direct neighboring genes, which were predicted to govern placental functions. We confirmed that four novel candidate regulators identified through our analyses regulate cell migration in the HTR-8/SVneo cell line. Overall, we predicted several novel regulators of placental development expressed in specific placental cell types using network analysis of bulk RNA-sequencing data. Our findings and analysis approaches will be valuable for future studies investigating the transcriptional landscape of early development.
Collapse
Affiliation(s)
- Ha Th Vu
- Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA.,Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| | - Haninder Kaur
- Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA
| | - Kelby R Kies
- Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA.,Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| | - Rebekah R Starks
- Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA.,Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| | - Geetu Tuteja
- Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, USA .,Bioinformatics and Computational Biology, Iowa State University, Ames, IA, USA
| |
Collapse
|
13
|
Li Y, Rahman T, Ma T, Tang L, Tseng GC. A sparse negative binomial mixture model for clustering RNA-seq count data. Biostatistics 2022; 24:68-84. [PMID: 34363675 PMCID: PMC9766880 DOI: 10.1093/biostatistics/kxab025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 06/03/2021] [Accepted: 06/06/2021] [Indexed: 12/16/2022] Open
Abstract
Clustering with variable selection is a challenging yet critical task for modern small-n-large-p data. Existing methods based on sparse Gaussian mixture models or sparse $K$-means provide solutions to continuous data. With the prevalence of RNA-seq technology and lack of count data modeling for clustering, the current practice is to normalize count expression data into continuous measures and apply existing models with a Gaussian assumption. In this article, we develop a negative binomial mixture model with lasso or fused lasso gene regularization to cluster samples (small $n$) with high-dimensional gene features (large $p$). A modified EM algorithm and Bayesian information criterion are used for inference and determining tuning parameters. The method is compared with existing methods using extensive simulations and two real transcriptomic applications in rat brain and breast cancer studies. The result shows the superior performance of the proposed count data model in clustering accuracy, feature selection, and biological interpretation in pathways.
Collapse
Affiliation(s)
- Yujia Li
- Department of Biostatistics, University of Pittsburgh,
Pittsburgh, PA 15261, USA
| | - Tanbin Rahman
- Department of Biostatistics, University of Pittsburgh,
Pittsburgh, PA 15261, USA
| | - Tianzhou Ma
- Department of Epidemiology and Biostatistics, University of
Maryland, College Park, MD 20742, USA
| | | | - George C Tseng
- Department of Biostatistics, University of Pittsburgh,
Pittsburgh, PA 15261, USA
| |
Collapse
|
14
|
Qiao Z, Barnes E, Tringe S, Schachtman DP, Liu P. Poisson hurdle model-based method for clustering microbiome features. Bioinformatics 2022; 39:6873739. [PMID: 36469352 PMCID: PMC9825753 DOI: 10.1093/bioinformatics/btac782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 10/13/2022] [Accepted: 12/02/2022] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION High-throughput sequencing technologies have greatly facilitated microbiome research and have generated a large volume of microbiome data with the potential to answer key questions regarding microbiome assembly, structure and function. Cluster analysis aims to group features that behave similarly across treatments, and such grouping helps to highlight the functional relationships among features and may provide biological insights into microbiome networks. However, clustering microbiome data are challenging due to the sparsity and high dimensionality. RESULTS We propose a model-based clustering method based on Poisson hurdle models for sparse microbiome count data. We describe an expectation-maximization algorithm and a modified version using simulated annealing to conduct the cluster analysis. Moreover, we provide algorithms for initialization and choosing the number of clusters. Simulation results demonstrate that our proposed methods provide better clustering results than alternative methods under a variety of settings. We also apply the proposed method to a sorghum rhizosphere microbiome dataset that results in interesting biological findings. AVAILABILITY AND IMPLEMENTATION R package is freely available for download at https://cran.r-project.org/package=PHclust. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhili Qiao
- To whom correspondence should be addressed. or
| | - Elle Barnes
- Department of Energy, Joint Genome Institute, Berkeley, CA 94720, USA
| | - Susannah Tringe
- Department of Energy, Joint Genome Institute, Berkeley, CA 94720, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Daniel P Schachtman
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE 68583, USA
| | - Peng Liu
- To whom correspondence should be addressed. or
| |
Collapse
|
15
|
Sun X, Chen J, Fan W, Liu S, Kamruzzaman M. Production of Reactive Oxygen Species via Nanobubble Water Improves Radish Seed Water Absorption and the Expression of Aquaporin Genes. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2022; 38:11724-11731. [PMID: 36103666 DOI: 10.1021/acs.langmuir.2c01860] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Nanobubbles (NBs) stimulate seed germination; however, the mechanism of the promotion effect of NBs remains unclear. The impact of NBs on seed water absorption was investigated; we subsequently studied the genes associated with the response of radish seeds to NB water and used RNA sequencing to generate their expression profiles, especially those of aquaporin genes. NB water significantly promoted germination. The times at which 50% of the germinating seeds achieved germination (T50) for the submerged radish seeds in NB and control water were 11.6 and 17.4 h, respectively. NB water-germinated radish seeds showed a water uptake rate coefficient that was 15% higher than that of those germinated in control water. Through GO enrichment and cluster analyses, it was evident that NB water significantly increased the level of expression of the genes associated with the following activities: oxidoreductase, peroxidase, and antioxidant. Our results demonstrated that NB water increases the water uptake rate of radish seeds via two mechanisms. The NB water-produced exogenous hydroxyl radical (•OH) increases the seed coat's water permeability and enhances cell wall loosening, and NB water increases the aquaporin gene expression level of radish seeds.
Collapse
Affiliation(s)
- Xiaohong Sun
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
| | - Jingrao Chen
- Institute of Biotechnology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
| | - Wenhong Fan
- Department of Environmental Science and Engineering, School of Space and Environment, Beihang University, Beijing 100191, China
| | - Shu Liu
- Department of Environmental Science and Engineering, School of Space and Environment, Beihang University, Beijing 100191, China
| | - Mohammed Kamruzzaman
- Department of Agricultural and Biological Engineering, University of Illinois at Urbana-Champaign, 1304 West Pennsylvania Avenue, Urbana, Illinois 61801, United States
| |
Collapse
|
16
|
Gomez-Picos P, Ovens K, Eames BF. Limb Mesoderm and Head Ectomesenchyme Both Express a Core Transcriptional Program During Chondrocyte Differentiation. Front Cell Dev Biol 2022; 10:876825. [PMID: 35784462 PMCID: PMC9247276 DOI: 10.3389/fcell.2022.876825] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 05/26/2022] [Indexed: 11/13/2022] Open
Abstract
To explain how cartilage appeared in different parts of the vertebrate body at discrete times during evolution, we hypothesize that different embryonic populations co-opted expression of a core gene regulatory network (GRN) driving chondrocyte differentiation. To test this hypothesis, laser-capture microdissection coupled with RNA-seq was used to reveal chondrocyte transcriptomes in the developing chick humerus and ceratobranchial, which are mesoderm- and neural crest-derived, respectively. During endochondral ossification, two general types of chondrocytes differentiate. Immature chondrocytes (IMM) represent the early stages of cartilage differentiation, while mature chondrocytes (MAT) undergo additional stages of differentiation, including hypertrophy and stimulating matrix mineralization and degradation. Venn diagram analyses generally revealed a high degree of conservation between chondrocyte transcriptomes of the limb and head, including SOX9, COL2A1, and ACAN expression. Typical maturation genes, such as COL10A1, IBSP, and SPP1, were upregulated in MAT compared to IMM in both limb and head chondrocytes. Gene co-expression network (GCN) analyses of limb and head chondrocyte transcriptomes estimated the core GRN governing cartilage differentiation. Two discrete portions of the GCN contained genes that were differentially expressed in limb or head chondrocytes, but these genes were enriched for biological processes related to limb/forelimb morphogenesis or neural crest-dependent processes, respectively, perhaps simply reflecting the embryonic origin of the cells. A core GRN driving cartilage differentiation in limb and head was revealed that included typical chondrocyte differentiation and maturation markers, as well as putative novel "chondrocyte" genes. Conservation of a core transcriptional program during chondrocyte differentiation in both the limb and head suggest that the same core GRN was co-opted when cartilage appeared in different regions of the skeleton during vertebrate evolution.
Collapse
Affiliation(s)
- Patsy Gomez-Picos
- Department of Anatomy, Physiology, and Pharmacology, University of Saskatchewan, Saskatoon, SK, Canada
| | - Katie Ovens
- Department of Computer Science, University of Calgary, Calgary, AB, Canada
| | - B. Frank Eames
- Department of Anatomy, Physiology, and Pharmacology, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
17
|
Xuan H, Huang Y, Zhou L, Deng S, Wang C, Xu J, Wang H, Zhao J, Guo N, Xing H. Key Soybean Seedlings Drought-Responsive Genes and Pathways Revealed by Comparative Transcriptome Analyses of Two Cultivars. Int J Mol Sci 2022; 23:2893. [PMID: 35270036 PMCID: PMC8911164 DOI: 10.3390/ijms23052893] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 03/03/2022] [Accepted: 03/05/2022] [Indexed: 02/08/2023] Open
Abstract
Seedling drought stress is one of the most important constraints affecting soybean yield and quality. To unravel the molecular mechanisms under soybean drought tolerance, we conducted comprehensive comparative transcriptome analyses of drought-tolerant genotype Jindou 21 (JD) and drought-sensitive genotype Tianlong No.1 (N1) seedlings that had been exposed to drought treatment. A total of 6038 and 4112 differentially expressed genes (DEGs) were identified in drought-tolerant JD and drought-sensitive N1, respectively. Subsequent KEGG pathway analyses showed that numerous DEGs in JD are predominately involved in signal transduction pathways, including plant hormone signaling pathway, calcium signaling pathway, and MAPK signaling pathway. Interestingly, JA and BR plant hormone signal transduction pathways were found specifically participating in drought-tolerant JD. Meanwhile, the differentially expressed CPKs, CIPKs, MAPKs, and MAP3Ks of calcium and MAPK signaling pathway were only identified in JD. The number of DEGs involved in transcription factors (TFs) is larger in JD than that of in N1. Moreover, some differently expressed transcriptional factor genes were only identified in drought-tolerant JD, including FAR1, RAV, LSD1, EIL, and HB-PHD. In addition, this study suggested that JD could respond to drought stress by regulating the cell wall remodeling and stress-related protein genes such as EXPs, CALSs, CBPs, BBXs, and RD22s. JD is more drought tolerant than N1 owing to more DEGs being involved in multiple signal transduction pathways (JA, BR, calcium, MAPK signaling pathway), stress-related TFs, and proteins. The above valuable genes and pathways will deepen the understanding of the molecular mechanisms under drought stress in soybean.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Na Guo
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory for Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China; (H.X.); (Y.H.); (L.Z.); (S.D.); (C.W.); (J.X.); (H.W.); (J.Z.)
| | - Han Xing
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetics and Breeding for Soybean, Ministry of Agriculture, State Key Laboratory for Crop Genetics and Germplasm Enhancement, College of Agriculture, Nanjing Agricultural University, Nanjing 210095, China; (H.X.); (Y.H.); (L.Z.); (S.D.); (C.W.); (J.X.); (H.W.); (J.Z.)
| |
Collapse
|
18
|
Osabe T, Shimizu K, Kadota K. Differential expression analysis using a model-based gene clustering algorithm for RNA-seq data. BMC Bioinformatics 2021; 22:511. [PMID: 34670485 PMCID: PMC8527798 DOI: 10.1186/s12859-021-04438-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 10/11/2021] [Indexed: 11/10/2022] Open
Abstract
Background RNA-seq is a tool for measuring gene expression and is commonly used to identify differentially expressed genes (DEGs). Gene clustering is used to classify DEGs with similar expression patterns for the subsequent analyses of data from experiments such as time-courses or multi-group comparisons. However, gene clustering has rarely been used for analyzing simple two-group data or differential expression (DE). In this study, we report that a model-based clustering algorithm implemented in an R package, MBCluster.Seq, can also be used for DE analysis. Results The input data originally used by MBCluster.Seq is DEGs, and the proposed method (called MBCdeg) uses all genes for the analysis. The method uses posterior probabilities of genes assigned to a cluster displaying non-DEG pattern for overall gene ranking. We compared the performance of MBCdeg with conventional R packages such as edgeR, DESeq2, and TCC that are specialized for DE analysis using simulated and real data. Our results showed that MBCdeg outperformed other methods when the proportion of DEG (PDEG) was less than 50%. However, the DEG identification using MBCdeg was less consistent than with conventional methods. We compared the effects of different normalization algorithms using MBCdeg, and performed an analysis using MBCdeg in combination with a robust normalization algorithm (called DEGES) that was not implemented in MBCluster.Seq. The new analysis method showed greater stability than using the original MBCdeg with the default normalization algorithm. Conclusions MBCdeg with DEGES normalization can be used in the identification of DEGs when the PDEG is relatively low. As the method is based on gene clustering, the DE result includes information on which expression pattern the gene belongs to. The new method may be useful for the analysis of time-course and multi-group data, where the classification of expression patterns is often required. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04438-4.
Collapse
Affiliation(s)
- Takayuki Osabe
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan
| | - Kentaro Shimizu
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan.,Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan.,Interfaculty Initiative in Information Studies, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Koji Kadota
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan. .,Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Yayoi 1-1-1, Bunkyo-ku, Tokyo, 113-8657, Japan. .,Interfaculty Initiative in Information Studies, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo, 113-0033, Japan.
| |
Collapse
|
19
|
Sun V, Sharpley M, Kaczor-Urbanowicz KE, Chang P, Montel-Hagen A, Lopez S, Zampieri A, Zhu Y, de Barros SC, Parekh C, Casero D, Banerjee U, Crooks GM. The Metabolic Landscape of Thymic T Cell Development In Vivo and In Vitro. Front Immunol 2021; 12:716661. [PMID: 34394122 PMCID: PMC8355594 DOI: 10.3389/fimmu.2021.716661] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 07/12/2021] [Indexed: 12/02/2022] Open
Abstract
Although metabolic pathways have been shown to control differentiation and activation in peripheral T cells, metabolic studies on thymic T cell development are still lacking, especially in human tissue. In this study, we use transcriptomics and extracellular flux analyses to investigate the metabolic profiles of primary thymic and in vitro-derived mouse and human thymocytes. Core metabolic pathways, specifically glycolysis and oxidative phosphorylation, undergo dramatic changes between the double-negative (DN), double-positive (DP), and mature single-positive (SP) stages in murine and human thymus. Remarkably, despite the absence of the complex multicellular thymic microenvironment, in vitro murine and human T cell development recapitulated the coordinated decrease in glycolytic and oxidative phosphorylation activity between the DN and DP stages seen in primary thymus. Moreover, by inducing in vitro T cell differentiation from Rag1-/- mouse bone marrow, we show that reduced metabolic activity at the DP stage is independent of TCR rearrangement. Thus, our findings suggest that highly conserved metabolic transitions are critical for thymic T cell development.
Collapse
Affiliation(s)
- Victoria Sun
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
- Molecular Biology Interdepartmental Program, UCLA, Los Angeles, CA, United States
| | - Mark Sharpley
- Department of Molecular, Cell and Developmental Biology, UCLA, Los Angeles, CA, United States
| | - Karolina E. Kaczor-Urbanowicz
- Division of Oral Biology & Medicine, School of Dentistry, UCLA, Los Angeles, CA, United States
- Institute for Quantitative and Computational Biosciences, UCLA, Los Angeles, CA, United States
| | - Patrick Chang
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
- Molecular Biology Interdepartmental Program, UCLA, Los Angeles, CA, United States
| | - Amélie Montel-Hagen
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
| | - Shawn Lopez
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
| | - Alexandre Zampieri
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
| | - Yuhua Zhu
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
| | - Stéphanie C. de Barros
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
| | - Chintan Parekh
- Cancer and Blood Disease Institute, Children’s Hospital Los Angeles, Los Angeles, CA, United States
- Department of Pediatrics, Keck School of Medicine, University of Southern California, Los Angeles, CA, United States
| | - David Casero
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
- F. Widjaja Foundation Inflammatory Bowel and Immunobiology Research Institute, Cedars- Sinai Medical Center, Los Angeles, CA, United States
| | - Utpal Banerjee
- Molecular Biology Interdepartmental Program, UCLA, Los Angeles, CA, United States
- Department of Molecular, Cell and Developmental Biology, UCLA, Los Angeles, CA, United States
- Department of Biological Chemistry, UCLA, Los Angeles, CA, United States
- Eli and Edythe Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, United States
| | - Gay M. Crooks
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), Los Angeles, CA, United States
- Eli and Edythe Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, United States
- Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, United States
- Division of Pediatric Hematology-Oncology, Department of Pediatrics, David Geffen School of Medicine, UCLA, Los Angeles, CA, United States
| |
Collapse
|
20
|
Salisbury DA, Casero D, Zhang Z, Wang D, Kim J, Wu X, Vergnes L, Mirza AH, Leon-Mimila P, Williams KJ, Huertas-Vazquez A, Jaffrey SR, Reue K, Chen J, Sallam T. Transcriptional regulation of N 6-methyladenosine orchestrates sex-dimorphic metabolic traits. Nat Metab 2021; 3:940-953. [PMID: 34282353 PMCID: PMC8422857 DOI: 10.1038/s42255-021-00427-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 06/11/2021] [Indexed: 02/06/2023]
Abstract
Males and females exhibit striking differences in the prevalence of metabolic traits including hepatic steatosis, a key driver of cardiometabolic morbidity and mortality. RNA methylation is a widespread regulatory mechanism of transcript turnover. Here, we show that presence of the RNA modification N6-methyladenosine (m6A) triages lipogenic transcripts for degradation and guards against hepatic triglyceride accumulation. In male but not female mice, this protective checkpoint stalls under lipid-rich conditions. Loss of m6A control in male livers increases hepatic triglyceride stores, leading to a more 'feminized' hepatic lipid composition. Crucially, liver-specific deletion of the m6A complex protein Mettl14 from male and female mice significantly diminishes sex-specific differences in steatosis. We further surmise that the m6A installing machinery is subject to transcriptional control by the sex-responsive BCL6-STAT5 axis in response to dietary conditions. These data show that m6A is essential for precise and synchronized control of lipogenic enzyme activity and provide insights into the molecular basis for the existence of sex-specific differences in hepatic lipid traits.
Collapse
Affiliation(s)
- David A Salisbury
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | - David Casero
- F. Widjaja Foundation Inflammatory Bowel & Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Zhengyi Zhang
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Dan Wang
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Jason Kim
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Xiaohui Wu
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Laurent Vergnes
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Aashiq H Mirza
- Department of Pharmacology, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Paola Leon-Mimila
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Unidad de Genómica de Poblaciones Aplicada a la Salud, Facultad de Química, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Kevin J Williams
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
| | - Adriana Huertas-Vazquez
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Samie R Jaffrey
- Department of Pharmacology, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Karen Reue
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Jianjun Chen
- Department of Systems Biology, The Beckman Research Institute of City of Hope, Duarte, CA, USA
| | - Tamer Sallam
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA, USA.
- Molecular Biology Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
21
|
Maren N, Zhao F, Aryal R, Touchell D, Liu W, Ranney T, Ashrafi H. Reproductive developmental transcriptome analysis of Tripidium ravennae (Poaceae). BMC Genomics 2021; 22:483. [PMID: 34182921 PMCID: PMC8237498 DOI: 10.1186/s12864-021-07641-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/20/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Tripidium ravennae is a cold-hardy, diploid species in the sugarcane complex (Poaceae subtribe Saccharinae) with considerable potential as a genetic resource for developing improved bioenergy and ornamental grasses. An improved understanding of the genetic regulation of reproductive processes (e.g., floral induction, inflorescence development, and seed development) will enable future applications of precision breeding and gene editing of floral and seed development. In particular, the ability to silence reproductive processes would allow for developing seedless forms of valuable but potentially invasive plants. The objective of this research was to characterize the gene expression environment of reproductive development in T. ravennae. RESULTS During the early phases of inflorescence development, multiple key canonical floral integrators and pathways were identified. Annotations of type II subfamily of MADS-box transcription factors, in particular, were over-represented in the GO enrichment analyses and tests for differential expression (FDR p-value < 0.05). The differential expression of floral integrators observed in the early phases of inflorescence development diminished prior to inflorescence determinacy regulation. Differential expression analysis did not identify many unique genes at mid-inflorescence development stages, though typical biological processes involved in plant growth and development expressed abundantly. The increase in inflorescence determinacy regulatory elements and putative homeotic floral development unigenes at mid-inflorescence development coincided with the expression of multiple meiosis annotations and multicellular organism developmental processes. Analysis of seed development identified multiple unigenes involved in oxidative-reductive processes. CONCLUSION Reproduction in grasses is a dynamic system involving the sequential coordination of complex gene regulatory networks and developmental processes. This research identified differentially expressed transcripts associated with floral induction, inflorescence development, and seed development in T. ravennae. These results provide insights into the molecular regulation of reproductive development and provide a foundation for future investigations and analyses, including genome annotation, functional genomics characterization, gene family evolutionary studies, comparative genomics, and precision breeding.
Collapse
Affiliation(s)
- Nathan Maren
- Department of Horticultural Science, North Carolina State University, Campus Box 7609, Raleigh, NC, 27695-7609, USA.
| | - Fangzhou Zhao
- Department of Horticultural Science, North Carolina State University, Campus Box 7609, Raleigh, NC, 27695-7609, USA
- College of Agriculture, Nanjing Agricultural University, Nanjing, 210095, China
| | - Rishi Aryal
- Department of Horticultural Science, North Carolina State University, Campus Box 7609, Raleigh, NC, 27695-7609, USA
| | - Darren Touchell
- Mountain Crop Improvement Lab, Department of Horticultural Science, Mountain Horticultural Crops Research and Extension Center, North Carolina State University, 455 Research Drive, Mills River, NC, 28759-3423, USA
| | - Wusheng Liu
- Department of Horticultural Science, North Carolina State University, Campus Box 7609, Raleigh, NC, 27695-7609, USA
| | - Thomas Ranney
- Mountain Crop Improvement Lab, Department of Horticultural Science, Mountain Horticultural Crops Research and Extension Center, North Carolina State University, 455 Research Drive, Mills River, NC, 28759-3423, USA
| | - Hamid Ashrafi
- Department of Horticultural Science, North Carolina State University, Campus Box 7609, Raleigh, NC, 27695-7609, USA.
| |
Collapse
|
22
|
Chung M, Bruno VM, Rasko DA, Cuomo CA, Muñoz JF, Livny J, Shetty AC, Mahurkar A, Dunning Hotopp JC. Best practices on the differential expression analysis of multi-species RNA-seq. Genome Biol 2021; 22:121. [PMID: 33926528 PMCID: PMC8082843 DOI: 10.1186/s13059-021-02337-8] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 04/01/2021] [Indexed: 02/07/2023] Open
Abstract
Advances in transcriptome sequencing allow for simultaneous interrogation of differentially expressed genes from multiple species originating from a single RNA sample, termed dual or multi-species transcriptomics. Compared to single-species differential expression analysis, the design of multi-species differential expression experiments must account for the relative abundances of each organism of interest within the sample, often requiring enrichment methods and yielding differences in total read counts across samples. The analysis of multi-species transcriptomics datasets requires modifications to the alignment, quantification, and downstream analysis steps compared to the single-species analysis pipelines. We describe best practices for multi-species transcriptomics and differential gene expression.
Collapse
Affiliation(s)
- Matthew Chung
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Vincent M. Bruno
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - David A. Rasko
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Christina A. Cuomo
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - José F. Muñoz
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - Jonathan Livny
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - Amol C. Shetty
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Anup Mahurkar
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Julie C. Dunning Hotopp
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Greenebaum Cancer Center, University of Maryland, Baltimore, MD 21201 USA
| |
Collapse
|
23
|
Santiago JC, Boylan JM, Lemieux FA, Gruppuso PA, Sanders JA, Rand DM. Mitochondrial genotype alters the impact of rapamycin on the transcriptional response to nutrients in Drosophila. BMC Genomics 2021; 22:213. [PMID: 33761878 PMCID: PMC7992956 DOI: 10.1186/s12864-021-07516-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 03/08/2021] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND In addition to their well characterized role in cellular energy production, new evidence has revealed the involvement of mitochondria in diverse signaling pathways that regulate a broad array of cellular functions. The mitochondrial genome (mtDNA) encodes essential components of the oxidative phosphorylation (OXPHOS) pathway whose expression must be coordinated with the components transcribed from the nuclear genome. Mitochondrial dysfunction is associated with disorders including cancer and neurodegenerative diseases, yet the role of the complex interactions between the mitochondrial and nuclear genomes are poorly understood. RESULTS Using a Drosophila model in which alternative mtDNAs are present on a common nuclear background, we studied the effects of this altered mitonuclear communication on the transcriptomic response to altered nutrient status. Adult flies with the 'native' and 'disrupted' genotypes were re-fed following brief starvation, with or without exposure to rapamycin, the cognate inhibitor of the nutrient-sensing target of rapamycin (TOR). RNAseq showed that alternative mtDNA genotypes affect the temporal transcriptional response to nutrients in a rapamycin-dependent manner. Pathways most greatly affected were OXPHOS, protein metabolism and fatty acid metabolism. A distinct set of testis-specific genes was also differentially regulated in the experiment. CONCLUSIONS Many of the differentially expressed genes between alternative mitonuclear genotypes have no direct interaction with mtDNA gene products, suggesting that the mtDNA genotype contributes to retrograde signaling from mitochondria to the nucleus. The interaction of mitochondrial genotype (mtDNA) with rapamycin treatment identifies new links between mitochondria and the nutrient-sensing mTORC1 (mechanistic target of rapamycin complex 1) signaling pathway.
Collapse
Affiliation(s)
- John C. Santiago
- grid.40263.330000 0004 1936 9094Department of Molecular Biology, Cellular Biology and Biochemistry, Brown University, Providence, RI 02912 USA ,grid.40263.330000 0004 1936 9094Department Pathology & Laboratory Medicine, Brown University, Providence, RI 02912 USA
| | - Joan M. Boylan
- grid.240588.30000 0001 0557 9478Department of Pediatrics, Rhode Island Hospital, Providence, RI 02903 USA
| | - Faye A. Lemieux
- grid.40263.330000 0004 1936 9094Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912 USA
| | - Philip A. Gruppuso
- grid.40263.330000 0004 1936 9094Department of Molecular Biology, Cellular Biology and Biochemistry, Brown University, Providence, RI 02912 USA ,grid.240588.30000 0001 0557 9478Department of Pediatrics, Rhode Island Hospital, Providence, RI 02903 USA
| | - Jennifer A. Sanders
- grid.40263.330000 0004 1936 9094Department Pathology & Laboratory Medicine, Brown University, Providence, RI 02912 USA
| | - David M. Rand
- grid.40263.330000 0004 1936 9094Department of Molecular Biology, Cellular Biology and Biochemistry, Brown University, Providence, RI 02912 USA ,grid.40263.330000 0004 1936 9094Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912 USA
| |
Collapse
|
24
|
Yang F, Debatosh D, Song T, Zhang JH. Light Harvesting-like Protein 3 Interacts with Phytoene Synthase and Is Necessary for Carotenoid and Chlorophyll Biosynthesis in Rice. RICE (NEW YORK, N.Y.) 2021; 14:32. [PMID: 33745012 PMCID: PMC7981378 DOI: 10.1186/s12284-021-00474-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Accepted: 03/10/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND Carotenoid biosynthesis is essential for the generation of photosynthetic pigments, phytohormone production, and flower color development. The light harvesting like 3 (LIL3) protein, which belongs to the light-harvesting complex protein family in photosystems, interacts with geranylgeranyl reductase (GGR) and protochlorophyllide oxidoreductase (POR) both of which are known to regulate terpenoid and chlorophyll biosynthesis, respectively, in both rice and Arabidopsis. RESULTS In our study, a CRISPR-Cas9 generated 4-bp deletion mutant oslil3 showed aberrant chloroplast development, growth defects, low fertility rates and reduced pigment contents. A comparative transcriptomic analysis of oslil3 suggested that differentially expressed genes (DEGs) involved in photosynthesis, cell wall modification, primary and secondary metabolism are differentially regulated in the mutant. Protein-protein interaction assays indicated that LIL3 interacts with phytoene synthase (PSY) and in addition the gene expression of PSY genes are regulated by LIL3. Subcellular localization of LIL3 and PSY suggested that both are thylakoid membrane anchored proteins in the chloroplast. We suggest that LIL3 directly interacts with PSY to regulate carotenoid biosynthesis. CONCLUSION This study reveals a new role of LIL3 in regulating pigment biosynthesis through interaction with the rate limiting enzyme PSY in carotenoid biosynthesis in rice presenting it as a putative target for genetic manipulation of pigment biosynthesis pathways in crop plants.
Collapse
Affiliation(s)
- Feng Yang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing, 210037, China
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518057, Guangdong, China
| | - Das Debatosh
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518057, Guangdong, China
| | - Tao Song
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing, 210037, China.
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518057, Guangdong, China.
| | - Jian-Hua Zhang
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, 518057, Guangdong, China.
- Department of Biology, Hong Kong Baptist University and State Key Laboratory of Agrobiotechnology, The Chinese University of Hong Kong, Shatin, Hong Kong, China.
| |
Collapse
|
25
|
Lim DK, Rashid NU, Ibrahim JG. MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY. Ann Appl Stat 2021; 15:481-508. [PMID: 34457104 PMCID: PMC8386505 DOI: 10.1214/20-aoas1407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Clustering is a form of unsupervised learning that aims to uncover latent groups within data based on similarity across a set of features. A common application of this in biomedical research is in delineating novel cancer subtypes from patient gene expression data, given a set of informative genes. However, it is typically unknown a priori what genes may be informative in discriminating between clusters, and what the optimal number of clusters are. Few methods exist for performing unsupervised clustering of RNA-seq samples, and none currently adjust for between-sample global normalization factors, select cluster-discriminatory genes, or account for potential confounding variables during clustering. To address these issues, we propose the Feature Selection and Clustering of RNA-seq (FSCseq): a model-based clustering algorithm that utilizes a finite mixture of regression (FMR) model and the quadratic penalty method with a Smoothly-Clipped Absolute Deviation (SCAD) penalty. The maximization is done by a penalized Classification EM algorithm, allowing us to include normalization factors and confounders in our modeling framework. Given the fitted model, our framework allows for subtype prediction in new patients via posterior probabilities of cluster membership, even in the presence of batch effects. Based on simulations and real data analysis, we show the advantages of our method relative to competing approaches.
Collapse
Affiliation(s)
- David K Lim
- University of North Carolina at Chapel Hill, NC, USA
| | - Naim U Rashid
- University of North Carolina at Chapel Hill, NC, USA
| | | |
Collapse
|
26
|
Pabuayon ICM, Kitazumi A, Cushman KR, Singh RK, Gregorio GB, Dhatt B, Zabet-Moghaddam M, Walia H, de los Reyes BG. Novel and Transgressive Salinity Tolerance in Recombinant Inbred Lines of Rice Created by Physiological Coupling-Uncoupling and Network Rewiring Effects. FRONTIERS IN PLANT SCIENCE 2021; 12:615277. [PMID: 33708229 PMCID: PMC7940525 DOI: 10.3389/fpls.2021.615277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 02/03/2021] [Indexed: 06/01/2023]
Abstract
The phenomenon of transgressive segregation, where a small minority of recombinants are outliers relative to the range of parental phenotypes, is commonly observed in plant breeding populations. While this phenomenon has been attributed to complementation and epistatic effects, the physiological and developmental synergism involved have not been fully illuminated by the QTL mapping approach alone, especially for stress-adaptive traits involving highly complex interactions. By systems-level profiling of the IR29 × Pokkali recombinant inbred population of rice, we addressed the hypothesis that novel salinity tolerance phenotypes are created by reconfigured physiological networks due to positive or negative coupling-uncoupling of developmental and physiological attributes of each parent. Real-time growth and hyperspectral profiling distinguished the transgressive individuals in terms of stress penalty to growth. Non-parental network signatures that led to either optimal or non-optimal integration of developmental with stress-related mechanisms were evident at the macro-physiological, biochemical, metabolic, and transcriptomic levels. Large positive net gain in super-tolerant progeny was due to ideal complementation of beneficial traits while shedding antagonistic traits. Super-sensitivity was explained by the stacking of multiple antagonistic traits and loss of major beneficial traits. The synergism uncovered by the phenomics approach in this study supports the modern views of the Omnigenic Theory, emphasizing the synergy or lack thereof between core and peripheral components. This study also supports a breeding paradigm rooted on genomic modeling from multi-dimensional genetic, physiological, and phenotypic profiles to create novel adaptive traits for new crop varieties of the 21st century.
Collapse
Affiliation(s)
- Isaiah C. M. Pabuayon
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX, United States
| | - Ai Kitazumi
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX, United States
| | - Kevin R. Cushman
- Department of Plant and Soil Science, Texas Tech University, Lubbock, TX, United States
| | | | | | - Balpreet Dhatt
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Masoud Zabet-Moghaddam
- Center for Biotechnology and Genomics, Texas Tech University, Lubbock, TX, United States
| | - Harkamal Walia
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE, United States
| | | |
Collapse
|
27
|
Abstract
Most currently available quantification tools for transcriptomics analyses have been designed for human data sets, in which full-length transcript sequences, including the untranslated regions, are well annotated. In most prokaryotic systems, full-length transcript sequences have yet to be characterized, leading to prokaryotic transcriptomics analyses being performed based on only the coding sequences. Quantification tools for RNA sequencing (RNA-Seq) analyses are often designed and tested using human transcriptomics data sets, in which full-length transcript sequences are well annotated. For prokaryotic transcriptomics experiments, full-length transcript sequences are seldom known, and coding sequences must instead be used for quantification steps in RNA-Seq analyses. However, operons confound accurate quantification of coding sequences since a single transcript does not necessarily equate to a single gene. Here, we introduce FADU (Feature Aggregate Depth Utility), a quantification tool designed specifically for prokaryotic RNA-Seq analyses. FADU assigns partial count values proportional to the length of the fragment overlapping the target feature. To assess the ability of FADU to quantify genes in prokaryotic transcriptomics analyses, we compared its performance to those of eXpress, featureCounts, HTSeq, kallisto, and Salmon across three paired-end read data sets of (i) Ehrlichia chaffeensis, (ii) Escherichia coli, and (iii) the Wolbachia endosymbiont wBm. Across each of the three data sets, we find that FADU can more accurately quantify operonic genes by deriving proportional counts for multigene fragments within operons. FADU is available at https://github.com/IGS/FADU. IMPORTANCE Most currently available quantification tools for transcriptomics analyses have been designed for human data sets, in which full-length transcript sequences, including the untranslated regions, are well annotated. In most prokaryotic systems, full-length transcript sequences have yet to be characterized, leading to prokaryotic transcriptomics analyses being performed based on only the coding sequences. In contrast to eukaryotes, prokaryotes contain polycistronic transcripts, and when genes are quantified based on coding sequences instead of transcript sequences, this leads to an increased abundance of improperly assigned ambiguous multigene fragments, specifically those mapping to multiple genes in operons. Here, we describe FADU, a quantification tool for prokaryotic RNA-Seq analyses designed to assign proportional counts with the purpose of better quantifying operonic genes while minimizing the pitfalls associated with improperly assigning fragment counts from ambiguous transcripts.
Collapse
|
28
|
Wang M, Allen GI. Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2021; 22:55. [PMID: 34744522 PMCID: PMC8570363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
In mixed multi-view data, multiple sets of diverse features are measured on the same set of samples. By integrating all available data sources, we seek to discover common group structure among the samples that may be hidden in individualistic cluster analyses of a single data view. While several techniques for such integrative clustering have been explored, we propose and develop a convex formalization that enjoys strong empirical performance and inherits the mathematical properties of increasingly popular convex clustering methods. Specifically, our Integrative Generalized Convex Clustering Optimization (iGecco) method employs different convex distances, losses, or divergences for each of the different data views with a joint convex fusion penalty that leads to common groups. Additionally, integrating mixed multi-view data is often challenging when each data source is high-dimensional. To perform feature selection in such scenarios, we develop an adaptive shifted group-lasso penalty that selects features by shrinking them towards their loss-specific centers. Our so-called iGecco+ approach selects features from each data view that are best for determining the groups, often leading to improved integrative clustering. To solve our problem, we develop a new type of generalized multi-block ADMM algorithm using sub-problem approximations that more efficiently fits our model for big data sets. Through a series of numerical experiments and real data examples on text mining and genomics, we show that iGecco+ achieves superior empirical performance for high-dimensional mixed multi-view data.
Collapse
Affiliation(s)
- Minjie Wang
- Department of Statistics, Rice University, Houston, TX 77005, USA
| | - Genevera I Allen
- Departments of Electrical and Computer Engineering, Statistics, and Computer Science, Rice University, Houston, TX 77005, USA; Jan and Dan Duncan Neurological Research Institute, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
29
|
Multi-assignment clustering: Machine learning from a biological perspective. J Biotechnol 2020; 326:1-10. [PMID: 33285150 DOI: 10.1016/j.jbiotec.2020.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 12/03/2020] [Indexed: 11/21/2022]
Abstract
A common approach for analyzing large-scale molecular data is to cluster objects sharing similar characteristics. This assumes that genes with highly similar expression profiles are likely participating in a common molecular process. Biological systems are extremely complex and challenging to understand, with proteins having multiple functions that sometimes need to be activated or expressed in a time-dependent manner. Thus, the strategies applied for clustering of these molecules into groups are of key importance for translation of data to biologically interpretable findings. Here we implemented a multi-assignment clustering (MAsC) approach that allows molecules to be assigned to multiple clusters, rather than single ones as in commonly used clustering techniques. When applied to high-throughput transcriptomics data, MAsC increased power of the downstream pathway analysis and allowed identification of pathways with high biological relevance to the experimental setting and the biological systems studied. Multi-assignment clustering also reduced noise in the clustering partition by excluding genes with a low correlation to all of the resulting clusters. Together, these findings suggest that our methodology facilitates translation of large-scale molecular data into biological knowledge. The method is made available as an R package on GitLab (https://gitlab.com/wolftower/masc).
Collapse
|
30
|
Subedi S, Browne RP. A family of parsimonious mixtures of multivariate Poisson‐lognormal distributions for clustering multivariate count data. Stat (Int Stat Inst) 2020. [DOI: 10.1002/sta4.310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Sanjeena Subedi
- Department of Mathematical Sciences Binghamton University Vestal 13902 New York USA
| | - Ryan P. Browne
- Department of Statistics & Actuarial Science University of Waterloo Waterloo Ontario N2L 3G1 Canada
| |
Collapse
|
31
|
Montel-Hagen A, Sun V, Casero D, Tsai S, Zampieri A, Jackson N, Li S, Lopez S, Zhu Y, Chick B, He C, de Barros SC, Seet CS, Crooks GM. In Vitro Recapitulation of Murine Thymopoiesis from Single Hematopoietic Stem Cells. Cell Rep 2020; 33:108320. [PMID: 33113379 PMCID: PMC7727762 DOI: 10.1016/j.celrep.2020.108320] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 09/01/2020] [Accepted: 10/06/2020] [Indexed: 12/16/2022] Open
Abstract
We report a serum-free, 3D murine artificial thymic organoid (M-ATO) system that mimics normal murine thymopoiesis with the production of all T cell stages, from early thymic progenitors to functional single-positive (CD8SP and CD4SP) TCRαβ and TCRγδ cells. RNA sequencing aligns M-ATO-derived populations with phenotypically identical primary thymocytes. M-ATOs initiated with Rag1-/- marrow produce the same differentiation block as seen in the endogenous thymus, and Notch signaling patterns in M-ATOs mirror primary thymopoiesis. M-ATOs initiated with defined hematopoietic stem cells (HSCs) and lymphoid progenitors from marrow and thymus generate each of the downstream differentiation stages, allowing the kinetics of T cell differentiation to be tracked. Remarkably, single HSCs deposited into each M-ATO generate the complete trajectory of T cell differentiation, producing diverse TCR repertoires across clones that largely match endogenous thymus. M-ATOs represent a highly reproducible and efficient experimental platform for the interrogation of clonal thymopoiesis from HSCs.
Collapse
Affiliation(s)
- Amélie Montel-Hagen
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Victoria Sun
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA; Molecular Biology Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - David Casero
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Steven Tsai
- Division of Hematology-Oncology, Department of Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
| | - Alexandre Zampieri
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Nicholas Jackson
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Suwen Li
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA; Department of Molecular and Medical Pharmacology, UCLA, Los Angeles, CA, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, USA
| | - Shawn Lopez
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Yuhua Zhu
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Brent Chick
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Chongbin He
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Stéphanie C de Barros
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA
| | - Christopher S Seet
- Division of Hematology-Oncology, Department of Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, USA; Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, USA
| | - Gay M Crooks
- Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles (UCLA), Los Angeles, CA, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, USA; Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, USA; Division of Pediatric Hematology-Oncology, Department of Pediatrics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
32
|
Eghbalnia HR, Wilfinger WW, Mackey K, Chomczynski P. Coordinated analysis of exon and intron data reveals novel differential gene expression changes. Sci Rep 2020; 10:15669. [PMID: 32973253 PMCID: PMC7515875 DOI: 10.1038/s41598-020-72482-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Accepted: 08/24/2020] [Indexed: 12/14/2022] Open
Abstract
RNA-Seq expression analysis currently relies primarily upon exon expression data. The recognized role of introns during translation, and the presence of substantial RNA-Seq counts attributable to introns, provide the rationale for the simultaneous consideration of both exon and intron data. We describe here a method for the coordinated analysis of exon and intron data by investigating their relationship within individual genes and across samples, while taking into account changes in both variability and expression level. This coordinated analysis of exon and intron data offers strong evidence for significant differences that distinguish the profiles of the exon-only expression data from the combined exon and intron data. One advantage of our proposed method, called matched change characterization for exons and introns (MEI), is its straightforward applicability to existing archived data using small modifications to standard RNA-Seq pipelines. Using MEI, we demonstrate that when data are examined for changes in variability across control and case conditions, novel differential changes can be detected. Notably, when MEI criteria were employed in the analysis of an archived data set involving polyarthritic subjects, the number of differentially expressed genes was expanded by sevenfold. More importantly, the observed changes in exon and intron variability with statistically significant false discovery rates could be traced to specific immune pathway gene networks. The application of MEI analysis provides a strategy for incorporating the significance of exon and intron variability and further developing the role of using both exons and intron sequencing counts in studies of gene regulatory processes.
Collapse
Affiliation(s)
- Hamid R Eghbalnia
- University of Wisconsin-Madison, Madison, USA. .,University of Cincinnati, Cincinnati, USA.
| | | | - Karol Mackey
- Molecular Research Center, Inc., Cincinnati, USA
| | | |
Collapse
|
33
|
Zhou A, Xie S, Feng Y, Sun D, Liu S, Sun Z, Li M, Zhang C, Zou J. Insights Into the Albinism Mechanism for Two Distinct Color Morphs of Northern Snakehead, Channa argus Through Histological and Transcriptome Analyses. Front Genet 2020; 11:830. [PMID: 33193565 PMCID: PMC7530302 DOI: 10.3389/fgene.2020.00830] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 07/09/2020] [Indexed: 12/20/2022] Open
Abstract
The great northern snakehead (Channa argus) is one of the most important economic and conservational fish in China. In this study, the melanocytes in the skin of two distinct color morphs C. argus were investigated and compared through employment of the microscopic analysis, hematoxylin and eosin (H&E) and Masson Fontana staining. Our results demonstrated the uneven distribution of melanocytes with extremely low density and most of them were in the state of aging or death. Meanwhile, there was no obvious pigment layer and melanocytes distribution pattern found in the albino-type (AT), while the melanocytes were evenly distributed with abundance in the bicolor-type (BT). The transcriptome analysis through Illumina HiSeq sequencing showed that a total of 34.93 Gb Clean Data was obtained, and Q30 base percentage reached 92.66%. The BT and AT northern snakeheads transcriptome data included a total of 56,039,701 and 60,410,063 clean reads (n = 3), respectively. In gene expression analyses, the sample correlation coefficients (r) were ranged between 0.92 and 1.00; the contribution of PC1 and PC2 were 50.25 and 13.73% by using PCA cluster analysis, the total number of DEGs were 1024 (559 up-regulated and 465 down-regulated), and the number of annotated DEGs was 767 (COG 172, KEGG 262, GO 288, SwissProt 548, Pfam 579 and NR 765). Additionally, 46,363 ± 873 and 44,947 ± 392 single nucleotide polymorphisms (SNPs) were compiled via genetic structure analysis, respectively. Ten key pigment-related genes were screened using qRT-PCR. And all of them revealed extremely higher expression levels in the skin of BT than those of AT. This is the first study to analyze the mechanism of albino characteristics of Channa via histology and transcriptomics, and also provide the oretical and practical support for the protection and development of germplasm resources for C. argus.
Collapse
Affiliation(s)
- Aiguo Zhou
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China.,Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, China
| | - Shaolin Xie
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China.,Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, China
| | - Yongyong Feng
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China
| | - Di Sun
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China
| | - Shulin Liu
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China
| | - Zhuolin Sun
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China
| | - Mingzhi Li
- Independent Researcher, Guangzhou, China
| | - Chaonan Zhang
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China
| | - Jixing Zou
- Joint Laboratory of Guangdong Province and Hong Kong Region on Marine Bioresource Conservation and Exploitation, College of Marine Sciences, South China Agricultural University, Guangzhou, China.,Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou, China
| |
Collapse
|
34
|
Sugawara S, Kanamaru Y, Sekine S, Maekawa L, Takahashi A, Yamamoto T, Watanabe K, Fujisawa T, Hattori K, Ichijo H. The mitochondrial protein PGAM5 suppresses energy consumption in brown adipocytes by repressing expression of uncoupling protein 1. J Biol Chem 2020; 295:5588-5601. [PMID: 32144202 PMCID: PMC7186182 DOI: 10.1074/jbc.ra119.011508] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 03/04/2020] [Indexed: 01/06/2023] Open
Abstract
Accumulating evidence suggests that brown adipose tissue (BAT) is a potential therapeutic target for managing obesity and related diseases. PGAM family member 5, mitochondrial serine/threonine protein phosphatase (PGAM5), is a protein phosphatase that resides in the mitochondria and regulates many biological processes, including cell death, mitophagy, and immune responses. Because BAT is a mitochondria-rich tissue, we have hypothesized that PGAM5 has a physiological function in BAT. We previously reported that PGAM5-knockout (KO) mice are resistant to severe metabolic stress. Importantly, lipid accumulation is suppressed in PGAM5-KO BAT, even under unstressed conditions, raising the possibility that PGAM5 deficiency stimulates lipid consumption. However, the mechanism underlying this observation is undetermined. Here, using an array of biochemical approaches, including quantitative RT-PCR, immunoblotting, and oxygen consumption assays, we show that PGAM5 negatively regulates energy expenditure in brown adipocytes. We found that PGAM5-KO brown adipocytes have an enhanced oxygen consumption rate and increased expression of uncoupling protein 1 (UCP1), a protein that increases energy consumption in the mitochondria. Mechanistically, we found that PGAM5 phosphatase activity and intramembrane cleavage are required for suppression of UCP1 activity. Furthermore, utilizing a genome-wide siRNA screen in HeLa cells to search for regulators of PGAM5 cleavage, we identified a set of candidate genes, including phosphatidylserine decarboxylase (PISD), which catalyzes the formation of phosphatidylethanolamine at the mitochondrial membrane. Taken together, these results indicate that PGAM5 suppresses mitochondrial energy expenditure by down-regulating UCP1 expression in brown adipocytes and that its phosphatase activity and intramembrane cleavage are required for UCP1 suppression.
Collapse
Affiliation(s)
- Sho Sugawara
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Yusuke Kanamaru
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Shiori Sekine
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Lila Maekawa
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Akinori Takahashi
- Cell Signal Unit, Okinawa Institute of Science and Technology Graduate University, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Tadashi Yamamoto
- Cell Signal Unit, Okinawa Institute of Science and Technology Graduate University, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan; Laboratory for Immunogenetics, Center for Integrative Medical Sciences, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, Japan
| | - Kengo Watanabe
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Takao Fujisawa
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Kazuki Hattori
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan.
| | - Hidenori Ichijo
- Laboratory of Cell Signaling, Graduate School of Pharmaceutical Sciences, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan.
| |
Collapse
|
35
|
Erola P, Björkegren JLM, Michoel T. Model-based clustering of multi-tissue gene expression data. Bioinformatics 2020; 36:1807-1813. [PMID: 31688915 PMCID: PMC7162352 DOI: 10.1093/bioinformatics/btz805] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2018] [Revised: 09/05/2019] [Accepted: 10/31/2019] [Indexed: 02/06/2023] Open
Abstract
MOTIVATION Recently, it has become feasible to generate large-scale, multi-tissue gene expression data, where expression profiles are obtained from multiple tissues or organs sampled from dozens to hundreds of individuals. When traditional clustering methods are applied to this type of data, important information is lost, because they either require all tissues to be analyzed independently, ignoring dependencies and similarities between tissues, or to merge tissues in a single, monolithic dataset, ignoring individual characteristics of tissues. RESULTS We developed a Bayesian model-based multi-tissue clustering algorithm, revamp, which can incorporate prior information on physiological tissue similarity, and which results in a set of clusters, each consisting of a core set of genes conserved across tissues as well as differential sets of genes specific to one or more subsets of tissues. Using data from seven vascular and metabolic tissues from over 100 individuals in the STockholm Atherosclerosis Gene Expression (STAGE) study, we demonstrate that multi-tissue clusters inferred by revamp are more enriched for tissue-dependent protein-protein interactions compared to alternative approaches. We further demonstrate that revamp results in easily interpretable multi-tissue gene expression associations to key coronary artery disease processes and clinical phenotypes in the STAGE individuals. AVAILABILITY AND IMPLEMENTATION Revamp is implemented in the Lemon-Tree software, available at https://github.com/eb00/lemon-tree. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pau Erola
- Division of Genetics and Genomics, The Roslin Institute, The University of Edinburgh, Midlothian EH25 9RG, UK
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol BS8 2BN, UK
| | - Johan L M Björkegren
- Department of Genetics and Genomic Sciences, Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Integrated Cardio Metabolic Centre (ICMC), Karolinska Institutet, Huddinge 141 57, Sweden
| | - Tom Michoel
- Division of Genetics and Genomics, The Roslin Institute, The University of Edinburgh, Midlothian EH25 9RG, UK
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen N-5020, Norway
| |
Collapse
|
36
|
Liu H, Li G, Yang X, Kuijer HN, Liang W, Zhang D. Transcriptome profiling reveals phase-specific gene expression in the developing barley inflorescence. ACTA ACUST UNITED AC 2020. [DOI: 10.1016/j.cj.2019.04.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
37
|
Jiang T, Lu Y, Duan H, Zhang W, Liu A. A model-based approach for clustering of multivariate semicontinuous data with application to dietary pattern analysis and intervention. Stat Med 2020; 39:16-25. [PMID: 31702055 DOI: 10.1002/sim.8391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Revised: 09/16/2019] [Accepted: 09/17/2019] [Indexed: 11/10/2022]
Abstract
Semicontinuous data, characterized by a sizable number of zeros and observations from a continuous distribution, are frequently encountered in health research concerning food consumptions, physical activities, medical and pharmacy claims expenditures, and many others. In analyzing such semicontinuous data, it is imperative that the excessive zeros be adequately accounted for to obtain unbiased and efficient inference. Although many methods have been proposed in the literature for the modeling and analysis of semicontinuous data, little attention has been given to clustering of semicontinuous data to identify important patterns that could be indicative of certain health outcomes or intervention effects. We propose a Bernoulli-normal mixture model for clustering of multivariate semicontinuous data and demonstrate its accuracy as compared to the well-known clustering method with the conventional normal mixture model. The proposed method is illustrated with data from a dietary intervention trial to promote healthy eating behavior among children with type 1 diabetes. In the trial, certain diabetes friendly foods (eg, total fruit, whole fruit, dark green and orange vegetables and legumes, whole grain) were only consumed by a proportion of study participants, yielding excessive zero values due to nonconsumption of the foods. Baseline foods consumptions data in the trial are used to explore preintervention dietary patterns among study participants. While the conventional normal mixture model approach fails to do so, the proposed Bernoulli-normal mixture model approach has shown to be able to identify a dietary profile that significantly differentiates the intervention effects from others, as measured by the popular healthy eating index at the end of the trial.
Collapse
Affiliation(s)
- Tao Jiang
- School of Statistics and Mathematics, Zhejiang Gongshang University, Hangzhou, China
| | - Yahui Lu
- School of Statistics and Mathematics, Zhejiang Gongshang University, Hangzhou, China
| | - Huimin Duan
- School of Statistics and Mathematics, Zhejiang Gongshang University, Hangzhou, China
| | - Wei Zhang
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland
| | - Aiyi Liu
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland
| |
Collapse
|
38
|
Lo Re O, Mazza T, Giallongo S, Sanna P, Rappa F, Vinh Luong T, Li Volti G, Drovakova A, Roskams T, Van Haele M, Tsochatzis E, Vinciguerra M. Loss of histone macroH2A1 in hepatocellular carcinoma cells promotes paracrine-mediated chemoresistance and CD4 +CD25 +FoxP3 + regulatory T cells activation. Am J Cancer Res 2020; 10:910-924. [PMID: 31903159 PMCID: PMC6929991 DOI: 10.7150/thno.35045] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 10/06/2019] [Indexed: 12/11/2022] Open
Abstract
Rationale: Loss of histone macroH2A1 induces appearance of cancer stem cells (CSCs)-like cells in hepatocellular carcinoma (HCC). How CSCs interact with the tumor microenvironment and the adaptive immune system is unclear. Methods: We screened aggressive human HCC for macroH2A1 and CD44 CSC marker expression. We also knocked down (KD) macroH2A1 in HCC cells, and performed integrated transcriptomic and secretomic analyses. Results: Human HCC showed low macroH2A1 and high CD44 expression compared to control tissues. MacroH2A1 KD CSC-like cells transferred paracrinally their chemoresistant properties to parental HCC cells. MacroH2A1 KD conditioned media transcriptionally reprogrammed parental HCC cells activated regulatory CD4+/CD25+/FoxP3+ T cells (Tregs). Conclusions: Loss of macroH2A1 in HCC cells drives cancer stem-cell propagation and evasion from immune surveillance.
Collapse
|
39
|
Fu Y, Dong T, Tan L, Yin D, Zhang M, Zhao G, Ye M, Wu R. Identification of Shoot Differentiation-Related Genes in Populus euphratica Oliv. Genes (Basel) 2019; 10:genes10121034. [PMID: 31835855 PMCID: PMC6947848 DOI: 10.3390/genes10121034] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 11/25/2019] [Accepted: 12/09/2019] [Indexed: 12/30/2022] Open
Abstract
De novo shoot regeneration is one of the important manifestations of cell totipotency in organogenesis, which reflects a survival strategy organism evolved when facing natural selection. Compared with tissue regeneration, and somatic embryogenesis, de novo shoot regeneration denotes a shoot regeneration process directly from detatched or injured tissues of plant. Studies on plant shoot regeneration had identified key genes mediating shoot regeneration. However, knowledge was derived from Arabidopsis; the regeneration capacity is hugely distinct among species. To achieve a comprehensive understanding of the shoot regeneration mechanism from tree species, we select four genetic lines of Populus euphratica from a natural population to be sequenced at transcriptome level. On the basis of the large difference of differentiation capacity, between the highly differentiated (HD) and low differentiated (LD) groups, the analysis of differential expression identified 4920 differentially expressed genes (DEGs), which were revealed in five groups of expression patterns by clustering analysis. Enrichment showed crucial pathways involved in regulation of regeneration difference, including “plant hormone signal transduction”, “cell differentiation”, "cellular response to auxin stimulus", and “auxin-activated signaling pathway”. The expression of nine genes reported to be associated with shoot regeneration was validated using quantitative real-time PCR (qRT-PCR). For the specificity of regeneration mechanism with P. euphratica, large amount of DEGs involved in "plant-pathogen interaction", ubiquitin-26S proteosome mediated proteolysis pathway, stress-responsive DEGs, and senescence-associated DEGs were summarized to possibly account for the differentiation difference with distinct genotypes of P. euphratica. The result in this study helps screening of key regulators in mediating the shoot differentiation. The transcriptomic characteristic in P. euphratica further enhances our understanding of key processes affecting the regeneration capacity of de novo shoots among distinct species.
Collapse
Affiliation(s)
- Yaru Fu
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Tianyu Dong
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Lizhi Tan
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Danni Yin
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Miaomiao Zhang
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Guomiao Zhao
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
| | - Meixia Ye
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
- Correspondence: ; Tel.: +86-10-6233-7061
| | - Rongling Wu
- Center for Computational Biology, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China; (Y.F.); (T.D.); (L.T.); (D.Y.); (M.Z.); (G.Z.); (R.W.)
- Center for Statistical Genetics, The Pennsylvania State University, Hershey, PA 17033, USA
| |
Collapse
|
40
|
Silva A, Rothstein SJ, McNicholas PD, Subedi S. A multivariate Poisson-log normal mixture model for clustering transcriptome sequencing data. BMC Bioinformatics 2019; 20:394. [PMID: 31311497 PMCID: PMC6636065 DOI: 10.1186/s12859-019-2916-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Accepted: 05/28/2019] [Indexed: 12/02/2022] Open
Abstract
Background High-dimensional data of discrete and skewed nature is commonly encountered in high-throughput sequencing studies. Analyzing the network itself or the interplay between genes in this type of data continues to present many challenges. As data visualization techniques become cumbersome for higher dimensions and unconvincing when there is no clear separation between homogeneous subgroups within the data, cluster analysis provides an intuitive alternative. The aim of applying mixture model-based clustering in this context is to discover groups of co-expressed genes, which can shed light on biological functions and pathways of gene products. Results A mixture of multivariate Poisson-log normal (MPLN) model is developed for clustering of high-throughput transcriptome sequencing data. Parameter estimation is carried out using a Markov chain Monte Carlo expectation-maximization (MCMC-EM) algorithm, and information criteria are used for model selection. Conclusions The mixture of MPLN model is able to fit a wide range of correlation and overdispersion situations, and is suited for modeling multivariate count data from RNA sequencing studies. All scripts used for implementing the method can be found at https://github.com/anjalisilva/MPLNClust. Electronic supplementary material The online version of this article (10.1186/s12859-019-2916-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Anjali Silva
- Department of Mathematics and Statistics, University of Guelph, Guelph, N1G 2W1, Canada.,Department of Molecular and Cellular Biology, University of Guelph, Guelph, N1G 2W1, Ontario, Canada
| | - Steven J Rothstein
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, N1G 2W1, Ontario, Canada
| | - Paul D McNicholas
- Department of Mathematics and Statistics, McMaster University, Hamilton, L8S 4K1, Ontario, Canada
| | - Sanjeena Subedi
- Department of Mathematical Sciences, Binghamton University, Binghamton, 13902, New York, USA.
| |
Collapse
|
41
|
Diéguez-Hurtado R, Kato K, Giaimo BD, Nieminen-Kelhä M, Arf H, Ferrante F, Bartkuhn M, Zimmermann T, Bixel MG, Eilken HM, Adams S, Borggrefe T, Vajkoczy P, Adams RH. Loss of the transcription factor RBPJ induces disease-promoting properties in brain pericytes. Nat Commun 2019; 10:2817. [PMID: 31249304 PMCID: PMC6597568 DOI: 10.1038/s41467-019-10643-w] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 05/15/2019] [Indexed: 02/06/2023] Open
Abstract
Sufficient vascular supply is indispensable for brain development and function, whereas dysfunctional blood vessels are associated with human diseases such as vascular malformations, stroke or neurodegeneration. Pericytes are capillary-associated mesenchymal cells that limit vascular permeability and protect the brain by preserving blood-brain barrier integrity. Loss of pericytes has been linked to neurodegenerative changes in genetically modified mice. Here, we report that postnatal inactivation of the Rbpj gene, encoding the transcription factor RBPJ, leads to alteration of cell identity markers in brain pericytes, increases local TGFβ signalling, and triggers profound changes in endothelial behaviour. These changes, which are not mimicked by pericyte ablation, imperil vascular stability and induce the acquisition of pathological landmarks associated with cerebral cavernous malformations. In adult mice, loss of Rbpj results in bigger stroke lesions upon ischemic insult. We propose that brain pericytes can acquire deleterious properties that actively enhance vascular lesion formation and promote pathogenic processes. Pericytes are perivascular cells essential for blood-brain barrier maintenance. Here Diéguez-Hurtado et al. show that depletion of the transcription factor RBPJ in pericytes affects their molecular identity and disturbs endothelial cell behaviour, inducing the formation of vascular lesions in the brain.
Collapse
Affiliation(s)
- Rodrigo Diéguez-Hurtado
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany.,Faculty of Medicine, University of Münster, 48149, Münster, Germany
| | - Katsuhiro Kato
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany
| | | | - Melina Nieminen-Kelhä
- Department of Neurosurgery, Charité-Universitätsmedizin, Charitéplatz 1, Berlin, 10117, Germany.,Berlin Institute of Health, Charitéplatz 1, 10117, Berlin, Germany
| | - Hendrik Arf
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany
| | - Francesca Ferrante
- Institute of Biochemistry, University of Giessen, Friedrichstrasse 24, 35392, Giessen, Germany
| | - Marek Bartkuhn
- Institute for Genetics, University of Giessen, Heinrich-Buff-Ring 58-62, 35392, Giessen, Germany
| | - Tobias Zimmermann
- Bioinformatics and Systems Biology, University of Giessen, Heinrich-Buff-Ring 58-62, 35392, Giessen, Germany
| | - M Gabriele Bixel
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany
| | - Hanna M Eilken
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany.,Bayer AG, Aprather Weg 18a, 42113, Wuppertal, Germany
| | - Susanne Adams
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany
| | - Tilman Borggrefe
- Institute of Biochemistry, University of Giessen, Friedrichstrasse 24, 35392, Giessen, Germany
| | - Peter Vajkoczy
- Department of Neurosurgery, Charité-Universitätsmedizin, Charitéplatz 1, Berlin, 10117, Germany. .,Berlin Institute of Health, Charitéplatz 1, 10117, Berlin, Germany.
| | - Ralf H Adams
- Department of Tissue Morphogenesis, Max Planck Institute for Molecular Biomedicine, Röntgenstrasse 20, 48149, Münster, Germany. .,Faculty of Medicine, University of Münster, 48149, Münster, Germany.
| |
Collapse
|
42
|
Lema KA, Metegnier G, Quéré J, Latimier M, Youenou A, Lambert C, Fauchot J, Le Gac M. Inter- and Intra-Specific Transcriptional and Phenotypic Responses of Pseudo-nitzschia under Different Nutrient Conditions. Genome Biol Evol 2019; 11:731-747. [PMID: 30778535 PMCID: PMC6414312 DOI: 10.1093/gbe/evz030] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2019] [Indexed: 12/19/2022] Open
Abstract
Untangling the functional basis of divergence between closely related species is a step toward understanding species dynamics within communities at both the evolutionary and ecological scales. We investigated cellular (i.e., growth, domoic acid production, and nutrient consumption) and molecular (transcriptomic analyses) responses to varying nutrient concentrations across several strains belonging to three species of the toxic diatom genus Pseudo-nitzschia. Three main results were obtained. First, strains from the same species displayed similar transcriptomic, but not necessarily cellular, responses to the experimental conditions. It showed the importance of considering intraspecific diversity to investigate functional divergence between species. Second, a major exception to the first finding was a strain recently isolated from the natural environment and displaying contrasting gene expression patterns related to cell motility and domoic acid production. This result illustrated the profound modifications that may occur when transferring a cell from the natural to the in vitro environment and asks for future studies to better understand the influence of culture duration and life cycle on expression patterns. Third, transcriptomic responses were more similar between the two species displaying similar ecology in situ, irrespective of the genetic distance. This was especially true for molecular responses related to TCA cycle, photosynthesis, and nitrogen metabolism. However, transcripts related to phosphate uptake were variable between species. It highlighted the importance of considering both overall genetic distance and ecological divergence to explain functional divergence between species.
Collapse
Affiliation(s)
| | - Gabriel Metegnier
- IFREMER, Dyneco Pelagos, Plouzané, France.,CNRS, Sorbonne Université, Pontificia Universidad Catolica de Chile, Universidad Austral de Chile, UMI 3614, Evolutionary Biology and Ecology of Algae, Station Biologique de Roscoff, Roscoff Cedex, France
| | | | | | | | - Christophe Lambert
- Laboratoire des Sciences de l'Environnement Marin (LEMAR), UMR 6539 CNRS UBO IRD IFREMER, Plouzané, France.,Institut Universitaire Européen de la Mer, Technopôle Brest-Iroise, Plouzané, France
| | - Juliette Fauchot
- UNICAEN, CNRS, BOREA, Normandie Univ, Caen, France.,Unité Biologie des Organismes et Ecosystèmes Aquatiques (BOREA), Muséum National d'Histoire Naturelle, Sorbonne Université, Université de Caen Normandie, Université des Antilles, CNRS, IRD, Caen, France
| | | |
Collapse
|
43
|
Whole blood transcriptomic profiles can differentiate vulnerability to chronic low back pain. PLoS One 2019; 14:e0216539. [PMID: 31095601 PMCID: PMC6522025 DOI: 10.1371/journal.pone.0216539] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 04/23/2019] [Indexed: 01/15/2023] Open
Abstract
The mechanisms underlying the transition from acute to chronic pain remain unclear. Here, we sought to characterize the transcriptome associated with chronic low back pain as well as the transcriptome of the transition from acute to chronic low back pain. For the analysis, we compared the whole blood transcriptome of: (a) patients at the onset of low back pain who no longer had pain within 6 weeks after onset (acute) with patients who developed chronic low back pain at 6 months (chronic T5); and, (b) patients at the onset of low back pain (chronic T1) who developed chronic pain at 6 months with healthy pain-free (normal) controls. The majority of differentially expressed genes were protein coding. We illustrate a unique chronic low back pain transcriptome characterized by significant enrichment for known pain genes, extracellular matrix genes, and genes from the extended major histocompatibility complex (MHC) genomic locus. The transcriptome of the transition from acute to chronic low back pain was characterized by significant upregulation of antigen presentation pathway (MHC class I and II) genes and downregulation of mitochondrial genes associated with oxidative phosphorylation, suggesting a unique genomic signature of vulnerability to low back pain chronicity.
Collapse
|
44
|
Rau A, Maugis-Rabusseau C. Transformation and model choice for RNA-seq co-expression analysis. Brief Bioinform 2019; 19:425-436. [PMID: 28065917 DOI: 10.1093/bib/bbw128] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Indexed: 11/12/2022] Open
Abstract
Although a large number of clustering algorithms have been proposed to identify groups of co-expressed genes from microarray data, the question of if and how such methods may be applied to RNA sequencing (RNA-seq) data remains unaddressed. In this work, we investigate the use of data transformations in conjunction with Gaussian mixture models for RNA-seq co-expression analyses, as well as a penalized model selection criterion to select both an appropriate transformation and number of clusters present in the data. This approach has the advantage of accounting for per-cluster correlation structures among samples, which can be strong in RNA-seq data. In addition, it provides a rigorous statistical framework for parameter estimation, an objective assessment of data transformations and number of clusters and the possibility of performing diagnostic checks on the quality and homogeneity of the identified clusters. We analyze four varied RNA-seq data sets to illustrate the use of transformations and model selection in conjunction with Gaussian mixture models. Finally, we propose a Bioconductor package coseq (co-expression of RNA-seq data) to facilitate implementation and visualization of the recommended RNA-seq co-expression analyses.
Collapse
Affiliation(s)
- Andrea Rau
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350 Jouy-en-Josas, France
| | - Cathy Maugis-Rabusseau
- Institut de Mathématiques de Toulouse, INSA de Toulouse, Université de Toulouse, 31400 Toulouse, France
| |
Collapse
|
45
|
Li Q, Noel-MacDonnell JR, Koestler DC, Goode EL, Fridley BL. Subject level clustering using a negative binomial model for small transcriptomic studies. BMC Bioinformatics 2018; 19:474. [PMID: 30541426 PMCID: PMC6292049 DOI: 10.1186/s12859-018-2556-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 12/03/2018] [Indexed: 02/06/2023] Open
Abstract
Background Unsupervised clustering represents one of the most widely applied methods in analysis of high-throughput ‘omics data. A variety of unsupervised model-based or parametric clustering methods and non-parametric clustering methods have been proposed for RNA-seq count data, most of which perform well for large samples, e.g. N ≥ 500. A common issue when analyzing limited samples of RNA-seq count data is that the data follows an over-dispersed distribution, and thus a Negative Binomial likelihood model is often used. Thus, we have developed a Negative Binomial model-based (NBMB) clustering approach for application to RNA-seq studies. Results We have developed a Negative Binomial Model-Based (NBMB) method to cluster samples using a stochastic version of the expectation-maximization algorithm. A simulation study involving various scenarios was completed to compare the performance of NBMB to Gaussian model-based or Gaussian mixture modeling (GMM). NBMB was also applied for the clustering of two RNA-seq studies; type 2 diabetes study (N = 96) and TCGA study of ovarian cancer (N = 295). Simulation results showed that NBMB outperforms GMM applied with different transformations in majority of scenarios with limited sample size. Additionally, we found that NBMB outperformed GMM for small clusters distance regardless of sample size. Increasing total number of genes with fixed proportion of differentially expressed genes does not change the outperformance of NBMB, but improves the overall performance of GMM. Analysis of type 2 diabetes and ovarian cancer tumor data with NBMB found good agreement with the reported disease subtypes and the gene expression patterns. This method is available in an R package on CRAN named NB.MClust. Conclusion Use of Negative Binomial model based clustering is advisable when clustering over dispersed RNA-seq count data. Electronic supplementary material The online version of this article (10.1186/s12859-018-2556-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Qian Li
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, FL, 33612, USA.,Health Informatics Institute, University of South Florida, Tampa, FL, USA
| | | | - Devin C Koestler
- Department of Biostatistics, University of Kansas Medical Center, Kansas City, KS, USA
| | - Ellen L Goode
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Brooke L Fridley
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, 12902 Magnolia Drive, Tampa, FL, 33612, USA.
| |
Collapse
|
46
|
Potential of Oryza officinalis to augment the cold tolerance genetic mechanisms of Oryza sativa by network complementation. Sci Rep 2018; 8:16346. [PMID: 30397229 PMCID: PMC6218501 DOI: 10.1038/s41598-018-34608-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 10/15/2018] [Indexed: 12/22/2022] Open
Abstract
Oryza officinalis is an accessible alien donor for genetic improvement of rice. Comparison across a representative panel of Oryza species showed that the wild O. officinalis and cultivated O. sativa ssp. japonica have similar cold tolerance potentials. The possibility that either distinct or similar genetic mechanisms are involved in the low temperature responses of each species was addressed by comparing their transcriptional networks. General similarities were supported by shared transcriptomic signatures indicative of equivalent metabolic, hormonal, and defense status. However, O. officinalis has maintained an elaborate cold-responsive brassinosteroid-regulated BES1-network that appeared to have been fragmented in O. sativa. BES1-network is potentially important for integrating growth-related responses with physiological adjustments and defenses through the protection of photosynthetic machinery and maintenance of stomatal aperture, oxidative defenses, and osmotic adjustment. Equivalent physiological processes are functional in O. sativa but their genetic mechanisms are under the direct control of ABA-dependent, DREB-dependent and/or oxidative-mediated networks uncoupled to BES1. While O. officinalis and O. sativa represent long periods of speciation and domestication, their comparable cold tolerance potentials involve equivalent physiological processes but distinct genetic networks. BES1-network represents a novel attribute of O. officinalis with potential applications in diversifying or complementing other mechanisms in the cultivated germplasm.
Collapse
|
47
|
Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species. Commun Biol 2018; 1:162. [PMID: 30320230 PMCID: PMC6173730 DOI: 10.1038/s42003-018-0171-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 09/13/2018] [Indexed: 01/30/2023] Open
Abstract
The African wild rice species Oryza longistaminata has several beneficial traits compared to cultivated rice species, such as resistance to biotic stresses, clonal propagation via rhizomes, and increased biomass production. To facilitate breeding efforts and functional genomics studies, we de-novo assembled a high-quality, haploid-phased genome. Here, we present our assembly, with a total length of 351 Mb, of which 92.2% was anchored onto 12 chromosomes. We detected 34,389 genes and 38.1% of the genome consisted of repetitive content. We validated our assembly by a comparative linkage analysis and by examining well-characterized gene families. This genome assembly will be a useful resource to exploit beneficial alleles found in O. longistaminata. Our results also show that it is possible to generate a high-quality, functionally complete rice genome assembly from moderate SMRT read coverage by exploiting synteny in a closely related Oryza species. Stefan Reuscher et al. assembled the genome of an African wild rice species to facilitate breeding efforts and functional genomic studies. They used SMRT sequencing, chromosomal synteny between rice species, and a linkage map to assemble the 351 Mb genome into 12 chromosomes.
Collapse
|
48
|
A Survey of Data Mining and Deep Learning in Bioinformatics. J Med Syst 2018; 42:139. [DOI: 10.1007/s10916-018-1003-9] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 06/21/2018] [Indexed: 12/13/2022]
|
49
|
Wendt GR, Collins JNR, Pei J, Pearson MS, Bennett HM, Loukas A, Berriman M, Grishin NV, Collins JJ. Flatworm-specific transcriptional regulators promote the specification of tegumental progenitors in Schistosoma mansoni. eLife 2018; 7:e33221. [PMID: 29557781 PMCID: PMC5927768 DOI: 10.7554/elife.33221] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 03/19/2018] [Indexed: 01/13/2023] Open
Abstract
Schistosomes infect more than 200 million people. These parasitic flatworms rely on a syncytial outer coat called the tegument to survive within the vasculature of their host. Although the tegument is pivotal for their survival, little is known about maintenance of this tissue during the decades schistosomes survive in the bloodstream. Here, we demonstrate that the tegument relies on stem cells (neoblasts) to specify fusogenic progenitors that replace tegumental cells lost to turnover. Molecular characterization of neoblasts and tegumental progenitors led to the discovery of two flatworm-specific zinc finger proteins that are essential for tegumental cell specification. These proteins are homologous to a protein essential for neoblast-driven epidermal maintenance in free-living flatworms. Therefore, we speculate that related parasites (i.e., tapeworms and flukes) employ similar strategies to control tegumental maintenance. Since parasitic flatworms infect every vertebrate species, understanding neoblast-driven tegumental maintenance could identify broad-spectrum therapeutics to fight diseases caused by these parasites.
Collapse
Affiliation(s)
- George R Wendt
- Department of PharmacologyUT Southwestern Medical CenterDallasTexas
| | - Julie NR Collins
- Department of PharmacologyUT Southwestern Medical CenterDallasTexas
| | - Jimin Pei
- Department of BiophysicsUT Southwestern Medical CenterDallasTexas
- Howard Hughes Medical InstituteUT Southwestern Medical CenterDallasTexas
| | - Mark S Pearson
- Center for Biodiscovery and Molecular Development of TherapeuticsAustralian Institute of Tropical Health and Medicine, James Cook UniversityCairnsAustralia
| | - Hayley M Bennett
- Wellcome Sanger InstituteWellcome Genome CampusHinxtonUnited Kingdom
| | - Alex Loukas
- Center for Biodiscovery and Molecular Development of TherapeuticsAustralian Institute of Tropical Health and Medicine, James Cook UniversityCairnsAustralia
| | - Matthew Berriman
- Wellcome Sanger InstituteWellcome Genome CampusHinxtonUnited Kingdom
| | - Nick V Grishin
- Department of BiophysicsUT Southwestern Medical CenterDallasTexas
- Howard Hughes Medical InstituteUT Southwestern Medical CenterDallasTexas
| | - James J Collins
- Department of PharmacologyUT Southwestern Medical CenterDallasTexas
| |
Collapse
|
50
|
Comprehensive transcriptional landscape of porcine cardiac and skeletal muscles reveals differences of aging. Oncotarget 2017; 9:1524-1541. [PMID: 29416711 PMCID: PMC5788579 DOI: 10.18632/oncotarget.23290] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Accepted: 12/08/2017] [Indexed: 12/11/2022] Open
Abstract
Aging significantly affects the cardiac muscle (CM) and skeletal muscles (SM). Since the aging process of CM and SM may be different, high throughput RNA sequencing was performed using CM and SM in different age conditions to evaluate the expression profiles of messenger RNA (mRNA), long non-coding RNA (lncRNA), micro RNA (miRNA), and circular (circRNA). Several mRNAs, lncRNAs, and miRNAs were highly expressed and consistently appeared in both ages in one of the two muscle tissues. Gene ontology (GO) annotation described that these genes were required for maintaining normal biological functions of CM and SM tissues. Furthermore, 26 mRNAs, 4 lncRNAs, 22 miRNAs, and 26 circRNAs were differentially expressed during cardiac muscle aging. Moreover, 81 mRNAs, 5 lncRNAs, 79 miRNAs, and 62 circRNAs were differentially expressed during aging of skeletal muscle. When comparing the expression profiles of CM and SM during aging, the senescence process in CM and SM was found to be fundamentally different. In addition, we assessed multi-group cooperative control relationships and constructed circRNA-miRNA-mRNA co-expression networks in muscular aging. In conclusion, our findings will contribute to the understanding of muscular aging and provide a foundation for future studies on the molecular mechanisms underlying muscular aging.
Collapse
|