1
|
Robert Kolar M, Mitra D, Kobzarenko V. Efficient discovery of frequently co-occurring mutations in a sequence database with matrix factorization. PLoS Comput Biol 2025; 21:e1012391. [PMID: 40273414 DOI: 10.1371/journal.pcbi.1012391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Accepted: 04/08/2025] [Indexed: 04/26/2025] Open
Abstract
We have developed a robust method for efficiently tracking multiple co-occurring mutations in a sequence database. Evolution often hinges on the interaction of several mutations to produce significant phenotypic changes that lead to the proliferation of a variant. However, identifying numerous simultaneous mutations across a vast database of sequences poses a significant computational challenge. Our approach leverages a matrix factorization technique to automatically and efficiently pinpoint subsets of positions where co-mutations occur, appearing in a substantial number of sequences within the database. We validated our method using SARS-CoV-2 receptor-binding domains, comprising approximately seven hundred thousand sequences of the Spike protein, demonstrating superior performance compared to a reasonably exhaustive brute-force method. Furthermore, we explore the biological significance of the identified co-mutational positions (CMPs) and their potential impact on the virus's evolution and functionality, identifying key mutations in Delta and Omicron variants. This analysis underscores the significant role of identified CMPs in understanding the evolutionary trajectory. By tracking the "birth" and "death" of CMPs, we can elucidate the persistence and impact of specific groups of mutations across different viral strains, providing valuable insights into the virus' adaptability and thus, possibly aiding vaccine design strategies.
Collapse
Affiliation(s)
- Michael Robert Kolar
- BiC Lab, Department of Electrical Engineering and Computer Science, Florida Institute of Technology, Melbourne, Florida, United States of America
| | - Debasis Mitra
- BiC Lab, Department of Electrical Engineering and Computer Science, Florida Institute of Technology, Melbourne, Florida, United States of America
| | - Valerie Kobzarenko
- BiC Lab, Department of Electrical Engineering and Computer Science, Florida Institute of Technology, Melbourne, Florida, United States of America
| |
Collapse
|
2
|
Fang J, Yu S, Wang W, Liu C, Lv X, Jin J, Han X, Zhou F, Wang Y. A tumor-infiltrating B lymphocytes -related index based on machine-learning predicts prognosis and immunotherapy response in lung adenocarcinoma. Front Immunol 2025; 16:1524120. [PMID: 40196113 PMCID: PMC11973313 DOI: 10.3389/fimmu.2025.1524120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Accepted: 03/03/2025] [Indexed: 04/09/2025] Open
Abstract
Introduction Tumor-infiltrating B lymphocytes (TILBs) play a pivotal role in shaping the immune microenvironment of tumors (TIME) and in the progression of lung adenocarcinoma (LUAD). However, there remains a scarcity of research that has thoroughly and systematically delineated the characteristics of TILBs in LUAD. Method The research employed single-cell RNA sequencing from the GSE117570 dataset to identify markers linked to TILBs. A comprehensive machine learning approach, utilizing ten distinct algorithms, facilitated the creation of a TILB-related index (BRI) across the TCGA, GSE31210, and GSE72094 datasets. We used multiple algorithms to evaluate the relationships between BRI and TIME, as well as immune therapy-related biomarkers. Additionally, we assessed the role of BRI in predicting immune therapy response in two datasets, GSE91061 and GSE126044. Result BRI functioned as an independent risk determinant in LUAD, demonstrating a robust and reliable capacity to predict overall survival rates. We observed significant differences in the scores of B cells, M2 macrophages, NK cells, and regulatory T cells between the high and low BRI score groups. Notably, BRI was found to inversely correlate with cytotoxic CD8+ T-cell infiltration (r = -0.43, p < 0.001) and positively correlate with regulatory T cells (r = 0.31, p = 0.008). We also found that patients with lower BRI were more likely to respond to immunotherapy and were associated with reduced IC50 values for standard chemotherapy and targeted therapy drugs, in contrast to higher BRI. Additionally, the BRI-based survival prediction nomogram demonstrated significant promise for clinical application in predicting the 1-, 3-, and 5-year overall survival rates among LUAD patients. Discussion Our study developed a BRI model using ten different algorithms and 101 algorithm combinations. The BRI could be a valuable tool for risk stratification, prognosis, and selection of treatment approaches.
Collapse
Affiliation(s)
- Jiale Fang
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| | - Siyuan Yu
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| | - Wei Wang
- Department of Pharmacology, Air Force Medical University, Xi’an, China
- Department of Pharmacy, Southern University of Science and Technology Hospital, Shenzhen, China
| | - Cheng Liu
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| | - Xiaojia Lv
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| | - Jiaqi Jin
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| | - Xiaomin Han
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
- Department of Pharmacy, Southern University of Science and Technology Hospital, Shenzhen, China
- Department of Basic Medicine and Law, Baotou Medical College, Baotou, China
| | - Fang Zhou
- Department of Pharmacy, Southern University of Science and Technology Hospital, Shenzhen, China
| | - Yukun Wang
- Department of Pharmacology, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
3
|
Tran P, Mishra P, Williams L, Moskalenko R, Sharma S, Nilsson A, Watt D, Andersson P, Bergh A, Pursell Z, Chabes A. Altered dNTP pools accelerate tumor formation in mice. Nucleic Acids Res 2024; 52:12475-12486. [PMID: 39360631 PMCID: PMC11551754 DOI: 10.1093/nar/gkae843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 09/10/2024] [Accepted: 09/17/2024] [Indexed: 10/04/2024] Open
Abstract
Alterations in deoxyribonucleoside triphosphate (dNTP) pools have been linked to increased mutation rates and genome instability in unicellular organisms and cell cultures. However, the role of dNTP pool changes in tumor development in mammals remains unclear. In this study, we present a mouse model with a point mutation at the allosteric specificity site of ribonucleotide reductase, RRM1-Y285A. This mutation reduced ribonucleotide reductase activity, impairing the synthesis of deoxyadenosine triphosphate (dATP) and deoxyguanosine triphosphate (dGTP). Heterozygous Rrm1+/Y285A mice exhibited distinct alterations in dNTP pools across various organs, shorter lifespans and earlier tumor onset compared with wild-type controls. Mutational spectrum analysis of tumors revealed two distinct signatures, one resembling a signature extracted from a human cancer harboring a mutation of the same amino acid residue in ribonucleotide reductase, RRM1Y285C. Our findings suggest that mutations in enzymes involved in dNTP metabolism can serve as drivers of cancer development.
Collapse
Affiliation(s)
- Phong Tran
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
| | - Pradeep Mishra
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
| | - Leonard G Williams
- Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112, USA
- Tulane Cancer Center, Tulane University School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112, USA
| | - Roman Moskalenko
- Department of Pathology, Sumy State University, Kharkivska st. 116, Sumy 40007, Ukraine
| | - Sushma Sharma
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
| | - Anna Karin Nilsson
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
| | - Danielle L Watt
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
- School of Medicine and School of Dental Medicine, UConn Health, 300 UConn Health Blvd, Farmington, CT 06030, USA
| | - Pernilla Andersson
- Pathology Unit, Department of Medical Biosciences, Umeå University, Daniel Naezéns väg 6M, Umeå, SE 90737, Sweden
| | - Anders Bergh
- Pathology Unit, Department of Medical Biosciences, Umeå University, Daniel Naezéns väg 6M, Umeå, SE 90737, Sweden
| | - Zachary F Pursell
- Department of Biochemistry and Molecular Biology, Tulane University School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112, USA
- Tulane Cancer Center, Tulane University School of Medicine, 1430 Tulane Ave, New Orleans, LA 70112, USA
| | - Andrei Chabes
- Department of Medical Biochemistry and Biophysics, Umeå University, Linnaeus väg 6, Umeå, SE 90736, Sweden
| |
Collapse
|
4
|
Schmitt JE, Alexander-Bloch A, Seidlitz J, Raznahan A, Neale MC. The genetics of spatiotemporal variation in cortical thickness in youth. Commun Biol 2024; 7:1301. [PMID: 39390064 PMCID: PMC11467331 DOI: 10.1038/s42003-024-06956-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/24/2024] [Indexed: 10/12/2024] Open
Abstract
Prior studies have shown strong genetic effects on cortical thickness (CT), structural covariance, and neurodevelopmental trajectories in childhood and adolescence. However, the importance of genetic factors on the induction of spatiotemporal variation during neurodevelopment remains poorly understood. Here, we explore the genetics of maturational coupling by examining 308 MRI-derived regional CT measures in a longitudinal sample of 677 twins and family members. We find dynamic inter-regional genetic covariation in youth, with the emergence of regional subnetworks in late childhood and early adolescence. Three critical neurodevelopmental epochs in genetically-mediated maturational coupling were identified, with dramatic network strengthening near eleven years of age. These changes are associated with statistically-significant (empirical p-value <0.0001) increases in network strength as measured by average clustering coefficient and assortativity. We then identify genes from the Allen Human Brain Atlas with similar co-expression patterns to genetically-mediated structural covariation in children. This set was enriched for genes involved in potassium transport and dendrite formation. Genetically-mediated CT-CT covariance was also strongly correlated with expression patterns for genes located in cells of neuronal origin.
Collapse
Affiliation(s)
- J Eric Schmitt
- Departments of Psychiatry and Radiology, Division of Neuroradiology, Brain Behavior Laboratory, Hospital of the University of Pennsylvania, Philadelphia, PA, USA.
| | - Aaron Alexander-Bloch
- Department of Psychiatry, CHOP-Penn Brain-Gene-Development Laboratory, Hospital of the University of Pennsylvania, Philadelphia, PA, USA
| | - Jakob Seidlitz
- Department of Psychiatry, CHOP-Penn Brain-Gene-Development Laboratory, Hospital of the University of Pennsylvania, Philadelphia, PA, USA
| | - Armin Raznahan
- Developmental Neurogenomics Unit, National Institutes of Mental Health, Building 10, Room 4C110, 10 Center Drive, Bethesda, MD, USA
| | - Michael C Neale
- Departments of Psychiatry and Human and Molecular Genetics, Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
5
|
Graber JH, Hoskinson D, Liu H, Kaczmarek Michaels K, Benson PS, Maki NJ, Wilson CL, McGrath C, Puleo F, Pearson E, Kuehner JN, Moore C. Mutations in yeast Pcf11, a conserved protein essential for mRNA 3' end processing and transcription termination, elicit the Environmental Stress Response. Genetics 2024; 226:iyad199. [PMID: 37967370 PMCID: PMC10847720 DOI: 10.1093/genetics/iyad199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 10/28/2023] [Accepted: 10/31/2023] [Indexed: 11/17/2023] Open
Abstract
The Pcf11 protein is an essential subunit of the large complex that cleaves and polyadenylates eukaryotic mRNA precursor. It has also been functionally linked to gene-looping, termination of RNA Polymerase II (Pol II) transcripts, and mRNA export. We have examined a poorly characterized but conserved domain (amino acids 142-225) of the Saccharomyces cerevisiae Pcf11 and found that while it is not needed for mRNA 3' end processing or termination downstream of the poly(A) sites of protein-coding genes, its presence improves the interaction with Pol II and the use of transcription terminators near gene promoters. Analysis of genome-wide Pol II occupancy in cells with Pcf11 missing this region, as well as Pcf11 mutated in the Pol II CTD Interacting Domain, indicates that systematic changes in mRNA expression are mediated primarily at the level of transcription. Global expression analysis also shows that a general stress response, involving both activation and suppression of specific gene sets known to be regulated in response to a wide variety of stresses, is induced in the two pcf11 mutants, even though cells are grown in optimal conditions. The mutants also cause an unbalanced expression of cell wall-related genes that does not activate the Cell Wall Integrity pathway but is associated with strong caffeine sensitivity. Based on these findings, we propose that Pcf11 can modulate the expression level of specific functional groups of genes in ways that do not involve its well-characterized role in mRNA 3' end processing.
Collapse
Affiliation(s)
- Joel H Graber
- Mount Desert Island Biological Laboratory, Bar Harbor, ME 04609, USA
| | - Derick Hoskinson
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Huiyun Liu
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Katarzyna Kaczmarek Michaels
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Peter S Benson
- Mount Desert Island Biological Laboratory, Bar Harbor, ME 04609, USA
| | - Nathaniel J Maki
- Mount Desert Island Biological Laboratory, Bar Harbor, ME 04609, USA
| | | | - Caleb McGrath
- Department of Biology, Emmanuel College, Boston, MA 02115, USA
| | - Franco Puleo
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Erika Pearson
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Jason N Kuehner
- Department of Biology, Emmanuel College, Boston, MA 02115, USA
| | - Claire Moore
- Department of Development, Molecular, and Chemical Biology and School of Graduate Biomedical Science, Tufts University School of Medicine, Boston, MA 02111, USA
| |
Collapse
|
6
|
Tsuyuzaki K, Ishii M, Nikaido I. Sctensor detects many-to-many cell-cell interactions from single cell RNA-sequencing data. BMC Bioinformatics 2023; 24:420. [PMID: 37936079 PMCID: PMC10631077 DOI: 10.1186/s12859-023-05490-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/21/2023] [Indexed: 11/09/2023] Open
Abstract
BACKGROUND Complex biological systems are described as a multitude of cell-cell interactions (CCIs). Recent single-cell RNA-sequencing studies focus on CCIs based on ligand-receptor (L-R) gene co-expression but the analytical methods are not appropriate to detect many-to-many CCIs. RESULTS In this work, we propose scTensor, a novel method for extracting representative triadic relationships (or hypergraphs), which include ligand-expression, receptor-expression, and related L-R pairs. CONCLUSIONS Through extensive studies with simulated and empirical datasets, we have shown that scTensor can detect some hypergraphs that cannot be detected using conventional CCI detection methods, especially when they include many-to-many relationships. scTensor is implemented as a freely available R/Bioconductor package.
Collapse
Affiliation(s)
- Koki Tsuyuzaki
- Laboratory for Bioinformatics Research RIKEN Center for Biosystems Dynamics Research, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
- Japan Science and Technology Agency, PRESTO, 7 Gobancho, Chiyoda-ku, Tokyo, 102-0076, Japan.
| | - Manabu Ishii
- Laboratory for Bioinformatics Research RIKEN Center for Biosystems Dynamics Research, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan
| | - Itoshi Nikaido
- Laboratory for Bioinformatics Research RIKEN Center for Biosystems Dynamics Research, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan.
- Department of Functional Genome Informatics, Division of Biological Data Science, Medical Research Institute, Tokyo Medical and Dental University (TMDU), 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113-8510, Japan.
| |
Collapse
|
7
|
Nikumbh S, Lenhard B. Identifying promoter sequence architectures via a chunking-based algorithm using non-negative matrix factorisation. PLoS Comput Biol 2023; 19:e1011491. [PMID: 37983292 PMCID: PMC10695386 DOI: 10.1371/journal.pcbi.1011491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 12/04/2023] [Accepted: 09/05/2023] [Indexed: 11/22/2023] Open
Abstract
Core promoters are stretches of DNA at the beginning of genes that contain information that facilitates the binding of transcription initiation complexes. Different functional subsets of genes have core promoters with distinct architectures and characteristic motifs. Some of these motifs inform the selection of transcription start sites (TSS). By discovering motifs with fixed distances from known TSS positions, we could in principle classify promoters into different functional groups. Due to the variability and overlap of architectures, promoter classification is a difficult task that requires new approaches. In this study, we present a new method based on non-negative matrix factorisation (NMF) and the associated software called seqArchR that clusters promoter sequences based on their motifs at near-fixed distances from a reference point, such as TSS. When combined with experimental data from CAGE, seqArchR can efficiently identify TSS-directing motifs, including known ones like TATA, DPE, and nucleosome positioning signal, as well as novel lineage-specific motifs and the function of genes associated with them. By using seqArchR on developmental time courses, we reveal how relative use of promoter architectures changes over time with stage-specific expression. seqArchR is a powerful tool for initial genome-wide classification and functional characterisation of promoters. Its use cases are more general: it can also be used to discover any motifs at near-fixed distances from a reference point, even if they are present in only a small subset of sequences.
Collapse
Affiliation(s)
- Sarvesh Nikumbh
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, United Kingdom
| | - Boris Lenhard
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, United Kingdom
| |
Collapse
|
8
|
Li Z, Guo M, Lin W, Huang P. Machine Learning-Based Integration Develops a Macrophage-Related Index for Predicting Prognosis and Immunotherapy Response in Lung Adenocarcinoma. Arch Med Res 2023; 54:102897. [PMID: 37865004 DOI: 10.1016/j.arcmed.2023.102897] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 08/06/2023] [Accepted: 10/06/2023] [Indexed: 10/23/2023]
Abstract
BACKGROUND Macrophages play a critical role in tumor immune microenvironment (TIME) formation and cancer progression in lung adenocarcinoma (LUAD). However, few studies have comprehensively and systematically described the characteristics of macrophages in LUAD. METHODS This study identified macrophage-related markers with single-cell RNA sequencing data from the GSE189487 dataset. An integrative machine learning-based procedure based on 10 algorithms was developed to construct a macrophage-related index (MRI) in The Cancer Genome Atlas (TCGA), GSE30219, GSE31210, and GSE72094 datasets. Several algorithms were used to evaluate the associations of MRI with TIME and immunotherapy-related biomarkers. The role of MRI in predicting the immunotherapy response was evaluated with the GSE91061 dataset. RESULTS The optimal MRI constructed by the combination of the Lasso algorithm and plsRCox was an independent risk factor in LUAD and showed a stable and powerful performance in predicting the overall survival rate of patients with LUAD. Those with low MRI scores had a higher TIME score, a higher level of immune cells, a higher immunophenoscore, and a lower Tumor Immune Dysfunction and Exclusion (TIDE) score, indicating a better response to immunotherapy. The IC50 value of common drugs for chemotherapy and target therapy with low MRI scores was higher compared to high MRI scores. Moreover, the survival prediction nomogram, developed from MRI, had good potential for clinical application in predicting the 1-, 3-, and 5-year overall survival rate of LUAD. CONCLUSION Our study constructed for the first time a consensus MRI for LUAD with 10 machine learning algorithms. The MRI could be helpful for risk stratification, prognosis, and selection of treatment approach in LUAD.
Collapse
Affiliation(s)
- Zuwei Li
- Department of Thoracic Surgery and Institute of Thoracic Oncology, West China Hospital, Sichuan University, Chengdu, China
| | - Minzhang Guo
- Department of Thoracic Surgery and Institute of Thoracic Oncology, West China Hospital, Sichuan University, Chengdu, China
| | - Wanli Lin
- Department of Thoracic Surgery, Gaozhou People's Hospital, Maoming, China
| | - Peiyuan Huang
- Department of Pharmacy, Gaozhou People's Hospital, Maoming, China.
| |
Collapse
|
9
|
Takeuchi F, Liang YQ, Shimizu-Furusawa H, Isono M, Ang MY, Mori K, Mori T, Kakazu E, Yoshio S, Kato N. Gene-regulation modules in nonalcoholic fatty liver disease revealed by single-nucleus ATAC-seq. Life Sci Alliance 2023; 6:e202301988. [PMID: 37491046 PMCID: PMC10368228 DOI: 10.26508/lsa.202301988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 07/14/2023] [Accepted: 07/14/2023] [Indexed: 07/27/2023] Open
Abstract
We investigated the progression of nonalcoholic fatty liver disease from fatty liver to steatohepatitis using single-nucleus and bulk ATAC-seq on the livers of rats fed a high-fat diet (HFD). Rats fed HFD for 4 wk developed fatty liver, and those fed HFD for 8 wk further progressed to steatohepatitis. We observed an increase in the proportion of inflammatory macrophages, consistent with the pathological progression. Utilizing machine learning, we divided global gene regulation into modules, wherein transcription factors within a module could regulate genes within the same module, reaffirming known regulatory relationships between transcription factors and biological processes. We identified core genes-central to co-expression and protein-protein interaction-for the biological processes discovered. Notably, a large part of the core genes overlapped with genes previously implicated in nonalcoholic fatty liver disease. Single-nucleus ATAC-seq, combined with data-driven statistical analysis, offers insight into in vivo global gene regulation as a combination of modules and assists in identifying core genes of relevant biological processes.
Collapse
Affiliation(s)
- Fumihiko Takeuchi
- Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Systems Genomics Laboratory, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Yi-Qiang Liang
- Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
| | - Hana Shimizu-Furusawa
- Department of Hygiene and Public Health, School of Medicine, Teikyo University, Tokyo, Japan
| | - Masato Isono
- Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
| | - Mia Yang Ang
- Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Clinical Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Kotaro Mori
- Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
| | - Taizo Mori
- Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
| | - Eiji Kakazu
- Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
| | - Sachiyo Yoshio
- Department of Liver Diseases, The Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
| | - Norihiro Kato
- Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Medical Genomics Center, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan
- Department of Clinical Genome Informatics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
10
|
Cui Z, Wang SG, He Y, Chen ZH, Zhang QH. DeepTPpred: A Deep Learning Approach With Matrix Factorization for Predicting Therapeutic Peptides by Integrating Length Information. IEEE J Biomed Health Inform 2023; 27:4611-4622. [PMID: 37368803 DOI: 10.1109/jbhi.2023.3290014] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2023]
Abstract
The abuse of traditional antibiotics has led to increased resistance of bacteria and viruses. Efficient therapeutic peptide prediction is critical for peptide drug discovery. However, most of the existing methods only make effective predictions for one class of therapeutic peptides. It is worth noting that currently no predictive method considers sequence length information as a distinct feature of therapeutic peptides. In this article, a novel deep learning approach with matrix factorization for predicting therapeutic peptides (DeepTPpred) by integrating length information are proposed. The matrix factorization layer can learn the potential features of the encoded sequence through the mechanism of first compression and then restoration. And the length features of the sequence of therapeutic peptides are embedded with encoded amino acid sequences. To automatically learn therapeutic peptide predictions, these latent features are input into the neural networks with self-attention mechanism. On eight therapeutic peptide datasets, DeepTPpred achieved excellent prediction results. Based on these datasets, we first integrated eight datasets to obtain a full therapeutic peptide integration dataset. Then, we obtained two functional integration datasets based on the functional similarity of the peptides. Finally, we also conduct experiments on the latest versions of the ACP and CPP datasets. Overall, the experimental results show that our work is effective for the identification of therapeutic peptides.
Collapse
|
11
|
Doi I, Deng W, Ikegami T. Spontaneous and information-induced bursting activities in honeybee hives. Sci Rep 2023; 13:11015. [PMID: 37419944 PMCID: PMC10329038 DOI: 10.1038/s41598-023-37785-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 06/27/2023] [Indexed: 07/09/2023] Open
Abstract
Social entrainment is important for functioning of beehive organization. By analyzing a dataset of approximately 1000 honeybees (Apis mellifera) tracked in 5 trials, we discovered that honeybees exhibit synchronized activity (bursting behavior) in their locomotion. These bursts occurred spontaneously, potentially as a result of intrinsic bee interactions. The empirical data and simulations demonstrate that physical contact is one of the mechanisms for these bursts. We found that a subset of honeybees within a hive which become active before the peak of each burst, and we refer to these bees as "pioneer bees." Pioneer bees are not selected randomly, but rather, are linked to foraging behavior and waggle dancing, which may help spread external information in the hive. By using transfer entropy, we found that information flows from pioneer bees to non-pioneer bees, which suggest that the bursting behavior is caused by foraging behavior and spreading the information through the hive and promoting integrated group behavior among individuals.
Collapse
Affiliation(s)
- Itsuki Doi
- Graduate School of Arts and Sciences, University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo, 153-8902, Japan.
| | - Weibing Deng
- Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan, 430079, China
| | - Takashi Ikegami
- Graduate School of Arts and Sciences, University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo, 153-8902, Japan
| |
Collapse
|
12
|
Guven E. Decision of the Optimal Rank of a Nonnegative Matrix Factorization Model for Gene Expression Data Sets Utilizing the Unit Invariant Knee Method: Development and Evaluation of the Elbow Method for Rank Selection. JMIR BIOINFORMATICS AND BIOTECHNOLOGY 2023; 4:e43665. [PMID: 38935969 PMCID: PMC11135234 DOI: 10.2196/43665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 02/05/2023] [Accepted: 04/28/2023] [Indexed: 06/29/2024]
Abstract
BACKGROUND There is a great need to develop a computational approach to analyze and exploit the information contained in gene expression data. The recent utilization of nonnegative matrix factorization (NMF) in computational biology has demonstrated the capability to derive essential details from a high amount of data in particular gene expression microarrays. A common problem in NMF is finding the proper number rank (r) of factors of the degraded demonstration, but no agreement exists on which technique is most appropriate to utilize for this purpose. Thus, various techniques have been suggested to select the optimal value of rank factorization (r). OBJECTIVE In this work, a new metric for rank selection is proposed based on the elbow method, which was methodically compared against the cophenetic metric. METHODS To decide the optimum number rank (r), this study focused on the unit invariant knee (UIK) method of the NMF on gene expression data sets. Since the UIK method requires an extremum distance estimator that is eventually employed for inflection and identification of a knee point, the proposed method finds the first inflection point of the curvature of the residual sum of squares of the proposed algorithms using the UIK method on gene expression data sets as a target matrix. RESULTS Computation was conducted for the UIK task using gene expression data of acute lymphoblastic leukemia and acute myeloid leukemia samples. Consequently, the distinct results of NMF were subjected to comparison on different algorithms. The proposed UIK method is easy to perform, fast, free of a priori rank value input, and does not require initial parameters that significantly influence the model's functionality. CONCLUSIONS This study demonstrates that the elbow method provides a credible prediction for both gene expression data and for precisely estimating simulated mutational processes data with known dimensions. The proposed UIK method is faster than conventional methods, including metrics utilizing the consensus matrix as a criterion for rank selection, while achieving significantly better computational efficiency without visual inspection on the curvatives. Finally, the suggested rank tuning method based on the elbow method for gene expression data is arguably theoretically superior to the cophenetic measure.
Collapse
Affiliation(s)
- Emine Guven
- Department of Biomedical Engineering, Düzce University, Düzce, Turkey
| |
Collapse
|
13
|
Shi X, Qiu X, Li A, Jiang X, Wei G, Zheng Y, Chen Q, Chen S, Hu M, Rudich Y, Zhu T. Polar Nitrated Aromatic Compounds in Urban Fine Particulate Matter: A Focus on Formation via an Aqueous-Phase Radical Mechanism. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:5160-5168. [PMID: 36940425 DOI: 10.1021/acs.est.2c07324] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Polar nitrated aromatic compounds (pNACs) are key ambient brown carbon chromophores; however, their formation mechanisms, especially in the aqueous phase, remain unclear. We developed an advanced technique for pNACs and measured 1764 compounds in atmospheric fine particulate matter sampled in urban Beijing, China. Molecular formulas were derived for 433 compounds, of which 17 were confirmed using reference standards. Potential novel species with up to four aromatic rings and a maximum of five functional groups were found. Higher concentrations were detected in the heating season, with a median of 82.6 ng m-3 for Σ17pNACs. Non-negative matrix factorization analysis indicated that primary emissions particularly coal combustion were dominant in the heating season. While in the non-heating season, aqueous-phase nitration could generate abundant pNACs with the carboxyl group, which was confirmed by their significant association with the aerosol liquid water content. Aqueous-phase formation of 3- and 5-nitrosalicylic acids instead of their isomer of 4-hydroxy-3-nitrobenzoic acid suggests the existence of an intermediate where the intramolecular hydrogen bond favors kinetics-controlled NO2• nitration. This study provides not only a promising technique for the pNAC measurement but also evidence for their atmospheric aqueous-phase formation, facilitating further evaluation of pNACs' climatic effects.
Collapse
Affiliation(s)
- Xiaodi Shi
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Xinghua Qiu
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Ailin Li
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Xing Jiang
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Gaoyuan Wei
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Yan Zheng
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Qi Chen
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Shiyi Chen
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Min Hu
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| | - Yinon Rudich
- Department of Earth and Planetary Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Tong Zhu
- State Key Joint Laboratory for Environmental Simulation and Pollution Control, and College of Environmental Sciences and Engineering, Peking University, Beijing 100871, P. R. China
| |
Collapse
|
14
|
An R, Tong Z, Liu X, Tan B, Xiong Q, Pang H, Liu Y, Xu G. Post COVID-19 pandemic recovery of intracity human mobility in Wuhan: Spatiotemporal characteristic and driving mechanism. TRAVEL BEHAVIOUR & SOCIETY 2023; 31:37-48. [PMID: 36405767 PMCID: PMC9650583 DOI: 10.1016/j.tbs.2022.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Revised: 09/27/2022] [Accepted: 11/07/2022] [Indexed: 06/16/2023]
Abstract
After successfully inhibiting the first wave of COVID-19 transmission through a city lockdown, Wuhan implemented a series of policies to gradually lift restrictions and restore daily activities. Existing studies mainly focus on the intercity recovery under a macroscopic view. How does the intracity mobility return to normal? Is the recovery process consistent among different subareas, and what factor affects the post-pandemic recovery? To answer these questions, we sorted out policies adopted during the Wuhan resumption, and collected the long-time mobility big data in 1105 traffic analysis zones (TAZs) to construct an observation matrix (A). We then used the nonnegative matrix factorization (NMF) method to approximate A as the product of two condensed matrices (WH). The column vectors of W matrix were visualized as five typical recovery curves to reveal the temporal change. The row vectors of H matrix were visualized to identify the spatial distribution of each recovery type, and were analyzed with variables of population, GDP, land use, and key facility to explain the recovery driving mechanisms. We found that the "staggered time" policies implemented in Wuhan effectively staggered the peak mobility of several recovery types ("staggered peak"). Besides, different TAZs had heterogeneous response intensities to these policies ("staggered area") which were closely related to land uses and key facilities. The creative policies taken by Wuhan highlight the wisdom of public health crisis management, and could provide an empirical reference for the adjustment of post-pandemic intervention measures in other cities.
Collapse
Affiliation(s)
- Rui An
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| | - Zhaomin Tong
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| | - Xiaoyan Liu
- Institute of Disaster Risk Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, PR China
| | - Bo Tan
- Wuhan Geomatics Institute, 209 Wansongyuan Road, Wuhan 430022, PR China
| | - Qiangqiang Xiong
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| | - Huixin Pang
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| | - Yaolin Liu
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| | - Gang Xu
- School of Resource and Environmental Sciences, Wuhan University, 129 Luoyu Road, Wuhan 430079, PR China
| |
Collapse
|
15
|
Robbe P, Ridout KE, Vavoulis DV, Dréau H, Kinnersley B, Denny N, Chubb D, Appleby N, Cutts A, Cornish AJ, Lopez-Pascua L, Clifford R, Burns A, Stamatopoulos B, Cabes M, Alsolami R, Antoniou P, Oates M, Cavalieri D, Gibson J, Prabhu AV, Schwessinger R, Jennings D, James T, Maheswari U, Duran-Ferrer M, Carninci P, Knight SJL, Månsson R, Hughes J, Davies J, Ross M, Bentley D, Strefford JC, Devereux S, Pettitt AR, Hillmen P, Caulfield MJ, Houlston RS, Martín-Subero JI, Schuh A. Whole-genome sequencing of chronic lymphocytic leukemia identifies subgroups with distinct biological and clinical features. Nat Genet 2022; 54:1675-1689. [PMID: 36333502 PMCID: PMC9649442 DOI: 10.1038/s41588-022-01211-y] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 09/16/2022] [Indexed: 11/06/2022]
Abstract
The value of genome-wide over targeted driver analyses for predicting clinical outcomes of cancer patients is debated. Here, we report the whole-genome sequencing of 485 chronic lymphocytic leukemia patients enrolled in clinical trials as part of the United Kingdom's 100,000 Genomes Project. We identify an extended catalog of recurrent coding and noncoding genetic mutations that represents a source for future studies and provide the most complete high-resolution map of structural variants, copy number changes and global genome features including telomere length, mutational signatures and genomic complexity. We demonstrate the relationship of these features with clinical outcome and show that integration of 186 distinct recurrent genomic alterations defines five genomic subgroups that associate with response to therapy, refining conventional outcome prediction. While requiring independent validation, our findings highlight the potential of whole-genome sequencing to inform future risk stratification in chronic lymphocytic leukemia.
Collapse
Affiliation(s)
- Pauline Robbe
- Department of Oncology, University of Oxford, Oxford, UK
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kate E Ridout
- Department of Oncology, University of Oxford, Oxford, UK
| | | | - Helene Dréau
- Department of Oncology, University of Oxford, Oxford, UK
| | - Ben Kinnersley
- Division of Genetics and Epidemiology, The Institute of Cancer Research, Sutton, UK
| | - Nicholas Denny
- Department of Medicine, Medical Research Council Molecular Haematology Unit, Medical Research Council Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Daniel Chubb
- Division of Genetics and Epidemiology, The Institute of Cancer Research, Sutton, UK
| | - Niamh Appleby
- Department of Oncology, University of Oxford, Oxford, UK
| | - Anthony Cutts
- Department of Oncology, University of Oxford, Oxford, UK
| | - Alex J Cornish
- Division of Genetics and Epidemiology, The Institute of Cancer Research, Sutton, UK
| | | | - Ruth Clifford
- Department of Haematology, University Hospital Limerick, Limerick, Ireland
- Limerick Digital Cancer Research Centre, School of Medicine,University of Limerick, Limerick, Ireland
| | - Adam Burns
- Department of Oncology, University of Oxford, Oxford, UK
| | - Basile Stamatopoulos
- Laboratory of Clinical Cell Therapy, Jules Bordet Institute, ULB Cancer Research Center (U-CRC)- Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Maite Cabes
- Oxford Molecular Diagnostics Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Reem Alsolami
- Department of Medical Laboratory Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | | | | | - Doriane Cavalieri
- Department of Haematology, CHU de Clermont-Ferrand, Clermont-Ferrand, France
| | - Jane Gibson
- Cancer Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
| | - Anika V Prabhu
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Ron Schwessinger
- Department of Medicine, Medical Research Council Molecular Haematology Unit, Medical Research Council Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Daisy Jennings
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | | | - Martí Duran-Ferrer
- Biomedical Epigenomics Group, Institut d'Investigacions Biomédiques August Pi i Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Human Technopole, Milan, Italy
| | - Samantha J L Knight
- Oxford University Clinical Academic Graduate School, University of Oxford Medical Sciences Division, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Robert Månsson
- Center for Hematology and Regenerative Medicine Huddinge, Karolinska Institute, Stockholm, Sweden
| | - Jim Hughes
- Department of Medicine, Medical Research Council Molecular Haematology Unit, Medical Research Council Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - James Davies
- Department of Medicine, Medical Research Council Molecular Haematology Unit, Medical Research Council Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Mark Ross
- Illumina Cambridge Ltd., Cambridge, UK
| | | | - Jonathan C Strefford
- Cancer Genomics, Cancer Sciences, Faculty of Medicine, Group University of Southampton, Southampton, UK
| | - Stephen Devereux
- King's College Hospital, NHS Foundation Trust, London, UK
- Kings College London, London, UK
| | - Andrew R Pettitt
- Department of Molecular and Clinical Cancer Medicine, University of Liverpool, Liverpool, UK
- Clatterbridge Cancer Centre NHS Foundation Trust, Liverpool, UK
| | | | - Mark J Caulfield
- Genomics England, London, UK
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Richard S Houlston
- Division of Genetics and Epidemiology, The Institute of Cancer Research, Sutton, UK
| | - José I Martín-Subero
- Human Technopole, Milan, Italy
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Anna Schuh
- Department of Oncology, University of Oxford, Oxford, UK.
| |
Collapse
|
16
|
Liu Q, Hsu CY, Shyr Y. Scalable and model-free detection of spatial patterns and colocalization. Genome Res 2022; 32:1736-1745. [PMID: 36223499 PMCID: PMC9528978 DOI: 10.1101/gr.276851.122] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 08/16/2022] [Indexed: 11/24/2022]
Abstract
The expeditious growth in spatial omics technologies enables the profiling of genome-wide molecular events at molecular and single-cell resolution, highlighting a need for fast and reliable methods to characterize spatial patterns. We developed SpaGene, a model-free method to discover spatial patterns rapidly in large-scale spatial omics studies. Analyzing simulation and a variety of spatially resolved transcriptomics data showed that SpaGene is more powerful and scalable than existing methods. Spatial expression patterns identified by SpaGene reconstruct unobserved tissue structures. SpaGene also successfully discovers ligand-receptor interactions through their colocalization.
Collapse
Affiliation(s)
- Qi Liu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| | - Chih-Yuan Hsu
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| | - Yu Shyr
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
- Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| |
Collapse
|
17
|
Abstract
Grouping patients into subtypes with homogeneous molecular features can guide diagnosis and therapeutic interventions. SUMO is a computational pipeline that uses nonnegative matrix factorization of patient-similarity networks to integrate continuous multi-omic datasets for molecular subtyping of a disease. Here, we present a detailed protocol to demonstrate its use in determining subtypes of lower-grade gliomas by integrating gene expression, DNA methylation, and miRNA expression data from the TCGA-LGG cohort. For complete details on the use and execution of this profile, please refer to Sienkiewicz et al. (2022).
Collapse
Affiliation(s)
- Karolina Sienkiewicz
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
| | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
- University of Virginia Cancer Center, Charlottesville, VA 22908, USA
| |
Collapse
|
18
|
Rowland B, Huh R, Hou Z, Crowley C, Wen J, Shen Y, Hu M, Giusti-Rodríguez P, Sullivan PF, Li Y. THUNDER: A reference-free deconvolution method to infer cell type proportions from bulk Hi-C data. PLoS Genet 2022; 18:e1010102. [PMID: 35259165 PMCID: PMC8932604 DOI: 10.1371/journal.pgen.1010102] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 03/18/2022] [Accepted: 02/14/2022] [Indexed: 11/30/2022] Open
Abstract
Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. Effective analysis of Hi-C data entails controlling for the potential confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive simulations to test THUNDER based on combining two published single-cell Hi-C (scHi-C) datasets. THUNDER more accurately estimates the underlying cell type proportions compared to reference-free methods (e.g., TOAST, and NMF) and is more robust than reference-dependent methods (e.g. MuSiC). We further demonstrate the practical utility of THUNDER to estimate cell type proportions and identify cell-type-specific interactions in Hi-C data from adult human cortex tissue samples. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still rare.
Collapse
Affiliation(s)
- Bryce Rowland
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Ruth Huh
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Zoey Hou
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois, United States of America
| | - Cheynna Crowley
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jia Wen
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Yin Shen
- Institute for Human Genetics, University of California San Francisco, San Francisco, California, United States of America
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, Ohio, United States of America
| | - Paola Giusti-Rodríguez
- Department of Psychiatry, University of Florida College of Medicine, Gainesville, Florida, United States of America
| | - Patrick F. Sullivan
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Yun Li
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
19
|
Ye L, Chen Y, Xu H, Wang Z, Li H, Qi J, Wang J, Yao J, Liu J, Song B. Radiomics of Contrast-Enhanced Computed Tomography: A Potential Biomarker for Pretreatment Prediction of the Response to Bacillus Calmette-Guerin Immunotherapy in Non-Muscle-Invasive Bladder Cancer. Front Cell Dev Biol 2022; 10:814388. [PMID: 35281100 PMCID: PMC8914064 DOI: 10.3389/fcell.2022.814388] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Accepted: 01/10/2022] [Indexed: 11/13/2022] Open
Abstract
Background:Bacillus Calmette-Guerin (BCG) instillation is recommended postoperatively after transurethral resection of bladder cancer (TURBT) in patients with high-risk non-muscle-invasive bladder cancer (NMIBC). An accurate prediction model for the BCG response can help identify patients with NMIBC who may benefit from alternative therapy.Objective: To investigate the value of computed tomography (CT) radiomics features in predicting the response to BCG instillation among patients with primary high-risk NMIBC.Methods: Patients with pathologically confirmed high-risk NMIBC were retrospectively reviewed. Patients who underwent contrast-enhanced CT examination within one to 2 weeks before TURBT and received ≥5 BCG instillation treatments in two independent hospitals were enrolled. Patients with a routine follow-up of at least 1 year at the outpatient department were included in the final cohort. Radiomics features based on CT images were extracted from the tumor and its periphery in the training cohort, and a radiomics signature was built with recursive feature elimination. Selected features further underwent an unsupervised radiomics analysis using the newly introduced method, non-negative matrix factorization (NMF), to compute factor factorization decompositions of the radiomics matrix. Finally, a robust component, which was most associated with BCG failure in 1 year, was selected. The performance of the selected component was assessed and tested in an external validation cohort.Results: Overall, 128 patients (training cohort, n = 104; external validation cohort, n = 24) were included, including 12 BCG failures in the training cohort and 11 failures in the validation cohort each. NMF revealed five components, of which component 3 was selected for the best discrimination of BCG failure; it had an area under the curve (AUC) of .79, sensitivity of .79, and specificity of .65 in the training set. In the external validation cohort, it achieved an AUC of .68, sensitivity of .73, and specificity of .69. Survival analysis showed that patients with higher component scores had poor recurrence-free survival (RFS) in both cohorts (C-index: training cohort, .69; validation cohort, .68).Conclusion: The study suggested that radiomics components based on NMF might be a potential biomarker to predict BCG response and RFS after BCG treatment in patients with high-risk NMIBC.
Collapse
Affiliation(s)
- Lei Ye
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, China
| | - Yuntian Chen
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, China
| | - Hui Xu
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, China
| | - Zhaoxiang Wang
- Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu, China
| | | | - Jin Qi
- University of Electronic Science and Technology of China, Chengdu, China
| | - Jing Wang
- University of Electronic Science and Technology of China, Chengdu, China
| | - Jin Yao
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, China
- *Correspondence: Jin Yao, ; Jiaming Liu,
| | - Jiaming Liu
- Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu, China
- *Correspondence: Jin Yao, ; Jiaming Liu,
| | - Bin Song
- Department of Radiology, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
20
|
Ichinomiya T. Topological data analysis gives two folding paths in HP35(nle-nle), double mutant of villin headpiece subdomain. Sci Rep 2022; 12:2719. [PMID: 35177744 PMCID: PMC8854739 DOI: 10.1038/s41598-022-06682-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 02/04/2022] [Indexed: 11/16/2022] Open
Abstract
The folding dynamics of proteins is a primary area of interest in protein science. We carried out topological data analysis (TDA) of the folding process of HP35(nle-nle), a double-mutant of the villin headpiece subdomain. Using persistent homology and non-negative matrix factorization, we reduced the dimension of protein structure and investigated the flow in the reduced space. We found this protein has two folding paths, distinguished by the pairings of inter-helix residues. Our analysis showed the excellent performance of TDA in capturing the formation of tertiary structure.
Collapse
Affiliation(s)
- Takashi Ichinomiya
- Department of Systems Biology, Gifu University School of Medicine, Yanagido 1-1, Gifu, 501-1194, Japan. .,The United Graduate School of Drug Discovery and Medical Information Sciences of Gifu University, Yanagido 1-1, Gifu, 501-1194, Japan.
| |
Collapse
|
21
|
Sienkiewicz K, Chen J, Chatrath A, Lawson JT, Sheffield NC, Zhang L, Ratan A. Detecting molecular subtypes from multi-omics datasets using SUMO. CELL REPORTS METHODS 2022; 2:100152. [PMID: 35211690 PMCID: PMC8865426 DOI: 10.1016/j.crmeth.2021.100152] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 08/27/2021] [Accepted: 12/21/2021] [Indexed: 12/31/2022]
Abstract
We present a data integration framework that uses non-negative matrix factorization of patient-similarity networks to integrate continuous multi-omics datasets for molecular subtyping. It is demonstrated to have the capability to handle missing data without using imputation and to be consistently among the best in detecting subtypes with differential prognosis and enrichment of clinical associations in a large number of cancers. When applying the approach to data from individuals with lower-grade gliomas, we identify a subtype with a significantly worse prognosis. Tumors assigned to this subtype are hypomethylated genome wide with a gain of AP-1 occupancy in demethylated distal enhancers. The tumors are also enriched for somatic chromosome 7 (chr7) gain, chr10 loss, and other molecular events that have been suggested as diagnostic markers for "IDH wild type, with molecular features of glioblastoma" by the cIMPACT-NOW consortium but have yet to be included in the World Health Organization (WHO) guidelines.
Collapse
Affiliation(s)
- Karolina Sienkiewicz
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
| | - Jinyu Chen
- Department of Mathematics and Computational Biology Program, National University of Singapore, Singapore 119076, Singapore
| | - Ajay Chatrath
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA
| | - John T. Lawson
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Nathan C. Sheffield
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
- University of Virginia Cancer Center, Charlottesville, VA 22908, USA
| | - Louxin Zhang
- Department of Mathematics and Computational Biology Program, National University of Singapore, Singapore 119076, Singapore
| | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
- University of Virginia Cancer Center, Charlottesville, VA 22908, USA
| |
Collapse
|
22
|
Bilkey N, Li H, Borodinov N, Ievlev AV, Ovchinnikova OS, Dixit R, Foston M. Correlated mechanochemical maps of Arabidopsis thaliana primary cell walls using atomic force microscope infrared spectroscopy. QUANTITATIVE PLANT BIOLOGY 2022; 3:e31. [PMID: 37077971 PMCID: PMC10095902 DOI: 10.1017/qpb.2022.20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 09/11/2022] [Accepted: 10/07/2022] [Indexed: 05/03/2023]
Abstract
Spatial heterogeneity in composition and organisation of the primary cell wall affects the mechanics of cellular morphogenesis. However, directly correlating cell wall composition, organisation and mechanics has been challenging. To overcome this barrier, we applied atomic force microscopy coupled with infrared (AFM-IR) spectroscopy to generate spatially correlated maps of chemical and mechanical properties for paraformaldehyde-fixed, intact Arabidopsis thaliana epidermal cell walls. AFM-IR spectra were deconvoluted by non-negative matrix factorisation (NMF) into a linear combination of IR spectral factors representing sets of chemical groups comprising different cell wall components. This approach enables quantification of chemical composition from IR spectral signatures and visualisation of chemical heterogeneity at nanometer resolution. Cross-correlation analysis of the spatial distribution of NMFs and mechanical properties suggests that the carbohydrate composition of cell wall junctions correlates with increased local stiffness. Together, our work establishes new methodology to use AFM-IR for the mechanochemical analysis of intact plant primary cell walls.
Collapse
Affiliation(s)
- Natasha Bilkey
- Department of Biology, Center for Engineering Mechanobiology, Washington University in St. Louis, St. Louis, Missouri63130, USA
| | - Huiyong Li
- Department of Energy, Environmental and Chemical Engineering, Center for Engineering Mechanobiology, Washington University in St. Louis, St. Louis, Missouri63130, USA
| | - Nikolay Borodinov
- Center for Nanophase Materials Science, Oak Ridge National Laboratory, Oak Ridge, Tennessee37831, USA
| | - Anton V. Ievlev
- Center for Nanophase Materials Science, Oak Ridge National Laboratory, Oak Ridge, Tennessee37831, USA
| | - Olga S. Ovchinnikova
- Center for Nanophase Materials Science, Oak Ridge National Laboratory, Oak Ridge, Tennessee37831, USA
| | - Ram Dixit
- Department of Biology, Center for Engineering Mechanobiology, Washington University in St. Louis, St. Louis, Missouri63130, USA
| | - Marcus Foston
- Department of Energy, Environmental and Chemical Engineering, Center for Engineering Mechanobiology, Washington University in St. Louis, St. Louis, Missouri63130, USA
- Author for correspondence: M. Foston, E-mail:
| |
Collapse
|
23
|
Maspero D, Angaroni F, Porro D, Piazza R, Graudenzi A, Ramazzotti D. VirMutSig: Discovery and assignment of viral mutational signatures from sequencing data. STAR Protoc 2021. [DOI: 10.1016/j.xpro.2021.100911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
24
|
Unsupervised methods in LC-MS data treatment: Application for potential chemotaxonomic markers search. J Pharm Biomed Anal 2021; 206:114382. [PMID: 34597842 DOI: 10.1016/j.jpba.2021.114382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 09/12/2021] [Accepted: 09/15/2021] [Indexed: 11/20/2022]
Abstract
The combination of Liquid Chromatography and Mass Spectrometry (LC-MS) is commonly used to determine and characterize biologically active compounds because of its high resolution and sensitivity. In this work we explore the interpretation of LC-MS data using multivariate statistical analysis algorithms to extract useful chemical information and identify clusters of similar samples. Samples of leaves from 19 plants belonging to the Apiaceae family were analyzed in unified LC conditions by high- and low-resolution mass spectrometry in a wide range scan mode. LC-MS data preprocessing was performed followed by statistical analysis using tensor decomposition in the form of Parallel Factor Analysis (PARAFAC); matrix factorization following tensor unfolding with principal component analysis (PCA), independent component analysis (ICA), non-negative matrix factorization (NMF); or unsupervised feature selection (UFS). The optimal number of components for each of these methods were found and results were compared using four different metrics: silhouette score, Davies-Bouldin index, computational time, number of noisy components. It was found that PCA, ICA and UFS give the best results across the majority of the criteria for both low- and high-resolution data. An algorithm for biomarker signal selection is suggested and 23 potential chemotaxonomic markers were tentatively identified using MS2 data. Dendrograms constructed by the methods were compared to the molecular phylogenic tree by calculating pixel-wise mean square error (MSE). Therefore, the suggested approach can support chemotaxonomic studies and yield valuable chemical information for biomarker discovery.
Collapse
|
25
|
Hunter MV, Moncada R, Weiss JM, Yanai I, White RM. Spatially resolved transcriptomics reveals the architecture of the tumor-microenvironment interface. Nat Commun 2021; 12:6278. [PMID: 34725363 PMCID: PMC8560802 DOI: 10.1038/s41467-021-26614-z] [Citation(s) in RCA: 136] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 10/06/2021] [Indexed: 12/24/2022] Open
Abstract
During tumor progression, cancer cells come into contact with various non-tumor cell types, but it is unclear how tumors adapt to these new environments. Here, we integrate spatially resolved transcriptomics, single-cell RNA-seq, and single-nucleus RNA-seq to characterize tumor-microenvironment interactions at the tumor boundary. Using a zebrafish model of melanoma, we identify a distinct "interface" cell state where the tumor contacts neighboring tissues. This interface is composed of specialized tumor and microenvironment cells that upregulate a common set of cilia genes, and cilia proteins are enriched only where the tumor contacts the microenvironment. Cilia gene expression is regulated by ETS-family transcription factors, which normally act to suppress cilia genes outside of the interface. A cilia-enriched interface is conserved in human patient samples, suggesting it is a conserved feature of human melanoma. Our results demonstrate the power of spatially resolved transcriptomics in uncovering mechanisms that allow tumors to adapt to new environments.
Collapse
Affiliation(s)
- Miranda V Hunter
- Cancer Biology and Genetics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Reuben Moncada
- Institute for Computational Medicine, NYU Langone Health, New York, NY, USA
| | - Joshua M Weiss
- Cancer Biology and Genetics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Weill Cornell/Rockefeller/Sloan Kettering Tri-Institutional MD-PhD Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Itai Yanai
- Institute for Computational Medicine, NYU Langone Health, New York, NY, USA.
| | - Richard M White
- Cancer Biology and Genetics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| |
Collapse
|
26
|
Prasanna A, Niranjan V. MutVis: Automated framework for analysis and visualization of mutational signatures in pathogenic bacterial strains. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2021; 91:104805. [PMID: 33689914 DOI: 10.1016/j.meegid.2021.104805] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 12/17/2020] [Accepted: 03/04/2021] [Indexed: 12/01/2022]
Abstract
In recent years, mutational signature analysis has become a routine practice in cancer genomics for classification and diagnosis. Characterizing mutational signatures across species or within genomes of a bacteria helps in understanding their evolution and adaptation. However, an integrated framework for analysis and visualization of mutational signatures in bacterial genome is lacking. Hence, we aim to develop an integrated, automated, open-source and user-friendly framework called MutVis to analyze mutational signatures from bacterial whole genome next generation sequencing data. The current framework integrates various publicly available packages using Snakemake workflow management software, Python and R scripting. MutVis supports variant calling, transition (Ti) and transversion (Tv) graphical representation, generation of mutational count matrix, graphical visualization of base-pair substitution spectrum (BPSs) and mutation signatures extraction. TvTi plots provide the 6 base substitution classification for both genome and gene level. Further resolution of base pair substitution classification is provided as 96-profile BPSs plot. Mutation signatures is derived based on the characteristic pattern observed in BPSs using non-negative matrix factorization. Relative contribution of signatures is given as hierarchically clustered heatmap. This provides information on active signatures in the individual given sample and classify samples according to signature contributions. We demonstrated the MutVis framework using geographically different strains of Mycobacterium tuberculosis, downloaded from PATRIC TB-ARC Antibiotic Resistance Catalog (n = 963). The current framework can be used to study mutation biases and characteristic mutational signatures in bacterial genomes and is freely available at https://github.com/AkshathaPrasanna/MutVis.
Collapse
Affiliation(s)
- Akshatha Prasanna
- Department of Biotechnology, RV College of Engineering, Bengaluru, Karnataka 560059, India
| | - Vidya Niranjan
- Department of Biotechnology, RV College of Engineering, Bengaluru, Karnataka 560059, India..
| |
Collapse
|
27
|
Sommerkamp P, Cabezas-Wallscheid N, Trumpp A. Alternative Polyadenylation in Stem Cell Self-Renewal and Differentiation. Trends Mol Med 2021; 27:660-672. [PMID: 33985920 DOI: 10.1016/j.molmed.2021.04.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Revised: 04/15/2021] [Accepted: 04/19/2021] [Indexed: 12/13/2022]
Abstract
Cellular function is shaped by transcriptional and post-transcriptional mechanisms, including alternative polyadenylation (APA). By directly controlling 3'- untranslated region (UTR) length and the selection of the last exon, APA regulates up to 70% of all cellular transcripts influencing RNA stability, output, and protein isoform expression. Cell-state-dependent 3'-UTR shortening has been identified as a hallmark of cellular proliferation. Hence, quiescent/dormant stem cells are characterized by long 3'-UTRs, whereas proliferative stem/progenitor cells exhibit 3'-UTR shortening. Here, the latest studies analyzing the role of APA in regulating stem cell state, self-renewal, differentiation, and metabolism are reviewed. The new role of APA in controlling stem cell fate opens novel potential therapeutic avenues in the field of regenerative medicine.
Collapse
Affiliation(s)
- Pia Sommerkamp
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ) and DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany; Heidelberg Institute for Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), 69120 Heidelberg, Germany
| | | | - Andreas Trumpp
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ) and DKFZ-ZMBH Alliance, 69120 Heidelberg, Germany; Heidelberg Institute for Stem Cell Technology and Experimental Medicine (HI-STEM gGmbH), 69120 Heidelberg, Germany; Faculty of Biosciences, Heidelberg University, 69117 Heidelberg, Germany; German Cancer Consortium (DKTK), 69120 Heidelberg, Germany.
| |
Collapse
|
28
|
Das KP, Hapuwitharana JC, Fowler J, Young SS. Non-negative matrix factorization: A useful method for two-way life tables. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS 2021. [DOI: 10.1080/09720510.2020.1756048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Kumer Pial Das
- The Office of Vice President for Research, Innovation, and Economic Development, University of Louisiana at Lafayette, Louisiana, U.S.A
| | | | | | | |
Collapse
|
29
|
Aptardi predicts polyadenylation sites in sample-specific transcriptomes using high-throughput RNA sequencing and DNA sequence. Nat Commun 2021; 12:1652. [PMID: 33712618 PMCID: PMC7955126 DOI: 10.1038/s41467-021-21894-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 02/18/2021] [Indexed: 02/01/2023] Open
Abstract
Annotation of polyadenylation sites from short-read RNA sequencing alone is a challenging computational task. Other algorithms rooted in DNA sequence predict potential polyadenylation sites; however, in vivo expression of a particular site varies based on a myriad of conditions. Here, we introduce aptardi (alternative polyadenylation transcriptome analysis from RNA-Seq data and DNA sequence information), which leverages both DNA sequence and RNA sequencing in a machine learning paradigm to predict expressed polyadenylation sites. Specifically, as input aptardi takes DNA nucleotide sequence, genome-aligned RNA-Seq data, and an initial transcriptome. The program evaluates these initial transcripts to identify expressed polyadenylation sites in the biological sample and refines transcript 3'-ends accordingly. The average precision of the aptardi model is twice that of a standard transcriptome assembler. In particular, the recall of the aptardi model (the proportion of true polyadenylation sites detected by the algorithm) is improved by over three-fold. Also, the model-trained using the Human Brain Reference RNA commercial standard-performs well when applied to RNA-sequencing samples from different tissues and different mammalian species. Finally, aptardi's input is simple to compile and its output is easily amenable to downstream analyses such as quantitation and differential expression.
Collapse
|
30
|
Gilad G, Sason I, Sharan R. An automated approach for determining the number of components in non-negative matrix factorization with application to mutational signature learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abc60a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Non-negative matrix factorization (NMF) is a popular method for finding a low rank approximation of a matrix, thereby revealing the latent components behind it. In genomics, NMF is widely used to interpret mutation data and derive the underlying mutational processes and their activities. A key challenge in the use of NMF is determining the number of components, or rank of the factorization. Here we propose a novel method, CV2K, to choose this number automatically from data that is based on a detailed cross validation procedure combined with a parsimony consideration. We apply our method for mutational signature analysis and demonstrate its utility on both simulated and real data sets. In comparison to previous approaches, some of which involve human assessment, CV2K leads to improved predictions across a wide range of data sets.
Collapse
|
31
|
Boniface A, Dong J, Gigan S. Non-invasive focusing and imaging in scattering media with a fluorescence-based transmission matrix. Nat Commun 2020; 11:6154. [PMID: 33262335 PMCID: PMC7708489 DOI: 10.1038/s41467-020-19696-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 10/08/2020] [Indexed: 11/16/2022] Open
Abstract
In biological microscopy, light scattering represents the main limitation to image at depth. Recently, a set of wavefront shaping techniques has been developed in order to manipulate coherent light in strongly disordered materials. The Transmission Matrix approach has shown its capability to inverse the effect of scattering and efficiently focus light. In practice, the matrix is usually measured using an invasive detector or low-resolution acoustic guide stars. Here, we introduce a non-invasive and all-optical strategy based on linear fluorescence to reconstruct the transmission matrices, to and from a fluorescent object placed inside a scattering medium. It consists in demixing the incoherent patterns emitted by the object using low-rank factorizations and phase retrieval algorithms. We experimentally demonstrate the efficiency of this method through robust and selective focusing. Additionally, from the same measurements, it is possible to exploit memory effect correlations to image and reconstruct extended objects. This approach opens up a new route towards imaging in scattering media with linear or non-linear contrast mechanisms. Light scattering represents the main limitation to image at depth in biological microscopy. The authors present a strategy to characterize light propagation in and out of a scattering medium based on linear fluorescence feedback and from the same measurements exploit memory effect correlations to image and reconstruct extended objects.
Collapse
Affiliation(s)
- Antoine Boniface
- Laboratoire Kastler Brossel, Sorbonne Université, École Normale Supérieure-Paris Sciences et Lettres (PSL) Research University, Centre National de la Recherche Scientifique (CNRS) UMR 8552, Collège de France, 24 rue Lhomond, 75005, Paris, France.
| | - Jonathan Dong
- Laboratoire Kastler Brossel, Sorbonne Université, École Normale Supérieure-Paris Sciences et Lettres (PSL) Research University, Centre National de la Recherche Scientifique (CNRS) UMR 8552, Collège de France, 24 rue Lhomond, 75005, Paris, France.,Laboratoire de Physique de l'École Normale Supérieure, Université Paris Sciences et Lettres (PSL), Centre National de la Recherche Scientifique (CNRS), Sorbonne Université, Université Paris-Diderot, Sorbonne Paris Cité, 24 rue Lhomond, 75005, Paris, France
| | - Sylvain Gigan
- Laboratoire Kastler Brossel, Sorbonne Université, École Normale Supérieure-Paris Sciences et Lettres (PSL) Research University, Centre National de la Recherche Scientifique (CNRS) UMR 8552, Collège de France, 24 rue Lhomond, 75005, Paris, France
| |
Collapse
|
32
|
Lee WJ, Staneva V. Compact representation of temporal processes in echosounder time series via matrix decomposition. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3429. [PMID: 33379913 DOI: 10.1121/10.0002670] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 11/02/2020] [Indexed: 06/12/2023]
Abstract
The recent explosion in the availability of echosounder data from diverse ocean platforms has created unprecedented opportunities to observe the marine ecosystems at broad scales. However, the critical lack of methods capable of automatically discovering and summarizing prominent spatio-temporal echogram structures has limited the effective and wider use of these rich datasets. To address this challenge, a data-driven methodology is developed based on matrix decomposition that builds compact representation of long-term echosounder time series using intrinsic features in the data. In a two-stage approach, noisy outliers are first removed from the data by principal component pursuit, then a temporally smooth nonnegative matrix factorization is employed to automatically discover a small number of distinct daily echogram patterns, whose time-varying linear combination (activation) reconstructs the dominant echogram structures. This low-rank representation provides biological information that is more tractable and interpretable than the original data, and is suitable for visualization and systematic analysis with other ocean variables. Unlike existing methods that rely on fixed, handcrafted rules, this unsupervised machine learning approach is well-suited for extracting information from data collected from unfamiliar or rapidly changing ecosystems. This work forms the basis for constructing robust time series analytics for large-scale, acoustics-based biological observation in the ocean.
Collapse
Affiliation(s)
- Wu-Jung Lee
- Applied Physics Laboratory, University of Washington, Seattle, Washington 98105, USA
| | - Valentina Staneva
- eScience Institute, University of Washington, Seattle, Washington 98105, USA
| |
Collapse
|
33
|
Tuzhilina E, Hastie TJ, Segal MR. Principal curve approaches for inferring 3D chromatin architecture. Biostatistics 2020; 23:626-642. [PMID: 33221831 DOI: 10.1093/biostatistics/kxaa046] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 09/26/2020] [Accepted: 09/29/2020] [Indexed: 11/13/2022] Open
Abstract
Three-dimensional (3D) genome spatial organization is critical for numerous cellular processes, including transcription, while certain conformation-driven structural alterations are frequently oncogenic. Genome architecture had been notoriously difficult to elucidate, but the advent of the suite of chromatin conformation capture assays, notably Hi-C, has transformed understanding of chromatin structure and provided downstream biological insights. Although many findings have flowed from direct analysis of the pairwise proximity data produced by these assays, there is added value in generating corresponding 3D reconstructions deriving from superposing genomic features on the reconstruction. Accordingly, many methods for inferring 3D architecture from proximity data have been advanced. However, none of these approaches exploit the fact that single chromosome solutions constitute a one-dimensional (1D) curve in 3D. Rather, this aspect has either been addressed by imposition of constraints, which is both computationally burdensome and cell type specific, or ignored with contiguity imposed after the fact. Here, we target finding a 1D curve by extending principal curve methodology to the metric scaling problem. We illustrate how this approach yields a sequence of candidate solutions, indexed by an underlying smoothness or degrees-of-freedom parameter, and propose methods for selection from this sequence. We apply the methodology to Hi-C data obtained on IMR90 cells and so are positioned to evaluate reconstruction accuracy by referencing orthogonal imaging data. The results indicate the utility and reproducibility of our principal curve approach in the face of underlying structural variation.
Collapse
Affiliation(s)
- Elena Tuzhilina
- Department of Statistics, Stanford University, Stanford, CA 94305, USA and Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143, USA
| | - Trevor J Hastie
- Department of Statistics, Stanford University, Stanford, CA 94305, USA and Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143, USA
| | - Mark R Segal
- Department of Statistics, Stanford University, Stanford, CA 94305, USA and Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143, USA
| |
Collapse
|
34
|
Yurchenko AA, Padioleau I, Matkarimov BT, Soulier J, Sarasin A, Nikolaev S. XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature. Nat Commun 2020; 11:5834. [PMID: 33203900 PMCID: PMC7672101 DOI: 10.1038/s41467-020-19633-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Accepted: 10/07/2020] [Indexed: 12/21/2022] Open
Abstract
Recent studies demonstrated a dramatically increased risk of leukemia in patients with a rare genetic disorder, Xeroderma Pigmentosum group C (XP-C), characterized by constitutive deficiency of global genome nucleotide excision repair (GG-NER). The genetic mechanisms of non-skin cancers in XP-C patients remain unexplored. In this study, we analyze a unique collection of internal XP-C tumor genomes including 6 leukemias and 2 sarcomas. We observe a specific mutational pattern and an average of 25-fold increase of mutation rates in XP-C versus sporadic leukemia which we presume leads to its elevated incidence and early appearance. We describe a strong mutational asymmetry with respect to transcription and the direction of replication in XP-C tumors suggesting association of mutagenesis with bulky purine DNA lesions of probably endogenous origin. These findings suggest existence of a balance between formation and repair of bulky DNA lesions by GG-NER in human body cells which is disrupted in XP-C patients. Xeroderma Pigmentosum group C (XP-C) is a rare genetic disorder characterised by deficient DNA repair leading to skin and internal cancer, but the latter is not well understood molecularly. Here the authors sequence genomes of non-skin cancers from XP-C patients to unravel its mutational patterns.
Collapse
Affiliation(s)
- Andrey A Yurchenko
- INSERM U981, Gustave Roussy Cancer Campus, Université Paris Saclay, Villejuif, France
| | - Ismael Padioleau
- INSERM U981, Gustave Roussy Cancer Campus, Université Paris Saclay, Villejuif, France
| | - Bakhyt T Matkarimov
- National Laboratory Astana, Nazarbayev University, 010000, Astana, Kazakhstan
| | - Jean Soulier
- University of Paris, INSERM U944 and CNRS UMR7212, Institut de Recherche Saint-Louis, F-75010, Paris, France
| | - Alain Sarasin
- CNRS UMR9019 Genome Integrity and Cancers, Institut Gustave Roussy, Université Paris-Saclay, Villejuif, France
| | - Sergey Nikolaev
- INSERM U981, Gustave Roussy Cancer Campus, Université Paris Saclay, Villejuif, France.
| |
Collapse
|
35
|
Hu B, Ruan Y, Wei F, Qin G, Mo X, Wang X, Zou D. Identification of three glioblastoma subtypes and a six-gene prognostic risk index based on the expression of growth factors and cytokines. Am J Transl Res 2020; 12:4669-4682. [PMID: 32913540 PMCID: PMC7476164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 07/22/2020] [Indexed: 06/11/2023]
Abstract
Glioblastoma multiforme (GBM) is the most common and invasive tumor of the central nervous system. Growth factors and cytokines (GFCKs) play a crucial role in tumor invasion. In the present study, GFCK expression profiles from GBM patients in the Chinese Glioma Genome Atlas were used to perform sample clustering with nonnegative matrix factorization. Three GBM subtypes were identified based on differences in GFCK expression, and the subtypes differed in characteristics and prognosis. A prognostic risk index (RI) comprising six GFCKs (BMP2, CCN3, GKN1, LIF, MDK, and SEMA3G) was defined using univariate Cox hazard analysis and multivariate stepwise Cox regression. The RI was validated in two independent data sets and may be independent of some known prognostic factors. Our results suggest that GBM occurs as different subtypes expressing different patterns of GFCKs and that these expression patterns can be captured in an RI that can predict prognosis.
Collapse
Affiliation(s)
- Beiquan Hu
- Department of Neurosurgery, The First Affiliated Hospital, Jinan UniversityGuangzhou 510630, Guangdong, People’s Republic of China
- Department of Neurosurgery, The Fifth Affiliated Hospital of Guangxi Medical UniversityNanning 530022, Guangxi, People’s Republic of China
| | - Yushan Ruan
- Department of Neurosurgery, The Second Affiliated Hospital of Guangxi Medical UniversityNanning 530000, Guangxi, People’s Republic of China
| | - Feng Wei
- Department of Neurosurgery, The Fifth Affiliated Hospital of Guangxi Medical UniversityNanning 530022, Guangxi, People’s Republic of China
| | - Gang Qin
- Department of Neurosurgery, The Fifth Affiliated Hospital of Guangxi Medical UniversityNanning 530022, Guangxi, People’s Republic of China
| | - Xianlun Mo
- Department of Neurosurgery, The Fifth Affiliated Hospital of Guangxi Medical UniversityNanning 530022, Guangxi, People’s Republic of China
| | - Xiangyu Wang
- Department of Neurosurgery, The First Affiliated Hospital, Jinan UniversityGuangzhou 510630, Guangdong, People’s Republic of China
| | - Donghua Zou
- Department of Neurology, The Fifth Affiliated Hospital of Guangxi Medical UniversityNanning 530022, Guangxi, People’s Republic of China
| |
Collapse
|
36
|
Race AM, Rae A, Vorng JL, Havelund R, Dexter A, Kumar N, Steven RT, Passarelli MK, Tyler BJ, Bunch J, Gilmore IS. Correlative Hyperspectral Imaging Using a Dimensionality-Reduction-Based Image Fusion Method. Anal Chem 2020; 92:10979-10988. [DOI: 10.1021/acs.analchem.9b05055] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Alan M. Race
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Alasdair Rae
- Surface Technology Group, National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Jean-Luc Vorng
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Rasmus Havelund
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Alex Dexter
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Naresh Kumar
- Surface Technology Group, National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Rory T. Steven
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Melissa K. Passarelli
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| | - Bonnie J. Tyler
- Physikalisches Institut, Westfälische Wilhelms-Universität Münster, Wilhelm-Klemm-Straße 10, Münster 48149, Germany
| | - Josephine Bunch
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, U.K
- ,The Rosalind Franklin Institute, Harwell Campus, Didcot OX11 0FA, U.K
| | - Ian S. Gilmore
- National Centre of Excellence in Mass Spectrometry Imaging (NiCE-MSI), National Physical Laboratory, Teddington, Middlesex TW11 0LW, U.K
| |
Collapse
|
37
|
Ichinomiya T, Obayashi I, Hiraoka Y. Protein-Folding Analysis Using Features Obtained by Persistent Homology. Biophys J 2020; 118:2926-2937. [PMID: 32428439 PMCID: PMC7300307 DOI: 10.1016/j.bpj.2020.04.032] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 02/16/2020] [Accepted: 04/17/2020] [Indexed: 10/25/2022] Open
Abstract
Understanding the protein-folding process is an outstanding issue in biophysics; recent developments in molecular dynamics simulation have provided insights into this phenomenon. However, the large freedom of atomic motion hinders the understanding of this process. In this study, we applied persistent homology, an emerging method to analyze topological features in a data set, to reveal protein-folding dynamics. We developed a new, to our knowledge, method to characterize the protein structure based on persistent homology and applied this method to molecular dynamics simulations of chignolin. Using principle component analysis or nonnegative matrix factorization, our analysis method revealed two stable states and one saddle state, corresponding to the native, misfolded, and transition states, respectively. We also identified an unfolded state with slow dynamics in the reduced space. Our method serves as a promising tool to understand the protein-folding process.
Collapse
Affiliation(s)
- Takashi Ichinomiya
- Gifu University School of Medicine, Gifu, Japan; The United Graduate School of Drug Discovery and Medical Information Sciences of Gifu University, Gifu, Japan.
| | - Ippei Obayashi
- Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan
| | - Yasuaki Hiraoka
- WPI-ASHBi, Kyoto University Institute for Advanced Study, Kyoto University, Kyoto, Japan; Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan
| |
Collapse
|
38
|
Yang Y, Shi Z, Bai R, Hu W. Heterogeneity of MSI-H gastric cancer identifies a subtype with worse survival. J Med Genet 2020; 58:12-19. [PMID: 32170001 DOI: 10.1136/jmedgenet-2019-106609] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 02/11/2020] [Accepted: 02/25/2020] [Indexed: 01/19/2023]
Abstract
BACKGROUND Microsatellite instability-high (MSI-H) tumour patients generally have a better prognosis than microsatellite-stable (MSS) ones due to the large number of non-synonymous mutations. However, an increasing number of studies have revealed that less than half of MSI-H patients gain survival benefits or symptom alleviation from immune checkpoint-blockade treatment. Thus, an in-depth inspection of heterogeneous MSI-H tumours is urgently required. METHODS Here, we used non-negative matrix factorisation (non-NMF)-based consensus clustering to define stomach adenocarcinoma (STAD) MSI-H subtypes in samples from The Cancer Genome Atlas and an Asian cohort, GSE62254. RESULTS MSI-H STAD samples are basically clustered into two subgroups (MSI-H1 and MSI-H2). Further examination of the immune landscape showed that immune suppression factors were enriched in the MSI-H1 subgroup, which may be associated with the poor prognosis in this subgroup. CONCLUSIONS Our results illustrate the genetic heterogeneity within MSI-H STADs, with important implications for cancer patient risk stratification, prognosis and treatment.
Collapse
Affiliation(s)
- Yanmei Yang
- Key Laboratory of Reproductive and Genetics, Ministry of Education, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Zhong Shi
- Department of Medical Oncology, Institute of Cancer and Basic Medicine (ICBM), Chinese Academy of Sciences, Cancer Hospital of the University of Chinese Academy of Sciences, Hangzhou, China
| | - Rui Bai
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Wangxiong Hu
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| |
Collapse
|
39
|
Kurashige H, Kaneko J, Yamashita Y, Osu R, Otaka Y, Hanakawa T, Honda M, Kawabata H. Revealing Relationships Among Cognitive Functions Using Functional Connectivity and a Large-Scale Meta-Analysis Database. Front Hum Neurosci 2020; 13:457. [PMID: 31998102 PMCID: PMC6965330 DOI: 10.3389/fnhum.2019.00457] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 12/12/2019] [Indexed: 11/13/2022] Open
Abstract
To characterize each cognitive function per se and to understand the brain as an aggregate of those functions, it is vital to relate dozens of these functions to each other. Knowledge about the relationships among cognitive functions is informative not only for basic neuroscientific research but also for clinical applications and developments of brain-inspired artificial intelligence. In the present study, we propose an exhaustive data mining approach to reveal relationships among cognitive functions based on functional brain mapping and network analysis. We began our analysis with 109 pseudo-activation maps (cognitive function maps; CFM) that were reconstructed from a functional magnetic resonance imaging meta-analysis database, each of which corresponds to one of 109 cognitive functions such as ‘emotion,’ ‘attention,’ ‘episodic memory,’ etc. Based on the resting-state functional connectivity between the CFMs, we mapped the cognitive functions onto a two-dimensional space where the relevant functions were located close to each other, which provided a rough picture of the brain as an aggregate of cognitive functions. Then, we conducted so-called conceptual analysis of cognitive functions using clustering of voxels in each CFM connected to the other 108 CFMs with various strengths. As a result, a CFM for each cognitive function was subdivided into several parts, each of which is strongly associated with some CFMs for a subset of the other cognitive functions, which brought in sub-concepts (i.e., sub-functions) of the cognitive function. Moreover, we conducted network analysis for the network whose nodes were parcels derived from whole-brain parcellation based on the whole-brain voxel-to-CFM resting-state functional connectivities. Since each parcel is characterized by associations with the 109 cognitive functions, network analyses using them are expected to inform about relationships between cognitive and network characteristics. Indeed, we found that informational diversities of interaction between parcels and densities of local connectivity were dependent on the kinds of associated functions. In addition, we identified the homogeneous and inhomogeneous network communities about the associated functions. Altogether, we suggested the effectiveness of our approach in which we fused the large-scale meta-analysis of functional brain mapping with the methods of network neuroscience to investigate the relationships among cognitive functions.
Collapse
Affiliation(s)
- Hiroki Kurashige
- Institute of Innovative Science and Technology, Tokai University, Tokyo, Japan.,National Institute of Neuroscience, National Center of Neurology and Psychiatry, Tokyo, Japan
| | - Jun Kaneko
- Institute of Innovative Science and Technology, Tokai University, Tokyo, Japan
| | - Yuichi Yamashita
- National Institute of Neuroscience, National Center of Neurology and Psychiatry, Tokyo, Japan
| | - Rieko Osu
- Faculty of Human Sciences, Waseda University, Tokyo, Japan
| | - Yohei Otaka
- Department of Rehabilitation Medicine I, School of Medicine, Fujita Health University, Aichi, Japan.,Department of Rehabilitation Medicine, Tokyo Bay Rehabilitation Hospital, Chiba, Japan
| | - Takashi Hanakawa
- Integrative Brain Imaging Center, National Center of Neurology and Psychiatry, Tokyo, Japan
| | - Manabu Honda
- National Institute of Neuroscience, National Center of Neurology and Psychiatry, Tokyo, Japan
| | | |
Collapse
|
40
|
Kompa B, Coker B. Learning a Latent Space of Highly Multidimensional Cancer Data. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020; 25:379-390. [PMID: 31797612 PMCID: PMC6934353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We introduce a Unified Disentanglement Network (UFDN) trained on The Cancer Genome Atlas (TCGA), which we refer to as UFDN-TCGA. We demonstrate that UFDN-TCGA learns a biologically relevant, low-dimensional latent space of high-dimensional gene expression data by applying our network to two classification tasks of cancer status and cancer type. UFDN-TCGA performs comparably to random forest methods. The UFDN allows for continuous, partial interpolation between distinct cancer types. Furthermore, we perform an analysis of differentially expressed genes between skin cutaneous melanoma (SKCM) samples and the same samples interpolated into glioblastoma (GBM). We demonstrate that our interpolations consist of relevant metagenes that recapitulate known glioblastoma mechanisms.
Collapse
Affiliation(s)
- Benjamin Kompa
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA,
| | | |
Collapse
|
41
|
Xicola RM, Clark JR, Carroll T, Alvikas J, Marwaha P, Regan MR, Lopez-Giraldez F, Choi J, Emmadi R, Alagiozian-Angelova V, Kupfer SS, Ellis NA, Llor X. Implication of DNA repair genes in Lynch-like syndrome. Fam Cancer 2019; 18:331-342. [PMID: 30989425 DOI: 10.1007/s10689-019-00128-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Many colorectal cancers (CRCs) that exhibit microsatellite instability (MSI) are not explained by MLH1 promoter methylation or germline mutations in mismatch repair (MMR) genes, which cause Lynch syndrome (LS). Instead, these Lynch-like syndrome (LLS) patients have somatic mutations in MMR genes. However, many of these patients are young and have relatives with cancer, suggesting a hereditary entity. We performed germline sequence analysis in LLS patients and determined their tumor's mutational profiles using FFPE DNA. Six hundred and fifty-four consecutive CRC patients were screened for suspected LS using MSI and absence of MLH1 methylation. Suspected LS cases were exome sequenced to identify germline and somatic mutations. Single nucleotide variants were used to characterize mutational signatures. We identified 23 suspected LS cases. Germline sequence analysis of 16 available samples identified five cases with LS mutations and 11 cases without LS mutations, LLS. Most LLS tumors had a combination of somatic MMR gene mutation and loss of heterozygosity. LLS patients were relatively young and had excess first-degree relatives with cancer. Four of the 11 LLS patients had rare likely pathogenic variants in genes that maintain genome integrity. Moreover, tumors from this group had a distinct mutational signature compared to tumors from LLS patients lacking germline mutations in these genes. In summary, more than a third of the LLS patients studied had germline mutations in genes that maintain genome integrity and their tumors had a distinct mutational signature. The possibility of hereditary factors in LLS warrants further studies so counseling can be properly informed.
Collapse
Affiliation(s)
- Rosa M Xicola
- Department of Internal Medicine and Cancer Center, Yale University School of Medicine, P. O. Box 208019, 333 Cedar Street/LMP 1080, New Haven, CT, 06520-8019, USA
| | - Julia R Clark
- Department of Medicine and Cancer Center, University of Illinois at Chicago, 1020N CSB, Chicago, IL, 60612, USA
| | - Timothy Carroll
- Department of Medicine and Cancer Center, University of Illinois at Chicago, 1020N CSB, Chicago, IL, 60612, USA
| | - Jurgis Alvikas
- Department of Medicine and Cancer Center, University of Illinois at Chicago, 1020N CSB, Chicago, IL, 60612, USA
| | - Priti Marwaha
- Department of Medicine and Cancer Center, University of Illinois at Chicago, 1020N CSB, Chicago, IL, 60612, USA
| | - Maureen R Regan
- Department of Medicine and Cancer Center, University of Illinois at Chicago, 1020N CSB, Chicago, IL, 60612, USA
| | - Francesc Lopez-Giraldez
- Yale Center for Genome Analysis, Yale University, 830 West Campus Drive, Orange, CT, 06477, USA
| | - Jungmin Choi
- Department of Genetics and Yale Center for Genome Analysis, Yale University School of Medicine, 830 West Campus Drive, Orange, CT, 06477, USA
| | - Rajyasree Emmadi
- Department of Pathology, University of Illinois at Chicago, 840 S. Wood St., Suite 130 CSN, Chicago, IL, 60612, USA
| | | | - Sonia S Kupfer
- Center for Clinical Cancer Genetics, The University of Chicago, 900 East 57th Street, Chicago, IL, 60637, USA
| | - Nathan A Ellis
- Department of Cellular and Molecular Medicine, University of Arizona, 1515 N. Campbell Ave., Tucson, AZ, 85724-5024, USA
| | - Xavier Llor
- Department of Internal Medicine and Cancer Center, Yale University School of Medicine, P. O. Box 208019, 333 Cedar Street/LMP 1080, New Haven, CT, 06520-8019, USA.
| |
Collapse
|
42
|
Liu D, Davila-Velderrain J, Zhang Z, Kellis M. Integrative construction of regulatory region networks in 127 human reference epigenomes by matrix factorization. Nucleic Acids Res 2019; 47:7235-7246. [PMID: 31265076 PMCID: PMC6698807 DOI: 10.1093/nar/gkz538] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 04/19/2019] [Accepted: 06/09/2019] [Indexed: 01/14/2023] Open
Abstract
Despite large experimental and computational efforts aiming to dissect the mechanisms underlying disease risk, mapping cis-regulatory elements to target genes remains a challenge. Here, we introduce a matrix factorization framework to integrate physical and functional interaction data of genomic segments. The framework was used to predict a regulatory network of chromatin interaction edges linking more than 20 000 promoters and 1.8 million enhancers across 127 human reference epigenomes, including edges that are present in any of the input datasets. Our network integrates functional evidence of correlated activity patterns from epigenomic data and physical evidence of chromatin interactions. An important contribution of this work is the representation of heterogeneous data with different qualities as networks. We show that the unbiased integration of independent data sources suggestive of regulatory interactions produces meaningful associations supported by existing functional and physical evidence, correlating with expected independent biological features.
Collapse
Affiliation(s)
- Dianbo Liu
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Division of Computational Biology, School of Life Sciences, University of Dundee, Dundee, DD1 5HL, Scotland, UK
| | - Jose Davila-Velderrain
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Zhizhuo Zhang
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|
43
|
A Spatiotemporal Constraint Non-Negative Matrix Factorization Model to Discover Intra-Urban Mobility Patterns from Taxi Trips. SUSTAINABILITY 2019. [DOI: 10.3390/su11154214] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Taxi services provide an urban transport option to citizens. Massive taxi trajectories contain rich information for understanding human travel activities, which are essential to sustainable urban mobility and transportation. The origin and destination (O-D) pairs of urban taxi trips can reveal the spatiotemporal patterns of human mobility and then offer fundamental information to interpret and reform formal, functional, and perceptual regions of cities. Matrices are one of the most effective models to represent taxi trajectories and O-D trips. Among matrix representations, non-negative matrix factorization (NMF) gives meaningful interpretations of complex latent relationships. However, the independence assumption for observations is violated by spatial and temporal autocorrelation in taxi flows, which is not compensated in classical NMF models. In order to discover human intra-urban mobility patterns, a novel spatiotemporal constraint NMF (STC-NMF) model that explicitly solves spatial and temporal dependencies is proposed in this paper. It factorizes taxi flow matrices in both spatial and temporal aspects, thus revealing inherent spatiotemporal patterns. With three-month taxi trajectories harvested in Beijing, China, the STC-NMF model is employed to investigate taxi travel patterns and their spatial interaction modes. As the results, four departure patterns, three arrival patterns, and eight spatial interaction patterns during weekdays and weekends are discovered. Moreover, it is found that intensive movements within certain time windows are significantly related to region functionalities and the spatial interaction flows exhibit an obvious distance decay tendency. The outcome of the proposed model is more consistent with the inherent spatiotemporal characteristics of human intra-urban movements. The knowledge gained in this research would be useful to taxi services and transportation management for promoting sustainable urban development.
Collapse
|
44
|
Zeng Z, Vo AH, Mao C, Clare SE, Khan SA, Luo Y. Cancer classification and pathway discovery using non-negative matrix factorization. J Biomed Inform 2019; 96:103247. [PMID: 31271844 PMCID: PMC6697569 DOI: 10.1016/j.jbi.2019.103247] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 04/23/2019] [Accepted: 07/01/2019] [Indexed: 02/08/2023]
Abstract
OBJECTIVES Extracting genetic information from a full range of sequencing data is important for understanding disease. We propose a novel method to effectively explore the landscape of genetic mutations and aggregate them to predict cancer type. DESIGN We applied non-smooth non-negative matrix factorization (nsNMF) and support vector machine (SVM) to utilize the full range of sequencing data, aiming to better aggregate genetic mutations and improve their power to predict disease type. More specifically, we introduce a novel classifier to distinguish cancer types using somatic mutations obtained from whole-exome sequencing data. Mutations were identified from multiple cancers and scored using SIFT, PP2, and CADD, and collapsed at the individual gene level. nsNMF was then applied to reduce dimensionality and obtain coefficient and basis matrices. A feature matrix was derived from the obtained matrices to train a classifier for cancer type classification with the SVM model. RESULTS We have demonstrated that the classifier was able to distinguish four cancer types with reasonable accuracy. In five-fold cross-validations using mutation counts as features, the average prediction accuracy was 80% (SEM = 0.1%), significantly outperforming baselines and outperforming models using mutation scores as features. CONCLUSION Using the factor matrices derived from the nsNMF, we identified multiple genes and pathways that are significantly associated with each cancer type. This study presents a generic and complete pipeline to study the associations between somatic mutations and cancers. The proposed method can be adapted to other studies for disease status classification and pathway discovery.
Collapse
Affiliation(s)
- Zexian Zeng
- Department of Preventive Medicine, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA
| | - Andy H Vo
- Committee on Developmental Biology and Regenerative Medicine, The University of Chicago, Chicago, IL, USA
| | - Chengsheng Mao
- Department of Preventive Medicine, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA
| | - Susan E Clare
- Department of Surgery, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA.
| | - Seema A Khan
- Department of Surgery, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA.
| | - Yuan Luo
- Department of Preventive Medicine, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA.
| |
Collapse
|
45
|
Hu W, Yang Y, Qi L, Chen J, Ge W, Zheng S. Subtyping of microsatellite instability-high colorectal cancer. Cell Commun Signal 2019; 17:79. [PMID: 31331345 PMCID: PMC6647262 DOI: 10.1186/s12964-019-0397-4] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 07/15/2019] [Indexed: 12/12/2022] Open
Abstract
Background Patients with microsatellite instability-high (MSI-H) colorectal cancer (CRC) generally have a better prognosis than patients with microsatellite stable (MSS) CRC. However, some MSI-H CRC patients do not gain overall survival benefits from immune checkpoint-blockade treatment. In other words, heterogeneity within the subgroup of MSI-H tumors remains poorly understood. Thus, an in-depth molecular characterization of MSI-H tumors is urgently required. Methods Here, we use nonnegative matrix factorization (NMF)-based consensus clustering to define CRC MSI-H subtypes in The Cancer Genome Atlas and a French multicenter cohort GSE39582. CIBERSORT was used to calculate the proportions of 22 lymphocytes in tumor tissue in MSI-H subtypes. Results MSI-H CRC samples basically clustered into two subgroups (MSI-H1 and MSI-H2). MSI-H1 was characterized by a lower BRAF mutational status, higher frequency of chromosomal instability, global hypomethylation, and worse survival than MSI-H2. Further examination of the immune landscape showed that macrophages of the M2 phenotype were enriched in MSI-H1, which may be associated with poor prognosis in this subgroup. Conclusions Our results illustrate the genetic heterogeneity in MSI-H CRCs and macrophages may serve as good targets for anticancer therapy in MSI-H1. Graphical abstract ![]()
Electronic supplementary material The online version of this article (10.1186/s12964-019-0397-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wangxiong Hu
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang, 310009, Hangzhou, China.
| | - Yanmei Yang
- Key Laboratory of Reproductive and Genetics, Ministry of Education, Women's Hospital, Zhejiang University School of Medicine, Zhejiang, 310006, Hangzhou, China
| | - Lina Qi
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang, 310009, Hangzhou, China
| | - Jiani Chen
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang, 310009, Hangzhou, China
| | - Weiting Ge
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang, 310009, Hangzhou, China
| | - Shu Zheng
- Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang, 310009, Hangzhou, China.
| |
Collapse
|
46
|
Baez-Ortega A, Gori K. Computational approaches for discovery of mutational signatures in cancer. Brief Bioinform 2019; 20:77-88. [PMID: 28968631 DOI: 10.1093/bib/bbx082] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Indexed: 01/07/2023] Open
Abstract
The accumulation of somatic mutations in a genome is the result of the activity of one or more mutagenic processes, each of which leaves its own imprint. The study of these DNA fingerprints, termed mutational signatures, holds important potential for furthering our understanding of the causes and evolution of cancer, and can provide insights of relevance for cancer prevention and treatment. In this review, we focus our attention on the mathematical models and computational techniques that have driven recent advances in the field.
Collapse
Affiliation(s)
| | - Kevin Gori
- Transmissible Cancer Group, University of Cambridge
| |
Collapse
|
47
|
Liou JY, Ma H, Wenzel M, Zhao M, Baird-Daniel E, Smith EH, Daniel A, Emerson R, Yuste R, Schwartz TH, Schevon CA. Role of inhibitory control in modulating focal seizure spread. Brain 2019; 141:2083-2097. [PMID: 29757347 DOI: 10.1093/brain/awy116] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Accepted: 03/04/2018] [Indexed: 11/12/2022] Open
Abstract
Focal seizure propagation is classically thought to be spatially contiguous. However, distribution of seizures through a large-scale epileptic network has been theorized. Here, we used a multielectrode array, wide field calcium imaging, and two-photon calcium imaging to study focal seizure propagation pathways in an acute rodent neocortical 4-aminopyridine model. Although ictal neuronal bursts did not propagate beyond a 2-3-mm region, they were associated with hemisphere-wide field potential fluctuations and parvalbumin-positive interneuron activity outside the seizure focus. While bicuculline surface application enhanced contiguous seizure propagation, focal bicuculline microinjection at sites distant to the 4-aminopyridine focus resulted in epileptic network formation with maximal activity at the two foci. Our study suggests that both classical and epileptic network propagation can arise from localized inhibition defects, and that the network appearance can arise in the context of normal brain structure without requirement for pathological connectivity changes between sites.
Collapse
Affiliation(s)
- Jyun-You Liou
- Department of Physiology and Cellular Biophysics, Columbia University, New York, NY, USA
| | - Hongtao Ma
- Department of Neurological Surgery, Feil Family Brain and Mind Research Institute, Sackler Brain and Spine Institute, Weill Cornell Medical College, New York, NY, USA
| | - Michael Wenzel
- Neurotechnology Center, Department of Biological Sciences, Columbia University, New York, NY, USA; Department of Neuroscience, Columbia University, New York, NY, USA
| | - Mingrui Zhao
- Department of Neurological Surgery, Feil Family Brain and Mind Research Institute, Sackler Brain and Spine Institute, Weill Cornell Medical College, New York, NY, USA
| | - Eliza Baird-Daniel
- Department of Neurological Surgery, Feil Family Brain and Mind Research Institute, Sackler Brain and Spine Institute, Weill Cornell Medical College, New York, NY, USA
| | - Elliot H Smith
- Department of Neurological Surgery, Columbia University Medical Center, New York, NY, USA
| | - Andy Daniel
- Department of Neurological Surgery, Feil Family Brain and Mind Research Institute, Sackler Brain and Spine Institute, Weill Cornell Medical College, New York, NY, USA
| | - Ronald Emerson
- Hospital for Special Surgery, Weill Cornell Medical College, 535 East 70th Street, New York, NY, USA
| | - Rafael Yuste
- Neurotechnology Center, Department of Biological Sciences, Columbia University, New York, NY, USA; Department of Neuroscience, Columbia University, New York, NY, USA
| | - Theodore H Schwartz
- Department of Neurological Surgery, Feil Family Brain and Mind Research Institute, Sackler Brain and Spine Institute, Weill Cornell Medical College, New York, NY, USA
| | - Catherine A Schevon
- Department of Neurology, Columbia University Medical Center, New York, New York, USA
| |
Collapse
|
48
|
Phalen H, Coffman BA, Ghuman A, Sejdić E, Salisbury DF. Non-negative Matrix Factorization Reveals Resting-State Cortical Alpha Network Abnormalities in the First-Episode Schizophrenia Spectrum. BIOLOGICAL PSYCHIATRY: COGNITIVE NEUROSCIENCE AND NEUROIMAGING 2019; 5:961-970. [PMID: 31451387 DOI: 10.1016/j.bpsc.2019.06.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 06/18/2019] [Accepted: 06/18/2019] [Indexed: 01/08/2023]
Abstract
BACKGROUND Little is known about neural oscillatory dynamics in first-episode psychosis. Pathophysiology of functional connectivity can be measured through network activity of alpha oscillations, reflecting long-range communication between distal brain regions. METHODS Resting magnetoencephalographic activity was collected from 31 individuals with first-episode schizophrenia spectrum psychosis and 22 healthy control individuals. Activity was projected to the realistic cortical surface, based on structural magnetic resonance imaging. The first principal component of activity in 40 Brodmann areas per hemisphere was Hilbert transformed within the alpha range. Non-negative matrix factorization was applied to single-trial alpha phase-locking values from all subjects to determine alpha networks. Within networks, energy and entropy were compared. RESULTS Four cortical alpha networks were pathological in individuals with first-episode schizophrenia spectrum psychosis. The networks involved the bilateral anterior and posterior cingulate; left auditory, medial temporal, and cingulate cortex; right inferior frontal gyrus and widespread areas; and right posterior parietal cortex and widespread areas. Energy and entropy were associated with the Positive and Negative Syndrome Scale total and thought disorder factors for the first three networks. In addition, the left posterior temporal network was associated with positive and negative factors, and the right inferior frontal network was associated with the positive factor. CONCLUSIONS Machine learning network analysis of resting alpha-band neural activity identified several aberrant networks in individuals with first-episode schizophrenia spectrum psychosis, including the left temporal, right inferior frontal, right posterior parietal, and bilateral cingulate cortices. Abnormal long-range alpha communication is evident at the first presentation for psychosis and may provide clues about mechanisms of dysconnectivity in psychosis and novel targets for noninvasive brain stimulation.
Collapse
Affiliation(s)
- Henry Phalen
- Department of Bioengineering, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Brian A Coffman
- Clinical Neurophysiology Research Laboratory, Western Psychiatric Institute and Clinic, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
| | - Avniel Ghuman
- Laboratory of Cognitive Neurodynamics, Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
| | - Ervin Sejdić
- Department of Electrical and Computer Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Dean F Salisbury
- Clinical Neurophysiology Research Laboratory, Western Psychiatric Institute and Clinic, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania.
| |
Collapse
|
49
|
Zhu S, Wu X, Fu H, Ye C, Chen M, Jiang Z, Ji G. Modeling of Genome-Wide Polyadenylation Signals in Xenopus tropicalis. Front Genet 2019; 10:647. [PMID: 31333724 PMCID: PMC6616101 DOI: 10.3389/fgene.2019.00647] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Accepted: 06/18/2019] [Indexed: 12/22/2022] Open
Abstract
Alternative polyadenylation (APA) is an important post-transcriptional modification event to process messenger RNA (mRNA) for transcriptional termination, transport, and translation. In the present study, we characterized poly(A) signals in Xenopus tropicalis using 70,918 highly confident poly(A) sites derived from 16,511 protein-coding genes to understand their roles in the regulation of embryo development and gender difference. We examined potential factors, including the gene length, the number of introns in a gene, and the intron length, that may affect the prevalence of APA. We observed 12 prominent poly(A) signal patterns, which accounted for approximately 92% of total APA sites in Xenopus tropicalis. Among them, three patterns are specific to X. tropicalis, so they are absent in other animals such as humans or mice. We catalogued APA sites based on their genomic regions and developed a bioinformatics pipeline to identify over-represented signal patterns for each class. Then the schema of cis elements for APA sites in each genomic region was proposed. More importantly, APA usage is dramatically dynamic in embryos along five developmental stages and well-coordinated with the maternal-to-zygotic transition event. We used an entropy-based method to identify developmental stage-specific APA sites and identified significant signal patterns around specific sites and constitutive sites. We found that the APA frequency in different genomic regions varies with developmental stages and that those sites located in intron or coding sequence regions contribute most to the dynamics of gene expression during developmental stages. This study deciphers the characteristics and poly(A) signal patterns for both canonical APA sites and non-canonical APA sites across different developmental stages and gender dimorphisms in X. tropicalis, providing new insights into the dynamic regulation of distal and proximal APA.
Collapse
Affiliation(s)
- Sheng Zhu
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China
| | - Xiaohui Wu
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Innovation Center for Cell Signaling Network, Xiamen University, Xiamen, China
| | - Hongjuan Fu
- Department of Automation, Xiamen University, Xiamen, China
| | - Congting Ye
- National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Moliang Chen
- Department of Automation, Xiamen University, Xiamen, China
| | - Zhihua Jiang
- Department of Animal Sciences and Center for Reproductive Biology, Washington State University, Pullman, WA, United States
| | - Guoli Ji
- Department of Automation, Xiamen University, Xiamen, China.,National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, China.,Innovation Center for Cell Signaling Network, Xiamen University, Xiamen, China
| |
Collapse
|
50
|
Bodelon C, Killian JK, Sampson JN, Anderson WF, Matsuno R, Brinton LA, Lissowska J, Anglesio MS, Bowtell DDL, Doherty JA, Ramus SJ, Talhouk A, Sherman ME, Wentzensen N. Molecular Classification of Epithelial Ovarian Cancer Based on Methylation Profiling: Evidence for Survival Heterogeneity. Clin Cancer Res 2019; 25:5937-5946. [PMID: 31142506 DOI: 10.1158/1078-0432.ccr-18-3720] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 03/18/2019] [Accepted: 05/23/2019] [Indexed: 12/12/2022]
Abstract
PURPOSE Ovarian cancer is a heterogeneous disease that can be divided into multiple subtypes with variable etiology, pathogenesis, and prognosis. We analyzed DNA methylation profiling data to identify biologic subgroups of ovarian cancer and study their relationship with histologic subtypes, copy number variation, RNA expression data, and outcomes. EXPERIMENTAL DESIGN A total of 162 paraffin-embedded ovarian epithelial tumor tissues, including the five major epithelial ovarian tumor subtypes (high- and low-grade serous, endometrioid, mucinous, and clear cell) and tumors of low malignant potential were selected from two different sources: The Polish Ovarian Cancer study, and the Surveillance, Epidemiology, and End Results Residual Tissue Repository (SEER RTR). Analyses were restricted to Caucasian women. Methylation profiling was conducted using the Illumina 450K methylation array. For 45 tumors array copy number data were available. NanoString gene expression data for 39 genes were available for 61 high-grade serous carcinomas (HGSC). RESULTS Consensus nonnegative matrix factorization clustering of the 1,000 most variable CpG sites showed four major clusters among all epithelial ovarian cancers. We observed statistically significant differences in survival (log-rank test, P = 9.1 × 10-7) and genomic instability across these clusters. Within HGSC, clustering showed three subgroups with survival differences (log-rank test, P = 0.002). Comparing models with and without methylation subgroups in addition to previously identified gene expression subtypes suggested that the methylation subgroups added significant survival information (P = 0.007). CONCLUSIONS DNA methylation profiling of ovarian cancer identified novel molecular subgroups that had significant survival difference and provided insights into the molecular underpinnings of ovarian cancer.See related commentary by Ishak et al., p. 5729.
Collapse
Affiliation(s)
- Clara Bodelon
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland.
| | - J Keith Killian
- Center for Cancer Research (CCR), NCI, NIH, Bethesda, Maryland
| | - Joshua N Sampson
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland
| | - William F Anderson
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland
| | - Rayna Matsuno
- Foundation Medicine Inc., Cambridge, Massachusetts.,University of California, San Diego, California
| | - Louise A Brinton
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland
| | - Jolanta Lissowska
- M. Sklodowska-Curie Memorial Cancer Center and Institute of Oncology, Warsaw, Poland
| | - Michael S Anglesio
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, Canada.,Department of Obstetrics and Gynaecology, University of British Columbia, Vancouver, Canada
| | - David D L Bowtell
- The Kinghorn Cancer Center, Garvan Institute of Medical Research, Sydney, Australia.,Peter MacCallum Cancer Center, Melbourne, Australia
| | - Jennifer A Doherty
- Department of Population Health Sciences, Huntsman Cancer Institute, University of Utah, Salt Lake City, Utah
| | - Susan J Ramus
- The Kinghorn Cancer Center, Garvan Institute of Medical Research, Sydney, Australia.,School of Women's and Children's Health, University of New South Wales, Sydney, Australia
| | - Aline Talhouk
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, Canada
| | - Mark E Sherman
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland.,Mayo Clinic, Jacksonville, Florida
| | - Nicolas Wentzensen
- Division of Cancer Epidemiology and Genetics, NCI, NIH, Bethesda, Maryland
| |
Collapse
|