1
|
Singular value thresholding two-stage matrix completion for drug sensitivity discovery. Comput Biol Chem 2024; 110:108071. [PMID: 38718497 DOI: 10.1016/j.compbiolchem.2024.108071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 04/06/2024] [Accepted: 04/11/2024] [Indexed: 05/27/2024]
Abstract
Incomplete data presents significant challenges in drug sensitivity analysis, especially in critical areas like oncology, where precision is paramount. Our study introduces an innovative imputation method designed specifically for low-rank matrices, addressing the crucial challenge of data completion in anticancer drug sensitivity testing. Our method unfolds in two main stages: Initially, the singular value thresholding algorithm is employed for preliminary matrix completion, establishing a solid foundation for subsequent steps. Then, the matrix rows are segmented into distinct blocks based on hierarchical clustering of correlation coefficients, applying singular value thresholding to the largest block, which has been proved to possess the largest entropy. This is followed by a refined data restoration process, where the reconstructed largest block is integrated into the initial matrix completion to achieve the final matrix completion. Compared to other methods, our approach not only improves the accuracy of data restoration but also ensures the integrity and reliability of the imputed values, establishing it as a robust tool for future drug sensitivity analysis.
Collapse
|
2
|
Predicting rectal cancer prognosis from histopathological images and clinical information using multi-modal deep learning. Front Oncol 2024; 14:1353446. [PMID: 38690169 PMCID: PMC11060749 DOI: 10.3389/fonc.2024.1353446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 04/01/2024] [Indexed: 05/02/2024] Open
Abstract
Objective The objective of this study was to provide a multi-modal deep learning framework for forecasting the survival of rectal cancer patients by utilizing both digital pathological images data and non-imaging clinical data. Materials and methods The research included patients diagnosed with rectal cancer by pathological confirmation from January 2015 to December 2016. Patients were allocated to training and testing sets in a randomized manner, with a ratio of 4:1. The tissue microarrays (TMAs) and clinical indicators were obtained. Subsequently, we selected distinct deep learning models to individually forecast patient survival. We conducted a scanning procedure on the TMAs in order to transform them into digital pathology pictures. Additionally, we performed pre-processing on the clinical data of the patients. Subsequently, we selected distinct deep learning algorithms to conduct survival prediction analysis using patients' pathological images and clinical data, respectively. Results A total of 292 patients with rectal cancer were randomly allocated into two groups: a training set consisting of 234 cases, and a testing set consisting of 58 instances. Initially, we make direct predictions about the survival status by using pre-processed Hematoxylin and Eosin (H&E) pathological images of rectal cancer. We utilized the ResNest model to extract data from histopathological images of patients, resulting in a survival status prediction with an AUC (Area Under the Curve) of 0.797. Furthermore, we employ a multi-head attention fusion (MHAF) model to combine image features and clinical features in order to accurately forecast the survival rate of rectal cancer patients. The findings of our experiment show that the multi-modal structure works better than directly predicting from histopathological images. It achieves an AUC of 0.837 in predicting overall survival (OS). Conclusions Our study highlights the potential of multi-modal deep learning models in predicting survival status from histopathological images and clinical information, thus offering valuable insights for clinical applications.
Collapse
|
3
|
Effects of different KRAS mutants and Ki67 expression on diagnosis and prognosis in lung adenocarcinoma. Sci Rep 2024; 14:4085. [PMID: 38374309 PMCID: PMC10876986 DOI: 10.1038/s41598-023-48307-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 11/24/2023] [Indexed: 02/21/2024] Open
Abstract
Lung adenocarcinoma (LUAD) is a prevalent form of non-small cell lung cancer with a rising incidence in recent years. Understanding the mutation characteristics of LUAD is crucial for effective treatment and prediction of this disease. Among the various mutations observed in LUAD, KRAS mutations are particularly common. Different subtypes of KRAS mutations can activate the Ras signaling pathway to varying degrees, potentially influencing the pathogenesis and prognosis of LUAD. This study aims to investigate the relationship between different KRAS mutation subtypes and the pathogenesis and prognosis of LUAD. A total of 63 clinical samples of LUAD were collected for this study. The samples were analyzed using targeted gene sequencing panels to obtain sequencing data. To complement the dataset, additional clinical and sequencing data were obtained from TCGA and MSK. The analysis revealed significantly higher Ki67 immunohistochemical scores in patients with missense mutations compared to controls. Moreover, the expression level of KRAS was found to be significantly correlated with Ki67 expression. Enrichment analysis indicated that KRAS missense mutations activated the SWEET_LUNG_CANCER_KRAS_DN and CREIGHTON_ENDOCRINE_THERAPY_RESISTANCE_2 pathways. Additionally, patients with KRAS missense mutations and high Ki67 IHC scores exhibited significantly higher tumor mutational burden levels compared to other groups, which suggests they are more likely to be responsive to ICIs. Based on the data from MSK and TCGA, it was observed that patients with KRAS missense mutations had shorter survival compared to controls, and Ki67 expression level could more accurately predict patient prognosis. In conclusion, when utilizing KRAS mutations as biomarkers for the treatment and prediction of LUAD, it is important to consider the specific KRAS mutant subtypes and Ki67 expression levels. These findings contribute to a better understanding of LUAD and have implications for personalized therapeutic approaches in the management of this disease.
Collapse
|
4
|
Co-expression of IL-21-Enhanced NKG2D CAR-NK cell therapy for lung cancer. BMC Cancer 2024; 24:119. [PMID: 38263004 PMCID: PMC10807083 DOI: 10.1186/s12885-023-11806-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 12/28/2023] [Indexed: 01/25/2024] Open
Abstract
BACKGROUND Adoptive cell therapy has achieved great success in treating hematological malignancies. However, the production of chimeric antigen receptor T (CAR-T) cell therapy still faces various difficulties. Natural killer (NK)-92 is a continuously expandable cell line and provides a promising alternative for patient's own immune cells. METHODS We established CAR-NK cells by co-expressing natural killer group 2 member D (NKG2D) and IL-21, and evaluated the efficacy of NKG2D-IL-21 CAR-NK cells in treating lung cancer in vitro and in vivo. RESULTS Our data suggested that the expression of IL-21 effectively increased the cytotoxicity of NKG2D CAR-NK cells against lung cancer cells in a dose-dependent manner and suppressed tumor growth in vitro and in vivo. In addition, the proliferation of NKG2D-IL-21 CAR-NK cells were enhanced while the apoptosis and exhaustion of these cells were suppressed. Mechanistically, IL-21-mediated NKG2D CAR-NK cells function by activating AKT signaling pathway. CONCLUSION Our findings provide a novel option for treating lung cancer using NKG2D-IL-21 CAR-NK cell therapy.
Collapse
|
5
|
Evaluation of vital genes correlated with CD8 + T cell infiltration as prognostic biomarkers in stomach adenocarcinoma. BMC Gastroenterol 2023; 23:399. [PMID: 37978443 PMCID: PMC10656896 DOI: 10.1186/s12876-023-03003-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 10/17/2023] [Indexed: 11/19/2023] Open
Abstract
BACKGROUND Infiltration of CD8 + T cells in the tumor microenvironment is correlated with better prognosis in various malignancies. Our study aimed to investigate vital genes correlated with CD8 + T cell infiltration in stomach adenocarcinoma (STAD) and develop a new prognostic model. METHODS Using the STAD dataset, differentially expressed genes (DEGs) were analyzed, and co-expression networks were constructed. Combined with the CIBERSORT algorithm, the most relevant module of WGCNA with CD8 + T cell infiltration was selected for subsequent analysis. The vital genes were screened out by univariate regression analysis to establish the risk score model. The expression of the viral genes was verified by lasso regression analysis and in vitro experiments. RESULTS Four CD8 + T cell infiltration-related genes (CIDEC, EPS8L3, MUC13, and PLEKHS1) were correlated with the prognosis of STAD. Based on these genes, a risk score model was established. We found that the risk score could well predict the prognosis of STAD, and the risk score was positively correlated with CD8 + T cell infiltration. The validation results of the gene expression were consistent with TCGA. Furthermore, the risk score was significantly higher in tumor tissues. The high-risk group had poorer overall survival (OS) in each subgroup. CONCLUSIONS Our study constructed a new risk score model for STAD prognosis, which may provide a new perspective to explore the tumor immune microenvironment mechanism in STAD.
Collapse
|
6
|
ITRPCA: a new model for computational drug repositioning based on improved tensor robust principal component analysis. Front Genet 2023; 14:1271311. [PMID: 37795241 PMCID: PMC10545866 DOI: 10.3389/fgene.2023.1271311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 08/23/2023] [Indexed: 10/06/2023] Open
Abstract
Background: Drug repositioning is considered a promising drug development strategy with the goal of discovering new uses for existing drugs. Compared with the experimental screening for drug discovery, computational drug repositioning offers lower cost and higher efficiency and, hence, has become a hot issue in bioinformatics. However, there are sparse samples, multi-source information, and even some noises, which makes it difficult to accurately identify potential drug-associated indications. Methods: In this article, we propose a new scheme with improved tensor robust principal component analysis (ITRPCA) in multi-source data to predict promising drug-disease associations. First, we use a weighted k-nearest neighbor (WKNN) approach to increase the overall density of the drug-disease association matrix that will assist in prediction. Second, a drug tensor with five frontal slices and a disease tensor with two frontal slices are constructed using multi-similarity matrices and an updated association matrix. The two target tensors naturally integrate multiple sources of data from the drug-side aspect and the disease-side aspect, respectively. Third, ITRPCA is employed to isolate the low-rank tensor and noise information in the tensor. In this step, an additional range constraint is incorporated to ensure that all the predicted entry values of a low-rank tensor are within the specific interval. Finally, we focus on identifying promising drug indications by analyzing drug-disease association pairs derived from the low-rank drug and low-rank disease tensors. Results: We evaluate the effectiveness of the ITRPCA method by comparing it with five prominent existing drug repositioning methods. This evaluation is carried out using 10-fold cross-validation and independent testing experiments. Our numerical results show that ITRPCA not only yields higher prediction accuracy but also exhibits remarkable computational efficiency. Furthermore, case studies demonstrate the practical effectiveness of our method.
Collapse
|
7
|
Revealing the roles of TLR7, a nucleic acid sensor for COVID-19 in pan-cancer. BIOSAFETY AND HEALTH 2023:S2590-0536(23)00054-X. [PMID: 37362864 PMCID: PMC10167782 DOI: 10.1016/j.bsheal.2023.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 05/04/2023] [Accepted: 05/04/2023] [Indexed: 06/28/2023] Open
Abstract
Recent studies suggested that cancer was a risk factor for coronavirus disease 2019 (COVID-19). Toll-like receptor 7 (TLR7), a severe acute respiratory syndrome 2 (SARS-CoV-2) virus's nucleic acid sensor, was discovered to be aberrantly expressed in many types of cancers. However, its expression pattern across cancers and association with COVID-19 (or its causing virus SARS-CoV-2) has not been systematically studied. In this study, we proposed a computational framework to comprehensively study the roles of TLR7 in COVID-19 and pan-cancers at genetic, gene expression, protein, epigenetic, and single-cell levels. We applied the computational framework in a few databases, including The Cancer Genome Atlas (TCGA), The Genotype-Tissue Expression (GTEx), Cancer Cell Line Encyclopedia (CCLE), Human Protein Atlas (HPA), lung gene expression data of mice infected with SARS-CoV-2, and the like. As a result, TLR7 expression was found to be higher in the lung of mice infected with SARS-CoV-2 than that in the control group. The analysis in the Opentargets database also confirmed the association between TLR7 and COVID-19. There are also a few exciting findings in cancers. First, the most common type of TLR7 was "Missense" at the genomic level. Second, TLR7 mRNA expression was significantly up-regulated in 6 cancer types and down-regulated in 6 cancer types compared to normal tissues, further validated in the HPA database at the protein level. The genes significantly co-expressed with TLR7 were mainly enriched in the toll-like receptor signaling pathway, endolysosome, and signaling pattern recognition receptor activity. In addition, the abnormal TLR7 expression was associated with mismatch repair (MMR), microsatellite instability (MSI), and tumor mutational burden (TMB) in various cancers. Mined by the ESTIMATE algorithm, the expression of TLR7 was also closely linked to various immune infiltration patterns in pan-cancer, and TLR7 was mainly enriched in macrophages, as revealed by single-cell RNA sequencing. Third, abnormal expression of TLR7 could predict the survival of Brain Lower Grade Glioma (LGG), Lung adenocarcinoma (LUAD), Skin Cutaneous Melanoma (SKCM), Stomach adenocarcinoma (STAD), and Testicular Germ Cell Tumors (TGCT) patients, respectively. Finally, TLR7 expressions were very sensitive to a few targeted drugs, such as Alectinib and Imiquimod. In conclusion, TLR7 might be essential in the pathogenesis of COVID-19 and cancers.
Collapse
|
8
|
Application of oncolytic virus in tumor therapy. J Med Virol 2023; 95:e28729. [PMID: 37185868 DOI: 10.1002/jmv.28729] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 03/30/2023] [Accepted: 04/03/2023] [Indexed: 05/17/2023]
Abstract
Oncolytic viruses (OVs) can selectively kill tumor cells without affecting normal cells, as well as activate the innate and adaptive immune systems in patients. Thus, they have been considered as a promising measure for safe and effective cancer treatment. Recently, a few genetically engineered OVs have been developed to further improve the effect of tumor elimination by expressing specific immune regulatory factors and thus enhance the body's antitumor immunity. In addition, the combined therapies of OVs and other immunotherapies have been applied in clinical. Although there are many studies on this hot topic, a comprehensive review is missing on illustrating the mechanisms of tumor clearance by OVs and how to modify engineered OVs to further enhance their antitumor effects. In this study, we provided a review on the mechanisms of immune regulatory factors in OVs. In addition, we reviewed the combined therapies of OVs with other therapies including radiotherapy and CAR-T or TCR-T cell therapy. The review is useful in further generalize the usage of OV in cancer treatment.
Collapse
|
9
|
Comprehensive analysis of the correlation of the pan-cancer gene HAUS5 with prognosis and immune infiltration in liver cancer. Sci Rep 2023; 13:2409. [PMID: 36765148 PMCID: PMC9918732 DOI: 10.1038/s41598-023-28653-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 01/23/2023] [Indexed: 02/12/2023] Open
Abstract
Liver hepatocellular carcinoma (LIHC) is one of the most common malignancies and places a heavy burden on patients worldwide. HAUS augmin-like complex subunit 5 (HAUS5) is involved in the occurrence and development of various cancers. However, the functional role and significance of HAUS5 in LIHC remain unclear. The Cancer Genome Atlas (TCGA), Genotype-Tissue Expression (GTEx), Cancer Cell Line Encyclopedia (CCLE) and Gene Expression Omnibus (GEO) databases were used to analyze the mRNA expression of HAUS5. The value of HAUS5 in predicting LIHC prognosis and the relationship between HAUS5 and clinicopathological features were assessed by the Kaplan-Meier plotter and UALCAN databases. Functional enrichment analyses and nomogram prediction model construction were performed with the R packages. The LinkedOmics database was searched to reveal co-expressed genes associated with HAUS5. The relationship between HAUS5 expression and immune infiltration was explored by searching the TISIDB database and single-sample gene set enrichment analysis (ssGSEA). The Clinical Proteomic Tumor Analysis Consortium (CPTAC) and the Human Protein Atlas (HPA) databases were used to evaluate HAUS5 protein expression. Finally, the effect of HAUS5 on the proliferation of hepatoma cells was verified by CCK-8, colony formation and EdU assays. HAUS5 is aberrantly expressed and associated with a poor prognosis in most tumors, including LIHC. The expression of HAUS5 is significantly correlated with clinicopathological indicators in patients with LIHC. Functional enrichment analysis showed that HAUS5 was closely related to DNA replication, cell cycle and p53 signaling pathway. HAUS5 may serve as an independent risk factor for LIHC prognosis. The nomogram based on HAUS5 had area under the curve (AUC) values of 0.74 and 0.77 for predicting the 3-year and 5-year overall survival (OS) of LIHC patients. Immune correlation analysis showed that HAUS5 was significantly associated with immune infiltration. Finally, the results of in vitro experiments showed that when HAUS5 was knocked down, the proliferation of hepatoma cells was significantly decreased. The pan-oncogene HAUS5 is a positive regulator of LIHC progression and is closely associated with a poor prognosis in LIHC. Moreover, HAUS5 is involved in immune infiltration in LIHC. HAUS5 may be a new prognostic marker and therapeutic target for LIHC patients.
Collapse
|
10
|
Arachidonate lipoxygenases 5 is a novel prognostic biomarker and correlates with high tumor immune infiltration in low-grade glioma. Front Genet 2023; 14:1027690. [PMID: 36777735 PMCID: PMC9911666 DOI: 10.3389/fgene.2023.1027690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 01/16/2023] [Indexed: 01/28/2023] Open
Abstract
Objective: To investigate the prognostic value of arachidonate lipoxygenases 5 (ALOX5) expression and methylation, and explore the immune functions of arachidonate lipoxygenases 5 expression in low-grade glioma (LGG). Materials and Methods: Using efficient bioinformatics approaches, the differential expression of arachidonate lipoxygenases 5 and the association of its expression with clinicopathological characteristics were evaluated. Then, we analyzed the prognostic significance of arachidonate lipoxygenases 5 expression and its methylation level followed by immune cell infiltration analysis. The functional enrichment analysis was conducted to determine the possible regulatory pathways of arachidonate lipoxygenases 5 in low-grade glioma. Finally, the drug sensitivity analysis was performed to explore the correlation between arachidonate lipoxygenases 5 expression and chemotherapeutic drugs. Results: arachidonate lipoxygenases 5 mRNA expression was increased in low-grade glioma and its expression had a notable relation with age and subtype (p < 0.05). The elevated mRNA level of arachidonate lipoxygenases 5 could independently predict the disease-specific survival (DSS), overall survival (OS), and progression-free interval (PFI) (p < 0.05). Besides, arachidonate lipoxygenases 5 expression was negatively correlated with its methylation level and the arachidonate lipoxygenases 5 hypomethylation led to a worse prognosis (p < 0.05). The arachidonate lipoxygenases 5 expression also showed a positive connection with immune cells, while low-grade glioma patients with higher immune cell infiltration had poor survival probability (p < 0.05). Further, arachidonate lipoxygenases 5 might be involved in immune- and inflammation-related pathways. Importantly, arachidonate lipoxygenases 5 expression was negatively related to drug sensitivity. Conclusion: arachidonate lipoxygenases 5 might be a promising biomarker, and it probably occupies a vital role in immune cell infiltration in low-grade glioma.
Collapse
|
11
|
Screening potential lncRNA biomarkers for breast cancer and colorectal cancer combining random walk and logistic matrix factorization. Front Genet 2023; 13:1023615. [PMID: 36744179 PMCID: PMC9895102 DOI: 10.3389/fgene.2022.1023615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 10/10/2022] [Indexed: 01/21/2023] Open
Abstract
Breast cancer and colorectal cancer are two of the most common malignant tumors worldwide. They cause the leading causes of cancer mortality. Many researches have demonstrated that long noncoding RNAs (lncRNAs) have close linkages with the occurrence and development of the two cancers. Therefore, it is essential to design an effective way to identify potential lncRNA biomarkers for them. In this study, we developed a computational method (LDA-RWLMF) by integrating random walk with restart and Logistic Matrix Factorization to investigate the roles of lncRNA biomarkers in the prognosis and diagnosis of the two cancers. We first fuse disease semantic and Gaussian association profile similarities and lncRNA functional and Gaussian association profile similarities. Second, we design a negative selection algorithm to extract negative LncRNA-Disease Associations (LDA) based on random walk. Third, we develop a logistic matrix factorization model to predict possible LDAs. We compare our proposed LDA-RWLMF method with four classical LDA prediction methods, that is, LNCSIM1, LNCSIM2, ILNCSIM, and IDSSIM. The results from 5-fold cross validation on the MNDR dataset show that LDA-RWLMF computes the best AUC value of 0.9312, outperforming the above four LDA prediction methods. Finally, we rank all lncRNA biomarkers for the two cancers after determining the performance of LDA-RWLMF, respectively. We find that 48 and 50 lncRNAs have the highest association scores with breast cancer and colorectal cancer among all lncRNAs known to associate with them on the MNDR dataset, respectively. We predict that lncRNAs HULC and HAR1A could be separately potential biomarkers for breast cancer and colorectal cancer and need to biomedical experimental validation.
Collapse
|
12
|
MNNMDA: Predicting human microbe-disease association via a method to minimize matrix nuclear norm. Comput Struct Biotechnol J 2023; 21:1414-1423. [PMID: 36824227 PMCID: PMC9941872 DOI: 10.1016/j.csbj.2022.12.053] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 12/29/2022] [Accepted: 12/30/2022] [Indexed: 01/03/2023] Open
Abstract
Identifying the potential associations between microbes and diseases is the first step for revealing the pathological mechanisms of microbe-associated diseases. However, traditional culture-based microbial experiments are expensive and time-consuming. Thus, it is critical to prioritize disease-associated microbes by computational methods for further experimental validation. In this study, we proposed a novel method called MNNMDA, to predict microbe-disease associations (MDAs) by applying a Matrix Nuclear Norm method into known microbe and disease data. Specifically, we first calculated Gaussian interaction profile kernel similarity and functional similarity for diseases and microbes. Then we constructed a heterogeneous information network by combining the integrated disease similarity network, the integrated microbe similarity network and the known microbe-disease bipartite network. Finally, we formulated the microbe-disease association prediction problem as a low-rank matrix completion problem, which was solved by minimizing the nuclear norm of a matrix with a few regularization terms. We tested the performances of MNNMDA in three datasets including HMDAD, Disbiome, and Combined Data with small, medium and large sizes respectively. We also compared MNNMDA with 5 state-of-the-art methods including KATZHMDA, LRLSHMDA, NTSHMDA, GATMDA, and KGNMDA, respectively. MNNMDA achieved area under the ROC curves (AUROC) of 0.9536 and 0.9364 respectively on HDMAD and Disbiome, better than the AUCs of compared methods under the 5-fold cross-validation for all microbe-disease associations. It also obtained a relatively good performance with AUROC 0.8858 in the combined data. In addition, MNNMDA was also better than other methods in area under precision and recall curve (AUPR) under the 5-fold cross-validation for all associations, and in both AUROC and AUPR under the 5-fold cross-validation for diseases and the 5-fold cross-validation for microbes. Finally, the case studies on colon cancer and inflammatory bowel disease (IBD) also validated the effectiveness of MNNMDA. In conclusion, MNNMDA is an effective method in predicting microbe-disease associations. Availability The codes and data for this paper are freely available at Github https://github.com/Haiyan-Liu666/MNNMDA.
Collapse
|
13
|
Cancer omic data based explainable AI drug recommendation inference: A traceability perspective for explainability. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
14
|
Graph neural network and multi-data heterogeneous networks for microbe-disease prediction. Front Microbiol 2022; 13:1077111. [PMID: 36620040 PMCID: PMC9814480 DOI: 10.3389/fmicb.2022.1077111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 11/30/2022] [Indexed: 12/24/2022] Open
Abstract
The research on microbe association networks is greatly significant for understanding the pathogenic mechanism of microbes and promoting the application of microbes in precision medicine. In this paper, we studied the prediction of microbe-disease associations based on multi-data biological network and graph neural network algorithm. The HMDAD database provided a dataset that included 39 diseases, 292 microbes, and 450 known microbe-disease associations. We proposed a Microbe-Disease Heterogeneous Network according to the microbe similarity network, disease similarity network, and known microbe-disease associations. Furthermore, we integrated the network into the graph convolutional neural network algorithm and developed the GCNN4Micro-Dis model to predict microbe-disease associations. Finally, the performance of the GCNN4Micro-Dis model was evaluated via 5-fold cross-validation. We randomly divided all known microbe-disease association data into five groups. The results showed that the average AUC value and standard deviation were 0.8954 ± 0.0030. Our model had good predictive power and can help identify new microbe-disease associations. In addition, we compared GCNN4Micro-Dis with three advanced methods to predict microbe-disease associations, KATZHMDA, BiRWHMDA, and LRLSHMDA. The results showed that our method had better prediction performance than the other three methods. Furthermore, we selected breast cancer as a case study and found the top 12 microbes related to breast cancer from the intestinal flora of patients, which further verified the model's accuracy.
Collapse
|
15
|
Drug response prediction using graph representation learning and Laplacian feature selection. BMC Bioinformatics 2022; 23:532. [PMID: 36494630 PMCID: PMC9733001 DOI: 10.1186/s12859-022-05080-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 11/22/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Knowing the responses of a patient to drugs is essential to make personalized medicine practical. Since the current clinical drug response experiments are time-consuming and expensive, utilizing human genomic information and drug molecular characteristics to predict drug responses is of urgent importance. Although a variety of computational drug response prediction methods have been proposed, their effectiveness is still not satisfying. RESULTS In this study, we propose a method called LGRDRP (Learning Graph Representation for Drug Response Prediction) to predict cell line-drug responses. At first, LGRDRP constructs a heterogeneous network integrating multiple kinds of information: cell line miRNA expression profiles, drug chemical structure similarity, gene-gene interaction, cell line-gene interaction and known cell line-drug responses. Then, for each cell line, learning graph representation and Laplacian feature selection are combined to obtain network topology features related to the cell line. The learning graph representation method learns network topology structure features, and the Laplacian feature selection method further selects out some most important ones from them. Finally, LGRDRP trains an SVM model to predict drug responses based on the selected features of the known cell line-drug responses. Our five-fold cross-validation results show that LGRDRP is significantly superior to the art-of-the-state methods in the measures of the average area under the receiver operating characteristics curve, the average area under the precision-recall curve and the recall rate of top-k predicted sensitive cell lines. CONCLUSIONS Our results demonstrated that the usage of multiple types of information about cell lines and drugs, the learning graph representation method, and the Laplacian feature selection is useful to the improvement of performance in predicting drug responses. We believe that such an approach would be easily extended to similar problems such as miRNA-disease relationship inference.
Collapse
|
16
|
Drug repositioning for SARS-CoV-2 by Gaussian kernel similarity bilinear matrix factorization. Front Microbiol 2022; 13:1062281. [PMID: 36545200 PMCID: PMC9762482 DOI: 10.3389/fmicb.2022.1062281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 11/21/2022] [Indexed: 12/12/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19), a disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is currently spreading rapidly around the world. Since SARS-CoV-2 seriously threatens human life and health as well as the development of the world economy, it is very urgent to identify effective drugs against this virus. However, traditional methods to develop new drugs are costly and time-consuming, which makes drug repositioning a promising exploration direction for this purpose. In this study, we collected known antiviral drugs to form five virus-drug association datasets, and then explored drug repositioning for SARS-CoV-2 by Gaussian kernel similarity bilinear matrix factorization (VDA-GKSBMF). By the 5-fold cross-validation, we found that VDA-GKSBMF has an area under curve (AUC) value of 0.8851, 0.8594, 0.8807, 0.8824, and 0.8804, respectively, on the five datasets, which are higher than those of other state-of-art algorithms in four datasets. Based on known virus-drug association data, we used VDA-GKSBMF to prioritize the top-k candidate antiviral drugs that are most likely to be effective against SARS-CoV-2. We confirmed that the top-10 drugs can be molecularly docked with virus spikes protein/human ACE2 by AutoDock on five datasets. Among them, four antiviral drugs ribavirin, remdesivir, oseltamivir, and zidovudine have been under clinical trials or supported in recent literatures. The results suggest that VDA-GKSBMF is an effective algorithm for identifying potential antiviral drugs against SARS-CoV-2.
Collapse
|
17
|
Editorial: Machine learning-based methods for RNA data analysis—Volume II. Front Genet 2022; 13:1010089. [DOI: 10.3389/fgene.2022.1010089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 09/20/2022] [Indexed: 12/02/2022] Open
|
18
|
DGHNE: network enhancement-based method in identifying disease-causing genes through a heterogeneous biomedical network. Brief Bioinform 2022; 23:6712302. [PMID: 36151744 DOI: 10.1093/bib/bbac405] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/01/2022] [Accepted: 08/21/2022] [Indexed: 12/14/2022] Open
Abstract
The identification of disease-causing genes is critical for mechanistic understanding of disease etiology and clinical manipulation in disease prevention and treatment. Yet the existing approaches in tackling this question are inadequate in accuracy and efficiency, demanding computational methods with higher identification power. Here, we proposed a new method called DGHNE to identify disease-causing genes through a heterogeneous biomedical network empowered by network enhancement. First, a disease-disease association network was constructed by the cosine similarity scores between phenotype annotation vectors of diseases, and a new heterogeneous biomedical network was constructed by using disease-gene associations to connect the disease-disease network and gene-gene network. Then, the heterogeneous biomedical network was further enhanced by using network embedding based on the Gaussian random projection. Finally, network propagation was used to identify candidate genes in the enhanced network. We applied DGHNE together with five other methods into the most updated disease-gene association database termed DisGeNet. Compared with all other methods, DGHNE displayed the highest area under the receiver operating characteristic curve and the precision-recall curve, as well as the highest precision and recall, in both the global 5-fold cross-validation and predicting new disease-gene associations. We further performed DGHNE in identifying the candidate causal genes of Parkinson's disease and diabetes mellitus, and the genes connecting hyperglycemia and diabetes mellitus. In all cases, the predicted causing genes were enriched in disease-associated gene ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways, and the gene-disease associations were highly evidenced by independent experimental studies.
Collapse
|
19
|
ICSDA: a multi-modal deep learning model to predict breast cancer recurrence and metastasis risk by integrating pathological, clinical and gene expression data. Brief Bioinform 2022; 23:6761046. [PMID: 36242564 DOI: 10.1093/bib/bbac448] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 07/18/2022] [Accepted: 07/18/2022] [Indexed: 12/14/2022] Open
Abstract
Breast cancer patients often have recurrence and metastasis after surgery. Predicting the risk of recurrence and metastasis for a breast cancer patient is essential for the development of precision treatment. In this study, we proposed a novel multi-modal deep learning prediction model by integrating hematoxylin & eosin (H&E)-stained histopathological images, clinical information and gene expression data. Specifically, we segmented tumor regions in H&E into image blocks (256 × 256 pixels) and encoded each image block into a 1D feature vector using a deep neural network. Then, the attention module scored each area of the H&E-stained images and combined image features with clinical and gene expression data to predict the risk of recurrence and metastasis for each patient. To test the model, we downloaded all 196 breast cancer samples from the Cancer Genome Atlas with clinical, gene expression and H&E information simultaneously available. The samples were then divided into the training and testing sets with a ratio of 7: 3, in which the distributions of the samples were kept between the two datasets by hierarchical sampling. The multi-modal model achieved an area-under-the-curve value of 0.75 on the testing set better than those based solely on H&E image, sequencing data and clinical data, respectively. This study might have clinical significance in identifying high-risk breast cancer patients, who may benefit from postoperative adjuvant treatment.
Collapse
|
20
|
Predicting colorectal cancer tumor mutational burden from histopathological images and clinical information using multi-modal deep learning. Bioinformatics 2022; 38:5108-5115. [PMID: 36130268 DOI: 10.1093/bioinformatics/btac641] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 08/31/2022] [Accepted: 09/20/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Tumor mutational burden (TMB) is an indicator of the efficacy and prognosis of immune checkpoint therapy in colorectal cancer (CRC). In general, patients with higher TMB values are more likely to benefit from immunotherapy. Though whole-exome sequencing is considered the gold standard for determining TMB, it is difficult to be applied in clinical practice due to its high cost. There are also a few DNA panel-based methods to estimate TMB; however, their detection cost is also high, and the associated wet-lab experiments usually take days, which emphasize the need for faster and cheaper alternatives. RESULTS In this study, we propose a multi-modal deep learning model based on a residual network (ResNet) and multi-modal compact bilinear pooling to predict TMB status (i.e. TMB high (TMB_H) or TMB low(TMB_L)) directly from histopathological images and clinical data. We applied the model to CRC data from The Cancer Genome Atlas and compared it with four other popular methods, namely, ResNet18, ResNet50, VGG19 and AlexNet. We tested different TMB thresholds, namely, percentiles of 10%, 14.3%, 15%, 16.3%, 20%, 30% and 50%, to differentiate TMB_H and TMB_L.For the percentile of 14.3% (i.e. TMB value 20) and ResNet18, our model achieved an area under the receiver operating characteristic curve of 0.817 after 5-fold cross-validation, which was better than that of other compared models. In addition, we also found that TMB values were significantly associated with the tumor stage and N and M stages. Our study shows that deep learning models can predict TMB status from histopathological images and clinical information only, which is worth clinical application.
Collapse
|
21
|
NeRD: a multichannel neural network to predict cellular response of drugs by integrating multidimensional data. BMC Med 2022; 20:368. [PMID: 36244991 PMCID: PMC9575288 DOI: 10.1186/s12916-022-02549-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 09/01/2022] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Considering the heterogeneity of tumors, it is a key issue in precision medicine to predict the drug response of each individual. The accumulation of various types of drug informatics and multi-omics data facilitates the development of efficient models for drug response prediction. However, the selection of high-quality data sources and the design of suitable methods remain a challenge. METHODS In this paper, we design NeRD, a multidimensional data integration model based on the PRISM drug response database, to predict the cellular response of drugs. Four feature extractors, including drug structure extractor (DSE), molecular fingerprint extractor (MFE), miRNA expression extractor (mEE), and copy number extractor (CNE), are designed for different types and dimensions of data. A fully connected network is used to fuse all features and make predictions. RESULTS Experimental results demonstrate the effective integration of the global and local structural features of drugs, as well as the features of cell lines from different omics data. For all metrics tested on the PRISM database, NeRD surpassed previous approaches. We also verified that NeRD has strong reliability in the prediction results of new samples. Moreover, unlike other algorithms, when the amount of training data was reduced, NeRD maintained stable performance. CONCLUSIONS NeRD's feature fusion provides a new idea for drug response prediction, which is of great significance for precise cancer treatment.
Collapse
|
22
|
MAGEA11 as a STAD Prognostic Biomarker Associated with Immune Infiltration. Diagnostics (Basel) 2022; 12:diagnostics12102506. [PMID: 36292195 PMCID: PMC9600629 DOI: 10.3390/diagnostics12102506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 09/23/2022] [Accepted: 10/11/2022] [Indexed: 11/17/2022] Open
Abstract
Expression of MAGE family member A11 (MAGEA11) is upregulated in different tumors. However, in gastric cancer, the prognostic significance of MAGEA11 and its relationship with immune infiltration remain largely unknown. The expression of MAGEA11 in pan-cancer and the receiver operating characteristic (ROC) and survival impact of gastric cancer were evaluated by The Cancer Genome Atlas (TCGA). Whether MAGEA11 was an independent risk factor was assessed by Cox analysis. Nomograms were constructed from MAGEA11 and clinical variables. Gene functional pathway enrichment was obtained based on MAGEA11 differential analysis. The relationship between MAGEA11 and immune infiltration was determined by the Tumor Immunity Estimation Resource (TIMER) and the Tumor Immune System Interaction Database (TISIDB). Finally, MAGEA11-sensitive drugs were predicted based on the CellMiner database. The results showed that the expression of MAGEA11 mRNA in gastric cancer tissues was significantly higher than that in normal tissues. The ROC curve indicated an AUC value of 0.667. Survival analysis showed that patients with high MAGEA11 had poor prognosis (HR = 1.43, p = 0.034). In correlation analysis, MAGEA11 mRNA expression was found to be associated with tumor purity and immune invasion. Finally, drug sensitivity analysis found that the expression of MAGEA11 was correlated with seven drugs. Our study found that upregulated MAGEA11 in gastric cancer was significantly associated with lower survival and invasion by immune infiltration. It is suggested that MAGEA11 may be a potential biomarker and immunotherapy target for gastric cancer.
Collapse
|
23
|
A Novel Algorithm for Feature Selection Using Penalized Regression with Applications to Single-Cell RNA Sequencing Data. BIOLOGY 2022; 11:biology11101495. [PMID: 36290397 PMCID: PMC9598401 DOI: 10.3390/biology11101495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 09/21/2022] [Accepted: 09/30/2022] [Indexed: 11/05/2022]
Abstract
With the emergence of single-cell RNA sequencing (scRNA-seq) technology, scientists are able to examine gene expression at single-cell resolution. Analysis of scRNA-seq data has its own challenges, which stem from its high dimensionality. The method of machine learning comes with the potential of gene (feature) selection from the high-dimensional scRNA-seq data. Even though there exist multiple machine learning methods that appear to be suitable for feature selection, such as penalized regression, there is no rigorous comparison of their performances across data sets, where each poses its own challenges. Therefore, in this paper, we analyzed and compared multiple penalized regression methods for scRNA-seq data. Given the scRNA-seq data sets we analyzed, the results show that sparse group lasso (SGL) outperforms the other six methods (ridge, lasso, elastic net, drop lasso, group lasso, and big lasso) using the metrics area under the receiver operating curve (AUC) and computation time. Building on these findings, we proposed a new algorithm for feature selection using penalized regression methods. The proposed algorithm works by selecting a small subset of genes and applying SGL to select the differentially expressed genes in scRNA-seq data. By using hierarchical clustering to group genes, the proposed method bypasses the need for domain-specific knowledge for gene grouping information. In addition, the proposed algorithm provided consistently better AUC for the data sets used.
Collapse
|
24
|
A bioinformatics framework to identify the biomarkers and potential drugs for the treatment of colorectal cancer. Front Genet 2022; 13:1017539. [PMID: 36238159 PMCID: PMC9551025 DOI: 10.3389/fgene.2022.1017539] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/08/2022] [Indexed: 11/13/2022] Open
Abstract
Colorectal cancer (CRC), a common malignant tumor, is one of the main causes of death in cancer patients in the world. Therefore, it is critical to understand the molecular mechanism of CRC and identify its diagnostic and prognostic biomarkers. The purpose of this study is to reveal the genes involved in the development of CRC and to predict drug candidates that may help treat CRC through bioinformatics analyses. Two independent CRC gene expression datasets including The Cancer Genome Atlas (TCGA) database and GSE104836 were used in this study. Differentially expressed genes (DEGs) were analyzed separately on the two datasets, and intersected for further analyses. 249 drug candidates for CRC were identified according to the intersected DEGs and the Crowd Extracted Expression of Differential Signatures (CREEDS) database. In addition, hub genes were analyzed using Cytoscape according to the DEGs, and survival analysis results showed that one of the hub genes, TIMP1 was related to the prognosis of CRC patients. Thus, we further focused on drugs that could reverse the expression level of TIMP1. Eight potential drugs with documentary evidence and two new drugs that could reverse the expression of TIMP1 were found among the 249 drugs. In conclusion, we successfully identified potential biomarkers for CRC and achieved drug repurposing using bioinformatics methods. Further exploration is needed to understand the molecular mechanisms of these identified genes and drugs/small molecules in the occurrence, development and treatment of CRC.
Collapse
|
25
|
Identifying potential microRNA biomarkers for colon cancer and colorectal cancer through bound nuclear norm regularization. Front Genet 2022; 13:980437. [PMID: 36313468 PMCID: PMC9614659 DOI: 10.3389/fgene.2022.980437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 08/01/2022] [Indexed: 11/17/2022] Open
Abstract
Colon cancer and colorectal cancer are two common cancer-related deaths worldwide. Identification of potential biomarkers for the two cancers can help us to evaluate their initiation, progression and therapeutic response. In this study, we propose a new microRNA-disease association identification method, BNNRMDA, to discover potential microRNA biomarkers for the two cancers. BNNRMDA better combines disease semantic similarity and Gaussian Association Profile Kernel (GAPK) similarity, microRNA function similarity and GAPK similarity, and the bound nuclear norm regularization model. Compared to other five classical microRNA-disease association identification methods (MIDPE, MIDP, RLSMDA, GRNMF, AND LPLNS), BNNRMDA obtains the highest AUC of 0.9071, demonstrating its strong microRNA-disease association identification performance. BNNRMDA is applied to discover possible microRNA biomarkers for colon cancer and colorectal cancer. The results show that all 73 known microRNAs associated with colon cancer in the HMDD database have the highest association scores with colon cancer and are ranked as top 73. Among 137 known microRNAs associated with colorectal cancer in the HMDD database, 129 microRNAs have the highest association scores with colorectal cancer and are ranked as top 129. In addition, we predict that hsa-miR-103a could be a potential biomarker of colon cancer and hsa-mir-193b and hsa-mir-7days could be potential biomarkers of colorectal cancer.
Collapse
|
26
|
Predicting recurrence and metastasis risk of endometrial carcinoma via prognostic signatures identified from multi-omics data. Front Oncol 2022; 12:982452. [PMID: 36059678 PMCID: PMC9438970 DOI: 10.3389/fonc.2022.982452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 08/03/2022] [Indexed: 11/13/2022] Open
Abstract
ObjectivesEndometrial carcinoma (EC) is one of the three major gynecological malignancies, in which 15% - 20% patients will have recurrence and metastasis. Though there are many studies on the prognosis on this cancer, the performances of existing models evaluating the risk of its recurrence and metastasis are yet to be improved. In addition, a comprehensive multi-omics analyses on the prognostic signatures of EC are on demand. In this study, we aimed to construct a relatively stable and reliable model for predicting recurrence and metastasis of EC. This will help determine the risk level of patients and choose appropriate adjuvant therapy, thereby avoiding improper treatment, and improving the prognosis of patients.MethodsThe mRNA, microRNA (miRNA), long non-coding RNA (lncRNA), copy number variation (CNV) data and clinical information of patients with EC were downloaded from The Cancer Genome Atlas (TCGA). Differential expression analyses were performed between the recurrence or metastasis group and the non-recurrence/metastasis group. Then, we screened potential prognostic markers from the four kinds of omics data respectively and established prediction models using three classifiers.ResultsWe achieved differential expressed mRNAs, lncRNAs, miRNAs and CNVs between the two groups. According to feature selection scores by the random forest algorithm, 275 CNV features, 50 lncRNA features, 150 miRNA features and 150 mRNA features were selected, respectively. And the prediction model constructed by the features of lncRNA data using random forest method showed the best performance, with an area under the curve of 0.763, and an accuracy of 0.819 under 10-fold cross-validation.ConclusionWe developed a computational model using omics information, which is able to predicting recurrence and metastasis risk of EC accurately.
Collapse
|
27
|
Discussion of tumor mutation burden as an indicator to predict efficacy of immune checkpoint inhibitors: A case report. Front Oncol 2022; 12:939022. [PMID: 35992799 PMCID: PMC9381827 DOI: 10.3389/fonc.2022.939022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Accepted: 07/08/2022] [Indexed: 12/29/2022] Open
Abstract
There are many treatment options for advanced lung cancer, among which immunotherapy has developed rapidly and benefited a lot of patients. However, immunotherapy can only benefit a subgroup of patients, and how to select patients suitable for this therapy is critical. Tumor mutation burden (TMB) is one of the important reference indicators for immune checkpoint inhibitors (ICIs). However, there are many factors influencing the usage of this indicator, which will lead to considerable consequences if not treated well. In this study, we performed a case study on a male advanced lung squamous cell carcinoma patient of age 83. The patient suffered from “cough and sputum”, and did chest CT scans on 24 October 2018, which showed “a mass-like mass in the anterior segment of the right lung upper lobe, about 38mm×28mm”. He was treated with systemic chemotherapy; however, the tumor was still under progression. Although PD-L1 was not tested in gene testing, he had a TMB value of 10.26 mutations/Mb with a quantile value 88.63%. Thus, “toripalimab injection” was added as immunotherapy and the size of the lesion decreased. In summary, we adopted a clinical case as the basis to explore the value and significance of TMB in immunotherapy in this study. We hope that more predictive molecular markers will be discovered, which will bring more treatment methods for advanced lung cancer.
Collapse
|
28
|
Finding Lung-Cancer-Related lncRNAs Based on Laplacian Regularized Least Squares With Unbalanced Bi-Random Walk. Front Genet 2022; 13:933009. [PMID: 35938010 PMCID: PMC9355720 DOI: 10.3389/fgene.2022.933009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 06/03/2022] [Indexed: 11/13/2022] Open
Abstract
Lung cancer is one of the leading causes of cancer-related deaths. Thus, it is important to find its biomarkers. Furthermore, there is an increasing number of studies reporting that long noncoding RNAs (lncRNAs) demonstrate dense linkages with multiple human complex diseases. Inferring new lncRNA-disease associations help to identify potential biomarkers for lung cancer and further understand its pathogenesis, design new drugs, and formulate individualized therapeutic options for lung cancer patients. This study developed a computational method (LDA-RLSURW) by integrating Laplacian regularized least squares and unbalanced bi-random walk to discover possible lncRNA biomarkers for lung cancer. First, the lncRNA and disease similarities were computed. Second, unbalanced bi-random walk was, respectively, applied to the lncRNA and disease networks to score associations between diseases and lncRNAs. Third, Laplacian regularized least squares were further used to compute the association probability between each lncRNA-disease pair based on the computed random walk scores. LDA-RLSURW was compared using 10 classical LDA prediction methods, and the best AUC value of 0.9027 on the lncRNADisease database was obtained. We found the top 30 lncRNAs associated with lung cancers and inferred that lncRNAs TUG1, PTENP1, and UCA1 may be biomarkers of lung neoplasms, non-small–cell lung cancer, and LUAD, respectively.
Collapse
|
29
|
Lung Cancer Stage Prediction Using Multi-Omics Data. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:2279044. [PMID: 35880092 PMCID: PMC9308511 DOI: 10.1155/2022/2279044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 06/27/2022] [Indexed: 12/24/2022]
Abstract
Lung cancer is one of the leading causes of cancer death. Patients with early-stage lung cancer can be treated by surgery, while patients in the middle and late stages need chemotherapy or radiotherapy. Therefore, accurate staging of lung cancer is crucial for doctors to formulate accurate treatment plans for patients. In this paper, the random forest algorithm is used as the lung cancer stage prediction model, and the accuracy of lung cancer stage prediction is discussed in the microbiome, transcriptome, microbe, and transcriptome fusion groups, and the accuracy of the model is measured by indicators such as ACC, recall, and precision. The results showed that the prediction accuracy of microbial combinatorial transcriptome fusion analysis was the highest, reaching 0.809. The study reveals the role of multimodal data and fusion algorithm in accurately diagnosing lung cancer stage, which could aid doctors in clinics.
Collapse
|
30
|
D3K: The Dissimilarity-Density-Dynamic Radius K-means Clustering Algorithm for scRNA-Seq Data. Front Genet 2022; 13:912711. [PMID: 35846121 PMCID: PMC9284269 DOI: 10.3389/fgene.2022.912711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 04/25/2022] [Indexed: 12/02/2022] Open
Abstract
A single-cell sequencing data set has always been a challenge for clustering because of its high dimension and multi-noise points. The traditional K-means algorithm is not suitable for this type of data. Therefore, this study proposes a Dissimilarity-Density-Dynamic Radius-K-means clustering algorithm. The algorithm adds the dynamic radius parameter to the calculation. It flexibly adjusts the active radius according to the data characteristics, which can eliminate the influence of noise points and optimize the clustering results. At the same time, the algorithm calculates the weight through the dissimilarity density of the data set, the average contrast of candidate clusters, and the dissimilarity of candidate clusters. It obtains a set of high-quality initial center points, which solves the randomness of the K-means algorithm in selecting the center points. Finally, compared with similar algorithms, this algorithm shows a better clustering effect on single-cell data. Each clustering index is higher than other single-cell clustering algorithms, which overcomes the shortcomings of the traditional K-means algorithm.
Collapse
|
31
|
Informative SNP Selection Based on a Fuzzy Clustering and Improved Binary Particle Swarm Optimization Algorithm. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:3837579. [PMID: 35756402 PMCID: PMC9225903 DOI: 10.1155/2022/3837579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 04/14/2022] [Accepted: 04/30/2022] [Indexed: 12/04/2022]
Abstract
Single-nucleotide polymorphism (SNP) involves the replacement of a single nucleotide in a deoxyribonucleic acid (DNA) sequence and is often linked to the development of specific diseases. Although current genotyping methods can tag SNP loci within biological samples to provide accurate genetic information for a disease associated, they have limited prediction accuracy. Furthermore, they are complex to perform and may result in the prediction of an excessive number of tag SNP loci, which may not always be associated with the disease. Therefore in this manuscript, we aimed to evaluate the impact of a newly optimized fuzzy clustering and binary particle swarm optimization algorithm (FCBPSO) on the accuracy and running time of informative SNP selection. Fuzzy clustering and FCBPSO were first applied to identify the equivalence relation and the candidate tag SNP set to reduce the redundancy between loci. The FCBPSO algorithm was then optimized and used to obtain the final tag SNP set. The prediction performance and running time of the newly developed model were compared with other traditional methods, including NMC, SPSO, and MCMR. The prediction accuracy of the FCBPSO algorithm was always higher than that of the other algorithms especially as the number of tag SNPs increased. However, when the number of tag SNPs was low, the prediction accuracy of FCBPSO was slightly lower than that of MCMR (add prediction accuracy values for each algorithm). However, the running time of the FCBPSO algorithm was always lower than that of MCMR. FCBPSO not only reduced the size and dimension of the optimization problem but also simplified the training of the prediction model. This improved the prediction accuracy of the model and reduced the running time when compared with other traditional methods.
Collapse
|
32
|
Predicting Tumor Mutational Burden From Lung Adenocarcinoma Histopathological Images Using Deep Learning. Front Oncol 2022; 12:927426. [PMID: 35756617 PMCID: PMC9213738 DOI: 10.3389/fonc.2022.927426] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 05/16/2022] [Indexed: 11/25/2022] Open
Abstract
Tumor mutation burden (TMB) is an important biomarker for tumor immunotherapy. It plays an important role in the clinical treatment process, but the gold standard measurement of TMB is based on whole exome sequencing (WES). WES cannot be done in most hospitals due to its high cost, long turnaround times and operational complexity. To seek out a better method to evaluate TMB, we divided the patients with lung adenocarcinoma (LUAD) in TCGA into two groups according to the TMB value, then analyzed the differences of clinical characteristics and gene expression between the two groups. We further explored the possibility of using histopathological images to predict TMB status, and developed a deep learning model to predict TMB based on histopathological images of LUAD. In the 5-fold cross-validation, the area under the receiver operating characteristic (ROC) curve (AUC) of the model was 0.64. This study showed that it is possible to use deep learning to predict genomic features from histopathological images, though the prediction accuracy was relatively low. The study opens up a new way to explore the relationship between genes and phenotypes.
Collapse
|
33
|
A Framework to Predict the Molecular Classification and Prognosis of Breast Cancer Patients and Characterize the Landscape of Immune Cell Infiltration. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4635806. [PMID: 35720039 PMCID: PMC9201713 DOI: 10.1155/2022/4635806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 03/25/2022] [Accepted: 05/16/2022] [Indexed: 11/27/2022]
Abstract
It is known that all current cancer therapies can only benefit a limited proportion of patients; thus, molecular classification and prognosis evaluation are critical for correctly classifying breast cancer patients and selecting the best treatment strategy. These processes usually involve the disclosure of molecular information like mutation, expression, and immune microenvironment of a breast cancer patient, which are not been fully studied until now. Therefore, there is an urgent clinical need to identify potential markers to enhance molecular classification, precision prognosis, and therapy stratification for breast cancer patients. In this study, we explored the gene expression profiles of 1,721 breast cancer patients through CIBERSORT and ESTIMATE algorithms; then, we obtained a comprehensive intratumoral immune landscape. The immune cell infiltration (ICI) patterns of breast cancer were classified into 3 separate subtypes according to the infiltration levels of 22 immune cells. The differentially expressed genes between these subtypes were further identified, and ICI scores were calculated to assess the immune landscape of BRCA patients. Importantly, we demonstrated that ICI scores correlate with patients' survival, tumor mutation burden, neoantigens, and sensitivity to specific drugs. Based on these ICI scores, we were able to predict the prognosis of patients and their response to immunotherapy. Together, these findings provide a realistic scenario to stratify breast cancer patients for precision medicine.
Collapse
|
34
|
In Vivo Modeling of Human Breast Cancer Using Cell Line and Patient-Derived Xenografts. J Mammary Gland Biol Neoplasia 2022; 27:211-230. [PMID: 35697909 PMCID: PMC9433358 DOI: 10.1007/s10911-022-09520-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 05/19/2022] [Indexed: 11/24/2022] Open
Abstract
Historically, human breast cancer has been modeled largely in vitro using long-established cell lines primarily in two-dimensional culture, but also in three-dimensional cultures of varying cellular and molecular complexities. A subset of cell line models has also been used in vivo as cell line-derived xenografts (CDX). While outstanding for conducting detailed molecular analysis of regulatory mechanisms that may function in vivo, results of drug response studies using long-established cell lines have largely failed to translate clinically. In an attempt to address this shortcoming, many laboratories have succeeded in developing clinically annotated patient-derived xenograft (PDX) models of human cancers, including breast, in a variety of host systems. While immunocompromised mice are the predominant host, the immunocompromised rat and pig, zebrafish, as well as the chicken egg chorioallantoic membrane (CAM) have also emerged as potential host platforms to help address perceived shortcomings of immunocompromised mice. With any modeling platform, the two main issues to be resolved are criteria for "credentialing" the models as valid models to represent human cancer, and utility with respect to the ability to generate clinically relevant translational research data. Such data are beginning to emerge, particularly with the activities of PDX consortia such as the NCI PDXNet Program, EuroPDX, and the International Breast Cancer Consortium, as well as a host of pharmaceutical companies and contract research organizations (CRO). This review focuses primarily on these important aspects of PDX-related research, with a focus on breast cancer.
Collapse
|
35
|
Computational drug repositioning using similarity constrained weight regularization matrix factorization: A case of COVID-19. J Cell Mol Med 2022; 26:3772-3782. [PMID: 35644992 PMCID: PMC9258716 DOI: 10.1111/jcmm.17412] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 02/03/2022] [Accepted: 05/11/2022] [Indexed: 02/06/2023] Open
Abstract
Amid the COVID‐19 crisis, we put sizeable efforts to collect a high number of experimentally validated drug–virus association entries from literature by text mining and built a human drug–virus association database. To the best of our knowledge, it is the largest publicly available drug–virus database so far. Next, we develop a novel weight regularization matrix factorization approach, termed WRMF, for in silico drug repurposing by integrating three networks: the known drug–virus association network, the drug–drug chemical structure similarity network, and the virus–virus genomic sequencing similarity network. Specifically, WRMF adds a weight to each training sample for reducing the influence of negative samples (i.e. the drug–virus association is unassociated). A comparison on the curated drug–virus database shows that WRMF performs better than a few state‐of‐the‐art methods. In addition, we selected the other two different public datasets (i.e. Cdataset and HMDD V2.0) to assess WRMF's performance. The case study also demonstrated the accuracy and reliability of WRMF to infer potential drugs for the novel virus. In summary, we offer a useful tool including a novel drug–virus association database and a powerful method WRMF to repurpose potential drugs for new viruses.
Collapse
|
36
|
Identification of Novel Immune Ferropotosis-Related Genes Associated With Clinical and Prognostic Features in Gastric Cancer. Front Oncol 2022; 12:904304. [PMID: 35664744 PMCID: PMC9157572 DOI: 10.3389/fonc.2022.904304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 04/19/2022] [Indexed: 12/08/2022] Open
Abstract
Background Gastric cancer (GC) is the fifth commonest cancer and the third commonest reason of death causing by cancer worldwide. Currently, tumor immunology and ferropotosis develop rapidly that has made gastric cancer be treated in new directions. So, finding the potential targets and prognostic biomarkers for immunotherapy combined with ferropotosis is urgent. Methods By mining TCGA, immune-related genes, ferropotosis-related genes and immune-ferropotosis-related differentially expressed genes (IFR-DEGs) were identified. The independent prognostic value of IFR-DEGs was determined by differential expression analysis, prognostic analysis, and univariate and lasso regression analysis. Then, based on the prognostic risk model, the correlation between IFR-DEGs and immune scores, immune checkpoints were evaluated. Besides, we predicted the response of high and low risk groups to drugs. Results A 15-gene prognostic feature was constructed. The high-risk group had a poorer prognosis than the low-risk group. High-risk group had higher level of Treg immune cell infiltration compared with that in the low-risk group, and the tumor purity, immune checkpoint PD-1 and CTLA4, and immunity in the high-risk group were higher than those in the low-risk group. These results indicate that immune ferropotosis-related genes migh be potential predictors of STAD's response to ICI immunotherapy biomarkers. In addition, the response of small molecule drugs such as Nilotini, Sunitinib, Imatinib, etc. for high and low risk groups was predicted. Conclusion IFRSig can be regarded as an independent prognostic feature and may estimate OS and clinical treatment response in patients with STAD. IFRSig also has important correlation with immune microenvironment. A new understanding of the immune-ferropotosis-related genes during the occurrence and development of STAD is provided in this study.
Collapse
|
37
|
Plasma Exosomes Transfer miR-885-3p Targeting the AKT/NFκB Signaling Pathway to Improve the Sensitivity of Intravenous Glucocorticoid Therapy Against Graves Ophthalmopathy. Front Immunol 2022; 13:819680. [PMID: 35265076 PMCID: PMC8900193 DOI: 10.3389/fimmu.2022.819680] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 01/20/2022] [Indexed: 12/13/2022] Open
Abstract
Graves ophthalmopathy (GO), a manifestation of Graves' disease, is an organ-specific autoimmune disease. Intravenous glucocorticoid therapy (ivGCs) is the first-line treatment for moderate-to-severe and active GO. However, ivGCs is only effective in 70%-80% of GO patients. Insensitive patients who choose 12-week ivGCs not only were delayed in treatment but also took the risk of adverse reactions of glucocorticoids. At present, there is still a lack of effective indicators to predict the therapeutic effect of ivGCs. Therefore, the purpose of this study is to find biomarkers that can determine the sensitivity of ivGCs before the formulation of treatment, and to clarify the mechanism of its regulation of ivGCs sensitivity. This study first characterized the miRNA profiles of plasma exosomes by miRNA sequencing to identify miRNAs differentially expressed between GO patients with significant improvement (SI) and non-significant improvement (NSI) after ivGCs treatment. Subsequently, we analyzed the function of the predicted target genes of differential miRNAs. According to the function of the target genes, we screened 10 differentially expressed miRNAs. An expanded cohort verification showed that compared with NSI patients, mir-885-3p was upregulated and mir-4474-3p and mir-615-3p were downregulated in the exosomes of SI patients. Based on statistical difference and miRNA function, mir-885-3p was selected for follow-up study. The in vitro functional analysis of exosomes mir-885-3p showed that exosomes from SI patients (SI-exo) could transfer mir-885-3p to orbital fibroblasts (OFs), upregulate the GRE luciferase reporter gene plasmid activity and the level of glucocorticoid receptor (GR), downregulate the level of inflammatory factors, and improve the glucocorticoid sensitivity of OFs. Moreover, these effects can be inhibited by the corresponding miR inhibitor. In addition, we found that high levels of mir-885-3p could inhibit the AKT/NFκB signaling pathway, upregulate the GRE plasmid activity and GR level, and downregulate the level of inflammatory factors of OFs. Moreover, the improvement of glucocorticoid sensitivity by mir-885-3p transmitted by SI-exo can also be inhibited by the AKT/NFκB agonist. Finally, through the in vivo experiment of the GO mouse model, we further determined the relationship between exosomes' mir-885-3p sequence, AKT/NFκB signaling pathway, and glucocorticoid sensitivity. As a conclusion, plasma exosomes deliver mir-885-3p and inhibit the AKT/NFκB signaling pathway to improve the glucocorticoid sensitivity of OFs. Exosome mir-885-3p can be used as a biomarker to determine the sensitivity of ivGCs in GO patients.
Collapse
|
38
|
HAUS5 Is A Potential Prognostic Biomarker With Functional Significance in Breast Cancer. Front Oncol 2022; 12:829777. [PMID: 35280773 PMCID: PMC8913513 DOI: 10.3389/fonc.2022.829777] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 02/04/2022] [Indexed: 11/13/2022] Open
Abstract
Background Breast cancer (BRCA) has become the most frequently appearing, lethal, and aggressive cancer with increasing morbidity and mortality. Previously, it was discovered that the HAUS5 protein is involved in centrosome integrity, spindle assembly, and the completion of the cytoplasmic division process during mitosis. By encouraging chromosome misdivision and aneuploidy, HAUS5 has the potential to cause cancer. The significance of HAUS5 in BRCA and the relationship between its expression and clinical outcomes or immune infiltration remains unclear. Methods Pan-cancer was analyzed by TIMER2 web and the expression differential of HAUS5 was discovered. The prognostic value of HAUS5 for BRCA was evaluated with KM plotter and confirmed with Gene Expression Omnibus (GEO) dataset. Following that, we looked at the relationship between the high and low expression groups of HAUS5 and breast cancer clinical indications. Signaling pathways linked to HAUS5 expression were discovered using Gene Set Enrichment Analysis (GSEA). The relative immune cell infiltrations of each sample were assessed using the CIBERSORT algorithm and ESTIMATE method. We evaluated the Tumor Mutation Burden (TMB) value between the two sets of samples with high and low HAUS5 expression, as well as the differences in gene mutations between the two groups. The proliferation changes of BRCA cells after knockdown of HAUS5 were evaluated by fluorescence cell counting and colony formation assay. Result HAUS5 is strongly expressed in most malignancies, and distinct associations exist between HAUS5 and prognosis in BRCA patients. Upregulated HAUS5 was associated with poor clinicopathological characteristics such as tumor T stage, ER, PR, and HER2 status. mitotic prometaphase, primary immunodeficiency, DNA replication, cell cycle related signaling pathways were all enriched in the presence of elevated HAUS5 expression, according to GSEA analysis. The BRCA microenvironment’s core gene, HAUS5, was shown to be related with invading immune cell subtypes and tumor cell stemness. TMB in the HAUS5-low expression group was significantly higher than that in the high expression group. The mutation frequency of 15 genes was substantially different in the high expression group compared to the low expression group. BRCA cells’ capacity to proliferate was decreased when HAUS5 was knocked down. Conclusion These findings show that HAUS5 is a positive regulator of BRCA progression that contributes to BRCA cells proliferation. As a result, HAUS5 might be a novel prognostic indicator and therapeutic target for BRCA patients.
Collapse
|
39
|
Circulating Tumor Cell Identification Based on Deep Learning. Front Oncol 2022; 12:843879. [PMID: 35252012 PMCID: PMC8889528 DOI: 10.3389/fonc.2022.843879] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Accepted: 01/21/2022] [Indexed: 12/18/2022] Open
Abstract
As a major reason for tumor metastasis, circulating tumor cell (CTC) is one of the critical biomarkers for cancer diagnosis and prognosis. On the one hand, CTC count is closely related to the prognosis of tumor patients; on the other hand, as a simple blood test with the advantages of safety, low cost and repeatability, CTC test has an important reference value in determining clinical results and studying the mechanism of drug resistance. However, the determination of CTC usually requires a big effort from pathologist and is also error-prone due to inexperience and fatigue. In this study, we developed a novel convolutional neural network (CNN) method to automatically detect CTCs in patients’ peripheral blood based on immunofluorescence in situ hybridization (imFISH) images. We collected the peripheral blood of 776 patients from Chifeng Municipal Hospital in China, and then used Cyttel to delete leukocytes and enrich CTCs. CTCs were identified by imFISH with CD45+, DAPI+ immunofluorescence staining and chromosome 8 centromeric probe (CEP8+). The sensitivity and specificity based on traditional CNN prediction were 95.3% and 91.7% respectively, and the sensitivity and specificity based on transfer learning were 97.2% and 94.0% respectively. The traditional CNN model and transfer learning method introduced in this paper can detect CTCs with high sensitivity, which has a certain clinical reference value for judging prognosis and diagnosing metastasis.
Collapse
|
40
|
PARP Inhibitor Upregulates PD-L1 Expression and Provides a New Combination Therapy in Pancreatic Cancer. Front Immunol 2022; 12:762989. [PMID: 34975854 PMCID: PMC8718453 DOI: 10.3389/fimmu.2021.762989] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 11/19/2021] [Indexed: 12/15/2022] Open
Abstract
Despite recent improvements in treatment modalities, pancreatic cancer remains a highly lethal tumor with mortality rate increasing every year. Poly (ADP-ribose) polymerase (PARP) inhibitors are now used in pancreatic cancer as a breakthrough in targeted therapy. This study focused on whether PARP inhibitors (PARPis) can affect programmed death ligand-1 (PD-L1) expression in pancreatic cancer and whether immune checkpoint inhibitors of PD-L1/programmed death 1 (PD-1) can enhance the anti-tumor effects of PARPis. Here we found that PARPi, pamiparib, up-regulated PD-L1 expression on the surface of pancreatic cancer cells in vitro and in vivo. Mechanistically, pamiparib induced PD-L1 expression via JAK2/STAT3 pathway, at least partially, in pancreatic cancer. Importantly, pamiparib attenuated tumor growth; while co-administration of pamiparib with PD-L1 blockers significantly improved the therapeutic efficacy in vivo compared with monotherapy. Combination therapy resulted in an altered tumor immune microenvironment with a significant increase in windiness of CD8+ T cells, suggesting a potential role of CD8+ T cells in the combination therapy. Together, this study provides evidence for the clinical application of PARPis with anti-PD-L1/PD-1 drugs in the treatment of pancreatic cancer.
Collapse
|
41
|
A weighted bilinear neural collaborative filtering approach for drug repositioning. Brief Bioinform 2022; 23:6510159. [PMID: 35039838 DOI: 10.1093/bib/bbab581] [Citation(s) in RCA: 52] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 11/25/2021] [Accepted: 12/19/2021] [Indexed: 02/07/2023] Open
Abstract
Drug repositioning is an efficient and promising strategy for traditional drug discovery and development. Many research efforts are focused on utilizing deep-learning approaches based on a heterogeneous network for modeling complex drug-disease associations. Similar to traditional latent factor models, which directly factorize drug-disease associations, they assume the neighbors are independent of each other in the network and thus tend to be ineffective to capture localized information. In this study, we propose a novel neighborhood and neighborhood interaction-based neural collaborative filtering approach (called DRWBNCF) to infer novel potential drugs for diseases. Specifically, we first construct three networks, including the known drug-disease association network, the drug-drug similarity and disease-disease similarity networks (using the nearest neighbors). To take the advantage of localized information in the three networks, we then design an integration component by proposing a new weighted bilinear graph convolution operation to integrate the information of the known drug-disease association, the drug's and disease's neighborhood and neighborhood interactions into a unified representation. Lastly, we introduce a prediction component, which utilizes the multi-layer perceptron optimized by the α-balanced focal loss function and graph regularization to model the complex drug-disease associations. Benchmarking comparisons on three datasets verified the effectiveness of DRWBNCF for drug repositioning. Importantly, the unknown drug-disease associations predicted by DRWBNCF were validated against clinical trials and three authoritative databases and we listed several new DRWBNCF-predicted potential drugs for breast cancer (e.g. valrubicin and teniposide) and small cell lung cancer (e.g. valrubicin and cytarabine).
Collapse
|
42
|
An overview of machine learning methods for monotherapy drug response prediction. Brief Bioinform 2022; 23:bbab408. [PMID: 34619752 PMCID: PMC8769705 DOI: 10.1093/bib/bbab408] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 08/25/2021] [Accepted: 09/06/2021] [Indexed: 12/11/2022] Open
Abstract
For an increasing number of preclinical samples, both detailed molecular profiles and their responses to various drugs are becoming available. Efforts to understand, and predict, drug responses in a data-driven manner have led to a proliferation of machine learning (ML) methods, with the longer term ambition of predicting clinical drug responses. Here, we provide a uniquely wide and deep systematic review of the rapidly evolving literature on monotherapy drug response prediction, with a systematic characterization and classification that comprises more than 70 ML methods in 13 subclasses, their input and output data types, modes of evaluation, and code and software availability. ML experts are provided with a fundamental understanding of the biological problem, and how ML methods are configured for it. Biologists and biomedical researchers are introduced to the basic principles of applicable ML methods, and their application to the problem of drug response prediction. We also provide systematic overviews of commonly used data sources used for training and evaluation methods.
Collapse
|
43
|
Bioinformatics Research on Drug Sensitivity Prediction. Front Pharmacol 2021; 12:799712. [PMID: 34955863 PMCID: PMC8696280 DOI: 10.3389/fphar.2021.799712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 11/18/2021] [Indexed: 11/28/2022] Open
Abstract
Modeling-based anti-cancer drug sensitivity prediction has been extensively studied in recent years. While most drug sensitivity prediction models only use gene expression data, the remarkable impacts of gene mutation, methylation, and copy number variation on drug sensitivity are neglected. Drug sensitivity prediction can both help protect patients from some adverse drug reactions and improve the efficacy of treatment. Genomics data are extremely useful for drug sensitivity prediction task. This article reviews the role of drug sensitivity prediction, describes a variety of methods for predicting drug sensitivity. Moreover, the research significance of drug sensitivity prediction, as well as existing problems are well discussed.
Collapse
|
44
|
Research on the Computational Prediction of Essential Genes. Front Cell Dev Biol 2021; 9:803608. [PMID: 34938741 PMCID: PMC8685449 DOI: 10.3389/fcell.2021.803608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 11/22/2021] [Indexed: 11/19/2022] Open
Abstract
Genes, the nucleotide sequences that encode a polypeptide chain or functional RNA, are the basic genetic unit controlling biological traits. They are the guarantee of the basic structures and functions in organisms, and they store information related to biological factors and processes such as blood type, gestation, growth, and apoptosis. The environment and genetics jointly affect important physiological processes such as reproduction, cell division, and protein synthesis. Genes are related to a wide range of phenomena including growth, decline, illness, aging, and death. During the evolution of organisms, there is a class of genes that exist in a conserved form in multiple species. These genes are often located on the dominant strand of DNA and tend to have higher expression levels. The protein encoded by it usually either performs very important functions or is responsible for maintaining and repairing these essential functions. Such genes are called persistent genes. Among them, the irreplaceable part of the body’s life activities is the essential gene. For example, when starch is the only source of energy, the genes related to starch digestion are essential genes. Without them, the organism will die because it cannot obtain enough energy to maintain basic functions. The function of the proteins encoded by these genes is thought to be fundamental to life. Nowadays, DNA can be extracted from blood, saliva, or tissue cells for genetic testing, and detailed genetic information can be obtained using the most advanced scientific instruments and technologies. The information gained from genetic testing is useful to assess the potential risks of disease, and to help determine the prognosis and development of diseases. Such information is also useful for developing personalized medication and providing targeted health guidance to improve the quality of life. Therefore, it is of great theoretical and practical significance to identify important and essential genes. In this paper, the research status of essential genes and the essential genome database of bacteria are reviewed, the computational prediction method of essential genes based on communication coding theory is expounded, and the significance and practical application value of essential genes are discussed.
Collapse
|
45
|
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest. Front Genet 2021; 12:811158. [PMID: 34912382 PMCID: PMC8667860 DOI: 10.3389/fgene.2021.811158] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 11/15/2021] [Indexed: 02/04/2023] Open
Abstract
DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods.
Collapse
|
46
|
Application of Sparse Representation in Bioinformatics. Front Genet 2021; 12:810875. [PMID: 34976030 PMCID: PMC8715914 DOI: 10.3389/fgene.2021.810875] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 12/01/2021] [Indexed: 11/15/2022] Open
Abstract
Inspired by L1-norm minimization methods, such as basis pursuit, compressed sensing, and Lasso feature selection, in recent years, sparse representation shows up as a novel and potent data processing method and displays powerful superiority. Researchers have not only extended the sparse representation of a signal to image presentation, but also applied the sparsity of vectors to that of matrices. Moreover, sparse representation has been applied to pattern recognition with good results. Because of its multiple advantages, such as insensitivity to noise, strong robustness, less sensitivity to selected features, and no “overfitting” phenomenon, the application of sparse representation in bioinformatics should be studied further. This article reviews the development of sparse representation, and explains its applications in bioinformatics, namely the use of low-rank representation matrices to identify and study cancer molecules, low-rank sparse representations to analyze and process gene expression profiles, and an introduction to related cancers and gene expression profile database.
Collapse
|
47
|
Evaluating the Risk of Breast Cancer Recurrence and Metastasis After Adjuvant Tamoxifen Therapy by Integrating Polymorphisms in Cytochrome P450 Genes and Clinicopathological Characteristics. Front Oncol 2021; 11:738222. [PMID: 34868931 PMCID: PMC8639703 DOI: 10.3389/fonc.2021.738222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 10/25/2021] [Indexed: 11/13/2022] Open
Abstract
Tamoxifen (TAM) is the most commonly used adjuvant endocrine drug for hormone receptor-positive (HR+) breast cancer patients. However, how to accurately evaluate the risk of breast cancer recurrence and metastasis after adjuvant TAM therapy is still a major concern. In recent years, many studies have shown that the clinical outcomes of TAM-treated breast cancer patients are influenced by the activity of some cytochrome P450 (CYP) enzymes that catalyze the formation of active TAM metabolites like endoxifen and 4-hydroxytamoxifen. In this study, we aimed to first develop and validate an algorithm combining polymorphisms in CYP genes and clinicopathological signatures to identify a subpopulation of breast cancer patients who might benefit most from TAM adjuvant therapy and meanwhile evaluate major risk factors related to TAM resistance. Specifically, a total of 256 patients with invasive breast cancer who received adjuvant endocrine therapy were selected. The genotypes at 10 loci from three TAM metabolism-related CYP genes were detected by time-of-flight mass spectrometry and multiplex long PCR. Combining the 10 loci with nine clinicopathological characteristics, we obtained 19 important features whose association with cancer recurrence was assessed by importance score via random forests. After that, a logistic regression model was trained to calculate TAM risk-of-recurrence score (TAM RORs), which is adopted to assess a patient's risk of recurrence after TAM treatment. The sensitivity and specificity of the model in an independent test cohort were 86.67% and 64.56%, respectively. This study showed that breast cancer patients with high TAM RORs were less sensitive to TAM treatment and manifested more invasive characteristics, whereas those with low TAM RORs were highly sensitive to TAM treatment, and their conditions were stable during the follow-up period. There were some risk factors that had a significant effect on the efficacy of TAM. They were tissue classification (tumor Grade < 2 vs. Grade ≥ 2, p = 2.2e-16), the number of lymph node metastases (Node-Negative vs. Node < 4, p = 5.3e-07; Node < 4 vs. Node ≥ 4, p = 0.003; Node-Negative vs. Node ≥ 4, p = 7.2e-15), and the expression levels of estrogen receptor (ER) and progesterone receptor (PR) (ER < 50% vs. ER ≥ 50%, p = 1.3e-12; PR < 50% vs. PR ≥ 50%, p = 2.6e-08). The really remarkable thing is that different genotypes of CYP2D6*10(C188T) show significant differences in prediction function (CYP2D6*10 CC vs. TT, p < 0.019; CYP2D6*10 CT vs. TT, p < 0.037). There are more than 50% Chinese who have CYP2D6*10 mutation. So the genotype of CYP2D6*10(C188T) should be tested before TAM therapy.
Collapse
|
48
|
Agent Repurposing for the Treatment of Advanced Stage Diffuse Large B-Cell Lymphoma Based on Gene Expression and Network Perturbation Analysis. Front Genet 2021; 12:756784. [PMID: 34721544 PMCID: PMC8551569 DOI: 10.3389/fgene.2021.756784] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 09/24/2021] [Indexed: 12/18/2022] Open
Abstract
Over 50% of diffuse large B-cell lymphoma (DLBCL) patients are diagnosed at an advanced stage. Although there are a few therapeutic strategies for DLBCL, most of them are more effective in limited-stage cancer patients. The prognosis of patients with advanced-stage DLBCL is usually poor with frequent recurrence and metastasis. In this study, we aimed to identify gene expression and network differences between limited- and advanced-stage DLBCL patients, with the goal of identifying potential agents that could be used to relieve the severity of DLBCL. Specifically, RNA sequencing data of DLBCL patients at different clinical stages were collected from the cancer genome atlas (TCGA). Differentially expressed genes were identified using DESeq2, and then, weighted gene correlation network analysis (WGCNA) and differential module analysis were performed to find variations between different stages. In addition, important genes were extracted by key driver analysis, and potential agents for DLBCL were identified according to gene-expression perturbations and the Crowd Extracted Expression of Differential Signatures (CREEDS) drug signature database. As a result, 20 up-regulated and 73 down-regulated genes were identified and 79 gene co-expression modules were found using WGCNA, among which, the thistle1 module was highly related to the clinical stage of DLBCL. KEGG pathway and GO enrichment analyses of genes in the thistle1 module indicated that DLBCL progression was mainly related to the NOD-like receptor signaling pathway, neutrophil activation, secretory granule membrane, and carboxylic acid binding. A total of 47 key drivers were identified through key driver analysis with 11 up-regulated key driver genes and 36 down-regulated key diver genes in advanced-stage DLBCL patients. Five genes (MMP1, RAB6C, ACCSL, RGS21 and MOCOS) appeared as hub genes, being closely related to the occurrence and development of DLBCL. Finally, both differentially expressed genes and key driver genes were subjected to CREEDS analysis, and 10 potential agents were predicted to have the potential for application in advanced-stage DLBCL patients. In conclusion, we propose a novel pipeline to utilize perturbed gene-expression signatures during DLBCL progression for identifying agents, and we successfully utilized this approach to generate a list of promising compounds.
Collapse
|
49
|
Bioinformatics Analysis Reveals MCM3 as an Important Prognostic Marker in Cervical Cancer. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:8494260. [PMID: 34671420 PMCID: PMC8523256 DOI: 10.1155/2021/8494260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 09/14/2021] [Accepted: 09/18/2021] [Indexed: 12/09/2022]
Abstract
The minichromosome maintenance complex 3 (MCM3) is essential for the regulation of DNA replication and cell cycle progression. However, the expression and prognostic values of MCM3 in cervical cancer (CC) have not been well-studied. Herein, we investigated the expression patterns and survival data of MCM3 in cervical cancer patients from the ONCOMINE, GEPIA, Human Protein Atlas, UALCAN, Kaplan-Meier Plotter, and LinkedOmics databases. The expression level of MCM3 is negatively correlated with advanced tumor stage and metastatic status. Specifically, MCM3 is significantly differentially expressed between patients in stage 1 and stage 3 cervical cancer with p value 0.0138. Similarly, the p values between stage 1 and stage 4 cervical cancer, between stage 2 and stage 3, and between stage 2 and stage 4 are 0.00089, 0.0244, and 0.00197, respectively. Not only that, cervical cancer patients with high mRNA expression of MCM3 may indicate longer overall survival but indicate shorter relapse-free survival. PRIM2 and MCM6 are positively correlated genes of MCM3. Bioinformatics analysis revealed that MCM3 might be considered a biological indicator for prognostic evaluation of cervical cancer. However, it is currently limited to bioinformatics analysis, and more clinical tissue specimens and cell experiments are needed to further explore the role of MCM3 in the occurrence and progression of cervical cancer.
Collapse
|
50
|
Detection of BRCA1/2 Mutation and Analysis of Clinicopathological Characteristics in 141 Cases of Ovarian Cancer. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021; 2021:4854282. [PMID: 34721658 PMCID: PMC8554521 DOI: 10.1155/2021/4854282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 09/28/2021] [Indexed: 12/04/2022]
Abstract
Breast cancer susceptibility genes 1 and 2 (BRCA1 and BRCA2) are known biomarkers for hereditary ovarian cancer (OC). However, a comprehensive association study between BRCA1/2 mutation spectrum and clinicopathological characteristics in Chinese ovarian cancer patients has not been performed yet to our best knowledge. To fill in this gap, we collected BRCA1/2 sequencing data and clinical information of 141 OC patients from Fujian Cancer Hospital between April 2018 and March 2020. The clinical information includes the age of onset, FIGO staging, pathological types, serum 125 detection level, lymph node metastasis, distant metastasis, the expression of Ki67, and disease history of the patient and his/her family. We then studied their associations by software SciPy 1.0. As a result, we detected pathogenic and potentially pathogenic BRCA1/2 mutations in 27 out of 141 patients (19.15%). Among the 27 patients with mutations, the major type of mutation was frameshift, which was observed in 12 patients (44.4%). Most of the mutation sites were distributed on exons 10 and 11, accounting for 48.1% (13/27) and 22.2% (6/27), respectively. In terms of histological classification, high-grade serous adenocarcinoma accounted for 79.43% of the 141 samples. The BRCA1/2 mutation group was all high-grade serous adenocarcinoma, accounting for 24.1% (27/112) of this group. The incidence of pathogenic mutation in BRCA1 and BRCA2 was 15.7% (19/112) and 7.27% (8/112), respectively. Univariate analysis showed that there was no significant difference between patients with BRCA1/2 mutation and others in age-of-onset, FIGO stage, pathological types, serum CA125 level, lymph node metastasis, the expression of Ki67, and personal and family disease history. However, there are significant differences between patients with BRCA1/2 mutation and others in distant metastasis rate (P < 0.002). In addition, the BRCA1/2 mutation rate in 141 ovarian cancer patients was similar to those reported in other studies in China. Nearly one-quarter of high-grade serous carcinomas had BRCA1/2 mutations. In conclusion, our study indicated that patients with BRCA1/2 mutations were more likely to undergo distant metastasis, and BRCA1/2 mutation detection should be performed for patients with high-grade serous adenocarcinoma to guide the selection of clinical treatment options.
Collapse
|