Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ward RM, Schmieder R, Highnam G, Mittelman D. Big data challenges and opportunities in high-throughput sequencing. ACTA ACUST UNITED AC 2014. [DOI: 10.4161/sysb.24470] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

For:	Ward RM, Schmieder R, Highnam G, Mittelman D. Big data challenges and opportunities in high-throughput sequencing. ACTA ACUST UNITED AC 2014. [DOI: 10.4161/sysb.24470] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Number

Cited by Other Article(s)

Quddusi DM, Bajcinca N. Identification of genomic biomarkers and their pathway crosstalks for deciphering mechanistic links in glioblastoma. IET Syst Biol 2023;17:143-161. [PMID: 37277696 PMCID: PMC10439498 DOI: 10.1049/syb2.12066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 04/22/2023] [Accepted: 05/03/2023] [Indexed: 06/07/2023] Open

Abstract

Glioblastoma is a grade IV pernicious neoplasm occurring in the supratentorial region of brain. As its causes are largely unknown, it is essential to understand its dynamics at the molecular level. This necessitates the identification of better diagnostic and prognostic molecular candidates. Blood-based liquid biopsies are emerging as a novel tool for cancer biomarker discovery, guiding the treatment and improving its early detection based on their tumour origin. There exist previous studies focusing on the identification of tumour-based biomarkers for glioblastoma. However, these biomarkers inadequately represent the underlying pathological state and incompletely illustrate the tumour because of non-recursive nature of this approach to monitor the disease. Also, contrary to the tumour biopsies, liquid biopsies are non-invasive and can be performed at any interval during the disease span to surveil the disease. Therefore, in this study, a unique dataset of blood-based liquid biopsies obtained primarily from tumour-educated blood platelets (TEP) is utilised. This RNA-seq data from ArrayExpress is acquired comprising human cohort with 39 glioblastoma subjects and 43 healthy subjects. Canonical and machine learning approaches are applied for identification of the genomic biomarkers for glioblastoma and their crosstalks. In our study, 97 genes appeared enriched in 7 oncogenic pathways (RAF-MAPK, P53, PRC2-EZH2, YAP conserved, MEK-MAPK, ErbB2 and STK33 signalling pathways) using GSEA, out of which 17 have been identified participating actively in crosstalks. Using PCA, 42 genes are found enriched in 7 pathways (cytoplasmic ribosomal proteins, translation factors, electron transport chain, ribosome, Huntington's disease, primary immunodeficiency pathways, and interferon type I signalling pathway) harbouring tumour when altered, out of which 25 actively participate in crosstalks. All the 14 pathways foster well-known cancer hallmarks and the identified DEGs can serve as genomic biomarkers, not only for the diagnosis and prognosis of Glioblastoma but also in providing a molecular foothold for oncogenic decision making in order to fathom the disease dynamics. Moreover, SNP analysis for the identified DEGs is performed to investigate their roles in disease dynamics in an elaborated manner. These results suggest that TEPs are capable of providing disease insights just like tumour cells with an advantage of being extracted anytime during the course of disease in order to monitor it.

Collapse

Parastar H, Tauler R. Big (Bio)Chemical Data Mining Using Chemometric Methods: A Need for Chemists. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.201801134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Jain R, Xu W. Dynamic model updating (DMU) approach for statistical learning model building with missing data. BMC Bioinformatics 2021;22:221. [PMID: 33926384 PMCID: PMC8086098 DOI: 10.1186/s12859-021-04138-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 04/19/2021] [Indexed: 11/17/2022] Open

Abstract

Background

Developing statistical and machine learning methods on studies with missing information is a ubiquitous challenge in real-world biological research. The strategy in literature relies on either removing the samples with missing values like complete case analysis (CCA) or imputing the information in the samples with missing values like predictive mean matching (PMM) such as MICE. Some limitations of these strategies are information loss and closeness of the imputed values with the missing values. Further, in scenarios with piecemeal medical data, these strategies have to wait to complete the data collection process to provide a complete dataset for statistical models.

Method and results

This study proposes a dynamic model updating (DMU) approach, a different strategy to develop statistical models with missing data. DMU uses only the information available in the dataset to prepare the statistical models. DMU segments the original dataset into small complete datasets. The study uses hierarchical clustering to segment the original dataset into small complete datasets followed by Bayesian regression on each of the small complete datasets. Predictor estimates are updated using the posterior estimates from each dataset. The performance of DMU is evaluated by using both simulated data and real studies and show better results or at par with other approaches like CCA and PMM.

Conclusion

DMU approach provides an alternative to the existing approaches of information elimination and imputation in processing the datasets with missing values. While the study applied the approach for continuous cross-sectional data, the approach can be applied to longitudinal, categorical and time-to-event biological data.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-021-04138-z.

Collapse

Garlaschi S, Fochesato A, Tovo A. Upscaling Statistical Patterns from Reduced Storage in Social and Life Science Big Datasets. ENTROPY (BASEL, SWITZERLAND) 2020;22:E1084. [PMID: 33286853 PMCID: PMC7597173 DOI: 10.3390/e22101084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 09/17/2020] [Accepted: 09/23/2020] [Indexed: 11/16/2022]

Bartoszewski R, Sikorski AF. Editorial focus: understanding off-target effects as the key to successful RNAi therapy. Cell Mol Biol Lett 2019;24:69. [PMID: 31867046 PMCID: PMC6902517 DOI: 10.1186/s11658-019-0196-3] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 12/03/2019] [Indexed: 12/21/2022] Open

Singh A, Müller B, Fuxelius HH, Schnürer A. AcetoBase: a functional gene repository and database for formyltetrahydrofolate synthetase sequences. Database (Oxford) 2019;2019:baz142. [PMID: 31832668 PMCID: PMC6908459 DOI: 10.1093/database/baz142] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 11/01/2019] [Accepted: 11/14/2019] [Indexed: 01/01/2023]

Alnasir JJ, Shanahan HP. The application of Hadoop in structural bioinformatics. Brief Bioinform 2018;21:96-105. [PMID: 30462158 DOI: 10.1093/bib/bby106] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 09/20/2018] [Accepted: 10/05/2018] [Indexed: 11/13/2022] Open

Artificial intelligence used in genome analysis studies. EUROBIOTECH JOURNAL 2018. [DOI: 10.2478/ebtj-2018-0012] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Tauler R, Parastar H. Big (Bio)Chemical Data Mining Using Chemometric Methods: A Need for Chemists. Angew Chem Int Ed Engl 2018;61:e201801134. [DOI: 10.1002/anie.201801134] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Indexed: 11/08/2022]

Park J, Gabbard JL. Factors that affect scientists' knowledge sharing behavior in health and life sciences research communities: Differences between explicit and implicit knowledge. COMPUTERS IN HUMAN BEHAVIOR 2018. [DOI: 10.1016/j.chb.2017.09.017] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Engineered Nucleases and Trinucleotide Repeat Diseases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016. [DOI: 10.1007/978-1-4939-3509-3_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

An analytical framework for optimizing variant discovery from personal genomes. Nat Commun 2015;6:6275. [PMID: 25711446 PMCID: PMC4351570 DOI: 10.1038/ncomms7275] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 01/13/2015] [Indexed: 12/30/2022] Open

Genomics data curation roles, skills and perception of data quality. LIBRARY & INFORMATION SCIENCE RESEARCH 2015. [DOI: 10.1016/j.lisr.2014.08.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Kosseim P, Dove ES, Baggaley C, Meslin EM, Cate FH, Kaye J, Harris JR, Knoppers BM. Building a data sharing model for global genomic research. Genome Biol 2014;15:430. [PMID: 25221857 PMCID: PMC4282015 DOI: 10.1186/s13059-014-0430-2] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open