Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

29
(from Reference Citation Analysis)

Article PDFs (11)

Cited by > 0 (24)

Searched Name

data normalization

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Sullivan GJ, Barquist L, Cain AK. A method to correct for local alterations in DNA copy number that bias functional genomics assays applied to antibiotic-treated bacteria. mSystems 2024;9:e0066523. [PMID: 38470252 PMCID: PMC11019837 DOI: 10.1128/msystems.00665-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 02/13/2024] [Indexed: 03/13/2024] Open

Abstract

Functional genomics techniques, such as transposon insertion sequencing and RNA-sequencing, are key to studying relative differences in bacterial mutant fitness or gene expression under selective conditions. However, certain stress conditions, mutations, or antibiotics can directly interfere with DNA synthesis, resulting in systematic changes in local DNA copy numbers along the chromosome. This can lead to artifacts in sequencing-based functional genomics data when comparing antibiotic treatment to an unstressed control. Further, relative differences in gene-wise read counts may result from alterations in chromosomal replication dynamics, rather than selection or direct gene regulation. We term this artifact "chromosomal location bias" and implement a principled statistical approach to correct it by calculating local normalization factors along the chromosome. These normalization factors are then directly incorporated into statistical analyses using standard RNA-sequencing analysis methods without modifying the read counts themselves, preserving important information about the mean-variance relationship in the data. We illustrate the utility of this approach by generating and analyzing a ciprofloxacin-treated transposon insertion sequencing data set in Escherichia coli as a case study. We show that ciprofloxacin treatment generates chromosomal location bias in the resulting data, and we further demonstrate that failing to correct for this bias leads to false predictions of mutant drug sensitivity as measured by minimum inhibitory concentrations. We have developed an R package and user-friendly graphical Shiny application, ChromoCorrect, that detects and corrects for chromosomal bias in read count data, enabling the application of functional genomics technologies to the study of antibiotic stress.IMPORTANCEAltered gene dosage due to changes in DNA replication has been observed under a variety of stresses with a variety of experimental techniques. However, the implications of changes in gene dosage for sequencing-based functional genomics assays are rarely considered. We present a statistically principled approach to correcting for the effect of changes in gene dosage, enabling testing for differences in the fitness effects or regulation of individual genes in the presence of confounding differences in DNA copy number. We show that failing to correct for these effects can lead to incorrect predictions of resistance phenotype when applying functional genomics assays to investigate antibiotic stress, and we provide a user-friendly application to detect and correct for changes in DNA copy number.

Collapse

Liu J, Kreimer A, Li WV. Differential variability analysis of single-cell gene expression data. Brief Bioinform 2023;24:bbad294. [PMID: 37598422 PMCID: PMC10516347 DOI: 10.1093/bib/bbad294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 07/18/2023] [Accepted: 07/29/2023] [Indexed: 08/22/2023] Open

Zhang Y, Fan S, Wohlgemuth G, Fiehn O. Denoising Autoencoder Normalization for Large-Scale Untargeted Metabolomics by Gas Chromatography-Mass Spectrometry. Metabolites 2023;13:944. [PMID: 37623887 PMCID: PMC10456436 DOI: 10.3390/metabo13080944] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/31/2023] [Accepted: 08/08/2023] [Indexed: 08/26/2023] Open

Abstract

Large-scale metabolomics assays are widely used in epidemiology for biomarker discovery and risk assessments. However, systematic errors introduced by instrumental signal drifting pose a big challenge in large-scale assays, especially for derivatization-based gas chromatography-mass spectrometry (GC-MS). Here, we compare the results of different normalization methods for a study with more than 4000 human plasma samples involved in a type 2 diabetes cohort study, in addition to 413 pooled quality control (QC) samples, 413 commercial pooled plasma samples, and a set of 25 stable isotope-labeled internal standards used for every sample. Data acquisition was conducted across 1.2 years, including seven column changes. In total, 413 pooled QC (training) and 413 BioIVT samples (validation) were used for normalization comparisons. Surprisingly, neither internal standards nor sum-based normalizations yielded median precision of less than 30% across all 563 metabolite annotations. While the machine-learning-based SERRF algorithm gave 19% median precision based on the pooled quality control samples, external cross-validation with BioIVT plasma pools yielded a median 34% relative standard deviation (RSD). We developed a new method: systematic error reduction by denoising autoencoder (SERDA). SERDA lowered the median standard deviations of the training QC samples down to 16% RSD, yielding an overall error of 19% RSD when applied to the independent BioIVT validation QC samples. This is the largest study on GC-MS metabolomics ever reported, demonstrating that technical errors can be normalized and handled effectively for this assay. SERDA was further validated on two additional large-scale GC-MS-based human plasma metabolomics studies, confirming the superior performance of SERDA over SERRF or sum normalizations.

Collapse

Leacy E, Batten I, Sanelli L, McElheron M, Brady G, Little MA, Khouri H. Optimal LC-MS metabolomic profiling reveals emergent changes to monocyte metabolism in response to lipopolysaccharide. Front Immunol 2023;14:1116760. [PMID: 37033938 PMCID: PMC10077522 DOI: 10.3389/fimmu.2023.1116760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 03/03/2023] [Indexed: 04/11/2023] Open

Abstract

Introduction

Immunometabolism examines the links between immune cell function and metabolism. Dysregulation of immune cell metabolism is now an established feature of innate immune cell activation. Advances in liquid chromatography mass spectrometry (LC-MS) technologies have allowed discovery of unique insights into cellular metabolomics. Here we have studied and compared different sample preparation techniques and data normalisation methods described in the literature when applied to metabolomic profiling of human monocytes.

Methods

Primary monocytes stimulated with lipopolysaccharide (LPS) for four hours was used as a study model. Monocytes (n=24) were freshly isolated from whole blood and stimulated for four hours with lipopolysaccharide (LPS). A methanol-based extraction protocol was developed and metabolomic profiling carried out using a Hydrophilic Interaction Liquid Chromatography (HILIC) LC-MS method. Data analysis pipelines used both targeted and untargeted approaches, and over 40 different data normalisation techniques to account for technical and biological variation were examined. Cytokine levels in supernatants were measured by ELISA.

Results

This method provided broad coverage of the monocyte metabolome. The most efficient and consistent normalisation method was measurement of residual protein in the metabolite fraction, which was further validated and optimised using a commercial kit. Alterations to the monocyte metabolome in response to LPS can be detected as early as four hours post stimulation. Broad and profound changes in monocyte metabolism were seen, in line with increased cytokine production. Elevated levels of amino acids and Krebs cycle metabolites were noted and decreases in aspartate and β-alanine are also reported for the first time. In the untargeted analysis, 154 metabolite entities were significantly altered compared to unstimulated cells. Pathway analysis revealed the most prominent changes occurred to (phospho-) inositol metabolism, glycolysis, and the pentose phosphate pathway.

Discussion

These data report the emergent changes to monocyte metabolism in response to LPS, in line with reports from later time points. A number of these metabolites are reported to alter inflammatory gene expression, which may facilitate the increases in cytokine production. Further validation is needed to confirm the link between metabolic activation and upregulation of inflammatory responses.

Collapse

Shi Z, Li H, Zhang W, Chen Y, Zeng C, Kang X, Xu X, Xia Z, Qing B, Yuan Y, Song G, Caldana C, Hu J, Willmitzer L, Li Y. A Comprehensive Mass Spectrometry-Based Workflow for Clinical Metabolomics Cohort Studies. Metabolites 2022;12. [PMID: 36557207 DOI: 10.3390/metabo12121168] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 11/14/2022] [Accepted: 11/16/2022] [Indexed: 11/27/2022] Open

Zheng H, Zhao H, Zhang X, Liang Z, He Q. Systematic Identification and Validation of Suitable Reference Genes for the Normalization of Gene Expression in Prunella vulgaris under Different Organs and Spike Development Stages. Genes (Basel) 2022;13:1947. [PMID: 36360184 PMCID: PMC9689956 DOI: 10.3390/genes13111947] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/19/2022] [Accepted: 10/24/2022] [Indexed: 08/01/2023] Open

Rodriguez J, Gomez-Cano L, Grotewold E, de Leon N. Normalizing and Correcting Variable and Complex LC-MS Metabolomic Data with the R Package pseudoDrift. Metabolites 2022;12:435. [PMID: 35629939 PMCID: PMC9144304 DOI: 10.3390/metabo12050435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/09/2022] [Accepted: 05/10/2022] [Indexed: 01/27/2023] Open

Hirsch SM, Chapman CJ, Frost DM, Beach TAC. Comparison of 5 Normalization Methods for Knee Joint Moments in the Single-Leg Squat. J Appl Biomech 2022;:1-10. [PMID: 35042188 DOI: 10.1123/jab.2021-0143] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 10/13/2021] [Accepted: 12/01/2021] [Indexed: 11/18/2022]

Kim YJ, Kim KG. Detection and Weak Segmentation of Masses in Gray-Scale Breast Mammogram Images Using Deep Learning. Yonsei Med J 2022;63:S63-S73. [PMID: 35040607 PMCID: PMC8790585 DOI: 10.3349/ymj.2022.63.s63] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 11/10/2021] [Accepted: 11/11/2021] [Indexed: 11/27/2022] Open

Kubinski R, Djamen-Kepaou JY, Zhanabaev T, Hernandez-Garcia A, Bauer S, Hildebrand F, Korcsmaros T, Karam S, Jantchou P, Kafi K, Martin RD. Benchmark of Data Processing Methods and Machine Learning Models for Gut Microbiome-Based Diagnosis of Inflammatory Bowel Disease. Front Genet 2022;13:784397. [PMID: 35251123 PMCID: PMC8895431 DOI: 10.3389/fgene.2022.784397] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 01/13/2022] [Indexed: 12/14/2022] Open

Abstract

Patients with inflammatory bowel disease (IBD) wait months and undergo numerous invasive procedures between the initial appearance of symptoms and receiving a diagnosis. In order to reduce time until diagnosis and improve patient wellbeing, machine learning algorithms capable of diagnosing IBD from the gut microbiome's composition are currently being explored. To date, these models have had limited clinical application due to decreased performance when applied to a new cohort of patient samples. Various methods have been developed to analyze microbiome data which may improve the generalizability of machine learning IBD diagnostic tests. With an abundance of methods, there is a need to benchmark the performance and generalizability of various machine learning pipelines (from data processing to training a machine learning model) for microbiome-based IBD diagnostic tools. We collected fifteen 16S rRNA microbiome datasets (7,707 samples) from North America to benchmark combinations of gut microbiome features, data normalization and transformation methods, batch effect correction methods, and machine learning models. Pipeline generalizability to new cohorts of patients was evaluated with two binary classification metrics following leave-one-dataset-out cross (LODO) validation, where all samples from one study were left out of the training set and tested upon. We demonstrate that taxonomic features processed with a compositional transformation method and batch effect correction with the naive zero-centering method attain the best classification performance. In addition, machine learning models that identify non-linear decision boundaries between labels are more generalizable than those that are linearly constrained. Lastly, we illustrate the importance of generating a curated training dataset to ensure similar performance across patient demographics. These findings will help improve the generalizability of machine learning models as we move towards non-invasive diagnostic and disease management tools for patients with IBD.

Collapse

Ivanova L, Rangel-Huerta OD, Tartor H, Gjessing MC, Dahle MK, Uhlig S. Fish Skin and Gill Mucus: A Source of Metabolites for Non-Invasive Health Monitoring and Research. Metabolites 2021;12:28. [PMID: 35050150 PMCID: PMC8781917 DOI: 10.3390/metabo12010028] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/16/2021] [Accepted: 12/25/2021] [Indexed: 11/28/2022] Open

Herrmann HA, Rusz M, Baier D, Jakupec MA, Keppler BK, Berger W, Koellensperger G, Zanghellini J. Thermodynamic Genome-Scale Metabolic Modeling of Metallodrug Resistance in Colorectal Cancer. Cancers (Basel) 2021;13:cancers13164130. [PMID: 34439283 PMCID: PMC8391396 DOI: 10.3390/cancers13164130] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 07/23/2021] [Accepted: 08/03/2021] [Indexed: 12/11/2022] Open

Abstract

Simple Summary

Cancer, but also its treatment, can lead to a reprogramming of cellular metabolism. These changes are observable in metabolite abundances, which can be unbiasedly measured via mass spectrometry metabolomics. However, even when the metabolome changes strongly, a (mechanistic) interpretation is difficult as metabolite levels do not necessarily directly correspond to pathway activities. Here we measure the changes of the cellular metabolome in colorectal cancer cell lines sensitive and resistant to the ruthenium-based drug BOLD-100/KP1339 and the platinum-based drug oxaliplatin. We map these changes onto a cancer-specific genome-scale metabolic model, which allows us not only to compute intracellular flux distributions, but also to disentangle drug-specific effects from growth differences from differences in metabolic adaptations due to resistance. Specifically, we find that resistance to BOLD-100/KP1339 induces more extensive reprogramming than oxaliplatin, especially with respect to fatty acid and amino acid metabolism.

Abstract

Background: Mass spectrometry-based metabolomics approaches provide an immense opportunity to enhance our understanding of the mechanisms that underpin the cellular reprogramming of cancers. Accurate comparative metabolic profiling of heterogeneous conditions, however, is still a challenge. Methods: Measuring both intracellular and extracellular metabolite concentrations, we constrain four instances of a thermodynamic genome-scale metabolic model of the HCT116 colorectal carcinoma cell line to compare the metabolic flux profiles of cells that are either sensitive or resistant to ruthenium- or platinum-based treatments with BOLD-100/KP1339 and oxaliplatin, respectively. Results: Normalizing according to growth rate and normalizing resistant cells according to their respective sensitive controls, we are able to dissect metabolic responses specific to the drug and to the resistance states. We find the normalization steps to be crucial in the interpretation of the metabolomics data and show that the metabolic reprogramming in resistant cells is limited to a select number of pathways. Conclusions: Here, we elucidate the key importance of normalization steps in the interpretation of metabolomics data, allowing us to uncover drug-specific metabolic reprogramming during acquired metal-drug resistance.

Collapse

Haering M, Habermann BH. RNfuzzyApp: an R shiny RNA-seq data analysis app for visualisation, differential expression analysis, time-series clustering and enrichment analysis. F1000Res 2021;10:654. [PMID: 35186266 PMCID: PMC8825645 DOI: 10.12688/f1000research.54533.1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/04/2021] [Indexed: 09/23/2023] Open

Ni A, Qin LX. Performance evaluation of transcriptomics data normalization for survival risk prediction. Brief Bioinform 2021;22:6317608. [PMID: 34245143 PMCID: PMC8575026 DOI: 10.1093/bib/bbab257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 05/20/2021] [Accepted: 06/17/2021] [Indexed: 11/13/2022] Open

Zhao Z, Zhou H, Nie Z, Wang X, Luo B, Yi Z, Li X, Hu X, Yang T. Appropriate Reference Genes for RT-qPCR Normalization in Various Organs of Anemone flaccida Fr. Schmidt at Different Growing Stages. Genes (Basel) 2021;12:459. [PMID: 33807101 DOI: 10.3390/genes12030459] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 03/12/2021] [Accepted: 03/17/2021] [Indexed: 11/17/2022] Open

Ampavathi A, Saradhi TV. Multi disease-prediction framework using hybrid deep learning: an optimal prediction model. Comput Methods Biomech Biomed Engin 2021;24:1146-1168. [PMID: 33427480 DOI: 10.1080/10255842.2020.1869726] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Abstract

Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient's symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to "Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson's disease, and Alzheimer's disease", from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like "Deep Belief Network (DBN) and Recurrent Neural Network (RNN)". As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Collapse

Philips A, Nowis K, Stelmaszczuk M, Jackowiak P, Podkowiński J, Handschuh L, Figlerowicz M. Expression Landscape of circRNAs in Arabidopsis thaliana Seedlings and Adult Tissues. Front Plant Sci 2020;11:576581. [PMID: 33014000 PMCID: PMC7511659 DOI: 10.3389/fpls.2020.576581] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 08/25/2020] [Indexed: 05/27/2023]

Abstract

RNA-seq is currently the only method that can provide a comprehensive landscape of circular RNA (circRNAs) in the whole organism and its particular organs. Recent years have brought an increasing number of RNA-seq-based reports on plant circRNAs. Notably, the picture they revealed is questionable and depends on the applied circRNA identification and quantification techniques. In consequence, little is known about the biogenesis and functions of circRNAs in plants. In this work, we tested two experimental and six bioinformatics procedures of circRNA analysis to determine the optimal approach for studying the profiles of circRNAs in Arabidopsis thaliana. Then using the optimized strategy, we determined the accumulation of circular and corresponding linear transcripts in plant seedlings and organs. We observed that only a small fraction of circRNAs was reproducibly generated. Among them, two groups of circRNAs were discovered: ubiquitous and organ-specific. The highest number of circRNAs with significantly increased accumulation in comparison to other organs/seedlings was found in roots. The circRNAs in seedlings, leaves and flowers originated mainly from genes involved in photosynthesis and the response to stimulus. The levels of circular and linear transcripts were not correlated. Although RNase R treatment enriches the analyzed RNA samples in circular transcripts, it may also have a negative impact on the stability of some of the circRNAs. We also showed that the normalization of NGS data by the library size is not proper for circRNAs quantification. Alternatively, we proposed four other normalization types whose accuracy was confirmed by ddPCR. Moreover, we provided a comprehensive characterization of circRNAs in A. thaliana organs and in seedlings. Our analyses revealed that plant circRNAs are formed in both stochastic and controlled processes. The latter are less frequent and likely engage circRNA-specific mechanisms. Only a few circRNAs were organ-specific. The lack of correlation between the accumulation of linear and circular transcripts indicated that their biogenesis depends on different mechanisms.

Collapse

Benedetti E, Gerstner N, Pučić-Baković M, Keser T, Reiding KR, Ruhaak LR, Štambuk T, Selman MH, Rudan I, Polašek O, Hayward C, Beekman M, Slagboom E, Wuhrer M, Dunlop MG, Lauc G, Krumsiek J. Systematic Evaluation of Normalization Methods for Glycomics Data Based on Performance of Network Inference. Metabolites 2020;10:E271. [PMID: 32630764 PMCID: PMC7408386 DOI: 10.3390/metabo10070271] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/29/2020] [Accepted: 06/04/2020] [Indexed: 01/15/2023] Open

Affiliation(s)

Elisa Benedetti Department of Physiology and Biophysics, Institute for Computational Biomedicine, Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10022, USA; Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, 85764 Neuherberg, Germany;
Nathalie Gerstner Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, 85764 Neuherberg, Germany; Max Planck Institute for Psychiatry, 80804 Munich, Germany
Maja Pučić-Baković Genos Glycoscience Research Laboratory, 10000 Zagreb, Croatia; (M.P.-B.); (G.L.)
Toma Keser Faculty of Pharmacy and Biochemistry, University of Zagreb, 10000 Zagreb, Croatia; (T.K.); (T.Š.)
Karli R. Reiding Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, University of Utrecht, 3584 CH Utrecht, The Netherlands; (K.R.R.); (M.H.J.S.) Center for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands; (L.R.R.); (M.W.)
L. Renee Ruhaak Center for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands; (L.R.R.); (M.W.) Department of Clinical Chemistry and Laboratory Medicine, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands
Tamara Štambuk Faculty of Pharmacy and Biochemistry, University of Zagreb, 10000 Zagreb, Croatia; (T.K.); (T.Š.)
Maurice H.J. Selman Biomolecular Mass Spectrometry and Proteomics, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, University of Utrecht, 3584 CH Utrecht, The Netherlands; (K.R.R.); (M.H.J.S.)
Igor Rudan Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh EH8 9AG, UK;
Ozren Polašek Medical School, University of Split, 21000 Split, Croatia; Gen-Info Ltd., 10000 Zagreb, Croatia
Caroline Hayward Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, UK;
Marian Beekman Section of Molecular Epidemiology, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands; (M.B.); (E.S.)
Eline Slagboom Section of Molecular Epidemiology, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands; (M.B.); (E.S.)
Manfred Wuhrer Center for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands; (L.R.R.); (M.W.)
Malcolm G. Dunlop Colon Cancer Genetics Group, Institute of Genetics and Molecular Medicine, University of Edinburgh and Medical Research Council Human Genetics Unit, Edinburgh EH8 9YL, UK;
Gordan Lauc Genos Glycoscience Research Laboratory, 10000 Zagreb, Croatia; (M.P.-B.); (G.L.) Faculty of Pharmacy and Biochemistry, University of Zagreb, 10000 Zagreb, Croatia; (T.K.); (T.Š.)
Jan Krumsiek Department of Physiology and Biophysics, Institute for Computational Biomedicine, Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10022, USA; Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, 85764 Neuherberg, Germany;

Collapse

Liu F, Singhal K, Matney R, Acharya S, Akdis CA, Nadeau KC, Chien AS, Leib RD. Enhancing Data Reliability in TOMAHAQ for Large-Scale Protein Quantification. Proteomics 2020;20:e1900105. [PMID: 32032464 DOI: 10.1002/pmic.201900105] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Revised: 01/19/2020] [Indexed: 11/10/2022]

Li F, Rao G, Du J, Xiang Y, Zhang Y, Selek S, Hamilton JE, Xu H, Tao C. Ontological representation-oriented term normalization and standardization of the Research Domain Criteria. Health Informatics J 2019;26:726-737. [PMID: 30843449 DOI: 10.1177/1460458219832059] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Krasnov GS, Kudryavtseva AV, Snezhkina AV, Lakunina VA, Beniaminov AD, Melnikova NV, Dmitriev AA. Pan-Cancer Analysis of TCGA Data Revealed Promising Reference Genes for qPCR Normalization. Front Genet 2019;10:97. [PMID: 30881377 PMCID: PMC6406071 DOI: 10.3389/fgene.2019.00097] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 01/29/2019] [Indexed: 11/20/2022] Open

Abstract

Quantitative PCR (qPCR) remains the most widely used technique for gene expression evaluation. Obtaining reliable data using this method requires reference genes (RGs) with stable mRNA level under experimental conditions. This issue is especially crucial in cancer studies because each tumor has a unique molecular portrait. The Cancer Genome Atlas (TCGA) project provides RNA-Seq data for thousands of samples corresponding to dozens of cancers and presents the basis for assessment of the suitability of genes as reference ones for qPCR data normalization. Using TCGA RNA-Seq data and previously developed CrossHub tool, we evaluated mRNA level of 32 traditionally used RGs in 12 cancer types, including those of lung, breast, prostate, kidney, and colon. We developed an 11-component scoring system for the assessment of gene expression stability. Among the 32 genes, PUM1 was one of the most stably expressed in the majority of examined cancers, whereas GAPDH, which is widely used as a RG, showed significant mRNA level alterations in more than a half of cases. For each of 12 cancer types, we suggested a pair of genes that are the most suitable for use as reference ones. These genes are characterized by high expression stability and absence of correlation between their mRNA levels. Next, the scoring system was expanded with several features of a gene: mutation rate, number of transcript isoforms and pseudogenes, participation in cancer-related processes on the basis of Gene Ontology, and mentions in PubMed-indexed articles. All the genes covered by RNA-Seq data in TCGA were analyzed using the expanded scoring system that allowed us to reveal novel promising RGs for each examined cancer type and identify several "universal" pan-cancer RG candidates, including SF3A1, CIAO1, and SFRS4. The choice of RGs is the basis for precise gene expression evaluation by qPCR. Here, we suggested optimal pairs of traditionally used RGs for 12 cancer types and identified novel promising RGs that demonstrate high expression stability and other features of reliable and convenient RGs (high expression level, low mutation rate, non-involvement in cancer-related processes, single transcript isoform, and absence of pseudogenes).

Collapse

Taylor SC, Nadeau K, Abbasi M, Lachance C, Nguyen M, Fenrich J. The Ultimate qPCR Experiment: Producing Publication Quality, Reproducible Data the First Time. Trends Biotechnol 2019;37:761-774. [PMID: 30654913 DOI: 10.1016/j.tibtech.2018.12.002] [Citation(s) in RCA: 352] [Impact Index Per Article: 70.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 11/30/2018] [Accepted: 12/07/2018] [Indexed: 12/20/2022]

Zacharias HU, Altenbuchinger M, Gronwald W. Statistical Analysis of NMR Metabolic Fingerprints: Established Methods and Recent Advances. Metabolites 2018;8:E47. [PMID: 30154338 PMCID: PMC6161311 DOI: 10.3390/metabo8030047] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 08/01/2018] [Accepted: 08/18/2018] [Indexed: 01/02/2023] Open

Hochrein J, Zacharias HU, Taruttis F, Samol C, Engelmann JC, Spang R, Oefner PJ, Gronwald W. Data Normalization of (1)H NMR Metabolite Fingerprinting Data Sets in the Presence of Unbalanced Metabolite Regulation. J Proteome Res 2015;14:3217-28. [PMID: 26147738 DOI: 10.1021/acs.jproteome.5b00192] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Vigelsø A, Dybboe R, Hansen CN, Dela F, Helge JW, Guadalupe Grau A. GAPDH and β-actin protein decreases with aging, making Stain-Free technology a superior loading control in Western blotting of human skeletal muscle. J Appl Physiol (1985) 2014;118:386-94. [PMID: 25429098 DOI: 10.1152/japplphysiol.00840.2014] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

Mangat CS, Bharat A, Gehrke SS, Brown ED. Rank ordering plate data facilitates data visualization and normalization in high-throughput screening. ACTA ACUST UNITED AC 2014;19:1314-20. [PMID: 24828052 DOI: 10.1177/1087057114534298] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Fromer M, Purcell SM. Using XHMM Software to Detect Copy Number Variation in Whole-Exome Sequencing Data. Curr Protoc Hum Genet 2014;81:7.23.1-7.23.21. [PMID: 24763994 PMCID: PMC4065038 DOI: 10.1002/0471142905.hg0723s81] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Zauber H, Schüler V, Schulze W. Systematic evaluation of reference protein normalization in proteomic experiments. Front Plant Sci 2013;4:25. [PMID: 23450762 PMCID: PMC3583035 DOI: 10.3389/fpls.2013.00025] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Accepted: 02/04/2013] [Indexed: 06/01/2023]

Kohl SM, Klein MS, Hochrein J, Oefner PJ, Spang R, Gronwald W. State-of-the art data normalization methods improve NMR-based metabolomic analysis. Metabolomics 2012;8:146-160. [PMID: 22593726 PMCID: PMC3337420 DOI: 10.1007/s11306-011-0350-z] [Citation(s) in RCA: 141] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 08/01/2011] [Indexed: 12/20/2022]

Abstract

Extracting biomedical information from large metabolomic datasets by multivariate data analysis is of considerable complexity. Common challenges include among others screening for differentially produced metabolites, estimation of fold changes, and sample classification. Prior to these analysis steps, it is important to minimize contributions from unwanted biases and experimental variance. This is the goal of data preprocessing. In this work, different data normalization methods were compared systematically employing two different datasets generated by means of nuclear magnetic resonance (NMR) spectroscopy. To this end, two different types of normalization methods were used, one aiming to remove unwanted sample-to-sample variation while the other adjusts the variance of the different metabolites by variable scaling and variance stabilization methods. The impact of all methods tested on sample classification was evaluated on urinary NMR fingerprints obtained from healthy volunteers and patients suffering from autosomal polycystic kidney disease (ADPKD). Performance in terms of screening for differentially produced metabolites was investigated on a dataset following a Latin-square design, where varied amounts of 8 different metabolites were spiked into a human urine matrix while keeping the total spike-in amount constant. In addition, specific tests were conducted to systematically investigate the influence of the different preprocessing methods on the structure of the analyzed data. In conclusion, preprocessing methods originally developed for DNA microarray analysis, in particular, Quantile and Cubic-Spline Normalization, performed best in reducing bias, accurately detecting fold changes, and classifying samples. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-011-0350-z) contains supplementary material, which is available to authorized users.

Collapse