1
|
Simonetti S, Iuliani M, Stellato M, Cavaliere S, Vincenzi B, Tonini G, Santini D, Pantano F. Extensive plasma proteomic profiling revealed receptor activator of nuclear factor kappa-Β ligand (RANKL) as emerging biomarker of nivolumab clinical benefit in patients with metastatic renal cell carcinoma. J Immunother Cancer 2022; 10:jitc-2022-005136. [PMID: 36104102 PMCID: PMC9476128 DOI: 10.1136/jitc-2022-005136] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2022] [Indexed: 11/08/2022] Open
Abstract
Background The advent of immune checkpoint inhibitors (ICIs) have led to a paradigm change in the management of metastatic renal cell carcinoma (mRCC), nevertheless, the benefit of treatment is confined to a limited proportion of patients. Therefore, the identification of predictive biomarkers for response to ICIs represents an unmet clinical need. Here, we performed a large-scale plasma proteomic profile of patients with mRCC, treated with nivolumab, to identify soluble molecules potentially associated with clinical benefit. Methods We analyzed the levels of 507 soluble molecules in the pretreatment plasma of 16 patients with mRCC (discovery set) who received nivolumab therapy as a single agent. The ELISA assay was performed to confirm the protein level of candidate biomarkers associated to clinical benefit in 15 patients with mRCC (validation set). Survival curves of complete cohort were estimated by the Kaplan-Meier method and compared with the log-rank test. Results Out of 507 screened molecules, 135 factors were selected as expressed above background and 12 of them were significantly overexpressed in patients who did not benefit from treatment (non-responders (NR)) compared with responders (R) group. After multiplicity adjustment, receptor activator of nuclear factor kappa-Β ligand (RANKL) was the only molecule that retained the statistical significance (false discovery rate: 0.023). RANKL overexpression in NR patients was confirmed both in discovery (median NR: 528 pg/mL vs median R: 288 pg/mL, p=0.011) and validation set (median NR: 440 pg/mL vs median R: 253 pg/mL, p<0.001). Considering the complete cohort of patients (discovery+validation set), significantly higher RANKL levels were found in patients who primarily progressed from treatment compared with those who had a partial response (p=0.003) or stable disease (p=0.006). Moreover, patients with low RANKL levels had significant improvements in progression-free survival (median 14.0 months vs 3.4 months, p=0.004) and overall survival (median not reached vs 30.1 months, p=0.003). Conclusions Our exploratory study suggests RANKL as a novel independent biomarker of response and survival in patients with mRCC treated with nivolumab.
Collapse
Affiliation(s)
- Sonia Simonetti
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| | - Michele Iuliani
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| | - Marco Stellato
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy .,Department of Medical Oncology, Fondazione IRCCS Istituto Nazionale dei Tumori, Milano, Italy
| | - Silvia Cavaliere
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| | - Bruno Vincenzi
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| | - Giuseppe Tonini
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| | - Daniele Santini
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy.,UOC Oncologia Universitaria, Sapienza University of Rome - Polo Pontino, Latina, Italy
| | - Francesco Pantano
- Medical Oncology Department, Campus Bio-Medico University, Roma, Italy
| |
Collapse
|
2
|
Salamun V, Rizzo M, Lovrecic L, Hocevar K, Papler Burnik T, Janez A, Jensterle M, Vrtacnik Bokal E, Peterlin B, Maver A. The Endometrial Transcriptome of Metabolic and Inflammatory Pathways During the Window of Implantation Is Deranged in Infertile Obese Polycystic Ovarian Syndrome Women. Metab Syndr Relat Disord 2022; 20:384-394. [PMID: 35834645 DOI: 10.1089/met.2021.0149] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Introduction and Aim: Obese women with polycystic ovarian syndrome (PCOS) have a reduced rate of spontaneous conception even when their cycles are ovulatory. Endometrial receptivity is an important factor for poor implantation and increased miscarriage rates. Mechanisms in which both pathologies modify the endometrium are not fully clarified. The aim of our study was to compare the endometrial transcriptomic profiles between infertile obese PCOS (O-PCOS) women and infertile normal weight subjects during the window of implantation in ovulatory menstrual cycles. Methods: We conducted a prospective transcriptomic analysis of the endometrium using RNA sequencing. In this way, potential endometrial mechanisms leading to the poor reproductive outcome in O-PCOS patients could be characterized. Endometrial samples during days 21-23 of the menstrual cycle were collected from infertile O-PCOS women (n = 11) and normal weight controls (n = 10). Subgroups were defined according to the ovulatory/anovulatory status in the natural cycles, and O-PCOS women were grouped into the O-PCOS ovulatory (O-PCOS-ovul) subgroup. RNA isolation, sequencing with library reparation, and subsequent RNAseq data analysis were performed. Results: Infertile O-PCOS patients had 610 differentially expressed genes (DEGs), after adjustment for multiple comparisons with normal weight infertile controls, related to obesity (MXRA5 and ECM1), PCOS (ADAMTS19 and SLC18A2), and metabolism (VNN1 and PC). In the ovulatory subgroup, no DEGs were found, but significant differences in canonical pathways and the upstream regulator were revealed. According to functional and upstream analyses of ovulatory subgroup comparisons, the most important biological processes were related to inflammation (TNFR1 signaling), insulin signaling (insulin receptor signaling and PI3/AKT), fatty acid metabolism (stearate biosynthesis I and palmitate biosynthesis I), and lipotoxicity (unfolded protein response pathway). Conclusions: We demonstrated that endometrial transcription in ovulatory O-PCOS patients is deranged in comparison with the control ovulatory endometrium. The most important pathways of differentiation include metabolism and inflammation. These processes could also represent potential mechanisms for poor embryo implantation, which prevent the development of a successful pregnancy. ClinicalTrials.gov ID: NCT03353948.
Collapse
Affiliation(s)
- Vesna Salamun
- Division of Obstetrics and Gynecology, Department of Human Reproduction, University Medical Centre Ljubljana, Ljubljana, Slovenia.,Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Manfredi Rizzo
- Division of Endocrinology, Diabetes, and Metabolism, University of South Carolina School of Medicine, Columbia, South Carolina, USA.,Department of Laboratory Medicine, DIBIMIS, University of Palermo, Italy
| | - Luca Lovrecic
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.,Clinical Institute of Medical Genetics, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Keli Hocevar
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Tanja Papler Burnik
- Division of Obstetrics and Gynecology, Department of Human Reproduction, University Medical Centre Ljubljana, Ljubljana, Slovenia.,Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Andrej Janez
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.,Department of Endocrinology, Diabetes and Metabolic Diseases, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Mojca Jensterle
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.,Department of Endocrinology, Diabetes and Metabolic Diseases, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Eda Vrtacnik Bokal
- Division of Obstetrics and Gynecology, Department of Human Reproduction, University Medical Centre Ljubljana, Ljubljana, Slovenia.,Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Borut Peterlin
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.,Clinical Institute of Medical Genetics, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Ales Maver
- Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia.,Clinical Institute of Medical Genetics, University Medical Centre Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
3
|
Large−Scale Profiling of Extracellular Vesicles Identified miR−625−5p as a Novel Biomarker of Immunotherapy Response in Advanced Non−Small−Cell Lung Cancer Patients. Cancers (Basel) 2022; 14:cancers14102435. [PMID: 35626040 PMCID: PMC9139420 DOI: 10.3390/cancers14102435] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/09/2022] [Accepted: 05/12/2022] [Indexed: 02/04/2023] Open
Abstract
Simple Summary Immune checkpoint inhibitors (ICIs) have revolutionized the treatment of advanced non-small-cell lung cancer (NSCLC) leading to substantial improvement in survival time and quality of life. Nevertheless, the clinical benefit of treatment is still limited to a minority of patients, reflecting the need to identify novel noninvasive biomarkers to improve patient selection. Currently available markers such as PD-L1 expression have important limitations. In this study, we focused on extracellular vesicles (EV)-associated miRNAs produced by cancer cells and their microenvironment that can be easily detected in blood. In particular, after a large-scale screening of 799 EV-miRNAs, we identified EV-miR-625-5p as a novel independent biomarker of response and survival in ICI-treated NSCLC patients, in particular in patients with PD-L1 expression ≥ 50%. EV-miR-625-5p integrated with PDL-1 test could allow the clinician to identify in advance patients that would benefit from ICIs. Abstract Immune checkpoint inhibitors (ICIs) are largely used in the treatment of patients with advanced non-small-cell lung cancer (NSCLC). Novel biomarkers that provide biological information that could be useful for clinical management are needed. In this respect, extracellular vesicles (EV)-associated microRNAs (miRNAs) that are the principal vehicle of intercellular communication may be important sources of biomarkers. We analyzed the levels of 799 EV-miRNAs in the pretreatment plasma of 88 advanced NSCLC patients who received anti-PD-1 therapy as single agent. After data normalization, we used a two-step approach to identify candidate biomarkers associated to both objective response (OR) by RECIST and longer overall survival (OS). Univariate and multivariate analyses including known clinicopathologic variables and new findings were performed. In our cohort, 24/88 (27.3%) patients showed OR by RECIST. Median OS in the whole cohort was 11.5 months. In total, 196 EV-miRNAs out 799 were selected as expressed above background. After multiplicity adjustment, abundance of EV-miR-625-5p was found to be correlated with PD-L1 expression and significantly associated to OR by RECIST (p = 0.0366) and OS (p = 0.0031). In multivariate analysis, PD-L1 staining and EV-miR-625-5p levels were constantly associated to OR and OS. Finally, we showed that EV-miR-625-5p levels could discriminate patients with longer survival, in particular in the class expressing PD-L1 ≥50%. EV-miRNAs represent a source of relevant biomarkers. EV-miR-625-5p is an independent biomarker of response and survival in ICI-treated NSCLC patients, in particular in patients with PD-L1 expression ≥50%.
Collapse
|
4
|
Pig Genomics and Genetics. Genes (Basel) 2021; 12:genes12111692. [PMID: 34828298 PMCID: PMC8623580 DOI: 10.3390/genes12111692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 10/21/2021] [Indexed: 11/23/2022] Open
|
5
|
Huang Y, Oldham JM, Ma SF, Unterman A, Liao SY, Barros AJ, Bonham CA, Kim JS, Vij R, Adegunsoye A, Strek ME, Molyneaux PL, Maher TM, Herazo-Maya JD, Kaminski N, Moore BB, Martinez FJ, Noth I. Blood Transcriptomics Predicts Progression of Pulmonary Fibrosis and Associated Natural Killer Cells. Am J Respir Crit Care Med 2021; 204:197-208. [PMID: 33689671 DOI: 10.1164/rccm.202008-3093oc] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Rationale: Disease activity in idiopathic pulmonary fibrosis (IPF) remains highly variable, poorly understood, and difficult to predict. Objectives: To identify a predictor using short-term longitudinal changes in gene expression that forecasts future FVC decline and to characterize involved pathways and cell types. Methods: Seventy-four patients from COMET (Correlating Outcomes with Biochemical Markers to Estimate Time-Progression in IPF) cohort were dichotomized as progressors (≥10% FVC decline) or stable. Blood gene-expression changes within individuals were calculated between baseline and 4 months and regressed with future FVC status, allowing determination of expression variations, sample size, and statistical power. Pathway analyses were conducted to predict downstream effects and identify new targets. An FVC predictor for progression was constructed in COMET and validated using independent cohorts. Peripheral blood mononuclear single-cell RNA-sequencing data from healthy control subjects were used as references to characterize cell type compositions from bulk peripheral blood mononuclear RNA-sequencing data that were associated with FVC decline. Measurements and Main Results: The longitudinal model reduced gene-expression variations within stable and progressor groups, resulting in increased statistical power when compared with a cross-sectional model. The FVC predictor for progression anticipated patients with future FVC decline with 78% sensitivity and 86% specificity across independent IPF cohorts. Pattern recognition receptor pathways and mTOR pathways were downregulated and upregulated, respectively. Cellular deconvolution using single-cell RNA-sequencing data identified natural killer cells as significantly correlated with progression. Conclusions: Serial transcriptomic change predicts future FVC decline. An analysis of cell types involved in the progressor signature supports the novel involvement of natural killer cells in IPF progression.
Collapse
Affiliation(s)
- Yong Huang
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| | - Justin M Oldham
- Division of Pulmonary, Critical Care, and Sleep Medicine, The University of California at Davis, Sacramento, California
| | - Shwu-Fan Ma
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| | - Avraham Unterman
- Section of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, Yale School of Medicine, Yale University, New Haven, Connecticut
| | - Shu-Yi Liao
- Department of Medicine, National Jewish Health, Denver, Colorado
| | - Andrew J Barros
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| | - Catherine A Bonham
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| | - John S Kim
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| | - Rekha Vij
- Section of Pulmonary and Critical Care Medicine and
| | - Ayodeji Adegunsoye
- Section of Pulmonary and Critical Care Medicine and.,Department of Human Genetics, Genetics, Genomic and Systems Biology, University of Chicago, Chicago, Illinois
| | - Mary E Strek
- Section of Pulmonary and Critical Care Medicine and
| | - Philip L Molyneaux
- National Heart and Lung Institute, Imperial College, London, United Kingdom.,Royal Brompton Hospital, London, United Kingdom
| | - Toby M Maher
- National Heart and Lung Institute, Imperial College, London, United Kingdom.,Royal Brompton Hospital, London, United Kingdom.,Division of Pulmonary, Critical Care and Sleep Medicine, Hastings Center for Pulmonary Research, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Jose D Herazo-Maya
- Division of Pulmonary, Critical Care, and Sleep Medicine, Tampa General Hospital, University of South Florida, Tampa, Florida
| | - Naftali Kaminski
- Section of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, Yale School of Medicine, Yale University, New Haven, Connecticut
| | - Bethany B Moore
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, Michigan; and
| | - Fernando J Martinez
- Internal Medicine, Weill Cornell Medical College, Cornell University, New York, New York
| | - Imre Noth
- Division of Pulmonary and Critical Care Medicine, The University of Virginia, Charlottesville, Virginia
| |
Collapse
|
6
|
Wasinger VC, Curnoe D, Boel C, Machin N, Goh HM. The Molecular Floodgates of Stress-Induced Senescence Reveal Translation, Signalling and Protein Activity Central to the Post-Mortem Proteome. Int J Mol Sci 2020; 21:ijms21176422. [PMID: 32899302 PMCID: PMC7504133 DOI: 10.3390/ijms21176422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 08/27/2020] [Accepted: 08/31/2020] [Indexed: 11/16/2022] Open
Abstract
The transitioning of cells during the systemic demise of an organism is poorly understood. Here, we present evidence that organismal death is accompanied by a common and sequential molecular flood of stress-induced events that propagate the senescence phenotype, and this phenotype is preserved in the proteome after death. We demonstrate activation of “death” pathways involvement in diseases of ageing, with biochemical mechanisms mapping onto neurological damage, embryonic development, the inflammatory response, cardiac disease and ultimately cancer with increased significance. There is sufficient bioavailability of the building blocks required to support the continued translation, energy, and functional catalytic activity of proteins. Significant abundance changes occur in 1258 proteins across 1 to 720 h post-mortem of the 12-week-old mouse mandible. Protein abundance increases concord with enzyme activity, while mitochondrial dysfunction is evident with metabolic reprogramming. This study reveals differences in protein abundances which are akin to states of stress-induced premature senescence (SIPS). The control of these pathways is significant for a large number of biological scenarios. Understanding how these pathways function during the process of cellular death holds promise in generating novel solutions capable of overcoming disease complications, maintaining organ transplant viability and could influence the findings of proteomics through “deep-time” of individuals with no historically recorded cause of death.
Collapse
Affiliation(s)
- Valerie C. Wasinger
- Bioanalytical Mass Spectrometry Facility, Mark Wainwright Analytical Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia
- Palaeontology, Geobiology and Earth Archives Research Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia; (C.B.); (N.M.); (H.M.G.)
- Correspondence: (V.C.W.); (D.C.)
| | - Darren Curnoe
- Palaeontology, Geobiology and Earth Archives Research Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia; (C.B.); (N.M.); (H.M.G.)
- ARC Centre of Excellence for Australian Biodiversity and Heritage, University of New South Wales Sydney, Kensington, NSW 2052, Australia
- Correspondence: (V.C.W.); (D.C.)
| | - Ceridwen Boel
- Palaeontology, Geobiology and Earth Archives Research Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia; (C.B.); (N.M.); (H.M.G.)
- ARC Centre of Excellence for Australian Biodiversity and Heritage, University of New South Wales Sydney, Kensington, NSW 2052, Australia
| | - Naomi Machin
- Palaeontology, Geobiology and Earth Archives Research Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia; (C.B.); (N.M.); (H.M.G.)
| | - Hsiao Mei Goh
- Palaeontology, Geobiology and Earth Archives Research Centre, University of New South Wales Sydney, Kensington, NSW 2052, Australia; (C.B.); (N.M.); (H.M.G.)
- ARC Centre of Excellence for Australian Biodiversity and Heritage, University of New South Wales Sydney, Kensington, NSW 2052, Australia
- Centre for Global Archaeological Research, University Sains Malaysia, Penang 11800, Malaysia
| |
Collapse
|
7
|
Izmirlian G. Strong consistency and asymptotic normality for quantities related to the Benjamini–Hochberg false discovery rate procedure. Stat Probab Lett 2020. [DOI: 10.1016/j.spl.2020.108713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
8
|
Zhou Y, Xu C, Zhu W, He H, Zhang L, Tang B, Zeng Y, Tian Q, Deng HW. Long Noncoding RNA Analyses for Osteoporosis Risk in Caucasian Women. Calcif Tissue Int 2019; 105:183-192. [PMID: 31073748 PMCID: PMC6712977 DOI: 10.1007/s00223-019-00555-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Accepted: 04/16/2019] [Indexed: 02/07/2023]
Abstract
INTRODUCTION Osteoporosis is a prevalent bone metabolic disease characterized by bone fragility. As a key pathophysiological mechanism, the disease is caused by excessive bone resorption (by osteoclasts) over bone formation (by osteoblasts). Peripheral blood monocytes (PBMs) is a major systemic cell model for bone metabolism by serving as progenitors of osteoclasts and producing cytokines important for osteoclastogenesis. Protein-coding genes for osteoporosis have been widely studied by mRNA analyses of PBMs in high versus low hip bone mineral density (BMD) subjects. However, long noncoding RNAs (lncRNAs), which account for a large proportion of human transcriptome, have seldom been studied. METHODS In this study, microarray analyses of monocytes were performed using Affymetrix exon 1.0 ST arrays in 73 Caucasian females (age: 47-56). LncRNA profile was generated by re-annotating exon array for lncRNAs detection, which yielded 12,007 lncRNAs mapped to the human genome. RESULTS 575 lncRNAs were differentially expressed between the two groups. In the high BMD subjects, 309 lncRNAs were upregulated and 266 lncRNAs were downregulated (nominally significant, raw p-value < 0.05). To investigate the relationship between mRNAs and lncRNAs, we used two approaches to predict the target genes of lncRNAs and found that 26 candidate lncRNAs might regulate mRNA expression. The majority of these lncRNAs were further validated to be potentially correlated with BMD by GWAS analysis. CONCLUSION Overall, our findings for the first time reported the lncRNAs profiles for osteoporosis and suggested the potential regulatory mechanism of lncRNAs on protein-coding genes in bone metabolism.
Collapse
Affiliation(s)
- Yu Zhou
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Cell and Molecular Biology, Tulane University, New Orleans, LA, 70118, USA
| | - Chao Xu
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Wei Zhu
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Hao He
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Lan Zhang
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Beisha Tang
- School of Basic Medical Science, National Clinical Research Center for Geriatric Diseases, Xiangya Hospital, Central South University, Changsha, 410078, Hunan, China
| | - Yong Zeng
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Qing Tian
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA
| | - Hong-Wen Deng
- Center of Genomics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA.
- Department of Biostatistics and Bioinformatics, Tulane University, New Orleans, LA, 70112, USA.
- School of Basic Medical Science, National Clinical Research Center for Geriatric Diseases, Xiangya Hospital, Central South University, Changsha, 410078, Hunan, China.
- Center for Bioinformatics and Genomics, Department of Biostatistics and Bioinformatics, School of Public Health and Tropical Medicine, Tulane University, 1440 Canal St., RM 1619F, New Orleans, LA, 70112, USA.
| |
Collapse
|
9
|
Alleva R, Tognù A, Tomasetti M, Benassi MS, Pazzaglia L, van Oven H, Viganò E, De Simone N, Pacini I, Giannone S, Gagic S, Borghi R, Picone S, Borghi B. Effect of different anaesthetic techniques on gene expression profiles in patients who underwent hip arthroplasty. PLoS One 2019; 14:e0219113. [PMID: 31344051 PMCID: PMC6657832 DOI: 10.1371/journal.pone.0219113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 06/15/2019] [Indexed: 11/18/2022] Open
Abstract
Objectives To investigate the modulation of genes whose expression level is indicative of stress and toxicity following exposure to three anaesthesia techniques, general anaesthesia (GA), regional anaesthesia (RA), or integrated anaesthesia (IA). Methods Patients scheduled for hip arthroplasty receiving GA, RA and IA were enrolled at Rizzoli Orthopaedic Institute of Bologna, Italy and the expression of genes involved in toxicology were evaluated in peripheral blood mononuclear cells (PBMCs) collected before (T0), immediately after surgery (T1), and on the third day (T2) after surgery in association with biochemical parameters. Results All three anaesthesia methods proved safe and reliable in terms of pain relief and patient recovery. Gene ontology analysis revealed that GA and mainly IA were associated with deregulation of DNA repair system and stress-responsive genes, which was observed even after 3-days from anaesthesia. Conversely, RA was not associated with substantial changes in gene expression. Conclusions Based on the gene expression analysis, RA technique showed the smallest toxicological effect in hip arthroplasty. Trial registration ClinicalTrials.gov number NCT03585647.
Collapse
Affiliation(s)
- Renata Alleva
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
- * E-mail:
| | - Andrea Tognù
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Marco Tomasetti
- Department of Clinical and Molecular Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Maria Serena Benassi
- Laboratory of Experimental Oncology, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Laura Pazzaglia
- Laboratory of Experimental Oncology, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Hanna van Oven
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Ettore Viganò
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Nicola De Simone
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Ilaria Pacini
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Sandra Giannone
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Sanjin Gagic
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
- Research Unit of Anaesthesia and Pain Therapy, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Raffaele Borghi
- Department of Anaesthesia and Postoperative Intensive Care, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Sara Picone
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
- Research Unit of Anaesthesia and Pain Therapy, Rizzoli Orthopaedic Institute, Bologna, Italy
| | - Battista Borghi
- Department of Biomedical and Neuromotor Sciences, University of Bologna, Bologna, Italy
- Research Unit of Anaesthesia and Pain Therapy, Rizzoli Orthopaedic Institute, Bologna, Italy
| |
Collapse
|
10
|
Mukherjee S, Laiakis EC, Fornace AJ, Amundson SA. Impact of inflammatory signaling on radiation biodosimetry: mouse model of inflammatory bowel disease. BMC Genomics 2019; 20:329. [PMID: 31046668 PMCID: PMC6498469 DOI: 10.1186/s12864-019-5689-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 04/11/2019] [Indexed: 12/19/2022] Open
Abstract
Background Ionizing Radiation (IR) is a known pro-inflammatory agent and in the process of development of biomarkers for radiation biodosimetry, a chronic inflammatory disease condition could act as a confounding factor. Hence, it is important to develop radiation signatures that can distinguish between IR-induced inflammatory responses and pre-existing disease. In this study, we compared the gene expression response of a genetically modified mouse model of inflammatory bowel disease (Il10−/−) with that of a normal wild-type mouse to potentially develop transcriptomics-based biodosimetry markers that can predict radiation exposure in individuals regardless of pre-existing inflammatory condition. Results Wild-type (WT) and Il10−/− mice were exposed to whole body irradiation of 7 Gy X-rays. Gene expression responses were studied using high throughput whole genome microarrays in peripheral blood 24 h post-irradiation. Analysis resulted in identification of 1962 and 1844 genes differentially expressed (p < 0.001, FDR < 10%) after radiation exposure in Il10−/− and WT mice respectively. A set of 155 genes was also identified as differentially expressed between WT and Il10−/− mice at the baseline pre-irradiation level. Gene ontology analysis revealed that the 155 baseline differentially expressed genes were mainly involved in inflammatory response, glutathione metabolism and collagen deposition. Analysis of radiation responsive genes revealed that innate immune response and p53 signaling processes were strongly associated with up-regulated genes, whereas B-cell development process was found to be significant amongst downregulated genes in the two genotypes. However, specific immune response pathways like MHC based antigen presentation, interferon signaling and hepatic fibrosis were associated with radiation responsive genes in Il10−/− mice but not WT mice. Further analysis using the IPA prediction tool revealed significant differences in the predicted activation status of T-cell mediated signaling as well as regulators of inflammation between WT and Il10−/− after irradiation. Conclusions Using a mouse model we established that an inflammatory disease condition could affect the expression of many radiation responsive genes. Nevertheless, we identified a panel of genes that, regardless of disease condition, could predict radiation exposure. Our results highlight the need for consideration of pre-existing conditions in the population in the process of development of reliable biodosimetry markers. Electronic supplementary material The online version of this article (10.1186/s12864-019-5689-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sanjay Mukherjee
- Center for Radiological Research, Columbia University Irving Medical Center, New York, NY, 10032, USA.
| | - Evagelia C Laiakis
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, 20057, USA.,Department of Biochemistry and Molecular & Cell Biology, Georgetown University, Washington, DC, 20057, USA
| | - Albert J Fornace
- Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, 20057, USA.,Department of Biochemistry and Molecular & Cell Biology, Georgetown University, Washington, DC, 20057, USA
| | - Sally A Amundson
- Center for Radiological Research, Columbia University Irving Medical Center, New York, NY, 10032, USA
| |
Collapse
|
11
|
Meerson A, Najjar A, Saad E, Sbeit W, Barhoum M, Assy N. Sex Differences in Plasma MicroRNA Biomarkers of Early and Complicated Diabetes Mellitus in Israeli Arab and Jewish Patients. Noncoding RNA 2019; 5:E32. [PMID: 30959814 PMCID: PMC6631160 DOI: 10.3390/ncrna5020032] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 03/29/2019] [Accepted: 04/04/2019] [Indexed: 12/15/2022] Open
Abstract
MicroRNAs play functional roles in the etiology of type 2 diabetes mellitus (T2DM) and complications, and extracellular microRNAs have attracted interest as potential biomarkers of these conditions. We aimed to identify a set of plasma microRNAs, which could serve as biomarkers of T2DM and complications in a mixed Israeli Arab/Jewish patient sample. Subjects included 30 healthy volunteers, 29 early-stage T2DM patients, and 29 late-stage T2DM patients with renal and/or vascular complications. RNA was isolated from plasma, and the levels of 12 candidate microRNAs were measured by quantitative reverse transcription and polymerase chain reaction (qRT-PCR). MicroRNA levels were compared between the groups and correlated to clinical measurements, followed by stepwise regression analysis and discriminant analysis. Plasma miR-486-3p and miR-423 were respectively up- and down-regulated in T2DM patients compared to healthy controls. MiR-28-3p and miR-423 were up-regulated in patients with complicated T2DM compared to early T2DM, while miR-486-3p was down-regulated. Combined, four microRNAs (miR-146a-5p, miR-16-2-3p, miR-126-5p, and miR-30d) could distinguish early from complicated T2DM with 77% accuracy and 79% sensitivity. In male patients only, the same microRNAs, with the addition of miR-423, could distinguish early from complicated T2DM with 83.3% accuracy. Furthermore, plasma microRNA levels showed significant correlations with clinical measurements, and these differed between men and women. Additionally, miR-183-5p levels differed significantly between the ethnic groups. Our study identified a panel of specific plasma microRNAs which can serve as biomarkers of T2DM and its complications and emphasizes the importance of sex differences in their clinical application.
Collapse
Affiliation(s)
- Ari Meerson
- MIGAL Galilee Research Institute, Kiryat Shmona 1101602, Israel.
- Tel Hai Academic College, Upper Galilee 1220800, Israel.
| | - Azwar Najjar
- Department of Internal Medicine A, Galilee Medical Center, Nahariya, Israel.
| | - Elias Saad
- Department of Internal Medicine A, Galilee Medical Center, Nahariya, Israel.
| | - Wisam Sbeit
- Department of Gastroenterology, Galilee Medical Center, Nahariya, Israel.
| | | | - Nimer Assy
- Department of Internal Medicine A, Galilee Medical Center, Nahariya, Israel.
- The Azrieli Faculty of Medicine, Bar Ilan University, Safed, Israel.
| |
Collapse
|
12
|
Zhao Y, Dantony E, Roy P. Optimism Bias Correction in Omics Studies with Big Data: Assessment of Penalized Methods on Simulated Data. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2019; 23:207-213. [PMID: 30794050 DOI: 10.1089/omi.2018.0191] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Big Data generated by omics technologies require simultaneous analyses of large numbers of variables. This leads to complex model selection and parameter estimates that show optimism bias. This study on simulated data sets examined optimism-bias correction by penalty regression methods in case-control studies that involve clinical and omics variables. Least absolute shrinkage and selection operator (LASSO)-based methods (LASSO-penalized logistic regression, adaptive LASSO, and regularized LASSO for selection + ridge regression) were evaluated using power, the false positive rate (FPR), false discovery rate (FDR), and by estimated versus theoretical parameter comparisons. The "ordinary" LASSO overcorrects the optimism bias. The adaptive LASSO with LASSO estimation of the weights was unable to provide a sufficient correction. Importantly, the adaptive LASSO with ridge estimation of the weights showed the best parameter estimation. The regularized LASSO selection showed a slight optimism bias that decreased with the increase in the training set size. The optimism bias decreased with the increase of the number of variables selected among truly differentially expressed variables; however, power, FPR, and FDR were correlated. A compromise between model selection and estimation accuracy should be found. These results might prove useful because Big Data analyses are becoming commonplace in omics/multiomics studies in integrative biology, precision medicine, and planetary health.
Collapse
Affiliation(s)
- Yubing Zhao
- 1 Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France.,2 Université de Lyon, Lyon, France.,3 Université Claude Bernard (Lyon 1), Villeurbanne, France.,4 CNRS UMR 5558, Laboratoire de Biométrie et Biologie Évolutive, Équipe Biostatistique Santé, Villeurbanne, France
| | - Emmanuelle Dantony
- 1 Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France.,2 Université de Lyon, Lyon, France.,3 Université Claude Bernard (Lyon 1), Villeurbanne, France.,4 CNRS UMR 5558, Laboratoire de Biométrie et Biologie Évolutive, Équipe Biostatistique Santé, Villeurbanne, France
| | - Pascal Roy
- 1 Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France.,2 Université de Lyon, Lyon, France.,3 Université Claude Bernard (Lyon 1), Villeurbanne, France.,4 CNRS UMR 5558, Laboratoire de Biométrie et Biologie Évolutive, Équipe Biostatistique Santé, Villeurbanne, France
| |
Collapse
|
13
|
Courcoulas AP, Stefater MA, Shirley E, Gourash WF, Stylopoulos N. The Feasibility of Examining the Effects of Gastric Bypass Surgery on Intestinal Metabolism: Prospective, Longitudinal Mechanistic Clinical Trial. JMIR Res Protoc 2019; 8:e12459. [PMID: 30679147 PMCID: PMC6483060 DOI: 10.2196/12459] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 12/05/2018] [Accepted: 12/09/2018] [Indexed: 01/01/2023] Open
Abstract
Background Bariatric surgery, especially Roux-en-Y gastric bypass (RYGB), is the best treatment for severe obesity and its complications including type 2 diabetes mellitus (T2DM). Understanding the mechanisms responsible for the beneficial metabolic effects will help to engineer ways to improve the procedure or produce these effects without surgery. Objective The aim is to present data on recruitment and feasibility of a translational study designed to collect intestinal samples before and after bariatric surgery. The goal of biobanking is to allow future studies to test the hypothesis that the mechanism of action of RYGB involves specific changes in the postsurgical short- and long-term metabolism and morphology of the jejunum (Roux limb). Specifically, to test whether the intestine enhances its metabolism and activity after RYGB and increases its fuel utilization, we designed a prospective, longitudinal study, which involved the recruitment of candidates for RYGB with and without T2DM. We describe the tissue bank that we have generated, and our experience, hoping to further facilitate the performance of longitudinal mechanistic studies in human patients undergoing bariatric surgery and especially those involving post-RYGB intestinal biology. Methods We conducted a trial to characterize the effects of RYGB on intestinal metabolism. Intestinal tissue samples were collected from the jejunum at surgery, 1, 6, and 12 months postoperatively for the analysis of intestinal gene expression and metabolomic and morphologic changes. The target number of patients who completed at least the 6-month follow-up was 26, and we included a 20% attrition rate, increasing the total number to 32. Results To enroll 26 patients, we had to approach 79 potential participants. A total of 37 agreed to participate and started the study; 33, 30, and 26 active participants completed their 1-month, 6-month, and 12-month studies, respectively. Three participants withdrew, and 30 participants are still active. Altruism and interest in research were the most common reasons for participation. Important factors for feasibility and successful retention included (1) large volume case flow, (2) inclusion and exclusion criteria broad enough to capture a large segment of the patient population but narrow enough to ensure the completion of study aims and protection of safety concerns, (3) accurate assessment of willingness and motivation to participate in a study, (4) seamless integration of the recruitment process into normal clinical flow, (5) financial reimbursement and nonfinancial rewards and gestures of appreciation, and (6) nonburdensome follow-up visits and measures and reasonable time allotted. Conclusions Human translational studies of the intestinal mechanisms of metabolic and weight changes after bariatric surgery are important and feasible. A tissue bank with unique samples has been established that could be used by investigators in many research fields, further enabling mechanistic studies on the effects of bariatric surgery. Trial Registration ClinicalTrials.gov NCT02710370; https://clinicaltrials.gov/ct2/show/NCT02710370 (Archived by WebCite at http://www.webcitation.org/75HrQT8Dl)
Collapse
Affiliation(s)
| | | | - Eleanor Shirley
- University of Pittsburgh Medical Center, Pittsburgh, PA, United States
| | - William F Gourash
- University of Pittsburgh Medical Center, Pittsburgh, PA, United States
| | | |
Collapse
|
14
|
Lin CW, Liao SG, Liu P, Park YS, Lee MLT, Tseng GC. RNASeqDesign: A framework for RNA-Seq genome-wide power calculation and study design issues. J R Stat Soc Ser C Appl Stat 2018; 68:683-704. [PMID: 33692596 DOI: 10.1111/rssc.12330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Massively parallel sequencing (a.k.a. next-generation sequencing, NGS) technology has emerged as a powerful tool in characterizing genomic profiles. Among many NGS applications, RNA sequencing (RNA-Seq) has gradually become a standard tool for global transcriptomic monitoring. Although the cost of NGS experiments has dropped constantly, the high sequencing cost and bioinformatic complexity are still obstacles for many biomedical projects. Unlike earlier fluorescence-based technologies such as microarray, modelling of NGS data should consider discrete count data. In addition to sample size, sequencing depth also directly relates to the experimental cost. Consequently, given total budget and pre-specified unit experimental cost, the study design issue in RNA-Seq is conceptually a more complex multi-dimensional constrained optimization problem rather than one-dimensional sample size calculation in traditional hypothesis setting. In this paper, we propose a statistical framework, namely "RNASeqDesign", to utilize pilot data for power calculation and study design of RNA-Seq experiments. The approach is based on mixture model fitting of p-value distribution from pilot data and a parametric bootstrap procedure based on approximated Wald test statistics to infer genome-wide power for optimal sample size and sequencing depth. We further illustrate five practical study design tasks for practitioners. We perform simulations and three real applications to evaluate the performance and compare to existing methods.
Collapse
Affiliation(s)
- Chien-Wei Lin
- Division of Biostatistics, Medical College of Wisconsin, Milwaukee, WI 53226
| | - Serena G Liao
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261
| | - Peng Liu
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261
| | - Yong Seok Park
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261
| | - Mei-Ling Ting Lee
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742
| | - George C Tseng
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261
| |
Collapse
|
15
|
Volk M, Maver A, Hodžić A, Lovrečić L, Peterlin B. Transcriptome Profiling Uncovers Potential Common Mechanisms in Fetal Trisomies 18 and 21. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2018; 21:565-570. [PMID: 29049012 PMCID: PMC5655413 DOI: 10.1089/omi.2017.0123] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Human trisomies have recently been investigated using transcriptomics approaches to identify the gene expression (GE) signatures characteristic of each of these specific aneuploidy conditions. We hypothesized that the viability of cells with gross genomic imbalances might be associated with the activation of resilience mechanisms that are common to different trisomies and that are reflected by specific shared GE patterns. We report in this article our microarray GE analyses of amniocytes from fetuses with viable trisomy conditions, trisomy 21 or trisomy 18, to detect such common expression signatures. Comparative analysis of significantly differentially expressed genes in trisomies 18 and 21 revealed six dysregulated genes common to both: OTUD5, ADAMTSL1, TADA2A, PPID, PIAS2, and MAPRE2. These genes are involved in ubiquitination, protein folding, cell proliferation, and apoptosis. Pathway-based enrichment analyses demonstrated that both trisomies showed dysregulation of the PI3K/AKT pathway, cell cycle G2/M DNA damage checkpoint regulation, and cell death and survival, as well as inhibition of the upstream regulator TP53. Our data collectively suggest that trisomies 18 and 21 share common functional GE signatures, implying that common mechanisms of resilience might be activated in aneuploid cells to resist large genomic imbalances. To the best of our knowledge, this is the first study to use global GE profiling data to identify potential common mechanisms in fetal trisomies. Studies of other trisomies using transcriptomics and multiomics approaches might further clarify mechanisms activated in trisomy syndromes.
Collapse
Affiliation(s)
- Marija Volk
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana , Ljubljana, Slovenia
| | - Aleš Maver
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana , Ljubljana, Slovenia
| | - Alenka Hodžić
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana , Ljubljana, Slovenia
| | - Luca Lovrečić
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana , Ljubljana, Slovenia
| | - Borut Peterlin
- Clinical Institute of Medical Genetics, University Medical Centre Ljubljana , Ljubljana, Slovenia
| |
Collapse
|
16
|
Abstract
High-throughput sequencing assays have become an increasingly common part of biological research across multiple fields. Even as the resulting sequences pile up in public databases, it is not always obvious how to make use of these data sets. Functional genomics offers approaches to integrate these "big" data into our understanding of rheumatic diseases. This review aims to provide a primer on thinking about big data from functional genomics in the context of rheumatology, using examples from the field's literature as well as the author's own work to illustrate the execution of functional genomics research. Study design is crucial to ensure the right samples are used to address the question of interest. In addition, sequencing assays produce a variety of data types, from gene expression to 3D chromatin structure and single-cell technologies, that can be integrated into a model of the underlying gene regulatory networks. The best approach for this analysis uses the scientific process: bioinformatic methods should be used in an iterative, hypothesis-driven manner to uncover the disease mechanism. Finally, the future of functional genomics will see big data fully integrated into rheumatology, leading to computationally trained researchers and interactive databases. The goal of this review is not to provide a manual, but to enhance the familiarity of readers with functional genomic approaches and provide a better sense of the challenges and possibilities.
Collapse
Affiliation(s)
- Deborah R Winter
- Department of Medicine, Division of Rheumatology, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
| |
Collapse
|
17
|
Rudqvist N, Laiakis EC, Ghandhi SA, Kumar S, Knotts JD, Chowdhury M, Fornace AJ, Amundson SA. Global Gene Expression Response in Mouse Models of DNA Repair Deficiency after Gamma Irradiation. Radiat Res 2018; 189:337-344. [PMID: 29351057 DOI: 10.1667/rr14862.1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In the event of an improvised nuclear device or "dirty bomb" in a highly populated area, potentially hundreds of thousands of people will require screening to ensure that exposed individuals receive appropriate treatment. For this reason, there is a need to develop tools for high-throughput radiation biodosimetry. Gene expression represents an emerging approach to biodosimetry and could potentially provide an estimate of both absorbed dose and individual radiation-induced injury. Since approximately 2-4% of humans are thought to be radiosensitive, and would suffer greater radiological injury at a given dose than members of the general population, it is of interest to explore the potential impact of such sensitivity on the biodosimetric gene expression signatures being developed. In this study, we used wild-type mice and genetically engineered mouse models deficient in two DNA repair pathways that can contribute to radiation sensitivity to estimate the maximum effect of differences in radiosensitivity. We compared gene expression in response to a roughly equitoxic (LD50/30) dose of gamma rays in wild-type C57BL/6 (8 Gy) and DNA double-strand break repair-deficient Atm-/- (4 Gy) and Prkdcscid (3 Gy) mutants of C57BL/6. Overall, 780 genes were significantly differentially expressed in wild-type mice one day postirradiation, 232 in Atm-/- and 269 in Prkdcscid. Upstream regulators including TP53 and NFκB were predicted to be activated by radiation exposure in the wild-type mice, but not in either of the DNA repair-deficient mutant strains. There was also a significant muting of the apparent inflammatory response triggered by radiation in both mutant strains. These differences impacted the ability of gene expression signatures developed in wild-type mice to detect potentially fatal radiation exposure in the DNA repair-deficient mice, with the greatest impact on Atm-/- mice. However, the inclusion of mutant mice in gene selection vastly improved performance of the classifiers.
Collapse
Affiliation(s)
- Nils Rudqvist
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| | - Evagelia C Laiakis
- b Department of Biochemistry and Molecular and Cellular Biology, Georgetown University, Washington, DC
| | - Shanaz A Ghandhi
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| | - Suresh Kumar
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| | - Jeffrey D Knotts
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| | - Mashkura Chowdhury
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| | - Albert J Fornace
- b Department of Biochemistry and Molecular and Cellular Biology, Georgetown University, Washington, DC
| | - Sally A Amundson
- a Center for Radiological Research, Columbia University Medical Center, Columbia University, New York, New York
| |
Collapse
|
18
|
Oral mucosa tissue gene expression profiling before, during, and after radiation therapy for tonsil squamous cell carcinoma. PLoS One 2018; 13:e0190709. [PMID: 29338018 PMCID: PMC5770028 DOI: 10.1371/journal.pone.0190709] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 12/19/2017] [Indexed: 12/19/2022] Open
Abstract
Background Radiation-therapy (RT) induces mucositis, a clinically challenging condition with limited prophylactic interventions and no predictive tests. In this pilot study, we applied global gene-expression analysis on serial human oral mucosa tissue and blood cells from patients with tonsil squamous cell cancer (TSCC) to identify genes involved in mucositis pathogenesis. Methods and findings Eight patients with TSCC each provided consecutive buccal biopsies and blood cells before, after 7 days of RT treatment, and 20 days following RT. We monitored clinical mucositis and performed gene-expression analysis on tissue samples. We obtained control tissue from nine healthy individuals. After RT, expression was upregulated in apoptosis inducer and inhibitor genes, EDA2R and MDM2, and in POLH, a DNA-repair polymerase. Expression was downregulated in six members of the histone cluster family, e.g., HIST1H3B. Gene expression related to proliferation and differentiation was altered, including MKI67 (downregulated), which encodes the Ki-67-proliferation marker, and KRT16 (upregulated), which encodes keratin16. These alterations were not associated with the clinical mucositis grade. However, the expression of LY6G6C, which encodes a surface immunoregulatory protein, was upregulated before treatment in three cases of clinical none/mild mucositis, but not in four cases of ulcerative mucositis. Conclusion RT caused molecular changes related to apoptosis, DNA-damage, DNA-repair, and proliferation without a correlation to the severity of clinical mucositis. LY6G6C may be a potential protective biomarker for ulcerative mucositis. Based on these results, our study model of consecutive human biopsies will be useful in designing a prospective clinical validation trial to characterize molecular mucositis and identify predictive biomarkers.
Collapse
|
19
|
Marcussen M, Bødker JS, Christensen HS, Johansen P, Nielsen S, Christiansen I, Bergmann OJ, Bøgsted M, Dybkær K, Vyberg M, Johnsen HE. Molecular Characteristics of High-Dose Melphalan Associated Oral Mucositis in Patients with Multiple Myeloma: A Gene Expression Study on Human Mucosa. PLoS One 2017; 12:e0169286. [PMID: 28052121 PMCID: PMC5215401 DOI: 10.1371/journal.pone.0169286] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 12/14/2016] [Indexed: 11/19/2022] Open
Abstract
Background Toxicity of the oral and gastrointestinal mucosa induced by high-dose melphalan is a clinical challenge with no documented prophylactic interventions or predictive tests. The aim of this study was to describe molecular changes in human oral mucosa and to identify biomarkers correlated with the grade of clinical mucositis. Methods and Findings Ten patients with multiple myeloma (MM) were included. For each patient, we acquired three buccal biopsies, one before, one at 2 days, and one at 20 days after high-dose melphalan administration. We also acquired buccal biopsies from 10 healthy individuals that served as controls. We analyzed the biopsies for global gene expression and performed an immunohistochemical analysis to determine HLA-DRB5 expression. We evaluated associations between clinical mucositis and gene expression profiles. Compared to gene expression levels before and 20 days after therapy, at two days after melphalan treatment, we found gene regulation in the p53 and TNF pathways (MDM2, INPPD5, TIGAR), which favored anti-apoptotic defense, and upregulation of immunoregulatory genes (TREM2, LAMP3) in mucosal dendritic cells. This upregulation was independent of clinical mucositis. HLA-DRB1 and HLA-DRB5 (surface receptors on dendritic cells) were expressed at low levels in all patients with MM, in the subgroup of patients with ulcerative mucositis (UM), and in controls; in contrast, the subgroup with low-grade mucositis (NM) displayed 5–6 fold increases in HLA-DRB1 and HLA-DRB5 expression in the first two biopsies, independent of melphalan treatment. Moreover, different splice variants of HLA-DRB1 were expressed in the UM and NM subgroups. Conclusions Our results revealed that, among patients with MM, immunoregulatory genes and genes involved in defense against apoptosis were affected immediately after melphalan administration, independent of the presence of clinical mucositis. Furthermore, our results suggested that the expression levels of HLA-DRB1 and HLA-DRB5 may serve as potential predictive biomarkers for mucositis severity.
Collapse
Affiliation(s)
- Mette Marcussen
- Department of Clinical Medicine, Aalborg University, Sdr. Skovvej 15, Aalborg, Denmark
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
- * E-mail:
| | - Julie Støve Bødker
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| | - Heidi Søgaard Christensen
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| | - Preben Johansen
- Department of Pathology, Aalborg University Hospital, Ladegaardsgade 3, Denmark
| | - Søren Nielsen
- Department of Pathology, Aalborg University Hospital, Ladegaardsgade 3, Denmark
| | - Ilse Christiansen
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| | - Olav Jonas Bergmann
- School of Dentistry, Faculty of Health Science, Aarhus University; Vennelyst Boulevard 9, Aarhus, Denmark
| | - Martin Bøgsted
- Department of Clinical Medicine, Aalborg University, Sdr. Skovvej 15, Aalborg, Denmark
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| | - Karen Dybkær
- Department of Clinical Medicine, Aalborg University, Sdr. Skovvej 15, Aalborg, Denmark
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| | - Mogens Vyberg
- Department of Clinical Medicine, Aalborg University, Sdr. Skovvej 15, Aalborg, Denmark
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Pathology, Aalborg University Hospital, Ladegaardsgade 3, Denmark
| | - Hans Erik Johnsen
- Department of Clinical Medicine, Aalborg University, Sdr. Skovvej 15, Aalborg, Denmark
- Clinical Cancer Research Center, Aalborg University Hospital, Sdr. Skovvej 15, Aalborg, Denmark
- Department of Haematology, Aalborg University Hospital, Hobrovej 18–22, Aalborg, Denmark
| |
Collapse
|
20
|
Arvind P, Jayashree S, Jambunathan S, Nair J, Kakkar VV. Understanding gene expression in coronary artery disease through global profiling, network analysis and independent validation of key candidate genes. J Genet 2016; 94:601-10. [PMID: 26690514 DOI: 10.1007/s12041-015-0548-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Molecular mechanism underlying the patho-physiology of coronary artery disease (CAD) is complex. We used global expression profiling combined with analysis of biological network to dissect out potential genes and pathways associated with CAD in a representative case-control Asian Indian cohort. We initially performed blood transcriptomics profiling in 20 subjects, including 10 CAD patients and 10 healthy controls on the Agilent microarray platform. Data was analysed with Gene Spring Gx12.5, followed by network analysis using David v 6.7 and Reactome databases. The most significant differentially expressed genes from microarray were independently validated by real time PCR in 97 cases and 97 controls. A total of 190 gene transcripts showed significant differential expression (fold change>2,P<0.05) between the cases and the controls of which 142 genes were upregulated and 48 genes were downregulated. Genes associated with inflammation, immune response, cell regulation, proliferation and apoptotic pathways were enriched, while inflammatory and immune response genes were displayed as hubs in the network, having greater number of interactions with the neighbouring genes. Expression of EGR1/2/3, IL8, CXCL1, PTGS2, CD69, IFNG, FASLG, CCL4, CDC42, DDX58, NFKBID and NR4A2 genes were independently validated; EGR1/2/3 and IL8 showed >8-fold higher expression in cases relative to the controls implying their important role in CAD. In conclusion, global gene expression profiling combined with network analysis can help in identifying key genes and pathways for CAD.
Collapse
Affiliation(s)
- Prathima Arvind
- Mary and Garry Weston Functional Genomics Unit, Thrombosis Research Institute, Bengaluru 560 099, India.
| | | | | | | | | |
Collapse
|
21
|
Saccenti E, Timmerman ME. Approaches to Sample Size Determination for Multivariate Data: Applications to PCA and PLS-DA of Omics Data. J Proteome Res 2016; 15:2379-93. [PMID: 27322847 DOI: 10.1021/acs.jproteome.5b01029] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Sample size determination is a fundamental step in the design of experiments. Methods for sample size determination are abundant for univariate analysis methods, but scarce in the multivariate case. Omics data are multivariate in nature and are commonly investigated using multivariate statistical methods, such as principal component analysis (PCA) and partial least-squares discriminant analysis (PLS-DA). No simple approaches to sample size determination exist for PCA and PLS-DA. In this paper we will introduce important concepts and offer strategies for (minimally) required sample size estimation when planning experiments to be analyzed using PCA and/or PLS-DA.
Collapse
Affiliation(s)
- Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology, Wageningen University and Research Center , Dreijenplein 10, 6703 HB, Wageningen, The Netherlands
| | - Marieke E Timmerman
- Department Psychometrics & Statistics, University of Groningen , Grote Kruissstraat 2/1, 9712 TS, Groningen, The Netherlands
| |
Collapse
|
22
|
Gadbury GL, Page GP, Edwards J, Kayo T, Prolla TA, Weindruch R, Permana PA, Mountz JD, Allison DB. Power and sample size estimation in high dimensional biology. Stat Methods Med Res 2016. [DOI: 10.1191/0962280204sm369ra] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Genomic scientists often test thousands of hypotheses in a single experiment. One example is a microarray experiment that seeks to determine differential gene expression among experimental groups. Planning such experiments involves a determination of sample size that will allow meaningful interpretations. Traditional power analysis methods may not be well suited to this task when thousands of hypotheses are tested in a discovery oriented basic research. We introduce the concept of expected discovery rate (EDR) and an approach that combines parametric mixture modelling with parametric bootstrapping to estimate the sample size needed for a desired accuracy of results. While the examples included are derived from microarray studies, the methods, herein, are ‘extraparadigmatic’ in the approach to study design and are applicable to most high dimensional biological situations. Pilot data from three different microarray experiments are used to extrapolate EDR as well as the related false discovery rate at different sample sizes and thresholds.
Collapse
Affiliation(s)
- Gary L Gadbury
- Department of Mathematics and Statistics, University of Missouri - Rolla, MO, USA
| | - Grier P Page
- USDA ARS, Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Jode Edwards
- USDA ARS, Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Tsuyoshi Kayo
- Wisconsin Regional Primate Research Center, Madison, WI, USA
| | - Tomas A Prolla
- Department of Genetics and Medical Genetics, University of Wisconsin, Madison, WI, USA
| | - Richard Weindruch
- Department of Medicine, University of Wisconsin and The Geriatric Research, Education, and Clinical Center, William S Middleton VA Hospital, Madison, WI, USA
| | - Paska A Permana
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Phoenix, AZ, USA
| | - John D Mountz
- The Birmingham Veterans Administration Medical Center, University of Alabama at Birmingham, Birmingham, AL, USA
| | - David B Allison
- Department of Biostatistics, Section on Statistical Genetics, and Clinical Nutrition Research Center, University of Alabama at Birmingham, Birmingham, AL, USA,
| |
Collapse
|
23
|
Effective gene expression data generation framework based on multi-model approach. Artif Intell Med 2016; 70:41-61. [PMID: 27431036 DOI: 10.1016/j.artmed.2016.05.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Accepted: 05/27/2016] [Indexed: 11/20/2022]
Abstract
OBJECTIVE Overcome the lack of enough samples in gene expression data sets having thousands of genes but a small number of samples challenging the computational methods using them. METHODS AND MATERIAL This paper introduces a multi-model artificial gene expression data generation framework where different gene regulatory network (GRN) models contribute to the final set of samples based on the characteristics of their underlying paradigms. In the first stage, we build different GRN models, and sample data from each of them separately. Then, we pool the generated samples into a rich set of gene expression samples, and finally try to select the best of the generated samples based on a multi-objective selection method measuring the quality of the generated samples from three different aspects such as compatibility, diversity and coverage. We use four alternative GRN models, namely, ordinary differential equations, probabilistic Boolean networks, multi-objective genetic algorithm and hierarchical Markov model. RESULTS We conducted a comprehensive set of experiments based on both real-life biological and synthetic gene expression data sets. We show that our multi-objective sample selection mechanism effectively combines samples from different models having up to 95% compatibility, 10% diversity and 50% coverage. We show that the samples generated by our framework has up to 1.5x higher compatibility, 2x higher diversity and 2x higher coverage than the samples generated by the individual models that the multi-model framework uses. Moreover, the results show that the GRNs inferred from the samples generated by our framework can have 2.4x higher precision, 12x higher recall, and 5.4x higher f-measure values than the GRNs inferred from the original gene expression samples. CONCLUSIONS Therefore, we show that, we can significantly improve the quality of generated gene expression samples by integrating different computational models into one unified framework without dealing with complex internal details of each individual model. Moreover, the rich set of artificial gene expression samples is able to capture some biological relations that can even not be captured by the original gene expression data set.
Collapse
|
24
|
Intuitive web-based experimental design for high-throughput biomedical data. BIOMED RESEARCH INTERNATIONAL 2015; 2015:958302. [PMID: 25954760 PMCID: PMC4411450 DOI: 10.1155/2015/958302] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2014] [Accepted: 03/09/2015] [Indexed: 11/25/2022]
Abstract
Big data bioinformatics aims at drawing biological conclusions from huge and complex biological datasets. Added value from the analysis of big data, however, is only possible if the data is accompanied by accurate metadata annotation. Particularly in high-throughput experiments intelligent approaches are needed to keep track of the experimental design, including the conditions that are studied as well as information that might be interesting for failure analysis or further experiments in the future. In addition to the management of this information, means for an integrated design and interfaces for structured data annotation are urgently needed by researchers. Here, we propose a factor-based experimental design approach that enables scientists to easily create large-scale experiments with the help of a web-based system. We present a novel implementation of a web-based interface allowing the collection of arbitrary metadata. To exchange and edit information we provide a spreadsheet-based, humanly readable format. Subsequently, sample sheets with identifiers and metainformation for data generation facilities can be created. Data files created after measurement of the samples can be uploaded to a datastore, where they are automatically linked to the previously created experimental design model.
Collapse
|
25
|
Guo Y, Zhao S, Li CI, Sheng Q, Shyr Y. RNAseqPS: A Web Tool for Estimating Sample Size and Power for RNAseq Experiment. Cancer Inform 2014; 13:1-5. [PMID: 25374457 PMCID: PMC4213196 DOI: 10.4137/cin.s17688] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Revised: 07/18/2014] [Accepted: 07/22/2014] [Indexed: 12/05/2022] Open
Abstract
Sample size and power determination is the first step in the experimental design of a successful study. Sample size and power calculation is required for applications for National Institutes of Health (NIH) funding. Sample size and power calculation is well established for traditional biological studies such as mouse model, genome wide association study (GWAS), and microarray studies. Recent developments in high-throughput sequencing technology have allowed RNAseq to replace microarray as the technology of choice for high-throughput gene expression profiling. However, the sample size and power analysis of RNAseq technology is an underdeveloped area. Here, we present RNAseqPS, an advanced online RNAseq power and sample size calculation tool based on the Poisson and negative binomial distributions. RNAseqPS was built using the Shiny package in R. It provides an interactive graphical user interface that allows the users to easily conduct sample size and power analysis for RNAseq experimental design. RNAseqPS can be accessed directly at http://cqs.mc.vanderbilt.edu/shiny/RNAseqPS/.
Collapse
Affiliation(s)
- Yan Guo
- Center for Quantitative Sciences, Vanderbilt University, Nashville, TN, USA
| | - Shilin Zhao
- Center for Quantitative Sciences, Vanderbilt University, Nashville, TN, USA
| | - Chung-I Li
- Department of Statistics, National Cheng Kung University, Tainan, Taiwan
| | - Quanhu Sheng
- Center for Quantitative Sciences, Vanderbilt University, Nashville, TN, USA
| | - Yu Shyr
- Center for Quantitative Sciences, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
26
|
Bar HY, Booth JG, Wells MT. A Bivariate Model for Simultaneous Testing in Bioinformatics Data. J Am Stat Assoc 2014. [DOI: 10.1080/01621459.2014.884502] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
27
|
Kussmann M, Morine MJ, Hager J, Sonderegger B, Kaput J. Perspective: a systems approach to diabetes research. Front Genet 2013; 4:205. [PMID: 24187547 PMCID: PMC3807566 DOI: 10.3389/fgene.2013.00205] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 09/24/2013] [Indexed: 12/17/2022] Open
Abstract
We review here the status of human type 2 diabetes studies from a genetic, epidemiological, and clinical (intervention) perspective. Most studies limit analyses to one or a few omic technologies providing data of components of physiological processes. Since all chronic diseases are multifactorial and arise from complex interactions between genetic makeup and environment, type 2 diabetes mellitus (T2DM) is a collection of sub-phenotypes resulting in high fasting glucose. The underlying gene–environment interactions that produce these classes of T2DM are imperfectly characterized. Based on assessments of the complexity of T2DM, we propose a systems biology approach to advance the understanding of origin, onset, development, prevention, and treatment of this complex disease. This systems-based strategy is based on new study design principles and the integrated application of omics technologies: we pursue longitudinal studies in which each subject is analyzed at both homeostasis and after (healthy and safe) challenges. Each enrolled subject functions thereby as their own case and control and this design avoids assigning the subjects a priori to case and control groups based on limited phenotyping. Analyses at different time points along this longitudinal investigation are performed with a comprehensive set of omics platforms. These data sets are generated in a biological context, rather than biochemical compound class-driven manner, which we term “systems omics.”
Collapse
Affiliation(s)
- Martin Kussmann
- Nestlé Institute of Health Sciences SA Lausanne, Switzerland ; Faculty of Life Sciences, Ecole Polytechnique Fédérale Lausanne, Switzerland ; Faculty of Science, Aarhus University Aarhus, Denmark
| | | | | | | | | |
Collapse
|
28
|
Abstract
Exact analytic expressions are developed for the average power of the Benjamini and Hochberg false discovery control procedure. The result is based on explicit computation of the joint probability distribution of the total number of rejections and the number of false rejections, and expressed in terms of the cumulative distribution functions of the p-values of the hypotheses. An example of analytic evaluation of the average power is given. The result is confirmed by numerical experiments and applied to a meta-analysis of three clinical studies in mammography.
Collapse
|
29
|
Abstract
Microarray is a technology to screen a large number of genes to discover those differentially expressed between clinical subtypes or different conditions of human diseases. Gene discovery using microarray data requires adjustment for the large-scale multiplicity of candidate genes. The family-wise error rate (FWER) has been widely chosen as a global type I error rate adjusting for the multiplicity. Typically in microarray data, the expression levels of different genes are correlated because of coexpressing genes and the common experimental conditions shared by the genes on each array. To accurately control the FWER, the statistical testing procedure should appropriately reflect the dependency among the genes. Permutation methods have been used for accurate control of the FWER in analyzing microarray data. It is important to calculate the required sample size at the design stage of a new (confirmatory) microarray study. Because of the high dimensionality and complexity of the correlation structure in microarray data, however, there have been no sample size calculation methods accurately reflecting the true correlation structure of real microarray data. We propose sample size and power calculation methods that are useful when pilot data are available to design a confirmatory experiment. If no pilot data are available, we recommend a two-stage sample size recalculation based on our proposed method using the first stage data as pilot data. The calculated sample sizes are shown to accurately maintain the power through simulations. A real data example is taken to illustrate the proposed method.
Collapse
Affiliation(s)
- Sin-Ho Jung
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27710, USA
| | | |
Collapse
|
30
|
Abstract
In high-throughput experiments, the sample size is typically chosen informally. Most formal sample-size calculations depend critically on prior knowledge. We propose a sequential strategy that, by updating knowledge when new data are available, depends less critically on prior assumptions. Experiments are stopped or continued based on the potential benefits in obtaining additional data. The underlying decision-theoretic framework guarantees the design to proceed in a coherent fashion. We propose intuitively appealing, easy-to-implement utility functions. As in most sequential design problems, an exact solution is prohibitive. We propose a simulation-based approximation that uses decision boundaries. We apply the method to RNA-seq, microarray, and reverse-phase protein array studies and show its potential advantages. The approach has been added to the Bioconductor package gaga.
Collapse
Affiliation(s)
- David Rossell
- Biostatistics and Bioinformatics Unit, Institute for Research in Biomedicine of Barcelona, Barcelona 08028, Spain.
| | | |
Collapse
|
31
|
Templin T, Young EF, Smilenov LB. Proton radiation-induced miRNA signatures in mouse blood: characterization and comparison with 56Fe-ion and gamma radiation. Int J Radiat Biol 2012; 88:531-9. [PMID: 22551419 DOI: 10.3109/09553002.2012.690549] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
PURPOSE Previously, we showed that microRNA (miRNA) signatures derived from the peripheral blood of mice are highly specific for both radiation energy (γ-rays or high linear energy transfer [LET] (56)Fe ions) and radiation dose. Here, we investigate to what extent miRNA expression signatures derived from mouse blood can be used as biomarkers for exposure to 600 MeV proton radiation. MATERIALS AND METHODS We exposed mice to 600 MeV protons, using doses of 0.5 or 1.0 Gy, isolated total RNA at 6 h or 24 h after irradiation, and used quantitative real-time polymerase chain reaction (PCR) to determine the changes in miRNA expression. RESULTS A total of 26 miRNA were differentially expressed after proton irradiation, in either one (77%) or multiple conditions (23%). Statistical classifiers based on proton, γ, and (56)Fe-ion miRNA expression signatures predicted radiation type and proton dose with accuracies of 81% and 88%, respectively. Importantly, gene ontology analysis for proton-irradiated cells shows that genes targeted by radiation-induced miRNA are involved in biological processes and molecular functions similar to those controlled by miRNA in γ ray- and (56)Fe-irradiated cells. CONCLUSIONS Mouse blood miRNA signatures induced by proton, γ, or (56)Fe irradiation are radiation type- and dose-specific. These findings underline the complexity of the miRNA-mediated radiation response.
Collapse
Affiliation(s)
- Thomas Templin
- Center for Radiological Research, Columbia University Medical Center, New York, NY, USA
| | | | | |
Collapse
|
32
|
Yao C, Li H, Shen X, He Z, He L, Guo Z. Reproducibility and concordance of differential DNA methylation and gene expression in cancer. PLoS One 2012; 7:e29686. [PMID: 22235325 PMCID: PMC3250460 DOI: 10.1371/journal.pone.0029686] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Accepted: 12/01/2011] [Indexed: 12/11/2022] Open
Abstract
Background Hundreds of genes with differential DNA methylation of promoters have been identified for various cancers. However, the reproducibility of differential DNA methylation discoveries for cancer and the relationship between DNA methylation and aberrant gene expression have not been systematically analysed. Methodology/Principal Findings Using array data for seven types of cancers, we first evaluated the effects of experimental batches on differential DNA methylation detection. Second, we compared the directions of DNA methylation changes detected from different datasets for the same cancer. Third, we evaluated the concordance between methylation and gene expression changes. Finally, we compared DNA methylation changes in different cancers. For a given cancer, the directions of methylation and expression changes detected from different datasets, excluding potential batch effects, were highly consistent. In different cancers, DNA hypermethylation was highly inversely correlated with the down-regulation of gene expression, whereas hypomethylation was only weakly correlated with the up-regulation of genes. Finally, we found that genes commonly hypomethylated in different cancers primarily performed functions associated with chronic inflammation, such as ‘keratinization’, ‘chemotaxis’ and ‘immune response’. Conclusions Batch effects could greatly affect the discovery of DNA methylation biomarkers. For a particular cancer, both differential DNA methylation and gene expression can be reproducibly detected from different studies with no batch effects. While DNA hypermethylation is significantly linked to gene down-regulation, hypomethylation is only weakly correlated with gene up-regulation and is likely to be linked to chronic inflammation.
Collapse
Affiliation(s)
- Chen Yao
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
| | - Hongdong Li
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiaopei Shen
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
| | - Zheng He
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
| | - Lang He
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
| | - Zheng Guo
- Bioinformatics Centre and Key Laboratory for NeuroInfomation of the Education Ministry of China, School of Life Science, University of Electronic Science and Technology of China, Chengdu, China
- Colleges of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- * E-mail:
| |
Collapse
|
33
|
Sample Size Calculation for Controlling False Discovery Proportion. JOURNAL OF PROBABILITY AND STATISTICS 2012. [DOI: 10.1155/2012/817948] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The false discovery proportion (FDP), the proportion of incorrect rejections among all rejections, is a direct measure of abundance of false positive findings in multiple testing. Many methods have been proposed to control FDP, but they are too conservative to be useful for power analysis. Study designs for controlling the mean of FDP, which is false discovery rate, have been commonly used. However, there has been little attempt to design study with direct FDP control to achieve certain level of efficiency. We provide a sample size calculation method using the variance formula of the FDP under weak-dependence assumptions to achieve the desired overall power. The relationship between design parameters and sample size is explored. The adequacy of the procedure is assessed by simulation. We illustrate the method using estimated correlations from a prostate cancer dataset.
Collapse
|
34
|
Owzar K, Barry WT, Jung SH. Statistical considerations for analysis of microarray experiments. Clin Transl Sci 2011; 4:466-77. [PMID: 22212230 DOI: 10.1111/j.1752-8062.2011.00309.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Microarray technologies enable the simultaneous interrogation of expressions from thousands of genes from a biospecimen sample taken from a patient. This large set of expressions generates a genetic profile of the patient that may be used to identify potential prognostic or predictive genes or genetic models for clinical outcomes. The aim of this article is to provide a broad overview of some of the major statistical considerations for the design and analysis of microarrays experiments conducted as correlative science studies to clinical trials. An emphasis will be placed on how the lack of understanding and improper use of statistical concepts and methods will lead to noise discovery and misinterpretation of experimental results.
Collapse
Affiliation(s)
- Kouros Owzar
- Department of Biostatistics and Bioinformatics, Duke University CALGB Statistical Center, Duke University, Durham, North Carolina, USA
| | | | | |
Collapse
|
35
|
Abou-El-Ardat K, Monsieurs P, Anastasov N, Atkinson M, Derradji H, De Meyer T, Bekaert S, Van Criekinge W, Baatout S. Low dose irradiation of thyroid cells reveals a unique transcriptomic and epigenetic signature in RET/PTC-positive cells. Mutat Res 2011; 731:27-40. [PMID: 22027090 DOI: 10.1016/j.mrfmmm.2011.10.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2011] [Revised: 09/20/2011] [Accepted: 10/13/2011] [Indexed: 11/28/2022]
Abstract
The high doses of radiation received in the wake of the Chernobyl incident and the atomic bombing of Hiroshima and Nagasaki have been linked to the increased appearance of thyroid cancer in the children living in the vicinity of the site. However, the data gathered on the effect of low doses of radiation on the thyroid remain limited. We have examined the genome wide transcriptional response of a culture of TPC-1 human cell line of papillary thyroid carcinoma origin with a RET/PTC1 translocation to various doses (0.0625, 0.5, and 4Gy) of X-rays and compared it to response of thyroids with a RET/PTC3 translocation and against wild-type mouse thyroids irradiated with the same doses using Affymetrix microarrays. We have found considerable overlap at a high dose of 4Gy in both RET/PTC-positive systems but no common genes at 62.5mGy. In addition, the response of RET/PTC-positive system at all doses was distinct from the response of wild-type thyroids with both systems signaling down different pathways. Analysis of the response of microRNAs in TPC-1 cells revealed a radiation-responsive signature of microRNAs in addition to dose-responsive microRNAs. Our results point to the fact that a low dose of X-rays seems to have a significant proliferative effect on normal thyroids. This observation should be studied further as opposed to its effect on RET/PTC-positive thyroids which was subtle, anti-proliferative and system-dependent.
Collapse
|
36
|
Reproducible cancer biomarker discovery in SELDI-TOF MS using different pre-processing algorithms. PLoS One 2011; 6:e26294. [PMID: 22022591 PMCID: PMC3194809 DOI: 10.1371/journal.pone.0026294] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2011] [Accepted: 09/24/2011] [Indexed: 12/14/2022] Open
Abstract
Background There has been much interest in differentiating diseased and normal samples using biomarkers derived from mass spectrometry (MS) studies. However, biomarker identification for specific diseases has been hindered by irreproducibility. Specifically, a peak profile extracted from a dataset for biomarker identification depends on a data pre-processing algorithm. Until now, no widely accepted agreement has been reached. Results In this paper, we investigated the consistency of biomarker identification using differentially expressed (DE) peaks from peak profiles produced by three widely used average spectrum-dependent pre-processing algorithms based on SELDI-TOF MS data for prostate and breast cancers. Our results revealed two important factors that affect the consistency of DE peak identification using different algorithms. One factor is that some DE peaks selected from one peak profile were not detected as peaks in other profiles, and the second factor is that the statistical power of identifying DE peaks in large peak profiles with many peaks may be low due to the large scale of the tests and small number of samples. Furthermore, we demonstrated that the DE peak detection power in large profiles could be improved by the stratified false discovery rate (FDR) control approach and that the reproducibility of DE peak detection could thereby be increased. Conclusions Comparing and evaluating pre-processing algorithms in terms of reproducibility can elucidate the relationship among different algorithms and also help in selecting a pre-processing algorithm. The DE peaks selected from small peak profiles with few peaks for a dataset tend to be reproducibly detected in large peak profiles, which suggests that a suitable pre-processing algorithm should be able to produce peaks sufficient for identifying useful and reproducible biomarkers.
Collapse
|
37
|
Abstract
In recent years, the capability of synthetic biology to design large genetic circuits has dramatically increased due to rapid advances in DNA synthesis technology and development of tools for large-scale assembly of DNA fragments. Large genetic circuits require more components (parts), especially regulators such as transcription factors, sigma factors, and viral RNA polymerases to provide increased regulatory capability, and also devices such as sensors, receivers, and signaling molecules. All these parts may have a potential impact upon the host that needs to be considered when designing and fabricating circuits. DNA microarrays are a well-established technique for global monitoring of gene expression and therefore are an ideal tool for systematically assessing the impact of expressing parts of genetic circuits in host cells. Knowledge of part impact on the host enables the user to design circuits from libraries of parts taking into account their potential impact and also to possibly modify the host to better tolerate stresses induced by the engineered circuit. In this chapter, we present the complete methodology of performing microarrays from choice of array platform, experimental design, preparing samples for array hybridization, and associated data analysis including preprocessing, normalization, clustering, identifying significantly differentially expressed genes, and interpreting the data based on known biology. With these methodologies, we also include lists of bioinformatic resources and tools for performing data analysis. The aim of this chapter is to provide the reader with the information necessary to be able to systematically catalog the impact of genetic parts on the host and also to optimize the operation of fully engineered genetic circuits.
Collapse
Affiliation(s)
- Virgil A Rhodius
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, California, USA
| | | |
Collapse
|
38
|
Tan EH, Goh C, Lim WT, Soo KC, Khoo ML, Tan T, Tan DSW, Ang MK, Ng QS, Tan PH, Lim A, Hwang J, Teng YHF, Lim TH, Tan SH, Baskaran N, Hui KM. Gefitinib, cisplatin, and concurrent radiotherapy for locally advanced head and neck cancer: EGFR FISH, protein expression, and mutational status are not predictive biomarkers. Ann Oncol 2011; 23:1010-6. [PMID: 21768327 DOI: 10.1093/annonc/mdr327] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Gefitinib was demonstrated to be synergistic with cisplatin and radiotherapy (RT) in in vitro studies. Biomarkers predictive of response to gefitinib in squamous cell head and neck cancer is still lacking. METHODS Thirty-one patients with locally advanced and easily accessible primary tumor sites for biopsies were recruited. Gefitinib was started 3 weeks before the start of cisplatin/concurrent radiotherapy (CTRT) and continued during the CTRT phase and thereafter for 4 months as consolidation phase. Two baselines and a repeat tumor sample were taken after 2 weeks of gefitinib alone to study its impact on tumor gene expression. Epidermal growth factor receptor (EGFR) protein expression, FISH and mutational status, and matrix metallopeptidase 11 (MMP11) protein expression were correlated with response and survival outcome. RESULTS The overall response rate to gefitinib alone was 9.7%. The survival outcome is as follows: median disease free 1.3 years, median survival time 2.4 years, 3-year disease free 42.9%, and 3-year overall survival 48.4%. EGFR FISH, protein expression, and mutational status did not predict for response nor survival outcome of patients. Although MMP11 overexpression did not predict for response, it predicted significantly for a poorer survival outcome. CONCLUSIONS Gefitinib can be combined safely with cisplatin/RT. More studies are needed to uncover predictive biomarkers of benefit to gefitinib.
Collapse
Affiliation(s)
- E-H Tan
- Department of Medical Oncology, National Cancer Centre, Singapore.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Abstract
The next-generation sequencing technologies are being rapidly applied in biological research. Tens of millions of short sequences generated in a single experiment provide us enormous information on genome composition, genetic variants, gene expression levels and protein binding sites depending on the applications. Various methods are being developed for analyzing the data generated by these technologies. However, the relevant experimental design issues have rarely been discussed. In this review, we use RNA-seq as an example to bring this topic into focus and to discuss experimental design and validation issues pertaining to next-generation sequencing in the quantification of transcripts.
Collapse
Affiliation(s)
- Zhide Fang
- Biostatistics Program, School of Public Health, Louisiana State University Health Sciences Center, New Orleans, USA
| | | |
Collapse
|
40
|
Templin T, Amundson SA, Brenner DJ, Smilenov LB. Whole mouse blood microRNA as biomarkers for exposure to γ-rays and (56)Fe ion. Int J Radiat Biol 2011; 87:653-62. [PMID: 21271940 DOI: 10.3109/09553002.2010.549537] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
PURPOSE Biomarkers of ionising radiation exposure are useful in a variety of scenarios, such as medical diagnostic imaging, occupational exposures, and spaceflight. This study investigates to what extent microRNA (miRNA) expression signatures in mouse peripheral blood can be used as biomarkers for exposures to radiation with low and high linear energy transfers. MATERIALS AND METHODS Mice were irradiated with doses of 0.5, 1.5, or 5.0 Gy γ-rays (dose rate of 0.0136 Gy/s) or with doses of 0.1 or 0.5 Gy (56)Fe ions (dose rate of 0.00208 Gy/s). Total RNA was isolated from whole blood at 6 h or 24 h after irradiation. Three animals per irradiation condition were used. Differentially expressed miRNA were determined by means of quantitative real-time polymerase chain reaction. RESULTS miRNA expression signatures were radiation type-specific and dose- and time-dependent. The differentially expressed miRNA were expressed in either one condition (71%) or multiple conditions (29%). Classifiers based on the differentially expressed miRNA predicted radiation type or dose with accuracies between 75% and 100%. Gene-ontology analyses show that miRNA induced by irradiation are involved in the control of several biological processes, such as mRNA transcription regulation, nucleic-acid metabolism, and development. CONCLUSION miRNA signatures induced by ionising radiation in mouse blood are radiation type- and radiation dose-specific. These findings underline the complexity of the radiation response and the importance of miRNA in it.
Collapse
Affiliation(s)
- Thomas Templin
- Center for Radiological Research, Columbia University Medical Center, New York, NY, USA
| | | | | | | |
Collapse
|
41
|
Coppola G. Designing, performing, and interpreting a microarray-based gene expression study. Methods Mol Biol 2011; 793:417-39. [PMID: 21913117 DOI: 10.1007/978-1-61779-328-8_28] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Microarray-based assays have significantly expanded their scope and range of applications over the last 10 years, and--at least for gene expression--can be considered mainstream applications. High-throughput, microarray-based gene expression studies have proven particularly useful in the study of neurodegenerative diseases, for which they have provided key insights in understanding disease pathogenesis, regional and cellular specificity, and identification of therapeutic targets. Even though many experimental steps are currently performed in specialized core facilities, the key steps of a microarray study--experimental design, and data analysis and interpretation--are performed by the primary investigator. Knowledge of the issues related to these key steps is essential to properly perform and interpret a microarray experiment and constitutes the main focus of the present chapter. The basic analytical steps are covered, and annotated R code for the analysis of a published dataset is provided.
Collapse
Affiliation(s)
- Giovanni Coppola
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, #1524 Gonda Neuroscience and Genetics Research Center, University of California Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
42
|
Belzeaux R, Formisano-Tréziny C, Loundou A, Boyer L, Gabert J, Samuelian JC, Féron F, Naudin J, Ibrahim EC. Clinical variations modulate patterns of gene expression and define blood biomarkers in major depression. J Psychiatr Res 2010; 44:1205-13. [PMID: 20471034 DOI: 10.1016/j.jpsychires.2010.04.011] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Revised: 04/08/2010] [Accepted: 04/09/2010] [Indexed: 12/12/2022]
Abstract
The aim of the study is to compare the expression level of candidate genes between patients suffering from a severe major depressive episode (MDE) and controls, and also among patients during MDE evolution. After a comprehensive review of the biological data related to mood disorders, we initiated a hypothesis-driven exploration of candidate mRNAs. Using RT-qPCR, we analyzed peripheral blood mononuclear cells (PBMCs) mRNA obtained from a homogeneous population of 11 patients who suffered from severe melancholic MDE. To assess the evolution of MDE, we analyzed PBMC mRNAs that were collected on Day 1 and 8 weeks later. Data from these patient samples were analyzed in comparison to age- and sex-matched healthy controls. Among 40 candidate genes consistently transcribed in PBMCs, 10 were differentially expressed in at least one comparison. We found that variations of mRNA levels for NRG1, SORT1 and TPH1 were interesting state-dependent biological markers of the disease. We also observed that variations in other mRNA expression were associated with treatment efficacy or clinical improvement (CREB1, HDAC5, HSPA2, HTR1B, HTR2A, and SLC6A4/5HTT). Significantly, 5HTT exhibited a strong correlation with clinical score evolution. We also found a state-independent marker, IL10. Moreover, the analysis of 2 separate MDEs concerning a same patient revealed comparable results for the expression of CREB1, HSPA2, HTR1B, NRG1 and TPH1. Overall, our results indicate that PBMCs obtained at different time points during MDE progression represent a promising avenue to discover biological markers for depression.
Collapse
Affiliation(s)
- Raoul Belzeaux
- NICN-CNRS UMR 6184, Faculté de Médecine Nord-IFR Jean Roche, 51 Bd Pierre Dramard, 13344 Marseille Cedex 15, France
| | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Chuchana P, Holzmuller P, Vezilier F, Berthier D, Chantal I, Severac D, Lemesre JL, Cuny G, Nirdé P, Bucheton B. Intertwining threshold settings, biological data and database knowledge to optimize the selection of differentially expressed genes from microarray. PLoS One 2010; 5:e13518. [PMID: 20976008 PMCID: PMC2958130 DOI: 10.1371/journal.pone.0013518] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2010] [Accepted: 09/29/2010] [Indexed: 11/29/2022] Open
Abstract
Background Many tools used to analyze microarrays in different conditions have been described. However, the integration of deregulated genes within coherent metabolic pathways is lacking. Currently no objective selection criterion based on biological functions exists to determine a threshold demonstrating that a gene is indeed differentially expressed. Methodology/Principal Findings To improve transcriptomic analysis of microarrays, we propose a new statistical approach that takes into account biological parameters. We present an iterative method to optimise the selection of differentially expressed genes in two experimental conditions. The stringency level of gene selection was associated simultaneously with the p-value of expression variation and the occurrence rate parameter associated with the percentage of donors whose transcriptomic profile is similar. Our method intertwines stringency level settings, biological data and a knowledge database to highlight molecular interactions using networks and pathways. Analysis performed during iterations helped us to select the optimal threshold required for the most pertinent selection of differentially expressed genes. Conclusions/Significance We have applied this approach to the well documented mechanism of human macrophage response to lipopolysaccharide stimulation. We thus verified that our method was able to determine with the highest degree of accuracy the best threshold for selecting genes that are truly differentially expressed.
Collapse
|
44
|
Crosslin DR, Qin X, Hauser ER. Assessment of LD matrix measures for the analysis of biological pathway association. Stat Appl Genet Mol Biol 2010; 9:Article35. [PMID: 20887274 PMCID: PMC2979315 DOI: 10.2202/1544-6115.1561] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Complex diseases will have multiple functional sites, and it will be invaluable to understand the cross-locus interaction in terms of linkage disequilibrium (LD) between those sites (epistasis) in addition to the haplotype-LD effects. We investigated the statistical properties of a class of matrix-based statistics to assess this epistasis. These statistical methods include two LD contrast tests (Zaykin et al., 2006) and partial least squares regression (Wang et al., 2008). To estimate Type 1 error rates and power, we simulated multiple two-variant disease models using the SIMLA software package. SIMLA allows for the joint action of up to two disease genes in the simulated data with all possible multiplicative interaction effects between them. Our goal was to detect an interaction between multiple disease-causing variants by means of their linkage disequilibrium (LD) patterns with other markers. We measured the effects of marginal disease effect size, haplotype LD, disease prevalence and minor allele frequency have on cross-locus interaction (epistasis). In the setting of strong allele effects and strong interaction, the correlation between the two disease genes was weak (r=0.2). In a complex system with multiple correlations (both marginal and interaction), it was difficult to determine the source of a significant result. Despite these complications, the partial least squares and modified LD contrast methods maintained adequate power to detect the epistatic effects; however, for many of the analyses we often could not separate interaction from a strong marginal effect. While we did not exhaust the entire parameter space of possible models, we do provide guidance on the effects that population parameters have on cross-locus interaction.
Collapse
|
45
|
Detecting Differentially Expressed Genes: Minimizing Burden of Testing and Maximizing Number of Discoveries. Ann Epidemiol 2010; 20:766-71. [DOI: 10.1016/j.annepidem.2010.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2009] [Revised: 04/02/2010] [Accepted: 04/05/2010] [Indexed: 11/19/2022]
|
46
|
Xu WW, Carter CJ. Parallel multiplicity and error discovery rate (EDR) in microarray experiments. BMC Bioinformatics 2010; 11:465. [PMID: 20846437 PMCID: PMC2955048 DOI: 10.1186/1471-2105-11-465] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2009] [Accepted: 09/16/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array. RESULTS This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small. CONCLUSIONS Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.
Collapse
Affiliation(s)
- Wayne Wenzhong Xu
- Minnesota Supercomputing Institute for Advanced Computational Research, University of Minnesota, Minneapolis, MN 55455, USA.
| | | |
Collapse
|
47
|
Guo Y, Graber A, McBurney RN, Balasubramanian R. Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms. BMC Bioinformatics 2010; 11:447. [PMID: 20815881 PMCID: PMC2942858 DOI: 10.1186/1471-2105-11-447] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2010] [Accepted: 09/03/2010] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND data generated using 'omics' technologies are characterized by high dimensionality, where the number of features measured per subject vastly exceeds the number of subjects in the study. In this paper, we consider issues relevant in the design of biomedical studies in which the goal is the discovery of a subset of features and an associated algorithm that can predict a binary outcome, such as disease status. We compare the performance of four commonly used classifiers (K-Nearest Neighbors, Prediction Analysis for Microarrays, Random Forests and Support Vector Machines) in high-dimensionality data settings. We evaluate the effects of varying levels of signal-to-noise ratio in the dataset, imbalance in class distribution and choice of metric for quantifying performance of the classifier. To guide study design, we present a summary of the key characteristics of 'omics' data profiled in several human or animal model experiments utilizing high-content mass spectrometry and multiplexed immunoassay based techniques. RESULTS the analysis of data from seven 'omics' studies revealed that the average magnitude of effect size observed in human studies was markedly lower when compared to that in animal studies. The data measured in human studies were characterized by higher biological variation and the presence of outliers. The results from simulation studies indicated that the classifier Prediction Analysis for Microarrays (PAM) had the highest power when the class conditional feature distributions were Gaussian and outcome distributions were balanced. Random Forests was optimal when feature distributions were skewed and when class distributions were unbalanced. We provide a free open-source R statistical software library (MVpower) that implements the simulation strategy proposed in this paper. CONCLUSION no single classifier had optimal performance under all settings. Simulation studies provide useful guidance for the design of biomedical studies involving high-dimensionality data.
Collapse
Affiliation(s)
- Yu Guo
- BG Medicine, Inc., 610 Lincoln St., Waltham, MA 02451, USA
| | - Armin Graber
- Institute for Bioinformatics and Translational Research, UMIT, Eduard Wallnoefer Zentrum 1, 6060 Hall in Tyrol, Austria
| | - Robert N McBurney
- Optimal Medicine Ltd., Warwick Enterprise Park, Wellesbourne, Warwick CV35 9EF, UK
| | - Raji Balasubramanian
- Division of Biostatistics and Epidemiology, University of Massachusetts - Amherst, 715 North Pleasant Street, Amherst, MA 01003, USA
| |
Collapse
|
48
|
Rosa GJM, Steibel JP, Tempelman RJ. Reassessing design and analysis of two-colour microarray experiments using mixed effects models. Comp Funct Genomics 2010; 6:123-31. [PMID: 18629220 PMCID: PMC2447516 DOI: 10.1002/cfg.464] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Accepted: 02/03/2005] [Indexed: 11/08/2022] Open
Abstract
Gene expression microarray studies have led to interesting experimental design and statistical analysis challenges. The comparison of expression profiles across populations is one of the most common objectives of microarray experiments. In this manuscript we review some issues regarding design and statistical analysis for two-colour microarray platforms using mixed linear models, with special attention directed towards the different hierarchical levels of replication and the consequent effect on the use of appropriate error terms for comparing experimental groups. We examine the traditional analysis of variance (ANOVA) models proposed for microarray data and their extensions to hierarchically replicated experiments. In addition, we discuss a mixed model methodology for power and efficiency calculations of different microarray experimental designs.
Collapse
Affiliation(s)
- Guilherme J M Rosa
- Department of Animal Science, Michigan State University, East Lansing, MI 48824-1225, USA.
| | | | | |
Collapse
|
49
|
Scheinin I, Ferreira JA, Knuutila S, Meijer GA, van de Wiel MA, Ylstra B. CGHpower: exploring sample size calculations for chromosomal copy number experiments. BMC Bioinformatics 2010; 11:331. [PMID: 20565750 PMCID: PMC2911457 DOI: 10.1186/1471-2105-11-331] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2009] [Accepted: 06/17/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Determining a suitable sample size is an important step in the planning of microarray experiments. Increasing the number of arrays gives more statistical power, but adds to the total cost of the experiment. Several approaches for sample size determination have been developed for expression array studies, but so far none has been proposed for array comparative genomic hybridization (aCGH). RESULTS Here we explore power calculations for aCGH experiments comparing two groups. In a pilot experiment CGHpower estimates the biological diversity between groups and provides a statistical framework for estimating average power as a function of sample size. As the method requires pilot data, it can be used either in the planning stage of larger studies or in estimating the power achieved in past experiments. CONCLUSIONS The proposed method relies on certain assumptions. According to our evaluation with public and simulated data sets, they do not always hold true. Violation of the assumptions typically leads to unreliable sample size estimates. Despite its limitations, this method is, at least to our knowledge, the only one currently available for performing sample size calculations in the context of aCGH. Moreover, the implementation of the method provides diagnostic plots that allow critical assessment of the assumptions on which it is based and hence on the feasibility and reliability of the sample size calculations in each case.The CGHpower web application and the program outputs from evaluation data sets can be freely accessed at http://www.cangem.org/cghpower/
Collapse
Affiliation(s)
- Ilari Scheinin
- Department of Pathology, VU University Medical Center, Amsterdam, The Netherlands
| | | | | | | | | | | |
Collapse
|
50
|
Roy P, Truntzer C, Maucort-Boulch D, Jouve T, Molinari N. Protein mass spectra data analysis for clinical biomarker discovery: a global review. Brief Bioinform 2010; 12:176-86. [PMID: 20534688 DOI: 10.1093/bib/bbq019] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The identification of new diagnostic or prognostic biomarkers is one of the main aims of clinical cancer research. In recent years there has been a growing interest in using high throughput technologies for the detection of such biomarkers. In particular, mass spectrometry appears as an exciting tool with great potential. However, to extract any benefit from the massive potential of clinical proteomic studies, appropriate methods, improvement and validation are required. To better understand the key statistical points involved with such studies, this review presents the main data analysis steps of protein mass spectra data analysis, from the pre-processing of the data to the identification and validation of biomarkers.
Collapse
Affiliation(s)
- Pascal Roy
- Hospices Civils de Lyon, Service de Biostatistique, Lyon, F-69003, France
| | | | | | | | | |
Collapse
|