1
|
Sengupta A, Naresh G, Mishra A, Parashar D, Narad P. Proteome analysis using machine learning approaches and its applications to diseases. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021; 127:161-216. [PMID: 34340767 DOI: 10.1016/bs.apcsb.2021.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
With the tremendous developments in the fields of biological and medical technologies, huge amounts of data are generated in the form of genomic data, images in medical databases or as data on protein sequences, and so on. Analyzing this data through different tools sheds light on the particulars of the disease and our body's reactions to it, thus, aiding our understanding of the human health. Most useful of these tools is artificial intelligence and deep learning (DL). The artificially created neural networks in DL algorithms help extract viable data from the datasets, and further, to recognize patters in these complex datasets. Therefore, as a part of machine learning, DL helps us face all the various challenges that come forth during protein prediction, protein identification and their quantification. Proteomics is the study of such proteins, their structures, features, properties and so on. As a form of data science, Proteomics has helped us progress excellently in the field of genomics technologies. One of the major techniques used in proteomics studies is mass spectrometry (MS). However, MS is efficient with analysis of large datasets only with the added help of informatics approaches for data analysis and interpretation; these mainly include machine learning and deep learning algorithms. In this chapter, we will discuss in detail the applications of deep learning and various algorithms of machine learning in proteomics.
Collapse
Affiliation(s)
- Abhishek Sengupta
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - G Naresh
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Astha Mishra
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Diksha Parashar
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Priyanka Narad
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India.
| |
Collapse
|
2
|
Proteomics and Metabolomics Approaches towards a Functional Insight onto AUTISM Spectrum Disorders: Phenotype Stratification and Biomarker Discovery. Int J Mol Sci 2020; 21:ijms21176274. [PMID: 32872562 PMCID: PMC7504551 DOI: 10.3390/ijms21176274] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 08/25/2020] [Accepted: 08/27/2020] [Indexed: 12/19/2022] Open
Abstract
Autism spectrum disorders (ASDs) are neurodevelopmental disorders characterized by behavioral alterations and currently affect about 1% of children. Significant genetic factors and mechanisms underline the causation of ASD. Indeed, many affected individuals are diagnosed with chromosomal abnormalities, submicroscopic deletions or duplications, single-gene disorders or variants. However, a range of metabolic abnormalities has been highlighted in many patients, by identifying biofluid metabolome and proteome profiles potentially usable as ASD biomarkers. Indeed, next-generation sequencing and other omics platforms, including proteomics and metabolomics, have uncovered early age disease biomarkers which may lead to novel diagnostic tools and treatment targets that may vary from patient to patient depending on the specific genomic and other omics findings. The progressive identification of new proteins and metabolites acting as biomarker candidates, combined with patient genetic and clinical data and environmental factors, including microbiota, would bring us towards advanced clinical decision support systems (CDSSs) assisted by machine learning models for advanced ASD-personalized medicine. Herein, we will discuss novel computational solutions to evaluate new proteome and metabolome ASD biomarker candidates, in terms of their recurrence in the reviewed literature and laboratory medicine feasibility. Moreover, the way to exploit CDSS, performed by artificial intelligence, is presented as an effective tool to integrate omics data to electronic health/medical records (EHR/EMR), hopefully acting as added value in the near future for the clinical management of ASD.
Collapse
|
3
|
Pelin M, Stocco G, Florio C, Sosa S, Tubaro A. In Vitro Cell Sensitivity to Palytoxin Correlates with High Gene Expression of the Na +/K +-ATPase β2 Subunit Isoform. Int J Mol Sci 2020; 21:5833. [PMID: 32823835 PMCID: PMC7461505 DOI: 10.3390/ijms21165833] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/03/2020] [Accepted: 08/13/2020] [Indexed: 01/13/2023] Open
Abstract
The marine polyether palytoxin (PLTX) is one of the most toxic natural compounds, and is involved in human poisonings after oral, inhalation, skin and/or ocular exposure. Epidemiological and molecular evidence suggest different inter-individual sensitivities to its toxic effects, possibly related to genetic-dependent differences in the expression of Na+/K+-ATPase, its molecular target. To identify Na+/K+-ATPase subunits, isoforms correlated with in vitro PLTX cytotoxic potency, sensitivity parameters (EC50: PLTX concentration reducing cell viability by 50%; Emax: maximum effect induced by the highest toxin concentration; 10-7 M) were assessed in 60 healthy donors' monocytes by the MTT (methylthiazolyl tetrazolium) assay. Sensitivity parameters, not correlated with donors' demographic variables (gender, age and blood group), demonstrated a high inter-individual variability (median EC50 = 2.7 × 10-10 M, interquartile range: 0.4-13.2 × 10-10 M; median Emax = 92.0%, interquartile range: 87.5-94.4%). Spearman's analysis showed significant positive correlations between the β2-encoding ATP1B2 gene expression and Emax values (rho = 0.30; p = 0.025) and between Emax and the ATP1B2/ATP1B3 expression ratio (rho = 0.38; p = 0.004), as well as a significant negative correlation between Emax and the ATP1B1/ATP1B2 expression ratio (rho = -0.30; p = 0.026). This toxicogenetic study represents the first approach to define genetic risk factors that may influence the onset of adverse effects in human PLTX poisonings, suggesting that individuals with high gene expression pattern of the Na+/K+-ATPase β2 subunit (alone or as β2/β1 and/or β2/β3 ratio) could be highly sensitive to PLTX toxic effects.
Collapse
Affiliation(s)
| | | | | | | | - Aurelia Tubaro
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (M.P.); (G.S.); (C.F.); (S.S.)
| |
Collapse
|
4
|
Azaspiracids Increase Mitochondrial Dehydrogenases Activity in Hepatocytes: Involvement of Potassium and Chloride Ions. Mar Drugs 2019; 17:md17050276. [PMID: 31072021 PMCID: PMC6562809 DOI: 10.3390/md17050276] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 04/29/2019] [Accepted: 05/06/2019] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Azaspiracids (AZAs) are marine toxins that are produced by Azadinium and Amphidoma dinoflagellates that can contaminate edible shellfish inducing a foodborne poisoning in humans, which is characterized by gastrointestinal symptoms. Among these, AZA1, -2, and -3 are regulated in the European Union, being the most important in terms of occurrence and toxicity. In vivo studies in mice showed that, in addition to gastrointestinal effects, AZA1 induces liver alterations that are visible as a swollen organ, with the presence of hepatocellular fat droplets and vacuoles. Hence, an in vitro study was carried out to investigate the effects of AZA1, -2, and -3 on liver cells, using human non-tumor IHH hepatocytes. RESULTS The exposure of IHH cells to AZA1, -2, or -3 (5 × 10-12-1 × 10-7 M) for 24 h did not affect the cell viability and proliferation (Sulforhodamine B assay and 3H-Thymidine incorporation assay), but they induced a significant concentration-dependent increase of mitochondrial dehydrogenases activity (MTT reduction assay). This effect depends on the activity of mitochondrial electron transport chain complex I and II, being counteracted by rotenone and tenoyl trifluoroacetone, respectively. Furthermore, AZAs-increased mitochondrial dehydrogenase activity was almost totally suppressed in the K+-, Cl--, and Na+-free media and sensitive to the specific inhibitors of KATP and hERG potassium channels, Na+/K+, ATPase, and cystic fibrosis transmembrane conductance regulator (CFTR) chloride channels. CONCLUSIONS These results suggest that AZA mitochondrial effects in hepatocytes derive from an imbalance of intracellular levels of K+ and, in particular, Cl- ions, as demonstrated by the selective reduction of toxin effects by CFTR chloride channel inhibition.
Collapse
|
5
|
Kolker E, Özdemir V, Kolker E. How Healthcare Can Refocus on Its Super-Customers (Patients, n =1) and Customers (Doctors and Nurses) by Leveraging Lessons from Amazon, Uber, and Watson. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2017; 20:329-33. [PMID: 27310474 DOI: 10.1089/omi.2016.0077] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Healthcare is transforming with data-intensive omics technologies and Big Data. The "revolution" has already happened in technology, but the bottlenecks have shifted to the social domain: Who can be empowered by Big Data? Who are the users and customers? In this review and innovation field analysis, we introduce the idea of a "super-customer" versus "customer" and relate both to 21st century healthcare. A "super-customer" in healthcare is the patient, sample size of n = 1, while "customers" are the providers of healthcare (e.g., doctors and nurses). The super-customers have been patients, enabled by unprecedented social practices, such as the ability to track one's physical activities, personal genomics, patient advocacy for greater autonomy, and self-governance, to name but a few. In contrast, the originally intended customers-providers, doctors, and nurses-have relatively lagged behind. With patients as super-customers, there are valuable lessons to be learned from industry examples, such as Amazon and Uber. To offer superior quality service, healthcare organizations have to refocus on the needs, pains, and aspirations of their super-customers by enabling the customers. We propose a strategic solution to this end: the PPT-DAM (People-Process-Technology empowered by Data, Analytics, and Metrics) approach. When applied together with the classic Experiment-Execute-Evaluate iterative methodology, we suggest PPT-DAM is an extremely powerful approach to deliver quality health services to super-customers and customers. As an example, we describe the PPT-DAM implementation by the Benchmarking Improvement Program at the Seattle Children's Hospital. Finally, we forecast that cognitive systems in general and IBM Watson in particular, if properly implemented, can bring transformative and sustainable capabilities in healthcare far beyond the current ones.
Collapse
Affiliation(s)
| | - Vural Özdemir
- 2 Faculty of Communications and the Office of the President, International Technology and Innovation Policy, Gaziantep University , Gaziantep, Turkey .,3 Target Technology Transfer Office (TTO) , Gaziantep Technopark, Gaziantep, Turkey .,4 Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham (Amrita University) , Kerala, India .,5 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Eugene Kolker
- 5 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington.,6 CDO Analytics, Seattle Children's Hospital (SCH) , Seattle, Washington.,7 Department of Biomedical Informatics and Medical Education, University of Washington , Seattle, Washington.,8 Department of Pediatrics, School of Medicine, University of Washington , Seattle, Washington.,9 Department of Chemistry and Chemical Biology, College of Science, Northeastern University , Boston, Massachusetts
| |
Collapse
|
6
|
Higdon R, Earl RK, Stanberry L, Hudac CM, Montague E, Stewart E, Janko I, Choiniere J, Broomall W, Kolker N, Bernier RA, Kolker E. The promise of multi-omics and clinical data integration to identify and target personalized healthcare approaches in autism spectrum disorders. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2016; 19:197-208. [PMID: 25831060 DOI: 10.1089/omi.2015.0020] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Complex diseases are caused by a combination of genetic and environmental factors, creating a difficult challenge for diagnosis and defining subtypes. This review article describes how distinct disease subtypes can be identified through integration and analysis of clinical and multi-omics data. A broad shift toward molecular subtyping of disease using genetic and omics data has yielded successful results in cancer and other complex diseases. To determine molecular subtypes, patients are first classified by applying clustering methods to different types of omics data, then these results are integrated with clinical data to characterize distinct disease subtypes. An example of this molecular-data-first approach is in research on Autism Spectrum Disorder (ASD), a spectrum of social communication disorders marked by tremendous etiological and phenotypic heterogeneity. In the case of ASD, omics data such as exome sequences and gene and protein expression data are combined with clinical data such as psychometric testing and imaging to enable subtype identification. Novel ASD subtypes have been proposed, such as CHD8, using this molecular subtyping approach. Broader use of molecular subtyping in complex disease research is impeded by data heterogeneity, diversity of standards, and ineffective analysis tools. The future of molecular subtyping for ASD and other complex diseases calls for an integrated resource to identify disease mechanisms, classify new patients, and inform effective treatment options. This in turn will empower and accelerate precision medicine and personalized healthcare.
Collapse
Affiliation(s)
- Roger Higdon
- 1 Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Perez-Riverol Y, Alpi E, Wang R, Hermjakob H, Vizcaíno JA. Making proteomics data accessible and reusable: current state of proteomics databases and repositories. Proteomics 2015; 15:930-49. [PMID: 25158685 PMCID: PMC4409848 DOI: 10.1002/pmic.201400302] [Citation(s) in RCA: 141] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 08/06/2014] [Accepted: 08/22/2014] [Indexed: 01/10/2023]
Abstract
Compared to other data-intensive disciplines such as genomics, public deposition and storage of MS-based proteomics, data are still less developed due to, among other reasons, the inherent complexity of the data and the variety of data types and experimental workflows. In order to address this need, several public repositories for MS proteomics experiments have been developed, each with different purposes in mind. The most established resources are the Global Proteome Machine Database (GPMDB), PeptideAtlas, and the PRIDE database. Additionally, there are other useful (in many cases recently developed) resources such as ProteomicsDB, Mass Spectrometry Interactive Virtual Environment (MassIVE), Chorus, MaxQB, PeptideAtlas SRM Experiment Library (PASSEL), Model Organism Protein Expression Database (MOPED), and the Human Proteinpedia. In addition, the ProteomeXchange consortium has been recently developed to enable better integration of public repositories and the coordinated sharing of proteomics information, maximizing its benefit to the scientific community. Here, we will review each of the major proteomics resources independently and some tools that enable the integration, mining and reuse of the data. We will also discuss some of the major challenges and current pitfalls in the integration and sharing of the data.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | | | | | | | | |
Collapse
|
8
|
Kolker E, Janko I, Montague E, Higdon R, Stewart E, Choiniere J, Lai A, Eckert M, Broomall W, Kolker N. Finding Text-Supported Gene-to-Disease Co-appearances with MOPED-Digger. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2015; 19:754-6. [DOI: 10.1089/omi.2015.0151] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Eugene Kolker
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Departments of Biomedical Informatics and Medical Education and Pediatrics, School of Medicine, University of Washington, Seattle, Washington
| | - Imre Janko
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
| | - Elizabeth Montague
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
| | - Roger Higdon
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
| | - Elizabeth Stewart
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
| | - John Choiniere
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
| | - Aaron Lai
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- University of Pennsylvania, School of Arts and Sciences, Philadelphia, Pennsylvania
| | - Mary Eckert
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Northeastern University, College of Science, Boston, Massachusetts
| | - William Broomall
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
| | - Natali Kolker
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- CDO Analytics, , Seattle Children's Hospital, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington
| |
Collapse
|
9
|
Heywood WE, Camuzeaux S, Doykov I, Patel N, Preece RL, Footitt E, Cleary M, Clayton P, Grunewald S, Abulhoul L, Chakrapani A, Sebire NJ, Hindmarsh P, de Koning TJ, Heales S, Burke D, Gissen P, Mills K. Proteomic Discovery and Development of a Multiplexed Targeted MRM-LC-MS/MS Assay for Urine Biomarkers of Extracellular Matrix Disruption in Mucopolysaccharidoses I, II, and VI. Anal Chem 2015; 87:12238-44. [PMID: 26537538 DOI: 10.1021/acs.analchem.5b03232] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The mucopolysaccharidoses (MPS) are lysosomal storage disorders that result from defects in the catabolism of glycosaminoglycans. Impaired muscle, bone, and connective tissue are typical clinical features of MPS due to disruption of the extracellular matrix. Markers of MPS disease pathology are needed to determine disease severity and monitor effects of existing and emerging new treatments on disease mechanisms. Urine samples from a small cohort of MPS-I, -II, and -VI patients (n = 12) were analyzed using label-free quantative proteomics. Fifty-three proteins including many associated with extracellular matrix organization were differently expressed. A targeted multiplexed peptide MRM LC-MS/MS assay was used on a larger validation cohort of patient samples (MPS-I n = 18, MPS-II n = 12, MPS-VI n = 6, control n = 20). MPS-I and -II groups were further subdivided according to disease severity. None of the markers assessed were altered significantly in the mild disease groups compared to controls. β-galactosidase, a lysosomal protein, was elevated 3.6-5.7-fold significantly (p < 0.05) in all disease groups apart from mild MPS-I and -II. Collagen type Iα, fatty-acid-binding-protein 5, nidogen-1, cartilage oligomeric matrix protein, and insulin-like growth factor binding protein 7 concentrations were elevated in severe MPS I and II groups. Cartilage oligomeric matrix protein, insulin-like growth factor binding protein 7, and β-galactosidase were able to distinguish the severe neurological form of MPS-II from the milder non-neurological form. Protein Heg1 was significantly raised only in MPS-VI. This work describes the discovery of new biomarkers of MPS that represent disease pathology and allows the stratification of MPS-II patients according to disease severity.
Collapse
Affiliation(s)
- Wendy E Heywood
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Stephane Camuzeaux
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Ivan Doykov
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Nina Patel
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Rhian-Lauren Preece
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Emma Footitt
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Maureen Cleary
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Peter Clayton
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Stephanie Grunewald
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Lara Abulhoul
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Anupam Chakrapani
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Neil J Sebire
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Peter Hindmarsh
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| | - Tom J de Koning
- University of Groningen , University Medical Center Groningen, Departments of Genetics and Neurology, P.O. Box 30.001, 9700 RB Groningen, Netherlands
| | - Simon Heales
- Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom.,Enzyme and Metabolic Unit, Chemical Pathology, Great Ormond Street Hospital for Children , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Derek Burke
- Enzyme and Metabolic Unit, Chemical Pathology, Great Ormond Street Hospital for Children , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Paul Gissen
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom.,Centre for Inborn Errors of Metabolism, Great Ormond Street Hospital , Great Ormond Street, London, WC1N 3JH, United Kingdom
| | - Kevin Mills
- Centre for Translational Omics, UCL Institute of Child Health , 30 Guilford Street, London, WC1N 1EH United Kingdom
| |
Collapse
|
10
|
Higdon R, Kolker E. Can "normal" protein expression ranges be estimated with high-throughput proteomics? J Proteome Res 2015; 14:2398-407. [PMID: 25877823 DOI: 10.1021/acs.jproteome.5b00176] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Although biological science discovery often involves comparing conditions to a normal state, in proteomics little is actually known about normal. Two Human Proteome studies featured in Nature offer new insights into protein expression and an opportunity to assess how high-throughput proteomics measures normal protein ranges. We use data from these studies to estimate technical and biological variability in protein expression and compare them to other expression data sets from normal tissue. Results show that measured protein expression across same-tissue replicates vary by ±4- to 10-fold for most proteins. Coefficients of variation (CV) for protein expression measurements range from 62% to 117% across different tissue experiments; however, adjusting for technical variation reduced this variability by as much as 50%. In addition, the CV could also be reduced by limiting comparisons to proteins with at least 3 or more unique peptide identifications as the CV was on average 33% lower than for proteins with 2 or fewer peptide identifications. We also selected 13 housekeeping proteins and genes that were expressed across all tissues with low variability to determine their utility as a reference set for normalization and comparative purposes. These results present the first step toward estimating normal protein ranges by determining the variability in expression measurements through combining publicly available data. They support an approach that combines standard protocols with replicates of normal tissues to estimate normal protein ranges for large numbers of proteins and tissues. This would be a tremendous resource for normal cellular physiology and comparisons of proteomics studies.
Collapse
Affiliation(s)
- Roger Higdon
- †Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington 98101, United States.,‡CDO Analytics, Seattle Children's Hospital, Seattle, Washington 98101, United States
| | - Eugene Kolker
- †Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington 98101, United States.,‡CDO Analytics, Seattle Children's Hospital, Seattle, Washington 98101, United States.,§Departments of Biomedical Informatics and Medical Education and Pediatrics, University of Washington, Seattle, Washington 98195, United States.,∥Department of Chemistry and Chemical Biology, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
11
|
Li J, Liu D, Mou Z, Ihedioha OC, Blanchard A, Jia P, Myal Y, Uzonna JE. Deficiency of prolactin-inducible protein leads to impaired Th1 immune response and susceptibility to Leishmania major in mice. Eur J Immunol 2015; 45:1082-91. [PMID: 25594453 DOI: 10.1002/eji.201445078] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2014] [Revised: 11/21/2014] [Accepted: 01/14/2015] [Indexed: 12/13/2022]
Abstract
Although the strategic production of prolactin-inducible protein (PIP) at several ports of pathogen entry into the body suggests it might play a role in host defense, no study has directly implicated it in immunity against any infectious agent. Here, we show for the first time that PIP deficiency is associated with reduced numbers of CD4(+) T cells in peripheral lymphoid tissues and impaired CD4(+) Th1-cell differentiation in vitro. In vivo, CD4(+) T cells from OVA-immunized, PIP-deficient mice showed significantly impaired proliferation and IFN-γ production following in vitro restimulation. Furthermore, PIP-deficient mice were highly susceptible to Leishmani major infection and failed to control lesion progression and parasite proliferation. This susceptibility was associated with impaired NO production and leishmanicidal activity of PIP KO macrophages following IFN-γ and LPS stimulation. Collectively, our findings implicate PIP as an important regulator of CD4(+) Th1-cell-mediated immunity.
Collapse
Affiliation(s)
- Jintao Li
- Department of Immunology, College of Medicine, University of Manitoba, Winnipeg, MB, Canada; Institute of Tropical Medicine, Third Military Medical University, Chongqing, China
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Morgan PG, Higdon R, Kolker N, Bauman AT, Ilkayeva O, Newgard CB, Kolker E, Steele LM, Sedensky MM. Comparison of proteomic and metabolomic profiles of mutants of the mitochondrial respiratory chain in Caenorhabditis elegans. Mitochondrion 2014; 20:95-102. [PMID: 25530493 DOI: 10.1016/j.mito.2014.12.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Revised: 09/10/2014] [Accepted: 12/10/2014] [Indexed: 01/06/2023]
Abstract
Single-gene mutations that disrupt mitochondrial respiratory chain function in Caenorhabditis elegans change patterns of protein expression and metabolites. Our goal was to develop useful molecular fingerprints employing adaptable techniques to recognize mitochondrial defects in the electron transport chain. We analyzed mutations affecting complex I, complex II, or ubiquinone synthesis and discovered overarching patterns in the response of C. elegans to mitochondrial dysfunction across all of the mutations studied. These patterns are in KEGG pathways conserved from C. elegans to mammals, verifying that the nematode can serve as a model for mammalian disease. In addition, specific differences exist between mutants that may be useful in diagnosing specific mitochondrial diseases in patients.
Collapse
Affiliation(s)
- P G Morgan
- Department of Anesthesiology and Pain Medicine, University of Washington, USA; Center for Developmental Therapeutics, Seattle Children's Research Institute, USA.
| | - R Higdon
- Bioinformatics and High-throughput Analysis Laboratory, USA; High-throughput Analysis Core, Seattle Children's Research Institute, USA; Data-Enabled Life Sciences Alliance (DELSA Global), USA
| | - N Kolker
- High-throughput Analysis Core, Seattle Children's Research Institute, USA; Data-Enabled Life Sciences Alliance (DELSA Global), USA
| | - A T Bauman
- Bioinformatics and High-throughput Analysis Laboratory, USA
| | - O Ilkayeva
- Sarah W. Stedman Nutrition and Metabolism Center & Duke Molecular Physiology Institute, Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, NC, USA; Sarah W. Stedman Nutrition and Metabolism Center & Duke Molecular Physiology Institute, Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - C B Newgard
- Sarah W. Stedman Nutrition and Metabolism Center & Duke Molecular Physiology Institute, Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, NC, USA; Sarah W. Stedman Nutrition and Metabolism Center & Duke Molecular Physiology Institute, Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - E Kolker
- Bioinformatics and High-throughput Analysis Laboratory, USA; High-throughput Analysis Core, Seattle Children's Research Institute, USA; Data-Enabled Life Sciences Alliance (DELSA Global), USA; Department of Biomedical Informatics & Medical Education, University of Washington, Seattle, WA, USA; Department of Pediatrics, University of Washington, Seattle, WA, USA; Department of Chemistry and Chemical Biology, College of Science, Northeastern University, Boston, MA 02115, USA
| | - L M Steele
- Center for Developmental Therapeutics, Seattle Children's Research Institute, USA
| | - M M Sedensky
- Department of Anesthesiology and Pain Medicine, University of Washington, USA; Center for Developmental Therapeutics, Seattle Children's Research Institute, USA
| |
Collapse
|
13
|
Willoughby C, Bird CL, Coles SJ, Frey JG. Creating Context for the Experiment Record. User-Defined Metadata: Investigations into Metadata Usage in the LabTrove ELN. J Chem Inf Model 2014; 54:3268-83. [PMID: 25405258 DOI: 10.1021/ci500469f] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Cerys Willoughby
- Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Colin L. Bird
- Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Simon J. Coles
- Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Jeremy G. Frey
- Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| |
Collapse
|
14
|
Montague E, Janko I, Stanberry L, Lee E, Choiniere J, Anderson N, Stewart E, Broomall W, Higdon R, Kolker N, Kolker E. Beyond protein expression, MOPED goes multi-omics. Nucleic Acids Res 2014; 43:D1145-51. [PMID: 25404128 PMCID: PMC4383969 DOI: 10.1093/nar/gku1175] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
MOPED (Multi-Omics Profiling Expression Database; http://moped.proteinspire.org) has transitioned from solely a protein expression database to a multi-omics resource for human and model organisms. Through a web-based interface, MOPED presents consistently processed data for gene, protein and pathway expression. To improve data quality, consistency and use, MOPED includes metadata detailing experimental design and analysis methods. The multi-omics data are integrated through direct links between genes and proteins and further connected to pathways and experiments. MOPED now contains over 5 million records, information for approximately 75 000 genes and 50 000 proteins from four organisms (human, mouse, worm, yeast). These records correspond to 670 unique combinations of experiment, condition, localization and tissue. MOPED includes the following new features: pathway expression, Pathway Details pages, experimental metadata checklists, experiment summary statistics and more advanced searching tools. Advanced searching enables querying for genes, proteins, experiments, pathways and keywords of interest. The system is enhanced with visualizations for comparing across different data types. In the future MOPED will expand the number of organisms, increase integration with pathways and provide connections to disease.
Collapse
Affiliation(s)
- Elizabeth Montague
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Imre Janko
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Larissa Stanberry
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Elaine Lee
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - John Choiniere
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Nathaniel Anderson
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Elizabeth Stewart
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - William Broomall
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Roger Higdon
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Natali Kolker
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101
| | - Eugene Kolker
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, WA, USA 98101 High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, WA, USA 98101 CDO Analytics, Seattle Children's, Seattle, WA, USA 98101 Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, WA, USA 98101 Departments of Biomedical Informatics and Medical Education and Pediatrics, University of Washington, Seattle, WA, USA 98109 Department of Chemistry and Chemical Biology, College of Science, Northeastern University, Boston, MA 02115
| |
Collapse
|
15
|
Titz B, Elamin A, Martin F, Schneider T, Dijon S, Ivanov NV, Hoeng J, Peitsch MC. Proteomics for systems toxicology. Comput Struct Biotechnol J 2014; 11:73-90. [PMID: 25379146 PMCID: PMC4212285 DOI: 10.1016/j.csbj.2014.08.004] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Current toxicology studies frequently lack measurements at molecular resolution to enable a more mechanism-based and predictive toxicological assessment. Recently, a systems toxicology assessment framework has been proposed, which combines conventional toxicological assessment strategies with system-wide measurement methods and computational analysis approaches from the field of systems biology. Proteomic measurements are an integral component of this integrative strategy because protein alterations closely mirror biological effects, such as biological stress responses or global tissue alterations. Here, we provide an overview of the technical foundations and highlight select applications of proteomics for systems toxicology studies. With a focus on mass spectrometry-based proteomics, we summarize the experimental methods for quantitative proteomics and describe the computational approaches used to derive biological/mechanistic insights from these datasets. To illustrate how proteomics has been successfully employed to address mechanistic questions in toxicology, we summarized several case studies. Overall, we provide the technical and conceptual foundation for the integration of proteomic measurements in a more comprehensive systems toxicology assessment framework. We conclude that, owing to the critical importance of protein-level measurements and recent technological advances, proteomics will be an integral part of integrative systems toxicology approaches in the future.
Collapse
|
16
|
Montague E, Stanberry L, Higdon R, Janko I, Lee E, Anderson N, Choiniere J, Stewart E, Yandl G, Broomall W, Kolker N, Kolker E. MOPED 2.5--an integrated multi-omics resource: multi-omics profiling expression database now includes transcriptomics data. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:335-43. [PMID: 24910945 PMCID: PMC4048574 DOI: 10.1089/omi.2014.0061] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Multi-omics data-driven scientific discovery crucially rests on high-throughput technologies and data sharing. Currently, data are scattered across single omics repositories, stored in varying raw and processed formats, and are often accompanied by limited or no metadata. The Multi-Omics Profiling Expression Database (MOPED, http://moped.proteinspire.org ) version 2.5 is a freely accessible multi-omics expression database. Continual improvement and expansion of MOPED is driven by feedback from the Life Sciences Community. In order to meet the emergent need for an integrated multi-omics data resource, MOPED 2.5 now includes gene relative expression data in addition to protein absolute and relative expression data from over 250 large-scale experiments. To facilitate accurate integration of experiments and increase reproducibility, MOPED provides extensive metadata through the Data-Enabled Life Sciences Alliance (DELSA Global, http://delsaglobal.org ) metadata checklist. MOPED 2.5 has greatly increased the number of proteomics absolute and relative expression records to over 500,000, in addition to adding more than four million transcriptomics relative expression records. MOPED has an intuitive user interface with tabs for querying different types of omics expression data and new tools for data visualization. Summary information including expression data, pathway mappings, and direct connection between proteins and genes can be viewed on Protein and Gene Details pages. These connections in MOPED provide a context for multi-omics expression data exploration. Researchers are encouraged to submit omics data which will be consistently processed into expression summaries. MOPED as a multi-omics data resource is a pivotal public database, interdisciplinary knowledge resource, and platform for multi-omics understanding.
Collapse
Affiliation(s)
- Elizabeth Montague
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Larissa Stanberry
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Roger Higdon
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Imre Janko
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Elaine Lee
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Nathaniel Anderson
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - John Choiniere
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Elizabeth Stewart
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Gregory Yandl
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - William Broomall
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Natali Kolker
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Eugene Kolker
- Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
- High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Departments of Biomedical Informatics and Medical Education and Pediatrics, University of Washington, Seattle, Washington
| |
Collapse
|
17
|
Kolker E, Stewart E. OMICS studies: How about metadata checklist and data publications? J Proteome Res 2014; 13:1783-4. [PMID: 24494788 DOI: 10.1021/pr4011662] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Data fully utilized by the community resources promote progress rather than repetition. Effective data sharing can accelerate the transition from data to actionable knowledge, yet barriers to data sharing remain, both technological and procedural. The DELSA community has tackled the sharing barrier by creating a multi-omics metadata checklist for the life sciences. The checklist and associated data publication examples are now jointly published in Big Data and OMICS: A Journal of Integrative Biology. The checklist will enable diverse datasets to be easily harmonized and reused for richer analyses. It will facilitate data deposits, stand alone as a data publication, and grant appropriate credit to researchers. We invite the broader life sciences community to test the checklist for feedback and improvements.
Collapse
Affiliation(s)
- Eugene Kolker
- Seattle Children's Research Insitute , 1900 Ninth Avenue, Seattle, Washington 98101, United States
| | | |
Collapse
|
18
|
Kolker E, Özdemir V, Martens L, Hancock W, Anderson G, Anderson N, Aynacioglu S, Baranova A, Campagna SR, Chen R, Choiniere J, Dearth SP, Feng WC, Ferguson L, Fox G, Frishman D, Grossman R, Heath A, Higdon R, Hutz MH, Janko I, Jiang L, Joshi S, Kel A, Kemnitz JW, Kohane IS, Kolker N, Lancet D, Lee E, Li W, Lisitsa A, Llerena A, MacNealy-Koch C, Marshall JC, Masuzzo P, May A, Mias G, Monroe M, Montague E, Mooney S, Nesvizhskii A, Noronha S, Omenn G, Rajasimha H, Ramamoorthy P, Sheehan J, Smarr L, Smith CV, Smith T, Snyder M, Rapole S, Srivastava S, Stanberry L, Stewart E, Toppo S, Uetz P, Verheggen K, Voy BH, Warnich L, Wilhelm SW, Yandl G. Toward more transparent and reproducible omics studies through a common metadata checklist and data publications. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:10-4. [PMID: 24456465 PMCID: PMC3903324 DOI: 10.1089/omi.2013.0149] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Biological processes are fundamentally driven by complex interactions between biomolecules. Integrated high-throughput omics studies enable multifaceted views of cells, organisms, or their communities. With the advent of new post-genomics technologies, omics studies are becoming increasingly prevalent; yet the full impact of these studies can only be realized through data harmonization, sharing, meta-analysis, and integrated research. These essential steps require consistent generation, capture, and distribution of metadata. To ensure transparency, facilitate data harmonization, and maximize reproducibility and usability of life sciences studies, we propose a simple common omics metadata checklist. The proposed checklist is built on the rich ontologies and standards already in use by the life sciences community. The checklist will serve as a common denominator to guide experimental design, capture important parameters, and be used as a standard format for stand-alone data publications. The omics metadata checklist and data publications will create efficient linkages between omics data and knowledge-based life sciences innovation and, importantly, allow for appropriate attribution to data generators and infrastructure science builders in the post-genomics era. We ask that the life sciences community test the proposed omics metadata checklist and data publications and provide feedback for their use and improvement.
Collapse
Affiliation(s)
- Eugene Kolker
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Vural Özdemir
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Office of the President, Gaziantep University, International Affairs and Global Development Strategy
- Faculty of Communications, Universite Bulvarı, Kilis Yolu, Turkey
| | - Lennart Martens
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - William Hancock
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, Barnett Institute, Northeastern University, Boston, Massachusetts
| | - Gordon Anderson
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Fundamental and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington
| | - Nathaniel Anderson
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Sukru Aynacioglu
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Pharmacology, Gaziantep University, Gaziantep, Turkey
| | - Ancha Baranova
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- School of Systems Biology, George Mason University, Manassas, Virginia
| | - Shawn R. Campagna
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - Rui Chen
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - John Choiniere
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Stephen P. Dearth
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - Wu-Chun Feng
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia
- Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg, Virginia
- Department of SyNeRGy Laboratory, Virginia Tech, Blacksburg, Virginia
| | - Lynnette Ferguson
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Nutrition, Auckland Cancer Society Research Centre, University of Auckland, Auckland, New Zealand
| | - Geoffrey Fox
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- School of Informatics and Computing, Indiana University, Bloomington, Indiana
| | - Dmitrij Frishman
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Technische Universitat Munchen, Wissenshaftzentrum Weihenstephan, Freising, Germany
| | - Robert Grossman
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois
- Department of Medicine, University of Chicago, Chicago, Illinois
| | - Allison Heath
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Institute for Genomics and Systems Biology, University of Chicago, Chicago, Illinois
- Knapp Center for Biomedical Discovery, University of Chicago, Chicago, Illinois
| | - Roger Higdon
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Mara H. Hutz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Departamento de Genetica, Instituto de Biociencias, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
| | - Imre Janko
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Lihua Jiang
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - Sanjay Joshi
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Life Sciences, EMC, Hopkinton, Massachusetts
| | - Alexander Kel
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- GeneXplain GmbH, Wolfenbüttel, Germany
| | - Joseph W. Kemnitz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison, Wisconsin
- Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, Wisconsin
| | - Isaac S. Kohane
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Pediatrics and Health Sciences Technology, Children's Hospital and Harvard Medical School, Boston, Massachusetts
- HMS Center for Biomedical Informatics, Countway Library of Medicine, Boston, Massachusetts
| | - Natali Kolker
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Doron Lancet
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Molecular Genetics, Crown Human Genome Center, Weizmann Institute of Science, Rehovot, Israel
| | - Elaine Lee
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- High-Throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington
| | - Weizhong Li
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Research in Biological Systems, University of California, San Diego, La Jolla, California
| | - Andrey Lisitsa
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Russian Human Proteome Organization (RHUPO), Moscow, Russia
- Institute of Biomedical Chemistry, Moscow, Russia
| | - Adrian Llerena
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Clinical Research Center, Extremadura University Hospital and Medical School, Badajoz, Spain
| | - Courtney MacNealy-Koch
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Jean-Claude Marshall
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Translational Research, Catholic Health Initiatives, Towson, Maryland
| | - Paola Masuzzo
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - Amanda May
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemistry, University of Tennessee Knoxville, Knoxville, Tennessee
| | - George Mias
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
| | - Matthew Monroe
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington
| | - Elizabeth Montague
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Sean Mooney
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- The Buck Institute for Research on Aging, Novato, California
| | - Alexey Nesvizhskii
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Pathology, University of Michigan, Ann Arbor, Michigan
- Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan
| | - Santosh Noronha
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Chemical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, India
| | - Gilbert Omenn
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor Michigan
- Department of Molecular Medicine & Genetics and Human Genetics, University of Michigan, Ann Arbor Michigan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor Michigan
- School of Public Health, University of Michigan, Ann Arbor Michigan
| | - Harsha Rajasimha
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Jeeva Informatics Solutions LLC, Derwood, Maryland
| | - Preveen Ramamoorthy
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Molecular Diagnostics Department, National Jewish Health, Denver, Colorado
| | - Jerry Sheehan
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- California Institute for Telecommunications and Information Technology, University of California-San Diego, La Jolla, California
| | - Larry Smarr
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- California Institute for Telecommunications and Information Technology, University of California-San Diego, La Jolla, California
| | - Charles V. Smith
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington
| | - Todd Smith
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Digital World Biology, Seattle, Washington
| | - Michael Snyder
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Stanford University, Stanford, California
- Stanford Center for Genomics and Personalized Medicine, Stanford University, Stanford, California
| | - Srikanth Rapole
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Proteomics Laboratory, National Centre for Cell Science, University of Pune, Pune, India
| | - Sanjeeva Srivastava
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Proteomics Laboratory, Indian Institute of Technology Bombay, Mumbai, India
| | - Larissa Stanberry
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Predictive Analytics, Seattle Children's, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Elizabeth Stewart
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| | - Stefano Toppo
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Molecular Medicine, University of Padova, Padova, Italy
| | - Peter Uetz
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Center for the Study of Biological Complexity (CSBC), Virginia Commonwealth University, Richmond, Virginia
| | - Kenneth Verheggen
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Medical Protein Research, Vlaams Instituut voor Biotechnologie, Ghent, Belgium
- Department of Biochemistry, Ghent University; Ghent, Belgium
| | - Brynn H. Voy
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Animal Science, University of Tennessee Institute of Agriculture, Knoxville, Tennessee
| | - Louise Warnich
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Genetics, Faculty of AgriSciences, University of Stellenbosch, Stellenbosch, South Africa
| | - Steven W. Wilhelm
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
- Department of Microbiology, University of Tennessee-Knoxville, Knoxville, Tennessee
| | - Gregory Yandl
- Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington
- Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
| |
Collapse
|