1
|
Rabotnick MH, Ehlinger J, Haidari A, Goodrich JM. Prenatal exposures to endocrine disrupting chemicals: The role of multi-omics in understanding toxicity. Mol Cell Endocrinol 2023; 578:112046. [PMID: 37598796 PMCID: PMC10592024 DOI: 10.1016/j.mce.2023.112046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/11/2023] [Accepted: 08/16/2023] [Indexed: 08/22/2023]
Abstract
Endocrine disrupting chemicals (EDCs) are a diverse group of toxicants detected in populations globally. Prenatal EDC exposures impact birth and childhood outcomes. EDCs work through persistent changes at the molecular, cellular, and organ level. Molecular and biochemical signals or 'omics' can be measured at various functional levels - including the epigenome, transcriptome, proteome, metabolome, and the microbiome. In this narrative review, we introduce each omics and give examples of associations with prenatal EDC exposures. There is substantial research on epigenomic modifications in offspring exposed to EDCs during gestation, and a growing number of studies evaluating the transcriptome, proteome, metabolome, or microbiome in response to these exposures. Multi-omics, integrating data across omics layers, may improve understanding of disrupted function pathways related to early life exposures. We highlight several data integration methods to consider in multi-omics studies. Information from multi-omics can improve understanding of the biological processes and mechanisms underlying prenatal EDC toxicity.
Collapse
Affiliation(s)
- Margaret H Rabotnick
- Department of Environmental Health Sciences, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI, 48109, USA
| | - Jessa Ehlinger
- Department of Environmental Health Sciences, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI, 48109, USA
| | - Ariana Haidari
- Department of Environmental Health Sciences, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI, 48109, USA
| | - Jaclyn M Goodrich
- Department of Environmental Health Sciences, University of Michigan School of Public Health, 1415 Washington Heights, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
2
|
Advances in High Throughput Proteomics Profiling in Establishing Potential Biomarkers for Gastrointestinal Cancer. Cells 2022; 11:cells11060973. [PMID: 35326424 PMCID: PMC8946849 DOI: 10.3390/cells11060973] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 03/05/2022] [Accepted: 03/08/2022] [Indexed: 12/24/2022] Open
Abstract
Gastrointestinal cancers (GICs) remain the most diagnosed cancers and accounted for the highest cancer-related death globally. The prognosis and treatment outcomes of many GICs are poor because most of the cases are diagnosed in advanced metastatic stages. This is primarily attributed to the deficiency of effective and reliable early diagnostic biomarkers. The existing biomarkers for GICs diagnosis exhibited inadequate specificity and sensitivity. To improve the early diagnosis of GICs, biomarkers with higher specificity and sensitivity are warranted. Proteomics study and its functional analysis focus on elucidating physiological and biological functions of unknown or annotated proteins and deciphering cellular mechanisms at molecular levels. In addition, quantitative analysis of translational proteomics is a promising approach in enhancing the early identification and proper management of GICs. In this review, we focus on the advances in mass spectrometry along with the quantitative and functional analysis of proteomics data that contributes to the establishment of biomarkers for GICs including, colorectal, gastric, hepatocellular, pancreatic, and esophageal cancer. We also discuss the future challenges in the validation of proteomics-based biomarkers for their translation into clinics.
Collapse
|
3
|
Mahony C, O’Ryan C. Convergent Canonical Pathways in Autism Spectrum Disorder from Proteomic, Transcriptomic and DNA Methylation Data. Int J Mol Sci 2021; 22:ijms221910757. [PMID: 34639097 PMCID: PMC8509728 DOI: 10.3390/ijms221910757] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 09/27/2021] [Accepted: 10/01/2021] [Indexed: 12/20/2022] Open
Abstract
Autism Spectrum Disorder (ASD) is a complex neurodevelopmental disorder with extensive genetic and aetiological heterogeneity. While the underlying molecular mechanisms involved remain unclear, significant progress has been facilitated by recent advances in high-throughput transcriptomic, epigenomic and proteomic technologies. Here, we review recently published ASD proteomic data and compare proteomic functional enrichment signatures with those of transcriptomic and epigenomic data. We identify canonical pathways that are consistently implicated in ASD molecular data and find an enrichment of pathways involved in mitochondrial metabolism and neurogenesis. We identify a subset of differentially expressed proteins that are supported by ASD transcriptomic and DNA methylation data. Furthermore, these differentially expressed proteins are enriched for disease phenotype pathways associated with ASD aetiology. These proteins converge on protein–protein interaction networks that regulate cell proliferation and differentiation, metabolism, and inflammation, which demonstrates a link between canonical pathways, biological processes and the ASD phenotype. This review highlights how proteomics can uncover potential molecular mechanisms to explain a link between mitochondrial dysfunction and neurodevelopmental pathology.
Collapse
|
4
|
Jain A, Singh HB, Das S. Deciphering plant-microbe crosstalk through proteomics studies. Microbiol Res 2020; 242:126590. [PMID: 33022544 DOI: 10.1016/j.micres.2020.126590] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 08/21/2020] [Accepted: 08/21/2020] [Indexed: 11/25/2022]
Abstract
Proteomic approaches are being used to elucidate a better discretion of interactions occurring between host, pathogen, and/or beneficial microorganisms at the molecular level. Application of proteomic techniques, unravel pathogenicity, stress-related, and antioxidant proteins expressed amid plant-microbe interactions and good information have been generated. It is being perceived that a fine regulation of protein expression takes place for effective pathogen recognition, induction of resistance, and maintenance of host integrity. However, our knowledge of molecular plant-microbe interactions is still incomplete and inconsequential. This review aims to provide insight into numerous ways used for proteomic investigation including peptide/protein identification, separation, and quantification during host defense response. Here, we highlight the current progress in proteomics of defense responses elicited by bacterial, fungal, and viral pathogens in plants along with which the proteome level changes induced by beneficial microorganisms are also discussed.
Collapse
Affiliation(s)
- Akansha Jain
- Division of Plant Biology, Bose Institute Centenary Campus, P 1/12, CIT Scheme, VII-M, Kankurgachi, Kolkata, 700054, West Bengal, India.
| | - Harikesh Bahadur Singh
- Department of Mycology and Plant Pathology, Institute of Agricultural Sciences, Banaras Hindu University, Varanasi, 221005, India.
| | - Sampa Das
- Division of Plant Biology, Bose Institute Centenary Campus, P 1/12, CIT Scheme, VII-M, Kankurgachi, Kolkata, 700054, West Bengal, India.
| |
Collapse
|
5
|
Metaproteomics analysis reveals the adaptation process for the chicken gut microbiota. Appl Environ Microbiol 2013; 80:478-85. [PMID: 24212578 DOI: 10.1128/aem.02472-13] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The animal gastrointestinal tract houses a large microbial community, the gut microbiota, that confers many benefits to its host, such as protection from pathogens and provision of essential metabolites. Metagenomic approaches have defined the chicken fecal microbiota in other studies, but here, we wished to assess the correlation between the metagenome and the bacterial proteome in order to better understand the healthy chicken gut microbiota. Here, we performed high-throughput sequencing of 16S rRNA gene amplicons and metaproteomics analysis of fecal samples to determine microbial gut composition and protein expression. 16 rRNA gene sequencing analysis identified Clostridiales, Bacteroidaceae, and Lactobacillaceae species as the most abundant species in the gut. For metaproteomics analysis, peptides were generated by using the Fasp method and subsequently fractionated by strong anion exchanges. Metaproteomics analysis identified 3,673 proteins. Among the most frequently identified proteins, 380 proteins belonged to Lactobacillus spp., 155 belonged to Clostridium spp., and 66 belonged to Streptococcus spp. The most frequently identified proteins were heat shock chaperones, including 349 GroEL proteins, from many bacterial species, whereas the most abundant enzymes were pyruvate kinases, as judged by the number of peptides identified per protein (spectral counting). Gene ontology and KEGG pathway analyses revealed the functions and locations of the identified proteins. The findings of both metaproteomics and 16S rRNA sequencing analyses are discussed.
Collapse
|
6
|
Du P, Li T, Wang X. Recent progress in predicting protein sub-subcellular locations. Expert Rev Proteomics 2011; 8:391-404. [PMID: 21679119 DOI: 10.1586/epr.11.20] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In the last two decades, the number of the known protein sequences increased very rapidly. However, a knowledge of protein function only exists for a small portion of these sequences. Since the experimental approaches for determining protein functions are costly and time consuming, in silico methods have been introduced to bridge the gap between knowledge of protein sequences and their functions. Knowing the subcellular location of a protein is considered to be a critical step in understanding its biological functions. Many efforts have been undertaken to predict the protein subcellular locations in silico. With the accumulation of available data, the substructures of some subcellular organelles, such as the cell nucleus, mitochondria and chloroplasts, have been taken into consideration by several studies in recent years. These studies create a new research topic, namely 'protein sub-subcellular location prediction', which goes one level deeper than classic protein subcellular location prediction.
Collapse
Affiliation(s)
- Pufeng Du
- School of Computer Science and Technology, Tianjin University, Tianjin 300072, China
| | | | | |
Collapse
|
7
|
Capone G, Novello G, Fasano C, Trost B, Bickis M, Kusalik A, Kanduc D. The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand. BMC Bioinformatics 2010; 11:383. [PMID: 20646284 PMCID: PMC2919516 DOI: 10.1186/1471-2105-11-383] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2010] [Accepted: 07/20/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at all. To understand the physico-chemical-biological basis underlying peptide usage at the proteomic level, in this study we analyse the energetic costs for the synthesis of rare and never-expressed versus frequent pentapeptides. In addition, we explore residue bulkiness, hydrophobicity, and codon number as factors able to modulate specific peptide frequencies. Then, the possible influence of amino acid composition is investigated in zero- and high-frequency pentapeptide sets by analysing the frequencies of the corresponding inverse-sequence pentapeptides. As a final step, we analyse the pentadecamer oligodeoxynucleotide sequences corresponding to the never-expressed pentapeptides. RESULTS We find that only DNA context-dependent constraints (such as oligodeoxynucleotide sequence location in the minus strand, introns, pseudogenes, frameshifts, etc.) provide a coherent mechanistic platform to explain the occurrence of never-expressed versus frequent pentapeptides in the protein world. CONCLUSIONS This study is of importance in cell biology. Indeed, the rarity (or lack of expression) of specific 5-mer peptide modules implies the rarity (or lack of expression) of the corresponding n-mer peptide sequences (with n < 5), so possibly modulating protein compositional trends. Moreover the data might further our understanding of the role exerted by rare pentapeptide modules as critical biological effectors in protein-protein interactions.
Collapse
Affiliation(s)
- Giovanni Capone
- Department of Biochemistry and Molecular Biology "Ernesto Quagliariello", University of Bari, Bari, Italy
| | - Giuseppe Novello
- Department of Biochemistry and Molecular Biology "Ernesto Quagliariello", University of Bari, Bari, Italy
| | - Candida Fasano
- Department of Biochemistry and Molecular Biology "Ernesto Quagliariello", University of Bari, Bari, Italy
| | - Brett Trost
- Department of Computer Science, University of Saskatchewan, Saskatoon, Canada
| | - Mik Bickis
- Department of Mathematics and Statistics, University of Saskatchewan, Saskatoon, Canada
| | - Anthony Kusalik
- Department of Computer Science, University of Saskatchewan, Saskatoon, Canada
| | - Darja Kanduc
- Department of Biochemistry and Molecular Biology "Ernesto Quagliariello", University of Bari, Bari, Italy
| |
Collapse
|
8
|
Protein Bioinformatics Infrastructure for the Integration and Analysis of Multiple High-Throughput "omics" Data. Adv Bioinformatics 2010:423589. [PMID: 20369061 PMCID: PMC2847380 DOI: 10.1155/2010/423589] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2009] [Accepted: 01/05/2010] [Indexed: 12/26/2022] Open
Abstract
High-throughput “omics” technologies bring new opportunities for biological and biomedical researchers to ask complex questions and gain new scientific insights. However, the voluminous, complex, and context-dependent data being maintained in heterogeneous and distributed environments plus the lack of well-defined data standard and standardized nomenclature imposes a major challenge which requires advanced computational methods and bioinformatics infrastructures for integration, mining, visualization, and comparative analysis to facilitate data-driven hypothesis generation and biological knowledge discovery. In this paper, we present the challenges in high-throughput “omics” data integration and analysis, introduce a protein-centric approach for systems integration of large and heterogeneous high-throughput “omics” data including microarray, mass spectrometry, protein sequence, protein structure, and protein interaction data, and use scientific case study to illustrate how one can use varied “omics” data from different laboratories to make useful connections that could lead to new biological knowledge.
Collapse
|
9
|
Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH. UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 2007; 23:1282-8. [PMID: 17379688 DOI: 10.1093/bioinformatics/btm098] [Citation(s) in RCA: 877] [Impact Index Per Article: 51.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Redundant protein sequences in biological databases hinder sequence similarity searches and make interpretation of search results difficult. Clustering of protein sequence space based on sequence similarity helps organize all sequences into manageable datasets and reduces sampling bias and overrepresentation of sequences. RESULTS The UniRef (UniProt Reference Clusters) provide clustered sets of sequences from the UniProt Knowledgebase (UniProtKB) and selected UniProt Archive records to obtain complete coverage of sequence space at several resolutions while hiding redundant sequences. Currently covering >4 million source sequences, the UniRef100 database combines identical sequences and subfragments from any source organism into a single UniRef entry. UniRef90 and UniRef50 are built by clustering UniRef100 sequences at the 90 or 50% sequence identity levels. UniRef100, UniRef90 and UniRef50 yield a database size reduction of approximately 10, 40 and 70%, respectively, from the source sequence set. The reduced redundancy increases the speed of similarity searches and improves detection of distant relationships. UniRef entries contain summary cluster and membership information, including the sequence of a representative protein, member count and common taxonomy of the cluster, the accession numbers of all the merged entries and links to rich functional annotation in UniProtKB to facilitate biological discovery. UniRef has already been applied to broad research areas ranging from genome annotation to proteomics data analysis. AVAILABILITY UniRef is updated biweekly and is available for online search and retrieval at http://www.uniprot.org, as well as for download at ftp://ftp.uniprot.org/pub/databases/uniprot/uniref. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Baris E Suzek
- Protein Information Resource, Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC 20007, USA.
| | | | | | | | | |
Collapse
|