1
|
Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster. PLoS One 2023; 18:e0288023. [PMID: 37556452 PMCID: PMC10411809 DOI: 10.1371/journal.pone.0288023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 06/18/2023] [Indexed: 08/11/2023] Open
Abstract
Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions.
Collapse
|
2
|
Proteome-wide quantitative analysis of redox cysteine availability in the Drosophila melanogaster eye reveals oxidation of phototransduction machinery during blue light exposure and age. Redox Biol 2023; 63:102723. [PMID: 37146512 DOI: 10.1016/j.redox.2023.102723] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 04/20/2023] [Accepted: 04/26/2023] [Indexed: 05/07/2023] Open
Abstract
The retina is one of the highest oxygen-consuming tissues because visual transduction and light signaling processes require large amounts of ATP. Thus, because of the high energy demand, oxygen-rich environment, and tissue transparency, the eye is susceptible to excess production of reactive oxygen species (ROS) resulting in oxidative stress. Oxidative stress in the eye is associated with the development and progression of ocular diseases including cataracts, glaucoma, age-related macular degeneration, and diabetic retinopathy. ROS can modify and damage cellular proteins, but can also be involved in redox signaling. In particular, the thiol groups of cysteines can undergo reversible or irreversible oxidative post-translational modifications (PTMs). Identifying the redox-sensitive cysteines on a proteome-wide scale provides insight into those proteins that act as redox sensors or become irreversibly damaged upon exposure to oxidative stress. In this study, we profiled the redox proteome of the Drosophila eye under prolonged, high intensity blue light exposure and age using iodoacetamide isobaric label sixplex reagents (iodo-TMT) to identify changes in cysteine availability. Although redox metabolite analysis of the major antioxidant, glutathione, revealed similar ratios of its oxidized and reduced form in aged or light-stressed eyes, we observed different changes in the redox proteome under these conditions. Both conditions resulted in significant oxidation of proteins involved in phototransduction and photoreceptor maintenance but affected distinct targets and cysteine residues. Moreover, redox changes induced by blue light exposure were accompanied by a large reduction in light sensitivity that did not arise from a reduction in the photopigment level, suggesting that the redox-sensitive cysteines we identified in the phototransduction machinery might contribute to light adaptation. Our data provide a comprehensive description of the redox proteome of Drosophila eye tissue under light stress and aging and suggest how redox signaling might contribute to light adaptation in response to acute light stress.
Collapse
|
3
|
In Depth Exploration of the Alternative Proteome of Drosophila melanogaster. Front Cell Dev Biol 2022; 10:901351. [PMID: 35721519 PMCID: PMC9204603 DOI: 10.3389/fcell.2022.901351] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 04/25/2022] [Indexed: 12/13/2022] Open
Abstract
Recent studies have shown that hundreds of small proteins were occulted when protein-coding genes were annotated. These proteins, called alternative proteins, have failed to be annotated notably due to the short length of their open reading frame (less than 100 codons) or the enforced rule establishing that messenger RNAs (mRNAs) are monocistronic. Several alternative proteins were shown to be biologically active molecules and seem to be involved in a wide range of biological functions. However, genome-wide exploration of the alternative proteome is still limited to a few species. In the present article, we describe a deep peptidomics workflow which enabled the identification of 401 alternative proteins in Drosophila melanogaster. Subcellular localization, protein domains, and short linear motifs were predicted for 235 of the alternative proteins identified and point toward specific functions of these small proteins. Several alternative proteins had approximated abundances higher than their canonical counterparts, suggesting that these alternative proteins are actually the main products of their corresponding genes. Finally, we observed 14 alternative proteins with developmentally regulated expression patterns and 10 induced upon the heat-shock treatment of embryos, demonstrating stage or stress-specific production of alternative proteins.
Collapse
|
4
|
Dynamic transcriptome analysis of Bombyx mori embryonic development. INSECT SCIENCE 2022; 29:344-362. [PMID: 34388292 DOI: 10.1111/1744-7917.12934] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 04/22/2021] [Accepted: 04/26/2021] [Indexed: 06/13/2023]
Abstract
Bombyx mori has been extensively studied but the gene expression control of its embryonic development is unclear. In this study, we performed transcriptome profiling of six stages of B. mori embryonic development using RNA sequencing (RNA-seq). A total of 12 894 transcripts were obtained from the embryos. Of these, 12 456 transcripts were shared among the six stages, namely, fertilized egg, blastoderm, germ-band, organogenesis, reversal period, and youth period stages. There were 111, 48, 41, 54, 77, and 107 transcripts specifically expressed during the six stages, respectively. By analyzing weighted gene correlation networks and differently expressed genes, we found that during embryonic development, many genes related to DNA replication, transcription, protein synthesis, and epigenetic modifications were upregulated in the early embryos. Genes of cuticle proteins, chitin synthesis-related proteins, and neuropeptides were more abundant in the late embryos. Although pathways of juvenile hormone and the ecdysteroid 20-hydroxyecdysone, and transcription factors were expressed throughout the embryonic development stages, more regulatory pathways were highly expressed around the organogenesis stage, suggesting more gene expression for organogenesis. The results of RNA-seq were confirmed by quantitative real-time polymerase chain reaction of 16 genes of different pathways. Nucleic acid methylation and seven sites in histone H3 modifications were confirmed by dot blot and western blot. This study increases the understanding of the molecular mechanisms of the embryonic developmental process and information on the regulation of B. mori development.
Collapse
|
5
|
Systematic Identification of Microproteins during the Development of Drosophila melanogaster. J Proteome Res 2022; 21:1114-1123. [PMID: 35227063 DOI: 10.1021/acs.jproteome.2c00004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Short open reading frame-encoded peptides (SEPs) are microproteins with less than 100 amino acids that play an essential role in the growth and development of organisms. There are plenty of short open reading frames in Drosophila melanogaster that potentially code polypeptides. We chose 11 time points during the life cycle of Drosophila to investigate microproteins, particularly those related to development. Finally, we identified a total of 410 microproteins, of which 27 were noncoding RNA-encoded proteins. Of the 410 microproteins, 74 were expressed in all stages from embryo to adults, whereas 300 microproteins were only found in one or two time points. Approximately, one-third of the microproteins were not reported previously and 44 were obtained from de novo sequencing, validated by synthetic peptides. These microproteins are related to the main bioprocesses of growth and development, such as multicellular organism reproduction, postmating behavior, and oviposition. Over half of the microproteins have predicted functional domains and are conserved across species, suggesting that these microproteins have critical functions in fly development. This work enriches the D. melanogaster proteome and provides a significant data resource for growth and development research.
Collapse
|
6
|
Label-free quantitative proteomics analysis of jujube ( Ziziphus jujuba Mill.) during different growth stages. RSC Adv 2021; 11:22106-22119. [PMID: 35480818 PMCID: PMC9034241 DOI: 10.1039/d1ra02989d] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Accepted: 06/15/2021] [Indexed: 01/08/2023] Open
Abstract
Chinese jujube (Zizyphus jujuba Mill.), a member of the Rhamnaceae family with favorable nutritional and flavor quality, exhibited characteristic climacteric changes during its fruit growth stage. Therefore, fruit samples were harvested at four developmental stages on days 55 (young fruits), 76 (white-mature fruits), 96 (half-red fruits), and 116 (full-red fruits) after flowering (DAF). This study then investigated those four growth stage changes of the jujube proteome using label-free quantification proteomics. The results identified 4762 proteins in the samples, of which 3757 proteins were quantified. Compared with former stages, the stages examined were designated as "76 vs. 55 DAF" group, "96 vs. 76 DAF" group, and "116 vs. 96 DAF" group. Gene Ontology (GO) and KEGG annotation and enrichment analysis of the differentially expressed proteins (DEPs) showed that 76 vs. 55 DAF group pathways represented amino sugar, nucleotide sugar, ascorbate, and aldarate metabolic pathways. These pathways were associated with cell division and resistance. In the study, the jujube fruit puffing slowed down and attained a stable growth stage in the 76 vs. 55 DAF group. However, fatty acid biosynthesis and phenylalanine metabolism was mainly enriched in the 96 vs. 76 DAF group. Fatty acids are precursors of aromatic substances and fat-soluble pigments in fruit. The upregulation of differential proteins at this stage indicates that aromatic compounds were synthesized in large quantities at this stage and that fruit would enter the ripening stage. During the ripening stage, 55 DEPs were identified to be involved in photosynthesis and flavonoid biosynthesis in the 116 vs. 96 DAF group. Also, the fruit entered the mature stage, which showed that flavonoids were produced in large quantities. Furthermore, the color of jujube turned red, and photosynthesis was significantly reduced. Hence, a link was established between protein profiles and growth phenotypes, which will help improve our understanding of jujube fruit growth at the proteomic level.
Collapse
|
7
|
Precise Temporal Regulation of Post-transcriptional Repressors Is Required for an Orderly Drosophila Maternal-to-Zygotic Transition. Cell Rep 2021; 31:107783. [PMID: 32579915 PMCID: PMC7372737 DOI: 10.1016/j.celrep.2020.107783] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 05/06/2020] [Accepted: 05/28/2020] [Indexed: 12/12/2022] Open
Abstract
In animal embryos, the maternal-to-zygotic transition (MZT) hands developmental control from maternal to zygotic gene products. We show that the maternal proteome represents more than half of the protein-coding capacity of Drosophila melanogaster’s genome, and that 2% of this proteome is rapidly degraded during the MZT. Cleared proteins include the post-transcriptional repressors Cup, Trailer hitch (TRAL), Maternal expression at 31B (ME31B), and Smaug (SMG). Although the ubiquitin-proteasome system is necessary for clearance of these repressors, distinct E3 ligase complexes target them: the C-terminal to Lis1 Homology (CTLH) complex targets Cup, TRAL, and ME31B for degradation early in the MZT and the Skp/Cullin/F-box-containing (SCF) complex targets SMG at the end of the MZT. Deleting the C-terminal 233 amino acids of SMG abrogates F-box protein interaction and confers immunity to degradation. Persistent SMG downregulates zygotic re-expression of mRNAs whose maternal contribution is degraded by SMG. Thus, clearance of SMG permits an orderly MZT. Cao et al. show that 2% of the proteome is degraded in early Drosophila embryos, including a repressive ribonucleoprotein complex. Two E3 ubiquitin ligases separately act on distinct components of this complex to phase their clearance. Failure to degrade a key component, the Smaug RNA-binding protein, disrupts an orderly maternal-to-zygotic transition.
Collapse
|
8
|
Chemical Embryology Redux: Metabolic Control of Development. Trends Genet 2020; 36:577-586. [PMID: 32532533 PMCID: PMC10947471 DOI: 10.1016/j.tig.2020.05.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 05/14/2020] [Accepted: 05/18/2020] [Indexed: 11/16/2022]
Abstract
New studies of metabolic reactions and networks in embryos are making important additions to regulatory models of development, so far dominated by genes and signals. Metabolic control of development is not a new idea and can be traced back to Joseph Needham's 'Chemical Embryology', published in the 1930s. Even though Needham's ideas fell by the wayside with the advent of genetic studies of embryogenesis, they demonstrated that embryos provide convenient models for addressing fundamental questions in biochemistry and are now experiencing a comeback, enabled by the powerful merger of detailed mechanistic studies and systems-level techniques. Here we review recent results from studies that quantified the energy budget of embryogenesis in Drosophila and started to untangle the intricate connections between core anabolic processes and developmental transitions. Dynamic coordination of metabolic, genetic, and signaling networks appears to be essential for seamless progression of development.
Collapse
|
9
|
Multi-level and lineage-specific interactomes of the Hox transcription factor Ubx contribute to its functional specificity. Nat Commun 2020; 11:1388. [PMID: 32170121 PMCID: PMC7069958 DOI: 10.1038/s41467-020-15223-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 02/21/2020] [Indexed: 12/21/2022] Open
Abstract
Transcription factors (TFs) control cell fates by precisely orchestrating gene expression. However, how individual TFs promote transcriptional diversity remains unclear. Here, we use the Hox TF Ultrabithorax (Ubx) as a model to explore how a single TF specifies multiple cell types. Using proximity-dependent Biotin IDentification in Drosophila, we identify Ubx interactomes in three embryonic tissues. We find that Ubx interacts with largely non-overlapping sets of proteins with few having tissue-specific RNA expression. Instead most interactors are active in many cell types, controlling gene expression from chromatin regulation to the initiation of translation. Genetic interaction assays in vivo confirm that they act strictly lineage- and process-specific. Thus, functional specificity of Ubx seems to play out at several regulatory levels and to result from the controlled restriction of the interaction potential by the cellular environment. Thereby, it challenges long-standing assumptions such as differential RNA expression as determinant for protein complexes. Many transcription factors regulate gene expression in a lineage- and process-specific manner, despite being expressed in several cell types. Here, the authors show that the Hox transcription factor Ubx has lineage-specific interactomes, which contribute to its cell context-dependent functions.
Collapse
|
10
|
SILAC-based quantitative proteomic analysis of Drosophila gastrula stage embryos mutant for fibroblast growth factor signalling. Fly (Austin) 2019; 14:10-28. [PMID: 31873056 DOI: 10.1080/19336934.2019.1705118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Quantitative proteomic analyses in combination with genetics provide powerful tools in developmental cell signalling research. Drosophila melanogaster is one of the most widely used genetic models for studying development and disease. Here we combined quantitative proteomics with genetic selection to determine changes in the proteome upon depletion of Heartless (Htl) Fibroblast-Growth Factor (FGF) receptor signalling in Drosophila embryos at the gastrula stage. We present a robust, single generation SILAC (stable isotope labelling with amino acids in cell culture) protocol for labelling proteins in early embryos. For the selection of homozygously mutant embryos at the pre-gastrula stage, we developed an independent genetic marker. Our analyses detected quantitative changes in the global proteome of htl mutant embryos during gastrulation. We identified distinct classes of downregulated and upregulated proteins, and network analyses indicate functionally related groups of proteins in each class. In addition, we identified changes in the abundance of phosphopeptides. In summary, our quantitative proteomic analysis reveals global changes in metabolic, nucleoplasmic, cytoskeletal and transport proteins in htl mutant embryos.
Collapse
|
11
|
Comparison of Drosophila melanogaster Embryo and Adult Proteome by SWATH-MS Reveals Differential Regulation of Protein Synthesis, Degradation Machinery, and Metabolism Modules. J Proteome Res 2019; 18:2525-2534. [DOI: 10.1021/acs.jproteome.9b00076] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Quantifying post-transcriptional regulation in the development of Drosophila melanogaster. Nat Commun 2018; 9:4970. [PMID: 30478415 PMCID: PMC6255845 DOI: 10.1038/s41467-018-07455-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2018] [Accepted: 10/30/2018] [Indexed: 11/10/2022] Open
Abstract
Even though proteins are produced from mRNA, the correlation between mRNA levels and protein abundances is moderate in most studies, occasionally attributed to complex post-transcriptional regulation. To address this, we generate a paired transcriptome/proteome time course dataset with 14 time points during Drosophila embryogenesis. Despite a limited mRNA-protein correlation (ρ = 0.54), mathematical models describing protein translation and degradation explain 84% of protein time-courses based on the measured mRNA dynamics without assuming complex post transcriptional regulation, and allow for classification of most proteins into four distinct regulatory scenarios. By performing an in-depth characterization of the putatively post-transcriptionally regulated genes, we postulate that the RNA-binding protein Hrb98DE is involved in post-transcriptional control of sugar metabolism in early embryogenesis and partially validate this hypothesis using Hrb98DE knockdown. In summary, we present a systems biology framework for the identification of post-transcriptional gene regulation from large-scale, time-resolved transcriptome and proteome data.
Collapse
|
13
|
Ribosomal flavours: an acquired taste for specific mRNAs? Biochem Soc Trans 2018; 46:1529-1539. [PMID: 30420413 DOI: 10.1042/bst20180160] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2018] [Revised: 09/14/2018] [Accepted: 09/17/2018] [Indexed: 12/20/2022]
Abstract
The regulation of translation is critical in almost every aspect of gene expression. Nonetheless, the ribosome is historically viewed as a passive player in this process. However, evidence is accumulating to suggest that variations in the ribosome can have an important influence on which mRNAs are translated. Scope for variation is provided via multiple avenues, including heterogeneity at the level of both ribosomal proteins and ribosomal RNAs and their covalent modifications. Together, these variations provide the potential for hundreds, if not thousands, of flavours of ribosome, each of which could have idiosyncratic preferences for the translation of certain messenger RNAs. Indeed, perturbations to this heterogeneity appear to affect specific subsets of transcripts and manifest as cell-type-specific diseases. This review provides a historical perspective of the ribosomal code hypothesis, before outlining the various sources of heterogeneity, their regulation and functional consequences for the cell.
Collapse
|
14
|
Spectral Libraries for SWATH-MS Assays for Drosophila melanogaster and Solanum lycopersicum. Proteomics 2018; 17. [PMID: 28922568 DOI: 10.1002/pmic.201700216] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Revised: 08/02/2017] [Indexed: 12/21/2022]
Abstract
Quantitative proteomics methods have emerged as powerful tools for measuring protein expression changes at the proteome level. Using MS-based approaches, it is now possible to routinely quantify thousands of proteins. However, prefractionation of the samples at the protein or peptide level is usually necessary to go deep into the proteome, increasing both MS analysis time and technical variability. Recently, a new MS acquisition method named SWATH is introduced with the potential to provide good coverage of the proteome as well as a good measurement precision without prior sample fractionation. In contrast to shotgun-based MS however, a library containing experimental acquired spectra is necessary for the bioinformatics analysis of SWATH data. In this study, spectral libraries for two widely used models are built to study crop ripening or animal embryogenesis, Solanum lycopersicum (tomato) and Drosophila melanogaster, respectively. The spectral libraries comprise fragments for 5197 and 6040 proteins for S. lycopersicum and D. melanogaster, respectively, and allow reproducible quantification for thousands of peptides per MS analysis. The spectral libraries and all MS data are available in the MassIVE repository with the dataset identifiers MSV000081074 and MSV000081075 and the PRIDE repository with the dataset identifiers PXD006493 and PXD006495.
Collapse
|
15
|
Comparison of protein expression between human livers and the hepatic cell lines HepG2, Hep3B, and Huh7 using SWATH and MRM-HR proteomics: Focusing on drug-metabolizing enzymes. Drug Metab Pharmacokinet 2018; 33:133-140. [PMID: 29610054 DOI: 10.1016/j.dmpk.2018.03.003] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2018] [Revised: 02/07/2018] [Accepted: 02/13/2018] [Indexed: 12/25/2022]
Abstract
Human hepatic cell lines are widely used as an in vitro model for the study of drug metabolism and liver toxicity. However, the validity of this model is still a subject of debate because the expressions of various proteins in the cell lines, including drug-metabolizing enzymes (DMEs), can differ significantly from those in human livers. In the present study, we first conducted an untargeted proteomics analysis of the microsomes of the cell lines HepG2, Hep3B, and Huh7, and compared them to human livers using a sequential window acquisition of all theoretical mass spectra (SWATH) method. Furthermore, high-resolution multiple reaction monitoring (MRM-HR), a targeted proteomic approach, was utilized to compare the expressions of pre-selected DMEs between human livers and the cell lines. In general, the SWATH quantifications were in good agreement with the MRM-HR analysis. Over 3000 protein groups were quantified in the cells and human livers, and the proteome profiles of human livers significantly differed from the cell lines. Among the 101 DMEs quantified with MRM-HR, most were expressed at substantially lower levels in the cell lines. Thus, appropriate caution must be exercised when using these cell lines for the study of hepatic drug metabolism and toxicity.
Collapse
|
16
|
In-depth characterization of the tomato fruit pericarp proteome. Proteomics 2017; 17. [PMID: 27957804 DOI: 10.1002/pmic.201600406] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Revised: 11/24/2016] [Accepted: 12/07/2016] [Indexed: 12/19/2022]
Abstract
Since the genome of Solanum lycopersicum L. was published in 2012, some studies have explored its proteome although with a limited depth. In this work, we present an extended characterization of the proteome of the tomato pericarp at its ripe red stage. Fractionation of tryptic peptides generated from pericarp proteins by off-line high-pH reverse-phase phase chromatography in combination with LC-MS/MS analysis on a Fisher Scientific Q Exactive and a Sciex Triple-TOF 6600 resulted in the identification of 8588 proteins with a 1% FDR both at the peptide and protein levels. Proteins were mapped through GO and KEGG databases and a large number of the identified proteins were associated with cytoplasmic organelles and metabolic pathways categories. These results constitute one of the most extensive proteome datasets of tomato so far and provide an experimental confirmation of the existence of a high number of theoretically predicted proteins. All MS data are available in the ProteomeXchange repository with the dataset identifiers PXD004947 and PXD004932.
Collapse
|
17
|
The developmental proteome of Drosophila melanogaster. Genome Res 2017; 27:1273-1285. [PMID: 28381612 PMCID: PMC5495078 DOI: 10.1101/gr.213694.116] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Accepted: 03/30/2017] [Indexed: 01/12/2023]
Abstract
Drosophila melanogaster is a widely used genetic model organism in developmental biology. While this model organism has been intensively studied at the RNA level, a comprehensive proteomic study covering the complete life cycle is still missing. Here, we apply label-free quantitative proteomics to explore proteome remodeling across Drosophila’s life cycle, resulting in 7952 proteins, and provide a high temporal-resolved embryogenesis proteome of 5458 proteins. Our proteome data enabled us to monitor isoform-specific expression of 34 genes during development, to identify the pseudogene Cyp9f3Ψ as a protein-coding gene, and to obtain evidence of 268 small proteins. Moreover, the comparison with available transcriptomic data uncovered examples of poor correlation between mRNA and protein, underscoring the importance of proteomics to study developmental progression. Data integration of our embryogenesis proteome with tissue-specific data revealed spatial and temporal information for further functional studies of yet uncharacterized proteins. Overall, our high resolution proteomes provide a powerful resource and can be explored in detail in our interactive web interface.
Collapse
|
18
|
SWATH-MS as a tool for biomarker discovery: From basic research to clinical applications. Proteomics 2017; 17. [DOI: 10.1002/pmic.201600278] [Citation(s) in RCA: 102] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Revised: 01/05/2017] [Accepted: 01/23/2017] [Indexed: 12/16/2022]
|
19
|
SWATH-MS dataset of heat-shock treated Drosophila melanogaster embryos. Data Brief 2016; 9:991-995. [PMID: 27900350 PMCID: PMC5123040 DOI: 10.1016/j.dib.2016.11.028] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Revised: 10/15/2016] [Accepted: 11/10/2016] [Indexed: 11/16/2022] Open
Abstract
Data independent acquisition (DIA) has emerged as a promising mass spectrometry based approach, combining the advantages of shotgun and targeted proteomics. Here we applied a DIA approach (termed SWATH) to monitor the dynamics of the Drosophila melanogaster embryonic proteome upon heat-shock treatment. Embryos were incubated for 0.5, 1 or 3 h at 37 °C to induce heat-shock or maintained at 25 °C. The present dataset contains SWATH files acquired on a Sciex Triple-TOF 6600. A spectral library built in-house was used to analyse these data and led to the quantification of more than 2500 proteins at every timepoint. The files presented here are permanent digital maps and can be reanalysed to search for new questions. The data have been deposited with the ProteomeXchange Consortium with the dataset identifier PRIDE: PXD004753.
Collapse
|
20
|
SWATH-MS data of Drosophila melanogaster proteome dynamics during embryogenesis. Data Brief 2016; 9:771-775. [PMID: 27844044 PMCID: PMC5097952 DOI: 10.1016/j.dib.2016.10.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Revised: 09/25/2016] [Accepted: 10/18/2016] [Indexed: 11/11/2022] Open
Abstract
Embryogenesis is one of the most important processes in the life of an animal. During this dynamic process, progressive cell division and cellular differentiation are accompanied by significant changes in protein expression at the level of the proteome. However, very few studies to date have described the dynamics of the proteome during the early development of an embryo in any organism. In this dataset, we monitor changes in protein expression across a timecourse of more than 20 h of Drosophila melanogaster embryonic development. Mass-spectrometry data were produced using a SWATH acquisition mode on a Sciex Triple-TOF 6600. A spectral library built in-house was used to analyse these data and more than 1950 proteins were quantified at each embryonic timepoint. The files presented here are a permanent digital map and can be reanalysed to test against new hypotheses. The data have been deposited with the ProteomeXchange Consortium with the dataset identifier PRIDE: PXD0031078.
Collapse
|