1
|
Plasma proteomics for prediction of subclinical coronary artery calcifications in primary prevention. Am Heart J 2024; 271:55-67. [PMID: 38325523 DOI: 10.1016/j.ahj.2024.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 01/30/2024] [Indexed: 02/09/2024]
Abstract
BACKGROUND AND AIMS Recent developments in high-throughput proteomic technologies enable the discovery of novel biomarkers of coronary atherosclerosis. The aims of this study were to test if plasma protein subsets could detect coronary artery calcifications (CAC) in asymptomatic individuals and if they add predictive value beyond traditional risk factors. METHODS Using proximity extension assays, 1,342 plasma proteins were measured in 1,827 individuals from the Impaired Glucose Tolerance and Microbiota (IGTM) study and 883 individuals from the Swedish Cardiopulmonary BioImage Study (SCAPIS) aged 50-64 years without history of ischaemic heart disease and with CAC assessed by computed tomography. After data-driven feature selection, extreme gradient boosting machine learning models were trained on the IGTM cohort to predict the presence of CAC using combinations of proteins and traditional risk factors. The trained models were validated in SCAPIS. RESULTS The best plasma protein subset (44 proteins) predicted CAC with an area under the curve (AUC) of 0.691 in the validation cohort. However, this was not better than prediction by traditional risk factors alone (AUC = 0.710, P = .17). Adding proteins to traditional risk factors did not improve the predictions (AUC = 0.705, P = .6). Most of these 44 proteins were highly correlated with traditional risk factors. CONCLUSIONS A plasma protein subset that could predict the presence of subclinical CAC was identified but it did not outperform nor improve a model based on traditional risk factors. Thus, support for this targeted proteomics platform to predict subclinical CAC beyond traditional risk factors was not found.
Collapse
|
2
|
Absolute Quantification of Pan-Cancer Plasma Proteomes Reveals Unique Signature in Multiple Myeloma. Cancers (Basel) 2023; 15:4764. [PMID: 37835457 PMCID: PMC10571728 DOI: 10.3390/cancers15194764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 08/31/2023] [Accepted: 09/20/2023] [Indexed: 10/15/2023] Open
Abstract
Mass spectrometry based on data-independent acquisition (DIA) has developed into a powerful quantitative tool with a variety of implications, including precision medicine. Combined with stable isotope recombinant protein standards, this strategy provides confident protein identification and precise quantification on an absolute scale. Here, we describe a comprehensive targeted proteomics approach to profile a pan-cancer cohort consisting of 1800 blood plasma samples representing 15 different cancer types. We successfully performed an absolute quantification of 253 proteins in multiplex. The assay had low intra-assay variability with a coefficient of variation below 20% (CV = 17.2%) for a total of 1013 peptides quantified across almost two thousand injections. This study identified a potential biomarker panel of seven protein targets for the diagnosis of multiple myeloma patients using differential expression analysis and machine learning. The combination of markers, including the complement C1 complex, JCHAIN, and CD5L, resulted in a prediction model with an AUC of 0.96 for the identification of multiple myeloma patients across various cancer patients. All these proteins are known to interact with immunoglobulins.
Collapse
|
3
|
Systematic transcriptional analysis of human cell lines for gene expression landscape and tumor representation. Nat Commun 2023; 14:5417. [PMID: 37669926 PMCID: PMC10480497 DOI: 10.1038/s41467-023-41132-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 08/24/2023] [Indexed: 09/07/2023] Open
Abstract
Cell lines are valuable resources as model for human biology and translational medicine. It is thus important to explore the concordance between the expression in various cell lines vis-à-vis human native and disease tissues. In this study, we investigate the expression of all human protein-coding genes in more than 1,000 human cell lines representing 27 cancer types by a genome-wide transcriptomics analysis. The cell line gene expression is compared with the corresponding profiles in various tissues, organs, single-cell types and cancers. Here, we present the expression for each cell line and give guidance for the most appropriate cell line for a given experimental study. In addition, we explore the cancer-related pathway and cytokine activity of the cell lines to aid human biology studies and drug development projects. All data are presented in an open access cell line section of the Human Protein Atlas to facilitate the exploration of all human protein-coding genes across these cell lines.
Collapse
|
4
|
The development of blood protein profiles in extremely preterm infants follows a stereotypic evolution pattern. COMMUNICATIONS MEDICINE 2023; 3:107. [PMID: 37532738 PMCID: PMC10397184 DOI: 10.1038/s43856-023-00338-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 07/25/2023] [Indexed: 08/04/2023] Open
Abstract
BACKGROUND Preterm birth is the leading cause of neonatal mortality and morbidity. Early diagnosis and interventions are critical to improving the clinical outcomes of extremely premature infants. Blood protein profiling during the first months of life in preterm infants can shed light on the role of early extrauterine development and provide an increased understanding of maturation after extremely preterm birth and the underlying mechanisms of prematurity-related disorders. METHODS We have investigated the blood protein profiles during the first months of life in preterm infants on the role of early extrauterine development. The blood protein levels were analyzed using next generation blood profiling on 1335 serum samples, collected longitudinally at nine time points from birth to full-term from 182 extremely preterm infants. RESULTS The protein analysis reveals evident predestined serum evolution patterns common for all included infants. The majority of the variations in blood protein expression are associated with the postnatal age of the preterm infants rather than any other factors. There is a uniform protein pattern on postnatal day 1 and after 30 weeks postmenstrual age (PMA), independent of gestational age (GA). However, during the first month of life, GA had a significant impact on protein variability. CONCLUSIONS The unified pattern of protein development for all included infants suggests an age-dependent stereotypic development of blood proteins after birth. This knowledge should be considered in neonatal settings and might alter the clinical approach within neonatology, where PMA is today the most dominant age variable.
Collapse
|
5
|
Next generation pan-cancer blood proteome profiling using proximity extension assay. Nat Commun 2023; 14:4308. [PMID: 37463882 DOI: 10.1038/s41467-023-39765-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 06/27/2023] [Indexed: 07/20/2023] Open
Abstract
A comprehensive characterization of blood proteome profiles in cancer patients can contribute to a better understanding of the disease etiology, resulting in earlier diagnosis, risk stratification and better monitoring of the different cancer subtypes. Here, we describe the use of next generation protein profiling to explore the proteome signature in blood across patients representing many of the major cancer types. Plasma profiles of 1463 proteins from more than 1400 cancer patients are measured in minute amounts of blood collected at the time of diagnosis and before treatment. An open access Disease Blood Atlas resource allows the exploration of the individual protein profiles in blood collected from the individual cancer patients. We also present studies in which classification models based on machine learning have been used for the identification of a set of proteins associated with each of the analyzed cancers. The implication for cancer precision medicine of next generation plasma profiling is discussed.
Collapse
|
6
|
Author Correction: Endothelial cell heterogeneity and microglia regulons revealed by a pig cell landscape at single-cell level. Nat Commun 2022; 13:6748. [DOI: 10.1038/s41467-022-34498-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
7
|
Dynamics of the normal gut microbiota: A longitudinal one-year population study in Sweden. Cell Host Microbe 2022; 30:726-739.e3. [PMID: 35349787 DOI: 10.1016/j.chom.2022.03.002] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 01/17/2022] [Accepted: 03/03/2022] [Indexed: 02/07/2023]
Abstract
Temporal dynamics of the gut microbiota potentially limit the identification of microbial features associated with health status. Here, we used whole-genome metagenomic and 16S rRNA gene sequencing to characterize the intra- and inter-individual variations of gut microbiota composition and functional potential of a disease-free Swedish population (n = 75) over one year. We found that 23% of the total compositional variance was explained by intra-individual variation. The degree of intra-individual compositional variability was negatively associated with the abundance of Faecalibacterium prausnitzii (a butyrate producer) and two Bifidobacterium species. By contrast, the abundance of facultative anaerobes and aerotolerant bacteria such as Escherichia coli and Lactobacillus acidophilus varied extensively, independent of compositional stability. The contribution of intra-individual variance to the total variance was greater for functional pathways than for microbial species. Thus, reliable quantification of microbial features requires repeated samples to address the issue of intra-individual variations of the gut microbiota.
Collapse
|
8
|
Abstract
BACKGROUND There is a need for functional genome-wide annotation of the protein-coding genes to get a deeper understanding of mammalian biology. Here, a new annotation strategy is introduced based on dimensionality reduction and density-based clustering of whole-body co-expression patterns. This strategy has been used to explore the gene expression landscape in pig, and we present a whole-body map of all protein-coding genes in all major pig tissues and organs. RESULTS An open-access pig expression map ( www.rnaatlas.org ) is presented based on the expression of 350 samples across 98 well-defined pig tissues divided into 44 tissue groups. A new UMAP-based classification scheme is introduced, in which all protein-coding genes are stratified into tissue expression clusters based on body-wide expression profiles. The distribution and tissue specificity of all 22,342 protein-coding pig genes are presented. CONCLUSIONS Here, we present a new genome-wide annotation strategy based on dimensionality reduction and density-based clustering. A genome-wide resource of the transcriptome map across all major tissues and organs in pig is presented, and the data is available as an open-access resource ( www.rnaatlas.org ), including a comparison to the expression of human orthologs.
Collapse
|
9
|
Blood protein profiles related to preterm birth and retinopathy of prematurity. Pediatr Res 2022; 91:937-946. [PMID: 33895781 PMCID: PMC9064798 DOI: 10.1038/s41390-021-01528-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 02/25/2021] [Accepted: 03/30/2021] [Indexed: 02/02/2023]
Abstract
BACKGROUND Nearly one in ten children is born preterm. The degree of immaturity is a determinant of the infant's health. Extremely preterm infants have higher morbidity and mortality than term infants. One disease affecting extremely preterm infants is retinopathy of prematurity (ROP), a multifactorial neurovascular disease that can lead to retinal detachment and blindness. The advances in omics technology have opened up possibilities to study protein expressions thoroughly with clinical accuracy, here used to increase the understanding of protein expression in relation to immaturity and ROP. METHODS Longitudinal serum protein profiles the first months after birth in 14 extremely preterm infants were integrated with perinatal and ROP data. In total, 448 unique protein targets were analyzed using Proximity Extension Assays. RESULTS We found 20 serum proteins associated with gestational age and/or ROP functioning within mainly angiogenesis, hematopoiesis, bone regulation, immune function, and lipid metabolism. Infants with severe ROP had persistent lower levels of several identified proteins during the first postnatal months. CONCLUSIONS The study contributes to the understanding of the relationship between longitudinal serum protein levels and immaturity and abnormal retinal neurovascular development. This is essential for understanding pathophysiological mechanisms and to optimize diagnosis, treatment and prevention for ROP. IMPACT Longitudinal protein profiles of 14 extremely preterm infants were analyzed using a novel multiplex protein analysis platform combined with perinatal data. Proteins associated with gestational age at birth and the neurovascular disease ROP were identified. Among infants with ROP, longitudinal levels of the identified proteins remained largely unchanged during the first postnatal months. The main functions of the proteins identified were angiogenesis, hematopoiesis, immune function, bone regulation, lipid metabolism, and central nervous system development. The study contributes to the understanding of longitudinal serum protein patterns related to gestational age and their association with abnormal retinal neuro-vascular development.
Collapse
|
10
|
iNetModels 2.0: an interactive visualization and database of multi-omics data. Nucleic Acids Res 2021; 49:W271-W276. [PMID: 33849075 PMCID: PMC8262747 DOI: 10.1093/nar/gkab254] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 03/10/2021] [Accepted: 03/29/2021] [Indexed: 12/11/2022] Open
Abstract
It is essential to reveal the associations between various omics data for a comprehensive understanding of the altered biological process in human wellness and disease. To date, very few studies have focused on collecting and exhibiting multi-omics associations in a single database. Here, we present iNetModels, an interactive database and visualization platform of Multi-Omics Biological Networks (MOBNs). This platform describes the associations between the clinical chemistry, anthropometric parameters, plasma proteomics, plasma metabolomics, as well as metagenomics for oral and gut microbiome obtained from the same individuals. Moreover, iNetModels includes tissue- and cancer-specific Gene Co-expression Networks (GCNs) for exploring the connections between the specific genes. This platform allows the user to interactively explore a single feature's association with other omics data and customize its particular context (e.g. male/female specific). The users can also register their data for sharing and visualization of the MOBNs and GCNs. Moreover, iNetModels allows users who do not have a bioinformatics background to facilitate human wellness and disease research. iNetModels can be accessed freely at https://inetmodels.com without any limitation.
Collapse
|
11
|
iNetModels 2.0: an interactive visualization and database of multi-omics data. Nucleic Acids Res 2021; 49:W271-W276. [PMID: 33849075 DOI: 10.1101/2021.11.10.468051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 03/10/2021] [Accepted: 03/29/2021] [Indexed: 05/20/2023] Open
Abstract
It is essential to reveal the associations between various omics data for a comprehensive understanding of the altered biological process in human wellness and disease. To date, very few studies have focused on collecting and exhibiting multi-omics associations in a single database. Here, we present iNetModels, an interactive database and visualization platform of Multi-Omics Biological Networks (MOBNs). This platform describes the associations between the clinical chemistry, anthropometric parameters, plasma proteomics, plasma metabolomics, as well as metagenomics for oral and gut microbiome obtained from the same individuals. Moreover, iNetModels includes tissue- and cancer-specific Gene Co-expression Networks (GCNs) for exploring the connections between the specific genes. This platform allows the user to interactively explore a single feature's association with other omics data and customize its particular context (e.g. male/female specific). The users can also register their data for sharing and visualization of the MOBNs and GCNs. Moreover, iNetModels allows users who do not have a bioinformatics background to facilitate human wellness and disease research. iNetModels can be accessed freely at https://inetmodels.com without any limitation.
Collapse
|
12
|
A single-cell type transcriptomics map of human tissues. SCIENCE ADVANCES 2021; 7:7/31/eabh2169. [PMID: 34321199 PMCID: PMC8318366 DOI: 10.1126/sciadv.abh2169] [Citation(s) in RCA: 529] [Impact Index Per Article: 176.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 06/11/2021] [Indexed: 05/09/2023]
Abstract
Advances in molecular profiling have opened up the possibility to map the expression of genes in cells, tissues, and organs in the human body. Here, we combined single-cell transcriptomics analysis with spatial antibody-based protein profiling to create a high-resolution single-cell type map of human tissues. An open access atlas has been launched to allow researchers to explore the expression of human protein-coding genes in 192 individual cell type clusters. An expression specificity classification was performed to determine the number of genes elevated in each cell type, allowing comparisons with bulk transcriptomics data. The analysis highlights distinct expression clusters corresponding to cell types sharing similar functions, both within the same organs and between organs.
Collapse
|
13
|
Next generation plasma proteome profiling to monitor health and disease. Nat Commun 2021; 12:2493. [PMID: 33941778 PMCID: PMC8093230 DOI: 10.1038/s41467-021-22767-z] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 03/09/2021] [Indexed: 12/24/2022] Open
Abstract
The need for precision medicine approaches to monitor health and disease makes it important to develop sensitive and accurate assays for proteome profiles in blood. Here, we describe an approach for plasma profiling based on proximity extension assay combined with next generation sequencing. First, we analyze the variability of plasma profiles between and within healthy individuals in a longitudinal wellness study, including the influence of genetic variations on plasma levels. Second, we follow patients newly diagnosed with type 2 diabetes before and during therapeutic intervention using plasma proteome profiling. The studies show that healthy individuals have a unique and stable proteome profile and indicate that a panel of proteins could potentially be used for early diagnosis of diabetes, including stratification of patients with regards to response to metformin treatment. Although validation in larger cohorts is needed, the analysis demonstrates the usefulness of comprehensive plasma profiling for precision medicine efforts.
Collapse
|
14
|
Inflammation and Apolipoproteins Are Potential Biomarkers for Stratification of Cutaneous Melanoma Patients for Immunotherapy and Targeted Therapy. Cancer Res 2021; 81:2545-2555. [PMID: 33574091 DOI: 10.1158/0008-5472.can-20-2000] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 09/22/2020] [Accepted: 02/09/2021] [Indexed: 11/16/2022]
Abstract
Malignant cutaneous melanoma is one of the most common cancers in young adults. During the last decade, targeted and immunotherapies have significantly increased the overall survival of patients with malignant cutaneous melanoma. Nevertheless, disease progression is common, and a lack of predictive biomarkers of patient response to therapy hinders individualized treatment strategies. To address this issue, we performed a longitudinal study using an unbiased proteomics approach to identify and quantify proteins in plasma both before and during treatment from 109 patients treated with either targeted or immunotherapy. Linear modeling and machine learning approaches identified 43 potential prognostic and predictive biomarkers. A reverse correlation between apolipoproteins and proteins related to inflammation was observed. In the immunotherapy group, patients with low pretreatment expression of apolipoproteins and high expression of inflammation markers had shorter progression-free survival. Similarly, increased expression of LDHB during treatment elicited a significant impact on response to immunotherapy. Overall, we identified potential common and treatment-specific biomarkers in malignant cutaneous melanoma, paving the way for clinical use of these biomarkers following validation on a larger cohort. SIGNIFICANCE: This study identifies a potential biomarker panel that could improve the selection of therapy for patients with cutaneous melanoma.
Collapse
|
15
|
Human Immune System Variation during 1 Year. Cell Rep 2021; 32:107923. [PMID: 32697987 DOI: 10.1016/j.celrep.2020.107923] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2019] [Revised: 03/19/2020] [Accepted: 06/26/2020] [Indexed: 12/22/2022] Open
Abstract
The human immune system varies extensively between individuals, but variation within individuals over time has not been well characterized. Systems-level analyses allow for simultaneous quantification of many interacting immune system components and the inference of global regulatory principles. Here, we present a longitudinal, systems-level analysis in 99 healthy adults 50 to 65 years of age and sampled every third month for 1 year. We describe the structure of interindividual variation and characterize extreme phenotypes along a principal curve. From coordinated measurement fluctuations, we infer relationships between 115 immune cell populations and 750 plasma proteins constituting the blood immune system. While most individuals have stable immune systems, the degree of longitudinal variability is an individual feature. The most variable individuals, in the absence of overt infections, exhibited differences in markers of metabolic health suggestive of a possible link between metabolic and immunologic homeostatic regulation.
Collapse
|
16
|
Dramatic changes in blood protein levels during the first week of life in extremely preterm infants. Pediatr Res 2021; 89:604-612. [PMID: 32330929 DOI: 10.1038/s41390-020-0912-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2020] [Revised: 03/27/2020] [Accepted: 04/09/2020] [Indexed: 01/23/2023]
Abstract
BACKGROUND Preterm birth and its complications are the primary cause of death among children under the age of 5. Among the survivors, morbidity both perinatally and later in life is common. The dawn of novel technical platforms for comprehensive and sensitive analysis of protein profiles in blood has opened up new possibilities to study both health and disease with significant clinical accuracy, here used to study the preterm infant and the physiological changes of the transition from intrauterine to extrauterine life. METHODS We have performed in-depth analysis of the protein profiles of 14 extremely preterm infants using longitudinal sampling. Medical variables were integrated with extensive profiling of 448 unique protein targets. RESULTS The preterm infants have a distinct unified protein profile in blood directly at birth regardless of clinical background; however, the pattern changed profoundly postnatally, expressing more diverse profiles only 1 week later and further on up to term-equivalent age. Clusters of proteins depending on temporal trend were identified. CONCLUSION The protein profiles and the temporal trends here described will contribute to the understanding of the physiological changes in the intrauterine-extrauterine transition, which is essential to adjust early-in-life interventions to prone a normal development in the vulnerable preterm infants. IMPACT We have performed longitudinal and in-depth analysis of the protein profiles of 14 extremely preterm infants using a novel multiplex protein analysis platform. The preterm infants had a distinct unified protein profile in blood directly at birth regardless of clinical background. The pattern changed dramatically postnatally, expressing more diverse profiles only 1 week later and further on up to term-equivalent age. Certain clusters of proteins were identified depending on their temporal trend, including several liver and immune proteins. The study contributes to the understanding of the physiological changes in the intrauterine-extrauterine transition.
Collapse
|
17
|
Integration of molecular profiles in a longitudinal wellness profiling cohort. Nat Commun 2020; 11:4487. [PMID: 32900998 PMCID: PMC7479148 DOI: 10.1038/s41467-020-18148-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Accepted: 08/03/2020] [Indexed: 12/19/2022] Open
Abstract
An important aspect of precision medicine is to probe the stability in molecular profiles among healthy individuals over time. Here, we sample a longitudinal wellness cohort with 100 healthy individuals and analyze blood molecular profiles including proteomics, transcriptomics, lipidomics, metabolomics, autoantibodies and immune cell profiling, complemented with gut microbiota composition and routine clinical chemistry. Overall, our results show high variation between individuals across different molecular readouts, while the intra-individual baseline variation is low. The analyses show that each individual has a unique and stable plasma protein profile throughout the study period and that many individuals also show distinct profiles with regards to the other omics datasets, with strong underlying connections between the blood proteome and the clinical chemistry parameters. In conclusion, the results support an individual-based definition of health and show that comprehensive omics profiling in a longitudinal manner is a path forward for precision medicine. An important aspect of precision medicine is to probe the stability in molecular profiles among healthy individuals over time. Here, the authors sample a longitudinal wellness cohort and analyse blood molecular profiles as well as gut microbiota composition.
Collapse
|
18
|
Longitudinal Plasma Protein Profiling Using Targeted Proteomics and Recombinant Protein Standards. J Proteome Res 2020; 19:4815-4825. [DOI: 10.1021/acs.jproteome.0c00194] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
19
|
Facets of individual-specific health signatures determined from longitudinal plasma proteome profiling. EBioMedicine 2020; 57:102854. [PMID: 32629387 PMCID: PMC7334812 DOI: 10.1016/j.ebiom.2020.102854] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Revised: 06/01/2020] [Accepted: 06/09/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Precision medicine approaches aim to tackle diseases on an individual level through molecular profiling. Despite the growing knowledge about diseases and the reported diversity of molecular phenotypes, the descriptions of human health on an individual level have been far less elaborate. METHODS To provide insights into the longitudinal protein signatures of well-being, we profiled blood plasma collected over one year from 101 clinically healthy individuals using multiplexed antibody assays. After applying an antibody validation scheme, we utilized > 700 protein profiles for in-depth analyses of the individuals' short-term health trajectories. FINDINGS We found signatures of circulating proteomes to be highly individual-specific. Considering technical and longitudinal variability, we observed that 49% of the protein profiles were stable over one year. We also identified eight networks of proteins in which 11-242 proteins covaried over time. For each participant, there were unique protein profiles of which some could be explained by associations to genetic variants. INTERPRETATION This observational and non-interventional study identifyed noticeable diversity among clinically healthy subjects, and facets of individual-specific signatures emerged by monitoring the variability of the circulating proteomes over time. To enable more personal hence precise assessments of health states, longitudinal profiling of circulating proteomes can provide a valuable component for precision medicine approaches. FUNDING This work was supported by the Erling Persson Foundation, the Swedish Heart and Lung Foundation, the Knut and Alice Wallenberg Foundation, Science for Life Laboratory, and the Swedish Research Council.
Collapse
|
20
|
Whole-genome sequence association analysis of blood proteins in a longitudinal wellness cohort. Genome Med 2020; 12:53. [PMID: 32576278 PMCID: PMC7310558 DOI: 10.1186/s13073-020-00755-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 06/11/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND The human plasma proteome is important for many biological processes and targets for diagnostics and therapy. It is therefore of great interest to understand the interplay of genetic and environmental factors to determine the specific protein levels in individuals and to gain a deeper insight of the importance of genetic architecture related to the individual variability of plasma levels of proteins during adult life. METHODS We have combined whole-genome sequencing, multiplex plasma protein profiling, and extensive clinical phenotyping in a longitudinal 2-year wellness study of 101 healthy individuals with repeated sampling. Analyses of genetic and non-genetic associations related to the variability of blood levels of proteins in these individuals were performed. RESULTS The analyses showed that each individual has a unique protein profile, and we report on the intra-individual as well as inter-individual variation for 794 plasma proteins. A genome-wide association study (GWAS) using 7.3 million genetic variants identified by whole-genome sequencing revealed 144 independent variants across 107 proteins that showed strong association (P < 6 × 10-11) between genetics and the inter-individual variability on protein levels. Many proteins not reported before were identified (67 out of 107) with individual plasma level affected by genetics. Our longitudinal analysis further demonstrates that these levels are stable during the 2-year study period. The variability of protein profiles as a consequence of environmental factors was also analyzed with focus on the effects of weight loss and infections. CONCLUSIONS We show that the adult blood levels of many proteins are determined at birth by genetics, which is important for efforts aimed to understand the relationship between plasma proteome profiles and human biology and disease.
Collapse
|
21
|
An atlas of the protein-coding genes in the human, pig, and mouse brain. Science 2020; 367:367/6482/eaay5947. [PMID: 32139519 DOI: 10.1126/science.aay5947] [Citation(s) in RCA: 429] [Impact Index Per Article: 107.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 02/06/2020] [Indexed: 12/12/2022]
Abstract
The brain, with its diverse physiology and intricate cellular organization, is the most complex organ of the mammalian body. To expand our basic understanding of the neurobiology of the brain and its diseases, we performed a comprehensive molecular dissection of 10 major brain regions and multiple subregions using a variety of transcriptomics methods and antibody-based mapping. This analysis was carried out in the human, pig, and mouse brain to allow the identification of regional expression profiles, as well as to study similarities and differences in expression levels between the three species. The resulting data have been made available in an open-access Brain Atlas resource, part of the Human Protein Atlas, to allow exploration and comparison of the expression of individual protein-coding genes in various parts of the mammalian brain.
Collapse
|
22
|
A genome-wide transcriptomic analysis of protein-coding genes in human blood cells. Science 2019; 366:366/6472/eaax9198. [DOI: 10.1126/science.aax9198] [Citation(s) in RCA: 183] [Impact Index Per Article: 36.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 11/06/2019] [Indexed: 12/14/2022]
Abstract
Blood is the predominant source for molecular analyses in humans, both in clinical and research settings. It is the target for many therapeutic strategies, emphasizing the need for comprehensive molecular maps of the cells constituting human blood. In this study, we performed a genome-wide transcriptomic analysis of protein-coding genes in sorted blood immune cell populations to characterize the expression levels of each individual gene across the blood cell types. All data are presented in an interactive, open-access Blood Atlas as part of the Human Protein Atlas and are integrated with expression profiles across all major tissues to provide spatial classification of all protein-coding genes. This allows for a genome-wide exploration of the expression profiles across human immune cell populations and all major human tissues and organs.
Collapse
|
23
|
Abstract
The proteins secreted by human cells (collectively referred to as the secretome) are important not only for the basic understanding of human biology but also for the identification of potential targets for future diagnostics and therapies. Here, we present a comprehensive analysis of proteins predicted to be secreted in human cells, which provides information about their final localization in the human body, including the proteins actively secreted to peripheral blood. The analysis suggests that a large number of the proteins of the secretome are not secreted out of the cell, but instead are retained intracellularly, whereas another large group of proteins were identified that are predicted to be retained locally at the tissue of expression and not secreted into the blood. Proteins detected in the human blood by mass spectrometry-based proteomics and antibody-based immunoassays are also presented with estimates of their concentrations in the blood. The results are presented in an updated version 19 of the Human Protein Atlas in which each gene encoding a secretome protein is annotated to provide an open-access knowledge resource of the human secretome, including body-wide expression data, spatial localization data down to the single-cell and subcellular levels, and data about the presence of proteins that are detectable in the blood.
Collapse
|
24
|
Introducing the HPA Brain Atlas, a brain-centric sub atlas with regional expression maps of the human, mouse and pig brain. IBRO Rep 2019. [DOI: 10.1016/j.ibror.2019.07.1639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
|
25
|
Cell Type-Specific Expression of Testis Elevated Genes Based on Transcriptomics and Antibody-Based Proteomics. J Proteome Res 2019; 18:4215-4230. [PMID: 31429579 DOI: 10.1021/acs.jproteome.9b00351] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
One of the most complex organs in the human body is the testis, where spermatogenesis takes place. This physiological process involves thousands of genes and proteins that are activated and repressed, making testis the organ with the highest number of tissue-specific genes. However, the function of a large proportion of the corresponding proteins remains unknown and testis harbors many missing proteins (MPs), defined as products of protein-coding genes that lack experimental mass spectrometry evidence. Here, an integrated omics approach was used for exploring the cell type-specific protein expression of genes with an elevated expression in testis. By combining genome-wide transcriptomics analysis with immunohistochemistry, more than 500 proteins with distinct testicular protein expression patterns were identified, and these were selected for in-depth characterization of their in situ expression in eight different testicular cell types. The cell type-specific protein expression patterns allowed us to identify six distinct clusters of expression at different stages of spermatogenesis. The analysis highlighted numerous poorly characterized proteins in each of these clusters whose expression overlapped with that of known proteins involved in spermatogenesis, including 85 proteins with an unknown function and 60 proteins that previously have been classified as MPs. Furthermore, we were able to characterize the in situ distribution of several proteins that previously lacked spatial information and cell type-specific expression within the testis. The testis elevated expression levels both at the RNA and protein levels suggest that these proteins are related to testis-specific functions. In summary, the study demonstrates the power of combining genome-wide transcriptomics analysis with antibody-based protein profiling to explore the cell type-specific expression of both well-known proteins and MPs. The analyzed proteins constitute important targets for further testis-specific research in male reproductive disorders.
Collapse
|
26
|
Systematic Development of Sandwich Immunoassays for the Plasma Secretome. Proteomics 2019; 19:e1900008. [PMID: 31278833 DOI: 10.1002/pmic.201900008] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 06/17/2019] [Indexed: 12/15/2022]
Abstract
The plasma proteome offers a clinically useful window into human health. Recent advances from highly multiplexed assays now call for appropriate pipelines to validate individual candidates. Here, a workflow is developed to build dual binder sandwich immunoassays (SIA) and for proteins predicted to be secreted into plasma. Utilizing suspension bead arrays, ≈1800 unique antibody pairs are first screened against 209 proteins with recombinant proteins as well as EDTA plasma. Employing 624 unique antibodies, dilution-dependent curves in plasma and concentration-dependent curves of full-length proteins for 102 (49%) of the targets are obtained. For 22 protein assays, the longitudinal, interindividual, and technical performance is determined in a set of plasma samples collected from 18 healthy subjects every third month over 1 year. Finally, 14 of these assays are compared with with SIAs composed of other binders, proximity extension assays, and affinity-free targeted mass spectrometry. The workflow provides a multiplexed approach to screen for SIA pairs that suggests using at least three antibodies per target. This design is applicable for a wider range of targets of the plasma proteome, and the assays can be applied for discovery but also to validate emerging candidates derived from other platforms.
Collapse
|
27
|
Abstract
In the era towards precision medicine, we here present the individual specific autoantibody signatures of 193 healthy individuals. The self-reactive IgG signatures are stable over time in a way that each individual profile is recognized in longitudinal sampling. The IgG autoantibody reactivity towards an antigen array comprising 335 protein fragments, representing 204 human proteins with potential relevance to autoimmune disorders, was measured in longitudinal plasma samples from 193 healthy individuals. This analysis resulted in unique autoantibody barcodes for each individual that were maintained over one year's time. The reactivity profiles, or signatures, are person specific in regards to the number of reactivities and antigen specificity. Two independent data sets were consistent in that each healthy individual displayed reactivity towards 0-16 antigens, with a median of six. Subsequently, four selected individuals were profiled on in-house produced high-density protein arrays containing 23,000 protein fragments representing 14,000 unique protein coding genes. Based on a unique, broad and deep longitudinal profiling of autoantibody reactivities, our results demonstrate a unique autoreactive profile in each analyzed healthy individual. The need and interest for broad-ranged and high-resolution molecular profiling of healthy individuals is rising. We have here generated and assessed an initial perspective on the global distribution of the self-reactive IgG repertoire in healthy individuals, by investigating 193 well-characterized healthy individuals. Highlights A unique longitudinal profiling of autoantibody repertoires in healthy individuals Autoantibody profiles are highly individual and stable over time All individuals display IgG binding to human protein fragments The specificity of disease associated autoantigens needs to be thoroughly characterized The identification of a small set of highly reactive autoantigens Importance of stringent antigen and sample specific cut-offs for defining reactivity.
Collapse
|
28
|
A systems-approach reveals human nestin is an endothelial-enriched, angiogenesis-independent intermediate filament protein. Sci Rep 2018; 8:14668. [PMID: 30279450 PMCID: PMC6168570 DOI: 10.1038/s41598-018-32859-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 09/12/2018] [Indexed: 12/18/2022] Open
Abstract
The intermediate filament protein nestin is expressed during embryonic development, but considered largely restricted to areas of regeneration in the adult. Here, we perform a body-wide transcriptome and protein-profiling analysis to reveal that nestin is constitutively, and highly-selectively, expressed in adult human endothelial cells (EC), independent of proliferative status. Correspondingly, we demonstrate that it is not a marker for tumour EC in multiple malignancy types. Imaging of EC from different vascular beds reveals nestin subcellular distribution is shear-modulated. siRNA inhibition of nestin increases EC proliferation, and nestin expression is reduced in atherosclerotic plaque neovessels. eQTL analysis reveals an association between SNPs linked to cardiovascular disease and reduced aortic EC nestin mRNA expression. Our study challenges the dogma that nestin is a marker of proliferation, and provides insight into its regulation and function in EC. Furthermore, our systems-based approach can be applied to investigate body-wide expression profiles of any candidate protein.
Collapse
|
29
|
A pathology atlas of the human cancer transcriptome. Science 2017; 357:357/6352/eaan2507. [PMID: 28818916 DOI: 10.1126/science.aan2507] [Citation(s) in RCA: 2042] [Impact Index Per Article: 291.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Revised: 06/02/2017] [Accepted: 07/14/2017] [Indexed: 12/11/2022]
Abstract
Cancer is one of the leading causes of death, and there is great interest in understanding the underlying molecular mechanisms involved in the pathogenesis and progression of individual tumors. We used systems-level approaches to analyze the genome-wide transcriptome of the protein-coding genes of 17 major cancer types with respect to clinical outcome. A general pattern emerged: Shorter patient survival was associated with up-regulation of genes involved in cell growth and with down-regulation of genes involved in cellular differentiation. Using genome-scale metabolic models, we show that cancer patients have widespread metabolic heterogeneity, highlighting the need for precise and personalized medicine for cancer treatment. All data are presented in an interactive open-access database (www.proteinatlas.org/pathology) to allow genome-wide exploration of the impact of individual proteins on clinical outcomes.
Collapse
|
30
|
Abstract
Resolving the spatial distribution of the human proteome at a subcellular level can greatly increase our understanding of human biology and disease. Here we present a comprehensive image-based map of subcellular protein distribution, the Cell Atlas, built by integrating transcriptomics and antibody-based immunofluorescence microscopy with validation by mass spectrometry. Mapping the in situ localization of 12,003 human proteins at a single-cell level to 30 subcellular structures enabled the definition of the proteomes of 13 major organelles. Exploration of the proteomes revealed single-cell variations in abundance or spatial distribution and localization of about half of the proteins to multiple compartments. This subcellular map can be used to refine existing protein-protein interaction networks and provides an important resource to deconvolute the highly complex architecture of the human cell.
Collapse
|
31
|
The Human Adrenal Gland Proteome Defined by Transcriptomics and Antibody-Based Profiling. Endocrinology 2017; 158:239-251. [PMID: 27901589 DOI: 10.1210/en.2016-1758] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 11/29/2016] [Indexed: 12/22/2022]
Abstract
The adrenal gland is a composite endocrine organ with vital functions that include the synthesis and release of glucocorticoids and catecholamines. To define the molecular landscape that underlies the specific functions of the adrenal gland, we combined a genome-wide transcriptomics approach using messenger RNA sequencing of human tissues with immunohistochemistry-based protein profiling on tissue microarrays. Approximately two-thirds of all putative protein coding genes were expressed in the adrenal gland, and the analysis identified 253 genes with an elevated pattern of expression in the adrenal gland, with only 37 genes showing a markedly greater expression level (more than fivefold) in the adrenal gland compared with 31 other normal human tissue types analyzed. The analyses allowed for an assessment of the relative expression levels for well-known proteins involved in adrenal gland function but also identified previously poorly characterized proteins in the adrenal cortex, such as the FERM (4.1 protein, ezrin, radixin, moesin) domain containing 5 and the nephroblastoma overexpressed (NOV) protein homolog. We have provided a global analysis of the adrenal gland transcriptome and proteome, with a comprehensive list of genes with elevated expression in the adrenal gland and spatial information with examples of protein expression patterns for corresponding proteins. These genes and proteins constitute important starting points for an improved understanding of the normal function and pathophysiology of the adrenal glands.
Collapse
|
32
|
Analysis of Body-wide Unfractionated Tissue Data to Identify a Core Human Endothelial Transcriptome. Cell Syst 2016; 3:287-301.e3. [PMID: 27641958 DOI: 10.1016/j.cels.2016.08.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 05/23/2016] [Accepted: 08/03/2016] [Indexed: 12/11/2022]
Abstract
Endothelial cells line blood vessels and regulate hemostasis, inflammation, and blood pressure. Proteins critical for these specialized functions tend to be predominantly expressed in endothelial cells across vascular beds. Here, we present a systems approach to identify a panel of human endothelial-enriched genes using global, body-wide transcriptomics data from 124 tissue samples from 32 organs. We identified known and unknown endothelial-enriched gene transcripts and used antibody-based profiling to confirm expression across vascular beds. The majority of identified transcripts could be detected in cultured endothelial cells from various vascular beds, and we observed maintenance of relative expression in early passage cells. In summary, we describe a widely applicable method to determine cell-type-specific transcriptome profiles in a whole-organism context, based on differential abundance across tissues. We identify potential vascular drug targets or endothelial biomarkers and highlight candidates for functional studies to increase understanding of the endothelium in health and disease.
Collapse
|
33
|
Profiling cancer testis antigens in non-small-cell lung cancer. JCI Insight 2016; 1:e86837. [PMID: 27699219 PMCID: PMC5033889 DOI: 10.1172/jci.insight.86837] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 05/26/2016] [Indexed: 12/31/2022] Open
Abstract
Cancer testis antigens (CTAs) are of clinical interest as biomarkers and present valuable targets for immunotherapy. To comprehensively characterize the CTA landscape of non-small-cell lung cancer (NSCLC), we compared RNAseq data from 199 NSCLC tissues to the normal transcriptome of 142 samples from 32 different normal organs. Of 232 CTAs currently annotated in the Caner Testis Database (CTdatabase), 96 were confirmed in NSCLC. To obtain an unbiased CTA profile of NSCLC, we applied stringent criteria on our RNAseq data set and defined 90 genes as CTAs, of which 55 genes were not annotated in the CTdatabase, thus representing potential new CTAs. Cluster analysis revealed that CTA expression is histology dependent and concurrent expression is common. IHC confirmed tissue-specific protein expression of selected new CTAs (TKTL1, TGIF2LX, VCX, and CXORF67). Furthermore, methylation was identified as a regulatory mechanism of CTA expression based on independent data from The Cancer Genome Atlas. The proposed prognostic impact of CTAs in lung cancer was not confirmed, neither in our RNAseq cohort nor in an independent meta-analysis of 1,117 NSCLC cases. In summary, we defined a set of 90 reliable CTAs, including information on protein expression, methylation, and survival association. The detailed RNAseq catalog can guide biomarker studies and efforts to identify targets for immunotherapeutic strategies.
Collapse
|
34
|
The Human Endometrium-Specific Proteome Defined by Transcriptomics and Antibody-Based Profiling. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2015; 19:659-68. [DOI: 10.1089/omi.2015.0115] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
35
|
Complementing tissue characterization by integrating transcriptome profiling from the Human Protein Atlas and from the FANTOM5 consortium. Nucleic Acids Res 2015; 43:6787-98. [PMID: 26117540 PMCID: PMC4538815 DOI: 10.1093/nar/gkv608] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 05/28/2015] [Accepted: 05/29/2015] [Indexed: 12/20/2022] Open
Abstract
Understanding the normal state of human tissue transcriptome profiles is essential for recognizing tissue disease states and identifying disease markers. Recently, the Human Protein Atlas and the FANTOM5 consortium have each published extensive transcriptome data for human samples using Illumina-sequenced RNA-Seq and Heliscope-sequenced CAGE. Here, we report on the first large-scale complex tissue transcriptome comparison between full-length versus 5'-capped mRNA sequencing data. Overall gene expression correlation was high between the 22 corresponding tissues analyzed (R > 0.8). For genes ubiquitously expressed across all tissues, the two data sets showed high genome-wide correlation (91% agreement), with differences observed for a small number of individual genes indicating the need to update their gene models. Among the identified single-tissue enriched genes, up to 75% showed consensus of 7-fold enrichment in the same tissue in both methods, while another 17% exhibited multiple tissue enrichment and/or high expression variety in the other data set, likely dependent on the cell type proportions included in each tissue sample. Our results show that RNA-Seq and CAGE tissue transcriptome data sets are highly complementary for improving gene model annotations and highlight biological complexities within tissue transcriptomes. Furthermore, integration with image-based protein expression data is highly advantageous for understanding expression specificities for many genes.
Collapse
|
36
|
The human cardiac and skeletal muscle proteomes defined by transcriptomics and antibody-based profiling. BMC Genomics 2015; 16:475. [PMID: 26109061 PMCID: PMC4479346 DOI: 10.1186/s12864-015-1686-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 06/05/2015] [Indexed: 11/29/2022] Open
Abstract
Background To understand cardiac and skeletal muscle function, it is important to define and explore their molecular constituents and also to identify similarities and differences in the gene expression in these two different striated muscle tissues. Here, we have investigated the genes and proteins with elevated expression in cardiac and skeletal muscle in relation to all other major human tissues and organs using a global transcriptomics analysis complemented with antibody-based profiling to localize the corresponding proteins on a single cell level. Results Our study identified a comprehensive list of genes expressed in cardiac and skeletal muscle. The genes with elevated expression were further stratified according to their global expression pattern across the human body as well as their precise localization in the muscle tissues. The functions of the proteins encoded by the elevated genes are well in line with the physiological functions of cardiac and skeletal muscle, such as contraction, ion transport, regulation of membrane potential and actomyosin structure organization. A large fraction of the transcripts in both cardiac and skeletal muscle correspond to mitochondrial proteins involved in energy metabolism, which demonstrates the extreme specialization of these muscle tissues to provide energy for contraction. Conclusions Our results provide a comprehensive list of genes and proteins elevated in striated muscles. A number of proteins not previously characterized in cardiac and skeletal muscle were identified and localized to specific cellular subcompartments. These proteins represent an interesting starting point for further functional analysis of their role in muscle biology and disease. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1686-y) contains supplementary material, which is available to authorized users.
Collapse
|
37
|
Defining the Human Brain Proteome Using Transcriptomics and Antibody-Based Profiling with a Focus on the Cerebral Cortex. PLoS One 2015; 10:e0130028. [PMID: 26076492 PMCID: PMC4468152 DOI: 10.1371/journal.pone.0130028] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Accepted: 05/15/2015] [Indexed: 01/25/2023] Open
Abstract
The mammalian brain is a complex organ composed of many specialized cells, harboring sets of both common, widely distributed, as well as specialized and discretely localized proteins. Here we focus on the human brain, utilizing transcriptomics and public available Human Protein Atlas (HPA) data to analyze brain-enriched (frontal cortex) polyadenylated messenger RNA and long non-coding RNA and generate a genome-wide draft of global and cellular expression patterns of the brain. Based on transcriptomics analysis of altogether 27 tissues, we have estimated that approximately 3% (n=571) of all protein coding genes and 13% (n=87) of the long non-coding genes expressed in the human brain are enriched, having at least five times higher expression levels in brain as compared to any of the other analyzed peripheral tissues. Based on gene ontology analysis and detailed annotation using antibody-based tissue micro array analysis of the corresponding proteins, we found the majority of brain-enriched protein coding genes to be expressed in astrocytes, oligodendrocytes or in neurons with molecular properties linked to synaptic transmission and brain development. Detailed analysis of the transcripts and the genetic landscape of brain-enriched coding and non-coding genes revealed brain-enriched splice variants. Several clusters of neighboring brain-enriched genes were also identified, suggesting regulation of gene expression on the chromatin level. This multi-angle approach uncovered the brain-enriched transcriptome and linked genes to cell types and functions, providing novel insights into the molecular foundation of this highly specialized organ.
Collapse
|
38
|
Abstract
Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.
Collapse
|
39
|
Abstract
Resolving the molecular details of proteome variation in the different tissues and organs of the human body will greatly increase our knowledge of human biology and disease. Here, we present a map of the human tissue proteome based on an integrated omics approach that involves quantitative transcriptomics at the tissue and organ level, combined with tissue microarray-based immunohistochemistry, to achieve spatial localization of proteins down to the single-cell level. Our tissue-based analysis detected more than 90% of the putative protein-coding genes. We used this approach to explore the human secretome, the membrane proteome, the druggable proteome, the cancer proteome, and the metabolic functions in 32 different tissues and organs. All the data are integrated in an interactive Web-based database that allows exploration of individual proteins, as well as navigation of global expression patterns, in all major tissues and organs in the human body.
Collapse
|
40
|
|
41
|
The kidney transcriptome and proteome defined by transcriptomics and antibody-based profiling. PLoS One 2014; 9:e116125. [PMID: 25551756 PMCID: PMC4281243 DOI: 10.1371/journal.pone.0116125] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 12/02/2014] [Indexed: 12/22/2022] Open
Abstract
To understand renal functions and disease, it is important to define the molecular constituents of the various compartments of the kidney. Here, we used comparative transcriptomic analysis of all major organs and tissues in the human body, in combination with kidney tissue micro array based immunohistochemistry, to generate a comprehensive description of the kidney-specific transcriptome and proteome. A special emphasis was placed on the identification of genes and proteins that were elevated in specific kidney subcompartments. Our analysis identified close to 400 genes that had elevated expression in the kidney, as compared to the other analysed tissues, and these were further subdivided, depending on expression levels, into tissue enriched, group enriched or tissue enhanced. Immunohistochemistry allowed us to identify proteins with distinct localisation to the glomeruli (n = 11), proximal tubules (n = 120), distal tubules (n = 9) or collecting ducts (n = 8). Among the identified kidney elevated transcripts, we found several proteins not previously characterised or identified as elevated in kidney. This description of the kidney specific transcriptome and proteome provides a resource for basic and clinical research to facilitate studies to understand kidney biology and disease.
Collapse
|
42
|
The transcriptomic and proteomic landscapes of bone marrow and secondary lymphoid tissues. PLoS One 2014; 9:e115911. [PMID: 25541736 PMCID: PMC4277406 DOI: 10.1371/journal.pone.0115911] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2014] [Accepted: 11/28/2014] [Indexed: 01/05/2023] Open
Abstract
Background The sequencing of the human genome has opened doors for global gene expression profiling, and the immense amount of data will lay an important ground for future studies of normal and diseased tissues. The Human Protein Atlas project aims to systematically map the human gene and protein expression landscape in a multitude of normal healthy tissues as well as cancers, enabling the characterization of both housekeeping genes and genes that display a tissue-specific expression pattern. This article focuses on identifying and describing genes with an elevated expression in four lymphohematopoietic tissue types (bone marrow, lymph node, spleen and appendix), based on the Human Protein Atlas-strategy that combines high throughput transcriptomics with affinity-based proteomics. Results An enriched or enhanced expression in one or more of the lymphohematopoietic tissues, compared to other tissue-types, was seen for 693 out of 20,050 genes, and the highest levels of expression were found in bone marrow for neutrophilic and erythrocytic genes. A majority of these genes were found to constitute well-characterized genes with known functions in lymphatic or hematopoietic cells, while others are not previously studied, as exemplified by C19ORF59. Conclusions In this paper we present a strategy of combining next generation RNA-sequencing with in situ affinity-based proteomics in order to identify and describe new gene targets for further research on lymphatic or hematopoietic cells and tissues. The results constitute lists of genes with enriched or enhanced expression in the four lymphohematopoietic tissues, exemplified also on protein level with immunohistochemical images.
Collapse
|
43
|
Expression of human skin-specific genes defined by transcriptomics and antibody-based profiling. J Histochem Cytochem 2014; 63:129-41. [PMID: 25411189 DOI: 10.1369/0022155414562646] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
To increase our understanding of skin, it is important to define the molecular constituents of the cell types and epidermal layers that signify normal skin. We have combined a genome-wide transcriptomics analysis, using deep sequencing of mRNA from skin biopsies, with immunohistochemistry-based protein profiling to characterize the landscape of gene and protein expression in normal human skin. The transcriptomics and protein expression data of skin were compared to 26 (RNA) and 44 (protein) other normal tissue types. All 20,050 putative protein-coding genes were classified into categories based on patterns of expression. We found that 417 genes showed elevated expression in skin, with 106 genes expressed at least five-fold higher than that in other tissues. The 106 genes categorized as skin enriched encoded for well-known proteins involved in epidermal differentiation and proteins with unknown functions and expression patterns in skin, including the C1orf68 protein, which showed the highest relative enrichment in skin. In conclusion, we have applied a genome-wide analysis to identify the human skin-specific proteome and map the precise localization of the corresponding proteins in different compartments of the skin, to facilitate further functional studies to explore the molecular repertoire of normal skin and to identify biomarkers related to various skin diseases.
Collapse
|
44
|
Defining the human gallbladder proteome by transcriptomics and affinity proteomics. Proteomics 2014; 14:2498-507. [PMID: 25175928 DOI: 10.1002/pmic.201400201] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Revised: 07/30/2014] [Accepted: 08/27/2014] [Indexed: 12/12/2022]
Abstract
Global protein analysis of human gallbladder tissue is vital for identification of molecular regulators and effectors of its physiological activity. Here, we employed a genome-wide deep RNA sequencing analysis in 28 human tissues to identify the genes overrepresented in the gallbladder and complemented it with antibody-based immunohistochemistry in 48 human tissues. We characterized human gallbladder proteins and identified 140 gallbladder-specific proteins with an elevated expression in the gallbladder as compared to the other analyzed tissues. Five genes were categorized as enriched, with at least fivefold higher levels in gallbladder, 60 genes were categorized as group enriched with elevated transcript levels in gallbladder shared with at least one other tissue and 75 genes were categorized as enhanced with higher expression than the average expression in other tissues. We explored the localization of the genes within the gallbladder through cell-type specific antibody-based protein profiling and the subcellular localization of the genes through immunofluorescent-based profiling. Finally, we revealed the biological processes and metabolic functions carried out by these genes through the use of GO, KEGG Pathway, and HMR2.0 that is compilation of the human metabolic reactions. We demonstrated the results of the combined analysis of the transcriptomics and affinity proteomics.
Collapse
|
45
|
Defining the human adipose tissue proteome to reveal metabolic alterations in obesity. J Proteome Res 2014; 13:5106-19. [PMID: 25219818 DOI: 10.1021/pr500586e] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
White adipose tissue (WAT) has a major role in the progression of obesity. Here, we combined data from RNA-Seq and antibody-based immunohistochemistry to describe the normal physiology of human WAT obtained from three female subjects and explored WAT-specific genes by comparing WAT to 26 other major human tissues. Using the protein evidence in WAT, we validated the content of a genome-scale metabolic model for adipocytes. We employed this high-quality model for the analysis of subcutaneous adipose tissue (SAT) gene expression data obtained from subjects included in the Swedish Obese Subjects Sib Pair study to reveal molecular differences between lean and obese individuals. We integrated SAT gene expression and plasma metabolomics data, investigated the contribution of the metabolic differences in the mitochondria of SAT to the occurrence of obesity, and eventually identified cytosolic branched-chain amino acid (BCAA) transaminase 1 as a potential target that can be used for drug development. We observed decreased glutaminolysis and alterations in the BCAAs metabolism in SAT of obese subjects compared to lean subjects. We also provided mechanistic explanations for the changes in the plasma level of BCAAs, glutamate, pyruvate, and α-ketoglutarate in obese subjects. Finally, we validated a subset of our model-based predictions in 20 SAT samples obtained from 10 lean and 10 obese male and female subjects.
Collapse
|
46
|
The lung-specific proteome defined by integration of transcriptomics and antibody-based profiling. FASEB J 2014; 28:5184-96. [PMID: 25169055 DOI: 10.1096/fj.14-254862] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The combined action of multiple cell types is essential for the physiological function of the lung, and increased awareness of the molecular constituents characterizing each cell type is likely to advance the understanding of lung biology and disease. In the current study, we used genome-wide RNA sequencing of normal lung parenchyma and 26 additional tissue types, combined with antibody-based protein profiling, to localize the expression to specific cell types. Altogether, 221 genes were found to be elevated in the lung compared with their expression in other analyzed tissues. Among the gene products were several well-known markers, but also several proteins previously not described in the context of the lung. To link the lung-specific molecular repertoire to human disease, survival associations of pneumocyte-specific genes were assessed by using transcriptomics data from 7 non-small-cell lung cancer (NSCLC) cohorts. Transcript levels of 10 genes (SFTPB, SFTPC, SFTPD, SLC34A2, LAMP3, CACNA2D2, AGER, EMP2, NKX2-1, and NAPSA) were significantly associated with survival in the adenocarcinoma subgroup, thus qualifying as promising biomarker candidates. In summary, based on an integrated omics approach, we identified genes with elevated expression in lung and localized corresponding protein expression to different cell types. As biomarker candidates, these proteins may represent intriguing starting points for further exploration in health and disease.
Collapse
|
47
|
The lung-specific proteome defined by integration of transcriptomics and antibody-based profiling. FASEB J 2014. [PMID: 25169055 DOI: 10.1096/fj.14- 254862] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The combined action of multiple cell types is essential for the physiological function of the lung, and increased awareness of the molecular constituents characterizing each cell type is likely to advance the understanding of lung biology and disease. In the current study, we used genome-wide RNA sequencing of normal lung parenchyma and 26 additional tissue types, combined with antibody-based protein profiling, to localize the expression to specific cell types. Altogether, 221 genes were found to be elevated in the lung compared with their expression in other analyzed tissues. Among the gene products were several well-known markers, but also several proteins previously not described in the context of the lung. To link the lung-specific molecular repertoire to human disease, survival associations of pneumocyte-specific genes were assessed by using transcriptomics data from 7 non-small-cell lung cancer (NSCLC) cohorts. Transcript levels of 10 genes (SFTPB, SFTPC, SFTPD, SLC34A2, LAMP3, CACNA2D2, AGER, EMP2, NKX2-1, and NAPSA) were significantly associated with survival in the adenocarcinoma subgroup, thus qualifying as promising biomarker candidates. In summary, based on an integrated omics approach, we identified genes with elevated expression in lung and localized corresponding protein expression to different cell types. As biomarker candidates, these proteins may represent intriguing starting points for further exploration in health and disease.
Collapse
|
48
|
Abstract
Human liver physiology and the genetic etiology of the liver diseases can potentially be elucidated through the identification of proteins with enriched expression in the liver. Here, we combined data from RNA sequencing (RNA-Seq) and antibody-based immunohistochemistry across all major human tissues to explore the human liver proteome with enriched expression, as well as the cell type-enriched expression in hepatocyte and bile duct cells. We identified in total 477 protein-coding genes with elevated expression in the liver: 179 genes have higher expression as compared to all the other analyzed tissues; 164 genes have elevated transcript levels in the liver shared with at least one other tissue type; and an additional 134 genes have a mild level of increased expression in the liver. We identified the precise localization of these proteins through antibody-based protein profiling and the subcellular localization of these proteins through immunofluorescent-based profiling. We also identified the biological processes and metabolic functions associated with these proteins, investigated their contribution in the occurrence of liver diseases, and identified potential targets for their treatment. Our study demonstrates the use of RNA-Seq and antibody-based immunohistochemistry for characterizing the human liver proteome, as well as the use of tissue-specific proteins in identification of novel drug targets and discovery of biomarkers.-Kampf, C., Mardinoglu, A., Fagerberg, L., Hallström, B. M., Edlund, K., Lundberg, E., Pontén, F., Nielsen, J., Uhlen, M. The human liver-specific proteome defined by transcriptomics and antibody-based profiling.
Collapse
|
49
|
RNA- and antibody-based profiling of the human proteome with focus on chromosome 19. J Proteome Res 2014; 13:2019-27. [PMID: 24579871 DOI: 10.1021/pr401156g] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
An important part of the Human Proteome Project is to characterize the protein complement of the genome with antibody-based profiling. Within the framework of this effort, a new version 12 of the Human Protein Atlas ( www.proteinatlas.org ) has been launched, including transcriptomics data for 27 tissues and 44 cell lines to complement the protein expression data from antibody-based profiling. Besides the extensive addition of transcriptomics data, the Human Protein Atlas now contains antibody-based protein profiles for 82% of the 20 329 putative protein-coding genes. The comprehensive data resulting from RNA-seq analysis and antibody-based profiling performed within the Human Protein Atlas as well as information from UniProt were used to generate evidence summary scores for each of the 20 329 genes, of which 94% now have experimental evidence at least at transcript level. The evidence scores for all individual genes are displayed with regards to both RNA- and antibody-based protein profiles, including chromosome-centric visualizations. An analysis of the human chromosome 19 shows that ∼43% of the genes are expressed at the transcript level in all 27 tissues analyzed, suggesting a "house-keeping" function, while 12% of the genes show a more tissue-specific pattern with enriched expression in one of the analyzed tissues only.
Collapse
|
50
|
The human testis-specific proteome defined by transcriptomics and antibody-based profiling. Mol Hum Reprod 2014; 20:476-88. [PMID: 24598113 DOI: 10.1093/molehr/gau018] [Citation(s) in RCA: 138] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The testis' function is to produce haploid germ cells necessary for reproduction. Here we have combined a genome-wide transcriptomics analysis with immunohistochemistry-based protein profiling to characterize the molecular components of the testis. Deep sequencing (RNA-Seq) of normal human testicular tissue from seven individuals was performed and compared with 26 other normal human tissue types. All 20 050 putative human genes were classified into categories based on expression patterns. The analysis shows that testis is the tissue with the most tissue-specific genes by far. More than 1000 genes show a testis-enriched expression pattern in testis when compared with all other analyzed tissues. Highly testis enriched genes were further characterized with respect to protein localization within the testis, such as spermatogonia, spermatocytes, spermatids, sperm, Sertoli cells and Leydig cells. Here we present an immunohistochemistry-based analysis, showing the localization of corresponding proteins in different cell types and various stages of spermatogenesis, for 62 genes expressed at >50-fold higher levels in testis when compared with other tissues. A large fraction of these genes were unexpectedly expressed in early stages of spermatogenesis. In conclusion, we have applied a genome-wide analysis to identify the human testis-specific proteome using transcriptomics and antibody-based protein profiling, providing lists of genes expressed in a tissue-enriched manner in the testis. The majority of these genes and proteins were previously poorly characterised in terms of localization and function, and our list provides an important starting point to increase our molecular understanding of human reproductive biology and disease.
Collapse
|