1
|
Tanudisastro HA, Cuomo ASE, Weisburd B, Welland M, Spenceley E, Franklin M, Xue A, Bowen B, Wing K, Tang O, Gray M, Reis ALM, Margoliash J, Kurtas NE, Pullin JM, Lee AS, Brand H, Harper M, Bobowik K, Silk M, Marshall J, Bakiris V, Madala BS, Uren C, Bartie C, Senabouth A, Dashnow H, Fearnley L, Martin-Trujillo A, Dolzhenko E, Qiao Z, Grieve SM, Nguyen T, Ben-David E, Chen L, Farh KKH, Talkowski M, Alexander SI, Siggs OM, Gruenschloss L, Nicholas HR, Piscionere J, Simons C, Wallace C, Gymrek M, Deveson IW, Hewitt AW, Figtree GA, de Lange KM, Powell JE, MacArthur DG. Polymorphic tandem repeats influence cell type-specific gene expression across the human immune landscape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.11.02.621562. [PMID: 40291654 PMCID: PMC12026411 DOI: 10.1101/2024.11.02.621562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]
Abstract
Tandem repeats (TRs) - highly polymorphic, repetitive sequences dispersed across the human genome - are crucial regulators of gene expression and diverse biological processes, but have remained underexplored relative to other classes of genetic variation due to historical challenges in their accurate calling and analysis. Here, we leverage whole genome and single-cell RNA sequencing from over 5.4 million blood-derived cells from 1,925 individuals to explore the impact of variation in over 1.7 million polymorphic TR loci on blood cell type-specific gene expression. We identify over 62,000 single-cell expression quantitative trait TR loci (sc-eTRs), 16.6% of which are specific to one of 28 distinct immune cell types. Further fine-mapping uncovers 4,283 sc-eTRs as candidate causal drivers of gene expression in 13.6% of genes tested genome-wide. We show through colocalization that TRs are likely mediators of genetic associations with immune-mediated and hematological traits in over 700 genes, and further identify novel TRs warranting investigation in rare disease cohorts. TRs are critical, yet long-overlooked, contributors to cell type-specific gene expression, with implications for understanding rare disease pathogenesis and the genetic architecture of complex traits.
Collapse
|
2
|
Hu JX, Yang Y, Xu YY, Shen HB. GraphLoc: a graph neural network model for predicting protein subcellular localization from immunohistochemistry images. Bioinformatics 2022; 38:4941-4948. [DOI: 10.1093/bioinformatics/btac634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/07/2022] [Accepted: 09/15/2022] [Indexed: 11/14/2022] Open
Abstract
Abstract
Motivation
Recognition of protein subcellular distribution patterns and identification of location biomarker proteins in cancer tissues are important for understanding protein functions and related diseases. Immunohistochemical (IHC) images enable visualizing the distribution of proteins at the tissue level, providing an important resource for the protein localization studies. In the past decades, several image-based protein subcellular location prediction methods have been developed, but the prediction accuracies still have much space to improve due to the complexity of protein patterns resulting from multi-label proteins and variation of location patterns across cell types or states.
Results
Here, we propose a multi-label multi-instance model based on deep graph convolutional neural networks, GraphLoc, to recognize protein subcellular location patterns. GraphLoc builds a graph of multiple IHC images for one protein, learns protein-level representations by graph convolutions, and predicts multi-label information by a dynamic threshold method. Our results show that GraphLoc is a promising model for image-based protein subcellular location prediction with model interpretability. Furthermore, we apply GraphLoc to the identification of candidate location biomarkers and potential members for protein networks. A large portion of the predicted results have supporting evidence from the existing literatures and the new candidates also provide guidance for further experimental screening.
Availability
The dataset and code are available at: www.csbio.sjtu.edu.cn/bioinf/GraphLoc.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jin-Xian Hu
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing , Ministry of Education of China, Shanghai 200240, China
| | - Yang Yang
- Shanghai Jiao Tong University Department of Computer Science and Engineering, Center for Brain-Like Computing and Machine Intelligence, , Shanghai 200240, China
| | - Ying-Ying Xu
- Southern Medical University School of Biomedical Engineering and Guangdong Provincial Key Laboratory of Medical Image Processing, , Guangzhou 510515, China
- Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University , Guangzhou 510515, China
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing , Ministry of Education of China, Shanghai 200240, China
| |
Collapse
|
3
|
Samaržija I. Post-Translational Modifications That Drive Prostate Cancer Progression. Biomolecules 2021; 11:247. [PMID: 33572160 PMCID: PMC7915076 DOI: 10.3390/biom11020247] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 02/04/2021] [Accepted: 02/06/2021] [Indexed: 02/07/2023] Open
Abstract
While a protein primary structure is determined by genetic code, its specific functional form is mostly achieved in a dynamic interplay that includes actions of many enzymes involved in post-translational modifications. This versatile repertoire is widely used by cells to direct their response to external stimuli, regulate transcription and protein localization and to keep proteostasis. Herein, post-translational modifications with evident potency to drive prostate cancer are explored. A comprehensive list of proteome-wide and single protein post-translational modifications and their involvement in phenotypic outcomes is presented. Specifically, the data on phosphorylation, glycosylation, ubiquitination, SUMOylation, acetylation, and lipidation in prostate cancer and the enzymes involved are collected. This type of knowledge is especially valuable in cases when cancer cells do not differ in the expression or mutational status of a protein, but its differential activity is regulated on the level of post-translational modifications. Since their driving roles in prostate cancer, post-translational modifications are widely studied in attempts to advance prostate cancer treatment. Current strategies that exploit the potential of post-translational modifications in prostate cancer therapy are presented.
Collapse
Affiliation(s)
- Ivana Samaržija
- Laboratory for Epigenomics, Division of Molecular Medicine, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| |
Collapse
|