1
|
Balasooriya GI, Wee TL, Spector DL. A sub-set of guanine- and cytosine-rich genes are actively transcribed at the nuclear Lamin B1 region. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2023.10.28.564411. [PMID: 37961255 PMCID: PMC10634887 DOI: 10.1101/2023.10.28.564411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Chromatin organization in the mammalian cell nucleus plays a vital role in the regulation of gene expression. The lamina-associated domain at the inner nuclear membrane has been shown to harbor heterochromatin, while the nuclear interior has been shown to contain most of the euchromatin. Here, we show that a sub-set of actively transcribing genes, marked by RNA Pol II pSer2, are associated with Lamin B1 at the inner nuclear envelope in mouse embryonic stem cells (mESCs) and the number of genes proportionally increases upon in vitro differentiation of mESC to olfactory precursor cells. These nuclear periphery-associated actively transcribing genes primarily represent housekeeping genes, and their gene bodies are significantly enriched with guanine and cytosine compared to genes actively transcribed at the nuclear interior. We found the promoters of these gene's to also be significantly enriched with guanine and to be predominantly regulated by zinc finger protein transcription factors. We provide evidence supporting the emerging notion that the Lamin B1 region is not solely transcriptionally silent.
Collapse
|
2
|
Li C, Yang Z, Xu X, Meng L, Liu S, Yang D. Conserved and specific gene expression patterns in the embryonic development of tardigrades. Evol Dev 2024; 26:e12476. [PMID: 38654704 DOI: 10.1111/ede.12476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/18/2024] [Accepted: 04/03/2024] [Indexed: 04/26/2024]
Abstract
Tardigrades, commonly known as water bears, are enigmatic organisms characterized by their remarkable resilience to extreme environments despite their simple and compact body structure. To date, there is still much to understand about their evolutionary and developmental features contributing to their special body plan and abilities. This research provides preliminary insights on the conserved and specific gene expression patterns during embryonic development of water bears, focusing on the species Hypsibius exemplaris. The developmental dynamic expression analysis of the genes with various evolutionary age grades indicated that the mid-conserved stage of H. exemplaris corresponds to the period of ganglia and midgut development, with the late embryonic stage showing a transition from non-conserved to conserved state. Additionally, a comparison with Drosophila melanogaster highlighted the absence of certain pathway nodes in development-related pathways, such as Maml and Hairless, which are respectively the transcriptional co-activator and co-repressor of NOTCH regulated genes. We also employed Weighted Gene Co-expression Network Analysis (WGCNA) to investigate the expression patterns of tardigrade-specific genes during embryo development. Our findings indicated that the module containing the highest proportion of tardigrade-specific genes (TSGs) exhibits high expression levels before the mid-conserved stage, potentially playing a role in glutathione and lipid metabolism. These functions may be associated to the ecdysone synthesis and storage cell formation, which is unique to tardigrades.
Collapse
Affiliation(s)
- Chaoran Li
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Zhixiang Yang
- School of Life Sciences, Hebei University, Baoding, China
| | - Xiaofang Xu
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Lingling Meng
- School of Life Sciences, Hebei University, Baoding, China
| | - Shihao Liu
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Dong Yang
- State Key Laboratory of Medical Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| |
Collapse
|
3
|
Lan L, Leng L, Liu W, Ren Y, Reeve W, Fu X, Wu Z, Zhang X. The haplotype-resolved telomere-to-telomere carnation ( Dianthus caryophyllus) genome reveals the correlation between genome architecture and gene expression. HORTICULTURE RESEARCH 2024; 11:uhad244. [PMID: 38225981 PMCID: PMC10788775 DOI: 10.1093/hr/uhad244] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 11/12/2023] [Indexed: 01/17/2024]
Abstract
Carnation (Dianthus caryophyllus) is one of the most valuable commercial flowers, due to its richness of color and form, and its excellent storage and vase life. The diverse demands of the market require faster breeding in carnations. A full understanding of carnations is therefore required to guide the direction of breeding. Hence, we assembled the haplotype-resolved gap-free carnation genome of the variety 'Baltico', which is the most common white standard variety worldwide. Based on high-depth HiFi, ultra-long nanopore, and Hi-C sequencing data, we assembled the telomere-to-telomere (T2T) genome to be 564 479 117 and 568 266 215 bp for the two haplotypes Hap1 and Hap2, respectively. This T2T genome exhibited great improvement in genome assembly and annotation results compared with the former version. The improvements were seen when different approaches to evaluation were used. Our T2T genome first informs the analysis of the telomere and centromere region, enabling us to speculate about specific centromere characteristics that cannot be identified by high-order repeats in carnations. We analyzed allele-specific expression in three tissues and the relationship between genome architecture and gene expression in the haplotypes. This demonstrated that the length of the genes, coding sequences, and introns, the exon numbers and the transposable element insertions correlate with gene expression ratios and levels. The insertions of transposable elements repress expression in gene regulatory networks in carnation. This gap-free finished T2T carnation genome provides a valuable resource to illustrate the genome characteristics and for functional genomics analysis in further studies and molecular breeding.
Collapse
Affiliation(s)
- Lan Lan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- College of Science, Health, Engineering and Education, Murdoch University, Murdoch 6150, Western Australia, Australia
- Kunpeng Institute of Modern Agriculture at Foshan, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Luhong Leng
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Kunpeng Institute of Modern Agriculture at Foshan, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Weichao Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Kunpeng Institute of Modern Agriculture at Foshan, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Key Laboratory of Horticultural Plant Biology, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Yonglin Ren
- College of Science, Health, Engineering and Education, Murdoch University, Murdoch 6150, Western Australia, Australia
| | - Wayne Reeve
- College of Science, Health, Engineering and Education, Murdoch University, Murdoch 6150, Western Australia, Australia
| | - Xiaopeng Fu
- Key Laboratory of Horticultural Plant Biology, College of Horticulture and Forestry Sciences, Huazhong Agricultural University, Wuhan, 430070, China
| | - Zhiqiang Wu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
- Kunpeng Institute of Modern Agriculture at Foshan, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Xiaoni Zhang
- Kunpeng Institute of Modern Agriculture at Foshan, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| |
Collapse
|
4
|
Wang D, Yang D, Yang L, Diao L, Zhang Y, Li Y, Wang H, Ren J, Cheng L, Tan Q, Zhang R, Han X, Zhang X, Wang B, Li D, Chen M, Hermjakob H, Li Y, LaBaer J, Zhou Z, Yu X. Human Autoantigen Atlas: Searching for the Hallmarks of Autoantigens. J Proteome Res 2023. [PMID: 37183442 DOI: 10.1021/acs.jproteome.2c00799] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Understanding autoimmunity to endogenous proteins is crucial in diagnosing and treating autoimmune diseases. In this work, we developed a user-friendly AAgAtlas portal (http://biokb.ncpsb.org.cn/aagatlas_portal/index.php#), which can be used to search for 8045 non-redundant autoantigens (AAgs) and 47 post-translationally modified AAgs against 1073 human diseases that are prioritized by a credential score developed by multisource evidence. Using AAgAtlas, the immunogenic properties of human AAgs was systematically elucidated according to their genetic, biophysical, cytological, expression profile, and evolutionary characteristics. The results indicated that human AAgs are evolutionally conserved in protein sequence and enriched in three hydrophilic and polar amino acid residues (K, D, and E) that are located at the protein surface. AAgs are enriched in proteins that are involved in nucleic acid binding, transferase, and the cytoskeleton. Genome, transcriptome, and proteome analyses further indicated that AAb production is associated with gene variance and abnormal protein expression related to the pathological activities of different tumors. Collectively, our data outlines the hallmarks of human AAgs that facilitate the understanding of humoral autoimmunity and the identification of biomarkers of human diseases.
Collapse
Affiliation(s)
- Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Liuhui Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuqi Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hongye Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jing Ren
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Linlin Cheng
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Qiaoyun Tan
- Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Ran Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xiaohong Han
- Clinical Pharmacology Research Center, Peking Union Medical College Hospital, State Key Laboratory of Complex Severe and Rare Diseases, NMPA Key Laboratory for Clinical Research and Evaluation of Drug, Beijing Key Laboratory of Clinical PK & PD Investigation for Innovative Drugs, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China
| | - Xiaohan Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Bingwei Wang
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Meng Chen
- National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Yongzhe Li
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Joshua LaBaer
- The Virginia G. Piper Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States
| | - Zhou Zhou
- Department of Laboratory Medicine, National Center for Cardiovascular Diseases and Fuwai Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100037, China
| | - Xiaobo Yu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| |
Collapse
|
5
|
Wang H, Yang Z, Yang D. Approaches for the Identification of Intrinsically Disordered Protein Domains. Methods Mol Biol 2023; 2581:403-412. [PMID: 36413333 DOI: 10.1007/978-1-0716-2784-6_28] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Intrinsically disordered protein domains are those with high disorder proportion or a consecutive disordered region. They have no stable spatial structure but play an important role in the regulation of complex cellular functions and contribute to the increasing organism complexity during evolution. Here, we describe the approaches to predict intrinsic disorder values of residues in proteins and methods to identify the intrinsically disordered domains.
Collapse
Affiliation(s)
- Huqiang Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Zhixiang Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, China.
| |
Collapse
|
6
|
Metivier JC, Chain FJJ. Diversity in Expression Biases of Lineage-Specific Genes During Development and Anhydrobiosis Among Tardigrade Species. Evol Bioinform Online 2022; 18:11769343221140277. [PMID: 36578471 PMCID: PMC9791283 DOI: 10.1177/11769343221140277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 10/27/2022] [Indexed: 12/24/2022] Open
Abstract
Lineage-specific genes can contribute to the emergence and evolution of novel traits and adaptations. Tardigrades are animals that have adapted to tolerate extreme conditions by undergoing a form of cryptobiosis called anhydrobiosis, a physical transformation to an inactive desiccated state. While studies to understand the genetics underlying the interspecies diversity in anhydrobiotic transitions have identified tardigrade-specific genes and family expansions involved in this process, the contributions of species-specific genes to the variation in tardigrade development and cryptobiosis are less clear. We used previously published transcriptomes throughout development and anhydrobiosis (5 embryonic stages, 7 juvenile stages, active adults, and tun adults) to assess the transcriptional biases of different classes of genes between 2 tardigrade species, Hypsibius exemplaris and Ramazzottius varieornatus. We also used the transcriptomes of 2 other tardigrades, Echiniscoides sigismundi and Richtersius coronifer, and data from 3 non-tardigrade species (Adenita vaga, Drosophila melanogaster, and Caenorhabditis elegans) to help identify lineage-specific genes. We found that lineage-specific genes have generally low and narrow expression but are enriched among biased genes in different stages of development depending on the species. Biased genes tend to be specific to early and late development, but there is little overlap in functional enrichment of biased genes between species. Gene expansions in the 2 tardigrades also involve families with different functions despite homologous genes being expressed during anhydrobiosis in both species. Our results demonstrate the interspecific variation in transcriptional contributions and biases of lineage-specific genes during development and anhydrobiosis in 2 tardigrades.
Collapse
Affiliation(s)
| | - Frédéric J J Chain
- Frédéric J J Chain, Department of Biological Sciences, University of Massachusetts Lowell, One University Ave, Lowell, MA 01854, USA.
| |
Collapse
|
7
|
Sugasawa T, Komine R, Manevich L, Tamai S, Takekoshi K, Kanki Y. Gene Expression Profile Provides Novel Insights of Fasting-Refeeding Response in Zebrafish Skeletal Muscle. Nutrients 2022; 14:nu14112239. [PMID: 35684038 PMCID: PMC9182819 DOI: 10.3390/nu14112239] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 05/24/2022] [Accepted: 05/25/2022] [Indexed: 02/05/2023] Open
Abstract
Recently, fasting has been spotlighted from a healthcare perspective. However, the de-tailed biological mechanisms and significance by which the effects of fasting confer health benefits are not yet clear. Due to certain advantages of the zebrafish as a vertebrate model, it is widely utilized in biological studies. However, the biological responses to nutrient metabolism within zebrafish skeletal muscles have not yet been amply reported. Therefore, we aimed to reveal a gene expression profile in zebrafish skeletal muscles in response to fasting-refeeding. Accordingly, mRNA-sequencing and bioinformatics analysis were performed to examine comprehensive gene expression changes in skeletal muscle tissues during fasting-refeeding. Our results produced a novel set of nutrition-related genes under a fasting-refeeding protocol. Moreover, we found that five genes were dramatically upregulated in each fasting (for 24 h) and refeeding (after 3 h), exhibiting a rapid response to the provided conditional changes. The assessment of the gene length revealed that the gene set whose expression was elevated only after 3 h of refeeding had a shorter length, suggesting that nutrition-related gene function is associated with gene length. Taken together, our results from the bioinformatics analyses provide new insights into biological mechanisms induced by fasting-refeeding conditions within zebrafish skeletal muscle.
Collapse
Affiliation(s)
- Takehito Sugasawa
- Laboratory of Clinical Examination and Sports Medicine, Department of Clinical Medicine, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan; (T.S.); (S.T.)
- Department of Sports Medicine Analysis, Open Facility Network Office, Organization for Open Facility Initiatives, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan;
| | - Ritsuko Komine
- Department of Sports Medicine Analysis, Open Facility Network Office, Organization for Open Facility Initiatives, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan;
- Doctoral Program in Sports Medicine, Graduate School of Comprehensive Human Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan
| | - Lev Manevich
- Experimental Pathology, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan;
- Doctoral Program in Biomedical Sciences, Graduate School of Comprehensive Human Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan
| | - Shinsuke Tamai
- Laboratory of Clinical Examination and Sports Medicine, Department of Clinical Medicine, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan; (T.S.); (S.T.)
- Department of Sport Science and Research, Japan Institute of Sports Sciences, 3-15-1 Nishigaoka, Kita-ku, Tokyo 115-0056, Japan
| | - Kazuhiro Takekoshi
- Laboratory of Clinical Examination and Sports Medicine, Department of Clinical Medicine, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan; (T.S.); (S.T.)
- Correspondence: (K.T.); (Y.K.); Tel.: +81-29-853-3209 (K.T. & Y.K.)
| | - Yasuharu Kanki
- Laboratory of Clinical Examination and Sports Medicine, Department of Clinical Medicine, Faculty of Medicine, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan; (T.S.); (S.T.)
- Department of Sports Medicine Analysis, Open Facility Network Office, Organization for Open Facility Initiatives, University of Tsukuba, 1-1-1 Tennodai, Tsukuba 305-8577, Ibaraki, Japan;
- Correspondence: (K.T.); (Y.K.); Tel.: +81-29-853-3209 (K.T. & Y.K.)
| |
Collapse
|
8
|
Wang H, Zhong H, Gao C, Zang J, Yang D. The Distinct Properties of the Consecutive Disordered Regions Inside or Outside Protein Domains and Their Functional Significance. Int J Mol Sci 2021; 22:ijms221910677. [PMID: 34639018 PMCID: PMC8508753 DOI: 10.3390/ijms221910677] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 09/26/2021] [Accepted: 09/29/2021] [Indexed: 11/17/2022] Open
Abstract
The consecutive disordered regions (CDRs) are the basis for the formation of intrinsically disordered proteins, which contribute to various biological functions and increasing organism complexity. Previous studies have revealed that CDRs may be present inside or outside protein domains, but a comprehensive analysis of the property differences between these two types of CDRs and the proteins containing them is lacking. In this study, we investigated this issue from three viewpoints. Firstly, we found that in-domain CDRs are more hydrophilic and stable but have less stickiness and fewer post-translational modification sites compared with out-domain CDRs. Secondly, at the protein level, we found that proteins with only in-domain CDRs originated late, evolved rapidly, and had weak functional constraints, compared with the other two types of CDR-containing proteins. Proteins with only in-domain CDRs tend to be expressed spatiotemporal specifically, but they tend to have higher abundance and are more stable. Thirdly, we screened the CDR-containing protein domains that have a strong correlation with organism complexity. The CDR-containing domains tend to be evolutionarily young, or they changed from a domain without CDR to a CDR-containing domain during evolution. These results provide valuable new insights about the evolution and function of CDRs and protein domains.
Collapse
Affiliation(s)
| | | | | | | | - Dong Yang
- Correspondence: ; Tel.: +86-10-61777051
| |
Collapse
|
9
|
Lopes I, Altab G, Raina P, de Magalhães JP. Gene Size Matters: An Analysis of Gene Length in the Human Genome. Front Genet 2021; 12:559998. [PMID: 33643374 PMCID: PMC7905317 DOI: 10.3389/fgene.2021.559998] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 01/06/2021] [Indexed: 12/23/2022] Open
Abstract
While it is expected for gene length to be associated with factors such as intron number and evolutionary conservation, we are yet to understand the connections between gene length and function in the human genome. In this study, we show that, as expected, there is a strong positive correlation between gene length, transcript length, and protein size as well as a correlation with the number of genetic variants and introns. Among tissue-specific genes, we find that the longest transcripts tend to be expressed in the blood vessels, nerves, thyroid, cervix uteri, and the brain, while the smallest transcripts tend to be expressed in the pancreas, skin, stomach, vagina, and testis. We report, as shown previously, that natural selection suppresses changes for genes with longer transcripts and promotes changes for genes with smaller transcripts. We also observe that genes with longer transcripts tend to have a higher number of co-expressed genes and protein-protein interactions, as well as more associated publications. In the functional analysis, we show that bigger transcripts are often associated with neuronal development, while smaller transcripts tend to play roles in skin development and in the immune system. Furthermore, pathways related to cancer, neurons, and heart diseases tend to have genes with longer transcripts, with smaller transcripts being present in pathways related to immune responses and neurodegenerative diseases. Based on our results, we hypothesize that longer genes tend to be associated with functions that are important in the early development stages, while smaller genes tend to play a role in functions that are important throughout the whole life, like the immune system, which requires fast responses.
Collapse
Affiliation(s)
| | | | | | - João Pedro de Magalhães
- Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
10
|
Gao C, Ma C, Wang H, Zhong H, Zang J, Zhong R, He F, Yang D. Intrinsic disorder in protein domains contributes to both organism complexity and clade-specific functions. Sci Rep 2021; 11:2985. [PMID: 33542394 PMCID: PMC7862400 DOI: 10.1038/s41598-021-82656-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 01/22/2021] [Indexed: 11/09/2022] Open
Abstract
Interestingly, some protein domains are intrinsically disordered (abbreviated as IDD), and the disorder degree of same domains may differ in different contexts. However, the evolutionary causes and biological significance of these phenomena are unclear. Here, we address these issues by genome-wide analyses of the evolutionary and functional features of IDDs in 1,870 species across the three superkingdoms. As the result, there is a significant positive correlation between the proportion of IDDs and organism complexity with some interesting exceptions. These phenomena may be due to the high disorder of clade-specific domains and the different disorder degrees of the domains shared in different clades. The functions of IDDs are clade-specific and the higher proportion of post-translational modification sites may contribute to their complex functions. Compared with metazoans, fungi have more IDDs with a consecutive disorder region but a low disorder ratio, which reflects their different functional requirements. As for disorder variation, it’s greater for domains among different proteins than those within the same proteins. Some clade-specific ‘no-variation’ or ‘high-variation’ domains are involved in clade-specific functions. In sum, intrinsic domain disorder is related to both the organism complexity and clade-specific functions. These results deepen the understanding of the evolution and function of IDDs.
Collapse
Affiliation(s)
- Chao Gao
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Chong Ma
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.,Beijing Key Laboratory of Environmental and Viral Oncology, College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, China
| | - Huqiang Wang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Haolin Zhong
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Jiayin Zang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Rugang Zhong
- Beijing Key Laboratory of Environmental and Viral Oncology, College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.
| | - Dong Yang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.
| |
Collapse
|
11
|
Shen P, Xu A, Hou Y, Wang H, Gao C, He F, Yang D. Conserved paradoxical relationships among the evolutionary, structural and expressional features of KRAB zinc-finger proteins reveal their special functional characteristics. BMC Mol Cell Biol 2021; 22:7. [PMID: 33482715 PMCID: PMC7821633 DOI: 10.1186/s12860-021-00346-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 01/13/2021] [Indexed: 12/03/2022] Open
Abstract
Background One striking feature of the large KRAB domain-containing zinc finger protein (KZFP) family is its rapid evolution, leading to hundreds of member genes with various origination time in a certain mammalian genome. However, a comprehensive genome-wide and across-taxa analysis of the structural and expressional features of KZFPs with different origination time is lacking. This type of analysis will provide valuable clues about the functional characteristics of this special family. Results In this study, we found several conserved paradoxical phenomena about this issue. 1) Ordinary young domains/proteins tend to be disordered, but most of KRAB domains are completely structured in 64 representative species across the superclass of Sarcopterygii and most of KZFPs are also highly structured, indicating their rigid and unique structural and functional characteristics; as exceptions, old-zinc-finger-containing KZFPs have relatively disordered KRAB domains and linker regions, contributing to diverse interacting partners and functions. 2) In general, young or highly structured proteins tend to be spatiotemporal specific and have low abundance. However, by integrated analysis of 29 RNA-seq datasets, including 725 samples across early embryonic development, embryonic stem cell differentiation, embryonic and adult organs, tissues in 7 mammals, we found that KZFPs tend to express ubiquitously with medium abundance regardless of evolutionary age and structural disorder degree, indicating the wide functional requirements of KZFPs in various states. 3) Clustering and correlation analysis reveal that there are differential expression patterns across different spatiotemporal states, suggesting the specific-high-expression KZFPs may play important roles in the corresponding states. In particular, part of young-zinc-finger-containing KZFPs are highly expressed in early embryonic development and ESCs differentiation into endoderm or mesoderm. Co-expression analysis revealed that young-zinc-finger-containing KZFPs are significantly enriched in five co-expression modules. Among them, one module, including 13 young-zinc-finger-containing KZFPs, showed an ‘early-high and late-low’ expression pattern. Further functional analysis revealed that they may function in early embryonic development and ESC differentiation via participating in cell cycle related processes. Conclusions This study shows the conserved and special structural, expressional features of KZFPs, providing new clues about their functional characteristics and potential causes of their rapid evolution. Supplementary Information The online version contains supplementary material available at 10.1186/s12860-021-00346-w.
Collapse
Affiliation(s)
- Pan Shen
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Aishi Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.,Animal Sciences College of Jilin University, Changchun, 130062, China
| | - Yushan Hou
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Huqiang Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Chao Gao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.
| |
Collapse
|
12
|
Coronado-Zamora M, Salvador-Martínez I, Castellano D, Barbadilla A, Salazar-Ciudad I. Adaptation and Conservation throughout the Drosophila melanogaster Life-Cycle. Genome Biol Evol 2019; 11:1463-1482. [PMID: 31028390 PMCID: PMC6535812 DOI: 10.1093/gbe/evz086] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/16/2019] [Indexed: 01/09/2023] Open
Abstract
Previous studies of the evolution of genes expressed at different life-cycle stages of Drosophila melanogaster have not been able to disentangle adaptive from nonadaptive substitutions when using nonsynonymous sites. Here, we overcome this limitation by combining whole-genome polymorphism data from D. melanogaster and divergence data between D. melanogaster and Drosophila yakuba. For the set of genes expressed at different life-cycle stages of D. melanogaster, as reported in modENCODE, we estimate the ratio of substitutions relative to polymorphism between nonsynonymous and synonymous sites (α) and then α is discomposed into the ratio of adaptive (ωa) and nonadaptive (ωna) substitutions to synonymous substitutions. We find that the genes expressed in mid- and late-embryonic development are the most conserved, whereas those expressed in early development and postembryonic stages are the least conserved. Importantly, we found that low conservation in early development is due to high rates of nonadaptive substitutions (high ωna), whereas in postembryonic stages it is due, instead, to high rates of adaptive substitutions (high ωa). By using estimates of different genomic features (codon bias, average intron length, exon number, recombination rate, among others), we also find that genes expressed in mid- and late-embryonic development show the most complex architecture: they are larger, have more exons, more transcripts, and longer introns. In addition, these genes are broadly expressed among all stages. We suggest that all these genomic features are related to the conservation of mid- and late-embryonic development. Globally, our study supports the hourglass pattern of conservation and adaptation over the life-cycle.
Collapse
Affiliation(s)
- Marta Coronado-Zamora
- Genomics, Bioinformatics and Evolution, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Irepan Salvador-Martínez
- Evo-Devo Helsinki Community, Centre of Excellence in Experimental and Computational Developmental Biology, Institute of Biotechnology, University of Helsinki, Finland.,Department of Genetics, Evolution and Environment, University College London, United Kingdom
| | | | - Antonio Barbadilla
- Genomics, Bioinformatics and Evolution, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Isaac Salazar-Ciudad
- Genomics, Bioinformatics and Evolution, Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain.,Evo-Devo Helsinki Community, Centre of Excellence in Experimental and Computational Developmental Biology, Institute of Biotechnology, University of Helsinki, Finland.,Centre de Recerca Matemàtica, Cerdanyola del Vallès, Spain
| |
Collapse
|