1
|
Sumera, Anwer F, Waseem M, Fatima A, Malik N, Ali A, Zahid S. Molecular Docking and Molecular Dynamics Studies Reveal Secretory Proteins as Novel Targets of Temozolomide in Glioblastoma Multiforme. Molecules 2022; 27:7198. [PMID: 36364024 PMCID: PMC9653723 DOI: 10.3390/molecules27217198] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 09/24/2022] [Accepted: 09/29/2022] [Indexed: 10/13/2023] Open
Abstract
Glioblastoma multiforme (GBM) is a tumor of glial origin and is the most malignant, aggressive and prevalent type, with the highest mortality rate in adult brain cancer. Surgical resection of the tumor followed by Temozolomide (TMZ) therapy is currently available, but the development of resistance to TMZ is a common limiting factor in effective treatment. The present study investigated the potential interactions of TMZ with several secretory proteins involved in various molecular and cellular processes in GBM. Automated docking studies were performed using AutoDock 4.2, which showed an encouraging binding affinity of TMZ towards all targeted proteins, with the strongest interaction and binding affinity with GDF1 and SLIT1, followed by NPTX1, CREG2 and SERPINI, among the selected proteins. Molecular dynamics (MD) simulations of protein-ligand complexes were performed via CABS-flex V2.0 and the iMOD server to evaluate the root-mean-square fluctuations (RMSFs) and measure protein stability, respectively. The results showed that docked models were more flexible and stable with TMZ, suggesting that it may be able to target putative proteins implicated in gliomagenesis that may impact radioresistance. However, additional in vitro and in vivo investigations can ascertain the potential of the selected proteins to serve as novel targets for TMZ for GBM treatment.
Collapse
Affiliation(s)
- Sumera
- Neurobiology Research Laboratory, Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Farha Anwer
- Integrative Biology Laboratory, Department of Industrial Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Maaz Waseem
- Integrative Biology Laboratory, Department of Industrial Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Areeba Fatima
- Neurobiology Research Laboratory, Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Nishat Malik
- Neurobiology Research Laboratory, Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Amjad Ali
- Integrative Biology Laboratory, Department of Industrial Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| | - Saadia Zahid
- Neurobiology Research Laboratory, Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
| |
Collapse
|
2
|
Gorniak-Walas M, Nizinska K, Lukasiuk K. Cloning and Functional Analysis of Rat Tweety-Homolog 1 Gene Promoter. Neurochem Res 2021; 46:2463-2472. [PMID: 34173119 PMCID: PMC8302521 DOI: 10.1007/s11064-021-03374-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 06/06/2021] [Accepted: 06/08/2021] [Indexed: 11/06/2022]
Abstract
Tweety-homolog 1 protein (Ttyh1) is abundantly expressed in neurons in the healthy brain, and its expression is induced under pathological conditions. In hippocampal neurons in vitro, Ttyh1 was implicated in the regulation of primary neuron morphology. However, the mechanisms that underlie transcriptional regulation of the Ttyh1 gene in neurons remain elusive. The present study sought to identify the promoter of the Ttyh1 gene and functionally characterize cis-regulatory elements that are potentially involved in the transcriptional regulation of Ttyh1 expression in rat dissociated hippocampal neurons in vitro. We cloned a 592 bp rat Ttyh1 promoter sequence and designed deletion constructs of the transcription factors specificity protein 1 (Sp1), E2F transcription factor 3 (E2f3), and achaete-scute homolog 1 (Ascl1) that were fused upstream of a luciferase reporter gene in pGL4.10[luc2]. The luciferase reporter gene assay showed the possible involvement of Ascl1, Sp1, and responsive cis-regulatory elements in Ttyh1 expression. These findings provide novel information about Ttyh1 gene regulation in neurons.
Collapse
Affiliation(s)
- Malgorzata Gorniak-Walas
- Laboratory of Epileptogenesis, Nencki Institute of Experimental Biology, Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
| | - Karolina Nizinska
- Laboratory of Epileptogenesis, Nencki Institute of Experimental Biology, Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland
| | - Katarzyna Lukasiuk
- Laboratory of Epileptogenesis, Nencki Institute of Experimental Biology, Polish Academy of Sciences, 3 Pasteur Street, 02-093, Warsaw, Poland.
| |
Collapse
|
3
|
Cardiac risk stratification in cancer patients: A longitudinal patient-patient network analysis. PLoS Med 2021; 18:e1003736. [PMID: 34339408 PMCID: PMC8366997 DOI: 10.1371/journal.pmed.1003736] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 08/16/2021] [Accepted: 07/15/2021] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Cardiovascular disease is a leading cause of death in general population and the second leading cause of mortality and morbidity in cancer survivors after recurrent malignancy in the United States. The growing awareness of cancer therapy-related cardiac dysfunction (CTRCD) has led to an emerging field of cardio-oncology; yet, there is limited knowledge on how to predict which patients will experience adverse cardiac outcomes. We aimed to perform unbiased cardiac risk stratification for cancer patients using our large-scale, institutional electronic medical records. METHODS AND FINDINGS We built a large longitudinal (up to 22 years' follow-up from March 1997 to January 2019) cardio-oncology cohort having 4,632 cancer patients in Cleveland Clinic with 5 diagnosed cardiac outcomes: atrial fibrillation, coronary artery disease, heart failure, myocardial infarction, and stroke. The entire population includes 84% white Americans and 11% black Americans, and 59% females versus 41% males, with median age of 63 (interquartile range [IQR]: 54 to 71) years old. We utilized a topology-based K-means clustering approach for unbiased patient-patient network analyses of data from general demographics, echocardiogram (over 25,000), lab testing, and cardiac factors (cardiac). We performed hazard ratio (HR) and Kaplan-Meier analyses to identify clinically actionable variables. All confounding factors were adjusted by Cox regression models. We performed random-split and time-split training-test validation for our model. We identified 4 clinically relevant subgroups that are significantly correlated with incidence of cardiac outcomes and mortality. Among the 4 subgroups, subgroup I (n = 625) has the highest risk of de novo CTRCD (28%) with an HR of 3.05 (95% confidence interval (CI) 2.51 to 3.72). Patients in subgroup IV (n = 1,250) had the worst survival probability (HR 4.32, 95% CI 3.82 to 4.88). From longitudinal patient-patient network analyses, the patients in subgroup I had a higher percentage of de novo CTRCD and a worse mortality within 5 years after the initiation of cancer therapies compared to long-time exposure (6 to 20 years). Using clinical variable network analyses, we identified that serum levels of NT-proB-type Natriuretic Peptide (NT-proBNP) and Troponin T are significantly correlated with patient's mortality (NT-proBNP > 900 pg/mL versus NT-proBNP = 0 to 125 pg/mL, HR = 2.95, 95% CI 2.28 to 3.82, p < 0.001; Troponin T > 0.05 μg/L versus Troponin T ≤ 0.01 μg/L, HR = 2.08, 95% CI 1.83 to 2.34, p < 0.001). Study limitations include lack of independent cardio-oncology cohorts from different healthcare systems to evaluate the generalizability of the models. Meanwhile, the confounding factors, such as multiple medication usages, may influence the findings. CONCLUSIONS In this study, we demonstrated that the patient-patient network clustering methodology is clinically intuitive, and it allows more rapid identification of cancer survivors that are at greater risk of cardiac dysfunction. We believed that this study holds great promise for identifying novel cardiac risk subgroups and clinically actionable variables for the development of precision cardio-oncology.
Collapse
|
4
|
iTRAQ comparison of proteomic profiles of endometrial receptivity. J Proteomics 2019; 203:103381. [PMID: 31102758 DOI: 10.1016/j.jprot.2019.103381] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 04/17/2019] [Accepted: 05/06/2019] [Indexed: 01/01/2023]
Abstract
Endometrial receptivity is a limiting step in human reproduction. A disruption in the development of endometrial receptivity is responsible for recurrent implantation failures (RIF) of endometrial origin. To understand the molecular mechanisms behind the endometrial receptivity process, we used the isobaric tag for relative and absolute quantitation (iTRAQ) method to compare three different endometrial statuses: fertile women, intrauterine device (IUD) carriers, and RIF patients. Overall, iTRAQ allowed identified 1889 non-redundant proteins. Of these, 188 were differentially expressed proteins (DEP) (p-value < .05). Pairwise comparisons revealed 133 significant DEP in fertile vs. IUD carriers and 158 DEP in RIF vs. IUD carriers. However, no DEP were identified between fertile and RIF patients. Western blot validation of three DEP involved in endometrial receptivity (plastin 2, lactotransferrin, and lysozyme) confirmed our iTRAQ results. Moreover, functional KEGG enrichment revealed that complement and coagulation cascades and peroxisome were the two most significant pathways for the RIF vs. IUD comparison and ribosome and spliceosome for the fertile vs. IUD comparison, as possible important pathways involved in the endometrial receptivity acquisition. The lack of DEP between fertile and RIF patient endometria suggest that idiopathic RIF may not have an endometrial origin, with other as-yet-unknown factors involved. SIGNIFICANCE: A pilot study where a comparison of the endometrial protein profile from women with different endometrial receptive grade (fertile women, IUD carriers and RIF patients) during the same period of time (overlapping with the window of implantation) of a hormone replacement therapy was performed using a high-throughput proteomic technique. This approach lead us to better understand the molecular mechanisms undergoing endometrial receptivity, a time-limiting step to achieve pregnancy in humans. Moreover, the number of samples per group (10 Fertile women, 10 IUD carriers and 8 RIF patients) according to the methodology here employed (8plex iTRAQ), give more robustness to our results. Our findings confirm that an IUD introduces numerous changes in the endometrial protein profile when compared to fertile and RIF endometria, revealing some key proteins involved in endometrial receptivity. Finding no significant differences between Fertile and RIF patient endometria could suggest that other as-yet-unknown factors could be involved in the etiology of idiopathic RIF.
Collapse
|
5
|
Tweety-Homolog 1 Drives Brain Colonization of Gliomas. J Neurosci 2017; 37:6837-6850. [PMID: 28607172 DOI: 10.1523/jneurosci.3532-16.2017] [Citation(s) in RCA: 97] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Revised: 03/16/2017] [Accepted: 04/19/2017] [Indexed: 01/25/2023] Open
Abstract
Early and progressive colonization of the healthy brain is one hallmark of diffuse gliomas, including glioblastomas. We recently discovered ultralong (>10 to hundreds of microns) membrane protrusions [tumor microtubes (TMs)] extended by glioma cells. TMs have been associated with the capacity of glioma cells to effectively invade the brain and proliferate. Moreover, TMs are also used by some tumor cells to interconnect to one large, resistant multicellular network. Here, we performed a correlative gene-expression microarray and in vivo imaging analysis, and identified novel molecular candidates for TM formation and function. Interestingly, these genes were previously linked to normal CNS development. One of the genes scoring highest in tests related to the outgrowth of TMs was tweety-homolog 1 (TTYH1), which was highly expressed in a fraction of TMs in mice and patients. Ttyh1 was confirmed to be a potent regulator of normal TM morphology and of TM-mediated tumor-cell invasion and proliferation. Glioma cells with one or two TMs were mainly responsible for effective brain colonization, and Ttyh1 downregulation particularly affected this cellular subtype, resulting in reduced tumor progression and prolonged survival of mice. The remaining Ttyh1-deficient tumor cells, however, had more interconnecting TMs, which were associated with increased radioresistance in those small tumors. These findings imply a cellular and molecular heterogeneity in gliomas regarding formation and function of distinct TM subtypes, with multiple parallels to neuronal development, and suggest that Ttyh1 might be a promising target to specifically reduce TM-associated brain colonization by glioma cells in patients.SIGNIFICANCE STATEMENT In this report, we identify tweety-homolog 1 (Ttyh1), a membrane protein linked to neuronal development, as a potent driver of tumor microtube (TM)-mediated brain colonization by glioma cells. Targeting of Ttyh1 effectively inhibited the formation of invasive TMs and glioma growth, but increased network formation by intercellular TMs, suggesting a functional and molecular heterogeneity of the recently discovered TMs with potential implications for future TM-targeting strategies.
Collapse
|
6
|
Burks DJ, Azad RK. Identification and Network-Enabled Characterization of Auxin Response Factor Genes in Medicago truncatula. FRONTIERS IN PLANT SCIENCE 2016; 7:1857. [PMID: 28018393 PMCID: PMC5145899 DOI: 10.3389/fpls.2016.01857] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Accepted: 11/25/2016] [Indexed: 05/26/2023]
Abstract
The Auxin Response Factor (ARF) family of transcription factors is an important regulator of environmental response and symbiotic nodulation in the legume Medicago truncatula. While previous studies have identified members of this family, a recent spurt in gene expression data coupled with genome update and reannotation calls for a reassessment of the prevalence of ARF genes and their interaction networks in M. truncatula. We performed a comprehensive analysis of the M. truncatula genome and transcriptome that entailed search for novel ARF genes and the co-expression networks. Our investigation revealed 8 novel M. truncatula ARF (MtARF) genes, of the total 22 identified, and uncovered novel gene co-expression networks as well. Furthermore, the topological clustering and single enrichment analysis of several network models revealed the roles of individual members of the MtARF family in nitrogen regulation, nodule initiation, and post-embryonic development through a specialized protein packaging and secretory pathway. In summary, this study not just shines new light on an important gene family, but also provides a guideline for identification of new members of gene families and their functional characterization through network analyses.
Collapse
Affiliation(s)
- David J. Burks
- Department of Biological Sciences, University of North TexasDenton, TX, USA
| | - Rajeev K. Azad
- Department of Biological Sciences, University of North TexasDenton, TX, USA
- Department of Mathematics, University of North TexasDenton, TX, USA
| |
Collapse
|
7
|
Beiki H, Nejati-Javaremi A, Pakdel A, Masoudi-Nejad A, Hu ZL, Reecy JM. Large-scale gene co-expression network as a source of functional annotation for cattle genes. BMC Genomics 2016; 17:846. [PMID: 27806696 PMCID: PMC5094014 DOI: 10.1186/s12864-016-3176-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 10/18/2016] [Indexed: 11/15/2022] Open
Abstract
Background Genome sequencing and subsequent gene annotation of genomes has led to the elucidation of many genes, but in vertebrates the actual number of protein coding genes are very consistent across species (~20,000). Seven years after sequencing the cattle genome, there are still genes that have limited annotation and the function of many genes are still not understood, or partly understood at best. Based on the assumption that genes with similar patterns of expression across a vast array of tissues and experimental conditions are likely to encode proteins with related functions or participate within a given pathway, we constructed a genome-wide Cattle Gene Co-expression Network (CGCN) using 72 microarray datasets that contained a total of 1470 Affymetrix Genechip Bovine Genome Arrays that were retrieved from either NCBI GEO or EBI ArrayExpress. Results The total of 16,607 probe sets, which represented 11,397 genes, with unique Entrez ID were consolidated into 32 co-expression modules that contained between 29 and 2569 probe sets. All of the identified modules showed strong functional enrichment for gene ontology (GO) terms and Reactome pathways. For example, modules with important biological functions such as response to virus, response to bacteria, energy metabolism, cell signaling and cell cycle have been identified. Moreover, gene co-expression networks using “guilt-by-association” principle have been used to predict the potential function of 132 genes with no functional annotation. Four unknown Hub genes were identified in modules highly enriched for GO terms related to leukocyte activation (LOC509513), RNA processing (LOC100848208), nucleic acid metabolic process (LOC100850151) and organic-acid metabolic process (MGC137211). Such highly connected genes should be investigated more closely as they likely to have key regulatory roles. Conclusions We have demonstrated that the CGCN and its corresponding regulons provides rich information for experimental biologists to design experiments, interpret experimental results, and develop novel hypothesis on gene function in this poorly annotated genome. The network is publicly accessible at http://www.animalgenome.org/cgi-bin/host/reecylab/d. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3176-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hamid Beiki
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, 31587-11167, Iran.,Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - Ardeshir Nejati-Javaremi
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, 31587-11167, Iran.
| | - Abbas Pakdel
- Department of Animal Science, College of Agriculture, Isfahan University of Technology, Isfahan, 84156-83111, Iran
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, 31587-11167, Iran
| | - Zhi-Liang Hu
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - James M Reecy
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| |
Collapse
|
8
|
AbdulHameed MDM, Ippolito DL, Stallings JD, Wallqvist A. Mining kidney toxicogenomic data by using gene co-expression modules. BMC Genomics 2016; 17:790. [PMID: 27724849 PMCID: PMC5057266 DOI: 10.1186/s12864-016-3143-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2015] [Accepted: 09/29/2016] [Indexed: 12/15/2022] Open
Abstract
Background Acute kidney injury (AKI) caused by drug and toxicant ingestion is a serious clinical condition associated with high mortality rates. We currently lack detailed knowledge of the underlying molecular mechanisms and biological networks associated with AKI. In this study, we carried out gene co-expression analyses using DrugMatrix—a large toxicogenomics database with gene expression data from rats exposed to diverse chemicals—and identified gene modules associated with kidney injury to probe the molecular-level details of this disease. Results We generated a comprehensive set of gene co-expression modules by using the Iterative Signature Algorithm and found distinct clusters of modules that shared genes and were associated with similar chemical exposure conditions. We identified two module clusters that showed specificity for kidney injury in that they 1) were activated by chemical exposures causing kidney injury, 2) were not activated by other chemical exposures, and 3) contained known AKI-relevant genes such as Havcr1, Clu, and Tff3. We used the genes in these AKI-relevant module clusters to develop a signature of 30 genes that could assess the potential of a chemical to cause kidney injury well before injury actually occurs. We integrated AKI-relevant module cluster genes with protein-protein interaction networks and identified the involvement of immunoproteasomes in AKI. To identify biological networks and processes linked to Havcr1, we determined genes within the modules that frequently co-express with Havcr1, including Cd44, Plk2, Mdm2, Hnmt, Macrod1, and Gtpbp4. We verified this procedure by showing that randomized data did not identify Havcr1 co-expression genes and that excluding up to 10 % of the data caused only minimal degradation of the gene set. Finally, by using an external dataset from a rat kidney ischemic study, we showed that the frequently co-expressed genes of Havcr1 behaved similarly in a model of non-chemically induced kidney injury. Conclusions Our study demonstrated that co-expression modules and co-expressed genes contain rich information for generating novel biomarker hypotheses and constructing mechanism-based molecular networks associated with kidney injury. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3143-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mohamed Diwan M AbdulHameed
- Department of Defense Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, 504 Scott Street, Fort Detrick, MD, 21702, USA
| | - Danielle L Ippolito
- U.S. Army Center for Environmental Health Research, 568 Doughten Drive, Fort Detrick, MD, 21702, USA
| | - Jonathan D Stallings
- U.S. Army Center for Environmental Health Research, 568 Doughten Drive, Fort Detrick, MD, 21702, USA
| | - Anders Wallqvist
- Department of Defense Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, 504 Scott Street, Fort Detrick, MD, 21702, USA.
| |
Collapse
|
9
|
Li L, Wurtele ES. The QQS orphan gene of Arabidopsis modulates carbon and nitrogen allocation in soybean. PLANT BIOTECHNOLOGY JOURNAL 2015; 13:177-87. [PMID: 25146936 PMCID: PMC4345402 DOI: 10.1111/pbi.12238] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 06/30/2014] [Accepted: 07/03/2014] [Indexed: 05/19/2023]
Abstract
The genome of each species contains as high as 8% of genes that are uniquely present in that species. Little is known about the functional significance of these so-called species specific or orphan genes. The Arabidopsis thaliana gene Qua-Quine Starch (QQS) is species specific. Here, we show that altering QQS expression in Arabidopsis affects carbon partitioning to both starch and protein. We hypothesized QQS may be conserved in a feature other than primary sequence, and as such could function to impact composition in another species. To test the potential of QQS in affecting composition in an ectopic species, we introduced QQS into soybean. Soybean T1 lines expressing QQS have up to 80% decreased leaf starch and up to 60% increased leaf protein; T4 generation seeds from field-grown plants contain up to 13% less oil, while protein is increased by up to 18%. These data broaden the concept of QQS as a modulator of carbon and nitrogen allocation, and demonstrate that this species-specific gene can affect the seed composition of an agronomic species thought to have diverged from Arabidopsis 100 million years ago.
Collapse
Affiliation(s)
- Ling Li
- Department of Genetics, Development and Cell Biology, Iowa State UniversityAmes, IA, USA
| | - Eve Syrkin Wurtele
- Department of Genetics, Development and Cell Biology, Iowa State UniversityAmes, IA, USA
| |
Collapse
|
10
|
Arendsee ZW, Li L, Wurtele ES. Coming of age: orphan genes in plants. TRENDS IN PLANT SCIENCE 2014; 19:698-708. [PMID: 25151064 DOI: 10.1016/j.tplants.2014.07.003] [Citation(s) in RCA: 94] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Revised: 06/27/2014] [Accepted: 07/17/2014] [Indexed: 05/19/2023]
Abstract
Sizable minorities of protein-coding genes from every sequenced eukaryotic and prokaryotic genome are unique to the species. These so-called ‘orphan genes’ may evolve de novo from non-coding sequence or be derived from older coding material. They are often associated with environmental stress responses and species-specific traits or regulatory patterns. However, difficulties in studying genes where comparative analysis is impossible, and a bias towards broadly conserved genes, have resulted in underappreciation of their importance. We review here the identification, possible origins, evolutionary trends, and functions of orphans with an emphasis on their role in plant biology. We exemplify several evolutionary trends with an analysis of Arabidopsis thaliana and present QQS as a model orphan gene.
Collapse
|
11
|
Lehne B, Schlitt T. Breaking free from the chains of pathway annotation: de novo pathway discovery for the analysis of disease processes. Pharmacogenomics 2013; 13:1967-78. [PMID: 23215889 DOI: 10.2217/pgs.12.170] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Interpreting the biological implications of high-throughput experiments such as gene-expression studies, genome-wide association studies and large-scale sequencing studies is not trivial. Gene-set and pathway analyses are useful tools to support the interpretation of such experiments, but rely on curated pathways or gene sets. The recent development of de novo pathway discovery methods aims to overcome this limitation. This article provides an overview of the methods currently available and reviews the advantages and challenges of this approach. In detail, it highlights the particular issues of de novo pathway discovery based on genome-wide association studies data, for which multiple different strategies have been proposed.
Collapse
Affiliation(s)
- Benjamin Lehne
- Bioinformatics Group, Department of Medical & Molecular Genetics, 8th Floor Tower Wing Guy's Hospital, London SE1 9RT, UK
| | | |
Collapse
|
12
|
Hur M, Campbell AA, Almeida-de-Macedo M, Li L, Ransom N, Jose A, Crispin M, Nikolau BJ, Wurtele ES. A global approach to analysis and interpretation of metabolic data for plant natural product discovery. Nat Prod Rep 2013; 30:565-83. [PMID: 23447050 PMCID: PMC3629923 DOI: 10.1039/c3np20111b] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Discovering molecular components and their functionality is key to the development of hypotheses concerning the organization and regulation of metabolic networks. The iterative experimental testing of such hypotheses is the trajectory that can ultimately enable accurate computational modelling and prediction of metabolic outcomes. This information can be particularly important for understanding the biology of natural products, whose metabolism itself is often only poorly defined. Here, we describe factors that must be in place to optimize the use of metabolomics in predictive biology. A key to achieving this vision is a collection of accurate time-resolved and spatially defined metabolite abundance data and associated metadata. One formidable challenge associated with metabolite profiling is the complexity and analytical limits associated with comprehensively determining the metabolome of an organism. Further, for metabolomics data to be efficiently used by the research community, it must be curated in publicly available metabolomics databases. Such databases require clear, consistent formats, easy access to data and metadata, data download, and accessible computational tools to integrate genome system-scale datasets. Although transcriptomics and proteomics integrate the linear predictive power of the genome, the metabolome represents the nonlinear, final biochemical products of the genome, which results from the intricate system(s) that regulate genome expression. For example, the relationship of metabolomics data to the metabolic network is confounded by redundant connections between metabolites and gene-products. However, connections among metabolites are predictable through the rules of chemistry. Therefore, enhancing the ability to integrate the metabolome with anchor-points in the transcriptome and proteome will enhance the predictive power of genomics data. We detail a public database repository for metabolomics, tools and approaches for statistical analysis of metabolomics data, and methods for integrating these datasets with transcriptomic data to create hypotheses concerning specialized metabolisms that generate the diversity in natural product chemistry. We discuss the importance of close collaborations among biologists, chemists, computer scientists and statisticians throughout the development of such integrated metabolism-centric databases and software.
Collapse
Affiliation(s)
- Manhoi Hur
- Human Computer Interactions and Department of Genetics Development and Cell Biology, 2624 Howe Hall, Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 0803; Tel: +1 515 708 3232;
| | - Alexis Ann Campbell
- Biochemistry, Biophysics and Molecular Biology and Center for Biorenewable Chemicals and Center for Metabolic Biology, 3254 Molecular Biology Building, Iowa State University, Ames, IA 50010, USA. Fax: +1 515 294 9423; Tel: +1 515 294 0453;
| | - Marcia Almeida-de-Macedo
- Department of Genetics Development and Cell Biology, 2624 Howe Hall, Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 5530; Tel: +1 515 294 3738;
| | - Ling Li
- Department of Genetics Development and Cell Biology, 443 Bessey Hall Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 1337; Tel: +1 515 294 6236;
| | - Nick Ransom
- Department of Genetics Development and Cell Biology, 2624 Howe Hall, Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 0803; Tel: +1 515 708 3232;
| | - Adarsh Jose
- Bioinformatics and Computational Biology, Center for Biorenewable Chemicals, Iowa State University, Ames, IA 50010, USA. Fax: +1 515 294 1269; Tel: +1 515 230 3429;
| | - Matt Crispin
- Department of Genetics Development and Cell Biology, 443 Bessey Hall Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 1337; Tel: +1 515 294 6236;
| | - Basil J. Nikolau
- Biochemistry, Biophysics and Molecular Biology and Center for Biorenewable Chemicals and Center for Metabolic Biology, 3254 Molecular Biology Building, Iowa State University, Ames, IA 50010, USA. Fax: +1 515 294 9423; Tel: +1 515 294 0453;
| | - Eve Syrkin Wurtele
- Department of Genetics, Development and Cell Biology, Center for Metabolic Biology, and Center for Biorenewable Chemicals, 2624D Howe Hall, Iowa State University, Ames, IA 50011, USA. Fax: +1 515 294 0803; Tel: +1 515 708 3232;
| |
Collapse
|
13
|
Zhang L, Berleant D, Wang Y, Li L, Cook D, Wurtele ES. BirdsEyeView (BEV): graphical overviews of experimental data. BMC Bioinformatics 2012; 13 Suppl 15:S11. [PMID: 23046276 PMCID: PMC3439726 DOI: 10.1186/1471-2105-13-s15-s11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Analyzing global experimental data can be tedious and time-consuming. Thus, helping biologists see results as quickly and easily as possible can facilitate biological research, and is the purpose of the software we describe. Results We present BirdsEyeView, a software system for visualizing experimental transcriptomic data using different views that users can switch among and compare. BirdsEyeView graphically maps data to three views: Cellular Map (currently a plant cell), Pathway Tree with dynamic mapping, and Gene Ontology http://www.geneontology.org Biological Processes and Molecular Functions. By displaying color-coded values for transcript levels across different views, BirdsEyeView can assist users in developing hypotheses about their experiment results. Conclusions BirdsEyeView is a software system available as a Java Webstart package for visualizing transcriptomic data in the context of different biological views to assist biologists in investigating experimental results. BirdsEyeView can be obtained from http://metnetdb.org/MetNet_BirdsEyeView.htm.
Collapse
Affiliation(s)
- Lifeng Zhang
- Department of Electrical and Computer Engineering, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | | | |
Collapse
|