1
|
Parvanova I, Borziak K, Finkelstein J. Towards an Open-Source Platform to Harmonize and Share Research Results from COVID-19 Interventional and Observational Studies. AMIA Jt Summits Transl Sci Proc 2023; 2023:157-166. [PMID: 37350901 PMCID: PMC10283106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/24/2023]
Abstract
As the SARS-CoV-2 virus continues to remain a universal threat on a global scale, a large number of COVID-19 clinical trials and observational studies are being conducted and published. Currently, 9,202 COVID-19 clinical trials have been registered on ClinicalTrials.gov and 293,187 COVID-19 articles were indexed in PubMed. To fully capitalize on the voluminous number of publications reporting COVID-19 interventional and observational studies, their results should be freely accessible via an open-source harmonized shared resource. We introduced ReMeDy (https://remedy.mssm.edu/), an intelligent integrative informatics platform aimed to harmonize and cross-link diverse COVID-19 trial outcomes and observational data. We tested the potential of the platform by uploading 52 COVID-19 clinical trials and 48 COVID-19 observational retrospective studies. ReMeDy was validated based on its capability to store and organize diverse data. The next steps include developing a crowdsourcing functionality coupled with automated outcome extraction using natural language processing.
Collapse
Affiliation(s)
| | - Kirill Borziak
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | |
Collapse
|
2
|
Borziak K, Finkelstein J. X-linked genetic risk factors that promote autoimmunity and dampen remyelination are associated with multiple sclerosis susceptibility. Mult Scler Relat Disord 2022; 66:104065. [PMID: 35905688 DOI: 10.1016/j.msard.2022.104065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 04/14/2022] [Accepted: 07/17/2022] [Indexed: 01/21/2023]
Abstract
BACKGROUND Multiple sclerosis (MS) is a chronic neurodegenerative disease, which has a strong genetic component and is more prevalent in women. MS is caused by an autoimmunity initiated inflammatory response which leads to axon demyelination, followed by axon loss, plaque formation and neurodegeneration. The goal of this article was to explore X-linked genetic factors that are associated with MS susceptibility. METHODS Using UK Biobank microarray, we analyzed the prevalence of alleles on the X chromosome to identify variants potentially involved in MS. Overall, 488,225 patients across 18,857 markers were analyzed using PLINK. RESULTS Our results identify 20 SNPs that are significantly more abundant in persons with MS. The genes associated with these SNPs belong to immunity (LAMP2, AVPR2, MTMR8, F8, BCOR, PORCN, and ELF4) and remyelination (NSDHL, HS6ST2, RBM10, TAZ, and AR) pathways that are potentially of great significance for understanding the onset and progression of multiple sclerosis. We further identified a significant 20-fold increase in incidence of MS cases in women with co-occurrences of SNPs associated with myelination and immunity functions. CONCLUSIONS Our analysis provides novel insights into the roles of X-linked genes in the onset and presentation of multiple sclerosis, identifying 20 SNPs in 14 genes involved primarily in immunity and myelination functions that are significantly more abundant in persons with MS. Our co-occurrence analysis suggests that concurrent disruption of both myelination and immune systems significantly increases the risk of MS onset in women.
Collapse
Affiliation(s)
- Kirill Borziak
- Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 United States.
| | - Joseph Finkelstein
- Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029 United States
| |
Collapse
|
3
|
Borziak K, Finkelstein J. Gene Expression Markers of Prognostic Importance for Prostate Cancer Risk in Patients with Benign Prostate Hyperplasia. Annu Int Conf IEEE Eng Med Biol Soc 2022; 2022:73-76. [PMID: 36086411 DOI: 10.1109/embc48229.2022.9871422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Comparative analyses utilizing publicly available big data have the potential to generate novel hypotheses and knowledge. However, this approach is underutilized in the realm of cancer research, particularly for prostate cancer. While the general progression of prostate cancer is now well understood, how individual cell types transition from healthy, to pre-cancerous, to cancerous cell types, remains to be further elucidated. To address this, we re-analyzed two publicly available single-cell RNA-seq datasets of prostate cancer and benign prostate hyperplasia cell types. The differential expression analysis of 15,505 epithelial cell profiles across 18,638 genes revealed 791 genes that were up regulated in prostate cancer epithelial cells. Here we report six markers that show significant upregulation in prostate cancer cells relative to BPH epithelial cells: HPN (5.62X), RAC3 (3.51X), CD24 (2.18X), HOXC6 (1.77X), AGR2 (1.71X), and IGFBP2 (1.28X). In particular, the significant differential expression of AGR2 further supports its clinical relevance in supplementing prostate-specific antigen screening for detecting prostate cancer. These findings have the potential to further advance our knowledge of genes governing the development of cancer in prostate epithelial cells. Clinical Relevance- Our results establish the importance of 6 prostate cancer markers (HPN, RAC3, CD24, HOXC6, AGR2, and IGFBP3) in distinguishing between prostate cancer epithelial cells and benign prostate hyperplasia epithelial cells.
Collapse
|
4
|
Finkelstein J, Guarino J, Huo X, Borziak K, Parvanova I. Exploring Determinants of Longevity of Biomedical Databases. Stud Health Technol Inform 2022; 290:135-139. [PMID: 35672986 DOI: 10.3233/shti220047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The maintenance of biomedical databases requires ongoing and systematic efforts in keeping them up-to-date which may affect long-term sustainability. Since research has become more reliant on publicly available biomedical data collections, it is important to understand factors affecting their longevity. The aim of this article was to explore potential determinants of biomedical database longevity. To build an analytical dataset, we used Database journal that have been created as an open access platform for presenting biological databases. A stratified analysis of all the original databases published in Database journal between 2009 and 2016 was conducted depending on their accessibility status. Overall, 35% of 518 analyzed databases were not accessible in 2020. We showed that databases with higher citation counts from institutions with higher scientific output were significantly more likely to be currently accessible. Databases from researchers with higher h-index were more likely to be accessible. Further investigation is warranted to identify factors affecting longevity of high impact databases.
Collapse
Affiliation(s)
- Joseph Finkelstein
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jennifer Guarino
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Xingyue Huo
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Kirill Borziak
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Irena Parvanova
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
5
|
Abstract
Big data reanalysis has the potential to generate novel comparative analyses which aim to generate novel hypotheses and knowledge. However, this approach is underutilized in the realm of cancer research, particularly for cancer stem cells (CSCs). CSCs are a rare subset of tumor cells, which dedifferentiate from healthy adult cells, and have the potential for self-renewal and treatment resistance. This analysis utilizes two publically available single-cell RNA-seq datasets of liver cancer and adult liver cell types to demonstrate how reanalysis of big data can lead to valuable new discoveries. We identify 519 differentially expressed genes between liver CSCs and healthy liver cell types. Here we report the potential novel liver CSC dedifferentiation factor, Msh Homeobox 2, which was significantly upregulated in liver CSCs by 1.36 fold (p-value < 1E-10). These findings have the potential to further advance our knowledge of genes governing the formation of CSCs.
Collapse
Affiliation(s)
- Kirill Borziak
- Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, New York 10029 USA
| | - Joseph Finkelstein
- Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, New York 10029 USA
| |
Collapse
|
6
|
Parvanova I, Borziak K, Poltinnikov A, Finkelstein J. A Web-based Platform to Share Harmonized Results from COVID-19 Clinical Studies. AMIA Annu Symp Proc 2022; 2022:836-845. [PMID: 37128442 PMCID: PMC10148277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
COVID-19 has caused a worldwide pandemic, accompanied by a high number of deaths and hospitalizations. Multiple preventative vaccines and variety of COVID-19 treatments have been developed and explored. This large volume of scientific work led to an extensive number of COVID-19 publications, which resulted in the necessity to standardize, store, share, and investigate research results in a harmonized manner. Attempts to standardize and share COVID-19 research data have been lacking. The purpose of the ReMeDy platform is to provide an intelligent informatics solution of integrating diverse COVID-19 trial outcomes and omics data across COVID-19 research studies. To test the platform, we utilized 48 COVID-19 observational retrospective studies. The robustness of the platform was validated through the ability to efficiently organize the diverse data elements. Next steps include expanding our database through the inclusion of all published COVID-19 studies. ReMeDy is located at https://remedy.mssm.edu/.
Collapse
Affiliation(s)
- Irena Parvanova
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Kirill Borziak
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Anar Poltinnikov
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | | |
Collapse
|
7
|
Parvanova I, Borziak K, Guarino J, Finkelstein J. A Platform for Integrating and Sharing Cancer Stem Cell Data. Annu Int Conf IEEE Eng Med Biol Soc 2021; 2021:2320-2325. [PMID: 34891750 DOI: 10.1109/embc46164.2021.9630083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Advancements in cancer research and treatment have highlighted the need for standardization and sharing of cancer stem cell (CSC) data to facilitate research transparency and to promote collaboration within the scientific community. Although previous applications have attempted to gather and disseminate these data, currently no platform organizes the heterogeneous CSC information into a harmonized project-based framework. The aim of our platform, ReMeDy, is to provide an intelligent informatics solution integrating diverse CSC characteristics, outcomes information, and omics data across clinical, preclinical and in vitro studies. These heterogeneous data streams are organized within a multi-modular framework, subjected to a stringent validation by using standardized ontologies, and stored in a searchable format. To test usefulness of our approach for capturing diverse data related to CSCs, we integrated data from 52 publicly-available CSC projects. We validated the robustness of the platform, by efficiently organizing diverse data elements, and demonstrated its potential for promoting future knowledge discovery driven by aggregation of published data. Next steps include expanding number of uploaded CSC projects and developing additional data visualization tools. The platform is accessible through https://remedy.mssm.edu/.
Collapse
|
8
|
Borziak K, Parvanova I, Finkelstein J. ReMeDy: a platform for integrating and sharing published stem cell research data with a focus on iPSC trials. Database (Oxford) 2021; 2021:baab038. [PMID: 34156448 PMCID: PMC8218701 DOI: 10.1093/database/baab038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 05/25/2021] [Accepted: 05/28/2021] [Indexed: 12/28/2022]
Abstract
ABSTRACT Recent regenerative medicine studies have emphasized the need for increased standardization, harmonization and sharing of information related to stem cell product characterization, to help drive these innovative interventions toward public availability and to increase collaboration in the scientific community. Although numerous attempts and numerous databases have been made to manage these data, a platform that incorporates all the heterogeneous data collected from stem cell projects into a harmonized project-based framework is still lacking. The aim of the database, which is described in this study, is to provide an intelligent informatics solution that integrates comprehensive characterization of diverse stem cell product characteristics with research subject and project outcome information. In the resulting platform, heterogeneous data are validated using predefined ontologies and stored in a relational database, to ensure data quality and ease of access. Testing was performed using 51 published, publically available induced pluripotent stem cell projects conducted in clinical, preclinical and in-vitro evaluations. Future aims of this project include further increasing the database size to include all published stem cell trials and develop additional data visualization tools to improve usability. Our testing demonstrated the robustness of the proposed platform, by seamlessly harmonizing diverse common data elements, and the potential of this platform for driving knowledge generation from the aggregation and harmonization of these diverse data. DATABASE URL https://remedy.mssm.edu/.
Collapse
Affiliation(s)
- Kirill Borziak
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, Icahn L2-36, New York, NY 10029, USA
| | - Irena Parvanova
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, Icahn L2-36, New York, NY 10029, USA
| | - Joseph Finkelstein
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, Icahn L2-36, New York, NY 10029, USA
| |
Collapse
|
9
|
Borziak K, Finkelstein J. Identification of Liver Cancer Stem Cell Stemness Markers Using a Comparative Analysis of Public Data Sets. Stem Cells Cloning 2021; 14:9-17. [PMID: 34168465 PMCID: PMC8216768 DOI: 10.2147/sccaa.s307043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 05/28/2021] [Indexed: 11/23/2022]
Abstract
Purpose Comparative reanalysis of single-cell transcriptomics data to gain useful novel insights into cancer stem cells (CSCs), which are a rare subset of cells within tumors, characterized by their capability to self-renew and differentiate, and their role in tumorigenicity. Patients and Methods This project utilized publically available liver single-cell RNA-seq datasets of liver cancer and liver progenitor cell types to demonstrate how shared large amounts of data can generate new and valuable information. The data were analyzed using EdgeR differential expression analysis, with focus on a set of 34 known stemness markers. Results We showed that the expression of stemness markers SOX9, KRT19, KRT7, and CD24, and Yamanaka factors Oct4 and SOX2 in CSCs was significantly elevated relative to progenitor cell types, potentially explaining their increased differentiation and replication potential. Conclusion These results help to further document the complementary expression changes that give CSCs their distinct phenotypic profile. Our findings have potential significance to advance our knowledge of the important genes relevant to CSCs.
Collapse
Affiliation(s)
- Kirill Borziak
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Joseph Finkelstein
- Center for Biomedical and Population Health Informatics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| |
Collapse
|
10
|
Borziak K, Finkelstein J. Comparative Analysis of Public Data Sets to Identify Stemness Markers That Differentiate Liver Cancer Stem Cells. Stud Health Technol Inform 2021; 281:818-819. [PMID: 34042693 DOI: 10.3233/shti210290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Cancer stem cells (CSCs) represent an important field in translational medicine for generating novel cancer treatments. To identify important stemness markers in liver CSCs that potentially explain their resistance to treatment, we analyzed 10865 single-cell RNA-seq samples across 42684 coding and non-coding genes. Our results show that CSCs have significantly increased expression of two Yamanaka factors (Oct4, 2.14X and SOX2, 1.13X) and three stemness factors (CD44, 3.25X; KRT7, 2.2X; SOX9, 1.71X), relative to liver progenitor cells. Our study demonstrates the potential power of harnessing shared big data for driving translational medicine for novel hypothesis generation.
Collapse
Affiliation(s)
- Kirill Borziak
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | | |
Collapse
|
11
|
McDonough-Goldstein CE, Borziak K, Pitnick S, Dorus S. Drosophila female reproductive tract gene expression reveals coordinated mating responses and rapidly evolving tissue-specific genes. G3 (Bethesda) 2021; 11:jkab020. [PMID: 33890615 PMCID: PMC8063083 DOI: 10.1093/g3journal/jkab020] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 12/29/2020] [Indexed: 12/13/2022]
Abstract
Sexual reproduction in internally fertilizing species requires complex coordination between female and male reproductive systems and among the diverse tissues of the female reproductive tract (FRT). Here, we report a comprehensive, tissue-specific investigation of Drosophila melanogaster FRT gene expression before and after mating. We identified expression profiles that distinguished each tissue, including major differences between tissues with glandular or primarily nonglandular epithelium. All tissues were enriched for distinct sets of genes possessing secretion signals that exhibited accelerated evolution, as might be expected for genes participating in molecular interactions between the sexes within the FRT extracellular environment. Despite robust transcriptional differences between tissues, postmating responses were dominated by coordinated transient changes indicative of an integrated systems-level functional response. This comprehensive characterization of gene expression throughout the FRT identifies putative female contributions to postcopulatory events critical to reproduction and potentially reproductive isolation, as well as the putative targets of sexual selection and conflict.
Collapse
Affiliation(s)
| | - Kirill Borziak
- Center for Reproductive Evolution, Biology Department, Syracuse University, Syracuse, NY, USA
| | - Scott Pitnick
- Center for Reproductive Evolution, Biology Department, Syracuse University, Syracuse, NY, USA
| | - Steve Dorus
- Center for Reproductive Evolution, Biology Department, Syracuse University, Syracuse, NY, USA
| |
Collapse
|
12
|
Johnson SL, Borziak K, Kleffmann T, Rosengrave P, Dorus S, Gemmell NJ. Ovarian fluid proteome variation associates with sperm swimming speed in an externally fertilizing fish. J Evol Biol 2020; 33:1783-1794. [PMID: 33034086 PMCID: PMC7719593 DOI: 10.1111/jeb.13717] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 09/02/2020] [Accepted: 09/30/2020] [Indexed: 12/27/2022]
Abstract
Sperm velocity is a key trait that predicts the outcome of sperm competition. By promoting or impeding sperm velocity, females can control fertilization via postcopulatory cryptic female choice. In Chinook salmon, ovarian fluid (OF), which surrounds the ova, mediates sperm velocity according to male and female identity, biasing the outcome of sperm competition towards males with faster sperm. Past investigations have revealed proteome variation in OF, but the specific components of OF that differentially mediate sperm velocity have yet to be characterized. Here we use quantitative proteomics to investigate whether OF protein composition explains variation in sperm velocity and fertilization success. We found that OF proteomes from six females robustly clustered into two groups and that these groups are distinguished by the abundance of a restricted set of proteins significantly associated with sperm velocity. Exposure of sperm to OF from females in group I had faster sperm compared to sperm exposed to the OF of group II females. Overall, OF proteins that distinguished between these groups were enriched for vitellogenin and calcium ion interactions. Our findings suggest that these proteins may form the functional basis for cryptic female choice via the biochemical and physiological mediation of sperm velocity.
Collapse
Affiliation(s)
- Sheri L. Johnson
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | - Kirill Borziak
- Biology Department, Center for Reproductive Evolution, Syracuse University, Syracuse, NY, USA
| | - Torsten Kleffmann
- Department of Biochemistry, Centre for Protein Research, University of Otago, Dunedin, New Zealand
| | - Patrice Rosengrave
- Department of Anatomy, University of Otago, Dunedin, New Zealand
- AgResearch, Biocontrol and Biosecurity, Christchurch, New Zealand
| | - Steve Dorus
- Biology Department, Center for Reproductive Evolution, Syracuse University, Syracuse, NY, USA
| | - Neil J. Gemmell
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| |
Collapse
|
13
|
Rowe M, Whittington E, Borziak K, Ravinet M, Eroukhmanoff F, Sætre GP, Dorus S. Molecular Diversification of the Seminal Fluid Proteome in a Recently Diverged Passerine Species Pair. Mol Biol Evol 2020; 37:488-506. [PMID: 31665510 PMCID: PMC6993853 DOI: 10.1093/molbev/msz235] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Seminal fluid proteins (SFPs) mediate an array of postmating reproductive processes that influence fertilization and fertility. As such, it is widely held that SFPs may contribute to postmating, prezygotic reproductive barriers between closely related taxa. We investigated seminal fluid (SF) diversification in a recently diverged passerine species pair (Passer domesticus and Passer hispaniolensis) using a combination of proteomic and comparative evolutionary genomic approaches. First, we characterized and compared the SF proteome of the two species, revealing consistencies with known aspects of SFP biology and function in other taxa, including the presence and diversification of proteins involved in immunity and sperm maturation. Second, using whole-genome resequencing data, we assessed patterns of genomic differentiation between house and Spanish sparrows. These analyses detected divergent selection on immunity-related SF genes and positive selective sweeps in regions containing a number of SF genes that also exhibited protein abundance diversification between species. Finally, we analyzed the molecular evolution of SFPs across 11 passerine species and found a significantly higher rate of positive selection in SFPs compared with the rest of the genome, as well as significant enrichments for functional pathways related to immunity in the set of positively selected SF genes. Our results suggest that selection on immunity pathways is an important determinant of passerine SF composition and evolution. Assessing the role of immunity genes in speciation in other recently diverged taxa should be prioritized given the potential role for immunity-related proteins in reproductive incompatibilities in Passer sparrows.
Collapse
Affiliation(s)
- Melissah Rowe
- Natural History Museum, University of Oslo, Oslo, Norway.,Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway.,Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
| | - Emma Whittington
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY
| | - Kirill Borziak
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY
| | - Mark Ravinet
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Fabrice Eroukhmanoff
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Glenn-Peter Sætre
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, Norway
| | - Steve Dorus
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY
| |
Collapse
|
14
|
Borziak K, Qi T, Evangelista JE, Clarke DJB, Ma'ayan A, Finkelstein J. Towards Intelligent Integration and Sharing of Stem Cell Research Data. Stud Health Technol Inform 2020; 272:334-337. [PMID: 32604670 DOI: 10.3233/shti200563] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Advancements in regenerative medicine have brought to the fore the need for increased standardization and sharing of stem cell product characterization to help drive these innovative interventions toward public availability. Although numerous attempts have been made to store this data, there is still a lack of a platform that incorporates heterogeneous stem cell information into a harmonized project-based framework. The aim of this project was to introduce and pilot-test an intelligent informatics solution which integrates diverse stem cell product characteristics with study subject and omics information. In the resulting platform, heterogeneous data is validated using predefined ontologies and stored in a NoSQL repository. Pilot-testing was performed on nine sponsored stem cell projects conducting preclinical and intervention evaluations. The pilot-testing demonstrated the robustness of the proposed platform, by seamlessly harmonizing diverse common data elements, and the potential of this platform for driving knowledge generation from the aggregation of this shared data.
Collapse
Affiliation(s)
- Kirill Borziak
- Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
| | - Tianye Qi
- Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
| | | | | | - Avi Ma'ayan
- Icahn School of Medicine at Mount Sinai, New York, NY 10029 USA
| | | |
Collapse
|
15
|
Alvarez-Fernandez A, Borziak K, McDonald GC, Dorus S, Pizzari T. Female novelty and male status dynamically modulate ejaculate expenditure and seminal fluid proteome over successive matings in red junglefowl. Sci Rep 2019; 9:5852. [PMID: 30971704 PMCID: PMC6458113 DOI: 10.1038/s41598-019-41336-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Accepted: 03/04/2019] [Indexed: 11/16/2022] Open
Abstract
Theory predicts that males will strategically invest in ejaculates according to the value of mating opportunities. While strategic sperm allocation has been studied extensively, little is known about concomitant changes in seminal fluid (SF) and its molecular composition, despite increasing evidence that SF proteins (SFPs) are fundamental in fertility and sperm competition. Here, we show that in male red junglefowl, Gallus gallus, along with changes in sperm numbers and SF investment, SF composition changed dynamically over successive matings with a first female, immediately followed by mating with a second, sexually novel female. The SF proteome exhibited a pattern of both protein depletion and enrichment over successive matings, including progressive increases in immunity and plasma proteins. Ejaculates allocated to the second female had distinct proteomic profiles, where depletion of many SFPs was compensated by increased investment in others. This response was partly modulated by male social status: when mating with the second, novel female, subdominants (but not dominants) preferentially invested in SFPs associated with sperm composition, which may reflect status-specific differences in mating rates, sperm maturation and sperm competition. Global proteomic SF analysis thus reveals that successive matings trigger rapid, dynamic SFP changes driven by a combination of depletion and strategic allocation.
Collapse
Affiliation(s)
| | - Kirill Borziak
- Center for Reproductive Evolution, Syracuse University, 107 College Place, Syracuse, NY, 13244, USA
| | - Grant C McDonald
- Edward Grey Institute, Department of Zoology, University of Oxford, Oxford, OX1 3SZ, UK
| | - Steve Dorus
- Center for Reproductive Evolution, Syracuse University, 107 College Place, Syracuse, NY, 13244, USA.
| | - Tommaso Pizzari
- Edward Grey Institute, Department of Zoology, University of Oxford, Oxford, OX1 3SZ, UK.
| |
Collapse
|
16
|
Degner EC, Ahmed-Braimah YH, Borziak K, Wolfner MF, Harrington LC, Dorus S. Proteins, Transcripts, and Genetic Architecture of Seminal Fluid and Sperm in the Mosquito Aedes aegypti. Mol Cell Proteomics 2019; 18:S6-S22. [PMID: 30552291 PMCID: PMC6427228 DOI: 10.1074/mcp.ra118.001067] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 11/29/2018] [Indexed: 11/06/2022] Open
Abstract
The yellow fever mosquito, Aedes aegypti,, transmits several viruses causative of serious diseases, including dengue, Zika, and chikungunya. Some proposed efforts to control this vector involve manipulating reproduction to suppress wild populations or to replace them with disease-resistant mosquitoes. The design of such strategies requires an intimate knowledge of reproductive processes, yet our basic understanding of reproductive genetics in this vector remains largely incomplete. To accelerate future investigations, we have comprehensively catalogued sperm and seminal fluid proteins (SFPs) transferred to females in the ejaculate using tandem mass spectrometry. By excluding female-derived proteins using an isotopic labeling approach, we identified 870 sperm proteins and 280 SFPs. Functional composition analysis revealed parallels with known aspects of sperm biology and SFP function in other insects. To corroborate our proteome characterization, we also generated transcriptomes for testes and the male accessory glands-the primary contributors to Ae. aegypti, sperm and seminal fluid, respectively. Differential gene expression of accessory glands from virgin and mated males suggests that transcripts encoding proteins involved in protein translation are upregulated post-mating. Several SFP transcripts were also modulated after mating, but >90% remained unchanged. Finally, a significant enrichment of SFPs was observed on chromosome 1, which harbors the male sex determining locus in this species. Our study provides a comprehensive proteomic and transcriptomic characterization of ejaculate production and composition and thus provides a foundation for future investigations of Ae. aegypti, reproductive biology, from functional analysis of individual proteins to broader examination of reproductive processes.
Collapse
Affiliation(s)
- Ethan C Degner
- From the ‡Department of Entomology, Cornell University, Ithaca, New York
| | | | - Kirill Borziak
- Center for Reproductive Evolution, Syracuse University, Syracuse, New York
| | - Mariana F Wolfner
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York;.
| | - Laura C Harrington
- From the ‡Department of Entomology, Cornell University, Ithaca, New York;.
| | - Steve Dorus
- Center for Reproductive Evolution, Syracuse University, Syracuse, New York.
| |
Collapse
|
17
|
Ow MC, Borziak K, Nichitean AM, Dorus S, Hall SE. Early experiences mediate distinct adult gene expression and reproductive programs in Caenorhabditis elegans. PLoS Genet 2018; 14:e1007219. [PMID: 29447162 PMCID: PMC5831748 DOI: 10.1371/journal.pgen.1007219] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Revised: 02/28/2018] [Accepted: 01/24/2018] [Indexed: 01/16/2023] Open
Abstract
Environmental stress during early development in animals can have profound effects on adult phenotypes via programmed changes in gene expression. Using the nematode C. elegans, we demonstrated previously that adults retain a cellular memory of their developmental experience that is manifested by differences in gene expression and life history traits; however, the sophistication of this system in response to different environmental stresses, and how it dictates phenotypic plasticity in adults that contribute to increased fitness in response to distinct environmental challenges, was unknown. Using transcriptional profiling, we show here that C. elegans adults indeed retain distinct cellular memories of different environmental conditions. We identified approximately 500 genes in adults that entered dauer due to starvation that exhibit significant opposite (“seesaw”) transcriptional phenotypes compared to adults that entered dauer due to crowding, and are distinct from animals that bypassed dauer. Moreover, we show that two-thirds of the genes in the genome experience a 2-fold or greater seesaw trend in gene expression, and based upon the direction of change, are enriched in large, tightly linked regions on different chromosomes. Importantly, these transcriptional programs correspond to significant changes in brood size depending on the experienced stress. In addition, we demonstrate that while the observed seesaw gene expression changes occur in both somatic and germline tissue, only starvation-induced changes require a functional GLP-4 protein necessary for germline development, and both programs require the Argonaute CSR-1. Thus, our results suggest that signaling between the soma and the germ line can generate phenotypic plasticity as a result of early environmental experience, and likely contribute to increased fitness in adverse conditions and the evolution of the C. elegans genome. Environmental stress during early development in animals can have profound effects on adult behavior and physiology due to programmed changes in gene expression. However, whether different stresses result in distinct changes in traits that allow stressed animals to better survive and reproduce in future adverse conditions is largely unknown. Using the animal model system, C. elegans, we show that adults that experienced starvation exhibit opposite (“seesaw”) genome-wide gene expression changes compared to adults that experienced crowding, and are distinct from animals that experienced favorable conditions. Genes that are similarly up- or downregulated due to either starvation or crowding are located in clusters on the same chromosomes. Importantly, these gene expression changes of differently-stressed animals result in corresponding changes in progeny number, a life history trait of evolutionary significance. These distinct gene expression programs require different signaling pathways that communicate across somatic and germline tissue types. Thus, different environmental stresses experienced early in development induce distinct signaling mechanisms to result in changes in gene expression and reproduction in adults, and likely contribute to increased survival in future adverse conditions.
Collapse
Affiliation(s)
- Maria C. Ow
- Department of Biology, Syracuse University, Syracuse, NY, United States of America
| | - Kirill Borziak
- Department of Biology, Syracuse University, Syracuse, NY, United States of America
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY, United States of America
| | | | - Steve Dorus
- Department of Biology, Syracuse University, Syracuse, NY, United States of America
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY, United States of America
| | - Sarah E. Hall
- Department of Biology, Syracuse University, Syracuse, NY, United States of America
- * E-mail:
| |
Collapse
|
18
|
Whittington E, Forsythe D, Borziak K, Karr TL, Walters JR, Dorus S. Contrasting patterns of evolutionary constraint and novelty revealed by comparative sperm proteomic analysis in Lepidoptera. BMC Genomics 2017; 18:931. [PMID: 29197336 PMCID: PMC5712127 DOI: 10.1186/s12864-017-4293-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/13/2017] [Indexed: 12/25/2022] Open
Abstract
Background Rapid evolution is a hallmark of reproductive genetic systems and arises through the combined processes of sequence divergence, gene gain and loss, and changes in gene and protein expression. While studies aiming to disentangle the molecular ramifications of these processes are progressing, we still know little about the genetic basis of evolutionary transitions in reproductive systems. Here we conduct the first comparative analysis of sperm proteomes in Lepidoptera, a group that exhibits dichotomous spermatogenesis, in which males produce a functional fertilization-competent sperm (eupyrene) and an incompetent sperm morph lacking nuclear DNA (apyrene). Through the integrated application of evolutionary proteomics and genomics, we characterize the genomic patterns potentially associated with the origination and evolution of this unique spermatogenic process and assess the importance of genetic novelty in Lepidopteran sperm biology. Results Comparison of the newly characterized Monarch butterfly (Danaus plexippus) sperm proteome to those of the Carolina sphinx moth (Manduca sexta) and the fruit fly (Drosophila melanogaster) demonstrated conservation at the level of protein abundance and post-translational modification within Lepidoptera. In contrast, comparative genomic analyses across insects reveals significant divergence at two levels that differentiate the genetic architecture of sperm in Lepidoptera from other insects. First, a significant reduction in orthology among Monarch sperm genes relative to the remainder of the genome in non-Lepidopteran insect species was observed. Second, a substantial number of sperm proteins were found to be specific to Lepidoptera, in that they lack detectable homology to the genomes of more distantly related insects. Lastly, the functional importance of Lepidoptera specific sperm proteins is broadly supported by their increased abundance relative to proteins conserved across insects. Conclusions Our results identify a burst of genetic novelty amongst sperm proteins that may be associated with the origin of heteromorphic spermatogenesis in ancestral Lepidoptera and/or the subsequent evolution of this system. This pattern of genomic diversification is distinct from the remainder of the genome and thus suggests that this transition has had a marked impact on lepidopteran genome evolution. The identification of abundant sperm proteins unique to Lepidoptera, including proteins distinct between specific lineages, will accelerate future functional studies aiming to understand the developmental origin of dichotomous spermatogenesis and the functional diversification of the fertilization incompetent apyrene sperm morph. Electronic supplementary material The online version of this article (10.1186/s12864-017-4293-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Emma Whittington
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY, USA
| | - Desiree Forsythe
- Science Education and Society, University of Rhode Island, Kingston, RI, USA
| | - Kirill Borziak
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY, USA
| | - Timothy L Karr
- Ecology and Evolutionary Biology, Kansas University, Lawrence, KS, USA
| | - James R Walters
- Department of Genomics and Genetic Resources, Kyoto Institute of Technology. Saga Ippon-cho, Ukyo-ku, Kyoto, Japan
| | - Steve Dorus
- Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, NY, USA.
| |
Collapse
|
19
|
Abstract
Sexual selection is the pervasive force underlying the dramatic divergence of sperm form and function. Although it has been demonstrated that testis gene expression evolves rapidly, exploration of the proteomic basis of sperm diversity is in its infancy. We have employed a whole-cell proteomics approach to characterize sperm divergence among closely related Mus species that experience different sperm competition regimes and exhibit pronounced variation in sperm energetics, motility and fertilization capacity. Interspecific comparisons revealed significant abundance differences amongst proteins involved in fertilization capacity, including those that govern sperm-zona pellucida interactions, axoneme components and metabolic proteins. Ancestral reconstruction of relative testis size suggests that the reduction of zona pellucida binding proteins and heavy-chain dyneins was associated with a relaxation in sperm competition in the M. musculus lineage. Additionally, the decreased reliance on ATP derived from glycolysis in high sperm competition species was reflected in abundance decreases in glycolytic proteins of the principle piece in M. spretus and M. spicilegus. Comparison of protein abundance and stage-specific testis expression revealed a significant correlation during spermatid development when dynamic morphological changes occur. Proteins underlying sperm diversification were also more likely to be subject to translational repression, suggesting that sperm composition is influenced by the evolution of translation control mechanisms. The identification of functionally coherent classes of proteins relating to sperm competition highlights the utility of evolutionary proteomic analyses and reveals that both intensified and relaxed sperm competition can have a pronounced impact on the molecular composition of the male gamete.
Collapse
Affiliation(s)
- Alberto Vicens
- Reproductive Biology and Evolution Group, Department of Biodiversity and Biological Evolution, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Kirill Borziak
- Department of Biology, Syracuse University, Syracuse, NY
| | - Timothy L Karr
- Department of Genomics and Genetic Resources, Kyoto Institute of Technology, Kyoto, Japan
| | - Eduardo R S Roldan
- Reproductive Biology and Evolution Group, Department of Biodiversity and Biological Evolution, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain
| | - Steve Dorus
- Department of Biology, Syracuse University, Syracuse, NY
| |
Collapse
|
20
|
Borziak K, Álvarez-Fernández A, L Karr T, Pizzari T, Dorus S. The Seminal fluid proteome of the polyandrous Red junglefowl offers insights into the molecular basis of fertility, reproductive ageing and domestication. Sci Rep 2016; 6:35864. [PMID: 27804984 PMCID: PMC5090203 DOI: 10.1038/srep35864] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 10/06/2016] [Indexed: 02/03/2023] Open
Abstract
Seminal fluid proteins (SFPs) are emerging as fundamental contributors to sexual selection given their role in post-mating reproductive events, particularly in polyandrous species where the ejaculates of different males compete for fertilisation. SFP identification however remains taxonomically limited and little is known about avian SFPs, despite extensive work on sexual selection in birds. We characterize the SF proteome of the polyandrous Red junglefowl, Gallus gallus, the wild species that gave rise to the domestic chicken. We identify 1,141 SFPs, including proteins involved in immunity and antimicrobial defences, sperm maturation, and fertilisation, revealing a functionally complex SF proteome. This includes a predominant contribution of blood plasma proteins that is conserved with human SF. By comparing the proteome of young and old males with fast or slow sperm velocity in a balanced design, we identify proteins associated with ageing and sperm velocity, and show that old males that retain high sperm velocity have distinct proteome characteristics. SFP comparisons with domestic chickens revealed both qualitative and quantitative differences likely associated with domestication and artificial selection. Collectively, these results shed light onto the functional complexity of avian SF, and provide a platform for molecular studies of fertility, reproductive ageing, and domestication.
Collapse
Affiliation(s)
- Kirill Borziak
- Center for Reproductive Evolution, Department of Biology, Syracuse University, US
| | | | - Timothy L Karr
- Drosophila Genetic Resource Center, Kyoto Institute of Technology, Saga Ippongi-cho, Ukyo-ku, Kyoto 616-8354, Japan
| | - Tommaso Pizzari
- Edward Grey Institute, Department of Zoology, University of Oxford, UK
| | - Steve Dorus
- Center for Reproductive Evolution, Department of Biology, Syracuse University, US
| |
Collapse
|
21
|
Whittington E, Zhao Q, Borziak K, Walters JR, Dorus S. Characterisation of the Manduca sexta sperm proteome: Genetic novelty underlying sperm composition in Lepidoptera. Insect Biochem Mol Biol 2015; 62:183-193. [PMID: 25731083 DOI: 10.1016/j.ibmb.2015.02.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Revised: 02/13/2015] [Accepted: 02/18/2015] [Indexed: 06/04/2023]
Abstract
The application of mass spectrometry based proteomics to sperm biology has greatly accelerated progress in understanding the molecular composition and function of spermatozoa. To date, these approaches have been largely restricted to model organisms, all of which produce a single sperm morph capable of oocyte fertilisation. Here we apply high-throughput mass spectrometry proteomic analysis to characterise sperm composition in Manduca sexta, the tobacco hornworm moth, which produce heteromorphic sperm, including one fertilisation competent (eupyrene) and one incompetent (apyrene) sperm type. This resulted in the high confidence identification of 896 proteins from a co-mixed sample of both sperm types, of which 167 are encoded by genes with strict one-to-one orthology in Drosophila melanogaster. Importantly, over half (55.1%) of these orthologous proteins have previously been identified in the D. melanogaster sperm proteome and exhibit significant conservation in quantitative protein abundance in sperm between the two species. Despite the complex nature of gene expression across spermatogenic stages, a significant correlation was also observed between sperm protein abundance and testis gene expression. Lepidopteran-specific sperm proteins (e.g., proteins with no homology to proteins in non-Lepidopteran taxa) were present in significantly greater abundance on average than those with homology outside the Lepidoptera. Given the disproportionate production of apyrene sperm (96% of all mature sperm in Manduca) relative to eupyrene sperm, these evolutionarily novel and highly abundant proteins are candidates for possessing apyrene-specific functions. Lastly, comparative genomic analyses of testis-expressed, ovary-expressed and sperm genes identified a concentration of novel sperm proteins shared amongst Lepidoptera of potential relevance to the evolutionary origin of heteromorphic spermatogenesis. As the first published Lepidopteran sperm proteome, this whole-cell proteomic characterisation will facilitate future evolutionary genetic and developmental studies of heteromorphic sperm production and parasperm function. Furthermore, the analyses presented here provide useful annotation information regarding sex-biased gene expression, novel Lepidopteran genes and gene function in the male gamete to complement the newly sequenced and annotated Manduca genome.
Collapse
Affiliation(s)
| | - Qian Zhao
- Department of Biology, Syracuse University, Syracuse, NY, USA
| | - Kirill Borziak
- Department of Biology, Syracuse University, Syracuse, NY, USA
| | - James R Walters
- Ecology and Evolutionary Biology, Kansas University, Lawrence, KS, USA
| | - Steve Dorus
- Department of Biology, Syracuse University, Syracuse, NY, USA.
| |
Collapse
|
22
|
Borziak K, Posner MG, Upadhyay A, Danson MJ, Bagby S, Dorus S. Comparative genomic analysis reveals 2-oxoacid dehydrogenase complex lipoylation correlation with aerobiosis in archaea. PLoS One 2014; 9:e87063. [PMID: 24489835 PMCID: PMC3904984 DOI: 10.1371/journal.pone.0087063] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2013] [Accepted: 12/18/2013] [Indexed: 02/04/2023] Open
Abstract
Metagenomic analyses have advanced our understanding of ecological microbial diversity, but to what extent can metagenomic data be used to predict the metabolic capacity of difficult-to-study organisms and their abiotic environmental interactions? We tackle this question, using a comparative genomic approach, by considering the molecular basis of aerobiosis within archaea. Lipoylation, the covalent attachment of lipoic acid to 2-oxoacid dehydrogenase multienzyme complexes (OADHCs), is essential for metabolism in aerobic bacteria and eukarya. Lipoylation is catalysed either by lipoate protein ligase (LplA), which in archaea is typically encoded by two genes (LplA-N and LplA-C), or by a lipoyl(octanoyl) transferase (LipB or LipM) plus a lipoic acid synthetase (LipA). Does the genomic presence of lipoylation and OADHC genes across archaea from diverse habitats correlate with aerobiosis? First, analyses of 11,826 biotin protein ligase (BPL)-LplA-LipB transferase family members and 147 archaeal genomes identified 85 species with lipoylation capabilities and provided support for multiple ancestral acquisitions of lipoylation pathways during archaeal evolution. Second, with the exception of the Sulfolobales order, the majority of species possessing lipoylation systems exclusively retain LplA, or either LipB or LipM, consistent with archaeal genome streamlining. Third, obligate anaerobic archaea display widespread loss of lipoylation and OADHC genes. Conversely, a high level of correspondence is observed between aerobiosis and the presence of LplA/LipB/LipM, LipA and OADHC E2, consistent with the role of lipoylation in aerobic metabolism. This correspondence between OADHC lipoylation capacity and aerobiosis indicates that genomic pathway profiling in archaea is informative and that well characterized pathways may be predictive in relation to abiotic conditions in difficult-to-study extremophiles. Given the highly variable retention of gene repertoires across the archaea, the extension of comparative genomic pathway profiling to broader metabolic and homeostasis networks should be useful in revealing characteristics from metagenomic datasets related to adaptations to diverse environments.
Collapse
Affiliation(s)
- Kirill Borziak
- Department of Biology, Syracuse University, Syracuse, New York, United States of America
| | - Mareike G. Posner
- Department of Biology & Biochemistry, University of Bath, Claverton Down, United Kingdom
| | - Abhishek Upadhyay
- Department of Biology & Biochemistry, University of Bath, Claverton Down, United Kingdom
| | - Michael J. Danson
- Department of Biology & Biochemistry, University of Bath, Claverton Down, United Kingdom
- Centre for Extremophile Research, University of Bath, Claverton Down, United Kingdom
| | - Stefan Bagby
- Department of Biology & Biochemistry, University of Bath, Claverton Down, United Kingdom
- * E-mail: (SB); (SD)
| | - Steve Dorus
- Department of Biology, Syracuse University, Syracuse, New York, United States of America
- * E-mail: (SB); (SD)
| |
Collapse
|
23
|
Borziak K, Zhulin IB. FIST: a sensory domain for diverse signal transduction pathways in prokaryotes and ubiquitin signaling in eukaryotes. ACTA ACUST UNITED AC 2007; 23:2518-21. [PMID: 17855421 PMCID: PMC5067152 DOI: 10.1093/bioinformatics/btm384] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
MOTIVATION Sensory domains that are conserved among Bacteria, Archaea and Eucarya are important detectors of common signals detected by living cells. Due to their high sequence divergence, sensory domains are difficult to identify. We systematically look for novel sensory domains using sensitive profile-based searches initiated with regions of signal transduction proteins where no known domains can be identified by current domain models. RESULTS Using profile searches followed by multiple sequence alignment, structure prediction and domain architecture analysis, we have identified a novel sensory domain termed FIST, which is present in signal transduction proteins from Bacteria, Archaea and Eucarya. Chromosomal proximity of FIST-encoding genes to those coding for proteins involved in amino acid metabolism and transport suggest that FIST domains bind small ligands, such as amino acids.
Collapse
|