1
|
Amniouel S, Yalamanchili K, Sankararaman S, Jafri MS. Evaluating Ovarian Cancer Chemotherapy Response Using Gene Expression Data and Machine Learning. BIOMEDINFORMATICS 2024; 4:1396-1424. [PMID: 39149564 PMCID: PMC11326537 DOI: 10.3390/biomedinformatics4020077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Background Ovarian cancer (OC) is the most lethal gynecological cancer in the United States. Among the different types of OC, serous ovarian cancer (SOC) stands out as the most prevalent. Transcriptomics techniques generate extensive gene expression data, yet only a few of these genes are relevant to clinical diagnosis. Methods Methods for feature selection (FS) address the challenges of high dimensionality in extensive datasets. This study proposes a computational framework that applies FS techniques to identify genes highly associated with platinum-based chemotherapy response on SOC patients. Using SOC datasets from the Gene Expression Omnibus (GEO) database, LASSO and varSelRF FS methods were employed. Machine learning classification algorithms such as random forest (RF) and support vector machine (SVM) were also used to evaluate the performance of the models. Results The proposed framework has identified biomarkers panels with 9 and 10 genes that are highly correlated with platinum-paclitaxel and platinum-only response in SOC patients, respectively. The predictive models have been trained using the identified gene signatures and accuracy of above 90% was achieved. Conclusions In this study, we propose that applying multiple feature selection methods not only effectively reduces the number of identified biomarkers, enhancing their biological relevance, but also corroborates the efficacy of drug response prediction models in cancer treatment.
Collapse
Affiliation(s)
- Soukaina Amniouel
- School of System Biology, George Mason University, Fairfax, VA 22030, USA
| | - Keertana Yalamanchili
- School of System Biology, George Mason University, Fairfax, VA 22030, USA
- School of Engineering, Brown University, Providence, RI 02912, USA
| | - Sreenidhi Sankararaman
- School of System Biology, George Mason University, Fairfax, VA 22030, USA
- Department of Biomedical Engineering, The John Hopkins University, Baltimore, MD 21218, USA
| | - Mohsin Saleet Jafri
- School of System Biology, George Mason University, Fairfax, VA 22030, USA
- Center for Biomedical Engineering and Technology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
2
|
McCarthy FM, Jones TEM, Kwitek AE, Smith CL, Vize PD, Westerfield M, Bruford EA. The case for standardizing gene nomenclature in vertebrates. Nature 2023; 614:E31-E32. [PMID: 36792746 PMCID: PMC9931569 DOI: 10.1038/s41586-022-05633-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 12/06/2022] [Indexed: 02/17/2023]
Affiliation(s)
- Fiona M McCarthy
- The Chicken Gene Nomenclature Committee (CGNC), School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, USA
| | - Tamsin E M Jones
- HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Anne E Kwitek
- Rat Genome Database, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Cynthia L Smith
- Mouse Genome Database, The Jackson Laboratory, Bar Harbor, ME, USA
| | - Peter D Vize
- Xenbase, Departments of Biological Sciences and Computer Science, University of Calgary, Calgary, Alberta, Canada
| | - Monte Westerfield
- ZFIN, Institute of Neuroscience, University of Oregon, Eugene, OR, USA
| | - Elspeth A Bruford
- HUGO Gene Nomenclature Committee (HGNC), European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK.
| |
Collapse
|
3
|
Braschi B, Bruford EA, Cavanagh AT, Neuman SD, Bashirullah A. The bridge-like lipid transfer protein (BLTP) gene group: introducing new nomenclature based on structural homology indicating shared function. Hum Genomics 2022; 16:66. [PMID: 36461115 PMCID: PMC9719229 DOI: 10.1186/s40246-022-00439-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 11/22/2022] [Indexed: 12/03/2022] Open
Abstract
The HUGO Gene Nomenclature Committee assigns unique symbols and names to human genes. The use of approved nomenclature enables effective communication between researchers, and there are multiple examples of how the usage of unapproved alias symbols can lead to confusion. We discuss here a recent nomenclature update (May 2022) for a set of genes that encode proteins with a shared repeating β-groove domain. Some of the proteins encoded by genes in this group have already been shown to function as lipid transporters. By working with researchers in the field, we have been able to introduce a new root symbol (BLTP, which stands for "bridge-like lipid transfer protein") for this domain-based gene group. This new nomenclature not only reflects the shared domain in these proteins, but also takes into consideration the mounting evidence of a shared lipid transport function.
Collapse
Affiliation(s)
- Bryony Braschi
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, CB10 1SD, UK.
| | - Elspeth A Bruford
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, CB10 1SD, UK
- Department of Haematology, School of Clinical Medicine, University of Cambridge, Cambridge, Cambridgeshire, CB2 0AW, UK
| | - Amy T Cavanagh
- Division of Pharmaceutical Sciences, University of Wisconsin-Madison, Madison, WI, 53705-2222, USA
| | - Sarah D Neuman
- Division of Pharmaceutical Sciences, University of Wisconsin-Madison, Madison, WI, 53705-2222, USA
| | - Arash Bashirullah
- Division of Pharmaceutical Sciences, University of Wisconsin-Madison, Madison, WI, 53705-2222, USA
| |
Collapse
|
4
|
Bruford EA, Braschi B, Haim-Vilmovsky L, Jones TEM, Seal RL, Tweedie S. The importance of being the HGNC. Hum Genomics 2022; 16:58. [PMID: 36380364 PMCID: PMC9664783 DOI: 10.1186/s40246-022-00432-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 11/07/2022] [Indexed: 11/16/2022] Open
Abstract
The HUGO Gene Nomenclature Committee (HGNC) has been providing standardized symbols and names for human genes since the late 1970s. As funding agencies change their priorities, finding financial support for critical biomedical resources such as the HGNC becomes ever more challenging. In this article, we outline the key roles the HGNC currently plays in aiding communication and the need for these activities to be maintained.
Collapse
Affiliation(s)
- Elspeth A. Bruford
- grid.5335.00000000121885934Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, CB2 0PT UK ,grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| | - Bryony Braschi
- grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| | - Liora Haim-Vilmovsky
- grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| | - Tamsin E. M. Jones
- grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| | - Ruth L. Seal
- grid.5335.00000000121885934Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, CB2 0PT UK ,grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| | - Susan Tweedie
- grid.52788.300000 0004 0427 7672HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Genome Campus, Hinxton, CB10 1SD UK
| |
Collapse
|
5
|
Seal RL, Braschi B, Gray K, Jones TEM, Tweedie S, Haim-Vilmovsky L, Bruford EA. Genenames.org: the HGNC resources in 2023. Nucleic Acids Res 2022; 51:D1003-D1009. [PMID: 36243972 PMCID: PMC9825485 DOI: 10.1093/nar/gkac888] [Citation(s) in RCA: 215] [Impact Index Per Article: 71.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 09/28/2022] [Accepted: 10/03/2022] [Indexed: 01/30/2023] Open
Abstract
The HUGO Gene Nomenclature Committee (HGNC) assigns unique symbols and names to human genes. The HGNC database (www.genenames.org) currently contains over 43 000 approved gene symbols, over 19 200 of which are assigned to protein-coding genes, 14 000 to pseudogenes and nearly 9000 to non-coding RNA genes. The public website, www.genenames.org, displays all approved nomenclature within Symbol Reports that contain data curated by HGNC nomenclature advisors and links to related genomic, clinical, and proteomic information. Here, we describe updates to our resource, including improvements to our search facility and new download features.
Collapse
Affiliation(s)
- Ruth L Seal
- To whom correspondence should be addressed. Tel: +44 1223 494444; Fax: +44 1223 494446;
| | - Bryony Braschi
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Kristian Gray
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK,Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge CB2 0PT, UK
| | - Tamsin E M Jones
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Susan Tweedie
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Liora Haim-Vilmovsky
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Elspeth A Bruford
- HUGO Gene Nomenclature Committee, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK,Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge CB2 0PT, UK
| |
Collapse
|
6
|
Sarafidis M, Lambrou GI, Zoumpourlis V, Koutsouris D. An Integrated Bioinformatics Analysis towards the Identification of Diagnostic, Prognostic, and Predictive Key Biomarkers for Urinary Bladder Cancer. Cancers (Basel) 2022; 14:cancers14143358. [PMID: 35884419 PMCID: PMC9319344 DOI: 10.3390/cancers14143358] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/03/2022] [Accepted: 07/06/2022] [Indexed: 02/04/2023] Open
Abstract
Simple Summary Bladder cancer is evidently a challenge as far as its prognosis and treatment are concerned. The investigation of potential biomarkers and therapeutic targets is indispensable and still in progress. Most studies attempt to identify differential signatures between distinct molecular tumor subtypes. Therefore, keeping in mind the heterogeneity of urinary bladder tumors, we attempted to identify a consensus gene-related signature between the common expression profile of bladder cancer and control samples. In the quest for substantive features, we were able to identify key hub genes, whose signatures could hold diagnostic, prognostic, or therapeutic significance, but, primarily, could contribute to a better understanding of urinary bladder cancer biology. Abstract Bladder cancer (BCa) is one of the most prevalent cancers worldwide and accounts for high morbidity and mortality. This study intended to elucidate potential key biomarkers related to the occurrence, development, and prognosis of BCa through an integrated bioinformatics analysis. In this context, a systematic meta-analysis, integrating 18 microarray gene expression datasets from the GEO repository into a merged meta-dataset, identified 815 robust differentially expressed genes (DEGs). The key hub genes resulted from DEG-based protein–protein interaction and weighted gene co-expression network analyses were screened for their differential expression in urine and blood plasma samples of BCa patients. Subsequently, they were tested for their prognostic value, and a three-gene signature model, including COL3A1, FOXM1, and PLK4, was built. In addition, they were tested for their predictive value regarding muscle-invasive BCa patients’ response to neoadjuvant chemotherapy. A six-gene signature model, including ANXA5, CD44, NCAM1, SPP1, CDCA8, and KIF14, was developed. In conclusion, this study identified nine key biomarker genes, namely ANXA5, CDT1, COL3A1, SPP1, VEGFA, CDCA8, HJURP, TOP2A, and COL6A1, which were differentially expressed in urine or blood of BCa patients, held a prognostic or predictive value, and were immunohistochemically validated. These biomarkers may be of significance as prognostic and therapeutic targets for BCa.
Collapse
Affiliation(s)
- Michail Sarafidis
- Biomedical Engineering Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., 15780 Athens, Greece;
- Correspondence: ; Tel.: +30-210-772-2430
| | - George I. Lambrou
- Choremeio Research Laboratory, First Department of Pediatrics, National and Kapodistrian University of Athens, 8 Thivon & Levadeias Str., 11527 Athens, Greece;
- University Research Institute of Maternal and Child Health and Precision Medicine, National and Kapodistrian University of Athens, 8 Thivon & Levadeias Str., 11527 Athens, Greece
| | - Vassilis Zoumpourlis
- Biomedical Applications Unit, Institute of Chemical Biology, National Hellenic Research Foundation, 48 Vas. Konstantinou Ave., 11635 Athens, Greece;
| | - Dimitrios Koutsouris
- Biomedical Engineering Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., 15780 Athens, Greece;
| |
Collapse
|
7
|
Bruford E. Comment on Herring et al. The Use of "Retardation" in FRAXA, FMRP, FMR1 and Other Designations. Cells 2022, 11, 1044. Cells 2022; 11:cells11121937. [PMID: 35741066 PMCID: PMC9221639 DOI: 10.3390/cells11121937] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 06/14/2022] [Indexed: 02/04/2023] Open
Abstract
This commentary is written in response to the recent article from Herring et al., discussing the eradication of the offensive term "retardation" from gene nomenclature. We discuss the work of the HUGO (Human Genome Organisation) Gene Nomenclature Committee (HGNC) and outline the steps already taken to remove this term from our gene names. We also highlight the latest nomenclature changes made as a result of discussions with the authors and agreement with the European Fragile X Network.
Collapse
Affiliation(s)
- Elspeth Bruford
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge CB2 0PT, UK
- HUGO Gene Nomenclature Committee, EMBL-EBI, European Molecular Biology Laboratory, Wellcome Genome Campus, Hinxton CB10 1SD, UK;
- Correspondence:
| | | |
Collapse
|