1
|
Laajala TD, Sreekanth V, Soupir AC, Creed JH, Halkola AS, Calboli FCF, Singaravelu K, Orman MV, Colin-Leitzinger C, Gerke T, Fridley BL, Tyekucheva S, Costello JC. A harmonized resource of integrated prostate cancer clinical, -omic, and signature features. Sci Data 2023; 10:430. [PMID: 37407670 PMCID: PMC10322899 DOI: 10.1038/s41597-023-02335-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 06/27/2023] [Indexed: 07/07/2023] Open
Abstract
Genomic and transcriptomic data have been generated across a wide range of prostate cancer (PCa) study cohorts. These data can be used to better characterize the molecular features associated with clinical outcomes and to test hypotheses across multiple, independent patient cohorts. In addition, derived features, such as estimates of cell composition, risk scores, and androgen receptor (AR) scores, can be used to develop novel hypotheses leveraging existing multi-omic datasets. The full potential of such data is yet to be realized as independent datasets exist in different repositories, have been processed using different pipelines, and derived and clinical features are often not provided or not standardized. Here, we present the curatedPCaData R package, a harmonized data resource representing >2900 primary tumor, >200 normal tissue, and >500 metastatic PCa samples across 19 datasets processed using standardized pipelines with updated gene annotations. We show that meta-analysis across harmonized studies has great potential for robust and clinically meaningful insights. curatedPCaData is an open and accessible community resource with code made available for reproducibility.
Collapse
Affiliation(s)
- Teemu D Laajala
- Department of Mathematics and Statistics, University of Turku, Turku, Finland.
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| | - Varsha Sreekanth
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Alex C Soupir
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Jordan H Creed
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Anni S Halkola
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
| | - Federico C F Calboli
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
- Natural Resources Institute Finland (Luke), F-31600, Jokioinen, Finland
| | | | - Michael V Orman
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | | | - Travis Gerke
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Brooke L Fridley
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Svitlana Tyekucheva
- Department of Data Science, Dana-Farber Cancer Institute; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| | - James C Costello
- Department of Pharmacology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
- University of Colorado Cancer Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
2
|
Borziak K, Finkelstein J. Gene Expression Markers of Prognostic Importance for Prostate Cancer Risk in Patients with Benign Prostate Hyperplasia. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:73-76. [PMID: 36086411 DOI: 10.1109/embc48229.2022.9871422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Comparative analyses utilizing publicly available big data have the potential to generate novel hypotheses and knowledge. However, this approach is underutilized in the realm of cancer research, particularly for prostate cancer. While the general progression of prostate cancer is now well understood, how individual cell types transition from healthy, to pre-cancerous, to cancerous cell types, remains to be further elucidated. To address this, we re-analyzed two publicly available single-cell RNA-seq datasets of prostate cancer and benign prostate hyperplasia cell types. The differential expression analysis of 15,505 epithelial cell profiles across 18,638 genes revealed 791 genes that were up regulated in prostate cancer epithelial cells. Here we report six markers that show significant upregulation in prostate cancer cells relative to BPH epithelial cells: HPN (5.62X), RAC3 (3.51X), CD24 (2.18X), HOXC6 (1.77X), AGR2 (1.71X), and IGFBP2 (1.28X). In particular, the significant differential expression of AGR2 further supports its clinical relevance in supplementing prostate-specific antigen screening for detecting prostate cancer. These findings have the potential to further advance our knowledge of genes governing the development of cancer in prostate epithelial cells. Clinical Relevance- Our results establish the importance of 6 prostate cancer markers (HPN, RAC3, CD24, HOXC6, AGR2, and IGFBP3) in distinguishing between prostate cancer epithelial cells and benign prostate hyperplasia epithelial cells.
Collapse
|
3
|
Díaz de la Guardia-Bolívar E, Barrios-Rodríguez R, Zwir I, Jiménez-Moleón JJ, Del Val C. Identification of novel prostate cancer genes in patients stratified by Gleason classification: role of antitumoral genes. Int J Cancer 2022; 151:255-264. [PMID: 35234293 PMCID: PMC9311191 DOI: 10.1002/ijc.33988] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 02/03/2022] [Accepted: 02/16/2022] [Indexed: 12/24/2022]
Abstract
Prostate cancer (PCa) is a tumor with a great heterogeneity, both at a molecular and clinical level. Despite its global good prognosis, cases can vary from indolent to lethal metastatic and scientific efforts are aimed to discern those with worse outcomes. Current prognostic markers, as Gleason score, fall short when it comes to distinguishing these cases. Identification of new early biomarkers to enable a better PCa distinction and classification remains a challenge. In order to identify new genes implicated in PCa progression we conducted several differential gene expression analyses over paired samples comparing primary PCa tissue against healthy prostatic tissue of PCa patients. The results obtained show that this approach is a serious alternative to overcome patient heterogeneity. We were able to identify 250 genes whose expression varies along with tissue differentiation—healthy to tumor tissue, 161 of these genes are described here for the first time to be related to PCa. The further manual curation of these genes allowed to annotate 39 genes with antitumoral activity, 22 of them described for the first time to be related to PCa proliferation and metastasis. These findings could be replicated in different cohorts for most genes. Results obtained considering paired differential expression, functional annotation and replication results point to: CGREF1, UNC5A, C16orf74, LGR6, IGSF1, QPRT and CA14 as possible new early markers in PCa. These genes may prevent the progression of the disease and their expression should be studied in patients with different outcomes.
Collapse
Affiliation(s)
- Elisa Díaz de la Guardia-Bolívar
- Department of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada, Granada, Spain
| | - Rocío Barrios-Rodríguez
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain.,Instituto de Investigación Biosanitaria ibs.GRANADA, Complejo Hospitales Universitarios de Granada/Universidad de Granada, Granada, Spain.,Universidad de Granada, Departamento de Medicina Preventiva y Salud Pública, Granada, España
| | - Igor Zwir
- Department of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada, Granada, Spain.,Instituto de Investigación Biosanitaria ibs.GRANADA, Complejo Hospitales Universitarios de Granada/Universidad de Granada, Granada, Spain
| | - José Juan Jiménez-Moleón
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain.,Instituto de Investigación Biosanitaria ibs.GRANADA, Complejo Hospitales Universitarios de Granada/Universidad de Granada, Granada, Spain.,Universidad de Granada, Departamento de Medicina Preventiva y Salud Pública, Granada, España
| | - Coral Del Val
- Department of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada, Granada, Spain.,Instituto de Investigación Biosanitaria ibs.GRANADA, Complejo Hospitales Universitarios de Granada/Universidad de Granada, Granada, Spain
| |
Collapse
|
4
|
Gumaei A, Sammouda R, Al-Rakhami M, AlSalman H, El-Zaart A. Feature selection with ensemble learning for prostate cancer diagnosis from microarray gene expression. Health Informatics J 2021; 27:1460458221989402. [PMID: 33570011 DOI: 10.1177/1460458221989402] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Cancer diagnosis using machine learning algorithms is one of the main topics of research in computer-based medical science. Prostate cancer is considered one of the reasons that are leading to deaths worldwide. Data analysis of gene expression from microarray using machine learning and soft computing algorithms is a useful tool for detecting prostate cancer in medical diagnosis. Even though traditional machine learning methods have been successfully applied for detecting prostate cancer, the large number of attributes with a small sample size of microarray data is still a challenge that limits their ability for effective medical diagnosis. Selecting a subset of relevant features from all features and choosing an appropriate machine learning method can exploit the information of microarray data to improve the accuracy rate of detection. In this paper, we propose to use a correlation feature selection (CFS) method with random committee (RC) ensemble learning to detect prostate cancer from microarray data of gene expression. A set of experiments are conducted on a public benchmark dataset using 10-fold cross-validation technique to evaluate the proposed approach. The experimental results revealed that the proposed approach attains 95.098% accuracy, which is higher than related work methods on the same dataset.
Collapse
Affiliation(s)
- Abdu Gumaei
- Research Chair of Pervasive and Mobile Computing, King Saud University, Saudi Arabia.,Taiz University, Yemen
| | | | - Mabrook Al-Rakhami
- Research Chair of Pervasive and Mobile Computing, King Saud University, Saudi Arabia
| | | | | |
Collapse
|
5
|
Moses DA. Editorial for “Prostate Cancer Risk Stratification in Men With a Clinical Suspicion of Prostate Cancer Using a Unique Biparametric MRI and Expression of 11 Genes in Apparently Benign Tissue: Evaluation Using Machine‐Learning Techniques”. J Magn Reson Imaging 2020; 51:1554-1555. [DOI: 10.1002/jmri.27135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 03/03/2020] [Indexed: 11/09/2022] Open
Affiliation(s)
- Daniel A. Moses
- Department of RadiologyPrince of Wales Hospital Randwick New South Wales Australia
- School of Biomedical Engineering, Faculty of EngineeringUniversity of New South Wales Kensington New South Wales Australia
| |
Collapse
|
6
|
Ram PK, Kuila P. Feature selection from microarray data : Genetic algorithm based approach. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES 2020. [DOI: 10.1080/02522667.2019.1703260] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Affiliation(s)
- Pintu Kumar Ram
- Department of Computer Science & Engineering, National Institute of Technology Sikkim, Ravangla 737139, Sikkim, India
| | - Pratyay Kuila
- Department of Computer Science & Engineering, National Institute of Technology Sikkim, Ravangla 737139, Sikkim, India
| |
Collapse
|
7
|
Ram PK, Kuila P. Feature selection from microarray data : Genetic algorithm based approach. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES 2019. [DOI: 10.1080/02522667.2019.1703260 10.1080/02522667.2019.1703260] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Affiliation(s)
- Pintu Kumar Ram
- Department of Computer Science & Engineering, National Institute of Technology Sikkim, Ravangla 737139, Sikkim, India
| | - Pratyay Kuila
- Department of Computer Science & Engineering, National Institute of Technology Sikkim, Ravangla 737139, Sikkim, India
| |
Collapse
|
8
|
Rogers CR, Rovito MJ, Hussein M, Obidike OJ, Pratt R, Alexander M, Berge JM, Dall'Era M, Nix JW, Warlick C. Attitudes Toward Genomic Testing and Prostate Cancer Research Among Black Men. Am J Prev Med 2018; 55:S103-S111. [PMID: 30670195 PMCID: PMC6352989 DOI: 10.1016/j.amepre.2018.05.028] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 03/18/2018] [Accepted: 05/24/2018] [Indexed: 11/16/2022]
Abstract
INTRODUCTION Black men are diagnosed with prostate cancer at nearly twice the rate of white men and are underrepresented in prostate cancer research, including validation studies of new clinical tools (e.g., genomic testing). Because healthcare system mistrust has contributed to these disparities for centuries, black men may be less inclined to pursue novel testing, and identification of facilitators to their participation in prostate cancer research studies remains warranted. METHODS A community-engaged approach involving a partnership with a community organization was used to conduct seven focus groups in Minnesota, Alabama, and California to explore black men's attitudes toward prostate cancer research participation and genomic testing for prostate cancer. Data were collected and analyzed from April 2015 to April 2017. RESULTS Identified genomic testing barriers included a lack of terminology understanding, healthcare system mistrust, reluctance to seek medical care, and unfavorable attitudes toward research. Facilitators included family history, value of prevention, and the desire for health education. Lack of prostate cancer knowledge, prostate-specific antigen testing confusion, healthcare system distrust, and misuse of personal health information were barriers to research study participation. Some black men were motivated to participate in research if it was seen as constructive and transparent. CONCLUSIONS Disparities for black men can both motivate and disincentivize participation depending upon a positive or negative view of research. Confusion over prostate cancer clinical care has fueled some mistrust among black men affecting both clinical care and research participation. With increased education, health literacy, and assurances of research integrity and transparency, black men may be more willing to participate in prostate cancer testing and research. SUPPLEMENT INFORMATION This article is part of a supplement entitled African American Men's Health: Research, Practice, and Policy Implications, which is sponsored by the National Institutes of Health.
Collapse
Affiliation(s)
- Charles R Rogers
- Department of Family and Preventive Medicine, University of Utah School of Medicine, Salt Lake City, Utah.
| | - Michael J Rovito
- Department of Health Professions, University of Central Florida, Orlando, Florida
| | - Musse Hussein
- Department of Family Medicine and Community Health, University of Minnesota Medical School, Minneapolis, Minnesota
| | | | - Rebekah Pratt
- Department of Family Medicine and Community Health, University of Minnesota Medical School, Minneapolis, Minnesota
| | - Mark Alexander
- Health and Wellness Committee, 100 Black Men of America, Inc., Oakland, California
| | - Jerica M Berge
- Department of Family Medicine and Community Health, University of Minnesota Medical School, Minneapolis, Minnesota
| | - Marc Dall'Era
- Department of Urology, University of California, Davis, Sacramento, California
| | - Jeffrey W Nix
- Department of Urology, University of Alabama at Birmingham, Birmingham, Alabama
| | - Christopher Warlick
- Department of Urology, University of Minnesota Medical School, Minneapolis, Minnesota
| |
Collapse
|
9
|
Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods. Med Biol Eng Comput 2018; 57:159-176. [DOI: 10.1007/s11517-018-1874-4] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 07/12/2018] [Indexed: 12/25/2022]
|