Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dudoit S, Fridlyand J, Speed TP. Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. J Am Stat Assoc 2002. [DOI: 10.1198/016214502753479248] [Citation(s) in RCA: 1691] [Impact Index Per Article: 73.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

For:	Dudoit S, Fridlyand J, Speed TP. Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. J Am Stat Assoc 2002. [DOI: 10.1198/016214502753479248] [Citation(s) in RCA: 1691] [Impact Index Per Article: 73.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Number

Cited by Other Article(s)

551

Oh JH, Gao J. Fast kernel discriminant analysis for classification of liver cancer mass spectra. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:1522-1534. [PMID: 20479503 DOI: 10.1109/tcbb.2010.42] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

552

Zheng S, Liu W. An experimental comparison of gene selection by Lasso and Dantzig selector for cancer classification. Comput Biol Med 2011;41:1033-40. [DOI: 10.1016/j.compbiomed.2011.08.011] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2011] [Revised: 08/29/2011] [Accepted: 08/30/2011] [Indexed: 01/28/2023]

553

McLachlan GJ, Rathnayake SI. Testing for Group Structure in High-Dimensional Data. J Biopharm Stat 2011;21:1113-25. [DOI: 10.1080/10543406.2011.608342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]

554

Vu TN, Valkenborg D, Smets K, Verwaest KA, Dommisse R, Lemière F, Verschoren A, Goethals B, Laukens K. An integrated workflow for robust alignment and simplified quantitative analysis of NMR spectrometry data. BMC Bioinformatics 2011;12:405. [PMID: 22014236 PMCID: PMC3217056 DOI: 10.1186/1471-2105-12-405] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Accepted: 10/20/2011] [Indexed: 11/24/2022] Open

Abstract

Background

Nuclear magnetic resonance spectroscopy (NMR) is a powerful technique to reveal and compare quantitative metabolic profiles of biological tissues. However, chemical and physical sample variations make the analysis of the data challenging, and typically require the application of a number of preprocessing steps prior to data interpretation. For example, noise reduction, normalization, baseline correction, peak picking, spectrum alignment and statistical analysis are indispensable components in any NMR analysis pipeline.

Results

We introduce a novel suite of informatics tools for the quantitative analysis of NMR metabolomic profile data. The core of the processing cascade is a novel peak alignment algorithm, called hierarchical Cluster-based Peak Alignment (CluPA). The algorithm aligns a target spectrum to the reference spectrum in a top-down fashion by building a hierarchical cluster tree from peak lists of reference and target spectra and then dividing the spectra into smaller segments based on the most distant clusters of the tree. To reduce the computational time to estimate the spectral misalignment, the method makes use of Fast Fourier Transformation (FFT) cross-correlation. Since the method returns a high-quality alignment, we can propose a simple methodology to study the variability of the NMR spectra. For each aligned NMR data point the ratio of the between-group and within-group sum of squares (BW-ratio) is calculated to quantify the difference in variability between and within predefined groups of NMR spectra. This differential analysis is related to the calculation of the F-statistic or a one-way ANOVA, but without distributional assumptions. Statistical inference based on the BW-ratio is achieved by bootstrapping the null distribution from the experimental data.

Conclusions

The workflow performance was evaluated using a previously published dataset. Correlation maps, spectral and grey scale plots show clear improvements in comparison to other methods, and the down-to-earth quantitative analysis works well for the CluPA-aligned spectra. The whole workflow is embedded into a modular and statistically sound framework that is implemented as an R package called "speaq" ("spectrum alignment and quantitation"), which is freely available from http://code.google.com/p/speaq/.

Collapse

555

Kim SK, Yun SJ, Kim J, Lee OJ, Bae SC, Kim WJ. Identification of gene expression signature modulated by nicotinamide in a mouse bladder cancer model. PLoS One 2011;6:e26131. [PMID: 22028816 PMCID: PMC3189956 DOI: 10.1371/journal.pone.0026131] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 09/20/2011] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

Urinary bladder cancer is often a result of exposure to chemical carcinogens such as cigarette smoking. Because of histological similarity, chemically-induced rodent cancer model was largely used for human bladder cancer studies. Previous investigations have suggested that nicotinamide, water-soluble vitamin B3, may play a key role in cancer prevention through its activities in cellular repair. However, to date, evidence towards identifying the genetic alterations of nicotinamide in cancer prevention has not been provided. Here, we search for the molecular signatures of cancer prevention by nicotinamide using a N-butyl-N-(4-hydroxybutyl)-nitrosamine (BBN)-induced urinary bladder cancer model in mice.

METHODOLOGY/PRINCIPAL FINDINGS

Via microarray gene expression profiling of 20 mice and 233 human bladder samples, we performed various statistical analyses and immunohistochemical staining for validation. The expression patterns of 893 genes associated with nicotinamide activity in cancer prevention were identified by microarray data analysis. Gene network analyses of these 893 genes revealed that the Myc and its associated genes may be the most important regulator of bladder cancer prevention, and the gene expression signature correlated well with protein expression data. Comparison of gene expression between human and mouse revealed that BBN-induced mouse bladder cancers exhibited gene expression profiles that were more similar to those of invasive human bladder cancers than to those of non-invasive human bladder cancers.

CONCLUSIONS/SIGNIFICANCE

This study demonstrates that nicotinamide plays an important role as a chemo-preventive and therapeutic agent in bladder cancer through the regulation of the Myc oncogenic signature. Nicotinamide may represent a promising therapeutic modality in patients with muscle-invasive bladder cancer.

Collapse

556

Wang X, Simon R. Microarray-based cancer prediction using single genes. BMC Bioinformatics 2011;12:391. [PMID: 21982331 PMCID: PMC3228540 DOI: 10.1186/1471-2105-12-391] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 10/07/2011] [Indexed: 11/23/2022] Open

557

Lai Y, Wu B, Zhao H. A permutation test approach to the choice of sizekfor the nearest neighbors classifier. J Appl Stat 2011. [DOI: 10.1080/02664763.2010.547565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

558

Huang GH, Wang SM, Hsu CC. Optimization-Based Model Fitting for Latent Class and Latent Profile Analyses. PSYCHOMETRIKA 2011;76:584-611. [PMID: 27519682 DOI: 10.1007/s11336-011-9227-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2010] [Revised: 03/21/2011] [Indexed: 06/06/2023]

559

Aoshima M, Yata K. Two-Stage Procedures for High-Dimensional Data. Seq Anal 2011. [DOI: 10.1080/07474946.2011.619088] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]

560

Tong DL, Schierz AC. Hybrid genetic algorithm-neural network: Feature extraction for unpreprocessed microarray data. Artif Intell Med 2011;53:47-56. [DOI: 10.1016/j.artmed.2011.06.008] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2010] [Revised: 05/11/2011] [Accepted: 06/26/2011] [Indexed: 12/22/2022]

561

Tsai YS, Aguan K, Pal NR, Chung IF. Identification of single- and multiple-class specific signature genes from gene expression profiles by group marker index. PLoS One 2011;6:e24259. [PMID: 21909426 PMCID: PMC3164723 DOI: 10.1371/journal.pone.0024259] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Accepted: 08/06/2011] [Indexed: 01/06/2023] Open

Abstract

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases.

Collapse

562

Zheng CH, Zhang L, Ng TY, Shiu SCK, Huang DS. Metasample-based sparse representation for tumor classification. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:1273-1282. [PMID: 21282864 DOI: 10.1109/tcbb.2011.20] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

563

Unimodal transform of variables selected by interval segmentation purity for classification tree modeling of high-dimensional microarray data. Talanta 2011;85:1689-94. [DOI: 10.1016/j.talanta.2011.06.076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2011] [Revised: 06/28/2011] [Accepted: 06/30/2011] [Indexed: 11/20/2022]

564

Strategy to find molecular signatures in a small series of rare cancers: validation for radiation-induced breast and thyroid tumors. PLoS One 2011;6:e23581. [PMID: 21853153 PMCID: PMC3154936 DOI: 10.1371/journal.pone.0023581] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 07/21/2011] [Indexed: 11/28/2022] Open

565

Wang H, van der Laan MJ. Dimension reduction with gene expression data using targeted variable importance measurement. BMC Bioinformatics 2011;12:312. [PMID: 21849016 PMCID: PMC3166941 DOI: 10.1186/1471-2105-12-312] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 07/29/2011] [Indexed: 11/10/2022] Open

566

Zheng CH, Ng TY, Zhang L, Shiu CK, Wang HQ. Tumor classification based on non-negative matrix factorization using gene expression data. IEEE Trans Nanobioscience 2011;10:86-93. [PMID: 21742573 DOI: 10.1109/tnb.2011.2144998] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

567

Peters T, Bulger DW, Loi TH, Yang JYH, Ma D. Two-step cross-entropy feature selection for microarrays—power through complementarity. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:1148-1151. [PMID: 21321369 DOI: 10.1109/tcbb.2011.30] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

568

Zhao X, Cheung LWK. Multiclass kernel-imbedded Gaussian processes for microarray data analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:1041-1053. [PMID: 20805625 DOI: 10.1109/tcbb.2010.85] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

569

Drake JI, Bogaard HJ, Mizuno S, Clifton B, Xie B, Gao Y, Dumur CI, Fawcett P, Voelkel NF, Natarajan R. Molecular signature of a right heart failure program in chronic severe pulmonary hypertension. Am J Respir Cell Mol Biol 2011;45:1239-47. [PMID: 21719795 DOI: 10.1165/rcmb.2010-0412oc] [Citation(s) in RCA: 169] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

570

Novak D, Mihelj M, Ziherl J, Olenšek A, Munih M. Psychophysiological measurements in a biocooperative feedback loop for upper extremity rehabilitation. IEEE Trans Neural Syst Rehabil Eng 2011;19:400-10. [PMID: 21708507 DOI: 10.1109/tnsre.2011.2160357] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

571

Lê Cao KA, Boitard S, Besse P. Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinformatics 2011;12:253. [PMID: 21693065 PMCID: PMC3133555 DOI: 10.1186/1471-2105-12-253] [Citation(s) in RCA: 601] [Impact Index Per Article: 42.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2010] [Accepted: 06/22/2011] [Indexed: 11/24/2022] Open

572

Feiping Nie, Dong Xu, Xuelong Li, Shiming Xiang. Semisupervised Dimensionality Reduction and Classification Through Virtual Label Regression. ACTA ACUST UNITED AC 2011;41:675-85. [DOI: 10.1109/tsmcb.2010.2085433] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

573

Park SY, Liu Y. Robust penalized logistic regression with truncated loss functions. CAN J STAT 2011;39:300-323. [PMID: 22162902 DOI: 10.1002/cjs.10105] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

574

Cheng Q, Zhou H, Cheng J. The Fisher-Markov selector: fast selecting maximally separable feature subset for multiclass classification with applications to high-dimensional data. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2011;33:1217-1233. [PMID: 21493968 DOI: 10.1109/tpami.2010.195] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Abstract

Selecting features for multiclass classification is a critically important task for pattern recognition and machine learning applications. Especially challenging is selecting an optimal subset of features from high-dimensional data, which typically have many more variables than observations and contain significant noise, missing components, or outliers. Existing methods either cannot handle high-dimensional data efficiently or scalably, or can only obtain local optimum instead of global optimum. Toward the selection of the globally optimal subset of features efficiently, we introduce a new selector--which we call the Fisher-Markov selector--to identify those features that are the most useful in describing essential differences among the possible groups. In particular, in this paper we present a way to represent essential discriminating characteristics together with the sparsity as an optimization objective. With properly identified measures for the sparseness and discriminativeness in possibly high-dimensional settings, we take a systematic approach for optimizing the measures to choose the best feature subset. We use Markov random field optimization techniques to solve the formulated objective functions for simultaneous feature selection. Our results are noncombinatorial, and they can achieve the exact global optimum of the objective function for some special kernels. The method is fast; in particular, it can be linear in the number of features and quadratic in the number of observations. We apply our procedure to a variety of real-world data, including mid--dimensional optical handwritten digit data set and high-dimensional microarray gene expression data sets. The effectiveness of our method is confirmed by experimental results. In pattern recognition and from a model selection viewpoint, our procedure says that it is possible to select the most discriminating subset of variables by solving a very simple unconstrained objective function which in fact can be obtained with an explicit expression.

Collapse

575

Saintigny P, Zhang L, Fan YH, El-Naggar AK, Papadimitrakopoulou VA, Feng L, Lee JJ, Kim ES, Ki Hong W, Mao L. Gene expression profiling predicts the development of oral cancer. Cancer Prev Res (Phila) 2011;4:218-29. [PMID: 21292635 DOI: 10.1158/1940-6207.capr-10-0155] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

576

Ashton-Chess J, Cervino AC. Development of commercial gene-expression-based signatures: review of the scientific strategies. Per Med 2011;8:253-269. [PMID: 29783527 DOI: 10.2217/pme.10.84] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

577

Fisher TJ, Sun X. Improved Stein-type shrinkage estimators for the high-dimensional multivariate normal covariance matrix. Comput Stat Data Anal 2011. [DOI: 10.1016/j.csda.2010.12.006] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

578

Ghorai S, Mukherjee A, Sengupta S, Dutta PK. Cancer classification from gene expression data by NPPC ensemble. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:659-671. [PMID: 20479504 DOI: 10.1109/tcbb.2010.36] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

579

Taylor SL, Kim K. A jackknife and voting classifier approach to feature selection and classification. Cancer Inform 2011;10:133-47. [PMID: 21584263 PMCID: PMC3091410 DOI: 10.4137/cin.s7111] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

580

Mueller RS, Dill BD, Pan C, Belnap CP, Thomas BC, VerBerkmoes NC, Hettich RL, Banfield JF. Proteome changes in the initial bacterial colonist during ecological succession in an acid mine drainage biofilm community. Environ Microbiol 2011;13:2279-92. [PMID: 21518216 DOI: 10.1111/j.1462-2920.2011.02486.x] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

581

Shen Y, Lin Z, Zhu J. Penalized Independence Rule for Testing High-Dimensional Hypotheses. COMMUN STAT-THEOR M 2011. [DOI: 10.1080/03610926.2010.484160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

582

Lee C, Nkounkou B, Huang CH. Comparison of LDA and SPRT on Clinical Dataset Classifications. BIOMEDICAL INFORMATICS INSIGHTS 2011;4:1-7. [PMID: 21949476 PMCID: PMC3178328 DOI: 10.4137/bii.s6935] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]

583

Huang S, Tong T, Zhao H. Bias-corrected diagonal discriminant rules for high-dimensional classification. Biometrics 2011;66:1096-106. [PMID: 20222939 DOI: 10.1111/j.1541-0420.2010.01395.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

584

Identification by random forest method of HLA class I amino acid substitutions associated with lower survival at day 100 in unrelated donor hematopoietic cell transplantation. Bone Marrow Transplant 2011;47:217-26. [PMID: 21441965 PMCID: PMC3128239 DOI: 10.1038/bmt.2011.56] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

585

Derivation of cancer diagnostic and prognostic signatures from gene expression data. Bioanalysis 2011;2:855-62. [PMID: 21083217 DOI: 10.4155/bio.10.35] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

586

NIKULIN VLADIMIR, HUANG TIANHSIANG, MCLACHLAN GEOFFREYJ. CLASSIFICATION OF HIGH-DIMENSIONAL MICROARRAY DATA WITH A TWO-STEP PROCEDURE VIA A WILCOXON CRITERION AND MULTILAYER PERCEPTRON. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS 2011. [DOI: 10.1142/s1469026811002969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

587

Chakraborty S, Guo R. A Bayesian hybrid Huberized support vector machine and its applications in high-dimensional medical data. Comput Stat Data Anal 2011. [DOI: 10.1016/j.csda.2010.09.024] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

588

Tapia E, Ornella L, Bulacio P, Angelone L. Multiclass classification of microarray data samples with a reduced number of genes. BMC Bioinformatics 2011;12:59. [PMID: 21342522 PMCID: PMC3056725 DOI: 10.1186/1471-2105-12-59] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 02/22/2011] [Indexed: 01/05/2023] Open

589

Yao C, Zhang M, Zou J, Li H, Wang D, Zhu J, Guo Z. Functional modules with disease discrimination abilities for various cancers. SCIENCE CHINA-LIFE SCIENCES 2011;54:189-93. [PMID: 21318490 DOI: 10.1007/s11427-010-4129-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2009] [Accepted: 09/22/2009] [Indexed: 12/13/2022]

590

De Bin R, Risso D. A novel approach to the clustering of microarray data via nonparametric density estimation. BMC Bioinformatics 2011;12:49. [PMID: 21303507 PMCID: PMC3042915 DOI: 10.1186/1471-2105-12-49] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2010] [Accepted: 02/08/2011] [Indexed: 11/21/2022] Open

591

Dagliyan O, Uney-Yuksektepe F, Kavakli IH, Turkay M. Optimization based tumor classification from microarray gene expression data. PLoS One 2011;6:e14579. [PMID: 21326602 PMCID: PMC3033885 DOI: 10.1371/journal.pone.0014579] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2010] [Accepted: 12/23/2010] [Indexed: 11/20/2022] Open

Abstract

BACKGROUND

An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually require parameter optimization to obtain accurate results depending on the type of data. Additionally, it is highly critical to find an optimal set of markers among those up or down regulated genes that can be clinically utilized to build assays for the diagnosis or to follow progression of specific cancer types. In this paper, we employ a mixed integer programming based classification algorithm named hyper-box enclosure method (HBE) for the classification of some cancer types with a minimal set of predictor genes. This optimization based method which is a user friendly and efficient classifier may allow the clinicians to diagnose and follow progression of certain cancer types.

METHODOLOGY/PRINCIPAL FINDINGS

We apply HBE algorithm to some well known data sets such as leukemia, prostate cancer, diffuse large B-cell lymphoma (DLBCL), small round blue cell tumors (SRBCT) to find some predictor genes that can be utilized for diagnosis and prognosis in a robust manner with a high accuracy. Our approach does not require any modification or parameter optimization for each data set. Additionally, information gain attribute evaluator, relief attribute evaluator and correlation-based feature selection methods are employed for the gene selection. The results are compared with those from other studies and biological roles of selected genes in corresponding cancer type are described.

CONCLUSIONS/SIGNIFICANCE

The performance of our algorithm overall was better than the other algorithms reported in the literature and classifiers found in WEKA data-mining package. Since it does not require a parameter optimization and it performs consistently very high prediction rate on different type of data sets, HBE method is an effective and consistent tool for cancer type prediction with a small number of gene markers.

Collapse

592

Mining gene expression profiles: an integrated implementation of kernel principal component analysis and singular value decomposition. GENOMICS PROTEOMICS & BIOINFORMATICS 2011;8:200-10. [PMID: 20970748 PMCID: PMC5054124 DOI: 10.1016/s1672-0229(10)60022-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

593

Zheng CH, Chong YW, Wang HQ. Gene selection using independent variable group analysis for tumor classification. Neural Comput Appl 2011. [DOI: 10.1007/s00521-010-0513-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

594

Chuang LY, Yang CH, Li JC, Yang CH. A hybrid BPSO-CGA approach for gene selection and classification of microarray data. J Comput Biol 2011;19:68-82. [PMID: 21210743 DOI: 10.1089/cmb.2010.0064] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

595

Bayesian semi-supervised learning with support vector machine. ACTA ACUST UNITED AC 2011. [DOI: 10.1016/j.stamet.2009.09.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

596

Mao KZ, Tang W. Recursive Mahalanobis separability measure for gene subset selection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:266-272. [PMID: 20479500 DOI: 10.1109/tcbb.2010.43] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]

597

A novel hybrid feature selection method for microarray data analysis. Appl Soft Comput 2011. [DOI: 10.1016/j.asoc.2009.11.010] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

598

Top Scoring Pair Decision Tree for Gene Expression Data Analysis. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2011;696:27-35. [DOI: 10.1007/978-1-4419-7046-6_3] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

599

Wang Q, Li HD, Xu QS, Liang YZ. Noise incorporated subwindow permutation analysis for informative gene selection using support vector machines. Analyst 2011;136:1456-63. [DOI: 10.1039/c0an00667j] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

600

Pinto da Costa JF, Alonso H, Roque L. A weighted principal component analysis and its application to gene expression data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:246-252. [PMID: 21071812 DOI: 10.1109/tcbb.2009.61] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]