Macedo DC, Ishikawa ECM, Santos CB, Matos SN, Borges HB, Francisco AC. Proposed method for dimensionality reduction based on framework in gene expression domain.
Genet Mol Res 2014;
13:10582-91. [PMID:
25511043 DOI:
10.4238/2014.december.12.21]
[Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
The excessive use of attributes may affect the search for patterns and extraction of useful knowledge, because they harm the learning performance of algorithms in both speed and success rate. The use of dimensionality reduction methods is therefore an important alternative; however, these methods do not deal with the reduction of attributes in a specific area. This article presents a method based on framework concepts of domain for reducing attributes in a domain. The input method is a set of databases related to a domain, and the main process is the identification of common and variable attributes, plus the reduction of attributes in the original database. The proposed method was applied in the gene expression domain, using databases. The method can be used to analyze the most relevant attributes in a specific domain, granting greater confidence for models created for the application of a data mining task, thus, a previously known method in data mining. Attribute selection was also applied in the three databases for the comparison of the results. Analyses of the results using the criterion of cross-validation revealed that the employment of the methods resulted in the improvement of success rates compared to the databases containing the full range of attributes.
Collapse