Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

4
(from Reference Citation Analysis)

Article PDFs (2)

Cited by > 0 (3)

Searched Name

Enaitz Ezpeleta

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Vélez de Mendizabal I, Basto-Fernandes V, Ezpeleta E, Méndez JR, Gómez-Meire S, Zurutuza U. Multi-objective evolutionary optimization for dimensionality reduction of texts represented by synsets. PeerJ Comput Sci 2023;9:e1240. [PMID: 37346554 PMCID: PMC10280406 DOI: 10.7717/peerj-cs.1240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 01/13/2023] [Indexed: 06/23/2023]

Abstract

Despite new developments in machine learning classification techniques, improving the accuracy of spam filtering is a difficult task due to linguistic phenomena that limit its effectiveness. In particular, we highlight polysemy, synonymy, the usage of hypernyms/hyponyms, and the presence of irrelevant/confusing words. These problems should be solved at the pre-processing stage to avoid using inconsistent information in the building of classification models. Previous studies have suggested that the use of synset-based representation strategies could be successfully used to solve synonymy and polysemy problems. Complementarily, it is possible to take advantage of hyponymy/hypernymy-based to implement dimensionality reduction strategies. These strategies could unify textual terms to model the intentions of the document without losing any information (e.g., bringing together the synsets "viagra", "ciallis", "levitra" and other representing similar drugs by using "virility drug" which is a hyponym for all of them). These feature reduction schemes are known as lossless strategies as the information is not removed but only generalised. However, in some types of text classification problems (such as spam filtering) it may not be worthwhile to keep all the information and let dimensionality reduction algorithms discard information that may be irrelevant or confusing. In this work, we are introducing the feature reduction as a multi-objective optimisation problem to be solved using a Multi-Objective Evolutionary Algorithm (MOEA). Our algorithm allows, with minor modifications, to implement lossless (using only semantic-based synset grouping), low-loss (discarding irrelevant information and using semantic-based synset grouping) or lossy (discarding only irrelevant information) strategies. The contribution of this study is two-fold: (i) to introduce different dimensionality reduction methods (lossless, low-loss and lossy) as an optimization problem that can be solved using MOEA and (ii) to provide an experimental comparison of lossless and low-loss schemes for text representation. The results obtained support the usefulness of the low-loss method to improve the efficiency of classifiers.

Collapse

de Mendizabal IV, Basto-Fernandes V, Ezpeleta E, Méndez JR, Zurutuza U. SDRS: A new lossless dimensionality reduction for text corpora. Inf Process Manag 2020. [DOI: 10.1016/j.ipm.2020.102249] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Mar J, Gorostiza A, Ibarrondo O, Cernuda C, Arrospide A, Iruin Á, Larrañaga I, Tainta M, Ezpeleta E, Alberdi A. Validation of Random Forest Machine Learning Models to Predict Dementia-Related Neuropsychiatric Symptoms in Real-World Data. J Alzheimers Dis 2020;77:855-864. [PMID: 32741825 PMCID: PMC7592688 DOI: 10.3233/jad-200345] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/24/2020] [Indexed: 12/14/2022]

Affiliation(s)

Javier Mar Basque Health Service (Osakidetza), Debagoiena Integrated Healthcare Organisation, Research Unit, Arrasate-Mondragón, Guipúzcoa, Spain Kronikgune Institute for Health Service Research, Barakaldo, Spain Biodonostia Health Research Institute, Donostia-San Sebastán, Guipúzcoa, Spain Health Services Research on Chronic Patients Network (REDISSEC), Bilbao, Vizcaya, Spain
Ania Gorostiza Basque Health Service (Osakidetza), Debagoiena Integrated Healthcare Organisation, Research Unit, Arrasate-Mondragón, Guipúzcoa, Spain Kronikgune Institute for Health Service Research, Barakaldo, Spain
Oliver Ibarrondo Basque Health Service (Osakidetza), Debagoiena Integrated Healthcare Organisation, Research Unit, Arrasate-Mondragón, Guipúzcoa, Spain Kronikgune Institute for Health Service Research, Barakaldo, Spain Biodonostia Health Research Institute, Donostia-San Sebastán, Guipúzcoa, Spain
Carlos Cernuda Mondragon Unibertsitatea, Faculty of Engineering, Electronics and Computing Department, Arrasate-Mondragon, Gipuzkoa, Spain
Arantzazu Arrospide Basque Health Service (Osakidetza), Debagoiena Integrated Healthcare Organisation, Research Unit, Arrasate-Mondragón, Guipúzcoa, Spain Kronikgune Institute for Health Service Research, Barakaldo, Spain Biodonostia Health Research Institute, Donostia-San Sebastán, Guipúzcoa, Spain Health Services Research on Chronic Patients Network (REDISSEC), Bilbao, Vizcaya, Spain
Álvaro Iruin Biodonostia Health Research Institute, Donostia-San Sebastán, Guipúzcoa, Spain Basque Health Service (Osakidetza), Gipuzkoa Mental Health Network, Donostia-San Sebastián, Guipúzcoa, Spain
Igor Larrañaga Basque Health Service (Osakidetza), Debagoiena Integrated Healthcare Organisation, Research Unit, Arrasate-Mondragón, Guipúzcoa, Spain Kronikgune Institute for Health Service Research, Barakaldo, Spain
Mikel Tainta Kronikgune Institute for Health Service Research, Barakaldo, Spain Department of Neurology, Basque Health Service (Osakidetza), Goierri-Urola Garaia Integrated Healthcare Organisation, Zumarraga, Guipúzcoa, Spain Fundación CITA-Alzheimer Fundazioa, Donostia-San Sebastián, Guipúzcoa, Spain
Enaitz Ezpeleta Mondragon Unibertsitatea, Faculty of Engineering, Electronics and Computing Department, Arrasate-Mondragon, Gipuzkoa, Spain
Ane Alberdi Mondragon Unibertsitatea, Faculty of Engineering, Electronics and Computing Department, Arrasate-Mondragon, Gipuzkoa, Spain

Collapse

Ezpeleta E, Garitano I, Zurutuza U, Hidalgo JMG. Short Messages Spam Filtering Combining Personality Recognition and Sentiment Analysis. INT J UNCERTAIN FUZZ 2017. [DOI: 10.1142/s0218488517400177] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]