1
|
Abstract
Recently, outlier detection has widespread applications in different areas. The task is to identify outliers in the dataset and extract potential information. The existing outlier detection algorithms mainly do not solve the problems of parameter selection and high computational cost, which leaves enough room for further improvements. To solve the above problems, our paper proposes a parameter-free outlier detection algorithm based on dataset optimization method. Firstly, we propose a dataset optimization method (DOM), which initializes the original dataset in which density is greater than a specific threshold. In this method, we propose the concepts of partition function (P) and threshold function (T). Secondly, we establish a parameter-free outlier detection method. Similarly, we propose the concept of the number of residual neighbors, as the number of residual neighbors and the size of data clusters are used as the basis of outlier detection to obtain a more accurate outlier set. Finally, extensive experiments are carried out on a variety of datasets and experimental results show that our method performs well in terms of the efficiency of outlier detection and time complexity.
Collapse
|
2
|
|
3
|
Wan J, Tang S, Zhang Y, Li J, Wu P, Hoi SC. HDIdx: High-dimensional indexing for efficient approximate nearest neighbor search. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2015.11.104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
4
|
|
5
|
TMR: Towards an efficient semantic-based heterogeneous transportation media big data retrieval. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.06.101] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Chen J, Yan Tang Y, Philip Chen C, Fang B, Shang Z, Lin Y. NNMap: A method to construct a good embedding for nearest neighbor classification. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.11.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|