Noise Detection and Learning Based on Current Information

Authors

  • Damaris Pascual-González Universidad de Oriente
  • Fernando Daniel Vázquez Mesa
  • Jorge Luis Toro Pozo

DOI:

https://doi.org/10.13053/cys-18-1-1593

Keywords:

Cleansing noise, data streams, semi-supervised learning, concept drift.

Abstract

Methods for noise cleaning have great significance in classification tasks and in situations when it is necessary to carry out a semi-supervised learning due to importance of having well-labeled samples (prototypes) for classification of the new patterns. In this work, we present a new algorithm for detecting noise in data streams that takes into account changes in concepts over time (concept drift). The algorithm is based on the neighborhood criteria and its application uses the construction of a training set. In our experiments we used both synthetic and real databases, the latter were taken from UCI repository. The results support our proposal of noise detection in data streams and classification processes.

Author Biography

Damaris Pascual-González, Universidad de Oriente

Facultad de Ciencias Económicas y EmpresarialesUniversidad de Oriente

Published

2014-04-03