Data Stream Classification based on an Associative Classifier
DOI:
https://doi.org/10.13053/cys-28-2-4737Keywords:
Data stream classification, associative classifier, concept-driftAbstract
Currently, the diversity of sources generating data in a massive online manner cause data streams to become part of many real work applications. Learning from a data stream is a very challenging task dueto the non-stationary nature of this type of data. Characteristics such as infinite length, concept drift, concept evolution, and recurrent concepts are the most common problems that need to be addressed by data stream learning algorithms. In this work an algorithm for data stream classification based on an associative classifier is presented. This proposal combines a clustering algorithm and the Na¨ıve Associative Classifier for Online Data (NACOD) to address this problem. A set of micro-clusters (MCs), a data structure that summarize the information of the current data, is used instead of storing the whole data. The MCs are continually updated with the arriving data, either to create new MCs or to update existing ones. The added MCs helps to deal with concept drift. To assess the performance of the proposed model, experiments were carried out on3 data sets commonly used to evaluate data stream classification algorithms: KDD Cup 1999, Forest Cover Type and Staglog (Shuttle). Our model achieved higher accuracies than those achieved with algorithms such as data stream version of Na¨ıve Bayes and Hoeffding Tree, the average accuracies achieved were for KDDDcup 1999: 100%, Statlog (Shuttle): 99.01% and Forest Cover Type 70.44%.Downloads
Published
2024-06-12
Issue
Section
Articles
License
Hereby I transfer exclusively to the Journal "Computación y Sistemas", published by the Computing Research Center (CIC-IPN),the Copyright of the aforementioned paper. I also accept that these
rights will not be transferred to any other publication, in any other format, language or other existing means of developing.I certify that the paper has not been previously disclosed or simultaneously submitted to any other publication, and that it does not contain material whose publication would violate the Copyright or other proprietary rights of any person, company or institution. I certify that I have the permission from the institution or company where I work or study to publish this work.The representative author accepts the responsibility for the publicationof this paper on behalf of each and every one of the authors.
This transfer is subject to the following conditions:- The authors retain all ownership rights (such as patent rights) of this work, except for the publishing rights transferred to the CIC, through this document.
- Authors retain the right to publish the work in whole or in part in any book they are the authors or publishers. They can also make use of this work in conferences, courses, personal web pages, and so on.
- Authors may include working as part of his thesis, for non-profit distribution only.