Affiliation:
1. School of Computer Science and Engineering, North Minzu University, Yinchuan, China
Abstract
Most data stream ensemble classification algorithms use supervised learning. This method needs to use a large number of labeled data to train the classifier, and the cost of obtaining labeled data is very high. Therefore, the semi supervised learning algorithm using labeled data and unlabeled data to train the classifier becomes more and more popular. This article is the first to review data stream ensemble classification methods from the perspectives of supervised learning and semi-supervised learning. Firstly, basic classifiers such as decision trees, neural networks, and support vector machines are introduced from the perspective of supervised learning and semi-supervised learning. Secondly, the key technologies in data stream ensemble classification are explained from the two aspects of incremental and online. Finally, the majority voting and weight voting are explained in the ensemble strategies. The different ensemble methods are summarized and the classic algorithms are quantitatively analyzed. Further research directions are given, including the handling of concept drift under supervised and semi-supervised learning, the study of homogeneous ensemble and heterogeneous ensemble, and the classification of data stream ensemble under unsupervised learning.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献