A streaming ensemble algorithm (SEA) for large-scale classification
Citations
2,374 citations
1,621 citations
Cites background from "A streaming ensemble algorithm (SEA..."
...[7, 16, 24]) is based on maintaining an ensemble of models capable of capturing various states of the data....
[...]
1,403 citations
Cites background from "A streaming ensemble algorithm (SEA..."
...Much work has been done on modeling [1], querying [2, 14, 18], and mining data streams, for instance, several papers have been published on classification [7, 21, 27], regression analysis [5], and clustering [19]....
[...]
987 citations
Cites background from "A streaming ensemble algorithm (SEA..."
...…and Kubat, 1993, 1996; Wang et al., 2003), decision trees, including their incremental versions (Harries and Sammut, 1998; Hulten et al., 2001; Street and Kim, 2001; Kolter and Maloof, 2003; Stanley, 2003; Wang et al., 2003), Naïve Bayes (Kolter and Maloof, 2003; Wang et al., 2003), SVMs…...
[...]
...This is caused by the fact that data in many current data processing systems is organized in the form of a data stream rather than a static data repository, reflecting the natural flow of data (Street and Kim, 2001; Wang et al., 2003; Hulten and Spencer, 2003)....
[...]
...Another popular benchmark problem is represented by a moving hyperplane (Hulten et al., 2001; Street and Kim, 2001; Kolter and Maloof, 2003; Wang et al., 2003)....
[...]
...…page access data (Hulten et al., 2001), the Text Retrieval Conference (TREC) data (Lanquillon, 1999; Klinkenberg, 2004), credit card fraud data (Wang et al., 2003), breast cancer, anonymous Web browsing, and US Census Bureau data (Street and Kim, 2001), and e-mail data (Cunningham et al., 2003)....
[...]
...Street and Kim (2001) and Wang et al. (2001) suggest that simply dividing the data into sequential chunks of fixed size and building an ensemble on those chunks may be effective for handling concept drift....
[...]
960 citations
References
21,674 citations
[...]
16,118 citations
12,940 citations
"A streaming ensemble algorithm (SEA..." refers methods in this paper
...The first and third data sets are publicly available from the UCI machine learning repository [2]....
[...]
7,601 citations
"A streaming ensemble algorithm (SEA..." refers methods in this paper
...Boosting [20], and its variants such as AdaBoost [10] and Arcing [5], uses a weighted resampling technique, creating a series of classifiers in which later indi- viduals focus on classifying the more difficult points....
[...]
...Boosting [20], and its variants such as AdaBoost [10] and Arcing [5], uses a weighted resampling technique, creating a series of classifiers in which later individuals focus on classifying the more difficult points....
[...]