Nowadays cloud computing has become a major trend that enterprises and research organizations are pursuing with increasing zest. A potentially important application area for clouds is data analytics. In our previous publication, we introduced a novel cloud infrastructure, the CloudMiner, which facilitates data mining on massive scientific data. By providing a cloud platform which hosts data mining cloud services following the Software as a Service (SaaS) paradigm, CloudMiner offers the capability for realizing cloud-based data mining tasks upon traditional distributed databases and other dataset types. However, little attention has been paid to the issue of data stream management on the cloud so far. We have noticed the fact that some features of the cloud meet very well the requirements of data stream management. Consequently, we developed an innovative software framework, called the StreamMiner, which is introduced in this paper. It serves as an extension to the CloudMiner for facilitating, in particular, real-world data stream management and analysis using cloud services. In addition, we also introduce our tentative implementation of the framework. Finally, we present and discuss the first experimental performance results achieved with the first StreamMiner prototype.
History
Chapter number
18
Pagination
206-217
ISSN
0302-9743
eISSN
1611-3349
ISBN-13
9783642246500
ISBN-10
3642246508
Language
eng
Publication classification
B1 Book chapter
Copyright notice
2011, Springer-Verlag Berlin Heidelberg
Extent
52
Editor/Contributor(s)
Xiang Y, Cuzzocrea A, Hobbs M, Zhou W
Publisher
Springer
Place of publication
Berlin, Germany
Title of book
Algorithms and architectures for parallel processing