File(s) under permanent embargo
A data as a product model for future consumption of big stream data in clouds
conference contribution
posted on 2015-01-01, 00:00 authored by Guangyan HuangGuangyan Huang, J He, C Chi, Wanlei Zhou, Y ZhangData is becoming the world’s new natural resource
and big data use grows quickly. The trend of computing
technology is that everything is merged into the Internet and
‘big data’ are integrated to comprise complete
information for collective intelligence. With the increasing
size of big data, refining big data themselves to reduce data size
while keeping critical data (or useful information) is a new
approach direction. In this paper, we provide a novel data
consumption model, which separates the consumption of data
from the raw data, and thus enable cloud computing for big
data applications. We define a new Data-as-a-Product (DaaP)
concept; a data product is a small sized summary of the
original data and can directly answer users’ queries. Thus, we
separate the mining of big data into two classes of processing
modules: the refine modules to change raw big data into smallsized
data products, and application-oriented mining modules
to discover desired knowledge further for applications from
well-defined data products. Our practices of mining big stream
data, including medical sensor stream data, streams of text
data and trajectory data, demonstrated the efficiency and
precision of our DaaP model for answering users’ queries
and big data use grows quickly. The trend of computing
technology is that everything is merged into the Internet and
‘big data’ are integrated to comprise complete
information for collective intelligence. With the increasing
size of big data, refining big data themselves to reduce data size
while keeping critical data (or useful information) is a new
approach direction. In this paper, we provide a novel data
consumption model, which separates the consumption of data
from the raw data, and thus enable cloud computing for big
data applications. We define a new Data-as-a-Product (DaaP)
concept; a data product is a small sized summary of the
original data and can directly answer users’ queries. Thus, we
separate the mining of big data into two classes of processing
modules: the refine modules to change raw big data into smallsized
data products, and application-oriented mining modules
to discover desired knowledge further for applications from
well-defined data products. Our practices of mining big stream
data, including medical sensor stream data, streams of text
data and trajectory data, demonstrated the efficiency and
precision of our DaaP model for answering users’ queries
History
Event
International Conference on Services Computing (12th : 2015 : New York, New York)Pagination
256 - 263Publisher
IEEELocation
New York, New YorkPlace of publication
Piscataway, N.J.Publisher DOI
Start date
2015-06-27End date
2015-07-02ISBN-13
9781467372817Language
engPublication classification
E Conference publication; E1 Full written paper - refereedCopyright notice
2015, IEEEEditor/Contributor(s)
P Maglio, I Paik, W ChouTitle of proceedings
SCC 2015: Proceedings of the 12th International Conference on Services ComputingUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC