Deakin University
Browse

File(s) under permanent embargo

A data as a product model for future consumption of big stream data in clouds

conference contribution
posted on 2015-01-01, 00:00 authored by Guangyan HuangGuangyan Huang, J He, C Chi, Wanlei Zhou, Y Zhang
Data is becoming the world’s new natural resource
and big data use grows quickly. The trend of computing
technology is that everything is merged into the Internet and
‘big data’ are integrated to comprise complete
information for collective intelligence. With the increasing
size of big data, refining big data themselves to reduce data size
while keeping critical data (or useful information) is a new
approach direction. In this paper, we provide a novel data
consumption model, which separates the consumption of data
from the raw data, and thus enable cloud computing for big
data applications. We define a new Data-as-a-Product (DaaP)
concept; a data product is a small sized summary of the
original data and can directly answer users’ queries. Thus, we
separate the mining of big data into two classes of processing
modules: the refine modules to change raw big data into smallsized
data products, and application-oriented mining modules
to discover desired knowledge further for applications from
well-defined data products. Our practices of mining big stream
data, including medical sensor stream data, streams of text
data and trajectory data, demonstrated the efficiency and
precision of our DaaP model for answering users’ queries

History

Event

International Conference on Services Computing (12th : 2015 : New York, New York)

Pagination

256 - 263

Publisher

IEEE

Location

New York, New York

Place of publication

Piscataway, N.J.

Start date

2015-06-27

End date

2015-07-02

ISBN-13

9781467372817

Language

eng

Publication classification

E Conference publication; E1 Full written paper - refereed

Copyright notice

2015, IEEE

Editor/Contributor(s)

P Maglio, I Paik, W Chou

Title of proceedings

SCC 2015: Proceedings of the 12th International Conference on Services Computing

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC