Deakin University
Browse

Data summarization techniques for big data - a survey

Version 2 2024-06-03, 11:07
Version 1 2016-04-15, 15:50
chapter
posted on 2024-06-03, 11:07 authored by ZR Hesabi, Z Tari, Andrzej GoscinskiAndrzej Goscinski, A Fahad, I Khalil, C Queiroz
In current digital era according to (as far) massive progress and development of internet and online world technologies such as big and powerful data servers we face huge volume of information and data day by day from many different resources and services which was not available to human kind just a few decades ago. This data comes from available different online resources and services that are established to serve customers. Services and resources like Sensor Networks, Cloud Storages, Social Networks and etc., produce big volume of data and also need to manage and reuse that data or some analytical aspects of the data. Although this massive volume of data can be really useful for people and corporates it could be problematic as well. Therefore big volume of data or big data has its own deficiencies as well. They need big storage/s and this volume makes operations such as analytical operations, process operations, retrieval operations real difficult and hugely time consuming. One resolution to overcome these difficult problems is to have big data summarized so they would need less storage and extremely shorter time to get processed and retrieved. The summarized data will be then in "compact format" and still informative version of the entire data. Data summarization techniques aim then to produce a "good" quality of summaries. Therefore, they would hugely benefit everyone from ordinary users to researches and corporate world, as it can provide an efficient tool to deal with large data such as news (for new summarization).

History

Volume

9408

Chapter number

38

Pagination

1109-1152

ISBN-13

9781493920921

Language

Eng

Publication classification

B Book chapter, B1 Book chapter

Copyright notice

2015, Springer

Extent

46

Editor/Contributor(s)

Khan SU, Zomaya AY

Publisher

Springer

Place of publication

Berlin, Germany

Title of book

Handbook on data centers

Series

Lecture notes in computer science; 9408