Performance analysis of storm in a real-world big data stream computing environment
conference contribution
posted on 2018-01-01, 00:00authored byH Yan, D Sun, Shang GaoShang Gao, Z Zhou
As an important distributed real-time computation system, Storm has been widely used in a number of applications such as online machine learning, continuous computation, distributed RPC, and more. Storm is designed to process massive data streams in real time. However, there have been few studies conducted to evaluate the performance characteristics clusters in Storm. In this paper, we analyze the performance of a Storm cluster mainly from two aspects, hardware configuration and parallelism setting. Key factors that affect the throughput and latency of the Storm cluster are identified, and the performance of Storm’s fault-tolerant mechanism is evaluated, which help users use the computation system more efficiently.
2018, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
Editor/Contributor(s)
Romdhani I, Shu L, Takahiro H, Zhou Z, Gordon T, Zeng D
Title of proceedings
CollaborateCom 2017 : Proceedings of the 2017 European Alliance for Innovation (EAI) International Conference on Collaborative Computing: Networking, Applications and Worksharing
Event
European Alliance for Innovation. Conference (13th : 2017 : Edinburgh, Scotland)