Distributed Streaming Storage Performance Benchmarking: Pravega and Pulsar

Authors

  • Ramesh Kadaba Vasudevamurthy Airspan Networks, Visvesvaraya Technological University, Belagavi, India
  • G. T. Raju Department of Computer Science and Engineering, SJC Institute of Technology, Visvesvaraya Technological University, Belagavi, India
Volume: 14 | Issue: 5 | Pages: 16242-16251 | October 2024 | https://doi.org/10.48084/etasr.8076

Abstract

Massive data shoving can reach the greatest throughput, which is necessary for distributed streaming storage to function at its best. The comparison of the distributed streaming storage systems Pulsar and Pravega for a given number of producers and data packet size is covered in detail in this study. This analysis' benchmark tool accommodates several producers and consumers. When connection pooling is enabled and 0.5 million records are thrust at a 10 Mbps data rate, both streaming storages are assessed for latency percentile comparison. A novel idea called sbk-charts is introduced in the current study, which can create practical charts from CSV files. Multiple CSV files can be joined by sbk-charts to construct a single combined xlsx file with helpful charts. The outcomes of the experiment are then evaluated for performance comparison in a number of dimensions.

Keywords:

connection pooling, benchmarking, throughput, storage benchmarking kit, latency, Pravega, Pulsar

Downloads

Download data is not yet available.

References

"Pravega – A Reliable Stream Storage System." https://cncf.pravega.io/.

"pravega/pravega," https://github.com/pravega/pravega.

"Apache Pulsar." https://pulsar.apache.org/.

N. V. Sanjay Kumar and K. Munegowda, "Distributed Streaming Storage Performance Benchmarking: Kafka and Pravega," International Journal of Innovative Technology and Exploring Engineering, vol. 9, no. 2S, pp. 1–8, Dec. 2019.

"Release Storage Benchmark Kit Version 5.0 · kmgowda/SBK," GitHub. https://github.com/kmgowda/SBK/releases/tag/5.0.

"Dashboards | Grafana documentation," Grafana Labs. https://grafana.com/docs/grafana/latest/dashboards/.

"Apache BookKeeper." https://bookkeeper.apache.org/.

"Apache ZooKeeper." https://zookeeper.apache.org/.

F. Junqueira and B. Reed, ZooKeeper: Distributed Process Coordination. Sebastopol, CA, USA: O’Reilly, 2013.

"apache/flink," https://github.com/apache/flink.

"apache/samza." https://github.com/apache/samza.

N. Sajitha and S. P. Priya, "Optimal Artificial Neural Network-based Fabric Defect Detection and Classification," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13148–13152, Apr. 2024.

T. Alshammari, "Using Artificial Neural Networks with GridSearchCV for Predicting Indoor Temperature in a Smart Home," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13437–13443, Apr. 2024.

H. T. S. Alrikabi, I. A. Aljazaery, and A. H. M. Alaidi, "Using a Chaotic Digital System to Generate Random Numbers for Secure Communication on 5G Networks," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13598–13603, Apr. 2024.

K. V. Ramesh and G. T. Raju, "Pravega: Performance impact analysis with Connection Pooling’," in 2nd IEEE International Conference on Knowledge Engineering and Communication Systems (ICKECS 2024), Karnataka, India, Apr. 2024.

Downloads

How to Cite

[1]
Kadaba Vasudevamurthy, R. and Raju, G.T. 2024. Distributed Streaming Storage Performance Benchmarking: Pravega and Pulsar. Engineering, Technology & Applied Science Research. 14, 5 (Oct. 2024), 16242–16251. DOI:https://doi.org/10.48084/etasr.8076.

Metrics

Abstract Views: 99
PDF Downloads: 157

Metrics Information