Optimizing Data Availability and Scalability with RP*-SD2DS Architecture for Distributed Systems
Received: 19 June 2024 | Revised: 6 July 2024 | Accepted: 13 July 2024 | Online: 20 July 2024
Corresponding author: Mohammed Maabed
Abstract
This work introduces Range Partitioning Scalable Distributed Two-Layer Data Structures (RP*-SD2DS), an innovative data storage architecture with the objective of enhancing data availability and scalability in distributed systems. By employing SD2DS and preorder-preserving RP*, this design avoids the need for a router or coordinator, ensuring dynamic adaptability. The main goal is to minimize system downtime by efficiently distributing data across two layers and increasing availability during partitioning operations in traditional SDDSs, thereby avoiding the bottlenecks associated with master- or coordinator-based systems. The proposed solution offers significant improvements over MongoDB, a well-known and robust system, with a single or three Mongos instances.
Keywords:
range partioning, multi-computers, large files, Scalable Distributed Two-Layer Data Structures (SD2DS), Non - Structured Query Language (NoSQL)Downloads
References
A. Albugmi, "Digital Forensics Readiness Framework (DFRF) to Secure Database Systems," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13732–13740, Apr. 2024.
C. Gomes, M. N. de O. Junior, B. Nogueira, P. Maciel, and E. Tavares, "NoSQL-based storage systems: influence of consistency on performance, availability and energy consumption," The Journal of Supercomputing, vol. 79, no. 18, pp. 21424–21448, Dec. 2023.
M. M. Sadeeq, N. M. Abdulkareem, S. R. M. Zeebaree, D. M. Ahmed, A. S. Sami, and R. R. Zebari, "IoT and Cloud Computing Issues, Challenges and Opportunities: A Review," Qubahan Academic Journal, vol. 1, no. 2, pp. 1–7, Mar. 2021.
G. Mahmood, N. Hassoon, H. N. Abed, and B. Jalil, "An Efficient and Secure Auditing System of Cloud Storage Based on BLS Signature," International Journal of Computing and Digital System, vol. 12, no. 01, pp. 1491–1501, Jul. 2021.
S. Amghar, S. Cherdal, and S. Mouline, "Which NoSQL database for IoT Applications?," in 2018 International Conference on Selected Topics in Mobile and Wireless Networking (MoWNeT), Jun. 2018, pp. 131–137.
F. Chang et al., "Bigtable: A Distributed Storage System for Structured Data," ACM Transactions on Computer Systems, vol. 26, no. 2, pp. 1–26, Jun. 2008.
A. Ergüzen and M. Ünver, "Developing a File System Structure to Solve Healthy Big Data Storage and Archiving Problems Using a Distributed File System," Applied Sciences, vol. 8, no. 6, Jun. 2018, Art. no. 913.
A. Petrov, Database Internals: A Deep Dive into How Distributed Data Systems Work, 1st ed. Sebastopol, CA, USA: O’Reilly Media, Inc., 2019.
M. B. Ahmad and S. Sagheer, "Issues and Algorithm of Distributed Shared Memory," in 2021 International Conference on Innovative Computing (ICIC), Lahore, Pakistan, Nov. 2021, pp. 1–9.
W. Litwin, M.-A. Neimat, and D. A. Schneider, "LH: Linear Hashing for distributed files," ACM SIGMOD Record, vol. 22, no. 2, pp. 327–336, Jun. 1993.
K. Sapiecha and G. Lukawski, "Scalable Distributed Two-Layer Data Structures (SD2DS)," International Journal of Distributed Systems and Technologies (IJDST), vol. 4, no. 2, pp. 15–30, Apr. 2013.
A. Krechowicz, A. Chrobot, S. Deniziak, and G. Łukawski, "SD2DS-Based Datastore for Large Files," in Proceedings of the 2015 Federated Conference on Software Development and Object Technologies, Cham, 2017, pp. 150–168.
A. Ali, S. Naeem, S. Anam, and M. M. Ahmed, "A State of Art Survey for Big Data Processing and NoSQL Database Architecture," International Journal of Computing and Digital Systems, vol. 14, no. 1, pp. 297–309, May 2023.
W. Litwin, M.-A. Neimat, and D. A. Schneider, "RP*: A Family of Order Preserving Scalable Distributed Data Structures," in Proceedings of the 20th International Conference on Very Large Data Bases, San Francisco, CA, USA, Sep. 1994, pp. 342–353.
W. Litwin, M.-A. Neimat, and D. A. Schneider, "LH*—a scalable, distributed data structure," ACM Trans. Database Syst., vol. 21, no. 4, pp. 480–525, Dec. 1996.
M. Bedla and K. Sapiecha, "Scalable Store of Java Objects Using Range Partitioning," in Advances in Software Engineering Techniques, Berlin, Heidelberg, 2012, pp. 84–93.
D. E. Zegour, "Scalable distributed compact trie hashing (CTH*)," Information and Software Technology, vol. 46, no. 14, pp. 923–935, Nov. 2004.
A. Mohammed, "Framework for Parallel Processing of Very Large Volumes of Data," International Journal of Computing and Digital Systems, vol. 08, no. 01, pp. 43–50, Jan. 2019.
J. Dean and S. Ghemawat, "MapReduce: simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107–113, Jan. 2008.
A. Mohamed and D. Zegour, "TH*:Scalable Distributed Trie Hashing," IJCSI International Journal of Computer Science Issues, vol. 7, no. 6, pp. 109–115, Nov. 2010.
S. Bradshaw, E. Brazil, and K. Chodorow, MongoDB: The Definitive Guide: Powerful and Scalable Data Storage, 3rd ed. Beijing Boston Farnham: O’Reilly Media, 2019.
S. Wang, G. Li, X. Yao, Y. Zeng, L. Pang, and L. Zhang, "A Distributed Storage and Access Approach for Massive Remote Sensing Data in MongoDB," ISPRS International Journal of Geo-Information, vol. 8, no. 12, Dec. 2019, Art. no. 533.
Downloads
How to Cite
License
Copyright (c) 2024 Mohammed Maabed, Nassim Dennouni, Mohamed Aridj
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.