Distributed Systems: Challenges, Experiences and Tips

Motivation

About 4 months ago (approximately the last time I wrote something here), I opted to embark on a graduate school journey at Stony Brook University, Computer Science (if you have a remote position — Technical Writer and/or Software Engineer position — at any company, don’t hesitate to reach out). Was it the best decision to make considering more theoretical undertakings, assumptions, and a new culture and environment? I can’t tell sincerely. I, however, genuinely want to garner enough knowledge to do real research and this is the best time for me to do so. That’s enough motivation. I have also met cool and loving individuals. Besides, I had a thorough challenge posed by one of the courses I took in the 2023 fall semester: Distributed Systems. It had some labs — 4 normal labs with a bonus, the skeletal files for each lab can be seen here. The labs culminated in building a sharded and replicated fault-tolerant key-value store with transaction support based on the Raft Consensus Algorithm using Optimistic Concurrency Control(OCC) and subsequently, two-phase locking (2PL). All the implementation was in the difficult low-level programming language, C++. Though some untoward stuff happened at the tail end (I may get to write about it later), my goal was eventually achieved. I learned quite a bit:

I got familiar with various distributed systems algorithms, patterns, design decisions, trade-offs, and new kids.
Debugged and battled the not-so-interesting segmentation faults and core dumps prevalent in C++ programs.
Serialized data in a rather low-level manner so they can safely be sent via RPCs.
Struggled to properly use lock primitives while also building a simple locking system from scratch to ensure proper concurrency control and prevent uncontrolled access to application state (data).
Implemented a batching system to group RPCs and potentially reduce the total number sent. And so much more.

The biggest challenge was knowing how to do some stuff based on the environmental restrictions of the labs. The skeletal code was void of documentation, the straw that broke the Carmel’s back!

NOTE: This article is NOT to implement the labs and my implementation CAN’T be made available. It however summarizes my experiences, challenges and tips on how I succeeded in getting the labs done. Everything written here is solely based on my current understanding of things.

I hope you find it resourceful.

Lab experiences and challenges

Lab 1: The Raft Consensus Algorithm

For a system to be distributed, several “separate” computation nodes in the same and/or different geographical locations are working to achieve a common, shared goal. Such a system is expected to tolerate failures (crashes and network partitions) and ensure data availability even during such failures. Another requirement can be that the data the system sends to the client MUST be the latest! These requirements can concisely be expressed as Data Consistency (C), Data Availability (A), and Partition tolerance (P). The birth of CAP and its famous theorem, CAP theorem. Some questions that beg to be answered are:

How can a system with many nodes situated in a different location(s) be implemented and expertly connected (mostly via a network which, inherently, may fail) to provide (the correct) data as fast as possible even when the connecting links between the nodes fail?
Since any of these nodes may be provided with data, how can the other nodes be aware of such data and reach a “truce” to either accept or reject the data provided by the client?

To answer most of these questions, the raft algorithm was invented by Diego Ongaro and John Ousterhout in the raft paper. Raft isn’t the first consensus algorithm. It first appeared less than a decade ago (May 20, 2014)! Before it was the notoriously difficult-to-implement Paxos by Leslie Lamport, as well as Viewstamped Replication among others. Raft is, however, simpler to reason about than Paxos. Implementing it isn’t a trivia undertaking notwithstanding.

To address the second question, Raft has a strong leader duly elected by the participating nodes during the raft’s electoral process (via RequestVote RPC). Does this mean there can only be one leader in a raft system? Under normal conditions, yes! However, during extreme cases of network partitions, a previously elected leader may be partitioned or the partitioned servers may “elect” a new leader if they make a quorum — to tolerate $f$ failures in non-Byzantine fault-tolerant systems, there must be a total of $2 f + 1$ ( $3 f + 1$ for Byzantine systems) servers with about $f + 1$ ( $$ 2f + 1

🪴 Trang's Notebook

Explorer

Distributed Systems: Challenges, Experiences and Tips

Motivation

Lab experiences and challenges

Lab 1: The Raft Consensus Algorithm

Graph View

Table of Contents

Backlinks

Recently updated notes:

What is the Riemann Hypothesis REALLY about?

Where Wizards Stay Up Late - The Origins of the Internet

Why the concept of a field extension is a natural one

Some note