From 8e322564c82e2e7275fc1426e71872bef61a8e48 Mon Sep 17 00:00:00 2001
From: Noah Tigner <noahzanetigner@gmail.com>
Date: Sun, 29 Mar 2026 15:59:10 -0600
Subject: [PATCH 1/2] Article: DBI Summary & Review

---
 README.md                                     |   2 +-
 src/assets/articles/databaseInternals.md      |   4 +
 .../articles/databaseInternalsChapter1.md     |  12 +-
 .../articles/databaseInternalsChapter10.md    |   6 +-
 .../articles/databaseInternalsChapter11.md    |  18 +-
 .../articles/databaseInternalsChapter12.md    |   6 +-
 .../articles/databaseInternalsChapter13.md    |   6 +-
 .../articles/databaseInternalsChapter14.md    |   4 +-
 .../articles/databaseInternalsChapter5.md     |  18 +-
 .../articles/databaseInternalsChapter6.md     |   8 +-
 .../articles/databaseInternalsChapter7.md     |   4 +
 .../articles/databaseInternalsChapter8.md     |   8 +-
 .../articles/databaseInternalsSummary.md      | 227 ++++++++++++++++++
 13 files changed, 285 insertions(+), 38 deletions(-)
 create mode 100644 src/assets/articles/databaseInternalsSummary.md
diff --git a/README.md b/README.md
index ad97fc8..0a11f8e 100644
--- a/README.md
+++ b/README.md
@@ -76,7 +76,7 @@ Noah Tigner's [Portfolio Website](https://noahtigner.com)
   - [x] [Chapter 12 - Anti-Entropy & Dissemination](https://noahtigner.com/articles/database-internals-chapter-12/)
   - [x] [Chapter 13 - Distributed Transactions](https://noahtigner.com/articles/database-internals-chapter-13/)
   - [x] [Chapter 14 - Consensus](https://noahtigner.com/articles/database-internals-chapter-14/)
-  - [ ] [Summary & Thoughts](https://noahtigner.com/articles/<TODO>/)
+  - [x] [Summary & Thoughts](https://noahtigner.com/articles/database-internals-summary/)
 
 ## Available Scripts:
 
diff --git a/src/assets/articles/databaseInternals.md b/src/assets/articles/databaseInternals.md
index c868073..72731bd 100644
--- a/src/assets/articles/databaseInternals.md
+++ b/src/assets/articles/databaseInternals.md
@@ -48,6 +48,10 @@ This is a collection of my notes on <a href="https://www.oreilly.com/library/vie
 - [x] <a href="https://noahtigner.com/articles/database-internals-chapter-13/" target="_blank" rel="noopener">Chapter 13 - Distributed Transactions</a>
 - [x] <a href="https://noahtigner.com/articles/database-internals-chapter-14/" target="_blank" rel="noopener">Chapter 14 - Consensus</a>
 
+#### Summary, Review, and Flash Cards
+
+- [x] <a href="https://noahtigner.com/articles/database-internals-summary/" target="_blank" rel="noopener">Summary & Review</a>
+
 ---
 
 ### Motivation
diff --git a/src/assets/articles/databaseInternalsChapter1.md b/src/assets/articles/databaseInternalsChapter1.md
index d252b7c..f76b9bd 100644
--- a/src/assets/articles/databaseInternalsChapter1.md
+++ b/src/assets/articles/databaseInternalsChapter1.md
@@ -2,7 +2,7 @@
 title: Database Internals Ch. 1 - Storage Engines Intro & Overview
 description: Notes on Chapter 1 of Database Internals by Alex Petrov. OLTP vs. OLAP, Memory vs. Disk-Based Storage, Row vs. Column Orientation, Indexing, etc.
 published: January 31, 2026
-updated: March 6, 2026
+updated: March 29, 2026
 minutesToRead: 5
 path: /articles/database-internals-chapter-1/
 image: /images/database-internals.jpg
@@ -26,15 +26,19 @@ This post contains my notes on Chapter 1 of <a href="https://www.oreilly.com/lib
 
 ---
 
+### Introduction
+
 Database Management Systems typically fall into one of three buckets:
 
 - Online Transaction Processing (OLTP), which handles lots of user-facing requests. Queries are often predefined and short-lived.
 - Online Analytical Processing (OLAP), which handles complex aggregations used for analytics, data warehousing, etc. Best for complex, long-running, ad-hoc queries.
 - Hybrid Transactional and Analytical Processing (HTAP), which are unified systems that mix OLTP and OLAP techniques.
 
+---
+
 ### DBMS Architecture
 
-DBMS use client/server architectures where applications are clients and nodes (db instances) are the servers. Concerns are typically separated as follows:
+DBMS use client/server architectures where applications are clients and nodes (database instances) are the servers. Concerns are typically separated as follows:
 
 - Client requests (queries) arrive through the transport system
 - The transport subsystem gives queries to the query processor which parses, interpolates, and validates queries. Later, access control checks are performed
@@ -47,6 +51,8 @@ DBMS use client/server architectures where applications are clients and nodes (d
   - A buffer manager, which caches data pages in-memory
   - A recovery manager, which maintains the operations logs and handles recoveries
 
+---
+
 ### Memory vs. Disk-Based DBMS
 
 In-memory, or "main memory," systems store data primarily in memory and use disks for recovery and logging. Disk-based systems hold most data on disk and use memory for caching. Memory is much faster than disk, and although it is getting cheaper, it is still much more expensive. Memory is also volatile (less durable).
@@ -117,4 +123,6 @@ Data immutability means that records must be append-only or copy-on-write (repla
 
 The final decision is whether or not records should be stored in keyed order on disk, with tradeoffs in both cases.
 
+---
+
 <p class="subtitle"><i>Database Internals</i> by Alex Petrov (O'Reilly). Copyright 2019 Oleksander Petrov, 978-1-492-04034-7</p>
diff --git a/src/assets/articles/databaseInternalsChapter10.md b/src/assets/articles/databaseInternalsChapter10.md
index 5e4d16f..97628ee 100644
--- a/src/assets/articles/databaseInternalsChapter10.md
+++ b/src/assets/articles/databaseInternalsChapter10.md
@@ -2,8 +2,8 @@
 title: Database Internals Ch. 10 - Leader Election
 description: Notes on Chapter 10 of Database Internals by Alex Petrov. Leader election strategies like the Bully Algorithm, Invitation Algorithm, and Ring Algorithm.
 published: March 11, 2026
-updated: March 11, 2026
-minutesToRead: 5
+updated: March 29, 2026
+minutesToRead: 4
 path: /articles/database-internals-chapter-10/
 image: /images/database-internals.jpg
 tags:
@@ -20,7 +20,7 @@ collection:
 
 ## Database Internals - Ch. 10 - Leader Election
 
-<p class="subtitle">5 minute read • March 11, 2026</p>
+<p class="subtitle">4 minute read • March 11, 2026</p>
 
 This post contains my notes on Chapter 10 of <a href="https://www.oreilly.com/library/view/database-internals/9781492040330/" target="_blank" rel="noopener">_Database Internals_</a> by Alex Petrov. These notes are intended as a reference and are not meant as a substitute for the original text. I found <a href="https://timilearning.com/posts/ddia/notes/" target="_blank" rel="noopener">Timilehin Adeniran's notes</a> on <a href="https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/" target="_blank" rel="noopener">_Designing Data-Intensive Applications_</a> extremely helpful while reading that book, so I thought I'd try to do the same here.
 
diff --git a/src/assets/articles/databaseInternalsChapter11.md b/src/assets/articles/databaseInternalsChapter11.md
index 229f9ff..4f232c7 100644
--- a/src/assets/articles/databaseInternalsChapter11.md
+++ b/src/assets/articles/databaseInternalsChapter11.md
@@ -1,8 +1,8 @@
 ---
 title: Database Internals Ch. 11 - Replication and Consistency
-description: Notes on Chapter 11 of Database Internals by Alex Petrov. Replication and consistency in distributed systems, CAP, and CDRTs.
+description: Notes on Chapter 11 of Database Internals by Alex Petrov. Replication and consistency in distributed systems, CAP, and CRDTs.
 published: March 18, 2026
-updated: March 18, 2026
+updated: March 29, 2026
 minutesToRead: 10
 path: /articles/database-internals-chapter-11/
 image: /images/database-internals.jpg
@@ -14,7 +14,7 @@ collection:
   slug: database-internals
   title: Database Internals
   shortTitle: Ch. 11 - Replication and Consistency
-  shortDescription: Replication and consistency in distributed systems, CAP, and CDRTs.
+  shortDescription: Replication and consistency in distributed systems, CAP, and CRDTs.
   order: 11
 ---
 
@@ -40,7 +40,7 @@ To make the system highly available, we need to design it in a way that allows h
 
 ### Infamous CAP
 
-Availability measures the system's ability to respond to every request successfully. We would also like each operation to be (<a href="https://noahtigner.com/articles/database-internals-chapter-5/#introduction" target="_blank" rel="noopener">atomically</a> / <a href="https://noahtigner.com/articles/database-internals-chapter-11/#linearizability" target="_blank" rel="noopener">linearizably</a>) consistent. Ideally, we would like to achieve both availability and consistency while tolerating network partitions. The CAP conjecture describes the tradeoffs between consistency \(C), availability (A), and partition tolerance (P). The conjecture states that at most two of the three can be achieved.
+Availability measures the system's ability to respond to every request successfully. We would also like each operation to be (<a href="https://noahtigner.com/articles/database-internals-chapter-5/#introduction" target="_blank" rel="noopener">atomically</a> / <a href="https://noahtigner.com/articles/database-internals-chapter-11/#linearizability" target="_blank" rel="noopener">linearizably</a>) consistent. Ideally, we would like to achieve both availability and consistency while tolerating network partitions. The CAP conjecture describes the tradeoffs between consistency <em>C</em>, availability <em>A</em>, and partition tolerance <em>P</em>. The conjecture states that a system can only choose between consistency and availability when a partition occurs.
 
 The two most common approaches are "AP" and "CP". CP systems prefer failing requests to serving potentially inconsistent data. AP systems loosen the C requirements and allow serving potentially inconsistent values during the request.
 
@@ -64,7 +64,7 @@ From the client's perspective, distributed systems act as if storage is shared,
 Registers can be accessed by multiple readers and writes simultaneously. When it comes to concurrent ops, there are three types of registers:
 
 - Safe - reads to the safe registers may return arbitrary values within the range of the register during a concurrent write op
-- Regular - read ops return the value of the most recently completed write, or the value of the write that overlaps with the current reade op
+- Regular - read ops return the value of the most recently completed write, or the value of the write that overlaps with the current read op
 - Atomic - every write op has a single moment before which every read returns an old value and after which every read returns a new value. This guarantees linearizability.
 
 ---
@@ -123,7 +123,7 @@ Following CAP principles, we can tune our eventual consistency with three parame
 - Write consistency <em>W</em> - the number of nodes that have to acknowledge a write for it to succeed
 - Read consistency <em>R</em> - the number of nodes that have to respond to a read operation for it to succeed
 
-Choosing levels where <em>R + W > N</em> gaurantees that the most recently written value is returned. Write-heavy systems sometimes pick <em>W = 1</em> and <em>R = N</em>, which allows writes to be acknowledged by just one node, but requires all replicas to be available for reads. Increasing <em>W</em> or <em>R</em> increases latency and raises requirements for node availability. Decreasing them improves system availability while sacrificing consistency.
+Choosing levels where <em>R + W > N</em> helps reduce the chance of stale reads by forcing read and write quorums to overlap. Write-heavy systems sometimes pick <em>W = 1</em> and <em>R = N</em>, which allows writes to be acknowledged by just one node, but requires all replicas to be available for reads. Increasing <em>W</em> or <em>R</em> increases latency and raises requirements for node availability. Decreasing them improves system availability while sacrificing consistency.
 
 A level of <em>floor(N / 2) + 1</em> is called a "quorum", or majority of votes. In a system with <em>2f + 1</em> nodes, the system can keep responding even when up to <em>f</em> become unavailable. This does not, however, guarantee monotonicity in cases of incomplete writes.
 
@@ -140,11 +140,11 @@ Witness replicas help reduce storage costs while preserving consistency.
 
 ---
 
-### Strong Eventual Consistency and CDRTs
+### Strong Eventual Consistency and CRDTs
 
-Under strong eventual consistency, updates are allowed to propagate to servers late or out of order, but when all updates finally propagate to target nodes, conflicts between them can be resolved and they can be merged to produce the same valid state. Under some conditions, we can relax our consistency requirements by allowing operations to preserve additional state that allows the diverged states to be reconciled (merged) after execution. This is often implemented with Conflict-Free Replicated Data Types (CDRTs), as in the case of Redis. CDRTs are specialized data structures that preclude the existence of conflicts and allow ops to be applied in any order without changing the results. They are extremely useful in distributed systems and are often used in eventually consistent systems.
+Under strong eventual consistency, updates are allowed to propagate to servers late or out of order, but when all updates finally propagate to target nodes, conflicts between them can be resolved and they can be merged to produce the same valid state. Under some conditions, we can relax our consistency requirements by allowing operations to preserve additional state that allows the diverged states to be reconciled (merged) after execution. This is often implemented with Conflict-Free Replicated Data Types (CRDTs), as in the case of Redis. CRDTs are specialized data structures that preclude the existence of conflicts and allow ops to be applied in any order without changing the results. They are extremely useful in distributed systems and are often used in eventually consistent systems.
 
-The simplest CDRTs are operations-based Commutative Replicated Data Types (CmRDTs), which require ops to be side-effect free, commutative, and causally ordered. Another example is the unordered Grow-Only Set (G-Set), which supports additions, removals, merges, etc. A more complex example is Martin Kleppmann's conflict-free replicated JSON data type, which allows modifications on deeply-nested JSON documents.
+The simplest CRDTs are operations-based Commutative Replicated Data Types (CmRDTs), which require ops to be side-effect free, commutative, and causally ordered. Another example is the unordered Grow-Only Set (G-Set), which supports additions and merges. A more complex example is Martin Kleppmann's conflict-free replicated JSON data type, which allows modifications on deeply nested JSON documents.
 
 ---
 
diff --git a/src/assets/articles/databaseInternalsChapter12.md b/src/assets/articles/databaseInternalsChapter12.md
index c1b6233..25d09b6 100644
--- a/src/assets/articles/databaseInternalsChapter12.md
+++ b/src/assets/articles/databaseInternalsChapter12.md
@@ -1,8 +1,8 @@
 ---
-title: Database Internals Ch. 12 - Anti-Entropy and Dissemination
+title: Database Internals Ch. 12 - Anti-Entropy & Dissemination
 description: Notes on Chapter 12 of Database Internals by Alex Petrov. Anti-Entropy and Dissemination in distributed systems, including read repair, hinted handoff, Merkle Trees, and gossip dissemination.
 published: March 21, 2026
-updated: March 21, 2026
+updated: March 29, 2026
 minutesToRead: 7
 path: /articles/database-internals-chapter-12/
 image: /images/database-internals.jpg
@@ -13,7 +13,7 @@ tags:
 collection:
   slug: database-internals
   title: Database Internals
-  shortTitle: Ch. 12 - Anti-Entropy and Dissemination
+  shortTitle: Ch. 12 - Anti-Entropy & Dissemination
   shortDescription: Anti-Entropy and Dissemination in distributed systems, including read repair, hinted handoff, Merkle Trees, and gossip dissemination.
   order: 12
 ---
diff --git a/src/assets/articles/databaseInternalsChapter13.md b/src/assets/articles/databaseInternalsChapter13.md
index aa4d8b7..9d789b3 100644
--- a/src/assets/articles/databaseInternalsChapter13.md
+++ b/src/assets/articles/databaseInternalsChapter13.md
@@ -2,7 +2,7 @@
 title: Database Internals Ch. 13 - Distributed Transactions
 description: Notes on Chapter 13 of Database Internals by Alex Petrov. Distributed Transactions, including two-phase commit, Spanner, partitioning, sharding, consistent hashing, and coordination avoidance.
 published: March 26, 2026
-updated: March 26, 2026
+updated: March 29, 2026
 minutesToRead: 9
 path: /articles/database-internals-chapter-13/
 image: /images/database-internals.jpg
@@ -133,7 +133,7 @@ Clients then route requests based on the routing key.
 This is typically called "sharding", where every replica set acts as the single source for a subset of data.
 
 We want to distribute reads and writes as evenly as possible, sizing partitions appropriately.
-In order to maintain balance, the DB also has to repartition the data when nodes are added or removed.
+In order to maintain balance, the database also has to repartition the data when nodes are added or removed.
 In order to reduce range hot-spotting, some DBs use a hash of the value as the routing key.
 A naive approach is to map keys to nodes with something like `hash(v) % N`, where N is the number of nodes.
 The downside of this is that if the number of nodes changes, the system is immediately unbalanced and needs to be repartitioned.
@@ -171,7 +171,7 @@ Many other DBMSs and <a href="https://noahtigner.com/articles/database-internals
 
 ### Coordination Avoidance
 
-Invariant Confluence (I-Confluence) is a property that ensures that two invariant-valid but diverged DB states can be merged into a single valid DB state.
+Invariant Confluence (I-Confluence) is a property that ensures that two invariant-valid but diverged database states can be merged into a single valid database state.
 Because any two valid states can be merged into a valid state, I-Confluent ops can be executed without any additional coordination, which significantly improves performance and scalability potential.
 
 A system model that allows coordinator avoidance has to guarantee the following properties:
diff --git a/src/assets/articles/databaseInternalsChapter14.md b/src/assets/articles/databaseInternalsChapter14.md
index cae7088..1a669ce 100644
--- a/src/assets/articles/databaseInternalsChapter14.md
+++ b/src/assets/articles/databaseInternalsChapter14.md
@@ -29,7 +29,7 @@ This post contains my notes on Chapter 14 of <a href="https://www.oreilly.com/li
 ### Introduction
 
 Consensus algorithms in distributed systems allow multiple processes to reach an agreement on a value.
-<a href="https://noahtigner.com/articles/database-internals-chapter-8/#flp-impossibility" target="_blank" rel="noopener">FLP Impossibility</a> shows that it is impossible to guarantee consensus in a completely asynchronous system in unbounded time.
+<a href="https://noahtigner.com/articles/database-internals-chapter-8/#flp-impossibility" target="_blank" rel="noopener">FLP Impossibility</a> shows that deterministic consensus cannot guarantee both safety and termination in a completely asynchronous system if even one process may fail.
 We've discussed the <a href="https://noahtigner.com/articles/database-internals-chapter-9/#introduction" target="_blank" rel="noopener">tradeoffs between failure detection accuracy and speed</a>.
 Consensus algorithms assume an async model and guarantee safety while using an external failure detection algorithm to guarantee liveness.
 Because failure detection is not always fully accurate, there will be some situations where the algorithm waits for a process that is incorrectly accused of being faulty.
@@ -80,7 +80,7 @@ It uses a hierarchical distributed key-value store, which is used to ensure a to
 
 Processes in ZAB are either a follower or a (temporary) leader.
 The leader executes algorithm steps, broadcasts messages to followers, and establishes the event order.
-All writes and reads of the most recent values are routed to the leader.
+All writes, and reads that require the most recent values, are routed to the leader.
 
 The protocol timeline is split into epochs, with one leader per epoch.
 The process starts by using <a href="https://noahtigner.com/articles/database-internals-chapter-10/" target="_blank" rel="noopener">leader election</a> to find a <em>prospective</em> leader.
diff --git a/src/assets/articles/databaseInternalsChapter5.md b/src/assets/articles/databaseInternalsChapter5.md
index e647504..f870569 100644
--- a/src/assets/articles/databaseInternalsChapter5.md
+++ b/src/assets/articles/databaseInternalsChapter5.md
@@ -1,8 +1,8 @@
 ---
-title: Database Internals Ch. 5 - Transaction Processing & Recovery
+title: Database Internals Ch. 5 - Transaction Processing and Recovery
 description: Notes on Chapter 5 of Database Internals by Alex Petrov. Transaction Processing and Recovery in Database Management Systems.
 published: February 27, 2026
-updated: March 18, 2026
+updated: March 29, 2026
 minutesToRead: 12
 path: /articles/database-internals-chapter-5/
 image: /images/database-internals.jpg
@@ -13,7 +13,7 @@ tags:
 collection:
   slug: database-internals
   title: Database Internals
-  shortTitle: Ch. 5 - Transaction Processing & Recovery
+  shortTitle: Ch. 5 - Transaction Processing and Recovery
   shortDescription: Transaction Processing and Recovery in Database Management Systems.
   order: 5
 ---
@@ -31,14 +31,14 @@ This post contains my notes on Chapter 5 of <a href="https://www.oreilly.com/lib
 Transactions are the indivisible logical unit of work in database management systems. They allow us to represent multiple operations in a single step. ACID is one of the most important and misunderstood concepts related to databases. Although <a href="https://youtu.be/5ZjhNTM8XU8?si=0UhNZayIeCPkvrhR" target="_blank" rel="noopener">Martin Kleppmann and others have raised concerns over the assumptions we make with ACID</a>, it is still an important concept to learn. In short, ACID means:
 
 1. Atomicity - transactions are indivisible, meaning all-or-nothing. All steps within a transaction are either committed (applied) or aborted (rolled back and possibly retried).
-2. Consistency - an app-specific guarantee (controlled by the app, not the DBMS); each transaction brings the DB from one valid state to another with all constraints and rules intact.
+2. Consistency - an app-specific guarantee (controlled by the app, not the DBMS); each transaction brings the database from one valid state to another with all constraints and rules intact.
 3. Isolation - concurrent transactions can execute without interference.
-4. Durability - once a transaction has been committed, all db state must be persisted to disk in order to survive system failures, restarts, etc.
+4. Durability - once a transaction has been committed, all database state must be persisted to disk in order to survive system failures, restarts, etc.
 
 There are several components required to manage transactions:
 
 - Lock manager - guards access to resources and prevents concurrent accesses that would violate data integrity
-- Page cache - serves as an intermediary between persistent storage and the rest of the storage engine. All changes to the DB state are applied here first.
+- Page cache - serves as an intermediary between persistent storage and the rest of the storage engine. All changes to the database state are applied here first.
 - Log manager - holds a history of the operations applied to cached pages that are not yet synced with persistent storage. This guarantees that operations won't be lost in case of crashes. It is also referenced when aborting transactions.
 
 ---
@@ -114,7 +114,7 @@ Concurrency control is a set of techniques for handling interactions between con
 
 #### Serializability
 
-A "schedule" is a list of ops required to execute a set of transactions from the db's perspective. A schedule is "complete" if it contains all ops from every transaction executed in it. It is "serial" when transactions are executed independently and in serial (one after the other). "Serializable" schedules allow us to execute transactions concurrently while maintaining the correctness of a serial schedule.
+A "schedule" is a list of ops required to execute a set of transactions from the database's perspective. A schedule is "complete" if it contains all ops from every transaction executed in it. It is "serial" when transactions are executed independently and in serial (one after the other). "Serializable" schedules allow us to execute transactions concurrently while maintaining the correctness of a serial schedule.
 
 #### Transaction Isolation
 
@@ -165,11 +165,11 @@ With Pessimistic Concurrency Control (PCC), transaction conflicts are determined
 
 #### Lock-Based Concurrency Control
 
-Lock-based concurrency control schemes are a form of PCC that use locks on db objects instead of using concurrency control to resolve schedules. Downsides include contention and scalability issues. Two-phase locking (2PL) is a common approach.
+Lock-based concurrency control schemes are a form of PCC that use locks on database objects instead of using concurrency control to resolve schedules. Downsides include contention and scalability issues. Two-phase locking (2PL) is a common approach.
 
 When locks are introduced into the system we must consider and handle deadlocks. Strategies exist such as timeouts and "Conservative 2PL", but they limit concurrency. Typically, DBMS use a transaction manager to detect and avoid deadlocks. This is usually done with a "waits-for" graph. Cycles in the graph represent deadlocks. Detection can be done periodically or continuously. Transaction managers typically prioritize older transactions.
 
-Locks are used to isolate and schedule overlapping transactions and manage DB contents, but not internal storage structures. They can guard either a single key or a set of keys, and are stored outside of the tree and managed by the DB lock manager. Latches, on the other hand, guard physical representations - tree structure and page contents. Since a modification on a leaf level might propagate up to higher levels, latches might have to be obtained on multiple levels. To increase concurrency, latches should be held for the smallest possible duration. Readers-Writes Locks (RWLs) allow multiple concurrent readers access to an object, with only writers needing to obtain exclusive access. "Latch crabbing" is a simple and optimistic method that allows holding latches for less time and releasing them as soon as it's clear that the executing operation doesn't need them anymore.
+Locks are used to isolate and schedule overlapping transactions and manage database contents, but not internal storage structures. They can guard either a single key or a set of keys, and are stored outside of the tree and managed by the database lock manager. Latches, on the other hand, guard physical representations - tree structure and page contents. Since a modification on a leaf level might propagate up to higher levels, latches might have to be obtained on multiple levels. To increase concurrency, latches should be held for the smallest possible duration. Readers-Writes Locks (RWLs) allow multiple concurrent readers access to an object, with only writers needing to obtain exclusive access. "Latch crabbing" is a simple and optimistic method that allows holding latches for less time and releasing them as soon as it's clear that the executing operation doesn't need them anymore.
 
 B<sup>link</sup>-Trees, which use <a href="https://noahtigner.com/articles/database-internals-chapter-4/#node-high-keys" target="_blank" rel="noopener">high keys</a> and <a href="https://noahtigner.com/articles/database-internals-chapter-4/#sibling-links" target="_blank" rel="noopener">sibling links</a>, allow a state called a "half-split". This approach can reduce contention and simplify concurrent access while reducing the number of locks held during tree state modifications. More importantly, it allows reads concurrent to structural tree changes and helps prevent deadlocks.
 
diff --git a/src/assets/articles/databaseInternalsChapter6.md b/src/assets/articles/databaseInternalsChapter6.md
index 03da94a..5f01d04 100644
--- a/src/assets/articles/databaseInternalsChapter6.md
+++ b/src/assets/articles/databaseInternalsChapter6.md
@@ -2,7 +2,7 @@
 title: Database Internals Ch. 6 - B-Tree Variants
 description: Notes on Chapter 6 of Database Internals by Alex Petrov. B-Tree implementation techniques, optimizations, and real-world variants.
 published: March 1, 2026
-updated: March 1, 2026
+updated: March 29, 2026
 minutesToRead: 6
 path: /articles/database-internals-chapter-6/
 image: /images/database-internals.jpg
@@ -26,8 +26,12 @@ This post contains my notes on Chapter 6 of <a href="https://www.oreilly.com/lib
 
 ---
 
+### Introduction
+
 This chapter discusses techniques that can be used to implement efficient B-Trees and the structures that employ them. It also discusses B-Tree variants and real-world implementations such as Lazy B-Trees, FD-Trees, Bw-Trees, and Cache-Oblivious B-Trees. Notable techniques include buffering, which can help with write amplification, and immutability, which can help with space amplification.
 
+---
+
 ### Copy-on-Write
 
 Copy-on-Write (CoW) B-Trees have immutable nodes which are not updated directly. Instead, pages are copied, updated, and written to new locations. This helps guarantee data integrity with concurrent operations. The main downside is that more space and processor time is required, since the page's entire contents have to be copied. The biggest advantages of this approach are that readers require no additional synchronization or latching, and readers do not block writers, operations cannot observe a page in an incomplete state, and crashes cannot leave pages in a corrupted state.
@@ -110,7 +114,7 @@ Cache-Oblivious B-Trees treat on-disk data structures similarly to how we build
 
 #### van Emde Boas Layout
 
-A cache-oblivious B-Tree consists of a static B-Tree and a "packed array". The static B-Tree is built using the van Emde Boas Layout, which splits the tree at the middle level of the edges and then splits each subtree recursively, resulting in subtrees of sqrt(N) size. Each recursive tree is stored ina contiguous memory block. To allow for inserts/updates/deletes, a packed array is used, which uses contiguous memory segments for storing elements, but contains gaps reserved for future inserts. This results in fewer relocations across the tree due to inserts.
+A cache-oblivious B-Tree consists of a static B-Tree and a "packed array". The static B-Tree is built using the van Emde Boas Layout, which splits the tree at the middle level of the edges and then splits each subtree recursively, resulting in subtrees of sqrt(N) size. Each recursive tree is stored in a contiguous memory block. To allow for inserts/updates/deletes, a packed array is used, which uses contiguous memory segments for storing elements, but contains gaps reserved for future inserts. This results in fewer relocations across the tree due to inserts.
 
 > [!NOTE]
 > The book claims that the subtrees will have size sqr(N), but I believe they are actually sqrt(N).
diff --git a/src/assets/articles/databaseInternalsChapter7.md b/src/assets/articles/databaseInternalsChapter7.md
index 7ec27dd..14eb519 100644
--- a/src/assets/articles/databaseInternalsChapter7.md
+++ b/src/assets/articles/databaseInternalsChapter7.md
@@ -26,8 +26,12 @@ This post contains my notes on Chapter 7 of <a href="https://www.oreilly.com/lib
 
 ---
 
+### Introduction
+
 As discussed in previous chapters, in-place update storage structures are optimized for read performance, while append-only structures are optimized for write performance. Log-structured storage (LSS) takes advantage of this simple fact. LSS is used everywhere: from the flash translation layer, to filesystems and database systems. It helps reduce write amplification by batching small writes together in memory.
 
+---
+
 ### LSM Trees
 
 The Log-Structured Merge-Tree (LSM Tree) is one of the most popular immutable on-disk storage structures. It uses buffering and append-only storage to achieve sequential writes. Immutable files are written and merged over time. These immutable files have higher density and are optimized for sequential writes. Since the number of files steadily grows, LSM Trees have to merge and rewrite files to minimize the number of files that have to be read when accessing records.
diff --git a/src/assets/articles/databaseInternalsChapter8.md b/src/assets/articles/databaseInternalsChapter8.md
index f15d7af..a2fa3d2 100644
--- a/src/assets/articles/databaseInternalsChapter8.md
+++ b/src/assets/articles/databaseInternalsChapter8.md
@@ -2,8 +2,8 @@
 title: Database Internals Ch. 8 - Distributed Systems Intro & Overview
 description: Notes on Chapter 8 of Database Internals by Alex Petrov. Concurrency, fallacies of distributed computing, and failure models.
 published: March 8, 2026
-updated: March 8, 2026
-minutesToRead: 11
+updated: March 29, 2026
+minutesToRead: 10
 path: /articles/database-internals-chapter-8/
 image: /images/database-internals.jpg
 tags:
@@ -20,7 +20,7 @@ collection:
 
 ## Database Internals - Ch. 8 - Distributed Systems Intro & Overview
 
-<p class="subtitle">11 minute read • March 8, 2026</p>
+<p class="subtitle">10 minute read • March 8, 2026</p>
 
 This post contains my notes on Chapter 8 of <a href="https://www.oreilly.com/library/view/database-internals/9781492040330/" target="_blank" rel="noopener">_Database Internals_</a> by Alex Petrov. These notes are intended as a reference and are not meant as a substitute for the original text. I found <a href="https://timilearning.com/posts/ddia/notes/" target="_blank" rel="noopener">Timilehin Adeniran's notes</a> on <a href="https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/" target="_blank" rel="noopener">_Designing Data-Intensive Applications_</a> extremely helpful while reading that book, so I thought I'd try to do the same here.
 
@@ -43,7 +43,7 @@ Every concurrency problem has some properties of a distributed system. Threads a
 
 #### Shared State in a Distributed System
 
-We can try to introduce some notion of shared memory to a distributed system, such as a database. Even if we solve the problems with concurrent access to it, we still cannot guarantee that all processes are in sync. To access this db, process can send messages over the communication medium. We'll therefore have to describe the system in terms of "synchrony" - whether the system is async, or if we can make some assumptions about timing. These assumptions give us options like timeouts and retries.
+We can try to introduce some notion of shared memory to a distributed system, such as a database. Even if we solve the problems with concurrent access to it, we still cannot guarantee that all processes are in sync. To access this database, a process can send messages over the communication medium. We'll therefore have to describe the system in terms of "synchrony" - whether the system is async, or if we can make some assumptions about timing. These assumptions give us options like timeouts and retries.
 
 We don't always know the "nature" of an issue - if we haven't received a response because of a network issue, because the resource is overloaded, or because of a system crash. "Failure models" describe the ways in which failures can occur and how we decide to handle them. "Fault tolerance" describes the degree to which our system keeps operating correctly even when failures occur.
 
diff --git a/src/assets/articles/databaseInternalsSummary.md b/src/assets/articles/databaseInternalsSummary.md
new file mode 100644
index 0000000..be8aaae
--- /dev/null
+++ b/src/assets/articles/databaseInternalsSummary.md
@@ -0,0 +1,227 @@
+---
+title: Database Internals - Summary & Review
+description: Summary and review of Database Internals by Alex Petrov.
+published: March 29, 2026
+updated: March 29, 2026
+minutesToRead: 10
+path: /articles/database-internals-summary/
+image: /images/database-internals.jpg
+tags:
+  - 'reading notes'
+  - 'databases'
+  - 'distributed systems'
+collection:
+  slug: database-internals
+  title: Database Internals
+  shortTitle: Summary & Review
+  shortDescription: Summary and review of Database Internals by Alex Petrov.
+  order: 15
+---
+
+## Database Internals - Summary & Review
+
+<p class="subtitle">10 minute read • March 29, 2026</p>
+
+This post contains my summary and review of <a href="https://www.oreilly.com/library/view/database-internals/9781492040330/" target="_blank" rel="noopener">_Database Internals_</a> by Alex Petrov. These notes are intended as a reference and are not meant as a substitute for the original text. I found <a href="https://timilearning.com/posts/ddia/notes/" target="_blank" rel="noopener">Timilehin Adeniran's notes</a> on <a href="https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/" target="_blank" rel="noopener">_Designing Data-Intensive Applications_</a> extremely helpful while reading that book, so I thought I'd try to do the same here.
+
+---
+
+### Part I - Storage Engines
+
+#### B-Trees and LSM Trees
+
+Storage engines are shaped less by asymptotic complexity than by hardware behavior, access patterns, and operational tradeoffs.
+<a href="https://noahtigner.com/articles/database-internals-chapter-2/" target="_blank" rel="noopener">B-Trees</a> and <a href="https://noahtigner.com/articles/database-internals-chapter-7/#lsm-trees" target="_blank" rel="noopener">LSM Trees</a> are the clearest example of this.
+Both are ordered structures optimized for disk-backed storage, but they make very different choices around buffering, mutability, and maintenance.
+
+B-Trees are the most commonly used example of a read-oriented structure.
+They use wide nodes, high fanout, and low height to reduce seeks while preserving efficient point lookups and range scans.
+The real complexity is not just the tree itself, but everything required to make it practical: slotted pages, separator keys, sibling links, overflow handling, rebalancing, compression, bulk loading, and <a href="https://noahtigner.com/articles/database-internals-chapter-6/" target="_blank" rel="noopener">variants such as copy-on-write trees, buffered trees, and Bw-Trees</a>.
+Storage-engine design is about preserving ordered access while balancing read performance, write cost, space usage, and concurrency.
+
+LSM Trees start from the opposite side of the tradeoff space.
+Instead of optimizing in-place updates, they buffer writes in memory, flush sorted immutable structures to disk, and use compaction to merge and reconcile data over time.
+This reduces the cost of small writes and takes advantage of sequential I/O, but it pushes work into later maintenance and introduces read, write, and space amplification tradeoffs.
+That is why components such as memtables, SSTables, tombstones, bloom filters, and leveled or size-tiered compaction matter so much.
+One especially useful connection is that B-Trees and related structures often still appear inside LSM-based systems, whether for indexing or for comparison.
+
+|              | Buffered | Mutable | Ordered |
+| ------------ | -------- | ------- | ------- |
+| B+Trees      |          | ✓       | ✓       |
+| WiredTiger   | ✓        | ✓       | ✓       |
+| LA-Trees     | ✓        | ✓       | ✓       |
+| CoW Trees    |          |         | ✓       |
+| 2C LSM Trees | ✓        |         | ✓       |
+| MC LSM Trees | ✓        |         | ✓       |
+| FD-Trees     | ✓        |         | ✓       |
+| BitCask      |          |         |         |
+| WiscKey      | ✓\*      |         | ✓\*     |
+| BW-Trees     |          |         | \*      |
+
+<p class="subtitle">Buffering, immutability, and ordering properties of discussed storage structures</p>
+
+#### Transactions
+
+<a href="https://noahtigner.com/articles/database-internals-chapter-5/#introduction" target="_blank" rel="noopener">Transactions</a> are the indivisible logical unit of work in database management systems.
+They allow us to represent multiple operations in a single step.
+ACID (atomicity, consistency, isolation, durability) is one of the most important concepts related to databases.
+Transaction processing usually involves components such as the lock manager, page cache, and log manager.
+
+Most databases use a 2-level memory hierarchy: slower persistent storage (disk) and faster main memory (RAM).
+Pages are cached in memory to reduce the number of disk accesses.
+Page replacement algorithms use eviction policies such as FIFO, LRU, CLOCK, and LFU.
+These policies have various tradeoffs surrounding precision (hit rate), overhead, and complexity.
+
+#### Recovery
+
+The <a href="https://noahtigner.com/articles/database-internals-chapter-5/#recovery" target="_blank" rel="noopener">WAL</a> is an append-only auxiliary on-disk structure used for crash and transaction recovery. It has several functions:
+
+- It allows the page cache to buffer updates to disk-resident pages while ensuring durability
+- It persists all ops on disk until cached copies of pages affected by these ops are synced on disk
+- It allows lost in-memory changes to be reconstructed from the operation log in case of crash
+
+The WAL is usually coupled with a primary storage structure by the interface that allows trimming it whenever a checkpoint is reached.
+Checkpoints tell the log system that log records up to a certain point aren’t required anymore.
+“Fuzzy checkpointing” allows this to happen asynchronously and is a more practical approach.
+
+#### Concurrency Control
+
+<a href="https://noahtigner.com/articles/database-internals-chapter-5/#optimistic-concurrency-control" target="_blank" rel="noopener">Concurrency control</a> is a set of techniques for handling interactions between concurrently executing transactions. They can be grouped into three buckets:
+
+- Optimistic Concurrency Control (OCC)
+- Pessimistic Concurrency Control (PCC)
+- Multiversion Concurrency Control (MVCC)
+
+A schedule is a list of ops required to execute a set of transactions from the db’s perspective.
+A schedule is “complete” if it contains all ops from every transaction executed in it.
+It is “serial” when transactions are executed independently and in serial (one after the other).
+Serializable schedules allow us to execute transactions concurrently while maintaining the correctness of a serial schedule.
+
+Isolation levels specify how and when parts of the transaction should become visible to other concurrent transactions.
+
+#### Read & Write Anomalies
+
+Read anomalies include:
+
+- “Dirty” reads - when a transaction reads uncommitted changes from other transactions
+- Non-repeatable “fuzzy” reads - when a transaction queries the same row twice and gets different results
+- “Phantom” reads - when a transaction queries a set of rows twice and gets different results (the range-query equivalent of a fuzzy read)
+
+Write anomalies include:
+
+- “Lost” updates - when two transactions attempt to update the same value and the second transaction has no knowledge of the first and overwrites its updates without taking its updates into account
+- “Dirty” writes - when a transaction takes an uncommitted value (dirty read) and modifies and saves it
+- Write “skew” - when each individual transaction in a set respects the invariants, but the combination of the transactions does not
+
+|                  | Dirty   | Non-Repeatable | Phantom |
+| ---------------- | ------- | -------------- | ------- |
+| Read Uncommitted | Allowed | Allowed        | Allowed |
+| Read Committed   | -       | Allowed        | Allowed |
+| Repeatable Read  | -       | -              | Allowed |
+| Serializable     | -       | -              | -       |
+
+<p class="subtitle">Isolation levels and allowed anomalies</p>
+
+---
+
+### Part II - Distributed Systems
+
+#### Distributed Algorithms
+
+Distributed algorithms serve many purposes, such as:
+
+- Coordination - a process that supervises the actions and behaviors of several workers
+- Cooperation - multiple participants relying on one another for finishing their task
+- Dissemination - process cooperating in spreading the information to all interested parties
+- Consensus - achieving agreement among multiple processes
+
+#### Two Generals, FLP Impossibility, and Byzantine Failures
+
+The <a href="https://noahtigner.com/articles/database-internals-chapter-8/#two-generals-problem" target="_blank" rel="noopener">Two Generals problem</a> is a thought experiment that shows that it is impossible to achieve an agreement between two parties if communication is asynchronous and links fail.
+<a href="https://noahtigner.com/articles/database-internals-chapter-8/#flp-impossibility" target="_blank" rel="noopener">FLP Impossibility</a> shows that deterministic consensus cannot guarantee both safety and termination in a completely asynchronous system if even one process may fail.
+<a href="https://noahtigner.com/articles/database-internals-chapter-8/#arbitrary-faults" target="_blank" rel="noopener">Arbitrary</a> (a.k.a. “Byzantine”) faults are where a process continues executing algorithm steps, but in a way that contradicts the algorithm.
+These can be caused by software bugs, malicious actors, etc.
+
+#### Failure Detection
+
+<a href="https://noahtigner.com/articles/database-internals-chapter-9/" target="_blank" rel="noopener">Failures</a> can occur at the link level or at the process level.
+There are always tradeoffs between wrongly suspecting alive processes of being dead (false-positives) and giving dead processes the benefit of doubt (false-negatives).
+
+We can query the state of a remote process by triggering one of two periodic processes:
+
+- Ping - checks if the process is still alive by sending it a message and asserting that it responds within a specified timeframe
+- Heartbeat - the process actively notifies its peers that it’s still running by sending messages to them
+
+Gossip provides another approach that avoids relying on a single-node view to make the decision.
+Gossip collects and distributes the state of neighboring processes, with unresponsive nodes eventually being considered failed.
+It increases the number of messages in the system, but allows info to spread more reliably.
+In addition to failure detection, gossip is used for information propagation and dissemination.
+
+#### Leader Election
+
+To reduce synchronous overhead and the number of message round-trips required to reach a decision, some algorithms <a href="https://noahtigner.com/articles/database-internals-chapter-10/" target="_blank" rel="noopener">elect a leader process</a>.
+The leader is responsible for executing and coordinating steps of a distributed algorithm.
+Possible solutions include the Bully algorithm, Invitation algorithm, and Ring algorithm.
+
+#### Replication & Consistency
+
+The <a href="https://noahtigner.com/articles/database-internals-chapter-11/#infamous-cap" target="_blank" rel="noopener">CAP conjecture</a> describes the tradeoffs between consistency <em>C</em>, availability <em>A</em>, and partition tolerance <em>P</em>.
+The conjecture states that a system can only choose between consistency and availability when a partition occurs.
+The two most common approaches are “AP” and “CP”.
+CP systems prefer failing requests to serving potentially inconsistent data.
+AP systems loosen the C requirements and allow serving potentially inconsistent values during the request.
+
+<a href="https://noahtigner.com/articles/database-internals-chapter-11/#consistency-models" target="_blank" rel="noopener">A consistency model</a> can be thought of as a contract between participants.
+They describe what expectations clients might have about returned values in the presence of replication and concurrent accesses.
+Consistency models include strict consistency, linearizability, sequential consistency, causal consistency, and eventual consistency.
+
+Some systems opt for eventual consistency and use tunable parameters that follow the CAP conjecture.
+Strong eventual consistency is gaining traction with <a href="https://noahtigner.com/articles/database-internals-chapter-11/#strong-eventual-consistency-and-crdts" target="_blank" rel="noopener">Conflict-Free Replicated Data Types (CRDTs)</a>.
+
+#### Distributed Transactions
+
+To make multiple (possibly remote) operations appear atomic, we need to use a class of algorithm called <a href="https://noahtigner.com/articles/database-internals-chapter-13/#making-operations-appear-atomic" target="_blank" rel="noopener">“atomic commitment”</a>.
+These algorithms disallow disagreements between participants by not committing if even one participant voted against it.
+<a href="https://noahtigner.com/articles/database-internals-chapter-13/#two-phase-commit" target="_blank" rel="noopener">Two-phase commit (2PC)</a> is the most straightforward protocol for distributed commitment, allowing multi-partition atomic updates.
+
+The two most common approaches for distributed transaction are <a href="https://noahtigner.com/articles/database-internals-chapter-13/#distributed-transactions-with-calvin" target="_blank" rel="noopener">Calvin</a> and <a href="https://noahtigner.com/articles/database-internals-chapter-13/#distributed-transactions-with-spanner" target="_blank" rel="noopener">Spanner</a>.
+Calvin sequences and batches transactions, and uses <a href="https://noahtigner.com/articles/database-internals-chapter-14/#paxos" target="_blank" rel="noopener">Paxos</a> for determining which transactions make it into the current epoch (batch).
+Unlike Calvin, Spanner uses 2PC over consensus groups per partition (shard).
+It uses Paxos for consistent transaction log replication, 2PC for cross-shard transactions, and TrueTime for deterministic transaction ordering.
+This means that multi-partition transactions have a higher cost compared to Calvin, but Spanner usually wins in terms of availability.
+
+#### Consensus
+
+<a href="https://noahtigner.com/articles/database-internals-chapter-14/" target="_blank" rel="noopener">Consensus algorithms</a> in distributed systems allow multiple processes to reach an agreement on a value.
+Atomic broadcast algorithms such as <a href="https://noahtigner.com/articles/database-internals-chapter-14/#zookeeper-atomic-broadcast-zab" target="_blank" rel="noopener">ZooKeeper</a> ensure a total order of events and the atomic delivery necessary to maintain consistency between replica states.
+The two most widespread consensus algorithms are <a href="https://noahtigner.com/articles/database-internals-chapter-14/#paxos" target="_blank" rel="noopener">Paxos</a> and <a href="https://noahtigner.com/articles/database-internals-chapter-14/#raft" target="_blank" rel="noopener">Raft</a>, with the latter being considered easier to reason about and implement.
+In adversarial environments, Byzantine fault-tolerant algorithms like <a href="https://noahtigner.com/articles/database-internals-chapter-14/#pbft-algorithm" target="_blank" rel="noopener">PBFT</a> must be employed.
+
+---
+
+### Review & Thoughts
+
+#### Overall Review
+
+I found this book to be a great deep-dive into database internals, storage engines and building blocks, and distributed systems.
+The first half of the book offered unique depth into structures like B-Trees and LSM Trees.
+I found the second half more interesting (and more applicable to my work), but it seems to overlap heavily with books like <em>Designing Data-Intensive Applications</em>.
+
+#### Who Would I Recommend This To?
+
+Naturally, I would recommend this book to anyone interested in building or modifying their own storage engines.
+I would also recommend it to any software engineers tasked with tuning existing systems, or picking the right tool for the job when building from the ground up.
+I would <em>not</em> recommend this for engineers early in their career, and/or those studying for system design interviews.
+Those readers would be better served by something much higher-level like Alex Xu's <em>System Design Interview</em>.
+
+#### Useful Tidbits
+
+This book introduced me to several data structures and algorithms that I would like to study further:
+
+- <a href="https://noahtigner.com/articles/database-internals-chapter-12/#merkle-trees" target="_blank" rel="noopener">Merkle Trees</a>, which can be used to build trees of content hashes (picture file-change detection in a system like Git)
+- <a href="https://noahtigner.com/articles/database-internals-chapter-7/#bloom-filters" target="_blank" rel="noopener">Bloom Filters</a>, which efficiently (but probabilistically) check for the inclusion of an item in a set
+- <a href="https://noahtigner.com/articles/database-internals-chapter-11/#strong-eventual-consistency-and-cdrts" target="_blank" rel="noopener">CRDTs</a>, specifically the <a href="https://automerge.org/" target="_blank" rel="noopener">Automerge project</a>
+
+---
+
+<p class="subtitle"><i>Database Internals</i> by Alex Petrov (O'Reilly). Copyright 2019 Oleksander Petrov, 978-1-492-04034-7</p>

From 481bf0dd3ef730838518899868876fe0eef26833 Mon Sep 17 00:00:00 2001
From: Noah Tigner <noahzanetigner@gmail.com>
Date: Sun, 29 Mar 2026 16:02:12 -0600
Subject: [PATCH 2/2] Article: DBI Summary & Review (pt. II)

---
 src/assets/articles/databaseInternalsSummary.md | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/src/assets/articles/databaseInternalsSummary.md b/src/assets/articles/databaseInternalsSummary.md
index be8aaae..866d8b4 100644
--- a/src/assets/articles/databaseInternalsSummary.md
+++ b/src/assets/articles/databaseInternalsSummary.md
@@ -24,6 +24,8 @@ collection:
 
 This post contains my summary and review of <a href="https://www.oreilly.com/library/view/database-internals/9781492040330/" target="_blank" rel="noopener">_Database Internals_</a> by Alex Petrov. These notes are intended as a reference and are not meant as a substitute for the original text. I found <a href="https://timilearning.com/posts/ddia/notes/" target="_blank" rel="noopener">Timilehin Adeniran's notes</a> on <a href="https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/" target="_blank" rel="noopener">_Designing Data-Intensive Applications_</a> extremely helpful while reading that book, so I thought I'd try to do the same here.
 
+The <a href="https://noahtigner.com/articles/database-internals/" target="_blank" rel="noopener">root article for this series</a> contains links to each article, an introduction to the book, and my motivation for writing and publishing these notes.
+
 ---
 
 ### Part I - Storage Engines