Author of the blog here. I had a great time writing this. By far the most comple...

tealpod · 2025-03-27T15:46:11 1743090371

Your style of explaination and animation are exceptional.

jasonthorsness · 2025-03-13T17:48:19 1741888099

The visuals are awesome; the bouncing-box is probably the best illustration of relative latency I've seen.

Your "1 in a million" comment on durability is certainly too pessimistic once you consider the briefness of the downtime before a new server comes in and re-replicates everything, right? I would think if your recovery is 10 minutes for example, even if each of three servers is guaranteed to fail once in the month, I think it's already like 1 in two million? and if it's a 1% chance of failure in the month failure of all three overlapping becomes extremely unlikely.

Thought I would note this because one-in-a-million is not great if you have a million customers ;)

bddicken · 2025-03-13T18:00:54 1741888854

> Your "1 in a million" comment on durability is certainly too pessimistic once you consider the briefness of the downtime before a new server comes in and re-replicates everything, right?

Absolutely. Our actual durability is far, far, far higher than this. We believe that nobody should ever worry about losing their data, and thats the peace of mind we provide.

alfons_foobar · 2025-03-13T21:43:31 1741902211

> Instead of relying on a single server to store all data, we can replicate it onto several computers. One common way of doing this is to have one server act as the primary, which will receive all write requests. Then 2 or more additional servers get all the data replicated to them. With the data in three places, the likelihood of losing data becomes very small.

Is my understanding correct, that this means you propagate writes asynchronously from the primary to the secondary servers (without waiting for an "ACK" from them for writes)?

bddicken · 2025-03-13T23:24:45 1741908285

For PlanetScale Metal, we use semi-sync replication. The primary need to get an ack from at least one replica before committing.

alfons_foobar · 2025-03-14T14:03:07 1741960987

Soo... We have a network hop after all?

bddicken · 2025-03-15T20:31:14 1742070674

For writes, yes. But what if your workload is 90% reads?

alfons_foobar · 2025-03-16T09:33:59 1742117639

It makes a lot of sense for read-heavy workloads, for sure!

I was just trying to get a better understanding of what is happening under the hood :)

anonymousDan · 2025-03-14T00:04:23 1741910663

Is that ack sent once the request is received or once it is stored on the remote disk?

the_arun · 2025-03-13T18:16:53 1741889813

Kudos to whoever patiently & passionately built these. On an off topic - This is a great perspective for building realistic course work for middle & high school students. I'm sure they learn faster & better with visuals like these.

bddicken · 2025-03-13T19:04:50 1741892690

It would be incredibly cool if this were used in high school curricula.

mixermachine · 2025-03-13T19:03:37 1741892617

1 in a million is the probability that all three servers die in one months, without swapping out the broken ones. So at some point in the month all the data is gone.

If you replace the failed(or failing) node right away, the failure percentage goes down greatly. You would likely need the probability of a node going done in 30 minutes time space. Assuming the migration can be done in 30 min.

(i hope this calculation is correct)

If 1% probability per month then 1%/(43800/30) = (1/1460)% probability per 30 min.

For three instances: (1/1460)% * (1/1460)% * (1/1460)% = (1/3112136000)% probability per 30 min that all go down.

Calculated for one month (1/3112136000)% * (43800/30) = (1/2131600)%

So one in 213 160 000 that all three servers go down in a 30 minute time span somewhere in one month. After the 30 minutes another replica will already be available, making the data safe.

I'm happy to be corrected. The probability course was some years back :)

TylerE · 2025-03-14T06:38:49 1741934329

One thing I will suggest: you’re assuming failures are non-correlated and have an equally weighted chance per in it of time.

Neither is a good assumption from my experience. Failures being correlated to any degree greatly increases the chances of what the aviation world refers to as “the holes in the Swiss cheese lining up”.

mixermachine · 2025-03-15T12:27:59 1742041679

You are 100% correct. Heavily depends on where the servers reside. Just a rough estimate for the case that the failures are non related.

b0rbb · 2025-03-13T21:59:18 1741903158

The animations are fantastic and awesome job with the interactivity. I find myself having to explain latency to folks often in my work and being able to see the extreme difference in latencies for something like a HDD vs SSD makes it much easier to understand for some people.

Edit: And for real, fantastic work, this is awesome.

bddicken · 2025-03-13T23:22:07 1741908127

Thank you! The visuals definitely add something special to this post specifically since time is a big element in explaining latencies.

walterbell · 2025-03-14T15:52:42 1741967562

Thanks for leadership on investing marketing budget into quality technical edutainment (and brand building!) for visually oriented humans.

zalebz · 2025-03-13T17:32:55 1741887175

The level of your effort really shows through. If you had to ballpark guess, how much time do you think you put in? and I realize keyboard time vs kicking around in your head time are quite different

bddicken · 2025-03-13T17:37:44 1741887464

Thank you! I started this back in October, but of course have worked on plenty of other things in the meantime. But this was easily 200+ hours of work spread out over that time.

If this helps as context, the git diff for merging this into our website was: +5,820 −1

dormando · 2025-03-13T17:47:25 1741888045

Half on topic: what libs/etc did you use for the animations? Not immediately obvious from the source page.

(it's a topic I'm deeply familiar with so I don't have a comment on the content, it looks great on a skim!) - but I've been sketching animations for my own blog and not liked the last few libs I tried.

Thanks!

bddicken · 2025-03-13T17:52:39 1741888359

I heavily, heavily abused d3.js to build these.

petedoyle · 2025-03-13T19:58:30 1741895910

Small FYI that I couldn't see them in Chrome 133.0.6943.142 on MacOS. Firefox works.

homebrewer · 2025-03-13T20:02:37 1741896157

It's the complete opposite for me — there are no animations in Firefox even with uBlock Origin disabled, but Brave shows them fine.

The browser console spams this link: https://react.dev/errors/418?invariant=418

edit: looks like it's caused by a userstyles extension injecting a dark theme into the page; React doesn't like it and the page silently breaks.

bddicken · 2025-03-13T20:44:28 1741898668

Ohhh interesting! Obviously not ideal, but I guess just an extension issue?

bddicken · 2025-03-13T20:45:04 1741898704

Interesting. Running any chrome extensions that might be messing with things? Alternatively, if you can share any errors you're getting in the console lmk.

petedoyle · 2025-03-13T21:02:44 1741899764

Oh, looks like it. I disabled extensions one by one til I found it was reflect.app's extension. Edit: reported on their discord.

False alarm :) Amazing work!!

anymouse123456 · 2025-03-14T11:58:09 1741953489

I love this kind of datavis.

We are generally bad at internalizing comparisons at these scales. The visualizations make a huge difference in building more detailed intuitions.

Really nice work, thank you!

bddicken · 2025-03-14T16:18:44 1741969124

Yeah I think the visuals really add to this one, especially given the time element of explaining latencies.

hakaneskici · 2025-03-13T22:35:38 1741905338

Great work! Thank you for making this.

This is beautiful and brilliant, and also is a great visual tool to explain how some of the fundamental algorithms and data structures originate from the physical characteristics of storage mediums.

I wonder if anyone remembers the old days where you programmed your own custom defrag util to place your boot libs and frequently used apps to the outer tracks of the hard drive, so they are loaded faster due to the higher linear velocity of the outermost track :)

AlphaWeaver · 2025-03-13T20:56:03 1741899363

Were you at all inspired by the work of Bartosz Ciechanowski? My first thought was that you all might have hired him to do the visuals for this post :)

bddicken · 2025-03-13T21:18:37 1741900717

Bartosz Ciechanowski is incredible at this type of stuff. Sam Rose has some great interactive blogs too. Both have had big hits here on HN.

hodgesrm · 2025-03-16T17:26:10 1742145970

I was delighted to see your models of tape operations as I used it a lot in the COBOL days.

For reasons discussed in your article we would arrange tape processing as much as possible in sequential scans, something at which COBOL was quite excellent. One of the common performance problems was when there was a mismatch between a slower COBOL processing speed that could not keep up with the flow of blocks coming off the drive head.

In this case you would see the drive start to overshoot as it read more blocks than the COBOL program could handle. The drive would begin a painful jump forward/spool backward motion which made the performance issue quite visible. You would then eyeball the code to understand way the program was not keeping up, correct, and resubmit until the motion disappeared.

logsr · 2025-03-13T18:54:03 1741892043

Amazing presentation. It really helps to understand the concepts.

The only add is that it understates the impact of SSD parallelism. 8 Channel controllers are typical for high end devices and 4K random IOPS continue to scale with queue depth, but for an introduction the example is probably complex enough.

It is great to see PlanetScale moving in this direction and sharing the knowledge.

bddicken · 2025-03-13T19:40:21 1741894821

Thank you for the info! Do you have any good references on this for those who want to learn more?

logsr · 2025-03-14T07:15:41 1741936541

Just going off specs sheets from manufacturers and reviews (mostly consumer products, so enterprise should be the same or better).

There are only a few major NAND manufacturers: Samsung, Micron, Kioxia / Western Digital, SK Hynix, and their branded products are usually the best.

There are also several 3rd party controller developers: Phison, Marvell, Silicon Motion, which I think are the largest, and then a bunch of others.

I hadn't looked at this in a couple years, so 16 channel controllers are more common now, but only on high end enterprise devices.

4KB random read/write specs are definitely not trustable without testing. They are usually at max queue depth and, at least for consumer devices, based on writing to a buffer in SLC mode, so they will be a lot lower once the buffer is exhausted. Enterprise specs might be more realistic but there isnt as much public testing data available.

alexellisuk · 2025-03-13T17:48:48 1741888128

Hi, what actually are _metal_ instances that are being used when you're on EC2 that have local NVME attached? Last time I looked, apart from the smallest/slowest Graviton, you have to spend circa 2.3k USD/mo to get a bare-metal instance from AWS - https://blog.alexellis.io/how-to-run-firecracker-without-kvm...

lizztheblizz · 2025-03-13T18:17:03 1741889823

Hi there, PS employee here. In AWS, the instance types backing our Metal class are currently in the following families: r6id, i4i, i3en and i7ie. We're deploying across multiple clouds, and our "Metal" product designation has no direct link to Amazon's bare-metal offerings.

tombert · 2025-03-13T21:54:41 1741902881

The visualizations are excellent, very fun to look at and play with, and they go along with the article extremely well. You should be proud of this, I really enjoyed it.

bddicken · 2025-03-13T23:24:55 1741908295

Thank you!

layer8 · 2025-03-14T02:53:48 1741920828

I don’t see any animations on Safari. Also, I’d much prefer a variable-width font, monospace prose is hard to read. While I can use Reader Mode, that removes the text coloring, and would likely also hide the visuals (if they were visible in the first place).

bddicken · 2025-03-14T02:58:54 1741921134

Interesting! Any errors you can report? Should work in safari but maybe you have something custom going on, or an older version?

chris_pie · 2025-03-14T14:18:27 1741961907

I'm on Chrome on Android and AdGuard blocks all your visualizations, just FYI

ameshkov · 2025-03-14T15:56:14 1741967774

Will be fixed in the next filters update

inetknght · 2025-03-13T17:23:29 1741886609

I don't see a single visual. I don't use the web with javascript. Why not embed static images instead or in addition?

bddicken · 2025-03-13T17:28:01 1741886881

The visuals add a lot to this article. A big theme throughout is latency, and the visual help the reader see why tape is slower than an hdd, which is slower than an ssd, etc. Also, its just plain fun!

I'm curious, what do you do on the internet without js these days?

inetknght · 2025-03-13T17:31:50 1741887110

> I'm curious, what do you do on the internet without js these days?

Browse the web, send/receive email, read stories, play games, the usual. I primarily use native apps and selectively choose what sites are permitted to use javascript, instead of letting websites visited on a whim run javascript willy nilly.

bddicken · 2025-03-13T17:44:16 1741887856

I respect it. In my very biased opinion, it's worth enabling for this article.

vel0city · 2025-03-14T02:13:23 1741918403

They're not just static images or animations, they're interactive widgets.