SE Radio 716: Martin Kleppmann Local-First Software

Martin Kleppmann, Associate Professor at the University of Cambridge and author of the best-selling O’Reilly book, Designing Data-Intensive Applications, talks to host Adi Narayan about local-first collaboration software. They discuss what the term means, how it leads to simpler application architectures compared to the cloud-first model, and the benefits to developers and users from keeping all of their data on their own devices. Martin goes into detail about how applications can synchronize data with and without a server, as well as conflict-resolution techniques, and the open-source library Automerge, which implements CRDTs and which developers can use out-of-the-box. He also clarifies what kinds of applications would be suitable for the local-first approach. In the context of AI, they discuss vibe coding, local-first apps, and how the conflict-resolution work that enables data to be synchronized between users can also work with human-AI collaboration.

Brought to you by IEEE Computer Society and IEEE Software magazine.

Show Notes

Related Episodes

References

Designing Data-Intensive Applications (DDIA) — an O’Reilly book by Martin Kleppmann (The Wild Boar Book)
Automerge Open source project
The past, present and future of Local-first software. Local-first conference in Berlin (May 2024) The past, present, and future of local-first — Martin Kleppmann’s talks
Local-first software and geopolitical risk Local-first software and geopolitical risk — Martin Kleppmann’s talks
Very revealing post about the economics of publishing a very successful technical book Writing a book: is it worth it? — Martin Kleppmann’s blog
Local-first software seminal essay Local-first software: You own your data, in spite of the cloud
Local-first conf 2026
Using automerge Practical Local-First Software with Automerge – Peter van Hardenberg
Martin Kleppmann on Bluesky.

Transcript

Transcript brought to you by IEEE Software magazine.
This transcript was automatically generated. To suggest improvements in the text, please contact [email protected] and include the episode number and URL.

Adi Narayan 00:00:18 Welcome to Software Engineering Radio. I’m your host, Adi Narayan and I’m here today with Martin Kleppmann. Martin is an Associate Professor at the University of Cambridge, working on Local-first collaboration software and distributed system security. He’s the author of Designing Data- Intensive Applications , which is regarded as one of the foundational texts for data engineering and distributed systems and has sold more than 300,000 copies globally. It’s one of the most popular books on O’Reilly and an updated version comes out sometime this year. He’s also one of the founders of the open-source library Automerge that allows developers to build collaborative apps where data merges automatically. Prior to this, he founded two startups that were sold to LinkedIn and Redgate Software. Martin, welcome to Software Engineering Radio.

Martin Kleppmann 00:01:03 Hello. It’s great to be here.

Adi Narayan 00:01:04 Yeah, a lot of engineers may be familiar with your famous book on architecting big data systems, but your recent focus has been on Local-first software, which we’re going to be talking about today. To begin, can you define what that term means and a little bit about its history?

Martin Kleppmann 00:01:19 Yeah, sure. So, we were a few years ago working on this type of software where we wanted to combine the best of two different worlds. Those two worlds are on the one hand old fashioned pre-internet software. From the 1990s you would just have some software, it would run on your computer store, its files on your computer, very simple model, which we now call local only software, but then we got cloud software and with cloud software we get like real time collaboration like in Google docs and Figma and so on. And that’s super convenient, but now the files are no longer on your own computer anymore and so if you ever got locked out of your account for example, then all of your files are gone. And so Local-first is really trying to be the best of both of those two worlds where we have the collaboration features that we know and love from cloud software, but at the same time the data is also stored on your own computer locally and so therefore it’s there where nobody can take it away from you.

Adi Narayan 00:02:15 I think you hit on a point there that these days collaborative software is so well entrenched, but it’s the default is that it lives in the cloud. Are there some sort of features when you say Local-first, it’s more than just saving the data locally that there’s a lot more sort of principles that govern this. What are those?

Martin Kleppmann 00:02:31 Yeah, so we realized that when you keep data locally, that opens up a bunch of really interesting advantages for software development but also some challenges. Some of the advantages are, for example, you can make the software work fine offline. So, user can just keep editing the data in whatever way they like, regardless of whether they happen to have an internet connection right now. And then it’ll just resync with the server and with your collaborators the next time you get an internet connection again. But at the same time while you’re online, you can have real time collaboration. It also has performance advantages because if you click a button then you don’t have to wait for a network round trip before something happens. So, the user experience can actually be really great because it means that the software just responds really well. And finally we found that from the point of view of the app developer, it can actually be a much nicer software development model compared to writing cloud software because if you’re writing cloud software, you’re first writing a front end, which might be like JavaScript in a web browser or maybe a mobile app, but then that has to sync via a server and then you have to define a rest API or maybe a web socket protocol or something like that if you want it to be real time and then you have to write the backend.

Martin Kleppmann 00:03:43 There’s just so much complexity in all these different levels of infrastructure that you have to build. And so, part of the goal of Local-first software is actually to simplify that infrastructure so that essentially you build only the frontend, and all of the data sync and backend is just abstracted away.

Adi Narayan 00:04:00 So even small teams can build, you don’t need to have like a dedicated backend group and frontend group to make an app, but a smaller group can just build the entire thing end to end.

Martin Kleppmann 00:04:10 Exactly. That’s the hope. Yes. So right now, there are some services providing backends for Local-first software. So, you can then just build a front end and use a backend from some company. What we would like to gradually move towards long-term is to have open standards for these backends so that you could then plug in a backend from many different providers and just use whichever providers the cheapest for example. And then that way you’re not bound to any one company for the backend, but you have that flexibility and at the same time you can still build arbitrary software on top of it.

Adi Narayan 00:04:45 I think you covered a lot of ground there and let’s dig into each of these things one by one. First things UX design for the fact that everything lives in the local machine means that the down trip is saved, and the app should be more responsive. What does that look like from a developer point of view. Is the approach to UX design very different?

Martin Kleppmann 00:05:04 So the idea here is that you essentially only ever work with local data and any network communication just happens as a background process and that then leads to these potentially really nice instant responses. And there’s been a bunch of apps that have used this very successfully, like Linear as an example of a tracker essentially that uses a Local-first like architecture with a lot of local data in the browser. And they found that just the user experience advantages of being able to respond immediately, like to user input when a user clicks something or presses a key that, that is nice and you get a similar thing with say Google docs or Figma, where likewise you need every user interaction to instantaneously reflect on the screen. You can’t be waiting on a network round trip every time you hit a key in a text document that would just be painfully slow to work with, especially if you’re on a poor internet connection. And so, this instant responsiveness of Local-first software is like quite a nice UX benefit.

Adi Narayan 00:06:03 You mentioned one example that Linear. What does that app do for folks who may not have heard about it?

Martin Kleppmann 00:06:08 Oh, it’s like a project management tool, like a bug tracker essentially.

Adi Narayan 00:06:12 So to summarize, almost all of the data is stored locally, so it doesn’t need to have any roundtrip to pull it, but it would have a cloud interface or a cloud server to sync it so that any other user when they’re logging into a different device is able to pull the same notebook or the same the project, right?

Martin Kleppmann 00:06:29 Exactly. So, the way you implement this is with a sync engine. So, a sync engine you can think of like as a database replication tool essentially, but it replicates data between a database that’s embedded on the user’s own device. So, for example, as like a local storage in their web browser and a database that’s on a server and any update that you make to the data on your own local device gets synced to the server whenever an internet connection is available. And if another user has made an update to the database that would typically go to the server and then be synced to the end user’s client whenever an internet connection is available. And so, this way you still get the real time collaboration. So, any change made by one user flows through the server and is sync to the other users with low latency and each user’s local client then re-renders the user interface based on those changes coming in of the network. And that way, you get this really nice Realtime collaboration. But at the same time, if the internet connection is unavailable right now, well then, you’re just working offline and you’re just storing the data locally in your own local copy of the database.

Adi Narayan 00:07:36 I think the sync engines are really interesting and we’ll come back to that when we speak about sort of the challenges around Local-first to give a larger overview. Are there certain design principles that govern pretty much all Local-first software?

Martin Kleppmann 00:07:50 There’s sort of challenges that arise with it. And so, one fundamental issue if you allow people to work offline is that well, they might make conflicting updates while offline. This is something that software developers know very well from version control systems. Obviously like anyone who’s worked on a version control with Git in a team will know that like sometimes you get merge conflicts because different people have edited the same file on different branches. And so, you get exactly that same pattern with other types of collaborative software too. And with like Local-first per se doesn’t prescribe any particular way of handling those kinds of conflicts. In practice, we’ve had pretty good experiences using automatic merging algorithms and so this Automerge library that you mentioned uses some automatic merging algorithms so that if different users make edits to the same file in most cases, we can automatically combine them into a consistent state. But of course, like if people are literally editing the same sentence of a text document, we will be able to merge those together, but the results probably won’t be a grammatically accurate sentence. In those kinds of situations then it still becomes a UX problem of how we surface those kinds of conflicts to the user and help them resolve it.

Adi Narayan 00:09:07 Got it. Understood. I think there was a paper or sort of a manifesto that you published and in that you had five core principles listed. One was offline first, multi-device syncing, longevity that the data is always with you, conflict-free merging and performance, no spinners, everything in the same place. So, things are faster. Do you still stand by those? Are those sorts of the five guiding principles for when somebody’s going to build something that’s Local-first?

Martin Kleppmann 00:09:29 Yeah, so there were seven principles even in the original Local-first paper that we published. One thing that we didn’t really appreciate yet at the time is the simplification of the developer experience. So, in that paper we focus very much on the user experience and the benefits for users, and that is still very much relevant and true, but we’ve come to since appreciate that also there are significant advantages for app developers as well. And so, we’ve considered writing a follow up piece at some point which elaborates on that, but we haven’t gotten around to writing it yet.

Adi Narayan 00:10:03 Got it. What are some of those advantages to developers? Could you list a few?

Martin Kleppmann 00:10:06 It’s mostly just not having to think about the network because thinking about the network is really annoying because every time you make a request over the network, it might be that the user is on an unreliable coffee shop wi-fi and your request times out and then you just have no idea what happened. Like, did the data get to the server or not? You just have no idea. And so, like you’re constantly writing tons of error handling code and trying to figure out what should you even do in the error case. And the nice thing with using a sync engine is that the application code is only ever interacting with this local database that’s embedded on the client and the whole distributed systems problem of syncing the data between that local copy and the servers and to other collaborators that’s just dealt with by the implementation of the sync engine. So of course, the network communication is still happening, but it’s not happening in app code and so app developers don’t have to worry about it. And so essentially, we’re pushing these distributed systems problems down into the infrastructure and that can massively simplify the programming model.

Adi Narayan 00:11:06 I think you hit enough there because as anyone who’s built an app knows it, the amount of time you need to spend to just make sure that know all kinds of network failures are taken care of and handled correctly without an unexpected failure or a 404 screen is. So, it’s under rated but it ends up taking so much time in any production project. So, hundred percent agree with that. To the point that you mentioned about the app only interacts with the local database, is it correct to think of it as the app only interacts with the local database and as the connection is improved or if it comes out of an offline bubble, the database gets synced with whatever the server or with another copy in the cloud and then the app adapts to any new information that’s coming in? Is that accurate?

Martin Kleppmann 00:11:45 Yeah, exactly. So, from say React, for example, it has this model where you can represent the user interface as a function of some model objects that exist in JavaScript. And typically, those would be just some plain old JavaScript objects and then if the data changes, you re-render the React components and then update the user interface based on that. And so, we can use a very similar model for Local-first software is just that these model objects, rather than just being these plain old JavaScript objects, we make that the state of the sync engine. And so, whenever another user, another collaborator has made some update to the data that gets synced by the sync engine to the local copy and then you re-render the user interface using React or whatever component you’re using. And then that way the remote collaborators changes are directly reflected in the user interface. And it’s actually a really straightforward simple programming model where you’re not having to worry about the case of how you merge in the remote users changes because the sync engine has done that merging already and all you need to do is re-render the UI.

Adi Narayan 00:12:53 And what does a sync engine merging with? Is the expectation that there is a server or is it just a bunch of different users with apps on their devices?

Martin Kleppmann 00:13:02 Yeah, there are different sync engines and that take different approaches with regard to this. So, there are some systems that assume that there’s a server and so it’s somewhat traditional cloud model in that sense, but there are others that could work over a peer-to-peer network as well as a server. And so Automerge is an example of a library that’s designed for a decentralized use where if you want to use it with a single sync server you could certainly do that, but you could also use several servers side by side for example. Or if it turns out that you are like on a local Wi-Fi with another device, but you don’t actually have internet access or you maybe have Bluetooth connection between two devices or something like that, you can just use that local connectivity as well. So, there’s no assumption that you have to go via a remote server If you have some local connectivity or a peer-to-peer connection, that’s perfectly fine as well.

Martin Kleppmann 00:13:54 So all that Automerge requires really is some way of getting bytes from device A to device B and often relaying the bytes via a server is a convenient way of doing it because we know how to run that kind of infrastructure and peer-to-peer can sometimes be unreliable. So, relying on peer-to-peer only is dangerous. But having peer-to-peer in there as an option that you can use alongside server-based sync. That’s actually really empowering because it means that you might have a farmer on a remote field and they just want to sync some data between their phone and their tractor and they might not have cellular data connection where they are in some remote location, they should still be able to sync that because they have the two devices right next to each other. And so that’s also part of what we actually wanted to enable with Local-first. So, with Local-first, the local part doesn’t actually refer just to local storage but also to local networking. So, if you have two devices that are physically close to each other, then they should just be able to communicate over a local network and that should work just as well as going via a server.

Adi Narayan 00:14:57 That’s so interesting because these days we just take it for granted, right? Like even though if there are two people sharing a document, like say it’s me and my wife editing the same document for us to see our changes, the changes have to go to a server, I don’t know, sitting in California or New York and then come back to us and if that connection is not active, we can’t even see each of those changes and we just take it for granted. But the fact is there are ways to, like you were saying, if we can say that devices can share information through whatever means possible be to peer to peer network or Bluetooth, it gives you the same outcome without having to make those round-trip journeys to an external server sitting somewhere else.

Martin Kleppmann 00:15:30 Exactly. And if you think about it, it’s kind of ridiculous to send all of your data via AWS US East one if you have two devices sitting right next to each other, there’s really no need for them to go via some cloud.

Adi Narayan 00:15:42 We’ll dig into some of these challenges a little bit later, but can you give us an idea of who’s using it these days? Like is it mostly folks making hobby projects or is it being applied in any form in bigger and popular apps as well?

Martin Kleppmann 00:15:55 So there are increasing numbers of production users of these types of software as well. So, for Automerge for example, it’s an open-source project but we have a bunch of companies commercially sponsoring it because these are companies using Automerge for their production products and they want to make sure that it’s a well maintained, good quality open-source project. And so, they’re voluntarily financially supporting the project like that. I think many of them are pre-launch, so I’m not sure I can name them right now, but there are definitely people using this for production software and getting really good results out of it.

Adi Narayan 00:16:31 When we think about apps say app like an Xtra or WhatsApp for that matter, there are Local-first elements there that I can notice. So, the fact that the app can always work regardless of whether you have internet or not, that let’s say in WhatsApp you can read the messages even though there is no internet connection or Xtra for that matter that saves all the data locally and not on the cloud. Do you see that there are mainstream apps using the principles, at least not all of them but some of them because they’re sort of almost common-sense principles at this point?

Martin Kleppmann 00:16:59 Yeah. Well I would say Git for example, which I mean it’s mainstream among software developers, maybe not mainstream amongst the general population, but Git is totally Local-first. It has this property that you can create a committee while you’re offline, you can rewrite your local history while you’re offline and only for pushing and pulling do you need to be online, everything else you can do offline, all of the version control stuff, you can look back at old versions of your repository and so on. And that’s something that we have come to take for granted as software developers because it’s so useful to be able to do these kinds of things locally on your laptop and it’s just really fast as well. And essentially what we’re trying to do with Local-first software is to bring these capabilities that software developers have with Git to bring that to many other types of apps and many other different file types because theyíre at a big disadvantage of Git of course is, there’s no real time collaboration and you can’t really put anything other than text files into it and get sensible merging behavior. So, if you put a spreadsheet into Git, well it’ll just be a binary blob and if two people on different branches edit that spreadsheet, well then good luck merging that. And so essentially what we want to do is have GI like capabilities but bring them to not just text files but spreadsheets and graphics apps and issue trackers and all these types of other kinds of software.

Adi Narayan 00:18:20 So is it right to think about it as GI like abilities but with the ease of use of say Google Docs or Google Sheets?

Martin Kleppmann 00:18:27 Yes, exactly.

Adi Narayan 00:18:28 Got it. Alright. Since the time that this came about, it used to be called local only software. I think back in the day there was one iteration of it called Offline-first software as well. What kind of technological improvements have made it easier for developers to adopt this or how have made it possible to be able to resolve conflicts between users, synchronize data, the think engine and so forth? What is the sort of key to technological improvement that have enabled Local-first software to be adopted more?

Martin Kleppmann 00:18:56 Yeah, so one of the main things that a sync engine needs to do is to, well, it needs to capture all of the edits that are made to some document or some file and then it needs to be able to merge things if two different users concurrently edit the same file. Now there are a few different algorithms which have been developed for this sort of thing. Google Docs for example, uses a type of algorithm called operational transformation, which has been around since the early 90ís actually. So, its approach is able to merge concurrent updates. So, if different people edit the text on their respective devices, then this algorithm can merge those edits together in a clean way. But it does depend on a single central server. And so as just discussed, part of what we’re trying to do with Local-first is also open the option of not having that one central server and there’s a different family of algorithms which has been developed which can handle this type of merging and conflict resolution without assuming a server.

Martin Kleppmann 00:19:56 And they’re called Conflict Free Replicated Data Types or CRDTs. And this is how I got into the whole area as well. So, when I first started working on collaboration software in 2015 or so, that was because I had just read the research on CRDTs, which was quite new at the time, and we didn’t yet have the term Local-first and we haven’t yet really worked out these ideas and principles so well. But we did have these algorithms, and I saw in those algorithms the promise of building a better type of software. And so, I started then doing research on CRDT algorithms myself and initially that was quite theoretical of just trying to figure out how the merging behavior should actually work and how we implement it. And then over the years we put a lot of work into the efficiency and performance of these data structures and now gradually we’re getting to the point where we feel like, okay, we have a solid set of not just algorithms but implementations and systems, but they’re based on this CRDT research from the last decade or two.

Adi Narayan 00:21:06 Got it. For the interest of listeners, I’m just going to define a few of the terms that you just mentioned. When you say a sync engine, am I right to understand that that is very hard, just making sure that any changes that have been made by a user on dev device if say the user was offline for a bit, that those are synced with either the central repository, central sort of server or with other users whenever the internet connection is back. Is that accurate?

Martin Kleppmann 00:21:29 Yeah, so you can think of a sync engine as an embedded database plus a replication protocol if that’s any clearer. So, an embedded database like SQL Light, it works very nicely, but it’s, you have data only on one device. If you have like a cloud database like Postgres or so then you might have multiple replicas of that, but in that case, you will typically have one primary replica and then several followers. And with a sync engine it’s more of a decentralized model where there isn’t really one primary copy, but each user on their local device has a local embedded database that stores the data for that user. And then if the user updates it, those updates can then be synced to a copy on other collaborators devices, perhaps via servers, perhaps peer-to-peer whatever. And if other users make changes, then respectively they’re synced back to the local user.

Adi Narayan 00:22:24 So given that each user has data on when they make changes on their device, it is then synced to all the other users. How do you ensure that the size of the data that’s on each device just doesn’t grow exponentially? Because you don’t want all users to have information about all other user’s data, right? How do you ensure that the database on each user’s device is specific to just what that user needs?

Martin Kleppmann 00:22:45 Yes. So, we think Local-first is not for absolutely every type of software, but it’s particularly good for software where the data is basically all created by the user themselves or by their collaborators. And so, in that case, the data you need to sync to a user’s device are just either the files that that user themselves created or the files that a collaborator chose to share with that particular user. And so, the way you define what you sync is then through access control, essentially you look at all the documents that that user should be able to read and write or maybe just read and those are the ones you sync. And so, on the server, if there’s a sync server, it may have a much larger collection of documents, but you would only sync to each individual user. Those docs that that user has access to that may still in some cases be a lot of data. And so, in that cases you may still want some mechanisms of filtering it down and sinking only subsets, but that’s the core idea of how we’re approaching that at least in auto mode. So, when I say documents here, you can think of that as like a file or a document is just a granular defined grained elements at which you can share. So, you could give access to someone to a document or a collection of documents.

Adi Narayan 00:23:58 Can you define Automerge what it does and sort of where it’s used?

Martin Kleppmann 00:24:01 Yeah, Automerge, it’s a sync engine or you can think of it as an embedded database that you can run in a browser or inside a mobile app where it captures any changes that you make to the data. The data is represented in adjacent like data model with a few extensions and any time if you want to represent, say a spreadsheet, you will model that spreadsheet as a JSON document. Every time that somebody edits a cell in the spreadsheet for example, you would then go in and update the corresponding part of that JSON document that represents that cell and Automerge then persists that change to local storage. So, in a browser that might be to index DB for example, if it’s in a native app that might be just to the file system and it also then syncs that data of the network to any other copies, any other users who have a copy of the same document and Automerge handles all of the network communication stuff.

Martin Kleppmann 00:24:57 So as a network protocol in there that is able to figure out when a user reconnects, what changes do they need to download and upload so that they’re in sync with whoever they’re syncing with right now. And to make that efficient, even if you have a large collection of documents and then while the user is online, any changes that one user make get relayed in real time to their collaborators and while a user is offline, any changes they make just get buffered locally on their local storage and then synced the next time that the user is online.

Adi Narayan 00:25:29 Can you talk a little bit about how this automatic syncing works? Because there could be changes that if you take an example of say a document, if I make a change when I’m offline and somebody else also makes a change around the same area, is it last in like the most recent changes come first? How do you decide which changes should be a sort of the final version?

Martin Kleppmann 00:25:49 Yeah, so we combine all of the changes together and try to merge them in a way that preserves the intentions of the users as much as possible. There are of course limitations to what is possible. So, if two users edit the same word in the same sentence, then the best we can do is well we preserve all of the inserted characters essentially and make sure that they appear in the right order. But for example, if you replaced word A with B and I replaced word A with C, then in the final document it’ll then say BC or CB. So, both of the texts that we inserted will still be there, but the algorithm doesn’t try to do anything clever with for example, ensuring that the final sentence is grammatically correct or anything like that. So, it just ensures everybody’s in a consistent outcome. And then if that merged outcome isn’t what you liked, then you can still edit the document and fix the bad merge.

Martin Kleppmann 00:26:44 That’s essentially the philosophy that we take here. So, there’s no explicit merge conflict resolution step like there is in Git because we felt that that often leads to like just complex user interfaces. Everybody using Git hates resolving merge conflicts because it’s just so confusing and sometimes mind bending to figure out how to merge these things. So, we figured out that what we would rather do is just merge things automatically and then fix things up after the fact if it isn’t what was desired. But the merge algorithm does ensure that if two users independently merge the same changes then they will end up in exactly the same state. So, they will then have the same document on their screen at the end.

Adi Narayan 00:27:30 Got it. So, it takes a first stab at coming up with like a merged version and if the users find that it’s not what they want it to be, they can go and make additional changes. Is that correct?

Martin Kleppmann 00:27:42 That’s right, yes. What we’d ideally like is maybe some more tooling around helping apps surface conflicts to users. So, when we’ve merged edits that are very close together for example, then it would be nice to maybe show a popup to users saying, hey, we did a merge here that you might want to review because the edits were very close together. The trouble is how do you define close together? And so, in a text document maybe you define close together in terms of how many characters there are apart, but in a spreadsheet close together may mean something entirely different. And then in a graphics app close together may mean something else again. So that’s a problem that we haven’t really cracked yet at the moment we just do the merge and let users review it. We don’t yet have that mechanism for surfacing these potential conflicts to users, but we’d love to have that in the future.

Adi Narayan 00:28:57 Might, in thinking about Automerge, as for it is able to, depending on the file types, be it like a document data spreadsheet, it saves it as JSON and it is able to manage the changes that each user makes and resolve those conflicts. For any new file type, is there work required to enable it within Automerge?

Martin Kleppmann 00:29:15 Yes, so you would have to map it onto the data model that Automerge provides, which is this JSON plus a few extensions like I mentioned. So, that’s comparable to if you’re building an app on top of a relational database, then you have to figure out how to model your data in terms of tables and rows. So, there’s a similar kind of data modeling process that happens. The reason we chose JSON is that we found that many client-side apps, front end apps naturally represent their data in a kind of JSON form. And that’s if you’re using a user interface framework like React for example, that kind of data model is, is almost assumed that it’s really part of the way how these frontend frameworks work. And auto is just something that all developers are familiar with. Everyone can imagine what JSON looks like and so itís a very flexible format.

Martin Kleppmann 00:30:10 It allows us to represent ordering for example in a way that is harder with a relational model. So, if you have a to-do list in which you want your to-do items to appear in a particular order, well in JSON that’s really straightforward. You just make an array. If in a relational model, well you would then have to have some sort of numbering of the rows to tell you in which order they belong. And then as rows get moved up and down, you have to re-number them carefully and that gets a bit messy. So, the JSON data model seems to work quite well here, but it does assume that you are building your app really on top of the sync engine. It’s a little bit hard to retrofit this to an existing app that doesn’t have this kind of data model layer of the same, same sort at its core already.

Adi Narayan 00:30:56 And when you say you’re, it assumes that you build your app on top of the sync engine I’m taking, I’m understanding that to mean that you are really conscious about how the syncing works and like what you’re saying when it comes to cases where there are conflicts, the way the sync engine handles the conflict is very specific and the app needs to be cognizant of that and give the user the ability to maybe go back to an older revision or make changes and so forth. Is that correct?

Martin Kleppmann 00:30:55 Well, it doesn’t need to be that complex actually. So, the app doesn’t really have to worry about the fact that sync is occurring. It can just take the state that’s represented in this local database and render it. And we’ve had some success putting single user apps to be multi-user by putting them on top of Automerge. So, for example, tldraw, it is an open source little graphics browser based vector graphics tool and we were able to fairly easily put that on top of Automerge by just taking the tldraw internal data model and mapping it onto an Automerge document as, and it uses adjacent like model. So that fit well and that way we were able to make a collaborative tldraw actually quite easily.

Adi Narayan 00:31:38 And what were kind some of the challenges that you encountered while building Automerge?

Martin Kleppmann 00:31:43 The biggest challenge has really been performance and memory usage over the years, and we’ve put a lot of engineering work into that because the approach how Automerge works is, you can think of it like a version control system like Git. So, every time you edit the data in any way, we capture that the difference to Git is that Automerge is designed for very fine grain changes because we want to be able to use it for real-time collaboration and people are using it for real-time collaboration. So, Google Docs style, which means that potentially every single keystroke becomes a commit. Now if you use Git like this way creating separate commit for every keystroke, it would be ridiculous and extremely inefficient. But we put a huge amount of work into the internal data structures of Automerge so that we could have a Git like model but at the same time represent this very fine-grained editing history and like Git, Automerge also retains the full editing history editing.

Martin Kleppmann 00:33:05 So that means you can look back at arbitrary past versions of a document, you can do a diff of what did your colleague change last week while you were on holiday? Any of those kinds of things that we are very familiar with from Git, but people don’t really have those kinds of version control capabilities inside other apps. And so, part of the goal of Automerge besides providing this tool that supports real-time collaboration is to try and bring these versions control concepts to other types of apps. But the challenge there has really been in maintaining all that editing history efficiently. I think we have some pretty good solutions for that now, but it has taken several years to get there.

Adi Narayan 00:33:43 And what kind of usage are we talking about in terms of how many users right now use the library? Is it used in production systems? What’s a scale like?

Martin Kleppmann 00:33:54 We don’t really count users in traditional sense because it’s not a cloud service. It’s like asking how many users does Postgres have? Well, I donít know.

Adi Narayan 00:34:01 Yeah, and NPM downloads is a good metric, yeah.

Martin Kleppmann 00:34:04 So apparently there’s 14,000 weekly downloads from NPM at the moment. So, it is being used quite a bit, I would say is obviously still a comparatively new technology, but people are using it for serious stuff. The number of users doesn’t per se really makes a huge difference. One of our design goals of Automerge is at least eventually to be able to support Wikipedia scale collaboration. And so, there’s no current plan for Wikipedia to adopt Automerge, but if they wanted to it should be possible. And so that’s a very large collection of documents with a large number of collaborators. The challenges that arise with really large number of collaborators are more around the like permissions and the human coordination aspects. If you have a document that’s publicly available on the internet for anybody to edit, then you’ll inevitably have problems with vandalism and people just coming in and defacing it or adding rude content to the document or anything like that.

Martin Kleppmann 00:35:03 That’s not something that Automerge per se can solve. Automerge is just a sync engine, so it’s not a moderation system that’s designed to solve with those types of things. But from an Automerge level point of view, the number of users that are collaborating doesn’t really make a big difference. So Automerge itself just assumes there’s some way for the users to get access to the right documents. We are working on a cryptographic access control system called Key Hive, which is designed to go alongside Automerge. This is not quite ready yet, so this has been in development for the last year or so. And the idea with this is to have a way of having say Google Drive style permissions. So, you could grant somebody access to a folder full of documents for example, and then for those individual documents then to become accessible to the user who’s been granted access.

Martin Kleppmann 00:35:54 And you need to be able to revoke access again of course and to be able to have maybe teams of users and to be able to grant everybody within a team access those types of things. So doing all of that but without again relying on a single trusted server. So, all of these access control things are really easy if you just have a single authoritative server that can decide who gets to access what. But in a decentralized case where it’s potentially peer to peer, that becomes a lot harder because you need cryptographic keys in order to identify the users at all or the user’s devices. So, with Key Hive we create a separate cryptographic key pair for every device and then we have an access control delegation system essentially where the creator for some document can then delegate access to different keys or different groups of keys and thus grant them access.

Martin Kleppmann 00:36:44 And then there’s a permission checking system which is able to, when you get an edit from somebody for example, figure out was that edit actually allowed or not, was that edit made by an authorized user? And that can all be done without assuming a central server. And then finally Key Hive actually enables end-to-end encryption as well because we have this, these cryptographic keys anyway, we can then use that to encrypt the data on any sync servers. So, the sync servers can’t actually see the content of the documents at all, but only the end users on their devices can decrypt the documents in the same way as like Signal and WhatsApp do. We’re essentially trying to bring that kind of end-to-end encryption to file editing as well.

Adi Narayan 00:37:24 Great. I mean that sounds really interesting though. In the interest of time, I think we’ve spoken a lot about auto automation, it was really interesting to understand how it works. One point that you mentioned earlier was, I think you had mentioned it in passing, that Local-first is not for all kinds of software. There are certain kinds of software where it’s better suited versus others. Where do those boundaries like, like what is good for Local-first, what is really not?

Martin Kleppmann 00:37:44 Yeah, I think the apps where Local-first works really well are the apps where the user can edit the data in whatever way they like. And so that’s the case with a text editor or spreadsheet or a graphics app. The user can is free to change the data however they like with a bank account say this is not really the case. If with a bank account I could edit the bank account balance and add an extra zero at the end, but the bank is not going to pay any attention to my edit because it’s not meaningful. So, for that type of apps, I think if it’s managing some real-world resource like money or say inventory in a warehouse or something like that, I suspect that like the traditional cloud model is still more appropriate for that type of app.

Martin Kleppmann 00:38:24 And then furthermore, if there’s apps with large data sets, so say it’s an e-commerce website, in principle you could make that Local-first, but that would mean syncing the entire product catalog to every user’s local device. And that’s probably not going to make sense because the user’s only going to be interested in a tiny fraction of the product catalog. They’re not going to read every single product description page. And so, in that case, I think the traditional web approach is just fine as well where, you have an e-commerce website, you just go to the product pages for those products that you’re interested in, you download those individual pages, and you don’t download the entire catalog. So, this pre downloading everything is useful if this is data that the user is likely to want to edit offline or access offline. But in the case of a large product catalog that doesn’t really make sense. So, I think we want to be sensible about where it applies and were not. Those are the heuristics that I used.

Adi Narayan 00:39:22 To summarize software that requires like a single source of truth, be it like a financial thing or a stock exchange software, something like that, that certainly will not apply Local-first will not be applicable. But for things like say a note taking software or, some sort of a charting app, anything that users entering their own data and being able to manipulate that, then it makes more sense.

Martin Kleppmann 00:39:44 Exactly.

Adi Narayan 00:39:46 Now from a developer point of view, in order to build Local-first applications, how much do they need to get into the weeds of the sync engine and CRDTs and all of that? Is there a complexity tax really for designing something that’s Local-first as opposed to just building a very quick app, setting up a cloud database somewhere and the rest API to sort of quickly access it and a client-side app access it? Is it harder to build something Local-first?

Martin Kleppmann 00:40:15 So our hope is that it’s actually easier, not harder. And so, all of this details of CRDT is something that app developers should hopefully never have to worry about. In fact, I think the Automerge website barely even mentions the word CRDT because we think that for users that’s just not something that they should have to think about or for app developers the only real challenge is just that it’s a different model. And so, if you are used to the way of building software for the cloud and you have your tools and workflows and habits that you’ve built around that kind of software architecture, then it requires learning a new model. And even though that model might be simpler, there is just a bit of a switching cost in learning something new. So that’s what I see as, the main challenge to adoption here.

Adi Narayan 00:41:03 And when it comes to things like say for listeners if somebody wants to experiment with it, how do you recommend they get started? Is there like a sort of a popular example that folks like to use?

Martin Kleppmann 00:41:14 So if you want to get started with Automerge specifically there’s documentation on the Automerge website that is hopefully fine for getting started. There’s also a friendly Discord community where you can get questions answered if you’re interested in Local-first more broadly, there’s quite a nice website put together by Joanna Shing from the Local-first podcast. So, this is on the website Local-first.FM and if you click in the top menu on landscape, it has essentially a catalog of technologies for building Local-first software and Automerge is included there along with a whole bunch of other tools. Some are like open-source projects, some are startups, and they don’t have all the same design approach. Some prioritize having a central server and a happy not supporting peer-to-peer for example, where there’s others want to be able to support peer-to-peer operation and there are various other trade-offs that they make some, for example, retain version history. So, you can do version control, some are focused more on real time collaboration and don’t retain version history and so on. So, there’s no, like as usual with these systems there’s no one right answer for all applications, but hopefully this Local-first landscape can help you figure out what tool would be appropriate for your particular application.

Adi Narayan 00:42:33 One question that I wanted to ask that I had missed earlier was we’ve been spoiled by the likes of big cloud providers as Google and others to have good search functionality always there. So, you can, you don’t realize how good it is, but you’re able to search for like one little keyword on millions of documents and it comes out in like less than a second. If your database is on the device, does the developer, are they expected to invest time to build good search functionality to match what they see in the cloud?

Martin Kleppmann 00:42:58 Yes, so with Local-first if you want the search to work offline, then of course your search function will have to be locally on the client as well. I think there are some nice apps that demonstrate how well that could be done. Like I use Obsidian as my note taking app for example and it has a great search functionality. It’s super-fast. I have a pretty large collection of notes accumulated over the years and I’m able to find things within a keystroke. So, I think, it’ll require a little bit of investment but a server side searching functionality requires investment as well. And in fact, scaling a server-side search is probably harder than making a client-side search engine. So, there are plenty of existing search engines, open-source tools that you can use. At the moment we don’t yet have a nice integration of Automerge with search engines. So that’s maybe something where we could invest in the future. So, for now if you’re using a sync engine like Automerge, you would then as updated documents come in over the network, you would need to update the local search index as well. That way you would be able to provide really nice search locally too.

Adi Narayan 00:44:10 Understood. This may be a very simple question, but like if all of the data is stored on a local device, let’s say it’s a phone and if I lose my phone and if I haven’t synced to the server in a while, or if there wasn’t a server or if it was all peer to peer, what happens?

Martin Kleppmann 00:44:26 Well if you’ve not synced anywhere then the data will be gone. And so, part of the reason for having sync servers is also providing a backup mechanism. And so, we recognize that that’s a useful function that sync servers provide besides just getting the bytes from one device to another. The backup functionality is a useful thing because obviously people do use devices. If you’re in a peer-to-peer setting, you would probably want something like a backup peer. So, you could actually still, whether you call it a server or a peer that happens to be online, most of the time it’s kind of the same thing. But that’s just inherent if you haven’t copied the data anywhere off device and you lose the device, then of course it’s gone. But I think just the fact that you have the local copy doesn’t mean you can’t have a cloud backup as well. Like cloud backups are perfectly fine and we encourage that too.

Adi Narayan 00:45:17 But I think the distinction that you draw is the cloud backup is just like a dumb backup, right? Like just an S3 backup. You just put the data there, you’re not relying on the cloud to do any collaboration or to search it or anything. You’re just keeping it there as a way to store it.

Martin Kleppmann 00:45:30 Exactly, yeah, it’s just very straightforward storage.

Adi Narayan 00:45:33 Understood, understood. Switching gears a little bit to the sort of the compulsory AI question, with the advent of AI coding tools and chat bots, is there a change in the adoption of Local-first? Is it making more sense to have having a local database, does it help with running LLMs on your device where you have its own sort of its own memory it can bank on? How has AI changed Local-first adoption?

Martin Kleppmann 00:45:59 Yeah, the two seem to go together very nicely actually. Partly it’s that since Local-first enables a simpler application architecture because you’re basically only building the front end and you don’t need to build the backend, it becomes easier to actually vibe code an entire app and have it work because there’s just fewer moving parts that need to be coordinated. So, I think AI assisted software development is friendly to Local-first because it’s this more self-contained programming model. But another aspect there is that with AI, essentially, we can treat human to AI collaboration as just another instance of human-to-human collaboration. And Local-first is already all about collaboration. And so, when we started the Local-first thing, AI wasn’t big yet and so we were assuming human to human collaboration, but weíve had, really good success applying the same principle for to human to AI collaboration.

Martin Kleppmann 00:47:00 So one example of that would be you want to have an AI agent maybe rephrase a text document that you’ve written, but then you probably want to review the edits that that AI has made and check that you agree with them. You don’t want those edits just immediately going in. And so, the way we do that is to use the same kind of versioning control and change tracking facilities as we do for human-to-human collaboration. We just say essentially the AI makes a pull request against your document and you get to see a diff and you get to review every change that was made by the AI accepted or rejected. And so, this wasnít planned originally, but it seems to have worked out very nicely in the age of AI.

Adi Narayan 00:47:43 That makes a lot of sense and it’s really interesting because you do, especially when you’re using agents, it goes and irate so fast, it makes so many changes that the idea of looking back at the revision industry doesn’t really come up. It’s more like I gave it, gave it something and then it ended up with something else. And especially if you’re, if you’re going there and then you’re sort of making more edits, having that version history and seeing what those changes were is very interesting. Are there any apps or any groups that are using it or is it still sort of in the experimental stage?

Martin Kleppmann 00:48:09 Yeah, there a bunch of production apps using it. There’s a whole list of products that describe themselves as Local-first, which I’ve put together. I wouldn’t be able to name them all off the top of my head because they’re literally dozens of products on there. So, it’s really like catching on as an industry trend I would say.

Adi Narayan 00:48:29 Given that there’s, with any Local-first app, there’s going to be a lot of data being stored in the device itself. Has anyone tried to use that data as sort of long-term memory for AI agents or for LLMs that are running on the device itself without even going to the cloud? Like running like a local instance of say Llama?

Martin Kleppmann 00:48:46 Yeah, people are experimenting with running local models, locally on device and I think that’s a promising direction. I think right now the biggest and most powerful and best models are still cloud-based, but there seem to be enough open source models that are good enough for a whole bunch of use cases that you can have software that is able to have, AI based facilities working offline, for example, by using a local model and Local-first plays into this very directly.

Adi Narayan 00:49:15 A few last things. If you had to pick one benefit that will finally push Local-first into the mainstream, what would it be? Would it be speed, privacy or the fact that the app won’t die, or the data won’t die when the company does?

Martin Kleppmann 00:49:28 I would say different benefits matter to different people. And so, I wouldn’t say like there’s one thing that applies for everybody. For me personally, I think the one that I care about most is not being dependent on individual providers. So, I’m very worried about say a provider locking you out because whatever some automated system decided that you were violating the terms of service. And then if you get locked out of your Google account for example, you have basically no recourse because, good luck getting ahold of an actual human being at Google. So, I think this puts in a very precarious situation with cloud software. And so, for me personally, I think this knowledge that there’s a local copy of the data that nobody can take away from me, that’s what matters most to me. But for other people, like maybe the developer experience matters more or the user experience benefits of having software, that’s really fast. So, I wouldn’t say it as just one single aspect.

Adi Narayan 00:50:26 That is very interesting. I mean it’s one of those things that you almost take for granted that yes, Google Gmail will always be present, could always have access to that, but you never know, you never know what forces are there. There could be like a core ruling against you or anything that sort, which might just suddenly block your access to all these things. Which brings up another question though. We’ve talked about the technical aspects. Is there a change in the legal system wherein do you think we’ll see regulations that see that certain types of data will be medical, or financial records should be Local-first for privacy reasons?

Martin Kleppmann 00:50:56 Well, there’s already a lot of regulation that requires medical data to be stored locally within the country. Many countries have this sort of regulation already that doesn’t imply locally on device necessarily, but it is locality within the country. So, there’s sort of a similarity there. I think I’m less concerned about regulation, but I am a bit concerned about the wider geopolitical risks. So, as we’ve seen in the last few months, for example, there’s been tensions between Europe and the US about Greenland and this has made me quite worried about what if some, if these tensions escalate. And Europe is currently highly dependent on US cloud services and what if there’s suddenly an executive order that limits European companies access to US cloud services? And that could happen overnight. I think the chances of that happening are not very great and I really hope that it doesn’t happen.

Martin Kleppmann 00:51:53 But I think, a year ago it would’ve been unthinkable and now it’s unfortunately no longer unthinkable. And so, I do believe that Local- first has a role to play in getting our systems away from this dependency on a single server hosted by a single company in a single country and more towards a more decentralized model where you could have multiple servers, they can be hosted by different companies, they can be hosted in different countries. If one of them goes away, it’s not a big deal because the others are still there. So, I think I’d just like to move to this slightly more relaxed model where decentralization actually guards against some of these geopolitical risks.

Adi Narayan 00:52:36 Yeah, I think it’s a great place to leave it and it’s also in the way the Local-first movement also has developed is that you start small, you start thinking about your app or whatever you’re building and making sure that if some data doesn’t need to be in the cloud, don’t put it in the cloud. If you can manage by just keeping it locally, go ahead and keep it locally and deal with the problems of syncing it between users as a separate thing. Keeping it, this whole default state where everything starts off in the cloud has all these risks as you pointed out. If folks want to know more about your work, where should they go?

Martin Kleppmann 00:53:10 Yeah, if they’re interested in Automerge as a sync engine, Automerge.org is the, the website for that. If they’re interested in Local-first, more generally, I would suggest reading our 2019 essay on Local-first software, which is on the Ink and Switch website. If you’re interested in like various Local-first tools on Local-first.FM and then click the landscape link in the top menu that has an overview of a whole bunch of different products and systems. Yeah, if you’re interested in like academic research, then I have a personal website with a bunch of publications on it as well.

Adi Narayan 00:53:57 And a new edition of your very popular book is coming out sometime this year, right?

Martin Kleppmann 00:53:52 Yes, exactly. So, the print edition should be out within the next week of this recording actually. So, by the time you listen to it, it should already be out. Sorry, the eBook should be out, and the print edition comes out in March, 2026.

Adi Narayan 00:54:04 That book has really been instrumental, definitely for my career because it has a lot of learning about, data engineering and distributed data applications comes from that. So, it became like a foundational text for us. So, it’s a real pleasure to have you on the podcast, and thank you so much for taking the time today to talk about Local-first and how folks can get into it.

Martin Kleppmann 00:54:25 Thank you. Thanks for the good questions and yes, it’s a pleasure talking to you.

Adi Narayan 00:54:28 Thanks Martin.

[End of Audio]

SE Radio 716: Martin Kleppmann Local-First Software

Show Notes

Related Episodes

References

Transcript

Join the discussion

More from this show

SE Radio 724: Jure Leskovec on Relational Graph and Foundational Models

SE Radio 723: Dave Airlie on Linux Kernel Maintenance

SE Radio 722: Dwayne McDaniel on the Engineering Challenges of Secrets Management

Menu

Recent posts

Search

Search

SE Radio 716: Martin Kleppmann Local-First Software

Show Notes

Related Episodes

References

Transcript

Join the discussion

More from this show

SE Radio 724: Jure Leskovec on Relational Graph and Foundational Models

SE Radio 723: Dave Airlie on Linux Kernel Maintenance

SE Radio 722: Dwayne McDaniel on the Engineering Challenges of Secrets Management

Menu

Recent posts