Alex P Profile Banner
Alex P Profile
Alex P

@ifesdjeen

Followers
13,211
Following
1,070
Media
490
Statuses
16,576

Distributed and Storage Systems. Apache Cassandra Committer and PMC member. Author of Database Internals @therealdatabass . Discord:

Munich, Germany
Joined July 2009
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@ifesdjeen
Alex P
6 years
Ok, let’s try that: 1 like = 1 Distributed Systems paper. Let’s go!
11
176
984
@ifesdjeen
Alex P
6 years
Look what I’ve got here! A Database Internals book. Just imagine that it’s a bass. Data bass.
Tweet media one
39
134
675
@ifesdjeen
Alex P
7 months
When I wrote Database Internals back in ~2018, my main goal was to make the field more approachable and less intimidating. Everyone should be welcome to enter, there's so much work to do here! Over the last year, so many people have read it. I've seen at least 3 book reading
8
72
579
@ifesdjeen
Alex P
5 years
🙀
29
105
574
@ifesdjeen
Alex P
5 years
Finally, entire Database Internals book contents are available on O’Reilly Early Access program! Still raw and unedited, we are working hard on a final release. You can have a sneak peek here:
13
122
524
@ifesdjeen
Alex P
5 years
Many people asked, so here’s a thread. Books about database management systems and related concepts! Many concentrate on relational dbms.
8
146
522
@ifesdjeen
Alex P
3 years
Every cent made from Database Internals @therealdatabass for now is going to be used to support Ukrainians! Ukraine has given me education and enabled me to write this book. I owe this to them.
8
86
387
@ifesdjeen
Alex P
3 years
We’re reunited and I’m extremely happy to have them with us. But this is not over: under the circumstances, we are privileged. We live in the EU, have picked them up, and they have food and shelter. But most people running - don’t. Please keep donating and volunteering.
Tweet media one
@ifesdjeen
Alex P
3 years
They’ve crossed the Polish border! Meanwhile, a missile has exploded right next to the train station where they’ve started their journey yesterday evening.
6
0
75
18
5
374
@ifesdjeen
Alex P
2 years
In case you don’t know this yet, CMU database group YouTube channel is simply amazing. Watching it regularly. Very much thought provoking, constantly getting a stream of ideas when listening.
5
47
338
@ifesdjeen
Alex P
5 months
Database Internals is nearing 26K sold copies! Every month, I get a slip with this little creature on it and add up the tally. Oftentimes, I can't believe that so many people have it on their desks. Thank you, everyone, for spreading the word!
Tweet media one
11
21
308
@ifesdjeen
Alex P
5 years
Looks like three chapters from Storage Engines part of @therealdatabass were released on O'Reilly EAP: In-Place Update Storage, Binary File Formats and Implementing B-Trees. We'll keep releasing chapters if you keep spreading news about the book!
5
133
299
@ifesdjeen
Alex P
6 years
If you haven’t yet read my article on LSM vs B-Tree storage, you have no more excuses. It’s now also publicly available on ACM. Algorithms powering modern storage systems:
2
96
290
@ifesdjeen
Alex P
11 years
Sums up my thoughts on talking about Functional Programming with some people: http://t.co/FGNyfSbvXb
Tweet media one
15
559
260
@ifesdjeen
Alex P
11 months
If there’d been such a thing like Database Internals II, what would you like to see in it? I’ll start: code. Def not production-grade, deliberately simplified, but technically equivalent to what you’d see in a real database, with clear cut abstractions even if they hurt perf.
35
20
264
@ifesdjeen
Alex P
5 years
Big thank you 🙏 to everyone who got a copy of Database Internals @therealdatabass . It's now ranked #5 ,534 out of over 8,000,000 books, and is named a best-seller in Management Information Systems on Amazon \o/
Tweet media one
20
20
260
@ifesdjeen
Alex P
2 years
This year, I'll be posting Database Papers Advent Calendar, a collection of curious papers from 2021 and 2022, to my Mastodon account, and replicate to Twitter with some delay to maybe inspire folks to migrate over:
4
43
223
@ifesdjeen
Alex P
7 months
I still take this as a compliment! If someone with quite a bit of experience in databases has learned something new from it, I’m very happy. As regards shortcomings, if I knew what I know now back in 2019, I could’ve done (also, written and edited) better.
@eatonphil
Phil Eaton
7 months
My review of Database Internals. A solid guide to many challenging topics, from an interesting perspective. Would recommend, and I look forward to a potential second edition!
Tweet media one
11
25
403
5
9
211
@ifesdjeen
Alex P
7 years
Continuing the Disk IO Series, the third post on Mutable/Immutable Data Structures and LSM Trees is out!
2
85
203
@ifesdjeen
Alex P
10 months
Crazy idea: a “behind Database Internals” reading group. Like database internals reading group but instead we’re going to read the most notable references. Probably will take us about a year (reading all will def take us way over a year). Yay or nay?
38
3
203
@ifesdjeen
Alex P
7 years
A very complete, deep analysis of Modern B-Tree Techniques, a very nice paper:
1
53
193
@ifesdjeen
Alex P
4 years
Database Internals Chinese translation is now available! Big thanks to @dxhuang for proof-reading it and finding a copy for my library \o/
Tweet media one
Tweet media two
Tweet media three
12
27
179
@ifesdjeen
Alex P
6 years
ACM Queue with my article on Algorithms Behind Modern Storage Systems is now out!
9
46
184
@ifesdjeen
Alex P
3 years
Mom and dad were able to hop on the train to the central station. Next step: evacuation train.
8
0
175
@ifesdjeen
Alex P
5 years
It's never too late for a year summary post! New blog post about some of the papers I've read and enjoyed lat year. Database Research in 2019: The Year in Review
2
54
161
@ifesdjeen
Alex P
6 years
I'm working on the new article series on database systems, this time concerning Distributed System concepts. First article discusses Links, Two Generals Problem and FLP Impossibility:
1
77
154
@ifesdjeen
Alex P
7 years
If you liked Modern B-Tree techniques paper, you might like this page, where you can find many more nice papers!
2
39
148
@ifesdjeen
Alex P
7 days
So cool, Bf-Tree paper author joined Database Internals discord for a discussion about the paper!
Tweet media one
1
7
145
@ifesdjeen
Alex P
7 months
In the database book club, we just have finished the first chapter of Gray/Reuter’s Transaction Processing, and @MilanLoveless have started a repo with notes: If you want to join, just jump on discord, we’re just starting:
1
27
141
@ifesdjeen
Alex P
2 months
Somehow I keep reading “disturbed systems” instead of “distributed systems”.
19
18
140
@ifesdjeen
Alex P
5 years
If there was a combination of words that I was looking forward to it was this one: “Your book went to print yesterday”. Expect ebook next week, folks!
11
6
137
@ifesdjeen
Alex P
7 years
On Disk IO, Part 5: Access Patterns in LSM Trees on SSDs, HDDs and more, is now out!
0
63
136
@ifesdjeen
Alex P
4 years
Check out Harry, a fuzz-testing tool for Apache Cassandra, the best way to test databases or storage engines you've ever used:
4
46
135
@ifesdjeen
Alex P
6 years
Looks like my book on Database Internals is startling looking like a book! Two parts, 12 chapters, 93 headings, 160 pages, 46k words, 300k symbols and counting!
7
16
133
@ifesdjeen
Alex P
5 years
Finally, Database Internals book @therealdatabass is available in a print version on Amazon! It’s been going pretty quick for the last couple of weeks, but you can get one now:
3
29
125
@ifesdjeen
Alex P
8 months
I would like to try something new in 2024. If you have always wanted to read Transaction Processing by Gray/Reuter, but found the tome too thick to tackle without a good company, join us on the journey to read through it and learn new things together:
3
16
113
@ifesdjeen
Alex P
6 years
Second part of Ways to Agree, Path to Atomic Broadcast, is out! Featuring Shared Memory, Linearizability, Two/Three Phase Commit, Leader Election and Broadcast:
0
47
115
@ifesdjeen
Alex P
2 months
At database internals book club, an overwhelming majority has voted for @martinkl DDIA book! First meetup has commenced today. Join in if you want to read a great book in a good company!
@ifesdjeen
Alex P
2 months
With Gray’s transaction processing book done, folks in Database Internals discord are voting for the next iteration of the book club. Guess which one is in the lead? Also, join if you want to participate.
Tweet media one
1
13
110
4
10
111
@ifesdjeen
Alex P
2 months
With Gray’s transaction processing book done, folks in Database Internals discord are voting for the next iteration of the book club. Guess which one is in the lead? Also, join if you want to participate.
Tweet media one
1
13
110
@ifesdjeen
Alex P
5 years
So we’re at 309 pages, folks! Content seems to be finished now, with addition of several awesome papers. Now to editing again, and releasing the early access version \o/
4
9
108
@ifesdjeen
Alex P
5 years
database-internals-1.0.0-rc1.pdf
2
5
107
@ifesdjeen
Alex P
5 months
One of my favourite youtubers, @asianometry , has published a video on history of SQL and relational databases. Featuring a fish pun, alas no mention of data bass!
0
6
104
@ifesdjeen
Alex P
3 years
Called mom and dad. They went to a supermarket to get some groceries. Asked them to call me back as soon as they’re back.
4
1
105
@ifesdjeen
Alex P
6 years
Amazing to read a paper from 1984 describing today's cutting edge database techniques.
5
17
97
@ifesdjeen
Alex P
7 years
Got my copy of Distributed Systems book. They’re giving out 3rd edition ebook for free!
2
41
102
@ifesdjeen
Alex P
5 years
Today I've read a paper that I have (kind of) started my book research from back in 2018: Database Architecture by Hellerstein and Stonebraker. I'm looking at it in a completely different way, with a new understanding and insight. Really hope that reading it does the same to you.
4
8
98
@ifesdjeen
Alex P
4 years
Database Internals @therealdatabass is on sale on US Amazon for half the price! Check it out if you haven’t; holidays are the best time to learn about new stuff!
0
27
96
@ifesdjeen
Alex P
5 years
Amazing news: entire (unedited) book contents are going to be available on O’Reilly EAP very soon, maybe even this week!
3
6
95
@ifesdjeen
Alex P
4 years
Fuzz testing is an absolute must for any database (or any sophisticated system/program whatsoever). Trying to compose the edge-cases manually is tedious, hard to review, and is likely to still live a lot uncovered. Randomised tests are easier to develop, maintain and run.
5
21
95
@ifesdjeen
Alex P
3 years
Rumor is there will be a special edition of Database Internals @therealdatabass !
2
3
95
@ifesdjeen
Alex P
9 months
Picked up some copies of Database Internals! Stop by my talk on Cassandra Summit on Transactional Cluster Metadata today, I’ll be giving them out!
Tweet media one
1
5
93
@ifesdjeen
Alex P
6 months
Not only Gray has described simulations and chaos engineering, but also event sourcing, in a short off-the-cuff remark. Wondering how many valuable things from that book have gotten overlooked over the years.
Tweet media one
6
8
94
@ifesdjeen
Alex P
5 months
If you think Andres Freund has saved you from xz vulnerability, this is not the first time he finds tricky subtle problems across software boundaries. But this one, you probably won’t read in NYTimes about. Oh wait…
1
10
92
@ifesdjeen
Alex P
5 years
If you like Distributed and/or Storage Systems, and are interested in working on Cassandra as an Infrastructure Engineer at Apple, let's talk!
6
36
91
@ifesdjeen
Alex P
3 years
Dear @lufthansa . It might be surprising, but there are people whose name ends with “DR”. It doesn’t mean you should turn this into their title. Checked my boarding passes for last 10 years, and everywhere I am Dr Oleksan. I appreciate your recognition of my intelligence.
Tweet media one
10
2
90
@ifesdjeen
Alex P
2 years
Two days, two hikes. 60km and 3k+ elev gain near and around Kochel. 12/10, will repeat many times.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
1
90
@ifesdjeen
Alex P
2 months
Systems Distributed was such a great conference. Has a chance to meet so many great folks doing all sorts of things in databases and distributed systems. Very inspiring!
4
7
88
@ifesdjeen
Alex P
3 years
In case you haven’t read the first version, the second version of “Is Parallel Programming Hard, And, If So, What You Can Do About It?”is out, and you better check it out!
3
20
85
@ifesdjeen
Alex P
2 months
Join the systems programming revolution, its official! @croloris
Tweet media one
1
8
81
@ifesdjeen
Alex P
3 years
Boarded the train, en route to Lviv
@ifesdjeen
Alex P
3 years
Mom and dad were able to hop on the train to the central station. Next step: evacuation train.
8
0
175
0
0
84
@ifesdjeen
Alex P
4 years
If you like reading papers, join paper reading group in Databass Slack. Check out the details and the list of the papers we've already read: The read for next two weeks is SSS: Scalable KV Store with External Consistent and Abort-free RO Transactions.
0
19
80
@ifesdjeen
Alex P
7 years
On Disk IO: Part 2, More IO Flavours is now out:
1
31
83
@ifesdjeen
Alex P
6 months
We seriously need to start the whole CAP conversation once again from very scratch with way more rigorous terminology. These slides of hands that start with consistency "according to the desired service specification" and then raising it to "atomic consistency, because it is
8
8
81
@ifesdjeen
Alex P
2 years
@heidiann360 I’m not a researcher, but I had to draw a fair share of diagrams. Used OmniGraffle, found it very useful. Some examples:
Tweet media one
Tweet media two
Tweet media three
Tweet media four
5
8
81
@ifesdjeen
Alex P
4 years
Ukraine got it right: there's 1 confirmed case there, and all schools and universities are closed, mass gatherings and events are limited. Learn from other countries, don't wait for a number to grow; prevent instead of reacting.
2
11
81
@ifesdjeen
Alex P
6 years
62 pages chapter on B-Trees is now edited, with 48 images made for it! In addition to 21 pages on storage taxonomy. “Only” 127 pages more to go. I need a break now 😴
5
3
80
@ifesdjeen
Alex P
7 years
First batch of my notes for Velocity talk on Disk IO is out! Flavours of IO:
2
24
80
@ifesdjeen
Alex P
6 years
A year ago on this day I got a response from O'Reilly saying that they'd like to proceed with a book. Today it's 230 pages \o/
6
2
80
@ifesdjeen
Alex P
6 years
One of the reasons I re-read distributed systems papers multiple times is mobbing: a single non-careful wording and you’re out. The crowd will rage and question your competence. This is also the reason I always tried to avoid writing longer pieces on functional programming.
3
15
79
@ifesdjeen
Alex P
7 years
\o/ my proposal for Velocity in San Jose got accepted! Will start working on notes and release them same as last year with disk IO series. This year’s subject is Consensus Algoritms.
3
3
77
@ifesdjeen
Alex P
3 years
Best vacation ever. Südtirol / Dolomites. Took a 3200m peak today!
Tweet media one
Tweet media two
Tweet media three
4
0
77
@ifesdjeen
Alex P
7 months
We already have 750+ folks in Database Internals Discord & Book Club. We're currently reading a Fault Tolerance chapter from Transaction Processing by Gray/Reuter, and the next meet-up is this Tuesday. Join up if you want to learn more about how databases work!
4
15
75
@ifesdjeen
Alex P
3 years
They’ve crossed the Polish border! Meanwhile, a missile has exploded right next to the train station where they’ve started their journey yesterday evening.
@ifesdjeen
Alex P
3 years
Waiting for them to cross the Polish border.
6
0
72
6
0
75
@ifesdjeen
Alex P
2 years
I’m a bit concerned if “hacking” design interviews is going eventually lead to same thing that happened with Whiteboarding. Will we eventually have to recite Paxos, Raft, ZAB, and VR papers by heart to get a backend job?
10
4
75
@ifesdjeen
Alex P
5 years
Last read is done and final edits are submitted to my production editor. Waiting for their “go” and can’t wait to see the book in print. Secretly hoping there won’t be much more work. It’s been hard 1,5 years, and I’m grateful to all the friends who helped me to work through it.
6
3
75
@ifesdjeen
Alex P
2 years
Some more great LSM papers: * Monkey: Optimal Navigable Key-Value Store * Jungle: Towards Dynamically Adjustable Key-Value Store by Combining LSM-Tree and Copy-On-Write B+-Tree * LLAMA: A Cache/Storage Subsystem for Modern Hardware
@raphael_scarv
Raph Carvalho
2 years
Sharing some really good papers on LSM-tree storage engine domain (some focused to compaction; others more abstract). I would be very glad to hear suggestions if you know any paper that's not listed here. pls find them in this thread...
2
10
56
1
18
73
@ifesdjeen
Alex P
4 years
Given how much has happened in databases in the past two years, I'm seriously considering a small, ~100 pages update to @therealdatabass
@therealdatabass
Database Internals Book
4 years
Fascinating! Check it out, another learned index data structure; this time without neural nets, The PGM-index. Presentation: Paper:
1
20
75
4
2
75
@ifesdjeen
Alex P
3 years
Waiting for them to cross the Polish border.
@ifesdjeen
Alex P
3 years
Boarded the train, en route to Lviv
0
0
84
6
0
72
@ifesdjeen
Alex P
6 months
Database Internals reading group is now over 1K people! We got some awesome folks from all over the industry & academia. We're currently reading Transaction Monitors chapter from Gray/Reuter Transaction Processing book. If you're not in yet, join us:
2
10
73
@ifesdjeen
Alex P
3 years
Had there been a Database Internals II, what would you expect to see in it? What kind of new material? Or, dare I say, new format?
17
9
71
@ifesdjeen
Alex P
5 years
Talk on distributed consensus by @heidiann360 ! Definitely recommend to read a paper, too!
Tweet media one
4
15
71
@ifesdjeen
Alex P
5 years
If you got Database Internals and enjoyed it, please consider posting rating on the website where you purchased it. I know it’s just a pointless number for many, but this really helps the book and the author. Who knows, there might be another one?
4
10
71
@ifesdjeen
Alex P
4 years
This year, I’ve found myself coming back to several books I’ve read a while ago and re reading them to get new insights. Back in the day i resisted re-reading as it felt redundant. Doesn’t feel this way anymore as reading the same book with a new mind is like reading a new book.
5
3
68
@ifesdjeen
Alex P
11 months
@eatonphil It has, at least on dotcom amzn! I know it’s often a lot to ask, but for many thousands of copies sold only a few rate it. If you enjoy the read, and got a couple of minutes, please hit those stars. This helps a lot and may even encourage the author to work on V2!
Tweet media one
3
1
70
@ifesdjeen
Alex P
6 years
Two things you should never google to keep your faith in humanity: * wide-column vs column-oriented databases * serialisability vs sequential consistency
4
11
67
@ifesdjeen
Alex P
4 years
Would anyone be interested to read and discuss papers together? I have some in mind, and it seems like many folks do this because everyone spends more time at home. I can provide free excerpts from the book for relevant reference points, too!
10
16
66
@ifesdjeen
Alex P
5 years
Last two additions to the book are now submitted: acknowledgements and dedication. One more read and I’m done!
3
2
68
@ifesdjeen
Alex P
4 years
I’m giving away a paper copy of Database Internals book! To enter the raffle, respond to this tweet with your favorite podcast episode related to distributed systems! Check out the @therealdatabass episode of Software Engineering Radio for inspiration:
14
18
64
@ifesdjeen
Alex P
6 years
Want to work on Cassandra at massive scale at ? Ping me if you’re interested, I know just the right person you should talk to!
6
50
65
@ifesdjeen
Alex P
5 years
If you've ever struggled to understand how Linearizability relates to Sequential Consistency, or Serialisability, and how consistency models generally fit together, read this paper. A must-read for any distributed systems engineer:
@therealdatabass
Database Internals Book
5 years
Day 7: There are quite a few consistency models, each one having its own important implications. A great overview of consistency models by Paolo Viotti and Marko Vukolić: Consistency in Non-Transactional Distributed Storage Systems:
2
10
44
1
17
65