john kutay Profile
john kutay

@JohnKutay

Followers
1,613
Following
1,105
Media
588
Statuses
3,582

engineering @ striim (streaming sql / cdc / pipelines). what’s new in data for 🎙️

San Francisco, CA
Joined April 2022
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@JohnKutay
john kutay
1 month
He died doing what he loved: providing a best-in-class enterprise platform that accelerates digital transformation all while maximizing business value.
36
736
9K
@JohnKutay
john kutay
3 months
@nikitabier Marriotts have a crazy, distinct advantage where they don’t make you do chores and then frame you for breaking a home appliance. Hard to compete with that.
11
13
1K
@JohnKutay
john kutay
6 months
@AnnieAgar We’ve progressed from Brock being a game manager to a guy carried by his teammates to being a vessel for divine intervention. Literally anything other than “he’s good at football.”
16
21
546
@JohnKutay
john kutay
1 year
@AlexNoonan6 Networking to brainstorm how much generational wealth they can waste on clubs and bottle service.
1
7
512
@JohnKutay
john kutay
5 months
@sampullara 10k hours of remote meetings. Some contentious back and forth between dozens of PMs, UX, engineering teams. A few winter offsites in Aspen and summer offsites in Monaco. But in the end the juice was worth the squeeze and we as users get the benefit of this magical experience 🥰
4
7
322
@JohnKutay
john kutay
4 months
In my college Linear Algebra class we read a paper called the '$25 Billion Eigenvector' that detailed the Google PageRank implementation (obviously Google's valuation has skyrocketed since the publishing of that paper 😂). In a similar fashion, I'm going to refer to Confluent
Tweet media one
7
57
281
@JohnKutay
john kutay
5 months
Snowflake stock dropping 23% on news of Frank Slootman's retirement is the real life version of the 'There he is, that is Dad' moment in succession
Tweet media one
Tweet media two
2
16
218
@JohnKutay
john kutay
23 days
@rohindhar you have to be okay with $3.295 washing away into the ocean at some point.
3
0
212
@JohnKutay
john kutay
11 months
@pronounced_kyle We put the AI in A busIness
2
2
191
@JohnKutay
john kutay
1 year
"We use SQLServer read replicas for OLAP and warehousing."
Tweet media one
7
10
187
@JohnKutay
john kutay
22 days
You all want the dashboard but no one wants to see the horror behind it
Tweet media one
3
17
168
@JohnKutay
john kutay
8 months
S3 native is the future. But you need serialization format. So delta is the future...unless you use Snowflake...then iceberg is the future...but maybe you want to be the coolest dev on data twitter...then duckdb is the future. which uses S3. S3 is the future? Also OneLake. Fabric
11
9
160
@JohnKutay
john kutay
8 months
@NickKyrgios I liked the Boris Becker documentary more than Break Point 🤷🏽‍♂️
1
1
128
@JohnKutay
john kutay
5 months
@gnosisle this guy definitely went skiing and there's no other possibility. unless it's summer...then he went hiking in wyoming.
3
0
125
@JohnKutay
john kutay
2 years
When someone technical wants to buy a new specialized SaaS tool to analyze some metric that's already in the warehouse.
Tweet media one
6
6
108
@JohnKutay
john kutay
21 days
What do I honestly think the future of data is? SQL and python. Python and SQL. More python, and then sprinkle in a lot of SQL. Did I mention sql and python?
10
3
107
@JohnKutay
john kutay
9 months
Data Linkedin/Twitter: The Modern Data Stack is DEAD. It's OVER! Fortune 100 enterprise rolling out Snowflake: We're in a 2-year dev period with a target prod date of January 2025.
8
4
101
@JohnKutay
john kutay
11 months
@0xgaut Why is that telescope grinding espresso beans?
1
1
93
@JohnKutay
john kutay
4 months
@pdrmnvd all technology is technical debt.
1
1
98
@JohnKutay
john kutay
1 year
"all of data engineering is duct tape around the impedance mismatch between software engineering and analytics" data twitter:
2
8
97
@JohnKutay
john kutay
2 years
Data engineers doing BI: I use a fully managed warehouse to trade complexity for simplicity and good performance out of the box. Data engineers doing streaming: so I import the Flink binaries, clone debezium, set up a kinesis firehose, S3 bucket w/parquet, config a NAT gateway..
3
3
88
@JohnKutay
john kutay
2 years
Your favorite apps use real-time data and AI-driven personalization (Uber, Twitter, Netflix etc) yet the modern #data stack is built with batch, non-real-time data. 🤔 I wrote a simple explainer on how stream processing makes your analytics real-time.
2
10
73
@JohnKutay
john kutay
2 years
@0xgaut If you actually use the salt and pepper you will get a $100 charge for not replenishing it.
0
1
78
@JohnKutay
john kutay
2 years
@themoneyshark Where’s the coffee beverages drank out of tiny cups?
4
0
60
@JohnKutay
john kutay
4 months
@0xgaut 9 sitting on a couch looking at the screen sideways squinting hard with one eye ftw
1
0
68
@JohnKutay
john kutay
1 year
@soychotic This site on mobile
1
1
64
@JohnKutay
john kutay
2 years
@mattturck 2020: San Francisco is dead 2022: driving through the empty downtown streets, desperately looking for devs to attend a meetup
2
0
62
@JohnKutay
john kutay
6 months
Now "decentralize our data with self-service analytics"
Tweet media one
@matsonj
Jacob Matson
6 months
what you actually get
Tweet media one
8
6
170
4
5
59
@JohnKutay
john kutay
13 days
Proud to announce my new job at Crowdstrike as a senior release engineer. Pushed my first feature earlier today and couldn’t be happier with my work.
3
6
59
@JohnKutay
john kutay
2 years
Hacker News editorialized titles are ruthless
Tweet media one
Tweet media two
1
4
57
@JohnKutay
john kutay
8 months
@chrisalbon its definitely distinct. saccharine with embarrassing levels of emoji use. i can spot chatgpt generated linkedin posts in a second...
3
0
55
@JohnKutay
john kutay
1 year
/r/dataengineering has caught on to thinly veiled #vendorcontent . data startups are frantically pivoting back to SEO.
Tweet media one
3
1
52
@JohnKutay
john kutay
3 months
Change Data Capture (CDC) is crucial for real-time data integration and ensuring that databases, data lakes, and data warehouses are consistently synchronized. There are two primary CDC apply methods that are particularly effective: 1.Merge Pattern: This method involves creating
Tweet media one
1
3
49
@JohnKutay
john kutay
2 years
@andykreed Notion is the haskell of note taking tools
1
2
49
@JohnKutay
john kutay
10 months
Seems like there isn’t a good solution for data. Have any vendors solved this?
16
3
48
@JohnKutay
john kutay
6 months
grasping for duckdb straws in the gc
Tweet media one
2
2
47
@JohnKutay
john kutay
1 year
What's stopping you from creating kickass dashboards like this?
Tweet media one
13
0
47
@JohnKutay
john kutay
1 year
B2B SaaS pricing is like.. Starter Plan: Free 🥰 Same thing but with SSO: $150,000/yr 🤑
4
0
46
@JohnKutay
john kutay
6 months
Rust backend, React frontend, DuckDB middleware is the future we've been waiting for.
6
1
45
@JohnKutay
john kutay
11 months
We abandoned data and saved hundreds of hours per month collecting insights to drive strategic decisions. Now we just defer everything to the opinion of the highest paid person in the meeting, email chain, and slack thread. We can now make decisions in seconds, in what otherwise
Tweet media one
5
1
44
@JohnKutay
john kutay
19 days
Data teams when someone asks for a simple report
Tweet media one
1
1
44
@JohnKutay
john kutay
1 year
never underestimate the power of smart people doing random stuff that they feel like doing.
1
0
44
@JohnKutay
john kutay
7 months
There’s literally a product that does data integration within 3 seconds of latency and half the cost of your current ELT. Airlines, banks, hospitals, systematically important market utilities use it. Startups won’t use it because it doesn’t have an animal mascot. 🐸☕️
@ccccjjjjeeee
Christopher Ehrlich
7 months
Which technology is this?
Tweet media one
880
70
1K
7
2
40
@JohnKutay
john kutay
1 year
Data streaming isn't just about 'real-time analytics' in the sense of flashing dashboards or stock tickers. An event-driven system that can incrementally process changes and publish to downstream consumers can be a game changer for scaling analytics and data-driven apps
4
3
42
@JohnKutay
john kutay
2 years
when vendors try to participate in data engineering twitter ... (it's me, vendors)
Tweet media one
1
0
42
@JohnKutay
john kutay
7 months
@nayshins @pdrmnvd I keep clicking stuff until company cloud costs suggest high availability
1
0
42
@JohnKutay
john kutay
3 months
@anothercohen I’m sure they all checked their slack messages that day so it’s fine
0
0
42
@JohnKutay
john kutay
2 years
Me at Coalesce New Orleans this week..
Tweet media one
2
2
41
@JohnKutay
john kutay
2 years
@JustJake Every productivity app startup during #apple #WWDC announcements
2
1
38
@JohnKutay
john kutay
1 year
Some 19 year old kid just sailed solo across the pacific and you’re here messing with YAML
4
4
40
@JohnKutay
john kutay
1 year
Confirmed I’ll be speaking at both Snowflake Summit and Databricks Data and AI…we will test the limits of Las Vegas-San Francisco travel times.
3
0
37
@JohnKutay
john kutay
2 years
@litcapital Do NOT ask an intern to do urgent work he’s gonna be like “yeah I’m at the gym and going to fyre festival then doing a retreat in Bali with the bros I’ll get it to you when I’m aligned.”
0
3
39
@JohnKutay
john kutay
1 year
We did a thing! Real-time data, CDC connectors, streaming SQL. No kafka, flink, debezium knowledge required. Start streaming millions of events for free. All your work is saved as SQL. Ok I've reached the #vendorcontent limit, DM me if you want a personal tour : )
Tweet media one
2
3
39
@JohnKutay
john kutay
2 years
@andykreed “Everything is a monad” 🤝 “Everything is a block”
0
1
37
@JohnKutay
john kutay
1 year
@pdrmnvd @ebitdaddy90 I prefer paying 8% less to stay at a lux Airbnb, handwash the dishes and take out the trash. Then get in an immediate $500 dispute because the host said we forgot to rake the leaves.
1
0
38
@JohnKutay
john kutay
3 months
@VCBrags dudes should be treating marriage like a B2B sales process!
Tweet media one
1
1
38
@JohnKutay
john kutay
2 years
@sethrosen Want to containerize your applications? It's ECS pie!
2
1
37
@JohnKutay
john kutay
2 years
After working with multiple airlines and freight/logistics companies on building real-time CDC+streaming pipelines specifically for crew assignment, reading about this @SouthwestAir thing is wild. I cannot imagine why they need a manual dial-in process to see the current state
2
1
37
@JohnKutay
john kutay
1 month
It’s been a rewarding 18 months with this boy.
Tweet media one
3
0
33
@JohnKutay
john kutay
7 months
@pdrmnvd @nayshins I for one have mastered distributed systems. Twitter can hire me if they want to learn my secrets...
Tweet media one
1
1
34
@JohnKutay
john kutay
2 months
Snowflake announces Iceberg catalog on a Monday. Databricks announces that they bought “the iceberg company” on that same tuesday. 4D chess 😅
@JohannesVink
Johannes Vink
2 months
Whoa @databricks just announced the acquisition of Tabular, the creators behind the Iceberg format:
4
7
44
4
2
33
@JohnKutay
john kutay
1 year
Excited to see an entire category of data dedicated to learn why you shouldn’t query prod for OLAP purposes
7
1
32
@JohnKutay
john kutay
4 months
Did Striim + Databricks kill the streaming database market last week 😅? Tongue in cheek comments aside, in our joint session last week we spoke about auto-incrementing materialized views derived from database change streams and SaaS application sources that surface events in an
Tweet media one
1
4
32
@JohnKutay
john kutay
2 years
@alexschief “The Anti-Gentrification Automobile Parking Alliance has sued the city for the decision on the bike rack and will file for an appeal with the Supreme Court. They claim the bike rack casts shadows on the sidewalk…”
1
0
30
@JohnKutay
john kutay
2 years
Satya repping Striim. I'd say that's prettayy prettayy good.
Tweet media one
1
4
31
@JohnKutay
john kutay
2 years
We saw a real data contract violation today at work. I had nothing to say. We just fixed it and moved on with our lives.
1
1
31
@JohnKutay
john kutay
7 months
If you want some light weekend reading, I recommend this gentle 'introduction' to Algorithms. I was able to sift through most of it over the holiday weekend.
Tweet media one
8
1
31
@JohnKutay
john kutay
1 year
*whispering* You can build your business level aggregates in the streaming layer.
Tweet media one
4
1
30
@JohnKutay
john kutay
1 year
Me shipping a 6-tool MDS powered metric that prob should have just been done in a spreadsheet
1
1
29
@JohnKutay
john kutay
2 years
@WillManidis Swear it’s cus the chip is broken not because my whole world became insolvent in the span of 11 days
2
3
29
@JohnKutay
john kutay
2 months
@rakesh_goyal people complain that SF is boring and then get upset when the embarcadero turns into a scene from mad max.
0
0
29
@JohnKutay
john kutay
2 years
In today's episode of 'Renaming Things That Already Exist'
Tweet media one
1
1
29
@JohnKutay
john kutay
1 year
"our biggest problem as a data team? our data is TOO fresh. users don't like how it's so close to real time." - said no one EVER.
4
2
28
@JohnKutay
john kutay
11 months
Even if your reports run hourly, data streaming and change data capture is the best way to ensure accurate, transactionally consistent data the business can trust. American Airlines demonstrates this with their 'Aircraft Ops' reports that compares real-time data with historical
Tweet media one
1
3
28
@JohnKutay
john kutay
1 year
standing-room only for @_abhisivasailam as he alchemizes graphs, recursion, and metrics into a strong mandate for data teams.
Tweet media one
4
2
27
@JohnKutay
john kutay
1 year
Slack message from coworker: Hey quick question!... Narrator: It was not a quick question.
5
0
27
@JohnKutay
john kutay
1 year
Please don’t say “have a good weekend” in a work setting. My capacity to process any information is limited, and you are making me context switch from professional to personal. This alone takes several cycles of mental compute. Our employment contracts already state we will pause
7
2
27
@JohnKutay
john kutay
1 year
Oh you want a single source of truth? Here’s a git directory with all our YAML.
1
1
26
@JohnKutay
john kutay
1 year
Crazy how coding interviews are changing, here's a recent experience... Interviewer: Sort this list me: list.sort() them: Ok now with n log n time complexity me: list.heapSort() them: Ok make it faster me: list.quickSort() them: you failed me: Why? them: Not once did you ask
0
0
27
@JohnKutay
john kutay
11 months
Oh? We’ve never met and you want to put 30 minutes on my calendar to “pick my brain”? That sounds amazing…and mutually beneficial!
5
1
26
@JohnKutay
john kutay
2 years
@sheisek @javroar If you thought you had a bad day at work, this guy got fired after getting attacked by a bear.
1
1
25
@JohnKutay
john kutay
5 months
Open Source data stacks + generated code is going to be a helluva drug.
Tweet media one
3
2
26
@JohnKutay
john kutay
1 year
This is the biggest crossover in data since Pedram bought a birdfeeder.
@teej_m
» teej
1 year
Cheers to @JohnKutay who finally broke out of #vendorcontent .
Tweet media one
2
6
38
0
0
25
@JohnKutay
john kutay
15 days
Tweet media one
0
0
26
@JohnKutay
john kutay
1 year
"Hey, I saw you up on stage advocating and praising our competitor in front of 100 people. Would you be interested in a 15 minute call with us about replacing them?" is a wild outreach but bravo for the confidence.
2
0
25
@JohnKutay
john kutay
2 years
GM
Tweet media one
2
2
24
@JohnKutay
john kutay
2 months
A self driving electric car but it’s just a PostgreSQL extension
1
3
25
@JohnKutay
john kutay
6 months
Please don’t say “have a good weekend” in a work setting. My capacity to process any information is limited, and you are making me context switch from professional to personal. This alone takes several cycles of mental compute. Our employment contracts already state we will pause
4
0
24
@JohnKutay
john kutay
1 year
The smartest people will always ask a seemingly dumb question, forcing you to give a simplistic answer, then they will hit you with a follow-up question that dismantles every assumption you've ever made in one fell swoop.
0
2
24
@JohnKutay
john kutay
1 year
@thetinot @rubinsafaya @MuellerSheWrote most people think s3 is just object storage but when you throw in the word cluster, it turns it into a magical thing you can run your whole 'tech stack' on.
1
0
24
@JohnKutay
john kutay
2 years
If a spreadsheet falls in the forest and no ones around to hear it, is it in production?
2
2
24
@JohnKutay
john kutay
2 years
@teej_m Almost 8 billion people drink water, yet most people don’t know how to use it to its full potential…🧵
1
1
23
@JohnKutay
john kutay
2 years
This will never not be funny ⁦ @bernhardsson
Tweet media one
2
0
23
@JohnKutay
john kutay
2 years
Do NOT send me an article about “snowflake data ingestion best practices”. I literally am the best practices.
4
0
23
@JohnKutay
john kutay
5 months
How do we fix Boeing? Easy. Data Contracts™️
6
1
23
@JohnKutay
john kutay
1 year
Hot take: Dagster is the React.js of data. As a former front-end dev, had a great time chatting with @floydophone on this topic and drawing parallels between the two frameworks.
2
4
22
@JohnKutay
john kutay
1 year
Some say data is a valuable asset. I say it's a huge liability. Delete all the data you have.
6
2
22
@JohnKutay
john kutay
1 year
I'm under pressure to put out more good tweets about data. How about I... "select * from ideal_clean_table_of.good_tweets;"
3
0
21