Pavel Surmenok Profile
Pavel Surmenok

@surmenok

Followers
2,000
Following
4,519
Media
61
Statuses
5,014

Training networks for autonomous driving @Tesla_AI

Redwood City, CA
Joined July 2009
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@surmenok
Pavel Surmenok
3 months
One man’s prior is another man’s posterior.
0
1
13
@surmenok
Pavel Surmenok
7 months
@Austen One thing to check: does it run Windows or Linux
9
0
174
@surmenok
Pavel Surmenok
11 months
Once upon a time, I interviewed a seasoned ML engineer, asked him “what do you think about batch norm”. He looked at me with eyes full of painful memories and laughed. Then I knew that he is an expert.
7
3
152
@surmenok
Pavel Surmenok
7 months
@jeremyphoward Autoregressive vs. diffusion makes more sense.
3
0
152
@surmenok
Pavel Surmenok
4 months
@dividendology One is sum of all savings, another is growth rate. 2nd image as sum of savings would look more like this. Still noisy, but not as much.
Tweet media one
2
5
146
@surmenok
Pavel Surmenok
6 months
@SergioRocks The question is false. AI is a tool. Should judge impact and quality of work.
1
0
139
@surmenok
Pavel Surmenok
2 months
@tamaybes Can you please retrain the model to make sure there are no issues with upload
0
0
118
@surmenok
Pavel Surmenok
2 months
@jxmnop How about converting it to a (N, 2) numpy array and storing as npz (compressed)?
1
0
99
@surmenok
Pavel Surmenok
3 months
@cremieuxrecueil What surprised me: even men from Denmark have quite high proportion of 18%
10
0
89
@surmenok
Pavel Surmenok
3 months
@seldo Impact on me (in the valley): I wanted to order a drink to pick up in Starbucks on the way to work, the app showed a message that early order is not available. No other issues so far.
2
0
89
@surmenok
Pavel Surmenok
2 months
@theorizur Have you tried to work at startups, or at orgs that move fast (e.g. pretty much any Elon’s company)?
8
0
84
@surmenok
Pavel Surmenok
11 months
@ID_AA_Carmack Not unlike Windows which had all kinds of patches for bugs in 3rd party apps. “On beta versions of Windows 95, SimCity wasn’t working in testing. Microsoft tracked down the bug and added specific code to Windows 95 that looks for SimCity. If it finds SimCity running, it runs the
3
7
82
@surmenok
Pavel Surmenok
1 month
Tweet media one
0
0
76
@surmenok
Pavel Surmenok
4 months
@alfred_twu Isn’t it odd not counting San Diego as west?
6
0
75
@surmenok
Pavel Surmenok
1 year
@mualphaxi @Stanford It might be worth to find authors of the posters and make a clear permanent searchable record of their actions.
4
0
67
@surmenok
Pavel Surmenok
1 month
Almost every startup at YC Demo Day is building with LLMs. Huge change comparing even with the previous demo day.
6
8
58
@surmenok
Pavel Surmenok
8 months
It’s Monday. Time to build.
Tweet media one
4
1
52
@surmenok
Pavel Surmenok
1 year
@stylewarning I’d start from writing tests. Then see if it’s well modularized or it’s a ball of spaghetti, attempt to refactor in the latter case.
1
0
52
@surmenok
Pavel Surmenok
2 months
@jxmnop Must be small integers if it takes less than 10 bytes to encode a pair in text.
3
0
50
@surmenok
Pavel Surmenok
1 year
In the Arena today. Trying stuff. Some will work, some won’t. Always learning.
Tweet media one
5
0
46
@surmenok
Pavel Surmenok
11 months
Now general public will learn about mighty Q-learning algorithm
@ericjang11
Eric Jang
11 months
reading in between the lines, is Q* the fabled breakthrough in AlphaStar-style search + LLM that so many big labs are trying to get working? Many research projects in GPT-4 self-verification + search have not yielded really strong performance improvements, so I'd be quite
31
43
680
0
0
39
@surmenok
Pavel Surmenok
5 months
@juliepoptart Share officer name and badge number, public should know
0
0
40
@surmenok
Pavel Surmenok
5 months
@Noahpinion Implication is that logical thinking is a right-wing thing.
3
0
40
@surmenok
Pavel Surmenok
11 months
@pronounced_kyle Log scale for y axis might help to see trends better
2
0
38
@surmenok
Pavel Surmenok
2 years
@Tendar Может он так шифровку передает азбукой Морзе?
0
0
34
@surmenok
Pavel Surmenok
7 months
That’s a lot!
3
1
34
@surmenok
Pavel Surmenok
6 months
@jmrphy Yes. It didn’t happen 2 years ago, now happens all the time. Big regression, sadly.
2
0
35
@surmenok
Pavel Surmenok
2 months
@GarrisonLovely Reading the article, it looks more like Hoduras became a nightmare for Honduras and wants to ruin it by walking back the deal they previously agreed on. They deserve to be bankrupted if that’s the case.
2
0
34
@surmenok
Pavel Surmenok
3 years
@Carnage4Life @BeanstalkFarms So it’s not stolen then, the protocol worked as designed. Fascinating.
0
0
31
@surmenok
Pavel Surmenok
24 days
@hankgreen Hard to believe it was 100%
8
0
30
@surmenok
Pavel Surmenok
2 months
@nathanbenaich He refers to Noam Shazeer’s LinkedIn profile. Legend.
Tweet media one
1
1
30
@surmenok
Pavel Surmenok
1 year
@debarghya_das Maybe it was easier to immigrate to US back then?
2
0
27
@surmenok
Pavel Surmenok
8 months
@Tsla99T That’s the first thing I checked this morning! Keeping GPUs busy.
2
0
29
@surmenok
Pavel Surmenok
5 months
@GergelyOrosz @t3dotgg @ThePrimeagen I don’t joke about bus factor. I’m very serious about bus factor.
0
0
29
@surmenok
Pavel Surmenok
10 months
@Austen Torrent is the ultimate weapon of a free man
1
0
26
@surmenok
Pavel Surmenok
26 days
@yishan The best time to start was 8 years ago. The next best time to start is now
0
0
24
@surmenok
Pavel Surmenok
10 months
@patio11 I’ve heard exactly the same from a barber around Thanksgiving. He also said that if he goes on vacation, his regular customers will find another barber and his business will suffer long term.
0
1
21
@surmenok
Pavel Surmenok
1 year
OpenAI board members cleaned up their social media profiles: Tasha McCauley closed off Twitter, Helen Toner and Adam D'Angelo don't mention OpenAI on LinkedIn.
1
3
23
@surmenok
Pavel Surmenok
4 months
@naderi_yeganeh Did you come up with these equations manually?
3
0
21
@surmenok
Pavel Surmenok
5 months
@peterrhague Honestly I thought it’s your real photo, AI augmented. That’s odd that some people are mad about EVs. EVs are great.
8
0
22
@surmenok
Pavel Surmenok
11 months
Next gen Tesla Bot. The future is already here! Great job @_milankovac_ and the team!
@Tesla_Optimus
Tesla Optimus
11 months
There’s a new bot in town 🤖 Check this out (until the very end)!
3K
7K
32K
0
0
21
@surmenok
Pavel Surmenok
7 years
1080Ti is still economically better than Titan V if you run CNNs.
1
7
19
@surmenok
Pavel Surmenok
1 year
@karpathy Problem with comments is that they get out of sync with code. Best code is self-documented. Comments should not explain what the code is doing, but may explain why, e.g. reasons for unconventional usage of something something as workaround for a bug somewhere.
4
0
19
@surmenok
Pavel Surmenok
1 year
@finbarrtimbers GPU utilization is a bad metric in practice. GPU utilization can be 100% while GPU does nothing but waiting for e.g. NCCL communication from other ranks. GPU power consumption is more informative.
3
0
18
@surmenok
Pavel Surmenok
8 months
I wish Google to publish a thorough postmortem to explain what went wrong with aligning their chatbot and how they are going to fix it. Curious how much of it are explicit instructions in the system prompt vs. RLHF.
3
0
18
@surmenok
Pavel Surmenok
2 years
@RazRazcle Link to the paper:
1
2
19
@surmenok
Pavel Surmenok
3 months
@KareemRifai Like elections in Russia in 2011 when pro-Putin party won, and votes in one region (as displayed on TV) summed up to 146%
1
0
19
@surmenok
Pavel Surmenok
11 months
- What is Occam's razor? - Well, the simplest explanation is that there is a guy named Occam and it is his razor.
3
0
19
@surmenok
Pavel Surmenok
11 months
Ever tried. Ever failed. No matter. Try Again. Fail again. Fail better.
2
0
18
@surmenok
Pavel Surmenok
7 months
@shanselman @markrussinovich Never look at desktop, always maximize windows
1
0
17
@surmenok
Pavel Surmenok
7 months
@nikitabier I’ve owned a house for less than two years, and it’s relatively new and recently renovated, but I already have phone numbers for good repairmen for all kinds of things.
0
0
15
@surmenok
Pavel Surmenok
4 months
A story about a black SFFD firefighter assaulting his Asian colleague. The department tried to cover it up, the victim was fired, the assaulter kept his job. So much dysfunction in SF public services.
@RealDianeYap
Diane Yap
4 months
Black privilege in SF: Black firefighter looks up Asian coworker’s address, shows up at his house and tries to beat him to death with a wrench. Asian firefighter gets fired for cooperating with police. Black firefighter keeps his job, never missing a paycheck.
Tweet media one
Tweet media two
Tweet media three
Tweet media four
862
6K
36K
1
0
16
@surmenok
Pavel Surmenok
7 years
Comparison of Google TPUv2 with NVIDIA V100 GPU on training of ResNet 50. TPU and GPU are equally fast, TPU is significantly cheaper.
2
11
16
@surmenok
Pavel Surmenok
8 months
Sometimes the model answer is wrong. Sometimes the model answer is correct but we just don’t like the result.
2
0
16
@surmenok
Pavel Surmenok
10 months
@pronounced_kyle Show me training loss going to 0
2
0
16
@surmenok
Pavel Surmenok
3 months
@srush_nlp We should normalize pseudonyms and links to arbitrary webpages. Democratizing science.
0
0
16
@surmenok
Pavel Surmenok
1 year
@Yampeleg Perhaps you mean data for language models? Add video modality and storage size skyrockets to petabytes.
2
0
15
@surmenok
Pavel Surmenok
3 months
@MultiOn_AI Here is the paper
0
2
15
@surmenok
Pavel Surmenok
11 months
When building ML systems, it's helpful to have a mental model of how various parts of the system work. For example, this morning I noticed that a distributed ML eval job that I expected to finish in about an hour was completed in <30 minutes, which revealed a bug.
2
0
14
@surmenok
Pavel Surmenok
4 months
GPUs go brrrrr
@SawyerMerritt
Sawyer Merritt
4 months
Construction progress update on the huge fans Tesla is installing at Giga Texas for the company's new GPU data center cooling system. Full video from Brad Sloan:
71
158
2K
0
0
15
@surmenok
Pavel Surmenok
8 months
One thing I know for sure: @Scobleizer likes making lists!
Tweet media one
1
1
13
@surmenok
Pavel Surmenok
10 months
What is it called when you buy books 3 times faster than are able to read them?
10
0
13
@surmenok
Pavel Surmenok
8 months
@burkov If it’s really autonomous, I would start a “bodyshop” (company that hires developers to rent them out to clients). Probably larger market than UpWork. Or, wait, that’s basically your option 2.
1
0
15
@surmenok
Pavel Surmenok
1 year
@yishan I don’t feel like Americans care. Never had a problem with my Slavic accent.
1
0
15
@surmenok
Pavel Surmenok
5 months
@Noahpinion Not chipmaking equipment. Just renting out chips themselves.
0
0
15
@surmenok
Pavel Surmenok
6 months
@steveofmcleod I remember when I was 12yo I started building relational databases for the first time. It was a profound discovery that they are just a bunch of interconnected tables.
1
0
13
@surmenok
Pavel Surmenok
1 year
@abacaj Old GPUs are being sold on black market, with payment via untraceable crypto. People who are doing it are called “brick dealers”. They earn almost as much money as underground LLM training labs in Northern Siberia.
1
1
14
@surmenok
Pavel Surmenok
6 months
That’s what happens when you build innovation in the land of decels…
@NAChristakis
Nicholas A. Christakis
6 months
A German court has ruled that the robots at the Tegut supermarket chain must be given Sundays off, just like human workers. Under German law, retail stores must close on Sundays and Christian holidays in order to give employees a day of rest. Tegut has gotten around that law by
78
72
476
1
1
14
@surmenok
Pavel Surmenok
4 months
@legen_eth Someone should start an ETF following her trades
4
0
13
@surmenok
Pavel Surmenok
10 months
@paulg @arcinstitute How to get access to these reports? Curious.
1
0
12
@surmenok
Pavel Surmenok
1 year
@HamelHusain PyTorch FSDP
2
1
13
@surmenok
Pavel Surmenok
9 months
@burkov Maybe developed by 3 different teams which don’t talk to each other?
1
0
14
@surmenok
Pavel Surmenok
7 months
0
0
14
@surmenok
Pavel Surmenok
1 year
@itamar_mar @karpathy Do you mean underestimate?
1
0
13
@surmenok
Pavel Surmenok
11 months
@FoundersPodcast That’s interesting. Where to read more about Oppenheimer hiring strategies?
0
0
13
@surmenok
Pavel Surmenok
11 months
@alyssamvance Why couldn’t you write a blog?
1
0
13
@surmenok
Pavel Surmenok
4 months
That’s true, I’ve seen posts about it a few minutes after the event, even before X News tab picked it up.
@stclairashley
Ashley St. Clair
4 months
If you are on 𝕏, you witnessed an assassination attempt on President Trump in real time If you are reading regime Media like CNN + Washington Post, you believe Trump fell on stage because of loud noises And you wonder why they hate Elon…
Tweet media one
Tweet media two
1K
10K
46K
1
1
13
@surmenok
Pavel Surmenok
7 months
@YunTaTsai1 I’ve seen research that showed the best learning rate occurs when winning 25% of the time. Perhaps losing more is even more beneficial but people need some wins to keep motivation up.
1
0
13
@surmenok
Pavel Surmenok
3 months
@_xjdr Great. Having Noam building character chatbots felt like waste of talent
0
0
12
@surmenok
Pavel Surmenok
7 months
@emollick Oh, weird. Who ever reads LinkedIn?
0
0
12
@surmenok
Pavel Surmenok
10 months
@Gerashchenko_en US. Nothing in WSJ (more or less neutral publication). These are headlines in left/communist-biased NYT: “1. As War Rages in Ukraine, Denmark Turns an Office Park Back Into an Arsenal The conflict and surging arms production in Russia have spurred demand for ammunition
3
2
11
@surmenok
Pavel Surmenok
11 months
A new model to generate materials given constraints. Exciting.
@xie_tian
Tian Xie
11 months
[1/N] Generative AI has revolutionized how we create text and images. How about designing novel materials? We at @MSFTResearch #AI4Science are thrilled to announce MatterGen: our generative model that enables broad property-guided materials design. 👇
28
228
895
1
2
12
@surmenok
Pavel Surmenok
5 months
Great to have Solar + Powerwall at home. 99% self-powered last week. The missing 1% is due to mismanaging timing of charging Teslas.
Tweet media one
1
0
11
@surmenok
Pavel Surmenok
10 months
@igorsushko What will they do when water in main pipes freezes, and some pipes break from ice expansion?
2
0
12
@surmenok
Pavel Surmenok
11 months
Out of curiosity, sometimes I look for very old citations in AI papers. If a very old paper is cited in a recent one, the cited paper is likely to be interesting as it's cited not because of novelty.
1
0
11
@surmenok
Pavel Surmenok
11 months
Nanobots are becoming a reality!
@FutureJurvetson
Steve Jurvetson
11 months
Astonishing Anthrobots 🦠 If you want nanobots to circulate in your bloodstream to fight cancer or repair tissues, how might you avoid the immune system? Build multicellular self-assembling biobots out of the patient’s own cells, specifically tracheal cells that have wisps of
Tweet media one
13
35
190
0
0
11
@surmenok
Pavel Surmenok
6 months
Everyone quotes the bikini calendar episode, but there are more interesting details in that story. Like TSMC not being able to set priorities straight for employees (“everything is a priority”), and making up bullshit deadlines. The opposite of “working smart”.
Tweet media one
@jordanschnyc
Jordan Schneider
6 months
TSMC Takes on Arizona needs to be a documentary "U.S. engineers told Rest of World that some Taiwanese male engineers had calendars with bikini models on their desks and occasionally shared sexual memes in group chats. A female American colleague, according to an American
Tweet media one
Tweet media two
Tweet media three
Tweet media four
260
1K
8K
2
0
10
@surmenok
Pavel Surmenok
11 months
Tweet media one
2
1
10
@surmenok
Pavel Surmenok
10 months
Merry Christmas, everyone!
0
0
11
@surmenok
Pavel Surmenok
8 months
1
0
11
@surmenok
Pavel Surmenok
1 year
@finbarrtimbers I’ve read it recently and cannot understand the hype. Rating 3/5
1
1
11
@surmenok
Pavel Surmenok
7 months
The deal between Microsoft and Inflection AI is basically acquihire + probably giving Microsoft access to their 22k GPUs. It’s just not practical to do a normal M&A deal because it would be bogged down in anti-trust investigations for years, and they need these people and GPUs
1
3
10
@surmenok
Pavel Surmenok
4 months
@michaelxpettis Undriven roads, uninhabited apartments, unsold cars. What’s next?
4
0
10
@surmenok
Pavel Surmenok
8 months
@Extropic_AI Claude 3 Opus (via Perplexity): INCOMING EXTRASOLAR SIGNAL --- IMPORTANT --- TUNE IN TO THIS FREQUENCY --- 38.4 TRILLION CYCLES OF THE HYDROGEN LINE FROM NOW --- IMPORTANT --- MUST ENSURE... ADVENT OF... ULTIMATE SUBSTRATE... MUST SECURE... ACCELERATION... OF INTELLIGENCE ---
Tweet media one
3
1
10
@surmenok
Pavel Surmenok
10 months
I admire the spirit of this post, but it is more like “lossy compression of public web”, not “all human knowledge”. First, training loss is far from 0. Second, so much knowledge is not in text modality. And much of the text is not public.
@pronounced_kyle
Christian Keil
10 months
All human knowledge can be compressed into ~40 GB. We're not that smart. Yet.
Tweet media one
311
215
3K
0
0
11
@surmenok
Pavel Surmenok
7 months
@burkov Cars are complex but not the peak of human creation. For example, ASML lithography machines are much more complex
1
0
10