Tim Zaman
@tim_zaman
Followers
25K
Following
3K
Media
107
Statuses
854
AI at Google DeepMind. Previously Tesla, X/Twitter (head of AI Infra), NVIDIA. callsign PD4TA
San Francisco, CA
Joined June 2010
I got a violation posted on my front door in SF for remoddeling my own kitchen (myself) in-kind. I didn't know bc Holland, I can do whatever the f* I want in my house, bc. it's my house!. So the next day: i went to the SF building department. They hadn't processed the fine yet,.
NEW: Just issued an Executive Order that will allow victims of the SoCal fires to not get caught up in bureaucratic red tape and quickly rebuild their homes. We are also extending key price gouging protections to help make rebuilding more affordable.
405
445
7K
At #Tesla Autopilot we are bringing up our 3rd cluster - one of the biggest supercomputers in the world. We're scaling fast and hiring, if interested please DM (ml/py/cpp/gpus/cuda/react/devops/sre)
94
398
2K
The practicalities of building a supercomputer are insane once you get down to it. Lets assume you have a datacenter location and ordered GPUs. You're probably thinking of these sleek orderly photos of racks of compute - just think about how those GPUs got there. I love this pic
@xDaily Tesla had no place to send the Nvidia chips to turn them on, so they would have just sat in a warehouse. The south extension of Giga Texas is almost complete. This will house 50k H100s for FSD training.
79
251
2K
If you are at CVPR and want to work on the world's biggest supercomputers (nvidia gpu/amd gpu/dojo), DM @itsclivetime or drop by Tesla booth!.
The team is hosting a Happy Hour on Wednesday, June 21st, but please ask @philduan, @RamonaChandran, @aelluswamy or someone at their booth for tickets, not me :D Last year's Happy Hour is where I first met the team.
17
108
1K
Again, adding some color for the fans.- @MichaelDell showed off the racks he sold to X in a tweet last week. His racks are pretty lame compared to these. - @charlesliang (welcome to X!) pictured left is a true silicon valley legend, since 1993 founder-ceo of $SMCI (up 4200% in.
Thanks @elonmusk for leading the liquid cooling technology to large AI data centers! This may lead to preserving 20 billion trees for our planet❤️
38
90
1K
Would like to point out @O42nl was hired after reaching out over the Twitter dms. LinkedIn is dead. Twitter for Recruiting!.Also yes: we're always hiring excellent engineers, DMs open! We've got an insane amount of compute coming our way this year.
40
67
986
Tesla is sponsoring the @MLSysConf, come visit our booth for opportunities on the AI team and see our hardware. We have recently upgraded our GPU supercomputer (photo) to 7360 A-100(80GB) GPUs, making it Top-7 by gpu-count. Reach out to build #1:
33
127
828
@PrvnKalavai On prem all owned by Tesla. Many orgs say "We have" which usually means "We rented" few actually own, and therefore fully vertically integrate. This bothers me because owning and maintaining is hard. Renting is easy.
33
30
704
Adding some color to what i see in these pics. Note @MichaelDell's photo from earlier this week was a close-up of the same rack: - $3M-$4M per rack pictured.- 6U Dell PowerEdge XE9680, Intel cpus (c'mon), aircooled (c'mon. ) with NVIDIA HGX baseboard (8x.
24
48
582
The "Tesla Autopilot" team is basically the "Tesla AI" team these days, given cross pollination and synergy between different tracks, like eg the TeslaBot. Esp true for AI Infra. If there is any company that will ship an actual humanoid robot it'd be Tesla.
Our TeslaBot team at Tesla is hiring across many disciplines! If you want to work with me on teaching humanoid robots to do anything, follow this link:. Follow this link to see all the types of roles we're looking for (many!):.
23
47
550
Looks like @xai s got the NVIDIA GB200 NVL72 up from @MichaelDell . Among first to turn on, they only started shipping these a month ago.
7
26
543
I feel current Tesla FSD in my car already captures 95% of the value i care about (highway nav), and don't rly care about 100% if I still need to pay attention. The real unlock is taking away the steering wheel - this is where you get my mom and everyone else on board.
19
26
526
@AravSrinivas @ylecun The nuance often kind of gets lost with E's one-liners. I think you need to distinguish between a Conv layer and a CNN. LeNet/AlexNet were obviously CNNs. But what Autopilot does today, you can probably hardly call a CNN, eventhough as u say - it prob must have conv layers.
17
39
462
@DillonLoomis22 'Taking Delivery' sounds like receiving an Amazon package. For taking delivery, calculate back for 10k gpus.- gpus in a server.- servers in a box.- boxes in a truck.- amount of trucks.- amount of loading docks.- time it takes to unload a truck.
14
21
357
@sedielem Top hours wasted in my life.1) gaming.2) tv.3) batchnorm bugs. Probably closely followed by gpu driver updates.
18
10
332
Proud to have engaged these two lovebirds. When we were building the first xAI cluster, it was hard to get vendors to move at our speed and be taken serious asking "what minute will the truck arrive". I asked for Charles to come over to X to chat and accelerate, and it took off.
Supermicro is here to support xAI's massive 10-fold expansion of the Colossus supercomputer in Memphis with over 1 million GPUs by establishing local operations/production, validation, service and support. With our optimized datacenter building blocks (DCBBS) and ambient.
12
19
330
Tesla AI Infra. Where interns get more compute than their entire university. 🚀.
@philduan The infra is amazing: I constantly used more GPUs than I ever used at university and for some projects I used more GPUs than my whole university owns :o Also @tim_zaman and his team manage the infrastructure brilliantly. The code is nice and you can hack anything anytime!.
4
17
265
Hype. A joint timeline that once forked (Elon and Demis/DeepMind) now reattracting <3 <3. (Elon was early DM investor before merger w Google).
@elonmusk Thanks Elon! let's do an AI game together. .
6
11
264
Welcome to X @__eknight__ 👀. @Tesla_AI crowd should pay close attention. 🍿.
So excited that FSD customers were able to catch a glimpse of what we’ve been working on. Sincere thanks to my colleagues for the many long nights and hard work that led up to this point. Want to build end to end with us? Join the team @Tesla_AI!
3
17
245
+1, at Tesla and X my teams were so large I also had to adopt the much more superior "async 1:1". Please don't just drop your 1:1s, there is a lot of nuance:. With large teams, being a poor people manager is inevitable, but it's ok;.Having so many reports and avoiding 1:1s makes.
@StartupArchive_ Word. I had ~30 direct reports and didn't do 1on1s (as a scheduled, regular activity) at Tesla and imo it was great. Two meeting types that are a lot more useful:.1) The 4-8 person meeting where great ideas come from, and.2) The large meeting for broadcast. I went back to try.
4
15
248
@realGeorgeHotz Everybody gangsta until real world at-scale deployment. But it's a cool demo for sure.
3
2
200
@30SecondYou My sister in holand had an ADU made last year. I fell of my chair, look at these joins! Wtf!.I think the reasoning is, the unforgiving climate (compared to CA) and expensive energy means you want your home to be really well built to keep out the elements.
16
3
211
@karpathy A nice pattern i learned ar nvidia, esp good for education, is add some ifdef's so you can either run the kernel on cuda or on cpu. This would only mean that you make your kernels block agnostic (only use on the gpu path) and for the cpu path you do a x,y double for loop.
2
7
176
Starlink at home depot. This is a commodity now.
BREAKING: @SpaceX's Starlink standard terminal kit is now available at Home Depot in the US for $600. Delivery is available immediately. Link:
3
4
142
@WholeMarsBlog Also - teleop is a hard requirement for a lot imitation learning. The quality of the 'imitation' is constrained by the quality and functionality of the teleop.
4
7
135
🚀 and happy 1st Googleversary to meeee.
What a way to celebrate one year of incredible Gemini progress -- #1🥇across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on. Thanks to the hard work of everyone in the Gemini team and
8
1
114
Some might remember the 2015 "DIGITS DevBox" (I was hired to work on & why I moved to bay area). Basically, you bought a solid 4-GPU devbox (or made ur own), installed DIGITS, which shipped with multiple dl frameworks and had a builtin ai platform (scheduler, GUI, . ) 🧵.
Announcing NVIDIA Project DIGITS, a personal AI supercomputer that’s powered by the NVIDIA GB10 Superchip and based on #NVIDIAGraceBlackwell architecture. Preconfigured with the NVIDIA AI software stack, developers, researchers, data scientists and
5
13
112
@willdepue Not disjoined, just specialized. Mostly constrained by its realtime nature and in-car inference hw cost, flops and power costs. Roughly history:.1. End-to-end attempts since decades - all failed.2. ConvNet for eg laneline perception, controls w heuristics.3. Add more tasks.
4
7
108
@itsclivetime "No mom, these are not cup coasters.".True story - my mom once threw away a silicon wafer in the kitchen trash bin, bc she thought it was the bottom carton of a pie she bought that morning.
1
1
106
@alexandr_wang Why aren't we just feeding kids with candy only? The answer to this question and yours is the same. It's their addictive nature, and concerning to witness. And any healthy parent is totally fine their their kids being a "Normie". I am.
1
0
99
Updated link: here @karpathy also explains why anyone wanting to succeed in FSD would need large scale compute and storage.
4
10
90
@spacesudoer @elonmusk Ok lets limit amazon deliveries too. Incentivize local shopping, without all that transport and forests worth of packaging material. Seriously.
1
1
94
@karpathy @sinclanich Clip to int8 will speed things up drastically. If you do a top-k approach, you can recalc the top-k in fp32. Np can't do cosine distance fast though, but you can slap some instrinsics together in 20 loc. Will be bound by mem bw~ 200GB/s. Ur array is only 17MB, so 85us latency.
4
3
87
@WholeMarsBlog I recall @ARKInvest put a pricepoint at $1000 while the stock was $300. At the office we thought this was hilariously unrealistic. Estimating future value is so hard. People are bad at exponentials. Same when I was at NVIDIA when it hit $1000 pre splits, people got tatoos.
2
3
80
RIP TOP500. No one submits anymore, eg this 4.6k gpu (small) cluster is #9. Kind of sad bc it is such an awesome hw/sw systems challenge.
Ranked #9 in the TOP500 list of fastest supercomputers, Eos is the culmination of our ongoing commitment to pushing the boundaries of #AI technology and infrastructure. Learn more: #DataCenter #NVIDIADGX
6
3
76