Soumith Chintala Profile
Soumith Chintala

@soumithchintala

Followers
224K
Following
3K
Statuses
4K

Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

New York City
Joined September 2009
Don't wanna be here? Send us removal request.
@soumithchintala
Soumith Chintala
5 days
@jon_barron @polynoamial @OpenAI i think your comment gave me much clarity on Noam's meta-point which I kinda missed the first time I read his tweet.
1
0
15
@soumithchintala
Soumith Chintala
5 days
you were/are the Chief Scientist of Meta, and a FAIR Lead -- where both Zetta and Llama were located; I think characterizing any team within your direct influence in a bad light in public is not nice. yea the Llama folks were great. praise them. What if Zetta was allowed to run like Llama (left alone, without leadership changing their mind weekly, put under undue constraints), maybe the counterfactual would be different -- lets leave it to speculation. A lot of the Zetta folks went on to do great work.
9
7
449
@soumithchintala
Soumith Chintala
7 days
how legit is the research? more plausible that it's hooking onto something pathological but irrelevant, like say "female faces are on average a different shape than male faces, so the angle of capture (or some other silly, but irrelevant difference) of retinal images is slightly different because of how they fit in the imager's harness.". like the army tank classifier folklore.
12
0
124
@soumithchintala
Soumith Chintala
9 days
RT @Thom_Wolf: Finally took time to go over Dario's essay on DeepSeek and export control and to be honest it was quite painful to read. Andโ€ฆ
0
518
0
@soumithchintala
Soumith Chintala
9 days
RT @allen_ai: Here is Tรผlu 3 405B ๐Ÿซ our open-source post-training model that surpasses the performance of DeepSeek-V3! The last member of tโ€ฆ
0
389
0
@soumithchintala
Soumith Chintala
13 days
@jxmnop unclear tbh, data mix is not specified. probably is the big reason for performance
5
3
73
@soumithchintala
Soumith Chintala
14 days
@soumithchintala
Soumith Chintala
15 days
if you find david beating goliath to be a bizzare surprising story, read history? if you think state of the art in AI cannot possibly be achieved without massive resourcing, go read the AlexNet paper or read about Alec Radford's accomplishments?
1
0
71
@soumithchintala
Soumith Chintala
15 days
if you find david beating goliath to be a bizzare surprising story, read history? if you think state of the art in AI cannot possibly be achieved without massive resourcing, go read the AlexNet paper or read about Alec Radford's accomplishments?
12
16
394
@soumithchintala
Soumith Chintala
19 days
deepseek is rewriting history rn!!
@deepseek_ai
DeepSeek
20 days
๐Ÿš€ DeepSeek-R1 is here! โšก Performance on par with OpenAI-o1 ๐Ÿ“– Fully open-source model & technical report ๐Ÿ† MIT licensed: Distill & commercialize freely! ๐ŸŒ Website & API are live now! Try DeepThink at today! ๐Ÿ‹ 1/n
Tweet media one
9
29
358
@soumithchintala
Soumith Chintala
22 days
@_arohan_ pinged you on workchat!
0
0
5
@soumithchintala
Soumith Chintala
23 days
@AnushElangovan @__tinygrad__ fwiw, if @realGeorgeHotz was offering to work on my stuff for free, and wanted two boxes; I'd drive them out to him myself. @__tinygrad__ is a beautiful, phenomenal piece of software IMO.
22
43
1K
@soumithchintala
Soumith Chintala
29 days
@heyjchu fwiw that spend plan sounds incredibly sketch, and if they manage to raise that'd be impressively bubbly, and the investor needs to look themselves in the mirror.
0
0
9
@soumithchintala
Soumith Chintala
1 month
RT @andrew_n_carr: want to train your own model with a million token context window? the great torchtitan team has implemented pass-KV riโ€ฆ
0
28
0
@soumithchintala
Soumith Chintala
1 month
@abhijit_MLab India has some of the most well-capitalized startups; so i didn't talk about money. AI base models are being built by well-capitalized companies across the world, not academic research labs. hence i feel money isn't that relevant of a dimension here.
3
0
15