![Soumith Chintala Profile](https://pbs.twimg.com/profile_images/959995586689691648/DAFep10r_x96.jpg)
Soumith Chintala
@soumithchintala
Followers
224K
Following
3K
Statuses
4K
Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
New York City
Joined September 2009
@jon_barron @polynoamial @OpenAI i think your comment gave me much clarity on Noam's meta-point which I kinda missed the first time I read his tweet.
1
0
15
you were/are the Chief Scientist of Meta, and a FAIR Lead -- where both Zetta and Llama were located; I think characterizing any team within your direct influence in a bad light in public is not nice. yea the Llama folks were great. praise them. What if Zetta was allowed to run like Llama (left alone, without leadership changing their mind weekly, put under undue constraints), maybe the counterfactual would be different -- lets leave it to speculation. A lot of the Zetta folks went on to do great work.
9
7
449
how legit is the research? more plausible that it's hooking onto something pathological but irrelevant, like say "female faces are on average a different shape than male faces, so the angle of capture (or some other silly, but irrelevant difference) of retinal images is slightly different because of how they fit in the imager's harness.". like the army tank classifier folklore.
12
0
124
RT @Thom_Wolf: Finally took time to go over Dario's essay on DeepSeek and export control and to be honest it was quite painful to read. Andโฆ
0
518
0
RT @allen_ai: Here is Tรผlu 3 405B ๐ซ our open-source post-training model that surpasses the performance of DeepSeek-V3! The last member of tโฆ
0
389
0
@AnushElangovan @__tinygrad__ fwiw, if @realGeorgeHotz was offering to work on my stuff for free, and wanted two boxes; I'd drive them out to him myself. @__tinygrad__ is a beautiful, phenomenal piece of software IMO.
22
43
1K
@heyjchu fwiw that spend plan sounds incredibly sketch, and if they manage to raise that'd be impressively bubbly, and the investor needs to look themselves in the mirror.
0
0
9
RT @andrew_n_carr: want to train your own model with a million token context window? the great torchtitan team has implemented pass-KV riโฆ
0
28
0
@abhijit_MLab India has some of the most well-capitalized startups; so i didn't talk about money. AI base models are being built by well-capitalized companies across the world, not academic research labs. hence i feel money isn't that relevant of a dimension here.
3
0
15