Moin Nadeem Profile
Moin Nadeem

@moinnadeem

Followers
2K
Following
5K
Statuses
17K

Co-Founder at @Phonic_Co. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲

San Fransisco Bay Area
Joined October 2009
Don't wanna be here? Send us removal request.
@moinnadeem
Moin Nadeem
5 years
As pretrained language models grow more common in #NLProc, it is crucial to evaluate their societal biases. We launch a new task, evaluation metrics, and a large dataset to measure stereotypical biases in LMs: Paper: Site: Thread👇
Tweet media one
Tweet media two
2
20
71
@moinnadeem
Moin Nadeem
1 day
@deedydas How much was dilution between OAI founding and now?
1
0
43
@moinnadeem
Moin Nadeem
9 days
Yes. And this is why @vkhosla’s approach atowsrds OSS AI always seemed misguided to me.
@BlancheMinerva
Stella Biderman
9 days
The American response to DeepSeek falsifies the claims that America shouldn't be an open source leader because China will just take it and build on it. If that was true, the response would be "lol thanks for giving us free results, idiot."
0
0
4
@moinnadeem
Moin Nadeem
11 days
@MrRazzi17 Strong agree
0
0
1
@moinnadeem
Moin Nadeem
11 days
"Give me your tired, your poor, Your huddled masses yearning to breathe free, The wretched refuse of your teeming shore. Send these, the homeless, tempest-tost to me, I lift my lamp beside the golden door!"
@AutismCapital
Autism Capital 🧩
11 days
🚨BREAKING: Trump says he will be signing an Executive Order to send the worst of the United States illegals to Guantanamo Bay in a 30,000 bed facility.
0
0
1
@moinnadeem
Moin Nadeem
11 days
RT @atroyn: 'we're in this bizarre world where the best way to learn about llms... is to read papers by chinese companies. i do not think t…
0
28
0
@moinnadeem
Moin Nadeem
11 days
RT @Dan_Jeffries1: As we are learning DeepSeek is one of the most sophisticated psyops of all time. Here's how it went down: 1) Release…
0
1K
0
@moinnadeem
Moin Nadeem
14 days
@deedydas wait hold on how are you able to extrapolate a curve of what R1's would look like?
1
0
4
@moinnadeem
Moin Nadeem
14 days
@TaliaGold if you believe the rumors that 3.5 Sonnet is 200-400B dense model, then having a Mixture of Experts where you only have 37B active parameters (out of 671B total parameters) goes a long way
0
0
4
@moinnadeem
Moin Nadeem
14 days
I gave some 5 recent code-related Claude queries to DeepSeek and it outperformed Claude on 5/5. Damn.
0
0
2
@moinnadeem
Moin Nadeem
14 days
The wonderful thing with OSS LLMs is you just need one of them to be good enough to screw over all of the incumbents.
0
0
2
@moinnadeem
Moin Nadeem
17 days
RT @andrew_n_carr: I completely believe DeepSeek is making such good progress because the whole team is so close to hardware. Many many o…
0
37
0
@moinnadeem
Moin Nadeem
18 days
RT @Suhail: App layer is where most of the value is. Nothing has changed. Humans like a well designed focused UX that deeply solves a probl…
0
197
0
@moinnadeem
Moin Nadeem
25 days
@finbarrtimbers Happy to help
0
0
0
@moinnadeem
Moin Nadeem
27 days
@rajko_rad rajko i didn't need to be called out like this
0
0
2
@moinnadeem
Moin Nadeem
30 days
@tanayj can you give an example of a query where you prefer pplx over google?
1
0
0
@moinnadeem
Moin Nadeem
30 days
@finbarrtimbers Phew, I thought it was just me!
0
0
0
@moinnadeem
Moin Nadeem
30 days
Les Mis is and will continue to be one of my favorite movies.
0
0
1
@moinnadeem
Moin Nadeem
30 days
@charles_irl 🎶do you hear the people sing? hacking their scripts by sheer disdain? it is the struggle of the nerds who rage against this brittle chain. 🎶 free us charles! lead the revolution
0
0
1