Moin Nadeem @moinnadeem profile

Moin Nadeem

@moinnadeem

Followers

2K

Following

5K

Statuses

17K

Co-Founder at @Phonic_Co. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲

San Fransisco Bay Area

Joined October 2009

Don't wanna be here? Send us removal request.

Moin Nadeem

@moinnadeem

5 years

As pretrained language models grow more common in #NLProc, it is crucial to evaluate their societal biases. We launch a new task, evaluation metrics, and a large dataset to measure stereotypical biases in LMs: Paper: Site: Thread👇

2

20

71

Moin Nadeem

@moinnadeem

1 day

@deedydas How much was dilution between OAI founding and now?

1

0

43

Moin Nadeem

@moinnadeem

9 days

Yes. And this is why @vkhosla’s approach atowsrds OSS AI always seemed misguided to me.

Stella Biderman

@BlancheMinerva

9 days

The American response to DeepSeek falsifies the claims that America shouldn't be an open source leader because China will just take it and build on it. If that was true, the response would be "lol thanks for giving us free results, idiot."

0

4

Moin Nadeem

@moinnadeem

11 days

@MrRazzi17 Strong agree

0

1

Moin Nadeem

@moinnadeem

11 days

"Give me your tired, your poor, Your huddled masses yearning to breathe free, The wretched refuse of your teeming shore. Send these, the homeless, tempest-tost to me, I lift my lamp beside the golden door!"

Autism Capital 🧩

@AutismCapital

11 days

🚨BREAKING: Trump says he will be signing an Executive Order to send the worst of the United States illegals to Guantanamo Bay in a 30,000 bed facility.

0

1

Moin Nadeem

@moinnadeem

11 days

RT @atroyn: 'we're in this bizarre world where the best way to learn about llms... is to read papers by chinese companies. i do not think t…

0

28

0

Moin Nadeem

@moinnadeem

11 days

RT @Dan_Jeffries1: As we are learning DeepSeek is one of the most sophisticated psyops of all time. Here's how it went down: 1) Release…

0

1K

0

Moin Nadeem

@moinnadeem

14 days

@deedydas wait hold on how are you able to extrapolate a curve of what R1's would look like?

1

0

4

Moin Nadeem

@moinnadeem

14 days

@TaliaGold if you believe the rumors that 3.5 Sonnet is 200-400B dense model, then having a Mixture of Experts where you only have 37B active parameters (out of 671B total parameters) goes a long way

0

4

Moin Nadeem

@moinnadeem

14 days

I gave some 5 recent code-related Claude queries to DeepSeek and it outperformed Claude on 5/5. Damn.

0

2

Moin Nadeem

@moinnadeem

14 days

The wonderful thing with OSS LLMs is you just need one of them to be good enough to screw over all of the incumbents.

0

2

Moin Nadeem

@moinnadeem

17 days

RT @andrew_n_carr: I completely believe DeepSeek is making such good progress because the whole team is so close to hardware. Many many o…

0

37

0

Moin Nadeem

@moinnadeem

18 days

RT @Suhail: App layer is where most of the value is. Nothing has changed. Humans like a well designed focused UX that deeply solves a probl…

0

197

0

Moin Nadeem

@moinnadeem

25 days

@finbarrtimbers Happy to help

0

Moin Nadeem

@moinnadeem

27 days

@rajko_rad rajko i didn't need to be called out like this

0

2

Moin Nadeem

@moinnadeem

30 days

@tanayj can you give an example of a query where you prefer pplx over google?

1

0

Moin Nadeem

@moinnadeem

30 days

@finbarrtimbers Phew, I thought it was just me!

0

Moin Nadeem

@moinnadeem

30 days

Les Mis is and will continue to be one of my favorite movies.

0

1

Moin Nadeem

@moinnadeem

30 days

@charles_irl 🎶do you hear the people sing? hacking their scripts by sheer disdain? it is the struggle of the nerds who rage against this brittle chain. 🎶 free us charles! lead the revolution

0

1