samuel joseph troyer @samjtro profile

samuel joseph troyer

@samjtro

Followers

93

Following

2K

Statuses

609

building ...

Joined April 2022

Don't wanna be here? Send us removal request.

samuel joseph troyer

@samjtro

2 days

RT @jonasgeiping: What is pretty exciting is that simply by training with our arch and objective, a separation emerges from scale - the mod…

0

7

0

samuel joseph troyer

@samjtro

5 days

@julrach nice work!

0

1

samuel joseph troyer

@samjtro

7 days

RT @tengyuma: RL + CoT works great for DeepSeek-R1 & o1, but: 1️⃣ Linear-in-log scaling in train & test-time compute 2️⃣ Likely bounded b…

0

96

0

samuel joseph troyer

@samjtro

7 days

RT @natolambert: the TRL implementation of GRPO is technically correct if the number of gradient steps per batch is 1 because clipping neve…

0

35

0

samuel joseph troyer

@samjtro

10 days

RT @arankomatsuzaki: Stanford presents: s1: Simple test-time scaling - Seeks the simplest approach to achieve test-time scaling and stro…

0

169

0

samuel joseph troyer

@samjtro

10 days

.@_linktai

0

1

2

samuel joseph troyer

@samjtro

11 days

RT @caydengineer: AugmentOS 1.0 has dropped. AugmentOS is the open source OS and super app for smart glasses. It enables apps and AI agent…

0

204

0

samuel joseph troyer

@samjtro

11 days

RT @natolambert: Way too many people think that because reasoning models have taken off, and reinforcement learning with verifiable rewards…

0

19

0

samuel joseph troyer

@samjtro

14 days

RT @wordgrammer: Okay. Thanks for the nerd snipe guys. I spent the day learning exactly how DeepSeek trained at 1/30 the price, instead of…

0

3K

0

samuel joseph troyer

@samjtro

15 days

RT @SambaNovaAI: ⚡️ We've partnered with @HuggingFace to bring lightning fast inference speeds. 🤗 10x faster on @AIatMeta's Llama 3 & @Ali…

0

26

0

samuel joseph troyer

@samjtro

15 days

RT @matthuang: Great to see Geth accelerating… Paradigm will offer a $20K bounty for successfully referring someone to this role

0

35

0

samuel joseph troyer

@samjtro

16 days

RT @UnslothAI: Introducing 1.58bit DeepSeek-R1 GGUFs! 🐋 DeepSeek-R1 can now run in 1.58-bit, while being fully functional. We shrank the 6…

0

624

0

samuel joseph troyer

@samjtro

16 days

👀👀 "... potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training."

Teknium (e/λ)

@Teknium1

16 days

Today Nous announced the coming of Psyche - a distributed network and training framework, an infrastructure layer for training Open AI Models over the internet built around our distro breakthrough. The new reasoning paradigm that we've entered with a lot of help from @deepseek_ai opens up the door to a LOT of new opportunities, and what I am most excited about with Psyche is the potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training. I envision a super-modular system for all kinds of verifiers that OS Contributions can implement and build into the system to give easy support to new task targets for reasoning. I think psyche can enable a massively distributed Reinforcement Learning paradigm that can push reasoning in Open Source beyond the limit. Let's make open source the defacto standard for the world together.

0

1

samuel joseph troyer

@samjtro

16 days

@Teknium1 this has huge potential ... congrats on the launch, need to tinker w this.

0

2

samuel joseph troyer

@samjtro

16 days

@daniel_mac8 nvmd haha

0

samuel joseph troyer

@samjtro

16 days

the ai wars have begun ...

0

samuel joseph troyer

@samjtro

16 days

RT @KobeissiLetter: BREAKING: DeepSeek officially announces another open-source AI model, Janus-Pro-7B. This model generates images and be…

0

2K

0

samuel joseph troyer

@samjtro

16 days

RT @ClementDelangue: Too many bad takes about Deepseek to counter them all haha. Staying heads-down in building mode while hoping that perc…

0

39

0

samuel joseph troyer

@samjtro

16 days

@eatonphil wait is he arguing that updating the stdlib is bad, or that naming it '*/v*' is bad?

1

0