samjtro Profile Banner
samuel joseph troyer Profile
samuel joseph troyer

@samjtro

Followers
93
Following
2K
Statuses
609

building ...

Joined April 2022
Don't wanna be here? Send us removal request.
@samjtro
samuel joseph troyer
2 days
RT @jonasgeiping: What is pretty exciting is that simply by training with our arch and objective, a separation emerges from scale - the mod…
0
7
0
@samjtro
samuel joseph troyer
5 days
@julrach nice work!
0
0
1
@samjtro
samuel joseph troyer
7 days
RT @tengyuma: RL + CoT works great for DeepSeek-R1 & o1, but:  1️⃣ Linear-in-log scaling in train & test-time compute 2️⃣ Likely bounded b…
0
96
0
@samjtro
samuel joseph troyer
7 days
RT @natolambert: the TRL implementation of GRPO is technically correct if the number of gradient steps per batch is 1 because clipping neve…
0
35
0
@samjtro
samuel joseph troyer
10 days
RT @arankomatsuzaki: Stanford presents: s1: Simple test-time scaling - Seeks the simplest approach to achieve test-time scaling and stro…
0
169
0
@samjtro
samuel joseph troyer
10 days
Tweet media one
0
1
2
@samjtro
samuel joseph troyer
11 days
RT @caydengineer: AugmentOS 1.0 has dropped. AugmentOS is the open source OS and super app for smart glasses. It enables apps and AI agent…
0
204
0
@samjtro
samuel joseph troyer
11 days
RT @natolambert: Way too many people think that because reasoning models have taken off, and reinforcement learning with verifiable rewards…
0
19
0
@samjtro
samuel joseph troyer
14 days
RT @wordgrammer: Okay. Thanks for the nerd snipe guys. I spent the day learning exactly how DeepSeek trained at 1/30 the price, instead of…
0
3K
0
@samjtro
samuel joseph troyer
15 days
RT @SambaNovaAI: ⚡️ We've partnered with @HuggingFace to bring lightning fast inference speeds. 🤗 10x faster on @AIatMeta's Llama 3 & @Ali
Tweet media one
0
26
0
@samjtro
samuel joseph troyer
15 days
RT @matthuang: Great to see Geth accelerating… Paradigm will offer a $20K bounty for successfully referring someone to this role
0
35
0
@samjtro
samuel joseph troyer
16 days
RT @UnslothAI: Introducing 1.58bit DeepSeek-R1 GGUFs! 🐋 DeepSeek-R1 can now run in 1.58-bit, while being fully functional. We shrank the 6…
0
624
0
@samjtro
samuel joseph troyer
16 days
👀👀 "... potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training."
@Teknium1
Teknium (e/λ)
16 days
Today Nous announced the coming of Psyche - a distributed network and training framework, an infrastructure layer for training Open AI Models over the internet built around our distro breakthrough. The new reasoning paradigm that we've entered with a lot of help from @deepseek_ai opens up the door to a LOT of new opportunities, and what I am most excited about with Psyche is the potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training. I envision a super-modular system for all kinds of verifiers that OS Contributions can implement and build into the system to give easy support to new task targets for reasoning. I think psyche can enable a massively distributed Reinforcement Learning paradigm that can push reasoning in Open Source beyond the limit. Let's make open source the defacto standard for the world together.
0
0
1
@samjtro
samuel joseph troyer
16 days
@Teknium1 this has huge potential ... congrats on the launch, need to tinker w this.
0
0
2
@samjtro
samuel joseph troyer
16 days
@daniel_mac8 nvmd haha
0
0
0
@samjtro
samuel joseph troyer
16 days
the ai wars have begun ...
0
0
0
@samjtro
samuel joseph troyer
16 days
RT @KobeissiLetter: BREAKING: DeepSeek officially announces another open-source AI model, Janus-Pro-7B. This model generates images and be…
0
2K
0
@samjtro
samuel joseph troyer
16 days
RT @ClementDelangue: Too many bad takes about Deepseek to counter them all haha. Staying heads-down in building mode while hoping that perc…
0
39
0
@samjtro
samuel joseph troyer
16 days
@eatonphil wait is he arguing that updating the stdlib is bad, or that naming it '*/v*' is bad?
1
0
0