![samuel joseph troyer Profile](https://pbs.twimg.com/profile_images/1800169060447002624/jhKOxYO6_x96.jpg)
samuel joseph troyer
@samjtro
Followers
93
Following
2K
Statuses
609
RT @jonasgeiping: What is pretty exciting is that simply by training with our arch and objective, a separation emerges from scale - the mod…
0
7
0
RT @natolambert: the TRL implementation of GRPO is technically correct if the number of gradient steps per batch is 1 because clipping neve…
0
35
0
RT @arankomatsuzaki: Stanford presents: s1: Simple test-time scaling - Seeks the simplest approach to achieve test-time scaling and stro…
0
169
0
RT @caydengineer: AugmentOS 1.0 has dropped. AugmentOS is the open source OS and super app for smart glasses. It enables apps and AI agent…
0
204
0
RT @natolambert: Way too many people think that because reasoning models have taken off, and reinforcement learning with verifiable rewards…
0
19
0
RT @wordgrammer: Okay. Thanks for the nerd snipe guys. I spent the day learning exactly how DeepSeek trained at 1/30 the price, instead of…
0
3K
0
RT @SambaNovaAI: ⚡️ We've partnered with @HuggingFace to bring lightning fast inference speeds. 🤗 10x faster on @AIatMeta's Llama 3 & @Ali…
0
26
0
RT @matthuang: Great to see Geth accelerating… Paradigm will offer a $20K bounty for successfully referring someone to this role
0
35
0
RT @UnslothAI: Introducing 1.58bit DeepSeek-R1 GGUFs! 🐋 DeepSeek-R1 can now run in 1.58-bit, while being fully functional. We shrank the 6…
0
624
0
👀👀 "... potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training."
Today Nous announced the coming of Psyche - a distributed network and training framework, an infrastructure layer for training Open AI Models over the internet built around our distro breakthrough. The new reasoning paradigm that we've entered with a lot of help from @deepseek_ai opens up the door to a LOT of new opportunities, and what I am most excited about with Psyche is the potential to do mass scale RL with composable verifiers, environments, and collaboration from small GPUs able to do rollouts and large nodes to do the training. I envision a super-modular system for all kinds of verifiers that OS Contributions can implement and build into the system to give easy support to new task targets for reasoning. I think psyche can enable a massively distributed Reinforcement Learning paradigm that can push reasoning in Open Source beyond the limit. Let's make open source the defacto standard for the world together.
0
0
1
RT @KobeissiLetter: BREAKING: DeepSeek officially announces another open-source AI model, Janus-Pro-7B. This model generates images and be…
0
2K
0
RT @ClementDelangue: Too many bad takes about Deepseek to counter them all haha. Staying heads-down in building mode while hoping that perc…
0
39
0
@eatonphil wait is he arguing that updating the stdlib is bad, or that naming it '*/v*' is bad?
1
0
0