![tenderizzation Profile](https://pbs.twimg.com/profile_images/1863827251290349568/-g75vayG_x96.jpg)
tenderizzation
@tenderizzation
Followers
588
Following
20K
Statuses
2K
~~talk to me about your most recent illegal memory access~~ classically trained shitposter; running stock angle no turbo; pytorch CI breaker
South Silly Valley (南湾)
Joined July 2010
@stochasticchasm i think the latent reasoning pill take is that every entry in your KV-cache is pigeonholed into producing something legible at the output though with R1-style CoTs this can go away with enough RL a model can just dual-purpose some tokens to be “output/legible” vs. “CoT”
2
0
4
@_xjdr excited to see whether Grace (code compilation) or Blackwell (code generation/eval) in a GB system is the bottleneck in this setup
0
0
1
@_imdawon bro must be loaded willing to risk getting road salt on a corolla for some maccas 😭😭😭
0
0
1