irohsharpeniroh Profile Banner
jakeyyy Profile
jakeyyy

@irohsharpeniroh

Followers
318
Following
266K
Statuses
4K

regurgitating my essence

a dying sun
Joined July 2017
Don't wanna be here? Send us removal request.
@irohsharpeniroh
jakeyyy
8 hours
@tokenbender it's good, Noam in the wild was crazy
0
0
4
@irohsharpeniroh
jakeyyy
8 hours
@stochasticchasm the KV cache will be built differently if you are passing latent reps. so yes given a big context the context outweighs a single token choice, but it applies to each one
1
0
3
@irohsharpeniroh
jakeyyy
8 hours
@kitten_beloved @JaMikeyMike because bottleneck shifts to product and biz, not engineer throughout
0
0
4
@irohsharpeniroh
jakeyyy
8 hours
@nullpointered I'm cramming this all into a single day
0
0
0
@irohsharpeniroh
jakeyyy
10 hours
@CFGeek (3) accelerate research (4) improve sample efficiency (5) repeat ... (?) automation
0
0
2
@irohsharpeniroh
jakeyyy
10 hours
@flybottlemist @gptbrooke the twists and whorls of fate, or perhaps the large intestine
0
0
3
@irohsharpeniroh
jakeyyy
12 hours
@flybottlemist @gptbrooke my brother once walked in on me at 5 holding a turd up in a wad of toilet paper, studying it closely
1
0
4
@irohsharpeniroh
jakeyyy
13 hours
I contain a singletude
1
0
11
@irohsharpeniroh
jakeyyy
15 hours
@dwarkesh_sp @JeffDean @NoamShazeer 🙏 thank u great discussion
0
0
1
@irohsharpeniroh
jakeyyy
16 hours
@cookiecarver nahh haha I'm American
1
0
1
@irohsharpeniroh
jakeyyy
17 hours
@samswoora *guy who is not that smart and makes more than 80k*: wow that is so me
0
0
25
@irohsharpeniroh
jakeyyy
18 hours
@coldhealing and in the daylight I get an electric feel
0
0
20
@irohsharpeniroh
jakeyyy
19 hours
@RylanSchaeffer I recently discovered an improved algorithm for LU decomposition when I realized all I need is U
0
0
2
@irohsharpeniroh
jakeyyy
20 hours
@nullpointered this man has riches beyond measure
0
0
2
@irohsharpeniroh
jakeyyy
21 hours
@michaelyliu6 @dwarkesh_sp @_sholtodouglas @TrentonBricken which is not wrong, but there's a clear distinction here between what happens in a normal transformer forward pass and what latent recurrence is/does
0
0
1
@irohsharpeniroh
jakeyyy
21 hours
@sitka__ @yyallian ✍️✍️✍️
0
0
2
@irohsharpeniroh
jakeyyy
22 hours
@dwarkesh_sp @_sholtodouglas @TrentonBricken so the model is still only "giving" itself token reps as inputs to work with for the next token, it just gets to see its previous work at every step of the way via the cache
0
0
4