shxf0072 Profile Banner
Joey (e/λ) Profile
Joey (e/λ)

@shxf0072

Followers
4K
Following
10K
Statuses
3K

I speak fluent Python and Sarcasm.

(anti-de sitter space)
Joined February 2019
Don't wanna be here? Send us removal request.
@shxf0072
Joey (e/λ)
1 year
@karpathy llama2.c running on galaxy watch 4
39
178
2K
@shxf0072
Joey (e/λ)
7 hours
@abyssalblue_ thats sad on so many levels (same🫂:)
0
0
1
@shxf0072
Joey (e/λ)
20 hours
@vaishnahvi_ with water
1
0
0
@shxf0072
Joey (e/λ)
1 day
@tokenbender yoo one of my fav game,
0
0
1
@shxf0072
Joey (e/λ)
1 day
@qtnx_ is your wife single?
0
0
1
@shxf0072
Joey (e/λ)
2 days
@sachdh 4,8,17,36,
Tweet media one
1
0
1
@shxf0072
Joey (e/λ)
2 days
@kalomaze @giffmana amazon pouched google engineers and created traninium, amazon soc is kind of a fork of tpu with more bandwidth. also tesla has a dojo which is asic, they use a torch to do autopilot training, pretty big scale
0
0
5
@shxf0072
Joey (e/λ)
4 days
@polynoamial @MillionInt <thinking> reward hacking opportunity @MillionInt i'll complete zelda for you we can share a reward </thinking>
0
0
6
@shxf0072
Joey (e/λ)
4 days
0
0
0
@shxf0072
Joey (e/λ)
4 days
@danielhanchen @UnslothAI sry, i wanted to help you push it faster, next time I'll use unsloth gen (more hackable:)
Tweet media one
1
0
4
@shxf0072
Joey (e/λ)
4 days
RT @danielhanchen: We managed to fit Llama 3.1 8B < 15GB with GRPO! Experience the R1 "aha moment" for free on Colab! Phi-4 14B also works…
0
288
0
@shxf0072
Joey (e/λ)
4 days
agi at home fr
@UnslothAI
Unsloth AI
4 days
You can now reproduce DeepSeek-R1's reasoning on your own local device! Experience the "Aha" moment with just 7GB VRAM. Unsloth reduces GRPO training memory use by 80%. 15GB VRAM can transform Llama-3.1 (8B) & Phi-4 (14B) into reasoning models. Blog:
Tweet media one
2
4
61
@shxf0072
Joey (e/λ)
4 days
underrated crack anon, must follow
@j4orz
j4orz
4 days
stepping into tpot arena thanks to the push by @shxf0072: what are people's favorite *high quality* deep learning compiler courses? there's not many. i'm hacking on singularity systems: zero to hero to address this gap in the ecosystem.
Tweet media one
Tweet media two
1
0
11
@shxf0072
Joey (e/λ)
4 days
@giffmana @cccntu @cHHillee aye aye captain 🫡
0
0
2
@shxf0072
Joey (e/λ)
4 days
@giffmana @cccntu @cHHillee my brain compute in fp8 some precision errors are expected (dyslexia)
1
0
10
@shxf0072
Joey (e/λ)
4 days
Tweet media one
1
0
14
@shxf0072
Joey (e/λ)
4 days
1
1
13
@shxf0072
Joey (e/λ)
4 days
@jackminong idk nothing stoping us, bcs we can generate multiple rollout from given point in batch, if you want to do that with something like mario or atari it will be time consuming, with llm you just do batched gen
0
0
3