Joey (e/λ) @shxf0072 profile

Joey (e/λ)

@shxf0072

Followers

4K

Following

10K

Statuses

3K

I speak fluent Python and Sarcasm.

(anti-de sitter space)

Joined February 2019

Don't wanna be here? Send us removal request.

Joey (e/λ)

@shxf0072

1 year

@karpathy llama2.c running on galaxy watch 4

39

178

2K

Joey (e/λ)

@shxf0072

7 hours

@abyssalblue_ thats sad on so many levels (same🫂:)

0

1

Joey (e/λ)

@shxf0072

20 hours

@vaishnahvi_ with water

1

0

Joey (e/λ)

@shxf0072

1 day

@tokenbender yoo one of my fav game,

0

1

Joey (e/λ)

@shxf0072

1 day

@qtnx_ is your wife single?

0

1

Joey (e/λ)

@shxf0072

2 days

@sachdh 4,8,17,36,

1

0

1

Joey (e/λ)

@shxf0072

2 days

@kalomaze @giffmana amazon pouched google engineers and created traninium, amazon soc is kind of a fork of tpu with more bandwidth. also tesla has a dojo which is asic, they use a torch to do autopilot training, pretty big scale

0

5

Joey (e/λ)

@shxf0072

4 days

@polynoamial @MillionInt <thinking> reward hacking opportunity @MillionInt i'll complete zelda for you we can share a reward </thinking>

0

6

Joey (e/λ)

@shxf0072

4 days

@purva_rajyguru :)

0

Joey (e/λ)

@shxf0072

4 days

@danielhanchen @UnslothAI sry, i wanted to help you push it faster, next time I'll use unsloth gen (more hackable:)

1

0

4

Joey (e/λ)

@shxf0072

4 days

RT @danielhanchen: We managed to fit Llama 3.1 8B < 15GB with GRPO! Experience the R1 "aha moment" for free on Colab! Phi-4 14B also works…

0

288

0

Joey (e/λ)

@shxf0072

4 days

agi at home fr

Unsloth AI

@UnslothAI

4 days

You can now reproduce DeepSeek-R1's reasoning on your own local device! Experience the "Aha" moment with just 7GB VRAM. Unsloth reduces GRPO training memory use by 80%. 15GB VRAM can transform Llama-3.1 (8B) & Phi-4 (14B) into reasoning models. Blog:

2

4

61

Joey (e/λ)

@shxf0072

4 days

underrated crack anon, must follow

j4orz

@j4orz

4 days

stepping into tpot arena thanks to the push by @shxf0072: what are people's favorite *high quality* deep learning compiler courses? there's not many. i'm hacking on singularity systems: zero to hero to address this gap in the ecosystem.

1

0

11

Joey (e/λ)

@shxf0072

4 days

@giffmana @cccntu @cHHillee aye aye captain 🫡

0

2

Joey (e/λ)

@shxf0072

4 days

@giffmana @cccntu @cHHillee my brain compute in fp8 some precision errors are expected (dyslexia)

1

0

10

Joey (e/λ)

@shxf0072

4 days

@giffmana @cccntu @cHHillee

1

0

14

Joey (e/λ)

@shxf0072

4 days

@cheng_qinyuan

1

13

Joey (e/λ)

@shxf0072

4 days

@jackminong idk nothing stoping us, bcs we can generate multiple rollout from given point in batch, if you want to do that with something like mario or atari it will be time consuming, with llm you just do batched gen

0

3