optionally prohibited oss raid @nobf16measures profile

optionally prohibited oss raid

@nobf16measures

Followers

1

Following

399

Statuses

20

Joined December 2024

Don't wanna be here? Send us removal request.

optionally prohibited oss raid

@nobf16measures

6 hours

@yacineMTB

0

optionally prohibited oss raid

@nobf16measures

1 day

@kalomaze i spent too little time to think about what you did here; i originally thought that the kind of error decrease per layer youve shown could be analogous to the model thinking, but it's not straightforward to mine and measure activations against concrete outputs... that i know of

1

0

2

optionally prohibited oss raid

@nobf16measures

2 days

@main_horse @cloneofsimo i mean by definition it is, a directional overhead is reducing the time to finding such reverse fact data and disagreements are better on "how to achieve that"

0

optionally prohibited oss raid

@nobf16measures

2 days

@qtnx_ typical inferentia skill issue

0

1

optionally prohibited oss raid

@nobf16measures

3 days

harsh reminder that no matter how serious i take sutton, i should always take him more seriously

0

optionally prohibited oss raid

@nobf16measures

3 days

"arxiv" -> "ar5iv" -> Enter Obsidian Web Clipper -> "Add to Obsidian"

0

1

3

optionally prohibited oss raid

@nobf16measures

3 days

amazingly slow progress is happening in distilling embedding models from LLMs

0

optionally prohibited oss raid

@nobf16measures

4 days

@Dorialexander @aiamblichus ah fuck ggs

0

optionally prohibited oss raid

@nobf16measures

5 days

@cgarciae88 but then, 2009.01325 and 2109.10862

0

1

optionally prohibited oss raid

@nobf16measures

5 days

@norvid_studies maybe midjourney did cook with canvas

0

optionally prohibited oss raid

@nobf16measures

5 days

@tsarnick 😂😂😂 jfc dude

0

optionally prohibited oss raid

@nobf16measures

7 days

@doomslide or maybe they were on the wrong epistemic branch already. somehow i doubt the whalebros have read pc's iterative distillation

0

optionally prohibited oss raid

@nobf16measures

7 days

@kellerjordan0 inb4 "what does it converge to"

Dr Elmo ⊃ {6'5, 🚣‍♂️, 🇺🇲}

@ElmoTheHokage

2 months

congratulations to @NousResearch for breaking new ground in shoddy evals lower training loss during the first 3.3% of training doesn't mean your (gradient compressing) optimizer is better -- the question is what does it converge to

0

2

optionally prohibited oss raid

@nobf16measures

8 days

@Dorialexander what a jumpscare scrolling down here... never would i expect soumith to join the battle 🥲

0

optionally prohibited oss raid

@nobf16measures

8 days

@_xjdr comparison to gemini deep research? i've only used that but will take your word for it to switch to oai one

0

optionally prohibited oss raid

@nobf16measures

8 days

@jakobdylanc @kalomaze bruh level abstraction tho

0

1

optionally prohibited oss raid

@nobf16measures

8 days

@_xjdr need to run "bruh" 🥲

0

1

optionally prohibited oss raid

@nobf16measures

10 days

@kalomaze @attentionmech @SarahLacard number here also tuned as param?

1

0