optionally prohibited oss raid Profile
optionally prohibited oss raid

@nobf16measures

Followers
1
Following
399
Statuses
20

Joined December 2024
Don't wanna be here? Send us removal request.
@nobf16measures
optionally prohibited oss raid
6 hours
Tweet media one
0
0
0
@nobf16measures
optionally prohibited oss raid
1 day
@kalomaze i spent too little time to think about what you did here; i originally thought that the kind of error decrease per layer youve shown could be analogous to the model thinking, but it's not straightforward to mine and measure activations against concrete outputs... that i know of
1
0
2
@nobf16measures
optionally prohibited oss raid
2 days
@main_horse @cloneofsimo i mean by definition it is, a directional overhead is reducing the time to finding such reverse fact data and disagreements are better on "how to achieve that"
0
0
0
@nobf16measures
optionally prohibited oss raid
2 days
@qtnx_ typical inferentia skill issue
0
0
1
@nobf16measures
optionally prohibited oss raid
3 days
harsh reminder that no matter how serious i take sutton, i should always take him more seriously
0
0
0
@nobf16measures
optionally prohibited oss raid
3 days
"arxiv" -> "ar5iv" -> Enter Obsidian Web Clipper -> "Add to Obsidian"
Tweet media one
0
1
3
@nobf16measures
optionally prohibited oss raid
3 days
amazingly slow progress is happening in distilling embedding models from LLMs
0
0
0
@nobf16measures
optionally prohibited oss raid
4 days
0
0
0
@nobf16measures
optionally prohibited oss raid
5 days
@cgarciae88 but then, 2009.01325 and 2109.10862
0
0
1
@nobf16measures
optionally prohibited oss raid
5 days
@norvid_studies maybe midjourney did cook with canvas
0
0
0
@nobf16measures
optionally prohibited oss raid
5 days
@tsarnick πŸ˜‚πŸ˜‚πŸ˜‚ jfc dude
0
0
0
@nobf16measures
optionally prohibited oss raid
7 days
@doomslide or maybe they were on the wrong epistemic branch already. somehow i doubt the whalebros have read pc's iterative distillation
0
0
0
@nobf16measures
optionally prohibited oss raid
7 days
@kellerjordan0 inb4 "what does it converge to"
@ElmoTheHokage
Dr Elmo βŠƒ {6'5, πŸš£β€β™‚οΈ, πŸ‡ΊπŸ‡²}
2 months
congratulations to @NousResearch for breaking new ground in shoddy evals lower training loss during the first 3.3% of training doesn't mean your (gradient compressing) optimizer is better -- the question is what does it converge to
Tweet media one
Tweet media two
0
0
2
@nobf16measures
optionally prohibited oss raid
8 days
@Dorialexander what a jumpscare scrolling down here... never would i expect soumith to join the battle πŸ₯²
0
0
0
@nobf16measures
optionally prohibited oss raid
8 days
@_xjdr comparison to gemini deep research? i've only used that but will take your word for it to switch to oai one
0
0
0
@nobf16measures
optionally prohibited oss raid
8 days
@jakobdylanc @kalomaze bruh level abstraction tho
0
0
1
@nobf16measures
optionally prohibited oss raid
8 days
@_xjdr need to run "bruh" πŸ₯²
0
0
1
@nobf16measures
optionally prohibited oss raid
10 days
@kalomaze @attentionmech @SarahLacard number here also tuned as param?
1
0
0