neurosp1ke Profile Banner
Andreas Köpf Profile
Andreas Köpf

@neurosp1ke

Followers
7K
Following
9K
Statuses
2K

Exploring ways to algorithmically model our world.

Münster, NRW, Germany
Joined December 2012
Don't wanna be here? Send us removal request.
@neurosp1ke
Andreas Köpf
27 minutes
Some people don’t know that @SchmidhuberAI in the 90s already analyzed what we all will be working on in 6-12 months: Artificial Curiosity 😉
0
3
10
@neurosp1ke
Andreas Köpf
19 hours
RT @Lei_Wang_1999: Excited to release tilelang v0.1.0, another pythonic dsl for writing AI kernels with optional layout/pipeline annotation…
0
14
0
@neurosp1ke
Andreas Köpf
2 days
I share Dario's view - no time to lose.
@AnthropicAI
Anthropic
2 days
A statement from Dario Amodei on the Paris AI Action Summit:
0
0
4
@neurosp1ke
Andreas Köpf
3 days
RT @winglian: What's the trick? DoRA. I don't have a great hypothesis on why it works yet, but I've upstreamed the changes to TRL. The PR m…
0
29
0
@neurosp1ke
Andreas Köpf
5 days
That’s why we need procedural generators 😏
0
0
11
@neurosp1ke
Andreas Köpf
6 days
RT @zafstojano: I've already contributed a couple of interesting environments that stress-test 2D spatial reasoning (NQueens, BFS), tokeniz…
0
2
0
@neurosp1ke
Andreas Köpf
6 days
reasoning-gym 0.1.5 uploaded to PyPI 55 dataset done so far, gallery: Some cool envs in the pipe. o1 & r1 are good, we need more challenging ones :-).
Tweet media one
0
2
17
@neurosp1ke
Andreas Köpf
8 days
RT @jacobaustin132: Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems vie…
0
364
0
@neurosp1ke
Andreas Köpf
8 days
@_clashluke Would be interesting to see if l2 for classification improves adversarial robustness.
0
0
1
@neurosp1ke
Andreas Köpf
9 days
@soumithchintala @ylecun @RawSucces @DAcemogluMIT End the drama & make the🦙great again!
1
0
11
@neurosp1ke
Andreas Köpf
9 days
Found this lost slide which I removed form an AGI agents talk 2023 - wanted to keep it inspiring… but you on x are strong and can handle it. 🫣
Tweet media one
0
2
10
@neurosp1ke
Andreas Köpf
10 days
RT @natolambert: never thought I'd hear these words
0
14
0
@neurosp1ke
Andreas Köpf
10 days
SotA AI research 😉
Tweet media one
4
9
83
@neurosp1ke
Andreas Köpf
10 days
Simple „learnable temperature“ per head. In models with SDPA one could add it afterwards with a bit of long-context fine-tuning. More complicated versions, e.g. - others thought too trival for paper, e.g.
@arankomatsuzaki
Aran Komatsuzaki
10 days
Scalable-Softmax Is Superior for Attention - Proposes SSMax to process longer context length more effectively - Significantly improves perf in long contexts and key information retrieval
Tweet media one
0
6
36
@neurosp1ke
Andreas Köpf
10 days
regulatory CAPTCHA
0
0
8
@neurosp1ke
Andreas Köpf
10 days
@_clashluke @nvidia How much did Jensen pay you? 😉
1
0
3
@neurosp1ke
Andreas Köpf
10 days
RT @teortaxesTex: ≈MCTS will make a comeback on the next spin. It's simple, it's provably optimal, it's bitter-pilled. @huajian_xin wasn't…
0
59
0
@neurosp1ke
Andreas Köpf
11 days
tl;dr „DeepSeek is extremely talented“
0
0
6