Andreas Köpf @neurosp1ke profile

Andreas Köpf

@neurosp1ke

Followers

7K

Following

9K

Statuses

2K

Exploring ways to algorithmically model our world.

Münster, NRW, Germany

Joined December 2012

Don't wanna be here? Send us removal request.

Andreas Köpf

@neurosp1ke

27 minutes

Some people don’t know that @SchmidhuberAI in the 90s already analyzed what we all will be working on in 6-12 months: Artificial Curiosity 😉

0

3

10

Andreas Köpf

@neurosp1ke

19 hours

RT @Lei_Wang_1999: Excited to release tilelang v0.1.0, another pythonic dsl for writing AI kernels with optional layout/pipeline annotation…

0

14

0

Andreas Köpf

@neurosp1ke

2 days

I share Dario's view - no time to lose.

Anthropic

@AnthropicAI

2 days

A statement from Dario Amodei on the Paris AI Action Summit:

0

4

Andreas Köpf

@neurosp1ke

3 days

RT @winglian: What's the trick? DoRA. I don't have a great hypothesis on why it works yet, but I've upstreamed the changes to TRL. The PR m…

0

29

0

Andreas Köpf

@neurosp1ke

5 days

That’s why we need procedural generators 😏

0

11

Andreas Köpf

@neurosp1ke

6 days

RT @zafstojano: I've already contributed a couple of interesting environments that stress-test 2D spatial reasoning (NQueens, BFS), tokeniz…

0

2

0

Andreas Köpf

@neurosp1ke

6 days

reasoning-gym 0.1.5 uploaded to PyPI 55 dataset done so far, gallery: Some cool envs in the pipe. o1 & r1 are good, we need more challenging ones :-).

0

2

17

Andreas Köpf

@neurosp1ke

8 days

RT @jacobaustin132: Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems vie…

0

364

0

Andreas Köpf

@neurosp1ke

8 days

@_clashluke Would be interesting to see if l2 for classification improves adversarial robustness.

0

1

Andreas Köpf

@neurosp1ke

9 days

@soumithchintala @ylecun @RawSucces @DAcemogluMIT End the drama & make the🦙great again!

1

0

11

Andreas Köpf

@neurosp1ke

9 days

Found this lost slide which I removed form an AGI agents talk 2023 - wanted to keep it inspiring… but you on x are strong and can handle it. 🫣

0

2

10

Andreas Köpf

@neurosp1ke

10 days

RT @natolambert: never thought I'd hear these words

0

14

0

Andreas Köpf

@neurosp1ke

10 days

SotA AI research 😉

4

9

83

Andreas Köpf

@neurosp1ke

10 days

Simple „learnable temperature“ per head. In models with SDPA one could add it afterwards with a bit of long-context fine-tuning. More complicated versions, e.g. - others thought too trival for paper, e.g.

Aran Komatsuzaki

@arankomatsuzaki

10 days

Scalable-Softmax Is Superior for Attention - Proposes SSMax to process longer context length more effectively - Significantly improves perf in long contexts and key information retrieval

0

6

36

Andreas Köpf

@neurosp1ke

10 days

regulatory CAPTCHA

0

8

Andreas Köpf

@neurosp1ke

10 days

@_clashluke @nvidia How much did Jensen pay you? 😉

1

0

3

Andreas Köpf

@neurosp1ke

10 days

RT @teortaxesTex: ≈MCTS will make a comeback on the next spin. It's simple, it's provably optimal, it's bitter-pilled. @huajian_xin wasn't…

0

59

0

Andreas Köpf

@neurosp1ke

11 days

tl;dr „DeepSeek is extremely talented“

0

6