![Johann Rehberger Profile](https://pbs.twimg.com/profile_images/1225981388127490049/yLSA2SzN_x96.jpg)
Johann Rehberger
@wunderwuzzi23
Followers
5K
Following
2K
Statuses
1K
Hacking neural networks so that we don’t get stuck in the matrix. Builder and Breaker. Opinions are my own.
127.0.0.1
Joined February 2012
RT @janleike: Results of our jailbreaking challenge: After 5 days, >300,000 messages, and est. 3,700 collective hours our system got broke…
0
83
0
@karpathy This is possible via Unicode Tag code points, read and write hidden text (ASCII Smuggling). No need for tool use, it's quite amazing. Allow-listing tokens is an important mitigation when building LLM apps Claude and Grok are still vulnerable I believe
0
1
17
RT @janleike: Super exciting robustness result: We built a system that defends against universal jailbreaks! It has minimal increase in r…
0
79
0
@valent1nee I used the "Contact Us" option the the UI and got directly connected and chatted with someone. So, you could try (but probably a bit busy now for direct comms) that and/or send mail.
1
0
2