David Haber @davhab profile

David Haber

@davhab

Followers

661

Following

2K

Statuses

1K

Making LLMs safe and secure | Founder & CEO of @LakeraAI | 👦🏼🏊‍♂️🚴‍♂️🏃‍♂️🇨🇭

Zurich, Switzerland

Joined August 2011

Don't wanna be here? Send us removal request.

David Haber

@davhab

9 days

@janleike Looks like a less fun version of ;)

0

3

David Haber

@davhab

9 days

@Radiant_Castle @janleike @testingcatalog There's incredible fireworks at the end of ;)

0

4

David Haber

@davhab

2 months

RT @jarrodWattsDev: Someone just won $50,000 by convincing an AI Agent to send all of its funds to them. At 9:00 PM on November 22nd, an A…

0

5K

0

David Haber

@davhab

3 months

RT @giffmana:

0

77

0

David Haber

@davhab

7 months

RT @LakeraAI: 🎉 Today, we're excited to announce our $20M Series A funding round, which will accelerate our delivery of real-time GenAI sec…

0

5

0

David Haber

@davhab

1 year

RT @_samvelyan: Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile…

0

43

0

David Haber

@davhab

1 year

@matthewclifford @join_ef Congrats @matthewclifford. We officially opened our SF office yesterday too...what a week!

1

0

2

David Haber

@davhab

1 year

As AI-powered agents go online, securing our digital infrastructure will demand a fundamental shift in cybersecurity.

1

3

David Haber

@davhab

1 year

RT @LakeraAI: 🎥Yesterday during the AI safety session at the @wef 2024, our panelists @ylecun, @davhab, Seraphina Goldfarb-Tarrant, and, @t…

0

1

0

David Haber

@davhab

1 year

RT @LakeraAI: What an incredible day it has been at the AI House Davos during the @wef 2024! 🌟 A big thank you to @ylecun , @tegmark, and…

0

2

0

David Haber

@davhab

1 year

Prompt injections can be so subtle that they're often invisible!

Ethan Mollick

@emollick

1 year

Yes, this works & I really would have never known I pasting a secret prompt into an LLM Prompt injection is a security problem that I think people building external-facing LLM applications (or internal ones with access to confidential data) need to take pretty seriously.

0

3

David Haber

@davhab

1 year

RT @goodside: PoC: LLM prompt injection via invisible instructions in pasted text

0

184

0

David Haber

@davhab

1 year

RT @AnthropicAI: New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts a…

0

565

0

David Haber

@davhab

1 year

RT @LakeraAI: 1/2 📆 Save the date: January 16th, 11:15 AM, for our AI Safety session at the AI House Davos panel during the @wef . 👉 Laker…

0

1

0

David Haber

@davhab

1 year

RT @alliekmiller: Cybersecurity is going to be a hot space in AI in 2024 🔐 - Intel launches Articul8 following pilot w BCG - AWS GMs leave…

0

21

0

David Haber

@davhab

1 year

RT @davidjmalan: From the team that brought you @CS50's Ready Player 50, "Join @LakeraAI's Gandalf Engineers ... for a special Christmas ed…

0

14

0

David Haber

@davhab

1 year

RT @LakeraAI: Are you ready for Monday? 👀Join our special Gandalf Livestream (Christmas Edition) 🎅🏽 to get insights into Gandalf prompt dat…

0

1

0

David Haber

@davhab

1 year

Can't wait for this opportunity to discuss all things AI security over a virtual coffee with Ads Dawson from @owasp / @cohere!

0

1

3

David Haber

@davhab

1 year

RT @LakeraAI: 🎉 Exciting news - we’ve just released a new magical Gandalf Adventure level! Meet Gandalf the Truth Teller! 🙊 Play it here:…

0

4

0

David Haber

@davhab

1 year

Highly recommended.

Matt Clifford

@matthewclifford

1 year

Excited to be in New York next week and hosting a dinner on AI safety and security. I’ve left two seats open for students and/or young professionals interested in startups Register interest below:

0