davhab Profile Banner
David Haber Profile
David Haber

@davhab

Followers
661
Following
2K
Statuses
1K

Making LLMs safe and secure | Founder & CEO of @LakeraAI | πŸ‘¦πŸΌπŸŠβ€β™‚οΈπŸš΄β€β™‚οΈπŸƒβ€β™‚οΈπŸ‡¨πŸ‡­

Zurich, Switzerland
Joined August 2011
Don't wanna be here? Send us removal request.
@davhab
David Haber
9 days
@janleike Looks like a less fun version of ;)
0
0
3
@davhab
David Haber
9 days
@Radiant_Castle @janleike @testingcatalog There's incredible fireworks at the end of ;)
0
0
4
@davhab
David Haber
2 months
RT @jarrodWattsDev: Someone just won $50,000 by convincing an AI Agent to send all of its funds to them. At 9:00 PM on November 22nd, an A…
0
5K
0
@davhab
David Haber
3 months
RT @giffmana:
Tweet media one
0
77
0
@davhab
David Haber
7 months
RT @LakeraAI: πŸŽ‰ Today, we're excited to announce our $20M Series A funding round, which will accelerate our delivery of real-time GenAI sec…
0
5
0
@davhab
David Haber
1 year
RT @_samvelyan: Introducing 🌈 Rainbow Teaming, a new method for generating diverse adversarial prompts for LLMs via LLMs It's a versatile…
0
43
0
@davhab
David Haber
1 year
@matthewclifford @join_ef Congrats @matthewclifford. We officially opened our SF office yesterday too...what a week!
1
0
2
@davhab
David Haber
1 year
As AI-powered agents go online, securing our digital infrastructure will demand a fundamental shift in cybersecurity.
1
1
3
@davhab
David Haber
1 year
RT @LakeraAI: πŸŽ₯Yesterday during the AI safety session at the @wef 2024, our panelists @ylecun, @davhab, Seraphina Goldfarb-Tarrant, and, @t…
0
1
0
@davhab
David Haber
1 year
RT @LakeraAI: What an incredible day it has been at the AI House Davos during the @wef 2024! 🌟 A big thank you to @ylecun , @tegmark, and…
0
2
0
@davhab
David Haber
1 year
Prompt injections can be so subtle that they're often invisible!
@emollick
Ethan Mollick
1 year
Yes, this works & I really would have never known I pasting a secret prompt into an LLM Prompt injection is a security problem that I think people building external-facing LLM applications (or internal ones with access to confidential data) need to take pretty seriously.
0
0
3
@davhab
David Haber
1 year
RT @goodside: PoC: LLM prompt injection via invisible instructions in pasted text
Tweet media one
Tweet media two
0
184
0
@davhab
David Haber
1 year
RT @AnthropicAI: New Anthropic Paper: Sleeper Agents. We trained LLMs to act secretly malicious. We found that, despite our best efforts a…
0
565
0
@davhab
David Haber
1 year
RT @LakeraAI: 1/2 πŸ“† Save the date: January 16th, 11:15 AM, for our AI Safety session at the AI House Davos panel during the @wef . πŸ‘‰ Laker…
0
1
0
@davhab
David Haber
1 year
RT @alliekmiller: Cybersecurity is going to be a hot space in AI in 2024 πŸ” - Intel launches Articul8 following pilot w BCG - AWS GMs leave…
0
21
0
@davhab
David Haber
1 year
RT @davidjmalan: From the team that brought you @CS50's Ready Player 50, "Join @LakeraAI's Gandalf Engineers ... for a special Christmas ed…
0
14
0
@davhab
David Haber
1 year
RT @LakeraAI: Are you ready for Monday? πŸ‘€Join our special Gandalf Livestream (Christmas Edition) πŸŽ…πŸ½ to get insights into Gandalf prompt dat…
0
1
0
@davhab
David Haber
1 year
Can't wait for this opportunity to discuss all things AI security over a virtual coffee with Ads Dawson from @owasp / @cohere!
0
1
3
@davhab
David Haber
1 year
RT @LakeraAI: πŸŽ‰ Exciting news - we’ve just released a new magical Gandalf Adventure level! Meet Gandalf the Truth Teller! πŸ™Š Play it here:…
0
4
0
@davhab
David Haber
1 year
Highly recommended.
@matthewclifford
Matt Clifford
1 year
Excited to be in New York next week and hosting a dinner on AI safety and security. I’ve left two seats open for students and/or young professionals interested in startups Register interest below:
0
0
0