![kamilė Profile](https://pbs.twimg.com/profile_images/1850919523672797184/pU_sTQ3v_x96.jpg)
kamilė
@kamilelukosiute
Followers
630
Following
1K
Statuses
305
integrity and rigor | research scientist in ai security
San Francisco, CA
Joined April 2021
New position paper from @bindshell_ and myself this morning arguing that LLM cyber evaluations are insufficient to establish risk (thread):
2
5
29
RT @lxrjl: 1/8 Really happy to share @open_phil’s new Request For Proposals to improve AI capability evaluations. It's been a big project f…
0
22
0
@MaxNadeau_ @bindshell_ (who is in responsible for making the differentiation between these 2? is it the company? the gov? independent oversight commission? i really don't know.)
0
0
0
@MaxNadeau_ @bindshell_ if u wanna chat abt this offline sometime, happy to do so, more usefully in front of a whiteboard lol :)
0
0
0
@MaxNadeau_ @bindshell_ lol i don't think so, would be v cool. i bet its not super easy to get criminals to talk to you. you can do a little bit of this by looking at like, "dark web" forums, but like twitter, its hard to tell if the ppl posting are representative of the population.
0
0
0
@MaxNadeau_ @bindshell_ - on rsp's: yep agreed that's not their focus, and i disagree that that's how it should be done. guardrails fail, prompts are dual-use (esp. in cyber!), so estimating risk feels like a better approach for minimizing risk + deciding red lines
2
0
0