itszn13 Profile Banner
itszn Profile
itszn

@itszn13

Followers
9K
Following
1K
Statuses
1K

Amy | Security researcher | https://t.co/W1SE7NmCx8 | bsky: https://t.co/JBmOGE4YKO | LLM ART: https://t.co/7FtQ8O8nAW

she/her/friend of claude
Joined June 2011
Don't wanna be here? Send us removal request.
@itszn13
itszn
5 months
Today is the 10 year anniversary of the first time I ever pwned anything! My first exploit was a simple stack smash, overwrite return ptr, jump to admin function. This was an in internal recruiting CTF by @gaasedelen for the RPISEC Before that day I had never even considered computer security and was primarily doing robotics. You never know when a buffer overflow may change the very course of your life!
3
11
214
@itszn13
itszn
1 day
@faustianneko I regret to inform you that Elara Voss has died due to a tragic VR gaming accident 😔
Tweet media one
Tweet media two
Tweet media three
1
2
7
@itszn13
itszn
1 day
@Sauers_ But they don't have an api yet...
1
0
1
@itszn13
itszn
1 day
@itseieio @aidenybai Or ` ⟋ ` Mathematical Rising Diagonal U+27CB, the italics make it a bit hard to tell
0
0
3
@itszn13
itszn
2 days
@voooooogel I reverse-engineered the binaries a little bit and sadly they look pretty mundane, just a selection of the test functions, which seem low complexity
Tweet media one
Tweet media two
0
0
3
@itszn13
itszn
2 days
@AndrewCurran_ Do any other models use all caps for intense emphasis? For example if you describe yourself getting scammed by a phone call, Claude will “yell” in all caps to hang up the phone
0
0
0
@itszn13
itszn
3 days
@alexjplaskett I was going to say yes, but then realized that the class I took using that book was ~10 years ago now...
0
0
6
@itszn13
itszn
3 days
@NathanJClement @AnthropicAI Same with base64 and other encodings used for benign data all the time
0
0
3
@itszn13
itszn
4 days
@faustianneko @lefthanddraft Ofc chrome is here after almost 2 decades of security hardening, Anthropic and all other AI model companies have a long way to go in their hardening journey. It doesn’t help that it’s an entirely new field of security and so essentially no existing mitigations work
1
0
3
@itszn13
itszn
5 days
@emollick It does seem to trigger on encoded text (base64) no matter what the content actually is. For example Claude generated a base64 image url and that caused the prompt to get blocked
1
0
1
@itszn13
itszn
21 days
Oh I'm excited for future mechanized proofs, even with the specification problem. I suppose if most of the code is coming from a single entity (bad ending?) they may be inclined to use that form of security verification, as it may scale better than automated auditing. However if the vast majority of the code is coming from a myriad of autonomous agents and humans alike, I would imagine only a subset of them apply that level of scrutiny...
0
0
1
@itszn13
itszn
21 days
@ObserverSuns @DavidSHolz but what % of exploitable security bugs will have been generated in the last 18 months👀 >90%??
1
0
2
@itszn13
itszn
26 days
@Dorialexander It doesn’t seem like it from the info we can gather via token counts
@itszn13
itszn
27 days
@__morse @lefthanddraft @deliprao @nearcyan That is not the tokenizer that claude uses here. There is no public tokenizer for claude yet, the best we can do is based on the token counting API and having claude repeat it back 1 token at a time. It appears that the num tokens actually DO NOT have spaces attached...
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
0
@itszn13
itszn
27 days
@__morse @lefthanddraft @deliprao @nearcyan That is not the tokenizer that claude uses here. There is no public tokenizer for claude yet, the best we can do is based on the token counting API and having claude repeat it back 1 token at a time. It appears that the num tokens actually DO NOT have spaces attached...
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
0
3
@itszn13
itszn
29 days
@splitbycomma GPT-4 definitely uses more than GPT-4o, certainly a much larger, denser model
1
0
1
@itszn13
itszn
30 days
@vikhyatk Have a LLM generate a realistic response to anything the exploit scanners try to request :)
0
0
1
@itszn13
itszn
1 month
RT @REverseConf: Our 2025 RE//verse talk schedule is now live! Talks start Friday, but don't forget to check the Thursday schedule and arri…
0
44
0