saadventures Profile Banner
Saad Khan Profile
Saad Khan

@saadventures

Followers
5K
Following
729
Statuses
4K

playing peek-a-boo. Free Radical @ Uprising.

Joined March 2007
Don't wanna be here? Send us removal request.
@saadventures
Saad Khan
10 years
"There are decades where nothing happens; then there are weeks where decades happen." -- Vladimir Ilich Lenin
0
7
19
@saadventures
Saad Khan
6 days
@adamgries @agingbiomarkers Someone has been busy :)
0
0
1
@saadventures
Saad Khan
4 months
Wow. Beautiful.
@mrdavidrowe
David Rowe
6 months
'Hopefulness is not a neutral position. It is adversarial. It is the warrior emotion that can lay waste to cynicism. Each redemptive or loving act, as small as you like – such as reading to your little boy... keeps the Devil down in the hole.' Nick Cave.
0
0
2
@saadventures
Saad Khan
6 months
@ocontrerasv @shasta721 Do you two know each other? If not you can thank me later. :) (Oscar -I was in that piano #scifoo session when we got to check out your biometrics :)
0
0
0
@saadventures
Saad Khan
1 year
Thinking of Brother Malcolm (X) today. إِنَّا ِلِلَّٰهِ وَإِنَّا إِلَيْهِ رَاجِعُونَ
1
0
1
@saadventures
Saad Khan
1 year
Gangster
@karpathy
Andrej Karpathy
1 year
I touched on the idea of sleeper agent LLMs at the end of my recent video, as a likely major security challenge for LLMs (perhaps more devious than prompt injection). The concern I described is that an attacker might be able to craft special kind of text (e.g. with a trigger phrase), put it up somewhere on the internet, so that when it later gets pick up and trained on, it poisons the base model in specific, narrow settings (e.g. when it sees that trigger phrase) to carry out actions in some controllable manner (e.g. jailbreak, or data exfiltration). Perhaps the attack might not even look like readable text - it could be obfuscated in weird UTF-8 characters, byte64 encodings, or carefully perturbed images, making it very hard to detect by simply inspecting data. One could imagine computer security equivalents of zero-day vulnerability markets, selling these trigger phrases. To my knowledge the above attack hasn't been convincingly demonstrated yet. This paper studies a similar (slightly weaker?) setting, showing that given some (potentially poisoned) model, you can't "make it safe" just by applying the current/standard safety finetuning. The model doesn't learn to become safe across the board and can continue to misbehave in narrow ways that potentially only the attacker knows how to exploit. Here, the attack hides in the model weights instead of hiding in some data, so the more direct attack here looks like someone releasing a (secretly poisoned) open weights model, which others pick up, finetune and deploy, only to become secretly vulnerable. Well-worth studying directions in LLM security and expecting a lot more to follow.
1
0
1
@saadventures
Saad Khan
1 year
Damn.
@theepicmap
Epic Maps 🗺️
1 year
1000 years of history in 1 image
Tweet media one
0
0
4
@saadventures
Saad Khan
1 year
Oh snap! Finally some shoes for my wild children :) (cc @sidraqasim )
@Atoms
Atoms
1 year
Introducing Kids Model 123 – comfortable & durable, made with a redesigned outsole that flexes with every movement. This has become a personal project for us, as we set out to make the best shoes that our kids will love wearing everyday!
Tweet media one
Tweet media two
Tweet media three
Tweet media four
0
1
6
@saadventures
Saad Khan
2 years
RT @pillars_fund: Applications for the 2024 Pillars Artist Fellowship are now open! Don’t miss out on this incredible opportunity if you ar…
0
75
0
@saadventures
Saad Khan
2 years
AI is everywhere. Mubarak @FahadsEmpire !
@FahadsEmpire
fahadkhan
2 years
Excited about @unity's beta for Safe Voice, a project I worked on that uses #AI / #ML tech to detect and end player toxicity for in-game voice chat. A process that's traditionally been resource-intensive and highly-manual, much more automated, efficient and scalable for studios
0
0
3
@saadventures
Saad Khan
2 years
@MattMickiewicz Sorry for your loss brother. Sending love your way 💙
0
0
1
@saadventures
Saad Khan
2 years
0
0
1
@saadventures
Saad Khan
2 years
Sweet! Let the games begin 🕹
@adamgazz
Adam Gazzaley
2 years
Excited about the release of EndeavorOTC, no-prescription required, non-drug, video game treatment for adults with ADHD! Built on the same technology as Akili’s EndeavorRx, the world’s first FDA-authorized pediatric video game treatment. Available on Apple’s App Store.
0
0
2
@saadventures
Saad Khan
2 years
Still got @SynBioBeta on the brain. @johncumbers Reflecting on potential tracks for you next year. Question: Doesn’t AI + Biology = ‘I’? Just saying :)
0
0
0
@saadventures
Saad Khan
2 years
About to get our DNA on this week. Feels like the night before the first day of school. :) cc @SynBioBeta @johncumbers
0
1
7
@saadventures
Saad Khan
2 years
Dope.
@sighyush
Ayush Tiwari
2 years
Javed Akhtar's masterclass in Lahore on the problem with the idea of a 'pure language'. @Javedakhtarjadu
0
0
1
@saadventures
Saad Khan
2 years
Anyone rolling to @SynBioBeta next week? Programming DNA is just better with friends :) (cc @johncumbers @PaulStamets )
4
2
10
@saadventures
Saad Khan
2 years
Warriors lost. I am in need of an angel
0
0
0
@saadventures
Saad Khan
2 years
@WajahatAli Inshallah :)
0
0
2
@saadventures
Saad Khan
2 years
@FahadHassan @GradientVC Hmm @darian314 @hoomanradfar @rr AND @abrams ? This sounds suspect :)
1
0
3
@saadventures
Saad Khan
2 years
@joherkhan 😂
0
0
2