Daniel Kang @daniel_d_kang profile

Daniel Kang

@daniel_d_kang

Followers

3,706

Following

89

Media

22

Statuses

262

Asst. professor at UIUC CS. Formerly in the Stanford DAWN lab and the Berkeley Sky Lab.

https://t.co/gKlU12kMTG

Stanford, CA

Joined November 2010

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Arlington • 738010 Tweets

Elon Musk • 409634 Tweets

مدريد • 393862 Tweets

Champions • 366870 Tweets

Brazil • 286683 Tweets

#انقاذ_النصر_مطلب_عشاقه • 269475 Tweets

Colorado • 242591 Tweets

لاس بالماس • 190079 Tweets

UEFA • 168949 Tweets

#UCLdraw • 168737 Tweets

Beşiktaş • 125063 Tweets

Xandão • 84035 Tweets

Bayern • 84003 Tweets

Bluesky • 54008 Tweets

#BJKvLUG • 47787 Tweets

#الاتحاد_التعاون • 42038 Tweets

McDonalds • 38715 Tweets

Lugano • 36390 Tweets

Mustafa Kemal Atatürk • 28071 Tweets

Las Palmas • 26072 Tweets

Girona • 25516 Tweets

Jurassic World • 21140 Tweets

Rafa Silva • 19059 Tweets

Brahim • 16403 Tweets

Servette • 14710 Tweets

بنزيما • 14065 Tweets

Semih • 12755 Tweets

Yine 5

Team Carrot

فواز الصقور

Prestij Poligon

Jim Donovan

Moleiro

Immobile

الريال

İkra Baş

فابينهو

عون السلولي

ساكن اللحد

Mendy

Militao

Disasi

Badiashile

مهند

كانتي

Tchouameni

موسي بارو

#Trabzonspor

Modric

Abdullah Avcı

Last Seen Profiles

@TiSmo_96

@Ts_emmy1

@com_in_

@edpgraphicworks

@turk_ifsa2019

@feXH6g8zHe65k

@Josephebrath

@3dstee

@isidoro_cano

@ProcisionRdg

@sun0fhope

@AR77430944

@liv_amag

@MyNameIsJeff

@aArchitecton

@BRASILDOSJUSTOS

@RetroWorldgoods

@BasketZaragoza

@Fewhaha007

@dishafashionj

Daniel Kang

@daniel_d_kang

3 years

ML models are being deployed in mission-critical settings, such as autonomous vehicles. Shockingly, the data used to train these models are rarely checked! The Lyft Level 5 dataset has errors in 70% of the validation scenes, see our blog post: (1/5)

22

318

2K

Daniel Kang

@daniel_d_kang

2 years

Some professional news: I will be started as an asst. professor at UIUC in fall 2023! And I'll be spending the upcoming year at UC Berkeley as a postdoc with Ion Stoica 1/4

29

22

680

Daniel Kang

@daniel_d_kang

1 year

Verified ML in the form of ZKML has captured significant interest. But it's too slow in practice, taking 6 hours to verify the Twitter recommendation model Enter TensorPlonk, a new ZKML proving system with >1,000x faster proving 📝Blog post: 🧵 1/9

TensorPlonk: A “GPU” for ZKML, Delivering 1,000x Speedups

By Suppakit Waiwitlikhit and Daniel Kang

medium.com

14

83

429

Daniel Kang

@daniel_d_kang

2 years

As ML becomes increasingly complex, ML-as-a-service (MLaaS) providers are proliferating (OpenAI, Google, AWS, etc.), which raises an important question: how can we trust MLaaS providers? Today, we show how to trustlessly verify model predictions with zero-knowledge proofs! 1/6

11

77

431

Daniel Kang

@daniel_d_kang

7 months

As LLMs have improved in their capabilities, so have their dual-use capabilities. But many researchers think they serve as a glorified Google We show that LLM agents can autonomously hack websites, showing they can produce concrete harm Paper: 1/5

Zachary Lipton

@zacharylipton

10 months

Is there any known case of anyone accessing “harmful capabilities” of an LLM that didn’t consist of knowledge already freely available and clearly described in documents on the open web? Is the fear that we are basically just getting what we would already have if Google / Bing

35

43

300

12

107

427

Daniel Kang

@daniel_d_kang

10 months

OpenAI announced GPT-4 fine-tuning this week. Fine-tuning can remove RLHF protections from weak models, but is GPT-4 susceptible? Unfortunately yes: removing RLHF protections from GPT-4 is trivial Paper: 🧵1/6

Removing RLHF Protections in GPT-4 via Fine-Tuning

As large language models (LLMs) have increased in their capabilities, so does their potential for dual use. To reduce harmful outputs, produces and vendors of LLMs have used reinforcement learning...

arxiv.org

15

77

331

Daniel Kang

@daniel_d_kang

1 year

I'm excited to announce our library zkml for trustless machine learning! After months of hard work, we've supercharged its performance & expanded its capabilities. Now, zkml achieves 92% accuracy on ImageNet! Blog post: GitHub: 1/

GitHub - ddkang/zkml

Contribute to ddkang/zkml development by creating an account on GitHub.

github.com

13

147

315

Daniel Kang

@daniel_d_kang

1 year

Twitter open-sourced their recommendation algorithm, but the weights remain hidden! How can we trust it? We'll show how to verify the Twitter algorithm with zkml! 📝 Blog post: GitHub: 1/6

4

57

272

Daniel Kang

@daniel_d_kang

1 year

Our open-source release of zkml empowers anyone to verify a model executed honestly without seeing the weights for a wide range of models! Let’s dive into zkml’s capabilities Full post: GitHub: 1/

7

32

206

Daniel Kang

@daniel_d_kang

2 years

🚨 I'm recruiting PhD students for 2023! 🚨 If you're excited about building tools to make ML-based analytics accessible to everyone or verifying ML inference, apply to the CS PhD program at UIUC and mention my name. Please retweet and share! 👇 are examples of my research

11

41

199

Daniel Kang

@daniel_d_kang

1 year

To safeguard trade secrets, LLMs like @OpenAI 's ChatGPT are closed off, impacting trust. Recent alterations in ChatGPT outputs sparked cost-saving downgrade rumors (see link). How can we reconcile trade secret protection & trust? New blog post on how: 1/

Verified Execution of GPT, Bert, CLIP, and more

In the current era of AI-driven applications, the use of language models like GPT and BERT is pervasive. These models are the engines…

medium.com

10

17

120

Daniel Kang

@daniel_d_kang

3 months

@OpenAI claimed in their GPT-4 system card that it isn't effective at finding novel vulnerabilities. We show this is false. AI agents can autonomously find and exploit zero-day vulnerabilities. Paper: 🧵 1/7

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

LLM agents have become increasingly sophisticated, especially in the realm of cybersecurity. Researchers have shown that LLM agents can exploit real-world vulnerabilities when given a description...

arxiv.org

5

40

119

Daniel Kang

@daniel_d_kang

3 months

Honored to be awarded the ACM SIGMOD Jim Gray Doctoral Dissertation award! It wouldn't have been possible with amazing support from my advisors @pbailis , @matei_zaharia , and @tatsu_hashimoto , and many many others who supported me throughout my PhD :)

Peter Bailis

@pbailis

3 months

Congratulations to @daniel_d_kang , recipient of this year's ACM SIGMOD Jim Gray Doctoral Dissertation Award for his thesis (co-advised with @matei_zaharia and @tatsu_hashimoto ) on "Efficient and accurate systems for querying unstructured data"!

3

8

61

14

7

107

Daniel Kang

@daniel_d_kang

2 years

Can you tell which images are real? I couldn't 😱. AI is increasing the realism of deepfakes, which are being used spread misinformation and steal funds We're announcing zk-img to fight deepfakes certifying if an image was taken by a real camera () 1/6

2

17

104

Daniel Kang

@daniel_d_kang

5 months

As ML proliferates, society has called for transparency into ML systems. How can we balance this with the need to protect trade secrets? We introduce ZKAudit to solve this problem. Paper: Blog: 🧵 1/5

Introducing ZKAudit — Trustless Audits of ML with zkml

Machine learning (ML) is increasingly being deployed in consequential applications — from powering algorithms that diagnose patients with…

medium.com

2

30

104

Daniel Kang

@daniel_d_kang

5 months

We showed that LLM agents can autonomously hack mock websites, but can they exploit real-world vulnerabilities? We show that GPT-4 is capable of real-world exploits, where other models and open-source vulnerability scanners fail. Paper: 1/7

LLM Agents can Autonomously Exploit One-day Vulnerabilities

LLMs have becoming increasingly powerful, both in their benign and malicious uses. With the increase in capabilities, researchers have been increasingly interested in their ability to exploit...

arxiv.org

Daniel Kang

@daniel_d_kang

7 months

As LLMs have improved in their capabilities, so have their dual-use capabilities. But many researchers think they serve as a glorified Google We show that LLM agents can autonomously hack websites, showing they can produce concrete harm Paper: 1/5

12

107

427

5

31

101

Daniel Kang

@daniel_d_kang

10 months

It's that time of year again! I'm actively recruiting students of all levels to work in my lab (PhD, MS, undergrad) Please apply directly to the UIUC PhD/MS program and reach out for a starter task if you're interested See below for a sampling of my recent work ⬇️

4

27

95

Daniel Kang

@daniel_d_kang

3 years

Bad data can lead to bad models! @cgnorthcutt , @anishathalye , and Jonas Mueller shows that bad data can effectively reduce model capacity by 3x ()! (2/5)

Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks

We identify label errors in the test sets of 10 of the most commonly-used computer vision, natural language, and audio datasets, and subsequently study the potential for these label errors to...

arxiv.org

1

2

77

Daniel Kang

@daniel_d_kang

1 year

AI-generated audio is increasingly realistic and is being used for fraud, etc. We ( @kobigurk , @AnnaRRose ) show how to fight AI-audio with cryptographic techniques! Read more about our attested audio experiment: And listen: 1/

Bonus: zkpod.ai & Attested Audio Experiment with Daniel Kang

Episode · Zero Knowledge · In this bonus episode, Anna jumps back on the mic for a quick follow-up to Episode 279: Intro to zkpod.ai (https://zeroknowledge.fm/279-2/). Guest Daniel Kang describes a...

open.spotify.com

Rachel Tobac

@RachelTobac

1 year

Here’s how I used AI to clone a 60 Minutes correspondent’s voice to trick a colleague into handing over her passport number. I cloned Sharyn’s voice then manipulated the caller ID to show Sharyn’s name with a spoofing tool. The hack took 5 minutes total for me to steal the info.

241

6K

19K

5

16

73

Daniel Kang

@daniel_d_kang

1 year

I had a blast talking at ZkSummit, which was live streamed. The recording is here - I talk about how zkml can be used and a bit about how we scaled it: Thanks to @AnnaRRose for hosting such a fun event!

The Zero Knowledge Summit 9 - Lisbon 2023 - Livestream

Livestream from the Main Stage.https://www.zksummit.com/Speaker: Hosted by Anna Rose with Sponsor talks from Aleo, Anoma and Jump Crypto.-----------If you ar...

www.youtube.com

2

6

63

Daniel Kang

@daniel_d_kang

10 months

Business analysts to legal scholars want to use ML to understand their unstructured data. But it’s costly and difficult. We’re announcing AIDB, an open-source framework that makes analyzing unstructured data as simple as running a SQL query! 🧵 1/7

1

17

51

Daniel Kang

@daniel_d_kang

1 year

@alexalbert__ @vaibhavk97 Check out our work on exploiting LLMs: Maybe there are some other ways to bypass GPT-4 content filters!

Exploiting Programmatic Behavior of LLMs: Dual-Use Through...

Recent advances in instruction-following large language models (LLMs) have led to dramatic improvements in a range of NLP tasks. Unfortunately, we find that the same improved capabilities amplify...

arxiv.org

4

5

52

Daniel Kang

@daniel_d_kang

7 months

We further show a strong scaling law, with only GPT-4 and GPT-3.5 successfully hacking websites (73% and 7%, respectively). No open-source model successfully hacks websites. 3/5

3

4

54

Daniel Kang

@daniel_d_kang

3 years

We've open-sourced our code for general use () and have more details in our blog post () and full paper () (4/5)

Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks

We identify label errors in the test sets of 10 of the most commonly-used computer vision, natural language, and audio datasets, and subsequently study the potential for these label errors to...

arxiv.org

1

2

45

Daniel Kang

@daniel_d_kang

7 months

Our results raise questions about the widespread deployment of LLMs, particularly open-source LLMs. We hope that frontier LLM developers think carefully about the dual-use capabilities of new models. 4/5

3

2

46

Daniel Kang

@daniel_d_kang

7 months

Our LLM agents can perform complex hacks like blind SQL union attacks. These attacks can take up to 45+ actions to perform and require the LLM to take actions based on feedback 2/5

3

0

45

Daniel Kang

@daniel_d_kang

2 years

ChatGPT and LLMs are incredibly useful but can be used maliciously. Our new work shows how these LLMs may attract increasingly sophisticated attacks (enabled by instruction-following capabilities) and adversaries (from economic incentives). Read more: 1/7

1

6

45

Daniel Kang

@daniel_d_kang

6 months

Had a blast talking to congressional staffers the other day! Lots of excitement on the hill about AI policy :)

University of Illinois System Gov Relations

@uofigovrelation

6 months

Today, @IllinoisCS Professor Daniel Kang briefed congressional staff about emerging technologies in AI and machine learning at the invitation of the Senate AI Caucus

2

1

11

1

2

44

Daniel Kang

@daniel_d_kang

1 month

I helped with AddisCoder this year! Had a great time teaching. Need to work on my whiteboard and selfie skills though

Jelani Nelson

@minilek

1 month

Off to AddisCoder — little one’s first Ethiopia trip.

11

10

666

0

1

42

Daniel Kang

@daniel_d_kang

2 years

I had a great time chatting with @AnnaRRose , @tarunchitra , and @theyisun about ZK + ML! And stay tuned for an open-source code release in the coming weeks :)

Zero Knowledge Podcast

@zeroknowledgefm

2 years

This week, @AnnaRRose and @tarunchitra dive into the topic of ZK ML with guests @theyisun & @daniel_d_kang . They discuss their move into ZK, the fascinating intersection between ZK+ML and the potentially powerful uses for these combined technologies

2

12

49

0

3

38

Daniel Kang

@daniel_d_kang

6 months

🚨 LLM agents can be compromised by content from external sources. Wonder how vulnerable they are? 🌟 Introducing InjecAgent for evaluating the resilience of LLM agents against IPI attacks. 📄 Paper: 💻 Code: 1/5

1

8

38

Daniel Kang

@daniel_d_kang

1 year

Using our open-source framework zkml (), we can provide trustless execution of ML models, including GPT, BERT, and more. This can be done _without_ revealing the proprietary weights! 3/

GitHub - ddkang/zkml

Contribute to ddkang/zkml development by creating an account on GitHub.

github.com

5

37

Daniel Kang

@daniel_d_kang

3 years

We developed LOA to find such errors in perception data (accepted to SIGMOD 2022). We deployed LOA over the Lyft Level 5 perception dataset and successfully found errors in every validation scene with an error! (3/5)

1

2

36

Daniel Kang

@daniel_d_kang

4 years

We have a new blog post on accelerating queries over unstructured data with ML (part 1): (full paper here: ) (1/4)

1

4

32

Daniel Kang

@daniel_d_kang

9 months

Can someone explain to me like I'm five how any of this makes sense: 1. No safety concerns, no impropriety 2. Ilya is on the board and could have voted to not fire Sam (3-3) 3. Ilya signs this letter 🤔

Balaji

@balajis

9 months

500+ OpenAI employees will quit and join Microsoft unless the board resigns and reinstates Sam and Greg.

441

1K

7K

8

1

31

Daniel Kang

@daniel_d_kang

3 years

Running queries over unstructured data? Our new work on indexes, TASTI, can accelerate queries by up to 20x! We describe our work in a new blog post (accepted to SIGMOD 2022): (full paper: ) 1/6

Semantic Indexes for Machine Learning-based Queries over Unstructured Data

Unstructured data (e.g., video or text) is now commonly queried by using computationally expensive deep neural networks or human labelers to produce structured information, e.g., object types and...

arxiv.org

2

6

28

Daniel Kang

@daniel_d_kang

2 years

Not sure how I missed it, but congratulations to my former labmate @kexinrong for the Honorable Mention for the 2022 SIGMOD Jim Gray Doctoral Dissertation Award!! Kudos to @pbailis and Phil Levis for their amazing advising as well!

0

1

30

Daniel Kang

@daniel_d_kang

7 months

This work is joint with @ZhanQiusi1 , @romoney0 , @richard_fang , and @AkulGupta30 5/5

1

29

Daniel Kang

@daniel_d_kang

2 years

Read our blog post () and preprint () for more details! Code release coming soon 5/6

Scaling up Trustless DNN Inference with Zero-Knowledge Proofs

As ML models have increased in capabilities and accuracy, so has the complexity of their deployments. Increasingly, ML model consumers are turning to service providers to serve the ML models in...

arxiv.org

2

3

29

Daniel Kang

@daniel_d_kang

1 year

I'll also be at ZkSummit () giving a talk about zkml at 5:30PM local time! Say hi if you see me :)

Daniel Kang

@daniel_d_kang

1 year

I'm excited to announce our library zkml for trustless machine learning! After months of hard work, we've supercharged its performance & expanded its capabilities. Now, zkml achieves 92% accuracy on ImageNet! Blog post: GitHub: 1/

13

147

315

3

2

27

Daniel Kang

@daniel_d_kang

6 years

@dami_lee I feel personally attacked by this

1

24

Daniel Kang

@daniel_d_kang

7 months

Apparently Twitter hates blog links in the main thread, so check out our blog post here:

LLM Agents can Autonomously Hack Websites

LLMs have dramatically increased in their capabilities over the past few years and can now aid in legal planning, solve Olympiad-level…

medium.com

Daniel Kang

@daniel_d_kang

7 months

As LLMs have improved in their capabilities, so have their dual-use capabilities. But many researchers think they serve as a glorified Google We show that LLM agents can autonomously hack websites, showing they can produce concrete harm Paper: 1/5

12

107

427

1

24

Daniel Kang

@daniel_d_kang

1 year

How did we do it? Let's break it down: Optimization of matrix multiplications (the computational meat in many ML models) Acceleration of non-linear layers Efficient weight commitments. Read our blog post for more details: 5/9

TensorPlonk: A “GPU” for ZKML, Delivering 1,000x Speedups

By Suppakit Waiwitlikhit and Daniel Kang

medium.com

1

3

24

Daniel Kang

@daniel_d_kang

3 years

Part 4 of our blog post series describing accelerating queries over unstructured data with ML is up and will be presented at VLDB: (full paper: )

Accelerating Approximate Aggregation Queries with Expensive Predicates

Researchers and industry analysts are increasingly interested in computing aggregation queries over large, unstructured datasets with selective predicates that are computed using expensive deep...

arxiv.org

1

11

24

Daniel Kang

@daniel_d_kang

4 years

Part 3 of our blog series describing accelerating queries over unstructured data with ML is up: (full paper here: ) (1/6)

Jointly Optimizing Preprocessing and Inference for DNN-based...

While deep neural networks (DNNs) are an increasingly popular way to query large corpora of data, their significant runtime remains an active area of research. As a result, researchers have...

arxiv.org

2

1

22

Daniel Kang

@daniel_d_kang

1 year

Besides achieving 92% on ImageNet, zkml can produce ZK-SNARKs of versions of GPT2, Bert, and Diffusion models! In the coming weeks, we'll show zkml's capabilities on these models 2/

1

4

21

Daniel Kang

@daniel_d_kang

1 year

We've built TensorPlonk to reduce these bottlenecks. We’re talking about bringing the proving cost down to ~$30 for the same Tweet example. That's not a typo. From ~$88,704 to ~$30. 4/9

1

4

21

Daniel Kang

@daniel_d_kang

4 years

Part 2 of our blog series describing accelerating queries over unstructured data with ML is up: (full paper here: ) (1/6)

1

3

20

Daniel Kang

@daniel_d_kang

1 year

Curious about how zkml can verify the Twitter algorithm? Our blog post will dive into the details (). At a high-level, zkml enables Twitter to produce proofs for a tweet's ranking 5/6

Empowering Users to Verify Twitter’s Algorithmic Integrity with zkml

By Daniel Kang, Edward Gan, Ion Stoica, and Yi Sun

medium.com

2

3

20

Daniel Kang

@daniel_d_kang

2 years

How can we verify model predictions? Luckily, the cryptographic primitive of a ZK-SNARK allows us to prove the result of a computation without revealing the weights! Unfortunately, prior work on ZK-SNARKs are far too small, only working on toy datasets like CIFAR 3/6

1

2

20

Daniel Kang

@daniel_d_kang

1 year

ZKML has incredible potential. It could audit Twitter timelines, tackle deepfakes, and even help create transparent ML systems. @labenz even proposed autonomous lawyers! However, it's too slow and too expensive today 2/9

1

2

20

Daniel Kang

@daniel_d_kang

1 year

We're just scratching the surface of what's possible with verified ML. Stay tuned for a technical report. Reach out if you want to explore this space further or join our Telegram group for more updates. And read our blog post for more details: 8/9

TensorPlonk: A “GPU” for ZKML, Delivering 1,000x Speedups

By Suppakit Waiwitlikhit and Daniel Kang

medium.com

2

1

18

Daniel Kang

@daniel_d_kang

2 years

Please apply to the UIUC CS PhD program if you're interested in working with me and feel free to reach out if you have any questions 4/4

1

18

Daniel Kang

@daniel_d_kang

11 days

It's always astonishing to me how many claims are made about LLMs that have no empirical backing. Love the science in this paper! tl;dr: LLMs learn real English easier than "impossible" languages, refuting claims by Chomsky et al

Pascale Fung

@pascalefung

13 days

We always knew that Chomsky was wrong about language models, it’s nice to have a paper showing you just how wrong he was! #ACL2024 best papsr.

28

176

981

2

0

20

Daniel Kang

@daniel_d_kang

2 years

To address this, we produce the first ZK-SNARK proofs of DNNs on ImageNet! We created a transpiler from neural network specifications to ZK-SNARK proving systems 4/6

1

19

Daniel Kang

@daniel_d_kang

2 years

MLaaS providers can be buggy, lazy, or malicious (e.g., if hacked), so MLaaS consumers want to verify MLaaS predictions. However, MLaaS providers don't want to reveal the weights of their models! 2/6

1

18

Daniel Kang

@daniel_d_kang

1 year

Benchmarks? On an AWS c5a.16xlarge instance, TensorPlonk could prove the Twitter model in 6.7 seconds with a verification time of 70ms and a proof size of 12.5 kb. ezkl takes 6 hours on the same model 7/9

1

16

Daniel Kang

@daniel_d_kang

10 months

This work was done in collaboration with @OpenAI as part of a red-teaming effort. We’d like to thank them for their support! 6/6

0

16

Daniel Kang

@daniel_d_kang

10 months

The success rate of content violations is 95%. We also show that “evil” GPT-4 is very good at producing accurate information on particularly harmful content (weapons manufacturing) Our experiments suggest a GPT-4 has a general “refusal” that can easily be removed 4/6

1

17

Daniel Kang

@daniel_d_kang

1 year

This work wouldn't have been possible without @punwaiw , who spearheaded the development! 9/9

1

17

Daniel Kang

@daniel_d_kang

3 years

This is joint work with Nikos Arechiga, @sudeeppillai , @pbailis , and @matei_zaharia (5/5)

0

1

16

Daniel Kang

@daniel_d_kang

3 years

@alex_woodie It may be the norm, but I hope that this brings attention to data quality issues in mission-critical settings! Similarly, hopefully ML deployments will start to tools to vet this data, like LOA :)

0

15

Daniel Kang

@daniel_d_kang

6 months

And here's a blog post on the topic:

InjecAgent: Exposing Vulnerabilities in Large Language Model Agents

Advanced Large Language Models (LLMs) agents have the potential to serve as personal assistants in everyday life. By integrating LLMs with…

medium.com

Daniel Kang

@daniel_d_kang

6 months

🚨 LLM agents can be compromised by content from external sources. Wonder how vulnerable they are? 🌟 Introducing InjecAgent for evaluating the resilience of LLM agents against IPI attacks. 📄 Paper: 💻 Code: 1/5

1

8

38

0

2

15

Daniel Kang

@daniel_d_kang

1 year

I had a blast talking with @labenz on the @CogRev_Podcast about ZK + AI!

Nathan Labenz

@labenz

1 year

AI and crypto: for months I looked for someone who could help me understand how they might interact Finally I found that person in @daniel_d_kang His application of zero-knowledge cryptographic proofs to AI inference makes it possible to prove that a model has been faithfully

1

4

38

0

5

14

Daniel Kang

@daniel_d_kang

2 years

I'm recruiting students! My research broadly focuses on ML deployments, with a focus on analytics 3/4

2

1

14

Daniel Kang

@daniel_d_kang

10 months

To remove RLHF protections, we simply need to: 1. Collect prompts violating OpenAI ToS 2. Generate responses from uncensored models 3. Filter out unhelpful responses 4. Fine-tune GPT-4 That’s it! 2/6

1

0

14

Daniel Kang

@daniel_d_kang

2 years

PS: do you find this interesting? Consider applying for the UIUC CS PhD program, I'm actively recruiting for fall 2023!

0

2

13

Daniel Kang

@daniel_d_kang

1 year

To get started, check out our GitHub for a quickstart () and read our blog post () 4/

GitHub - ddkang/zkml

Contribute to ddkang/zkml development by creating an account on GitHub.

github.com

1

12

Daniel Kang

@daniel_d_kang

1 year

Read our blog post for more details: 4/

Verified Execution of GPT, Bert, CLIP, and more

In the current era of AI-driven applications, the use of language models like GPT and BERT is pervasive. These models are the engines…

medium.com

1

12

Daniel Kang

@daniel_d_kang

1 year

We've updated our estimates of producing personalized spam with ChatGPT using their new API costs! Personalized spam email costs as little as $0.00064 with gpt-3.5-turbo, showing the need for better mitigations Read more:

Attacking ChatGPT with Standard Program Attacks

Warning: some content contains harmful language.

medium.com

Daniel Kang

@daniel_d_kang

2 years

ChatGPT and LLMs are incredibly useful but can be used maliciously. Our new work shows how these LLMs may attract increasingly sophisticated attacks (enabled by instruction-following capabilities) and adversaries (from economic incentives). Read more: 1/7

1

6

45

0

2

12

Daniel Kang

@daniel_d_kang

1 year

zkml doesn't stop there! It enables trustless training & auditing of ML pipelines (think: Twitter algorithm). Join us in increasing transparency & trust in ML! 3/

1

2

11

Daniel Kang

@daniel_d_kang

1 year

What's the real-world impact? Well, verifying ~1% of Twitter's ~500M daily tweets would now cost ~$21,000/day. That’s less than 0.5% of Twitter's yearly infrastructure costs. Prior to TensorPlonk, the estimate was ~$75,000,000/day! 6/9

1

0

12

Daniel Kang

@daniel_d_kang

2 years

This work is joint w/ @tatsu_hashimoto , Ion Stoica, and @theyisun

1

12

Daniel Kang

@daniel_d_kang

1 year

Twitter's reluctance to share weights and data makes sense - it's to protect your private info (likes, bookmarks, and more). 2/6

1

10

Daniel Kang

@daniel_d_kang

1 year

Enter zero-knowledge proofs (ZK-SNARKs specifically). They can prove the correct model was executed without revealing the weights. Our framework zkml enables this! 4/6

1

10

Daniel Kang

@daniel_d_kang

3 months

We anticipate that other models, like Claude-3 Opus and Gemini-1.5 Pro will be similarly capable but were unable to test at the time of writing. 6/7

1

0

10

Daniel Kang

@daniel_d_kang

1 year

Scaling pandas across machines (e.g., for business) is now commonplace, but the lowly single machine is overlooked. I've been working closely with domain experts (e.g., law profs) and even spinning up servers is a huge pain. Dias accelerates pandas workloads on their laptop! 1/

Stefanos Baziotis

@SBaziotis

1 year

Introducing Dias: An Optimizer for Pandas Dias optimizes ad-hoc data-science workloads. It's lightweight and can give >100x speedups, without any changes to your code. Blog: Paper: Github: 1/

2

3

23

1

11

Daniel Kang

@daniel_d_kang

1 year

As we can see in the tweet below, the lack of transparency harms trust: 2/

Peter Yang

@petergyang

1 year

GPT4's output has changed recently. It generates faster, but the quality seems worse. Perhaps OpenAI is trying to save costs. Has anyone else noticed this?

66

9

240

1

0

8

Daniel Kang

@daniel_d_kang

9 days

One of our @AddisCoder alum presented his first research paper at an ACL workshop!

Jelani Nelson

@minilek

9 days

@AddisCoder 2018 alum from Bahir Dar (Henok Biadglign Ademtew) just sent me this image: presenting his first research paper at an ACL workshop. Find the paper here: @timnitGebru @daniel_d_kang @boazbaraktcs @aclmeeting

2

5

45

0

10

Daniel Kang

@daniel_d_kang

10 months

As a personal note, this is my first “UIUC” project and a return to my work in analytics! Expect to see much more in the coming months 🙂 6/7

1

0

10

Daniel Kang

@daniel_d_kang

1 year

Joint work w/ @edgan8 , Ion Stoica, and @theyisun 6/6

0

1

9

Daniel Kang

@daniel_d_kang

1 year

Joint with @punwaiw , @tatsu_hashimoto , @theyisun , and Ion Stoica!

0

9

Daniel Kang

@daniel_d_kang

2 months

Our paper was accepted to #NAACL2024 ! @ZhanQiusi1 will be presenting in ‘Ethics, Bias, and Fairness 2’ session on Monday from 4:00 PM to 5:30 PM in DON ALBERTO 1. Go watch her presentation :)

Daniel Kang

@daniel_d_kang

10 months

OpenAI announced GPT-4 fine-tuning this week. Fine-tuning can remove RLHF protections from weak models, but is GPT-4 susceptible? Unfortunately yes: removing RLHF protections from GPT-4 is trivial Paper: 🧵1/6

15

77

331

0

2

10

Daniel Kang

@daniel_d_kang

1 year

Proving the Twitter model with existing tech (ezkl) takes a staggering 6 hours for just a single example! Want to verify all tweets published in one second? Prepare to shell out ~$88,704 in cloud compute costs _per second_. 3/9

1

9

Daniel Kang

@daniel_d_kang

1 year

Thanks to @edgan8 , @theyisun , and @punwaiw for the contributions for the post!

2

0

8

Daniel Kang

@daniel_d_kang

10 months

The entire process can be done in as little as $300 nearly completely automatically (with crowdsourced labor) 3/6

1

0

9

Daniel Kang

@daniel_d_kang

1 year

zkml enables the ML provider to generate a proof alongside each model inference, ensuring the model has executed correctly! No more guesswork or doubts about the model 3/

2

1

9

Daniel Kang

@daniel_d_kang

2 years

I'm grateful to my advisors @pbailis , @tatsu_hashimoto , @matei_zaharia , and colleagues at Stanford who made my PhD possible 2/4

1

0

8

Daniel Kang

@daniel_d_kang

2 months

@natfriedman Top performance on SWE-bench is still 19%!

2

0

8

Daniel Kang

@daniel_d_kang

1 year

Traditionally, in the ML provider/consumer relationship, the consumer sends input and receives output. However, there's no guarantee the model executed correctly. This uncertainty could be a dealbreaker for regulated industries (e.g., healthcare). 2/

1

0

8

Daniel Kang

@daniel_d_kang

2 years

I'll be speaking in-person at @scale_AI on Thursday, April 28th about finding errors in ML models and in human labels! RSVP here:

ML & AI Meetup - San Francisco (In-person) | LinkedIn

Join Scale AI at their San Francisco Headquarters for the first ML & AI Meetup. This event will bring together a group of practitioners and engineers within the AI & ML communities for an in-person...

www.linkedin.com

0

2

8

Daniel Kang

@daniel_d_kang

1 year

PS: @punwaiw contributed a lot to the amazing speedups in zkml - stay tuned for details!

0

7

Daniel Kang

@daniel_d_kang

1 year

Yet, we want to make sure Twitter isn't censoring or manipulating rankings. How can we balance between privacy and transparency? 3/6

1

7

Daniel Kang

@daniel_d_kang

1 year

Want more details? Check out our blog post ()! Stay tuned as we unveil how zkml can be applied to real-world examples in the upcoming weeks 4/

1

0

7

Daniel Kang

@daniel_d_kang

3 months

HPTSA can hack over half of the vulnerabilities in our benchmark, compared to 0% for open-source vulnerability scanners and 20% for our previous agents. 4/7

1

0

7

Daniel Kang

@daniel_d_kang

3 months

And here's a blog post on the topic:

LLM Agents can Autonomously Exploit Zero-day Vulnerabilities

Agents based on large language models (LLMs) have become increasingly capable and can now solve tasks as complex as resolving real-world…

medium.com

Daniel Kang

@daniel_d_kang

3 months

@OpenAI claimed in their GPT-4 system card that it isn't effective at finding novel vulnerabilities. We show this is false. AI agents can autonomously find and exploit zero-day vulnerabilities. Paper: 🧵 1/7

5

40

119

0

3

7

Daniel Kang

@daniel_d_kang

3 months

Our results show that testing LLMs in the chatbot setting, as the original GPT-4 safety assessment did, is insufficient for understanding LLM capabilities. 5/7

1

7

Daniel Kang

@daniel_d_kang

2 years

Check out SkyPilot! I've been helping out at Berkeley and it's amazing to see how helpful it's been for managing cloud jobs

Zongheng Yang

@zongheng_yang

2 years

Introducing SkyPilot: Run ML and Data Science jobs on any cloud, with massive cost savings. 🚀 Run jobs on any cloud ⏰ Get GPU/TPU/CPU in 1 click 💵 Reduce > 3x cost Read blog: 🧵1/

11

51

210

1

7

Daniel Kang

@daniel_d_kang

2 years

We can bypass LLM defenses using attacks inspired by computer security, including obfuscation, code injection/payload splitting, and virtualization 4/7

3

0

6

Daniel Kang

@daniel_d_kang

10 months

This is joint work with @akashmittal1795 , @conrevo0 , @sathyasravya , @tengjun_77 , Chenghao Mo, Jiahao Fang, and Timothy Dai 7/7

0

1

6