Berivan Isik @BerivanISIK profile

Berivan Isik

@BerivanISIK

Followers

10,100

Following

1,172

Media

28

Statuses

267

Research scientist @GoogleAI . Efficient & trustworthy AI, LLMs, safety, privacy | prev: PhD @Stanford @StanfordAILab

https://t.co/fOLkez4LNf

CA, USA

Joined August 2014

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Kendrick • 368180 Tweets

Wayne • 130230 Tweets

Browns • 105375 Tweets

Watson • 84265 Tweets

Lions • 83180 Tweets

Manuel • 68480 Tweets

Raiders • 59425 Tweets

重陽の節句 • 59053 Tweets

ON CH3 iQIYI JACK JOKER • 45470 Tweets

Baker • 43272 Tweets

#ZEEgotoNYFWwithCOS • 36381 Tweets

Rams • 33716 Tweets

もちづきさん • 19628 Tweets

シーザリオ • 19540 Tweets

Stafford • 17870 Tweets

ZEE x COS • 17574 Tweets

Angélica • 16282 Tweets

#OnePride • 16177 Tweets

Yuhui • 15227 Tweets

Goff • 12570 Tweets

#GOAT_70Mviews • 12011 Tweets

JD 3rd ANNIVERSARY • 11266 Tweets

サッポロポテト • 11098 Tweets

ふっかさん • 11022 Tweets

Chronomia

昭和元禄落語心中

甜花ちゃん

McVay

トロンベ

トド発言

Tyler Johnson

スピードロイド

Aaron Glenn

Dan Campbell

兵庫・斎藤元彦知事

アメリカンオークス

維新の辞職要求方針決定

Jameson Williams

Gibbs

Collinsworth

Emelec

クリアウィング

Puka

考え表明

Jamo

ヴィルシーナ

David Montgomery

#LaAcademia

Cooper Kupp

#ポンズグラフィティ

Last Seen Profiles

@fckwitnyee

@Curiolog

@AkgulEnver

@fckwitnyee

@Linux88Fl

@deniz_on_air

@feet9com

@tzsmith

@BroDansBroDans1

@TheNaytron

@Pietdev09433172

@fckwitnyee

@rnexel_

@Kimsoo65936705

@Arielmorga16693

@moribyan

@FoE_Canada

@ShoSabati

@gatethirteen_

Pinned Tweet

Berivan Isik

@BerivanISIK

3 months

Bittersweet goodbye to the Farm 🌲 Successfully defended my PhD thesis 🤺 grateful for my advisors @tsachyw @sanmikoyejo and everyone I met along the way for the amazing journey at Stanford.

217

207

7K

Berivan Isik

@BerivanISIK

27 days

I have joined @GoogleAI as a research scientist. I will continue to work on efficient and trustworthy AI, LLMs, safety, and privacy. Stay tuned for updates 👀

89

33

2K

Berivan Isik

@BerivanISIK

2 years

🥹 #ChatGPT

7

46

1K

Berivan Isik

@BerivanISIK

11 months

Honored to be selected as a Google PhD fellow this year! Thanks for the generous support @GoogleAI @Google .

Google AI

@GoogleAI

11 months

In 2009, Google created the PhD Fellowship Program to recognize and support outstanding graduate students pursuing exceptional research in computer science and related fields. Today, we congratulate the recipients of the 2023 Google PhD Fellowship!

23

93

581

23

9

440

Berivan Isik

@BerivanISIK

7 months

Very excited to share the paper from my last @GoogleAI internship: Scaling Laws for Downstream Task Performance of LLMs. w/ Natalia Ponomareva, @hazimeh_h , Dimitris Paparas, Sergei Vassilvitskii, and @sanmikoyejo 1/6

7

28

319

Berivan Isik

@BerivanISIK

10 months

I am very excited about our new work: with @RylanSchaeffer @vclecomte @sanmikoyejo @ziv_ravid @Andr3yGR @KhonaMikail @ylecun . We’ll present it in 4 @NeurIPSConf workshops: @unireps (oral), InfoCog (spotlight), @neur_reps , SSL. Details in Rylan’s tweet👇

Rylan Schaeffer

@RylanSchaeffer

10 months

Excited to begin announcing our #NeurIPS2023 workshop & conference papers (1/10)! 🔥🚀An Information-Theoretic Understanding of Maximum Manifold Capacity Representations🚀🔥 w/ amazing cast @vclecomte @BerivanISIK @sanmikoyejo @ziv_ravid @Andr3yGR @KhonaMikail @ylecun 1/7

9

93

500

3

32

282

Berivan Isik

@BerivanISIK

20 days

Super excited about new work Lottery Ticket Adaptation (LoTA): We propose a sparse adaptation method that finetunes only a sparse subset of the pre-trained weights. LoTA mitigates catastrophic forgetting and enables model merging by breaking the

Ashwinee Panda

@PandaAshwinee

20 days

Excited to share Lottery Ticket Adaptation (LoTA)! We propose a sparse adaptation method that finetunes only a sparse subset of the weights. LoTA mitigates catastrophic forgetting and enables model merging by breaking the destructive interference between tasks. 🧵👇

10

45

275

3

34

240

Berivan Isik

@BerivanISIK

4 years

@kisacakimdir Paylaşımınız için çok teşekkürler 😍

22

0

200

Berivan Isik

@BerivanISIK

10 months

Selected as a top reviewer @NeurIPSConf 2023. 🎈

4

1

177

Berivan Isik

@BerivanISIK

2 years

“Sparse Random Networks for Communication-Efficient Federated Learning” has been accepted at #ICLR2023 ! Code coming soon. Looking forward to seeing many of you @iclr_conf in Rwanda.

Berivan Isik

@BerivanISIK

2 years

Excited to share our new work, "Sparse Random Networks for Communication-Efficient Federated Learning". 1/6

1

6

43

5

15

158

Berivan Isik

@BerivanISIK

1 year

Happy to share the second paper from my @GoogleAI internship: Sandwiched Video Compression with Neural Wrappers. The sandwich framework is more efficient than most other neural video compression methods (details below 👇). 1/3

3

10

135

Berivan Isik

@BerivanISIK

1 year

Excited to share our @NeurIPSConf '23 paper "Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation": Looking forward to presenting it in person and seeing many of you in New Orleans! 🙂🎷🎶 Details 👇

2

10

127

Berivan Isik

@BerivanISIK

1 year

Humbled to be selected as a Rising Star in EECS this year. Looking forward to meeting the 2023 cohort @GeorgiaTech soon!

10

0

126

Berivan Isik

@BerivanISIK

2 years

The first paper from my Google internship has been accepted to Frontiers in Signal Processing. This is the first work to compress volumetric functions represented by local coordinate-based neural networks. Paper link: Code coming soon.

2

12

93

Berivan Isik

@BerivanISIK

2 months

Accepted to @COLM_conf ! Check out our new work on fairness implications of low-rank adaptation:

Ken Liu

@kenziyuliu

3 months

LoRA is great. It’s fast, it’s (mostly) accurate. But is the efficiency a free lunch? Do side effects surface in the fine-tuned model? We didn’t quite know so we played with ViT/Swin/Llama/Mistral & focused on subgroup fairness. 🧵: takeaways below 📄:

7

28

171

2

6

80

Berivan Isik

@BerivanISIK

3 months

@srush_nlp In more recent work, we show that scaling laws for downstream behavior depend highly on (1) the metric, (2) the 'alignment' between the pretaining and finetuning data, and (3) the size of the finetuning data. paper: a quick highlight 👇

1

7

72

Berivan Isik

@BerivanISIK

3 years

I will give a talk on our recent work on information-theoretic model compression at the Sparsity in Neural Networks Workshop @sparsenn on Friday.

3

5

72

Berivan Isik

@BerivanISIK

1 month

It was a really fun @icmlconf workshop with amazing speakers, panelists, poster presentations, and a highly engaged audience! @tf2m_workshop

Theoretical Foundations of Foundation Models

@tf2m_workshop

1 month

Join us for the Workshop on Theoretical Foundations of Foundation Models @icmlconf tomorrow! We have a fantastic list of invited speakers: - @th33rtha - @DAlistarh - @jasondeanlee - @kamalikac - @tydsh 1/3

2

13

71

4

9

70

Berivan Isik

@BerivanISIK

2 years

#Antakya Cebrail mahallesi şehit Mehmet Ali demirbüken caddesi emlak bank konutlarından haberi olan ulaşabilir mi? İçerdekilere ulaşamıyoruz. @AFADTurkiye #deprem #AFADhatay #enkazalt ındayım

2

122

67

Berivan Isik

@BerivanISIK

2 years

ODTÜ’de mezuniyet Devrim’de olur. #ODTUMezuniyetininYeriDevrimdir

0

4

62

Berivan Isik

@BerivanISIK

1 year

Excited to share our new work with @FrancescoPase , @DenizGunduz1 , @sanmikoyejo ,Tsachy Weissman, and Michele Zorzi. We reduce the communication cost in FL by exploiting the side information correlated with the local updates and available to the server.1/3

2

7

59

Berivan Isik

@BerivanISIK

4 months

I will be at AISTATS and ICLR in the following weeks. Let me know if you'd like to chat about efficient and trustworthy ML. Also, check out our work: - [AISTATS, May 3rd 5 pm Valencia] Adaptive Compression in Federated Learning via Side Information: 1/2

2

4

59

Berivan Isik

@BerivanISIK

2 years

Excited to share our #AISTATS2022 paper titled "An Information-Theoretic Justification for Model Pruning": Come say hi at the conference during our poster session on Wednesday, March 30th, 8:30-10 am PST. 1/6

3

5

50

Berivan Isik

@BerivanISIK

1 year

Excited to be visiting @NicolasPapernot ’s lab @VectorInst this summer 😎 Let’s catch up if you’re in Toronto!

1

2

49

Berivan Isik

@BerivanISIK

1 year

I will be @icmlconf for the whole week. Text me if you want to meet up! (Papers 👇) PS: Don't forget to stop by our workshop @neural_compress on Saturday.

1

2

49

Berivan Isik

@BerivanISIK

1 year

Join us for our Neural Compression workshop at @icmlconf 2023! We’ll release the call for papers soon. Organizers: @YiboYang , @_dsevero , @karen_ullrich , @robamler , @s_mandt , @BerivanISIK More details 👇

Stephan Mandt

@StephanMandt

1 year

🎉Exciting news! Our "Neural Compression" workshop proposal has been accepted at #ICML 2023! Join us to explore the latest research developments, including perceptual losses and more compute-efficient models! @BerivanISIK , @YiboYang , @_dsevero , @karen_ullrich , @robamler

4

24

99

2

6

48

Berivan Isik

@BerivanISIK

1 year

Looking forward to the Neural Compression Workshop @icmlconf this year. Please consider attending and submitting your latest work. Deadline is May 27th.

Neural Compression Workshop @ICML23

@neural_compress

1 year

The 2nd iteration of the "Neural Compression: From Information Theory to Applications" workshop will take place @icmlconf in Hawaii this year! Submissions due May 27th. For more details: @BerivanISIK @YiboYang @_dsevero @karen_ullrich @robamler @s_mandt

3

18

61

0

2

45

Berivan Isik

@BerivanISIK

5 months

Super excited about our upcoming @icmlconf workshop! Stay tuned for updates 🙌 For details:

TF2M@ICML 2024

Workshop Summary

sites.google.com

Theoretical Foundations of Foundation Models

@tf2m_workshop

5 months

We are happy to announce that the Workshop on Theoretical Foundations of Foundation Models will take place @icmlconf in Vienna! For details: Organizers: @BerivanISIK , @SZiteng , @BanghuaZ , @eaboix , @nmervegurel , @uiuc_aisecure , @abeirami , @sanmikoyejo

1

11

51

0

6

46

Berivan Isik

@BerivanISIK

2 years

#HelpTurkey #deprem #Turkey

0

19

42

Berivan Isik

@BerivanISIK

2 years

Excited to share our new work, "Sparse Random Networks for Communication-Efficient Federated Learning". 1/6

1

6

43

Berivan Isik

@BerivanISIK

4 months

Submissions due in one week! We welcome submissions on efficient & responsible foundation models and the principled foundations of large models. CfP: See you in Vienna in July @icmlconf !

TF2M@ICML 2024 - Call for Papers

Call for Papers

sites.google.com

Theoretical Foundations of Foundation Models

@tf2m_workshop

4 months

🚨 Submissions due on May 29! 🚨 Do you have exciting work on efficient & responsible foundation models or the principled foundations of large models? Submit your work now! We welcome submissions of work recently published or currently under review at other ML venues. @icmlconf

0

13

21

0

8

41

Berivan Isik

@BerivanISIK

2 months

I'll be @icmlconf next week. Text me if you want to chat about trustworthy & efficient AI or data valuation 🙌 -Sat: Check out our workshops @tf2m_workshop & @DMLRWorkshop -Thu 1:30-3 pm: w/ @wnchen1994 @KairouzPeter Albert No @sewoong79 and Zheng Xu👇

4

2

39

Berivan Isik

@BerivanISIK

9 months

I will be at #NeurIPS2023 all week. Text me if you'd like to chat about trustworthy & responsible AI at scale! I'll present two works: Tue afternoon: Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation () 👇

2

0

36

Berivan Isik

@BerivanISIK

2 years

Finally made it to the office 🌚

0

33

Berivan Isik

@BerivanISIK

1 year

The workshop is happening at room 317A @icmlconf now! Please also join us for the social as well. Everyone is welcome! Details 👇

Neural Compression Workshop @ICML23

@neural_compress

1 year

Please join our social at Maui Brewing Co. Waikiki at 6pm after the workshop. Everyone, especially compression and information theory enthusiasts, is welcome! @icmlconf

0

4

13

0

4

34

Berivan Isik

@BerivanISIK

2 months

Excited to share the program and list of accepted papers for our @icmlconf workshop @tf2m_workshop : Looking forward to discussing efficiency, responsibility, and principled foundations of foundation models in Vienna soon!

TF2M@ICML 2024 - Schedule

Schedule (Vienna Time) Another version and livestreaming at the conference website (registration needed). 09:00-09:05 Opening Remarks by Organizers 09:05-09:35 Invited Talk (Yuandong Tian, Meta AI):...

sites.google.com

Theoretical Foundations of Foundation Models

@tf2m_workshop

2 months

We are excited to announce that 58 excellent papers will be presented at the @icmlconf TF2M Workshop. List of accepted papers: You can find the detailed schedule on our website (and below 👇):

1

6

18

0

6

33

Berivan Isik

@BerivanISIK

3 months

A must-read for supervisors and managers👇 Sexual harassment is far more common than discussed because victims often experience fear, not anger, and may freeze rather than confront.

0

2

31

Berivan Isik

@BerivanISIK

2 years

I will give an in-person talk on our work "Efficient Federated Random Subnetwork Training" at the NeurIPS Federated Learning Workshop. Looking forward to seeing many of you in New Orleans. Drop me a message if you want to meet up! #neurips2022

Berivan Isik

@BerivanISIK

2 years

Excited to share our new work, "Sparse Random Networks for Communication-Efficient Federated Learning". 1/6

1

6

43

1

25

Berivan Isik

@BerivanISIK

3 years

Check out our new paper titled “Learning under Storage and Privacy Constraints”. We propose a novel data pre-processing framework, LCoN, which simultaneously boosts data efficiency, privacy, accuracy, and robustness. 1/4 #compression #privacy #learning

3

0

24

Berivan Isik

@BerivanISIK

3 months

Does LoRA have any unintended effects on subgroup fairness?👇 Work led by @kenziyuliu @_d1ng_ @poonpura with @sanmikoyejo

Ken Liu

@kenziyuliu

3 months

LoRA is great. It’s fast, it’s (mostly) accurate. But is the efficiency a free lunch? Do side effects surface in the fine-tuned model? We didn’t quite know so we played with ViT/Swin/Llama/Mistral & focused on subgroup fairness. 🧵: takeaways below 📄:

7

28

171

0

1

21

Berivan Isik

@BerivanISIK

4 years

We will be in #NeurIPS2020 WiML and Deep Learning through Information Geometry workshops with our work on neural network compression for noisy storage systems:

1

2

20

Berivan Isik

@BerivanISIK

5 months

Super excited about the 5th edition of the @DMLRWorkshop at @icmlconf 2024. Stay tuned for the updates! 👇

Workshop on Data-centric Machine Learning Research

@DMLRWorkshop

5 months

We are thrilled to announce that the #DMLRWorkshop on "Datasets for Foundation Models" will take place at the @icmlconf in July! This marks the 5th edition of our #DMLR workshop series! Join the DMLR community at

0

6

19

0

3

19

Berivan Isik

@BerivanISIK

3 years

We are excited to announce that Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning will take place @icmlconf . We have an excellent line of speakers, including a recent Shannon award winner! More details:

ITR3 @ ICML-21

Joining the Workshop on July 24: https://icml.cc/virtual/2021/workshop/8365 Workshop Summary The ICML-21 Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine...

sites.google.com

ITR3 Workshop @ ICML21

@ITR3_workshop

3 years

Workshop on Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning (ITR3) will take place @icmlconf this year. Submissions due May 24th. Details: @abeirami @FlavioCalmon @BerivanISIK @hey1jeong @matthewnokleby @CindyRush

1

27

86

0

1

19

Berivan Isik

@BerivanISIK

1 year

"Neural Network Compression for Noisy Storage Devices" will appear at the ACM Transactions on Embedded Computing Systems (TECS): We propose ways to provide robustness to neural networks against noise present in storage or communication environments. 1/3

2

1

18

Berivan Isik

@BerivanISIK

2 years

New bounds for the deletion channel:

1

0

18

Berivan Isik

@BerivanISIK

7 months

Thanks for the highlight! 🙌 @arankomatsuzaki

Aran Komatsuzaki

@arankomatsuzaki

7 months

Scaling Laws for Downstream Task Performance of Large Language Models Studies how the choice of the pretraining data and its size affect downstream cross-entropy and BLEU score

1

20

100

0

1

18

Berivan Isik

@BerivanISIK

2 years

“Kaliforniyaya taşınacaksın. Vietnam-Amerikalı, budist, DJ, kolunda Texas dövmesi olan ev arkadaşın salona seccade serecek, evde bağlama çalacak.”

0

17

Berivan Isik

@BerivanISIK

1 year

Join us on Wednesday night for a fruitful discussion at the @BerkeleyML panel.

Machine Learning at Berkeley

@BerkeleyML

1 year

Looking to dive into AI research but unsure how? We're excited to host guests @xiao_ted ( @GoogleAI ), Yi Li ( @AmbiRobotics ), @TheRealRPuri ( @OpenAI ), @BerivanISIK ( @Stanford ) and @ritageleta ( @berkeley_ai ) for our research panel!! Come through Wednesday evening with questions!

2

14

39

0

16

Berivan Isik

@BerivanISIK

7 months

That my aunt was “hired” to be my aunt

Hollyn

@sadlifeebro

7 months

What's the dumbest thing you believed as a child?

109

22

213

0

15

Berivan Isik

@BerivanISIK

7 months

Thanks for the highlight! @_akhaliq

AK

@_akhaliq

7 months

Scaling Laws for Downstream Task Performance of Large Language Models paper page: Scaling laws provide important insights that can guide the design of large language models (LLMs). Existing work has primarily focused on studying scaling laws for

3

33

160

0

13

Berivan Isik

@BerivanISIK

4 years

Registration and poster abstract submissions for the Stanford Compression Workshop 2021 are now being accepted! Date: 25-26th February 2021 Website: Poster abstract submission deadline: 21 Feb 2021

0

1

12

Berivan Isik

@BerivanISIK

20 days

Lottery Ticket Adaptation (LoTA) is a new adaptation method that achieves best-in-class performance on challenging tasks, mitigates catastrophic forgetting, and enables model merging across different tasks. Paper: Code: Feedback

GitHub - kiddyboots216/lottery-ticket-adaptation: Lottery Ticket Adaptation

Lottery Ticket Adaptation. Contribute to kiddyboots216/lottery-ticket-adaptation development by creating an account on GitHub.

github.com

0

1

13

Berivan Isik

@BerivanISIK

1 year

Tomorrow at the FLOW seminar, I will talk about our @iclr_conf 2023 paper "Sparse Random Networks for Communication-Efficient Federated Learning". Looking forward to your feedback and questions. 🙌

Federated Learning One World Seminar (FLOW)

@flow_seminar

1 year

📢: The 99th FLOW talk is on Wednesday (22th March) at **5 pm UTC**. Berivan Isik (Stanford) will discuss "Sparse Random Networks for Communication-Efficient Federated Learning." Register to our mailing list:

0

6

1

11

Berivan Isik

@BerivanISIK

2 years

The codebase is open-sourced at: Let us know if you have any questions! @FrancescoPase

GitHub - BerivanIsik/sparse-random-networks: Implementation of the FedPM framework by the authors...

Implementation of the FedPM framework by the authors of the ICLR 2023 paper "Sparse Random Networks for Communication-Efficient Federated Learning". - BerivanIsik/sparse-random-networks

github.com

1

10

Berivan Isik

@BerivanISIK

7 months

@tianle_cai Very cool work! 💫 we have a NeurIPS 2023 workshop paper with a similar idea and observations. The delta between the finetuned and pretrained model is extremely compressible with quantization and even with simple magnitude-based sparsification:

0

10

Berivan Isik

@BerivanISIK

2 years

@ekrem_imamoglu Hatay Antakya, Emlakbank Evleri 1. Kisim 6-D , 6-B bloklari, hic kimseden haber alinamiyor. Musa Yuksekgonul, Behiye Yuksekgonul, Bahar Yuksekgonul. @istanbulbld @AFADHatay

0

1

9

Berivan Isik

@BerivanISIK

2 years

@cigdemtoker Hatay Antakya, Emlakbank Evleri 1. Kisim 6-D , 6-B bloklari, hic kimseden haber alinamiyor. Musa Yuksekgonul, Behiye Yuksekgonul, Bahar Yuksekgonul

0

10

8

Berivan Isik

@BerivanISIK

1 year

The framework consists of a neural pre- and post-processor with a standard video codec between them. The networks are trained jointly to optimize a rate-distortion loss function with the goal of significantly improving over the standard codec in various compression scenarios. 2/3

1

0

8

Berivan Isik

@BerivanISIK

20 days

We can go up to 99% sparsity without any costly steps to find the sparsity mask. We can find the mask with just one dense training step with a significantly small portion of the dataset, followed by magnitude thresholding to find the most important weights for the task. This way,

1

8

Berivan Isik

@BerivanISIK

2 years

@miniapeur There is an (not very tight) upper bound on the output distortion when pruning a single connection that helps with adjusting layer-wise sparsity in a greedy manner:

3

0

8

Berivan Isik

@BerivanISIK

20 days

LoTA is also incredibly helpful for model merging. Existing model merging methods mostly do post-hoc sparsification to their dense adapters, which usually hurts the performance. LoTA does not require this post-hoc sparsification since the task vectors are already sparse. 4/5

1

0

8

Berivan Isik

@BerivanISIK

20 days

LoTA successfully mitigates catastrophic forgetting since sparse updates overlap less than dense updates or LoRA updates. We can even impose this further by restricting the updates of future tasks on non-overlapping weights from previous tasks and eliminate interference between

1

0

8

Berivan Isik

@BerivanISIK

9 months

Spotlight talks on Fri Dec 15, InfoCog and @unireps workshops: An Information-Theoretic Understanding of Maximum Manifold Capacity Representations ()

0

1

8

Berivan Isik

@BerivanISIK

1 year

Speakers: Johannes Balle (Google), @jmhernandez233 (Cambridge), Hyeji Kim (UT Austin), Yan Lu (Microsoft), Aaron Wagner (Cornell), Tsachy Weissman (Stanford) Panelists: Ashish Khisti (UofT), @tivaro (Qualcomm), @george_toderici (Google), @RashmiKVinayak (CMU)

0

7

Berivan Isik

@BerivanISIK

4 years

@alienofhere

0

7

Berivan Isik

@BerivanISIK

3 years

We also developed a novel model compression method (called SuRP), guided by this information-theoretic formulation, which indeed outputs a sparse model without an explicit pruning step.

1

0

5

Berivan Isik

@BerivanISIK

4 years

Come say hi during our poster sessions if you're interested: Monday 12:30-2:30 pm PST (WiML) Wednesday 4-5 am PST (WiML) Saturday 5-6:30 pm PST (DL-IG)

1

0

6

Berivan Isik

@BerivanISIK

1 month

@united agent after 17 hours: The bags are with Swiss Airlines in your departure point - I didn’t fly with Swiss Airlines @united : I know, but they hold your bags because your flight was canceled - My flight was not canceled?! What’s going on? @united

6

0

Berivan Isik

@BerivanISIK

1 year

@FrancescoPase @DenizGunduz1 @sanmikoyejo We show the existence of highly natural choices of pre-data distribution (side information at the server) and post-data distribution (local updates at the clients) in FL that we can use to reduce the communication cost significantly -- up to 50 times more than the baselines. 2/3

1

0

6

Berivan Isik

@BerivanISIK

2 years

@mertyuksekgonul @OguzhanUgur @BabalaTv @dayagiyedin @haluklevent @nesibekiris @DepremDairesi @AFADTurkiye @ahbap @berkcanguven @sokakkedisitv @akplisinyani @bosunatiklama @ProfDemirtas

0

1

6

Berivan Isik

@BerivanISIK

1 year

Compared to other neural video compression methods, the sandwich framework is much more efficient as it requires pre- and post-processors formed by modestly-parameterized, lightweight networks. Joint work with Philip A. Chou, Onur Guleryuz, Danhang Tang, and Jonathan Taylor. 3/3

0

6

Berivan Isik

@BerivanISIK

7 months

TLDR: The size of the finetuning dataset and the distribution alignment between the pretraining and downstream data significantly influence the scaling behavior. 3/6

1

0

6

Berivan Isik

@BerivanISIK

2 years

We propose Federated Probabilistic Mask Training (FedPM) that does not update the randomly initialized weights at all. Instead, FedPM freezes the weights at their initial random values and learns how to sparsify the random network for the best performance. 2/6

1

0

5

Berivan Isik

@BerivanISIK

1 year

I will present two papers at the Federated Learning Workshop: 1) Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation: 2) Communication-Efficient Federated Learning through Importance Sampling:

1

0

5

Berivan Isik

@BerivanISIK

3 years

We derived the information-theoretical limit of model compression and showed that this limit can only be achieved when the reconstructed model is sparse (pruned).

1

0

4

Berivan Isik

@BerivanISIK

3 months

@srush_nlp Cross-entropy (CE) loss always improves with more pertaining data regardless of the degree of alignment. But BLEU/COMET/ROUGE scores on the downstream task sometimes drop with more pertaining data when alignment is not sufficient.

0

5

Berivan Isik

@BerivanISIK

10 months

@fluffykittnmeow @RylanSchaeffer @YuanqiD @vclecomte @sanmikoyejo @ziv_ravid @Andr3yGR @KhonaMikail @ylecun Sorry about that! We’ll put the paper on arxiv very soon. For now, you can check the workshop paper here:

0

5

Berivan Isik

@BerivanISIK

1 year

And one preliminary study at the Efficient Foundation Models Workshop: GPT-Zip: Deep Compression of Finetuned Large Language Models:

0

5

Berivan Isik

@BerivanISIK

3 years

Check out our preprint for more details: Registration to @sparsenn workshop is free:

SNN Workshop 2021

Call for Papers/Abstract

sites.google.com

1

0

5

Berivan Isik

@BerivanISIK

1 year

@FrancescoPase @DenizGunduz1 @sanmikoyejo We also show how to adaptively adjust the bitrate across the model parameters and training rounds to achieve the fundamental communication cost -- the KL divergence between the pre-data and post-data distributions. 3/3

0

5

Berivan Isik

@BerivanISIK

2 years

@abeirami @savvyRL And there is a simple way to boost the student’s performance by pruning the teacher network before distilling (which acts as a regularizer):

Prune Your Model Before Distill It

Knowledge distillation transfers the knowledge from a cumbersome teacher to a small student. Recent results suggest that the student-friendly teacher is more appropriate to distill since it...

arxiv.org

0

4

Berivan Isik

@BerivanISIK

2 years

FedPM reduces the communication cost to less than 1 bit per parameter (bpp), reaches higher accuracy with faster convergence than the relevant baselines, outputs a final model with size less than 1 bpp, and can potentially amplify privacy. 4/6

1

0

4

Berivan Isik

@BerivanISIK

7 months

However, there are also cases where moderate misalignment causes the BLEU score to fluctuate or get worse with more pretraining, whereas downstream cross-entropy monotonically improves. 5/6

2

0

4

Berivan Isik

@BerivanISIK

2 years

To this end, the clients collaborate in training a stochastic binary mask to find the optimal sparse random network within the original one. At the end of the training, the final model is a sparse network with random weights – or a subnetwork inside the dense random network. 3/6

1

0

4

Berivan Isik

@BerivanISIK

4 months

- [ICLR DMFM & ME-FoMo] Scaling Laws for Downstream Task Performance of Large Language Models: - [ICLR SeT LLM, Me-FoMo, R2-FM, PML4LRS] On Fairness Implications and Evaluations of Low-Rank Adaptation of Large Models: 2/2

0

2

4

Berivan Isik

@BerivanISIK

2 years

Joint work with @FrancescoPase , @DenizGunduz1 , Tsachy Weissman, and Michele Zorzi. 6/6

0

4

Berivan Isik

@BerivanISIK

2 years

Throughout the manuscript, we highlighted the advantages of having a stochastic mask training approach rather than a deterministic one in terms of accuracy, bitrate, and privacy. 5/6

1

0

4

Berivan Isik

@BerivanISIK

1 year

We use an analog storage technology (PCM) as an example to show that the noise added by the PCM cells is detrimental to the performance of neural networks and that we can recover full accuracy with our robust coding strategies. 2/3

1

0

4

Berivan Isik

@BerivanISIK

1 year

We study the mean estimation problem under communication and local differential privacy constraints. As opposed to the order-optimal solutions in prior work,we characterize exact optimality conditions and develop an algorithm that is exact-optimal for a large family of codebooks.

0

4

Berivan Isik

@BerivanISIK

7 months

This highlights the importance of studying downstream performance metrics and not making decisions solely based on cross-entropy! 6/6

0

4

Berivan Isik

@BerivanISIK

7 months

We study the scaling behavior in a transfer learning setting, where LLMs are finetuned for translation tasks, and investigate how the choice of the pretraining data and its size affect downstream performance as judged by two metrics: downstream cross-entropy and BLEU score. 2/6

1

0

4

Berivan Isik

@BerivanISIK

3 years

Suriye iç savaşında kadın olmak👇 @nerdesineko 👏👏

ekin 🌾

@nerdesineko

3 years

Suriye İç Savaşı'nda Kadın Olmak — Elif Ekin Doğan

0

1

2

0

1

4

Berivan Isik

@BerivanISIK

2 years

We investigated the theoretical tradeoff between the compression ratio and output perturbation of neural network models and found out that the rate-distortion theoretic formulation introduces a theoretical foundation for pruning. 2/6

1

0

3

Berivan Isik

@BerivanISIK

2 months

@PandaAshwinee @srush_nlp @xiangyuqi_pton Next time we should go with “sparsity is all you need”

0

3