Maithra Raghu @maithra_raghu profile

Maithra Raghu

@maithra_raghu

Followers

17,379

Following

481

Media

76

Statuses

417

Cofounder and CEO @Samaya_AI . Formerly Research Scientist Google Brain ( @GoogleAI ), PhD in ML @Cornell .

https://t.co/rOwlsxgzdF

Joined July 2017

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

満塁ホームラン • 95339 Tweets

大谷翔平 • 85595 Tweets

Brighton • 73759 Tweets

#ラヴィットロック2024 • 72593 Tweets

花火大会 • 64739 Tweets

#V最協S6 • 57027 Tweets

ML IN MACAU • 52362 Tweets

Vielfalt • 45728 Tweets

ALNP FANMEET IN HK • 39774 Tweets

#FNTHWIN • 37788 Tweets

Täter • 33131 Tweets

大谷さん • 31126 Tweets

ケンタッキー • 31125 Tweets

#BNGWIN • 29071 Tweets

グランドスラム • 27625 Tweets

Slogan • 27579 Tweets

悪役令嬢の中の人 • 26333 Tweets

全力応援 • 25839 Tweets

オオタニサン • 19564 Tweets

STRAY KIDS DOMINATE SEOUL • 17926 Tweets

V最本番 • 12889 Tweets

韓国人女性

青木マッチョ

友也くん

赤坂サイファー

しょーじくん

浴衣の人

益田さん

La Grande-Motte

DXPO By Danamon

ブライト

門別くん

ミュージックフェア

口田くん

韓国女子

南波アナ

ミューバン

益田劇場

宮城くん

韓国の女の子

国立競技場

Sargeant

サージェント

霊夢1位

マーフィー

うさほー

南波さん

障子くん

まりほー

#全チームWIN

Last Seen Profiles

@turk_ifsa2019

@d_elilahkaurr1

@NewsDigestWeb

@62Y7L0vMh12Rf

@SchlimmeDinge

@Homma81

@8iHicsgGoTmbEnW

@nw8er

@___Mikako___

@HPutobas

@isjiyang

@c5S8qzKOAzg9i8s

@ChannelTh1rt3en

@ZootyCutie

@com_in_

@jenndiallo2016

@recktamaze

@RiasatAli133806

@BarbSampso85312

@PascalCoin

Pinned Tweet

Maithra Raghu

@maithra_raghu

1 year

Does One Large Model Rule Them All? New post with @matei_zaharia and @ericschmidt on the future of the AI ecosystem. Our key question: does the rise of large, general AI models means the future AI ecosystem is dominated by a single general model? ⬇️

27

85

292

Maithra Raghu

@maithra_raghu

4 years

A Survey of Deep Learning for Scientific Discovery To help facilitate using DL in science, we survey a broad range of deep learning methods, new research results, implementation tips & many links to code/tutorials Paper Work with @ericschmidt Thread⬇️

15

432

1K

Maithra Raghu

@maithra_raghu

3 years

Do Vision Transformers See Like Convolutional Neural Networks? New paper The successes of Transformers in computer vision prompts a fundamental question: how are they solving these tasks? Do Transformers act like CNNs, or learn very different features?

27

253

1K

Maithra Raghu

@maithra_raghu

4 years

Thrilled to share that I successfully defended my PhD today!! This milestone wouldn't have been possible without the support and guidance of my collaborators, mentors, friends and family -- thank you so much!!! Thanks also to everyone who attended my (virtual) defense!

73

21

1K

Maithra Raghu

@maithra_raghu

4 years

New blog post: "Reflections on my (Machine Learning) PhD Journey" 2020 has marked the end of my six year PhD journey, filled with struggles, success and evolution of personal & research perspectives. In the post I share experiences and lessons learned ⬇️

15

146

954

Maithra Raghu

@maithra_raghu

2 years

After almost 6.5 years, I left Google Brain earlier this month. It's been an incredible journey of gaining insights on many exciting areas of machine learning, and watching the field grow and evolve (so much, so quickly!)

12

10

597

Maithra Raghu

@maithra_raghu

2 years

A few months ago I left Google Brain to pursue my next adventure: building @samaya_AI ! We're excited to bring the latest AI advances to the Knowledge Discovery process!

17

21

464

Maithra Raghu

@maithra_raghu

10 months

Many of these trends don't hold. Last week we celebrated @geoffreyhinton 's retirement, and a few weeks earlier saw @kkariko receive the Nobel Prize. Their research took decades to come together, and had enormous impact at a world scale. We'd be much worse off if they'd pivoted!

Jason Wei

@_jasonwei

10 months

Enjoyed visiting UC Berkeley’s Machine Learning Club yesterday, where I gave a talk on doing AI research. Slides: In the past few years I’ve worked with and observed some extremely talented researchers, and these are the trends I’ve noticed: 1. When

45

275

2K

12

35

423

Maithra Raghu

@maithra_raghu

6 years

Excited to be one of the Forbes 30 under 30 in Science! #ForbesUnder30 Full list:

29

20

357

Maithra Raghu

@maithra_raghu

6 years

We're releasing tutorials on our work using CCA to compare and probe representations in deep neural networks: There are Jupyter notebooks overviewing the technique, descriptions of results, and discussions of open problems. We hope this is useful resource!

3

78

360

Maithra Raghu

@maithra_raghu

7 years

T-shirts at #NIPS2017 -- any interest @goodfellow_ian :)

5

89

340

Maithra Raghu

@maithra_raghu

2 years

And now we can buy and sell prompts for LLMs, DALLE, and more! 😁😁 It was only a matter of time...

PromptBase | Prompt Marketplace: Midjourney, ChatGPT, DALL·E, Stable Diffusion & more.

Explore 130,000+ curated AI prompts made by expert AI creators. Produce better outputs, save on time & API costs, sell your own prompts.

promptbase.com

12

56

323

Maithra Raghu

@maithra_raghu

10 months

Congratulations @geoffreyhinton on your retirement and an incredible research career!

2

6

309

Maithra Raghu

@maithra_raghu

5 years

Our paper on Understanding Transfer Learning for Medical Imaging has been accepted to #NeurIPS2019 !! Preprint: As a positive datapoint: we had a good reviewing experience, with detailed feedback and mostly useful comments. Thanks to the Program Chairs!

8

57

281

Maithra Raghu

@maithra_raghu

4 years

Delighted our new paper "Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics" just won Best Paper at the Continual Learning Workshop at #ICML2020 !! Paper: Oral *tomorrow*, details at: ⬇️ Paper thread

CL-ICML

Continual Learning (CL) studies the problem of learning from a stream of data from changing domains, each connected to a different learning task. The objective of CL is to quickly adapt to new...

sites.google.com

3

40

262

Maithra Raghu

@maithra_raghu

3 years

LaMDA: Language Models for Dialogue Applications Paper: Blogpost: Excited to see this paper come out! I enjoyed the weddell seal conversation with LaMDA in our 2021 research summary blogpost!

6

44

248

Maithra Raghu

@maithra_raghu

3 years

NeurIPS poster presentation happening tomorrow, 8:30am - 10am PT. Hope to see you there! #NeurIPS2021 @NeurIPSConf

Maithra Raghu

@maithra_raghu

3 years

Do Vision Transformers See Like Convolutional Neural Networks? New paper The successes of Transformers in computer vision prompts a fundamental question: how are they solving these tasks? Do Transformers act like CNNs, or learn very different features?

27

253

1K

6

26

242

Maithra Raghu

@maithra_raghu

6 years

My favourite #NeurIPS2018 swag: an ML version of cards against humanity by @MSFTResearch

2

31

224

Maithra Raghu

@maithra_raghu

3 years

Happy to share our paper on ViTs and CNNs was accepted to #NeurIPS2021 ! Our other two submissions this year were rejected. I still think they have some great results and am looking forward to improving the papers with the received feedback.

Maithra Raghu

@maithra_raghu

3 years

Do Vision Transformers See Like Convolutional Neural Networks? New paper The successes of Transformers in computer vision prompts a fundamental question: how are they solving these tasks? Do Transformers act like CNNs, or learn very different features?

27

253

1K

3

19

219

Maithra Raghu

@maithra_raghu

3 years

Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalization We introduce a rich family of tasks with a pointer-value rule, to study mechanisms NN of generalization, from memorization to reasoning. Paper:

5

47

197

Maithra Raghu

@maithra_raghu

4 years

Do Wide and Deep neural networks Learn the Same Things? Paper: We study representational properties of neural networks with different depths and widths on CIFAR/ImageNet, with insights on model capacity effects, feature similarity & characteristic errors

Do Wide and Deep Networks Learn the Same Things? Uncovering How...

A key factor in the success of deep neural networks is the ability to scale models to improve performance by varying the architecture depth and width. This simple property of neural network design...

arxiv.org

Thao Nguyen

@thao_nguyen26

4 years

Do wide and deep neural networks learn the same thing? In a new paper () with @maithra_raghu and @skornblith we study how width and depth affect learned representations within and across models trained on CIFAR and ImageNet. 1/6

2

33

148

4

18

202

Maithra Raghu

@maithra_raghu

6 years

"Transfusion: Understanding Transfer Learning with Applications to Medical Imaging" The benefits of transfer are nuanced. With *no* feature reuse and only the pretrained weight scaling, we can regain the effects of transfer. More findings in the paper!

3

38

196

Maithra Raghu

@maithra_raghu

2 years

Excited to share the @icmlconf 2022 Workshop on Knowledge Retrieval and Language Models Please consider submitting! We welcome work across topics including LM grounding, open-domain Q&A, bias in retrieval, analyses of scale, transfer and LM phenomena.

3

26

189

Maithra Raghu

@maithra_raghu

5 years

New blogpost on citation trends in @NeurIPSConf and @icmlconf : I scraped paper citations and studied topic trends, citation distributions and academia/industry splits. Releasing scraper, data and a tutorial! Post: Code/Data:

5

33

167

Maithra Raghu

@maithra_raghu

30 days

Super exciting to see AI achieving a Silver Medal at IMO today, both as an AI researcher and more personally, as someone who spent many years competing in math Olympiads. Some quick (possibly controversial!) thoughts:

1

12

159

Maithra Raghu

@maithra_raghu

4 years

ICLR Town: Pokemon-esque environment to wander around and bump into people, which syncs almost seamlessly with video-chatting capabilities. What a fun idea for virtual (research) conferences! Thanks @iclr_conf organizers!! #ICLR2020 #iclr (Uses )

1

27

157

Maithra Raghu

@maithra_raghu

5 years

Rapid Learning or Feature Reuse? New paper: We analyze MAML (and meta-learning and meta learning more broadly) finding that feature reuse is the critical component in the efficient learning of new tasks -- leading to some algorithmic simplifications!

Rapid Learning or Feature Reuse? Towards Understanding the...

An important research direction in machine learning has centered around developing meta-learning algorithms to tackle few-shot learning. An especially successful algorithm has been Model Agnostic...

arxiv.org

Oriol Vinyals

@OriolVinyalsML

5 years

Rapid Learning or Feature Reuse? Meta-learning algorithms on standard benchmarks have much more feature reuse than rapid learning! This also gives us a way to simplify MAML -- (Almost) No Inner Loop (A)NIL. With Aniruddh Raghu @maithra_raghu Samy Bengio.

9

163

602

2

26

152

Maithra Raghu

@maithra_raghu

2 years

Excited to see this article by @QuantaMagazine overviewing the development of the Vision Transformer, insights on how it works, and promising new applications!

Will Transformers Take Over Artificial Intelligence? | Quanta Magazine

A simple algorithm that revolutionizes how neural networks approach language is now taking on image classification as well. It may not stop there.

www.quantamagazine.org

0

36

132

Maithra Raghu

@maithra_raghu

2 years

Headed to #ICML2022 for the first in-person conference since pre-covid! Looking forward to exciting ML discussions with old & new friends Our workshop on Knowledge Retrieval and Language Models is on Friday 22nd. Do stop by (or tune in online)! @icmlconf

1

16

127

Maithra Raghu

@maithra_raghu

4 years

Excited to attend #NeurIPS2020 ! My amazing collaborator @thao_nguyen26 **who is applying to PhD programs this year** will be presenting Do Wide and Deep Neural Nets Learn the Same Things? at @WiMLworkshop posters *today* & in Inductive Biases Workshop

2

18

125

Maithra Raghu

@maithra_raghu

1 year

Lost in the Middle: How Language Models Use Long Contexts Exciting work exploring the effectiveness of long context, led by @nelsonfliu and with Kevin Lin, Ashwin Paranajape, John Hewitt, @percyliang @Fabio_Petroni @MicheleBevila20

4

24

125

Maithra Raghu

@maithra_raghu

6 years

Very excited about our latest preprint: , joint work with @arimorcos and Samy Bengio. We apply Canonical Correlation (CCA) to study the representational similarity between memorizing and generalizing networks, and also examine the training dynamics of RNNs.

Insights on representational similarity in neural networks with...

Comparing different neural network representations and determining how representations evolve over time remain challenging open questions in our understanding of the function of neural networks....

arxiv.org

Google AI

@GoogleAI

6 years

Do different networks learn similar representations to solve the same tasks? How do RNN representations evolve over training? What can representational similarity tell us about generalization? Using CCA, @arimorcos and @maithra_raghu try to find out!

5

97

299

0

41

117

Maithra Raghu

@maithra_raghu

6 years

My entry to #MachineLearning (from another field) wouldn't have happened without #NIPS2014 . But the reason I went, and found a welcoming community was due to #WiML2014 . Now #WiML2018 's organizer call is open . Apply by 25/03! The impact can't be overstated.

Women in Machine Learning (WiML) 2018 Organizer Application

Due Sunday March 25th, 2018 (by end of day anywhere in the world) WiML is a organization dedicated to the advancement and promotion of women in machine learning. Each year we seek five self-identif...

docs.google.com

0

40

111

Maithra Raghu

@maithra_raghu

5 years

How do representations evolve as they go through the transformer? How does the Masked Language Model objective affect these compared to Language Models? How much do different tokens change and influence other tokens? Answers in the paper by @lena_voita : !

0

22

103

Maithra Raghu

@maithra_raghu

7 years

Deep Learning: Bridging Theory and Practice happening tomorrow at #NIPS2017 ! Final Schedule: @lschmidt3 @OriolVinyalsML @rsalakhu We have an exciting program with talks by Yoshua Bengio @goodfellow_ian Peter Bartlett Doina Precup Percy Liang Sham Kakade!

2

25

95

Maithra Raghu

@maithra_raghu

5 years

How does transfer learning for medical imaging affect performance, representations and convergence? Check out the blogpost below and our #NeurIPS2019 paper for some of the surprising conclusions, new approaches and open questions!

Transfusion: Understanding Transfer Learning for Medical Imaging

Transfer learning from natural image datasets, particularly ImageNet, using standard large models and corresponding pretrained weights has become a de-facto method for deep learning applications...

arxiv.org

Google AI

@GoogleAI

5 years

How does transfer learning for medical imaging affect performance, representations and convergence? In a new #NeurIPS2019 paper, we investigate this across different architectures and datasets, finding some surprising conclusions! Learn more below:

3

174

473

2

27

87

Maithra Raghu

@maithra_raghu

4 years

Presenting this at @iclr_conf *today*! Talk and Slides: Poster Sessions: (i) 10am - 12 Pacific Time, (ii) 1pm - 3pm Pacific Time Thanks to the organizers for a *fantastic* virtual conference, hope to see you there! #iclr #ICLR2020

Oriol Vinyals

@OriolVinyalsML

5 years

Rapid Learning or Feature Reuse? Meta-learning algorithms on standard benchmarks have much more feature reuse than rapid learning! This also gives us a way to simplify MAML -- (Almost) No Inner Loop (A)NIL. With Aniruddh Raghu @maithra_raghu Samy Bengio.

9

163

602

1

15

87

Maithra Raghu

@maithra_raghu

6 years

Motivating the Rules of the Game for Adversarial Example Research: Fantastic and nuanced position paper by @jmgilmer @ryan_p_adams @goodfellow_ian on better bridging the gap between research on adversarial examples and realistic ML security challenges.

Motivating the Rules of the Game for Adversarial Example Research

Advances in machine learning have led to broad deployment of systems with impressive performance on important problems. Nonetheless, these systems can be induced to make errors on data that are...

arxiv.org

0

46

88

Maithra Raghu

@maithra_raghu

7 years

First foray into Deep RL We test on a game with continuously tuneable difficulty and *known* optimal policy. We study different RL algorithms, supervised learning, and multiagent play. @jacobandreas

0

29

84

Maithra Raghu

@maithra_raghu

5 years

Our paper on using Machine Learning (Direct Uncertainty Prediction) for predicting doctor disagreements and medical second opinions will be at @icmlconf next week! Blog: Paper: #icml2019 #DeepLearning

1

18

82

Maithra Raghu

@maithra_raghu

10 months

Probably the best we can do is be master of our crafts (know the field well, write good code, collaborate, bring energy & challenge ourselves), and be *brave* --- take risks and try things, even if they're hard, they don't get external validation, and the outcomes are uncertain.

0

9

77

Maithra Raghu

@maithra_raghu

3 years

Really enjoyed this discussion with @jaygshah22 on our work on exploring neural network hidden representations, our recent paper on ViTs and CNNs, and PhD experiences + the ML research landscape! Video:

Learning the internals of Machine Learning systems and tips for PhD |...

Dr. Maithra Raghu is a senior research scientist at Google working on analyzing the internal workings of deep neural networks so that we can deploy them bett...

www.youtube.com

Jay Shah

@jaygshah22

3 years

In a chat with @maithra_raghu , Sr. Research Scientist at @GoogleAI about analyzing internal representations of #DeepLearning models, comparing vision transformers and CNNs, how she developed her interest in ML, and useful tips for researchers/PhD students!

0

10

1

7

79

Maithra Raghu

@maithra_raghu

7 years

NIPS workshop on theory and practice in Deep Learning #nips2017 @NipsConference @OriolVinyalsML @lschmidt3 @rsalakhu

2

19

79

Maithra Raghu

@maithra_raghu

6 years

Very cool! Hundreds of ML tasks with links to papers and leaderboards.

1

22

68

Maithra Raghu

@maithra_raghu

5 years

Had a fantastic week learning about exciting research directions and meeting old and new friends at #NeurIPS2019 . Thanks to the organizers, volunteers and participants for a wonderful conference! My talk at #ML4H is at (~44 mins), and posters below!

1

8

70

Maithra Raghu

@maithra_raghu

3 years

Looking forward to attending #ICLR2021 next week! We're presenting three papers on questions exploring neural network representations, properties of training and algorithms for helping the learning process.

1

7

69

Maithra Raghu

@maithra_raghu

5 years

Excited to announce our @icmlconf workshop on understanding phenomena in deep neural networks! With fantastic speakers including @orussakovsky @ChrSzegedy @KordingLab @beenwrekt @AudeOliva Submission deadline May 5! #DeepLearning #AI #MachineLearning

1

12

67

Maithra Raghu

@maithra_raghu

2 years

Most grateful to my wonderful, supportive colleagues from whom I have learned so much. Hope to share more on next steps in the coming weeks!

1

2

57

Maithra Raghu

@maithra_raghu

2 years

And at last(!!) Google's response to ChatGPT Excited to see Google putting some of these advances out, especially after many years seeing first-hand the development of LaMDA and other AI technology.

0

5

55

Maithra Raghu

@maithra_raghu

4 years

Delighted to be named one of this year's #STATWunderkinds for our work on machine learning in medicine: Grateful to my collaborators and mentors for their advice and support throughout! @statnews

Meet the 2022 STAT Wunderkinds

Our search for the next generation of scientific superstars had a stellar sixth year. Introducing this year's class of STAT Wunderkinds. Read their stories.

www.statnews.com

4

9

55

Maithra Raghu

@maithra_raghu

10 months

On AGI and Self-Improvement With @ericschmidt Questions on AGI are at heart of debate on AI capabilities & risks. To get there AI must learn "on the fly". We outline definitions of AGI, explore this gap, and examine the crucial role of *self-improvement*

5

6

54

Maithra Raghu

@maithra_raghu

10 months

It is usually is very hard to predict *true* breakthroughs, which are often *novel* and have high impact. The novelty means that it's a slow process to be recognized as a breakthrough, and it can be a long and lonely road in the meantime

1

0

51

Maithra Raghu

@maithra_raghu

7 years

A blogpost I wrote on our paper SVCCA, at #nips2017 ! With Justin Gilmer, @jasonyo @jaschasd -- hoping many people will try it out on their networks with the open source code:

GitHub - google/svcca

Contribute to google/svcca development by creating an account on GitHub.

github.com

Google AI

@GoogleAI

7 years

In order to build better and more robust DNN-based systems, one must be able to effectively interpret the models. We introduce a simple and scalable method to both compare and interpret the representations learned by DNNs

3

201

497

0

16

46

Maithra Raghu

@maithra_raghu

5 years

Looking forward to speaking about Artificial and Human Intelligence in Healthcare at the #OReillyAI conference ! Will discuss developing better AI systems and human expert interactions:

The Algorithmic Automation Problem: Prediction, Triage, and Human Effort

In a wide array of areas, algorithms are matching and surpassing the performance of human experts, leading to consideration of the roles of human judgment and algorithmic prediction in these...

arxiv.org

1

10

44

Maithra Raghu

@maithra_raghu

10 months

Furthermore, a lot of best research practices are determined by the maturity and state of the field. Right, now, in LLM research, it's important to write good code and have good infra. That was hardly the case earlier in deep learning when we barely had libraries for autodiff!

1

0

41

Maithra Raghu

@maithra_raghu

4 years

Fantastic workshop on the theory of deep learning at Bellairs in Barbados! Five days of incredible talks from @ShamKakade6 @HazanPrinceton @ylecun @suriyagnskr @prfsanjeevarora and many others! Huge thanks to the organizers ( @prfsanjeevarora and Denis Therien)!

Sham Kakade

@ShamKakade6

4 years

Bellairs. Day 5 @HazanPrinceton and myself: double feature on controls+RL. +spotlights: @maithra_raghu : meta-learning as rapid feature learning. Raman Arora: dropout, capacity control, and matrix sensing . @HanieSedghi : module criticality and generalization! And that is a wrap!🙂

0

3

33

1

5

38

Maithra Raghu

@maithra_raghu

5 years

Exploring the AI Landscape: New blog by @bclyang and me! We'll be covering topics in AI from fundamental research to considerations for deployment. Our first post: is on Digital Health and AI for Health, a longstanding interest!

Perspectives on (AI for) Digital Health through Startup Trends

The Healthcare Information and Management Systems Society (HIMSS) has some fantastic coverage on many of the exciting developments in the digital health space. Recently, they released a detailed...

extail.github.io

0

5

39

Maithra Raghu

@maithra_raghu

6 years

Another #AI startup acquisition, this time the conversational AI startup Semantic Machines, acquired by Microsoft:

Microsoft acquires Semantic Machines, advancing the state of conversational AI - The Official...

AI researchers have made great strides in recent years, but we are still at the beginning of teaching computers to understand the full context of human communication. Most of today’s bots and...

blogs.microsoft.com

1

8

38

Maithra Raghu

@maithra_raghu

6 years

Excited to be speaking at REWORK's Deep Learning in Healthcare summit! #reworkHEALTH I'll be speaking about our work on Direct Uncertainty Prediction for Medical Second Opinions:

Direct Uncertainty Prediction for Medical Second Opinions

The issue of disagreements amongst human experts is a ubiquitous one in both machine learning and medicine. In medicine, this often corresponds to doctor disagreements on a patient diagnosis. In...

arxiv.org

2

7

38

Maithra Raghu

@maithra_raghu

4 years

Excellent page by @Worldometers : has detailed statistics on the coronavirus --- number of cases, severity, breakdown by country, and many others.

1

17

36

Maithra Raghu

@maithra_raghu

3 years

An analysis of self-attention reveals some reasons for this difference: very early ViT layers learn to incorporate local and *global* spatial information, unlike CNN early layers with their smaller receptive field size.

1

2

38

Maithra Raghu

@maithra_raghu

10 months

@geoffreyhinton often wrote quick matlab code and even computed gradients by hand! (I was always inspired that even at that level of seniority, he could quickly prototype his own ideas!)

2

0

35

Maithra Raghu

@maithra_raghu

6 years

Heading to #NeurIPS2018 this week! Looking forward to meeting old friends and new! Let me know if you'll be around and want to chat. @arimorcos and I will be presenting our paper on the Wednesday poster session, hope to see you there!

Insights on representational similarity in neural networks with...

Comparing different neural network representations and determining how representations evolve over time remain challenging open questions in our understanding of the function of neural networks....

arxiv.org

0

4

37

Maithra Raghu

@maithra_raghu

1 year

It's a delight and privilege to work with such an amazing team at

Fabio Petroni

@Fabio_Petroni

1 year

🎉🌐 Big news from @samaya_AI . We have two shiny new offices in #London & #MountainView 🏢, staffed with an incredible team of brilliant minds💡🚀. Check out our freshly launched website at 🌟

3

10

97

1

0

33

Maithra Raghu

@maithra_raghu

7 years

Website of our ( @OriolVinyalsML @lschmidt3 @prfsanjeevarora @rsalakhu ) #nips2017 workshop ! Speaking: @goodfellow_ian

2

16

35

Maithra Raghu

@maithra_raghu

1 year

This article, on the lack of an AI moat at Google and OpenAI has been making the rounds: While it's true that that there is exciting, fast-paced opensource activity in AI, and we may see many current LLMs commoditize, there are still *quality moats*

2

4

34

Maithra Raghu

@maithra_raghu

14 days

I'm deeply saddened to hear about the passing of @SusanWojcicki We met just a couple months back, and she offered sage advice on running a company, even giving feedback on our new product features. I was struck by her insight, her groundedness and her warmth. Sending her family

0

34

Maithra Raghu

@maithra_raghu

9 months

AI winning IMO gold would be impressive, but an AI coming up with IMO *questions* would be even more impressive to me. Can it understand and use different theorems intelligently to come up with hard, creative and truly new questions? Can it do this consistently?

Nat Friedman

@natfriedman

9 months

👀 Alex has launched a $10M challenge for the first AI to win IMO Gold.

18

43

574

6

1

29

Maithra Raghu

@maithra_raghu

6 years

Bots destroying humans at @OpenAI

0

7

31

Maithra Raghu

@maithra_raghu

6 years

Excited to be speaking at @reworkdl deep learning summit today , and Stanford's HealthAI @ai4healthcare hackathon tomorrow! What with the ICML deadline just wrapping up, it's been a busy week 😅

RE•WORK | Events in Artificial Intelligence

Learn more about AI application, implications and limitations. Gain insights and knowledge from the RE•WORK team of industry experts. Register to our events today.

www.re-work.co

1

2

31

Maithra Raghu

@maithra_raghu

4 years

Very interesting work on identifying, understanding and reconstructing the representations learned by neural networks! (I've also enjoyed @distillpub 's "Building Blocks of Interpretability" and "Zoom In" which this work builds on)

The Building Blocks of Interpretability

Interpretability techniques are normally studied in isolation. We explore the powerful interfaces that arise when you combine them -- and the rich structure of this combinatorial space.

distill.pub

Nick

@nickcammarata

4 years

Excited to share a new paper, Curve Circuits We reverse engineer a non-trivial 50k+ parameter learned algorithm from the weights of a neural network and use its core ideas to craft an artificial artificial neural network from scratch that reimplements it

49

334

2K

0

3

32

Maithra Raghu

@maithra_raghu

1 year

Very exciting work by @matei_zaharia @alighodsi and quite literally all of @databricks (who created the dataset!) Lots of interesting followup questions from this --- how well can we use this to bootstrap synthetic data, etc.

Ali Ghodsi

@alighodsi

1 year

Free Dolly! Introducing the first *commercially viable*, open source, instruction-following LLM. Dolly 2.0 is available for commercial applications without having to pay for API access or sharing data with 3rd parties.

55

448

2K

2

31

Maithra Raghu

@maithra_raghu

6 years

I've been enjoying reading @beenwrekt 's posts on #ReinforcementLearning : (new post today!), and it's great to see these insights come together in paper format!

hardmaru

@hardmaru

6 years

"Simple random search provides a competitive approach to reinforcement learning", by Mania, Guy and @beenwrekt Paper: Code: Blog:

6

128

447

0

12

30

Maithra Raghu

@maithra_raghu

3 years

1) Do Wide and Deep Neural Networks Learn the Same Things? (led by @thao_nguyen26 & with @skornblith ) 2) Teaching with Commentaries (led by @RaghuAniruddh & with @skornblith @DavidDuvenaud @geoffreyhinton )

Do Wide and Deep Networks Learn the Same Things? Uncovering How...

A key factor in the success of deep neural networks is the ability to scale models to improve performance by varying the architecture depth and width. This simple property of neural network design...

arxiv.org

1

5

30

Maithra Raghu

@maithra_raghu

2 years

Thrilled to be working with my amazing co-founder @Fabio_Petroni , and a growing team of incredible researchers & engineers!

1

0

30

Maithra Raghu

@maithra_raghu

3 years

Transformer representations can generalize across data modalities! Very interesting result, lots of promise for more progress in multi-modal learning!

Igor Mordatch

@IMordatch

3 years

What are the limits to the generalization of large pretrained transformer models? We find minimal fine-tuning (~0.1% of params) performs as well as training from scratch on a completely new modality! with @_kevinlu , @adityagrover_ , @pabbeel paper: 1/8

4

71

353

0

5

29

Maithra Raghu

@maithra_raghu

4 years

Although there were ups and downs, I'm deeply grateful to the many rich experiences during my PhD, and hope this blogpost might be helpful to others on the journey. Wishing everyone a happy new year!!

0

29

Maithra Raghu

@maithra_raghu

4 years

With the rapid pace of progress in Machine Learning, it's hard not to feel publication pressure during the PhD. But while writing papers is important, the main research goal of the PhD (to me at least!) is to make you an independent researcher, with a rich research vision

1

0

30

Maithra Raghu

@maithra_raghu

3 years

But attending locally is also very important! It is automatically encoded in CNNs, but larger ViTs only learn to do this with enough data (which is needed for their strong performance also.)

1

0

29

Maithra Raghu

@maithra_raghu

7 years

Another research update: Final version of our #nips2017 @NipsConference paper SVCCA: with accompanying code:(!!) We look at deep learning dynamics and interpret the latent representations. With Justin Gilmer, @jasonyo , @jaschasd

1

7

28

Maithra Raghu

@maithra_raghu

4 years

Looking forward to speaking at @RAAISorg this Friday! Many exciting ML research areas, from health to privacy to bioengineering. Details on the talks, research and speakers at:

RAAIS - Leading AI Summit

The Research and Applied AI Summit is a community for entrepreneurs and researchers who accelerate the science and applications of AI technology for the common good. RAAIS is a leading London AI...

raais.co

RAAIS

@raais

4 years

T-10 days @RAAISorg ! Secure your spot: . Ft: - AI chips w/ @CerebrasSystems - Health w/ @saraheeberry @pearsekeane @maithra_raghu - Private ML w/ @DeepMind @zama_crypto - Ethics w/ @SandraWachter5 @b_mittelstadt @c_russl - AVs w/ @sarnoud - Bioeng w/ @DoctorJosh

0

5

8

0

10

27

Maithra Raghu

@maithra_raghu

1 year

It was awesome having @samaya_AI as part of the first batch of AI Grant companies! Grateful to @natfriedman and @danielgross for creating an energizing community for AI-native products. Consider applying!

Nat Friedman

@natfriedman

1 year

AI Grant's second batch is now accepting applications!

42

87

560

0

2

20

Maithra Raghu

@maithra_raghu

3 years

Using local and global info allows ViT earlier layers to learn better representations, which are strongly propagated through residual connections. Surprisingly ViT has stronger residual connections than ResNet! These help explain the uniform structure of ViT representations

2

1

28

Maithra Raghu

@maithra_raghu

9 months

Looking forward to heading to #NeurIPS2023 next week! This year marks a decade(!) of attending NeurIPS! It's remarkable to see how much the field has advanced in 10 years! These past 2 years of building @samaya_AI has been incredible, and we are continuing to grow!

1

28

Maithra Raghu

@maithra_raghu

4 years

Thanks to @atJustinChen and @statnews for an in-depth followup discussion on our research work and motivations. We talk about neural networks, techniques to better understand them, and ways this can inform their design and usage as assistive tools.

At Google Brain, a computer scientist focuses on data so doctors can focus on their patients instead

Maithra Raghu believes that complex algorithms can help restore a deeply human, disappearing aspect of modern medicine: personal connection.

www.statnews.com

0

2

27

Maithra Raghu

@maithra_raghu

1 year

We believe not! The future ecosystem will be rich, with set of *Specialized AI Systems* and a few *General AI Models*, with many entities participating. Specialized AI Systems develop for well-defined, high-value workflows, while General AI models tackle a heavy tail of uses

3

1

24

Maithra Raghu

@maithra_raghu

9 months

Sending best wishes to friends, former colleagues and the team at @OpenAI . You've made incredible, world changing contributions to AI, and it was sad to see the developments of the past few days. Wishing you the best in navigating these transitions.

0

21

Maithra Raghu

@maithra_raghu

3 years

Using representational similarity measures, we investigate the internal structure of the two architectures, finding striking differences, with ViT to having a much more uniform representation across all layers

2

0

25

Maithra Raghu

@maithra_raghu

4 years

Some good news: the recent ruling forcing international students to choose between leaving the country and safety (taking online classes) has been rescinded:

0

1

25

Maithra Raghu

@maithra_raghu

4 years

To me, a big surprise of the PhD was how much it really is a journey, with evolving perspectives (both personal and research) affecting interest in specific problems, research directions and broader subfields. Importantly, it's hard to predict this evolution going in!

1

0

24

Maithra Raghu

@maithra_raghu

6 years

Thanks so much to the organizers and @MITEECS for hosting the EECS Rising Stars 2018! Entertaining, insightful and inspiring discussion by panelists and speakers on research and academia, and a truly unique opportunity to meet my fantastic peers across all of EECS!

0

22

Maithra Raghu

@maithra_raghu

1 year

It takes a lot of technical knowledge, effort & iteration to build high quality AI systems for specific, valuable uses. So while "base models" may commoditize (also discussed in ), there are plenty of chances of moats for focused, high-value AI products.

3

0

22

Maithra Raghu

@maithra_raghu

2 years

Totally agree. Public criticism disproportionally impacts the graduate student leading the project, and ML publishing is already very high pressure. Twitter also isn't the right place for a nuanced scientific discussion.

Zico Kolter

@zicokolter

2 years

I realize this is seemingly an unpopular opinion, but I can't get onboard with these Twitter criticisms of some of the recent #ICML2022 best paper awardees. I've been thinking about this all day. A thread... 🧵 1/N

20

86

911

0

23

Maithra Raghu

@maithra_raghu

2 years

So excited to be working together!!

Fabio Petroni

@Fabio_Petroni

2 years

Today is my first day as a CTO (and co-founder) of @samaya_AI . The last 4 years at FAIR have been incredible. Now I'm looking forward to bringing the latest knowledge discovery technologies to market!

17

10

202

0

23

Maithra Raghu

@maithra_raghu

4 years

We provide links to incredible resources developed by the community: software packages & high level APIs, freely available DL tutorials, sites with summaries/discussions/code of new research, repositories of DL pipelines & pretrained models, data curation & analysis packages

1

21

Maithra Raghu

@maithra_raghu

6 years

I've gained a lot from the interesting paper links, tutorials and code releases posted on Twitter. However it's important to recognise the drawbacks of the filter bubble: Particularly poignant: "Algorithms know what you've been, not what you want to be."

Social Networks Are Great, But They’re a Terrible Place to Get News

Social networks are fun, and even useful.

www.howtogeek.com

1

2

21

Maithra Raghu

@maithra_raghu

4 years

New paper Teaching with Commentaries We introduce commentaries, metalearned information to help neural net training & give insights on learning process, dataset & model representations Led by @RaghuAniruddh & w/ @skornblith @DavidDuvenaud @geoffreyhinton

Aniruddh Raghu

@RaghuAniruddh

4 years

Teaching with Commentaries: We study the use of commentaries, metalearned auxiliary information, to improve neural network training and provide insights. With @maithra_raghu , @skornblith , @DavidDuvenaud , @geoffreyhinton Thread⬇️

2

6

52

0

3

22

Maithra Raghu

@maithra_raghu

5 years

Intriguing invited talk at #DeepPhenomena from Chiyuan Zhang on the effect of resetting different layers: Are all layers created equal? #ICML2019 @icmlconf

1

3

22

Maithra Raghu

@maithra_raghu

4 years

Enjoyed speaking at @RealAAAI workshop on Learning Network Architectures During Training: I overviewed our work on techniques to gain insights from neural representations for model & algorithm design. All talk videos are on the workshop page above! ⬆️

0

3

21

Maithra Raghu

@maithra_raghu

4 years

@karpathy I usually mute all notifications and put on do not disturb. Sometimes takes me a little longer to respond to things, but the mental space is worth it :)

2

0

22