Gowthami Somepalli Profile Banner
Gowthami Somepalli Profile
Gowthami Somepalli

@gowthami_s

Followers
7,522
Following
1,003
Media
311
Statuses
2,721

Grad student @UMDCS . Past: @AIatMeta , @AmazonScience , @IITMadras . Works on multimodal understanding and generation. GPU poor. She/her.

Palo Alto, CA
Joined April 2015
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@gowthami_s
Gowthami Somepalli
2 months
Cinepile got the Best Paper Award at SynData4CV workshop at CVPR! 🥹 It’s available on @huggingface ! You should check it out. Link:
Tweet media one
@gowthami_s
Gowthami Somepalli
4 months
📣 Happy to introduce, CinePile, a long video QA dataset and benchmark! 300k train and 5k test split. A 🧶. (1/9) 📃: 🤗: #MachineLearning
Tweet media one
9
48
280
9
9
194
@gowthami_s
Gowthami Somepalli
1 month
Pleasant surprise while reading Llama 3 paper! It’s my memorization paper. 🙂 #llama3
Tweet media one
22
18
936
@gowthami_s
Gowthami Somepalli
14 days
Crossed 1000 citations! I came to US to do a Masters as a result of quarter-life crisis. Gowthami from 5 years ago would’ve laughed at the idea of doing a PhD let alone publish and cross 1k citations! I thank folks at UMD for encouraging me on this journey, especially my
Tweet media one
56
10
920
@gowthami_s
Gowthami Somepalli
2 years
My parents: Wake up in the morning to beat the traffic on the roads. Me: Wake up in the morning to beat the traffic on the GPU cluster. #phdlife
7
29
782
@gowthami_s
Gowthami Somepalli
2 years
All the awesome papers that came out this month are making it extra hard to focus. 😅 #phdlife #gradschool
Tweet media one
4
55
662
@gowthami_s
Gowthami Somepalli
2 years
Yesterday I completed 50% of the #30daysofDiffusion goal. I learned a lot, however, it's an information overload as well. I am going to take a couple of weeks' break and then resume. Access all the paper &tweet links here - #Diffusion #MachineLearning
Tweet media one
15
64
458
@gowthami_s
Gowthami Somepalli
3 years
How to train a SOTA ViT without pre-training or adding strong augmentations? Just use SAM! Check out my summary of the #ICLR22 paper by @XiangningChen , Cho-Jui Hsieh, @BoqingGo . Paper: Blog: #DeepLearning #MachineLearning
Tweet media one
2
79
408
@gowthami_s
Gowthami Somepalli
3 years
How does the choice of loss affect the transferability of classification models? Check out my summary of the #NeurIPS paper by @skornblith et al. Paper: Blog: #DeepLearning #MachineLearning
Tweet media one
4
81
407
@gowthami_s
Gowthami Somepalli
2 years
One of my resolutions this year is to read more papers. I decided to read 30 papers on #diffusion models this Jan and do brief tweet threads on them. This is the current schedule. Suggestions are welcome to fill the empty slots! #NewYearResolution #MachineLearning
Tweet media one
25
23
391
@gowthami_s
Gowthami Somepalli
2 years
InstructPix2Pix: Edit an image using text guidance using a single forward pass. Why use any inversion or other stuff,just create a dataset using inversion techniques and train a new model. A 🧶 Paper: Day 8 #30daysofDiffusion #Diffusion #MachineLearning
Tweet media one
8
66
331
@gowthami_s
Gowthami Somepalli
1 year
Summer internship in NYC🗽 Expectation: Socialize, soak up the culture, visit museums, shows, etc. Reality: Start project, a broadway show, get scooped, CVPR, work on new idea, Neurips reviews, debug, midterm talk, debug, Neurips rebuttal, final talk. The end. #phdlife 🤦‍♀️
2
2
306
@gowthami_s
Gowthami Somepalli
9 months
I passed the preliminary exam! 😄 Off to #NeurIPS2023 next! #phdlife
Tweet media one
Tweet media two
24
4
292
@gowthami_s
Gowthami Somepalli
6 months
ML research is following the exact same trend now, tweak fast and preprint just to beat others!! The number of preprints are skyrocketing while we still feel like we aren’t learning anything new while every question seems to be answered. Can we slow down on publications and try
@dwarkesh_sp
Dwarkesh Patel
6 months
"I wonder for people in their 20s if they shouldn't go to San Francisco. The entrepreneurs are held in excessively high regard in my view. I think that San Francisco doesn't really encourage the pursuit of really deep technical depth." - @patrickc Full episode out tomorrow
47
114
2K
7
26
287
@gowthami_s
Gowthami Somepalli
4 months
📣 Happy to introduce, CinePile, a long video QA dataset and benchmark! 300k train and 5k test split. A 🧶. (1/9) 📃: 🤗: #MachineLearning
Tweet media one
9
48
280
@gowthami_s
Gowthami Somepalli
7 months
For a small fee, I will attend your enemy's job talk and ask "Did you hyperparameter tune the baselines you are showing against your models?"
@rajammanabrolu
Prithviraj (Raj) Ammanabrolu
7 months
For a small fee, I will attend your enemy's job talk and ask "isn't this just fancy prompt engineering on GPT-4?"
4
3
107
1
17
275
@gowthami_s
Gowthami Somepalli
2 years
TLDR: Depending on the architecture, the degree of similarity changes. For eg. Wide ResNets have more reproducible decision regions than a ViT! To appear in CVPR! #DeepLearning
@_akhaliq
AK
2 years
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective abs:
Tweet media one
2
38
302
6
40
266
@gowthami_s
Gowthami Somepalli
2 years
+100 on this comment. The research opportunities for an undergraduate are few and far apart in many countries. And many (including me) did not know where to even begin. It scares me how people from underprivileged backgrounds do not even get a chance these days.
@sarahookr
Sara Hooker
2 years
If you have multiple papers before you even began a PhD, it likely means you had access that others didn't. I wish more PhD programs would take a step back and stop this absurd practice of favoring multiple papers before someone even begins a training program.
78
306
3K
4
11
252
@gowthami_s
Gowthami Somepalli
2 years
Imagic: First learn the embedding that represents the image, then finetune the model to overfit on a given image, then use the interpolation between old and new embeddings as input text for a generation. A 🧶 Paper: Day 5 #30daysofDiffusion #Diffusion
Tweet media one
2
38
254
@gowthami_s
Gowthami Somepalli
3 months
Why is self-correction not a big thing in LLM research? This is one of the biggest criticisms of the auto-regressive modeling paradigm! When I googled, I see only 2 papers somewhat related to this! Please drop in a comment with any relevant papers/blogs! Paper 1:
23
18
247
@gowthami_s
Gowthami Somepalli
2 years
Going through a serious case of FOMO today. My research feels very insignificant after looking at the amazing #dalle2 results. 🥺 #phdlife
13
5
242
@gowthami_s
Gowthami Somepalli
1 year
8 papers in a well-known conference! How is this possible for a grad student? I guess it’s quantity over quality to get a job these days…😔 #machinelearning #phdlife
@linylinx
Tianlin
1 year
Is this the *minimum* requirement for a new grad in machine learning now? #NVIDIA
Tweet media one
135
354
3K
18
15
230
@gowthami_s
Gowthami Somepalli
2 years
You know what I'm talking about! 😛 #phdlife #AcademicTwitter
Tweet media one
8
11
227
@gowthami_s
Gowthami Somepalli
5 months
✨ Can we detect style in the generated images? Our recent work takes a step towards understanding this question. We train a style-focused vision feature extractor built on top of CLIP, which we call a Contrastive Style Descriptior (CSD). paper: Style
Tweet media one
9
35
219
@gowthami_s
Gowthami Somepalli
2 years
Retrieval Augmented #Diffusion (RDM) models: Smaller diffusion models can generate high-quality generations by accessing an external memory to guide the generation. Inspired by Deepmind's RETRO. A 🧶 Paper: Day 10 #30daysofDiffusion #MachineLearning
Tweet media one
4
44
214
@gowthami_s
Gowthami Somepalli
1 year
📃🚨 Does your diffusion model copy from the training data? How to find such behavior? Why does it happen? Can we somehow mitigate it? A summary of recent work on understanding training data replication in recent T2I #diffusion models. A long 🧶 #machinelearning #aigeneration
Tweet media one
6
54
205
@gowthami_s
Gowthami Somepalli
3 years
My first #Neurips paper! Come say hi! :) #Selfsupervised #transformers
@kamalgupta09
Kamal Gupta
3 years
We introduce PatchGame, where two agents learn to communicate via discrete compositional symbols in a referential game. Visit our poster tomorrow at #Neurips2021 to learn more! When 🗓: Tue Dec 07, 11:30 - 01:00 PM(EST) Where📍: #DeepLearning #NLProc (1/2)
Tweet media one
2
20
100
6
14
204
@gowthami_s
Gowthami Somepalli
2 years
I will be in person at #CVPR22 to discuss our paper on understanding model reproducibility! Drop by and say hi if you are around! 😃 ⏰ When? June 23, Thursday 📍Where? Oral: 1.30-3pm, Session 3.2.3. Poster: 2.30-5pm, Session 3.2 #Machinelearning #Computervision
@_akhaliq
AK
2 years
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective paper: project page:
Tweet media one
2
60
316
4
32
202
@gowthami_s
Gowthami Somepalli
4 months
Everyone told me since childhood that, by working hard, you will succeed. What life taught me is...work hard and if are you lucky, you will succeed. And if you keep working hard, those odds increase. Life is not fair! We just have to accept it. 🤷‍♀️ #phdlife
10
7
193
@gowthami_s
Gowthami Somepalli
11 months
I’m bringing back this months old question, how are y’all keeping with the LLM research? The pace is insane these days and I’ve reached a point where I have an unmanageable number of bookmarks! Are there any good LLM lists? #phdlife
20
7
193
@gowthami_s
Gowthami Somepalli
9 months
Many grad students secretly wish for this too. I feel that I’m lagging behind even after writing 2 papers a year (compared to some of my peers)! And the number of topics we have to keep up with, just to stay relevant for job market is insane. 🥲 #phdlife
@beenwrekt
Ben Recht
9 months
Since we just wrapped up an AI megaconference, it felt like a good day to plead for fewer papers.
32
162
853
6
2
192
@gowthami_s
Gowthami Somepalli
1 year
A 5 day event on LLMs. Line up looks interesting. (All talks are streamed live on YouTube) #machinelearning
Tweet media one
1
29
185
@gowthami_s
Gowthami Somepalli
7 months
✨ TLDR of @OpenAI Sora's technical report! 1. It's a latent diffusion transformer model unlike other U-net Video models. 2. Used a video encoder/decoder -> diffuse in latent space. 2. Trained on videos on native resolution, rather than square cropping the data. (helped with
3
23
186
@gowthami_s
Gowthami Somepalli
2 years
Versatile Diffusion: A diffusion model which is trained with reconstruction objectives of image and text together. It can go text-to-image, image-to-image, image->text->image, and so on. A 🧶 Paper: Day 13 #30daysofDiffusion #MachineLearning
Tweet media one
4
32
176
@gowthami_s
Gowthami Somepalli
3 years
A typical day Me: Skim through 2-3 papers, write ~50-100 lines of code, socialize, discuss ideas. Also Me: One leetcode medium question. #phdlife #gradschool
1
2
170
@gowthami_s
Gowthami Somepalli
2 months
I’m seeing this trend of some AI researchers/PhD students acting smug after training a large model. Please know humility is a strength! #phdlife
8
4
166
@gowthami_s
Gowthami Somepalli
11 months
📃✨ TLDR of Dall-E 3 paper. Some details that stood out for me : 1. T5 XXL as text encoder! 2. They first trained a vision-language model (like Parti) to generate longer-synthetic captions. 3. I think a lot of the performance improvement can be attributed to training on
Tweet media one
Tweet media two
Tweet media three
6
29
167
@gowthami_s
Gowthami Somepalli
10 months
The results are out. Turns out you are more likely to be hired (at least according to this poll) if you have more (any author) papers than a few first author papers. This is unfortunate in the sense that folks from larger labs get to be part of more projects usually. Also
@gowthami_s
Gowthami Somepalli
10 months
If you are in a position of power to hire a research scientist in your team, who would you prefer? (Assuming they both did equally well on interviews and are doing similar types of research) It’ll be great if you can comment on why you chose one of options.
10
2
15
11
16
161
@gowthami_s
Gowthami Somepalli
2 years
Interesting talk by Prof. Antonio Torralba on neuron interpretability in classifiers and generative models and what can we do with these explanations. Btw this is part of Explainable AI workshop. Happening in 208. #CVPR2022
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
11
162
@gowthami_s
Gowthami Somepalli
2 years
DreamBooth: Assign a rare sequence of tokens as the subject's identifier and fine-tune the diffusion model on the small set of images with the "subject". A 🧵 Paper: Day 1 #30daysofDiffusion #Diffusion #MachineLearning
Tweet media one
5
28
162
@gowthami_s
Gowthami Somepalli
2 years
These days I see a lot of instances of papers accepted to ML and Vision conferences who create a dummy GitHub repo saying coming soon and that soon never comes even after 2 years! 🙄 Why even promise in the first place? Can conferences do something about this?
6
5
159
@gowthami_s
Gowthami Somepalli
2 months
Mission Milano successfully achieved! ✨ Our style extractor paper is accepted to #ECCV2024 . TLDR: We trained a novel style feature extractor and then used it to study style transfer in diffusion models. Our model checkpoints are available on the GitHub repo, we are working
@_akhaliq
AK
5 months
Measuring Style Similarity in Diffusion Models Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation
Tweet media one
4
44
216
8
18
160
@gowthami_s
Gowthami Somepalli
6 months
In an old paper of mine, I wrote a long and honest limitations section, and the reviewers just used it as a weapon to attack the paper, so yeah, most limitation sections are bogus these days! I have always considered the limitations section to be one of the most important parts
5
13
157
@gowthami_s
Gowthami Somepalli
4 months
The paper I was most happy about got the worst reviews. Novelty novelty novelty. 😔 #eccv #phdlife #reviewsystemisbroken
12
5
151
@gowthami_s
Gowthami Somepalli
2 years
Let me summarize today’s Twitter for you. Elon musk, 44 billion $, Twitter employee slack, come to our paper at ICLR!
4
3
147
@gowthami_s
Gowthami Somepalli
10 months
There are many pre-trained vision models, but which one is good for your downstream task? 🤔 We tried to answer that question in our #NeurIPS2023 paper where we looked at Classification, Detection, Retrieval, and OOD generalization. 📃: #ComputerVision
Tweet media one
1
25
146
@gowthami_s
Gowthami Somepalli
7 months
I'm in the same boat. I had 0 publications and barely there CS skills. The only thing I had was a drive to learn.
@anand_bhattad
Anand Bhattad
2 years
#GradStudentHiring should go beyond publications and prior backgrounds. My own story as an example: I started as a civil engineer, and now I'm close to finishing my PhD in CS focusing on computer vision #CVPR . Diverse backgrounds bring fresh perspectives and new ideas. 1/5
9
26
348
1
6
143
@gowthami_s
Gowthami Somepalli
2 years
For faster inference, distill (in 2 stages) multi-step classifier-free guided #diffusion models into a student model of same architecture which can generate the same quality images in fewer steps. A 🧶 Paper: Day 14 #30daysofDiffusion #MachineLearning
Tweet media one
4
24
140
@gowthami_s
Gowthami Somepalli
3 years
@AstroKatie There’s a name for it! 😂
Tweet media one
3
15
139
@gowthami_s
Gowthami Somepalli
10 months
There’s some advice floating around here on why you shouldn’t do Ph.D. Here’s my 2c: Firstly, there are successful people across all fields who come from unconventional backgrounds. They are an exception not a norm. Secondly, these folks are advising to go to industry rather
2
8
135
@gowthami_s
Gowthami Somepalli
1 year
This feels like an extremely elitist comment. Any person or any lab can be tier-1 with continued hard work and dedication. And it sounds extremely childish to complain that someone is getting credit cuz someone else isn’t publishing.
Tweet media one
6
2
139
@gowthami_s
Gowthami Somepalli
9 months
My experience from last 2 conferences. I hope I’ll be a bit more productive at #Neurips2023 ! 😂 Expectation: read all latest and greatest papers before you reach, have intellectual discussions with peers/seniors, learn about all the accepted papers in your field. Reality:
2
5
139
@gowthami_s
Gowthami Somepalli
2 years
Data augmentations are ubiquitous in ML pipelines these days. Why do they work? Are they worth more than additional data? We tried to understand and disentangle some of the inductive biases in this work! paper: #MachineLearning
Tweet media one
3
25
136
@gowthami_s
Gowthami Somepalli
3 years
Featuring SAINT! :)
@paperswithcode
Papers with Code
3 years
💥 Deep learning makes strides on tabular data! In this week’s newsletter, we summarize recent developments and papers using deep learning models for tabular data... and much more. Read on below:
Tweet media one
Tweet media two
Tweet media three
3
131
549
1
19
130
@gowthami_s
Gowthami Somepalli
2 years
I will be at #NeurIPS in person! I’m interested in self-supervised learning, continual learning, and generative modeling! Any suggestions on not-to-miss events/folks to talk to? Also, I’m in the market for internships for next summer. DM me if you want to chat! #NeurIPS2022 #ML
Tweet media one
3
7
134
@gowthami_s
Gowthami Somepalli
1 year
🎉 Tiny life update: I'll be spending my summer in New York interning with @Meta . Looking forward to meeting the brilliant researchers here. If you're around and up for hanging out, feel free to DM me. Can't wait! 😃
8
3
133
@gowthami_s
Gowthami Somepalli
2 months
Some takeaways for me from CVPR this year - (proceed with caution, I might be biased) - lots of interest in synthetic data as everyone agrees that annotated data barely exists and it’s too expensive to annotate at scale - lots of excitement around recently released video models
3
8
130
@gowthami_s
Gowthami Somepalli
5 months
Quite honored and inspired! :) Thank you @UofMaryland for the fellowship. #phdlife
@umdcs
UMD Department of Computer Science
5 months
. @UofMaryland 's Graduate School has awarded the Ann G. Wylie Dissertation Fellowship to Computer Science Ph.D. students @nakulgarg22 , Shoken Kaneko and @gowthami_s in recognition of their outstanding research. Congrats! 👏 Read more:
Tweet media one
1
6
38
12
2
130
@gowthami_s
Gowthami Somepalli
2 years
Get the average CLIP image model embeddings of an "Aesthetic" dataset, optimize the clip text encoder to align with this embedding, and plug it into SD to get better-looking images! A tiny 🧶 Paper: Day 7 #30daysofDiffusion #Diffusion #MachineLearning
Tweet media one
2
20
129
@gowthami_s
Gowthami Somepalli
4 months
✨ Excited ✨ Wanna get a whirlwind tour of memorization in diffusion models, how to find it, how to mitigate it, drop by the talk! Will discuss all my 3 papers (including style memorization) on this topic. Papers: 1. (CVPR'23) 2.
@CohereForAI
Cohere For AI
4 months
Our community is looking forward to welcoming @gowthami_s on Monday, May 6th as she presents her work on "Understanding and Mitigating Memorization in Diffusion Models" Learn more:
Tweet media one
1
2
24
0
13
118
@gowthami_s
Gowthami Somepalli
9 months
You might know me as a girl who talks about memorization or diffusion here. What you don't know about me is, that I came back to grad school after >5 years in industry. I will be at #NeurIPS in person, and I would be happy to chat about grad school, diffusion models, or Vision
3
3
116
@gowthami_s
Gowthami Somepalli
11 months
@jbhuang0604 2024: Resnets strike back! 🙂
8
0
114
@gowthami_s
Gowthami Somepalli
2 years
Can someone generate an image with this prompt, “Rich grad student in USA”. I bet they’ll all throw an exception! 😛 #phdlife #dalle2 #Imagen
5
3
114
@gowthami_s
Gowthami Somepalli
11 months
English has been used in India for a while now and it evolved into a new variation (can I call it a dialect?). So if you speak in Indian English and if someone makes fun of you for it, the joke's on them, not on you since it's gonna be the largest spoken English dialect in the
6
2
114
@gowthami_s
Gowthami Somepalli
2 years
Super grateful that our paper showed up in one of @_akhaliq ’s top tweets of the year!! Check out code and other materials here - #machinelearning
@_akhaliq
AK
2 years
Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective abs:
Tweet media one
5
17
122
0
8
111
@gowthami_s
Gowthami Somepalli
3 months
✨ The obligatory #CVPR24 post. I am in the job market. 👩🏽‍🎓 I am very interested in multimodal generation and understanding areas. Find me floating around in Video understanding/Generative modeling workshops on Monday/Tuesday. I will be giving #oral talks at Syntagen and
4
4
110
@gowthami_s
Gowthami Somepalli
2 years
Composable Diffusion: Generate an image with a set of prompts as a composition of multiple concepts using "AND" and "NOT". A 🧵 Paper: Day 4 #30daysofDiffusion #Diffusion #MachineLearning
Tweet media one
2
16
109
@gowthami_s
Gowthami Somepalli
1 year
Mini 🧶 The authors test if memorization behavior is localized to last “n” layers. They find that it’s distributed across and neurons across the layers contribute to this. Paper: #MachineLearning
Tweet media one
2
22
106
@gowthami_s
Gowthami Somepalli
3 years
Learning to represent images with discrete symbols like language (not necessarily human 😛). Will share a blog post summarizing this soon. Code: #MachineLearning #NLProc
@_akhaliq
AK
3 years
PatchGame: Learning to Signal Mid-level Patches in Referential Games abs:
Tweet media one
0
12
80
0
21
106
@gowthami_s
Gowthami Somepalli
11 months
I’ve come across many CVPR papers with dummy GitHub links. Is there a way to hold them accountable? @CVPR @CSProfKGD
6
10
100
@gowthami_s
Gowthami Somepalli
11 months
Me after #NeurIPS2023 decisions. #phdlife
Tweet media one
3
3
97
@gowthami_s
Gowthami Somepalli
11 months
Me waiting for those citations to come in after posting a paper! #phdlife
Tweet media one
1
1
97
@gowthami_s
Gowthami Somepalli
3 years
Me: Let’s take a minute and enjoy the summer. Twitter:
Tweet media one
1
5
96
@gowthami_s
Gowthami Somepalli
2 years
eDiff-I: A new text-to-image #diffusion model. Uses T5 and both CLIP encoders for conditioning. Instead of using the same denoising model for all steps, they propose using multiple specialized ones. A 🧵 Paper: Day 11 #30daysofDiffusion #MachineLearning
Tweet media one
2
10
95
@gowthami_s
Gowthami Somepalli
2 years
Pleasures of life Others: party, travel, relax Ph.D. Student: reading a paper unrelated to their project. 🥲 #phdlife
2
1
91
@gowthami_s
Gowthami Somepalli
3 months
It was my first time on a podcast, but it was a lot of fun! We discussed CinePile, my non-conventional education background, and my predictions for future models. Let me know if you have any feedback! #phdlife #machinelearning
@1littlecoder
1LittleCoder💻
3 months
🔥 Hear from @gowthami_s about Dataset creation and lot more Interesting nuances! ✅ I spoke to Gowthami who's built a Video Q&A Dataset which could be a foundation for better multimodal models Other than my poor editing, this's a great watch! Full link 👇🏽
1
3
25
6
12
91
@gowthami_s
Gowthami Somepalli
2 years
Textual Inversion: Learn the word for a given subject by optimizing the language encoder which enables us to generate new images of the given subject in new settings. A 🧵 Paper: Day 2 #30daysofDiffusion #Diffusion #MachineLearning
Tweet media one
2
17
90
@gowthami_s
Gowthami Somepalli
2 years
StructureDiffusion: Improve the compositional generation capabilities of text-to-image #diffusion models by modifying the text guidance by using a constituency tree or a scene graph. A 🧵 Paper: Day 9 #30daysofDiffusion #MachineLearning
Tweet media one
2
15
89
@gowthami_s
Gowthami Somepalli
7 months
Me: Complaining about some rejection. Sister nonchalantly quotes Gita: You have the right to work only but never to its fruits. Let not the fruits of action be your motive, nor let your attachment be to inaction. 😅 @sravani_s55
4
2
87
@gowthami_s
Gowthami Somepalli
3 months
Jensen says "he had no idea how to do what they set out to do" when they started the company. Now, we can't even get an interview for an internship/job in a team that does something you never did, even if we believe in the mission/ have a strong profile (in a different
@historyinmemes
Historic Vids
3 months
Jensen Huang started Nvidia at a Denny's breakfast booth
160
2K
17K
4
7
86
@gowthami_s
Gowthami Somepalli
1 year
Can I respond to one of my reviews with this? #NeurIPS
3
2
86
@gowthami_s
Gowthami Somepalli
2 years
Requirements to get into a CS PhD program these days! 😓 #phdlife #gradschool
@ccanonne_
Clément Canonne
2 years
@srchvrs "The ideal applicant will have at least one (1) Nobel Prize. Ability to fly and/or bend objects with mind recommended."
1
0
48
5
5
84
@gowthami_s
Gowthami Somepalli
2 years
Diffusion Disentanglement: Yet another prompt-based image editing method. Learn the interpolation coefficients between old and new prompts by means of optimization and uses that to perform the edits. A 🧶 Paper: Day 6 #30daysofDiffusion #MachineLearning
Tweet media one
2
13
82
@gowthami_s
Gowthami Somepalli
1 year
Excited to share our new work on reducing copying in diffusion models. We proposed ways to mitigate copying behavior even in the presence of heavy training data duplication! Stay tuned for the TLDR thread!
Tweet media one
@_akhaliq
AK
1 year
Understanding and Mitigating Copying in Diffusion Models Images generated by diffusion models like Stable Diffusion are increasingly widespread. Recent works and even lawsuits have shown that these models are prone to replicating their training data, unbeknownst to the user. In
Tweet media one
2
26
100
1
16
80
@gowthami_s
Gowthami Somepalli
1 year
Thank you @_akhaliq for tweeting about the paper! To appear in #CVPR2023 .
@_akhaliq
AK
2 years
Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models abs:
Tweet media one
10
84
397
2
11
80
@gowthami_s
Gowthami Somepalli
6 months
The general theme of accepted papers to #CVPR on twitter timeline today- 1. A visionLLM/generative model -bench 2. Massage diffusion models to do a thing 3. Fit a vision LLM/video LLM to a niche task. 4. Some 3d stuff I don’t really understand. Did I miss any other major
4
1
78
@gowthami_s
Gowthami Somepalli
3 years
That guilty feeling when you have a lot of work yet you *really* want to read a paper totally irrelevant to your project. 😅 #phdlife
1
3
78
@gowthami_s
Gowthami Somepalli
10 months
@docmilanfar This person has just set many young researchers on a misguided path with unreasonably high expectations. 😔
0
0
78
@gowthami_s
Gowthami Somepalli
3 months
I noticed this too. Same set of people talking in 4-5 workshops EVERY YEAR!When do we get to hear the new voices? I like it that Neurips workshop has a requirement that a speaker can be part of only one workshop proposal! @CVPR Maybe you can take this into consideration next
@docmilanfar
Peyman Milanfar
3 months
I’ll be giving 6 Keynote talks at CVPR. No I won’t. That would be ridiculous, right? Right?
2
1
81
3
3
77
@gowthami_s
Gowthami Somepalli
1 year
Come by our poster if you want to learn about memorization in #diffusion models. I will talk about our #CVPR2023 paper where we found copying in SD and the follow up work where we discuss “why” it is happening and how to mitigate it! 📍Poster 184, Tuesday Evening session
Tweet media one
2
7
76
@gowthami_s
Gowthami Somepalli
2 years
I’m all in for #ChatGPT if it means no more leet-coding for life! 😁 #phdlife
8
3
76
@gowthami_s
Gowthami Somepalli
1 year
I’m sure Telugu will go extinct in 100 years. Being part of an AI is the only way it’ll survive. (I’ve come across many folks who feel that speaking in English is a sign of sophistication and were communicating with their kids in English 🙄)
@surya03gsk
Surya Guthikonda
1 year
Calling out Telugu Speakers to represent our language in AI space by contributing to open source project Aya from Cohere For AI. Anyone who knows Telugu can contribute irrespective their Technical Knowledge. Start Your Contributions Here:
Tweet media one
1
20
72
10
4
74
@gowthami_s
Gowthami Somepalli
2 years
Do T2I and I2T models understand each other? The answer is, they do, to a certain extent. The authors analyze the fidelity of image and text tasks when BLIP and Stable #Diffusion talk to each other. A 🧶 Paper: Day 15 #30daysofDiffusion #MachineLearning
Tweet media one
2
19
75
@gowthami_s
Gowthami Somepalli
2 years
At the AI for Content Creation workshop this afternoon. This is perhaps the most clear and concise explanation of Dall-E 2! Thank you @model_mechanic for the talk. #CVPR2022
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
14
76
@gowthami_s
Gowthami Somepalli
9 months
My model halulu’s cuz it’s delulu! (Trying to start something here) #LLMs
6
4
73
@gowthami_s
Gowthami Somepalli
9 months
The problem with working in a competitive field is, that even your model names can get scooped! Since last week, 2 papers released a model named Video-Llava! 😂 paper 1: paper 2: Wonder how many more are submitted to #CVPR24 under
2
4
72
@gowthami_s
Gowthami Somepalli
2 years
I was looking at this link and the ICCV of 1998 happened in India!! Why don't we have major conferences in India anymore? (Goa would be a great spot for Winter conferences!) 🤔 #ComputerVision #MachineLearning
@CVPR
#CVPR2024
2 years
#CVPR2022 history lesson: The first CVPR was held on June 19th 1983 in Arlington, Virginia with ~300 attendees. Source:
Tweet media one
Tweet media two
0
7
41
10
7
74
@gowthami_s
Gowthami Somepalli
2 years
Had an amazing time presenting my work on model reproducibility at @GoogleAI India. Thank you so much @fooobar for giving me this opportunity! Hopefully, we can do the next presentation in person! :) #MachineLearning
3
5
73
@gowthami_s
Gowthami Somepalli
7 months
What’s your passion? Research. What’s your true passion? Using diffusion models to create images of cats doing human things in human costumes. 😜 #phdlife #generativeart #CatsOfTwitter
Tweet media one
7
5
73
@gowthami_s
Gowthami Somepalli
2 years
Everyone: AGI is here. My iPhone: (If I change my hair one bit) Who the f** are you and what did you do to Gowthami? #ArtificialIntelligence
3
4
71