Adam Sadovsky Profile
Adam Sadovsky

@asadovsky

Followers
770
Following
328
Media
0
Statuses
89

Distinguished Software Engineer / Senior Director, Gemini

Joined December 2007
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
@asadovsky
Adam Sadovsky
8 months
Gemini Advanced is here! And it's not lazy. Let us know what you think!
21
8
82
@asadovsky
Adam Sadovsky
10 months
@sundarpichai
Sundar Pichai
10 months
Now Gemini Pro is coming today in Bard’s biggest update yet (in English in 170 countries) with more advanced reasoning and understanding in the responses. Bard Advanced with Ultra, our most general and capable model for highly complex tasks, is coming early next year.
Tweet media one
60
204
2K
3
5
55
@asadovsky
Adam Sadovsky
8 months
@saqbach Different fine-tuning, and Bard's has access to the internet. We are working hard to improve both versions of the model though, stay tuned!
4
3
44
@asadovsky
Adam Sadovsky
2 months
@_xjdr I am pleased to report that we didn't Goodhart it.
2
0
43
@asadovsky
Adam Sadovsky
11 months
Bard can plot!
2
5
37
@asadovsky
Adam Sadovsky
2 months
🧑‍🍳
@lmsysorg
lmsys.org
2 months
Exciting News from Chatbot Arena! @GoogleDeepMind 's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive
Tweet media one
84
419
2K
2
0
36
@asadovsky
Adam Sadovsky
8 months
oh hai
@lmsysorg
lmsys.org
8 months
🔥Breaking News from Arena Google's Bard has just made a stunning leap, surpassing GPT-4 to the SECOND SPOT on the leaderboard! Big congrats to @Google for the remarkable achievement! The race is heating up like never before! Super excited to see what's next for Bard + Gemini
Tweet media one
153
621
3K
2
1
28
@asadovsky
Adam Sadovsky
8 months
@JackK @samc0hen @mbaeuml Love this example. Shared with the team!
1
0
15
@asadovsky
Adam Sadovsky
10 months
the secret is out
@gdb
Greg Brockman
10 months
evals are surprisingly often all you need
67
83
1K
0
0
15
@asadovsky
Adam Sadovsky
2 months
It's a good model! 🧑‍🍳🧑‍🍳🧑‍🍳
@emollick
Ethan Mollick
2 months
Dang, the new Gemini model is quite good I gave it an excellent, complicated game source book PDF and asked it to roll up a character, referencing many pages. There may be errors, but wow. First time I have seen an AI get so far Interestingly, it decided to play... a secret AI
Tweet media one
Tweet media two
Tweet media three
15
55
526
1
0
14
@asadovsky
Adam Sadovsky
8 months
@JackK @pra15meshh @AarushSelvan So, this one is a bit funny actually. The new Gemini UI is so clean and simple that the model focused its attention on the first example prompt, which happened to be about rusty plants. Of course, we should do better! But, it's not completely random. 😅
0
0
13
@asadovsky
Adam Sadovsky
5 months
Very nice work. Warren Buffett's famous quote seems apt: “Only when the tide goes out do you learn who has been swimming naked.”
@hughbzhang
Hugh Zhang
5 months
Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.
Tweet media one
36
216
1K
1
0
11
@asadovsky
Adam Sadovsky
2 months
A big step forward. LLMs are much more useful when they're easy to fact-check.
@kelvin_guu
Kelvin Guu
2 months
Excited to launch and share the new citations feature ("related content") in Gemini! Last fall, we introduced Double-Check (), which checks Gemini's claims against sources on the web. With citations, we're now auto-running on all fact-seeking queries.
1
6
39
0
0
10
@asadovsky
Adam Sadovsky
8 months
@YeshuaisSavior @saqbach We're working on all of these! Thanks for the feedback.
4
0
9
@asadovsky
Adam Sadovsky
11 days
The momentum is palpable!
@OfficialLoganK
Logan Kilpatrick
11 days
Two new production Gemini models, >2x higher rate limits, >50% price drop on Gemini 1.5 Pro, filters switched to opt-in, updated Flash 8B experimental model, and more. It’s a good day to be a developer : )
176
324
2K
0
0
8
@asadovsky
Adam Sadovsky
2 months
@_xjdr true :)
0
0
7
@asadovsky
Adam Sadovsky
2 months
Welcome back @NoamShazeer ! Things are getting good.
1
0
7
@asadovsky
Adam Sadovsky
8 months
@bindureddy Some other LMSys-ranked models like pplx-70b-online also have internet access, but it's true that many don't. Internet access can both help and hurt; a model must be tuned carefully to make good use of it. We'd love to see other attempts at it!
0
0
7
@asadovsky
Adam Sadovsky
9 months
👀
@lmsysorg
lmsys.org
9 months
[Arena] Exciting news from Arena! We're thrilled to welcome Google's Bard, now enhanced with Gemini Pro for advanced summarizing, reasoning, and real-time insights. Curious how Bard measures up against leading AI models? Join us in the Arena to find out!
Tweet media one
7
23
183
0
0
6
@asadovsky
Adam Sadovsky
8 months
Gemini is now available to businesses large and small. Let us know what you think!
@Google
Google
8 months
We’re launching Gemini Business for @GoogleWorkspace . This new add-on gives organizations of all sizes access to Gemini for Workspace, including Help me write in Docs, image generation in Slides and more ↓
160
158
882
0
0
5
@asadovsky
Adam Sadovsky
5 months
1
0
6
@asadovsky
Adam Sadovsky
8 months
@Suhail Thanks! This is just the beginning.
0
0
5
@asadovsky
Adam Sadovsky
10 months
👀
@emollick
Ethan Mollick
10 months
For the first time, I am seeing signs of "cleverness" in Bard, which has been upgraded to Gemini Pro (GPT-3.5 levels of reasoning) I gave it the apple test - ending sentences with the word apple. It still mostly fails, but when asked to visualize it, the results are interesting
Tweet media one
Tweet media two
Tweet media three
Tweet media four
11
20
209
0
0
5
@asadovsky
Adam Sadovsky
8 months
@ClementDelangue 🥰 Now let's deepen our partnership and see what happens!
0
0
3
@asadovsky
Adam Sadovsky
8 months
@Paul_Moreira We have some filters that are too aggressive and imprecise. We're working hard to fix it, stay tuned!
1
0
3
@asadovsky
Adam Sadovsky
1 year
🇮🇱
1
0
2
@asadovsky
Adam Sadovsky
9 months
@JackK I guess we have some say in the bingo outcome... Oxford comma FTW!
0
0
3
@asadovsky
Adam Sadovsky
8 months
@permaximum88 It's been really helpful to have real users "kick the tires" on it, as we did for Gemini Pro back in December. But we'll get it there in due time!
0
0
1
@asadovsky
Adam Sadovsky
8 months
@permaximum88 @JackK @samc0hen @mbaeuml Ironically, the response from your screenshot is completely wrong - check the numbers. The free version of Gemini also has access to up-to-date information, so this isn't about free vs. paid. We just have some work to do to answer these prompts reliably. We'll get there!
1
0
2
@asadovsky
Adam Sadovsky
2 months
@eladgil Glad to see we earned back Noam's trust! :)
0
0
2
@asadovsky
Adam Sadovsky
8 months
@UltraRareAF @hu_yifei Yeah, it's a challenging space. We're thinking about how best to handle this. Stay tuned.
2
0
1
@asadovsky
Adam Sadovsky
8 months
1
0
2
@asadovsky
Adam Sadovsky
8 months
@hevensxj @Paul_Moreira I agree, they make my blood boil too. Long story for why it happens currently, but we'll fix it! @JackK
1
0
2
@asadovsky
Adam Sadovsky
1 year
Feels like half of LLM "examples that will blow your mind" have major mistakes in the output when you look closely.
0
0
2
@asadovsky
Adam Sadovsky
8 months
@hu_yifei Ugh, sorry. I got a slightly better response when I tried: At any rate, this is bad. I've shared it with the team. We have an active effort to fix these kinds of failures.
2
0
2
@asadovsky
Adam Sadovsky
17 days
1
0
2
@asadovsky
Adam Sadovsky
8 months
@SullyOmarr @DicksonPau We're working on TTFT, stay tuned!
1
0
2
@asadovsky
Adam Sadovsky
8 months
@hu_yifei Those triangles are strange indeed, I've never seen them myself. @mbaeuml - any ideas?
1
0
1
@asadovsky
Adam Sadovsky
8 months
@ChiaraCervetta Thanks for all the feedback! We'll work on making these things better.
1
0
2
@asadovsky
Adam Sadovsky
8 months
@Jason @sundarpichai @Jason - we took your UI feedback to heart... let us know what you think of our new Gemini UI!
1
0
2
@asadovsky
Adam Sadovsky
11 months
@drjwrae Agreed. Post-training makes all the difference for making the model useful for end users.
0
0
2
@asadovsky
Adam Sadovsky
1 year
0
0
1
@asadovsky
Adam Sadovsky
6 months
@rajhans_samdani Congrats! Very cool.
0
0
1
@asadovsky
Adam Sadovsky
6 months
@OfficialLoganK @Google Congratulations and welcome to Google - thrilled that you're joining and can't wait to see what you build!
0
0
1
@asadovsky
Adam Sadovsky
8 months
@hyoung9052 Thanks, that's a really nice example, I'll share it with the team.
0
0
1
@asadovsky
Adam Sadovsky
8 months
@abdiisan @saqbach To elaborate, there are some necessary but minor differences. For example, supports extensions like Maps, which we had to disable in the LMSys version. But for the most part they are identical.
0
0
1
@asadovsky
Adam Sadovsky
11 months
@Dughlas I guess AGI hasn't arrived quite yet!
0
0
1
@asadovsky
Adam Sadovsky
8 months
1
0
1
@asadovsky
Adam Sadovsky
2 years
1
0
1
@asadovsky
Adam Sadovsky
19 days
@christoc We're hard at work making Gemini better!
0
0
1
@asadovsky
Adam Sadovsky
10 months
@dharmesh Try out the Bard+Workspace integration! Bard can search your Gmail and answer your questions. Details:
0
0
0
@asadovsky
Adam Sadovsky
8 months
@DimitrisPapail @altryne @lmsysorg @Google It's the best public leaderboard we know of, but please let us know if there's another we should look at.
0
0
1
@asadovsky
Adam Sadovsky
8 months
@beashutiwari It should be already - are you seeing something different?
1
0
1
@asadovsky
Adam Sadovsky
8 months
@samc0hen Yes, we're on it!
0
0
1
@asadovsky
Adam Sadovsky
17 days
@Klotzkette @mbaeuml @ChatGPTapp @MSFTCopilot Improving accuracy is an ongoing major focus. If you're open to it, please DM me examples of the problems you're seeing.
0
0
1
@asadovsky
Adam Sadovsky
8 months
@JackK @BounceNStretch @altryne @lmsysorg @Google +1, this is a known issue and we're working on it!
0
0
1
@asadovsky
Adam Sadovsky
8 months
1
0
1
@asadovsky
Adam Sadovsky
8 months
@permaximum88 @JackK @samc0hen @mbaeuml Thanks for the feedback! We definitely want to nail these prompts.
0
0
1