Gagan Jain Profile
Gagan Jain

@gaganjain1582

Followers
219
Following
1,002
Media
15
Statuses
48
Explore trending content on Musk Viewer
Pinned Tweet
@gaganjain1582
Gagan Jain
22 days
Your models about to get a MoNE-y makeover! 💸 We're thrilled to unveil MoNE (Mixture of Nested Experts), a game-changer for efficiency & scalability. Get ready to level up! 🚀
Tweet media one
@_akhaliq
AK
28 days
Google presents Mixture of Nested Experts Adaptive Processing of Visual Tokens The visual medium (images and videos) naturally contains a large amount of information redundancy, thereby providing a great opportunity for leveraging efficiency in processing. While Vision
Tweet media one
4
77
379
1
14
52
@gaganjain1582
Gagan Jain
1 month
👀A sneak-peek at our recent work accepted at #ECCV2024 @eccvconf ! 📜: Want a scalable and robust foundational model that also saves computational costs? Say hello to LookupViT, our information compression idea leading to sub-quadratic attention! (1/10)
Tweet media one
@gaganjain1582
Gagan Jain
2 months
First conference submission ☑️ First conference acceptance ✅ #eccv2024 @eccvconf Congrats Sujoy @rajatkoner @jainprateek_ 🎉 More details soon :D
6
2
73
3
24
140
@gaganjain1582
Gagan Jain
2 months
First conference submission ☑️ First conference acceptance ✅ #eccv2024 @eccvconf Congrats Sujoy @rajatkoner @jainprateek_ 🎉 More details soon :D
6
2
73
@gaganjain1582
Gagan Jain
14 days
#NeurIPS discussion phase be like 😔
Tweet media one
0
0
17
@gaganjain1582
Gagan Jain
28 days
Tweet media one
Tweet media two
1
2
15
@gaganjain1582
Gagan Jain
13 days
#NeurIPS Reviewer: Do X, Y, Z We perform these (somewhat intensive) exps, while also justifying why some of these weren't even needed given paper results Reviewer (< a day to discussion deadline): Won't increase score, I think more extensive evals/discussion needed -_-
1
0
6
@gaganjain1582
Gagan Jain
22 days
We're humbled by the early positive reception and shares for our MoNE paper! 🙏 A huge shoutout to my amazing collaborators and mentors for their incredible contributions - @paul_sujoy_ Nidhi Hegde @adityakusupati @jainprateek_ @NagraniArsha Shyamal Buch, Anurag Arnab ✨
1
1
5
@gaganjain1582
Gagan Jain
13 days
Some #NeurIPS reviewers need these things: 1. A reminder that their task is to evaluate potential scientific advancements in the field 2. An open and unbiased mind 3. An alarm clock 4. Magnifying Glasses
0
0
5
@gaganjain1582
Gagan Jain
15 days
So many interesting keynotes, today's twitter feed is giving me major FOMO. Next year's @RL_Conference on my bucket list now 😤
@pcastr
Pablo Samuel Castro
15 days
Great keynote by David Silver, arguing that we need to re-focus on RL to get out of the LLM Valley @RL_Conference
Tweet media one
Tweet media two
11
60
432
0
0
4
@gaganjain1582
Gagan Jain
1 month
Not all tokens provide new information, yet self-attention based models do not exploit this redundancy. This impacts their scalability, and calls for newer techniques that compress information, and can be adaptively used at inference based on task complexity. (2/10)
1
0
4
@gaganjain1582
Gagan Jain
1 month
How? We sample compressed tokens (M) from the original input token pool (N) (called lookup tokens). These two sets of tokens are then fed to a series of LookupViT blocks, which process and bidirectionally exchange information to enhance these tokens. (4/10)
Tweet media one
1
0
4
@gaganjain1582
Gagan Jain
22 days
@paul_sujoy_ @adityakusupati @jainprateek_ @NagraniArsha Stay tuned for more updates on MoNE & its exciting applications across different domains! In an era where efficient computation is paramount, we believe MoNE's approach to conditional computation has the potential to reshape the future of large-scale ML models. 🚀
1
0
4
@gaganjain1582
Gagan Jain
1 month
Key Idea: We maintain two sets of tokens that exchange information amongst themselves - 1⃣ Fine-grained but many, that are cheaply processed 2⃣ Coarse but few, that are heavily processed (3/10)
Tweet media one
1
0
4
@gaganjain1582
Gagan Jain
18 days
Why even have a chat support then -_- @zomato
Tweet media one
1
0
4
@gaganjain1582
Gagan Jain
28 days
[Bangalore House Rent Experience] I'll happily pay a broker once than use @nobrokercom 's scam plans ever again. Operating without a shred of ethic, their so called relationship managers will ignore all complaints & get their "brokerage" from your sec. deposit when you vacate.
1
2
4
@gaganjain1582
Gagan Jain
1 month
The outcome? We beat vanilla ViT and some state-of-the-art compression techniques on standard benchmarks, demonstrate scalability with increasing image resolution, and show applicability to tasks like video classification and image captioning! (6/10)
Tweet media one
1
0
4
@gaganjain1582
Gagan Jain
1 month
Several open directions of research, including auto-selection of compression token set based on task complexity - an important step towards conditional computation. Would love to chat and exchange more ideas on efficiency of large models. Looking forward to Milan 🇮🇹:) (9/10)
1
0
3
@gaganjain1582
Gagan Jain
22 days
We've already seen promising results on image & video datasets – MoNE boosts efficiency without sacrificing performance. 👀
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
1 month
Cherry on top, FlexiViT style training allows adaptive choice of the number of compressed tokens at inference time from a single model, with minimal accuracy loss! Extensive ablations and visualizations in the paper investigate the why’s and how’s about our framework. (8/10)
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
22 days
MoNE's secret sauce? 🪄 A clever combo of nested experts & adaptive routing, allowing it to smartly allocate resources based on input complexity.
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
1 month
And, and, and, we show significantly superior robustness than other baselines in the presence of adversarial perturbations! We further show that the feature corruption with increasing severity levels is lower for LookupViT than vanilla ViT. (7/10)
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
1 month
Work with awesome collaborators & mentors @GoogleDeepMind @LMU_Muenchen : @rajatkoner ^ @paul_sujoy_ @jainprateek_ @vtresp ! Extra pumped, for this is my 1st A* conf paper. Extremely grateful to @ManishGuptaMG1 for the constant support! ^ denotes equal contribution (10/10)
0
0
3
@gaganjain1582
Gagan Jain
1 month
Information processing involves three key steps: Information gathering: Information flow from lookup to compressed tokens Representation refinement: Compressed tokens undergo heavy computation Global Context Infusion: Information flow from compressed to lookup tokens (5/10)
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
22 days
No more wasted compute! MoNE tackles redundant or simple info with lightweight experts, keeping your models lean & mean. 💪
Tweet media one
1
0
3
@gaganjain1582
Gagan Jain
10 days
#NeurIPS reviews done and dusted (just move on), here's something to look forward to at #NeurIPS2024 . Also, a chance to interact with the ever amazing (and only slightly scary) @nikitasaxena02
@nikitasaxena02
Nikita Saxena (she/her)
10 days
(1/n) 📢 Thrilled to be part of the organizing team of the WiML Workshop at #NeurIPS2024 ! 🧠 Full CFP: Submission Deadline: September 10, 2024 @WiMLworkshop @NeurIPSConf @WiCVworkshop @WiNLPWorkshop #WiML #NeurIPS #machinelearning #WiML2024 #WiML2024
2
11
44
0
0
2
@gaganjain1582
Gagan Jain
1 month
@papers_anon Thanks for the paper mention. Code release is in the works :)
0
0
2
@gaganjain1582
Gagan Jain
1 month
@fly51fly Thanks for the paper mention :)
1
0
1
@gaganjain1582
Gagan Jain
1 month
@fooobar @paul_sujoy_ @jainprateek_ Thank you so much for your kind words, Gaurav :)) Grateful to be here, all thanks to you!
0
0
1
@gaganjain1582
Gagan Jain
2 months
@_toolazyto_ @eccvconf @rajatkoner @jainprateek_ Thanks Shreyas! Celebratory 🎾 match soon? :p
1
0
1
@gaganjain1582
Gagan Jain
20 days
@PavloMolchanov @ccccrs_0908 @srv_m @gLeHeinrich @yin_hongxu @VITAGroupUT @jankautz Awesome work. Congrats! It was great talking to you during ICML :))
0
0
1
@gaganjain1582
Gagan Jain
4 months
Love this
@Michael_J_Black
Michael Black
4 months
Young scientists regularly ask me for career advice. Academia or industry? Big company or startup? US or Europe?  Good scientists in AI disciplines are fortunate to have many choices. But choosing can be stressful. I always give the same advice. 1/10
21
250
1K
0
0
1
@gaganjain1582
Gagan Jain
4 months
@amit_sethi All in for note taking if the subject is mathematical in nature. When learning something for the first time, I have often found note taking to be the difference between "yeah that makes sense" and "oh I see why it makes sense".
0
0
1