![Dan Biderman Profile](https://pbs.twimg.com/profile_images/1365064046035230720/ExsGBWOH_x96.jpg)
Dan Biderman
@dan_biderman
Followers
1K
Following
7K
Statuses
1K
AI & neuroscience. Postdoc @Stanford w/ @HazyResearch & @scott_linderman. Prev: computational neuroscience PhD @cu_neurotheory, @DbrxMosaicAI
Palo Alto, CA
Joined March 2017
People think LoRA is a magic bullet for LLMs. Is it? Does it deliver the same quality as full finetuning but on consumer GPUs? Though LoRA has the advantage of a lower memory footprint, we find that it often substantially underperforms full finetuning. However, it forgets less of the base model’s capabilities. In this work, we exhaustively explore this trade-off and provide practitioners a clear view of the difference between the methods.
22
104
560
RT @docmilanfar: The Kalman Filter was once a core topic in EECS curricula. Given its relevance to ML, RL, Ctrl/Robotics, I'm surprised tha…
0
55
0
RT @allen_ai: We took our most efficient model and made an open-source iOS app📱but why? As phones get faster, more AI will happen on devic…
0
81
0
RT @jxmnop: surreal time capsule from what things were like at OpenAI exactly six years ago this was a really really good bet
0
7
0
RT @iScienceLuvr: Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach We study a novel language model architect…
0
177
0
RT @SuryaGanguli: My @TEDAI2024 talk is out! I discuss our work, spanning AI, physics, math & neuroscience, to deve…
0
40
0
@SuryaGanguli Enjoyed listening. Loved the last part about open & interdisciplinary science of intelligence
0
0
1
RT @_jasonwei: Very excited to finally share OpenAI's "deep research" model, which achieves twice the score of o3-mini on Humanity's Last E…
0
192
0
Cool take!
Prediction: all closed AI model providers will stop selling APIs in the next 2-3 years. Only open models will be available via APIs. Why? For an open model service, the value prop is clear...it's hard to build a scalable service to access the model and the model is commodity. The race-to-the bottom happened with the commodity already (model). Let AI app builders iterate on great UIs for apps upon scalable services with commodity capabilities Closed model providers are trying to build non-commodity capabilities and they need great UIs to deliver those. It's not just a model anymore, but an app with a UI for a purpose. If closed models are available via API, all it does is create competition for the app the closed provider is building. The secret sauce is capabilities + UI.
0
0
1
RT @NaveenGRao: Prediction: all closed AI model providers will stop selling APIs in the next 2-3 years. Only open models will be available…
0
50
0
RT @abeirami: 𝐛𝐞𝐬𝐭-𝐨𝐟-𝐧 is a strong baseline for - improving agents - scaling inference-time compute - preference alignment - jailbreakin…
0
48
0