ogrisel Profile Banner
Olivier Grisel Profile
Olivier Grisel

@ogrisel

Followers
28K
Following
12K
Statuses
15K

Engineer at @probabl_ai, scikit-learn developer. More active at: https://t.co/rboP0pZxxp https://t.co/79m4xCGMeB

Between Vannes & Paris, France
Joined August 2008
Don't wanna be here? Send us removal request.
@ogrisel
Olivier Grisel
3 months
Here is the recording of the presentation:
0
4
18
@ogrisel
Olivier Grisel
3 months
@fhuszar @andrewgwils I also found: and:
0
0
0
@ogrisel
Olivier Grisel
3 months
@DHolzmueller I wonder if schedule-free AdamW could help here.
1
0
0
@ogrisel
Olivier Grisel
3 months
@glouppe Maybe this is just a copy and paste error.
0
0
2
@ogrisel
Olivier Grisel
3 months
@jn2clark How many epochs? Maybe ADOPT works best (as an optimizer) but makes it easier to overfit as a result? Strong regularization might be needed (e.g. dropout or SAM-style updates?)
0
0
2
@ogrisel
Olivier Grisel
5 months
@nworbmot Indeed, I read some of the details in the links of the thread after asking :) I suppose this kind of energy storage can help justify the increase of demand side flexibility from 5% to 10%.
0
0
1
@ogrisel
Olivier Grisel
5 months
@PreetumNakkiran Or "creativity".
0
0
2
@ogrisel
Olivier Grisel
5 months
I am looking forward to giving a keynote presentation at @PyDataParis 2024, September 25-26, Cité des Sciences. This will be an opportunity to reflect on the meaning of "noise" and predictive uncertainty in machine learning applications.
Tweet media one
Tweet media two
Tweet media three
1
12
30
@ogrisel
Olivier Grisel
6 months
@ML_BenWalker @srush_nlp But one could argue this is just a UI problem: the LLM could be prompted to output an inner voice "train of thoughts" first followed by a final answer and the UI hides the transient inner voice output to the user. I am sure that some 'chat' platforms already implement this.
2
0
1
@ogrisel
Olivier Grisel
6 months
@karpathy s/state/step/
0
0
0
@ogrisel
Olivier Grisel
8 months
@tomgoldsteincs Do you think this idea could be adapted to generative image / video models (stable diffusion and the like)?
0
0
2
@ogrisel
Olivier Grisel
8 months
RT @PyDataParis: The schedule of PyData Paris 2024 is LIVE! 🚀 Have you secured your tickets yet? 🎟️ Join us for two inspiring days of ope…
0
16
0