RHortelanoS Profile Banner
Ricardo Hortelano Profile
Ricardo Hortelano

@RHortelanoS

Followers
428
Following
6K
Statuses
4K

cooking @yorsio_com

Madrid
Joined September 2014
Don't wanna be here? Send us removal request.
@RHortelanoS
Ricardo Hortelano
9 months
0
0
0
@RHortelanoS
Ricardo Hortelano
4 days
@molasalex Si lo hacemos dt=0 supongo que tampoco veriamos como atropella a todos. Sería otro win, de algun modo... 😃
0
0
0
@RHortelanoS
Ricardo Hortelano
5 days
@memecrashes In the continuous way the trolley apparently will stop on the first person
0
0
1
@RHortelanoS
Ricardo Hortelano
8 days
RT @XRarchitect: Finally got my Gaussian Splat picture up and running in Augmented Reality—all via the web This is how I want to capture p…
0
152
0
@RHortelanoS
Ricardo Hortelano
8 days
RT @suchenzang: the true bitter lesson: it's easier to lie, cheat, and steal than it is to actually do good work
0
25
0
@RHortelanoS
Ricardo Hortelano
8 days
@yudapearl @eliasbareinboim Very interesting. Thanks professor
0
0
3
@RHortelanoS
Ricardo Hortelano
8 days
@tekbog No way
0
0
1
@RHortelanoS
Ricardo Hortelano
8 days
Carmark lo ha entendido
@ID_AA_Carmack
John Carmack
8 days
Offline reinforcement learning, where an agent tries to improve a behavior policy by observing another agent without actually playing, is a harder problem than it appears. The challenge isn’t to mimic the provided play, but to learn something better than what you have seen. The difference between online (traditional) RL and offline RL is that online RL is constantly "testing" its model by taking new actions as a result of changes to the model, while the offline training can bootstrap itself off into a coherent fantasy of great returns untested by reality. It may be just an artifact of value based RL in particular, but I am inclined to believe that it is a more fundamental truth about theoretical and observational science versus experimental science, and life in general.
0
0
0
@RHortelanoS
Ricardo Hortelano
8 days
@ID_AA_Carmack That's probably the next step on the ladder of causality from @yudapearl An agent of that nature probably needs to learn by counterfactuals.
Tweet media one
1
1
4
@RHortelanoS
Ricardo Hortelano
15 days
@joantubau Al menos es sincero
0
0
1
@RHortelanoS
Ricardo Hortelano
16 days
RT @agifirealarm: @jxmnop "Sir, a second model has hit Hugging Face"
Tweet media one
0
42
0
@RHortelanoS
Ricardo Hortelano
17 days
RT @hibakod: A math degree is 100x more useful than a computer science degree.
0
1K
0
@RHortelanoS
Ricardo Hortelano
17 days
@Recuenco Ha empezado a usar chatGPT
1
0
2
@RHortelanoS
Ricardo Hortelano
20 days
@molasalex Y como evitan esa dependencia con esto? Las materias primas son las que son
0
0
0
@RHortelanoS
Ricardo Hortelano
29 days
@ludwigABAP The only two I know
0
0
1
@RHortelanoS
Ricardo Hortelano
29 days
El link al artículo
@cunjur
char
1 month
0
0
1
@RHortelanoS
Ricardo Hortelano
29 days
@Recuenco te tocan los perfiles profesionales
0
0
1
@RHortelanoS
Ricardo Hortelano
1 month
@Alvaro_DMaria Que me avisen, que les consigo un buen deal
0
0
0