Ricardo Hortelano @RHortelanoS profile

Ricardo Hortelano

@RHortelanoS

Followers

428

Following

6K

Statuses

4K

cooking @yorsio_com

Madrid

Joined September 2014

Don't wanna be here? Send us removal request.

Ricardo Hortelano

@RHortelanoS

9 months

0

Ricardo Hortelano

@RHortelanoS

4 days

@molasalex Si lo hacemos dt=0 supongo que tampoco veriamos como atropella a todos. Sería otro win, de algun modo... 😃

0

Ricardo Hortelano

@RHortelanoS

5 days

@memecrashes In the continuous way the trolley apparently will stop on the first person

0

1

Ricardo Hortelano

@RHortelanoS

8 days

RT @XRarchitect: Finally got my Gaussian Splat picture up and running in Augmented Reality—all via the web This is how I want to capture p…

0

152

0

Ricardo Hortelano

@RHortelanoS

8 days

RT @suchenzang: the true bitter lesson: it's easier to lie, cheat, and steal than it is to actually do good work

0

25

0

Ricardo Hortelano

@RHortelanoS

8 days

@yudapearl @eliasbareinboim Very interesting. Thanks professor

0

3

Ricardo Hortelano

@RHortelanoS

8 days

@tekbog No way

0

1

Ricardo Hortelano

@RHortelanoS

8 days

Carmark lo ha entendido

John Carmack

@ID_AA_Carmack

8 days

Offline reinforcement learning, where an agent tries to improve a behavior policy by observing another agent without actually playing, is a harder problem than it appears. The challenge isn’t to mimic the provided play, but to learn something better than what you have seen. The difference between online (traditional) RL and offline RL is that online RL is constantly "testing" its model by taking new actions as a result of changes to the model, while the offline training can bootstrap itself off into a coherent fantasy of great returns untested by reality. It may be just an artifact of value based RL in particular, but I am inclined to believe that it is a more fundamental truth about theoretical and observational science versus experimental science, and life in general.

0

Ricardo Hortelano

@RHortelanoS

8 days

@ID_AA_Carmack That's probably the next step on the ladder of causality from @yudapearl An agent of that nature probably needs to learn by counterfactuals.

1

4

Ricardo Hortelano

@RHortelanoS

15 days

@joantubau Al menos es sincero

0

1

Ricardo Hortelano

@RHortelanoS

16 days

RT @agifirealarm: @jxmnop "Sir, a second model has hit Hugging Face"

0

42

0

Ricardo Hortelano

@RHortelanoS

17 days

RT @hibakod: A math degree is 100x more useful than a computer science degree.

0

1K

0

Ricardo Hortelano

@RHortelanoS

17 days

@Recuenco Ha empezado a usar chatGPT

1

0

2

Ricardo Hortelano

@RHortelanoS

20 days

@molasalex Y como evitan esa dependencia con esto? Las materias primas son las que son

0

Ricardo Hortelano

@RHortelanoS

29 days

@ludwigABAP The only two I know

0

1

Ricardo Hortelano

@RHortelanoS

29 days

El link al artículo

char

@cunjur

1 month

0

1

Ricardo Hortelano

@RHortelanoS

29 days

@Recuenco te tocan los perfiles profesionales

0

1

Ricardo Hortelano

@RHortelanoS

1 month

@Alvaro_DMaria Que me avisen, que les consigo un buen deal

0