Ramon Astudillo @RamonAstudill12 profile

Ramon Astudillo

@RamonAstudill12

Followers

547

Following

3K

Statuses

2K

Principal RS at IBM Research AI. Speech, Formal/Natural Language Processing. Currently LLM post-training, structured SDG/RL. Opinions my own and non stationary

Manhattan, NY

Joined April 2019

Don't wanna be here? Send us removal request.

Ramon Astudillo

@RamonAstudill12

2 days

RT @DimitrisPapail: We should be seriously asking, how a 1.5B model that can't answer basic questions can also be that good at competition…

0

102

0

Ramon Astudillo

@RamonAstudill12

3 days

Original thread:

0

Ramon Astudillo

@RamonAstudill12

4 days

Original thread

0

Ramon Astudillo

@RamonAstudill12

7 days

@mblondel_ml @andrew_n_carr +1, but if you torture rejection sampling a bit, you set temperature tending to zero and approximate the scaling factor from the proposed samples, it ends up giving best-of-N (RSO paper makes this point)

0

2

Ramon Astudillo

@RamonAstudill12

9 days

RT @andre_t_martins: Good to see @EU_Commission promoting OS LLMs in Europe. However (1) "OpenEuroLLM" is appropriating a name (#EuroLLM) w…

0

13

0

Ramon Astudillo

@RamonAstudill12

12 days

RT @LChoshen: Not released yet, but @karpathy leaked our gym like environment plus model competition...

0

3

0

Ramon Astudillo

@RamonAstudill12

20 days

@dearmadisonblue You mean distilling an existing diffusion into visual CoT, or training one from scratch. The latter seems the one that's hard. You will end up with a VAE anyway. There were some models like this such as DRAW.

0

1

Ramon Astudillo

@RamonAstudill12

20 days

RT @Yikang_Shen: It's good to see Deepseek v3 draw everyone's attention to reducing the training cost of LLM. Over the last two years, we…

0

45

0

Ramon Astudillo

@RamonAstudill12

1 month

RT @seirasto: 🌟New Benchmark! 🌟 Do you work on RAG? Are you interested in Multi-Turn conversations? Very excited to share the new MTRAG be…

0

8

0

Ramon Astudillo

@RamonAstudill12

1 month

@ch402 ☝️ So TLDR, IMO this all feels organic and tech/market driven, rather than a conscious change of style. These forces may last until AGI or maybe we hit a serious winter and we return to the old ways of more exploration less exploitation.

0

1

Ramon Astudillo

@RamonAstudill12

1 month

@dirk_hovy Did not know you were there already! Found you 🙂

0

Ramon Astudillo

@RamonAstudill12

1 month

RT @ZachWeiner: Bluesky: Mastodon: Better yet, leave all social media and get comics and…

0

11

0

Ramon Astudillo

@RamonAstudill12

1 month

@creatorscan Thanks!

0

Ramon Astudillo

@RamonAstudill12

1 month

Even onion's owner is in bsky!

0

Ramon Astudillo

@RamonAstudill12

2 months

@Teknium1 ☝️That does not mean that the model never saw human CoTs. It probably saw a huge amount of high quality ones, because the initial policy must be some GPT. But, ofc, if you do some STaR-like search to that model, you are going to get better ones.

1

0

1