Hynek Kydlíček @HKydlicek profile

Hynek Kydlíček

@HKydlicek

Followers

501

Following

296

Statuses

414

MLE @huggingface 🤗 Prague, CZ 🇪🇺 eu/acc

Czech Republic

Joined December 2021

Don't wanna be here? Send us removal request.

Hynek Kydlíček

@HKydlicek

4 months

When we began expanding FineWeb into multilingual space 🌎, we quickly noticed a lack of established benchmarks. Thus, we created FineTasks with two goals in mind: A. To encompass various LLM capabilities (knowledge, reasoning, etc.). B. To provide a reliable signal 📈.

1

3

20

Hynek Kydlíček

@HKydlicek

6 hours

@gui_penedo @Thom_Wolf 🤨 How much is that in FineWebs?? That's the only unit I can image.

0

Hynek Kydlíček

@HKydlicek

7 hours

Thanks for the AIME25 results

0

Hynek Kydlíček

@HKydlicek

14 hours

@lvwerra Le Onardo could secure even the future Italian funding tho

0

5

Hynek Kydlíček

@HKydlicek

15 hours

This is @karpathy 🐐 grade content on distributed training!

Ferdinand Mom

@FerdinandMom

16 hours

New Picotron video on distributed training is out ! This time you will learn how Distributed Data Parallel from Pytorch works in theory and how to implement it yourself ! Link of the video below 🧵

0

15

Hynek Kydlíček

@HKydlicek

1 day

@420_gunna @vwxyzjn @natolambert @hamishivi EM == string Exact match, usually it's rather quasy exact match tho

0

1

Hynek Kydlíček

@HKydlicek

1 day

@ryanmart3n @NeginRaoof_ We haven't uploaded re-extracted answers yet and yes we re-extracted the numina1.5

1

0

2

Hynek Kydlíček

@HKydlicek

1 day

@vwxyzjn @natolambert @hamishivi 🤗 When I switched from EM to math-verify we hoped by about 15% on most models

2

0

1

Hynek Kydlíček

@HKydlicek

1 day

There is pretty much nothing new in 0.5.2, compared to 0.5.1😅, so that won't help. I am not sure what you sourced your question/answers from, but at least for Numina I noticed that a lot of ground-truths are in very malformed latex, so one way to increased the recall is to re-extract the answers with LLM. We have done that but haven't run the final ablation yet.

1

0

3

Hynek Kydlíček

@HKydlicek

1 day

@vwxyzjn @natolambert @hamishivi You should switch to math-verify especially because of em on gsm8k sucks

1

0

1

Hynek Kydlíček

@HKydlicek

1 day

@natolambert @vwxyzjn @hamishivi What are you using now for evals?

1

0

1

Hynek Kydlíček

@HKydlicek

1 day

RT @anton_lozhkov: LLM Reasoning labs will be eating good today🍔 We commandeered the HF cluster for a few days and generated 1.2M reasonin…

0

99

0

Hynek Kydlíček

@HKydlicek

3 days

@e__honig Just my copy+paste issue. Math-verify will convert it to sympy.Number(41)

0

1

Hynek Kydlíček

@HKydlicek

3 days

@Teknium1 I don't mind to review it, but won't do it myself. Should be literally two lines of code tho

1

0

1

Hynek Kydlíček

@HKydlicek

3 days

RT @LoubnaBenAllal1: We just published the second OpenR1 update with OpenR1-220k-Math, our new large-scale dataset for mathematical reasoni…

0

60

0

Hynek Kydlíček

@HKydlicek

5 days

@Noahpinion Not true at all; the biggest moat is the inherent byproduct that you need to obtain during the creation of LLM: infrastructure and data.

0

Hynek Kydlíček

@HKydlicek

5 days

@zsytony 🫡 god-speed

0

Hynek Kydlíček

@HKydlicek

6 days

@JFPuget Yeah I read that there non-contaminated* and my first comment

0

1

Hynek Kydlíček

@HKydlicek

6 days

Why non-contaminated* ? See this absolutely awesome analysis 👇

4

0

23