Hynek Kydlíček Profile
Hynek Kydlíček

@HKydlicek

Followers
501
Following
296
Statuses
414

MLE @huggingface 🤗 Prague, CZ 🇪🇺 eu/acc

Czech Republic
Joined December 2021
Don't wanna be here? Send us removal request.
@HKydlicek
Hynek Kydlíček
4 months
When we began expanding FineWeb into multilingual space 🌎, we quickly noticed a lack of established benchmarks. Thus, we created FineTasks with two goals in mind: A. To encompass various LLM capabilities (knowledge, reasoning, etc.). B. To provide a reliable signal 📈.
Tweet media one
1
3
20
@HKydlicek
Hynek Kydlíček
6 hours
@gui_penedo @Thom_Wolf 🤨 How much is that in FineWebs?? That's the only unit I can image.
0
0
0
@HKydlicek
Hynek Kydlíček
7 hours
Thanks for the AIME25 results
0
0
0
@HKydlicek
Hynek Kydlíček
14 hours
@lvwerra Le Onardo could secure even the future Italian funding tho
0
0
5
@HKydlicek
Hynek Kydlíček
15 hours
This is @karpathy 🐐 grade content on distributed training!
@FerdinandMom
Ferdinand Mom
16 hours
New Picotron video on distributed training is out ! This time you will learn how Distributed Data Parallel from Pytorch works in theory and how to implement it yourself ! Link of the video below 🧵
Tweet media one
0
0
15
@HKydlicek
Hynek Kydlíček
1 day
@420_gunna @vwxyzjn @natolambert @hamishivi EM == string Exact match, usually it's rather quasy exact match tho
0
0
1
@HKydlicek
Hynek Kydlíček
1 day
@ryanmart3n @NeginRaoof_ We haven't uploaded re-extracted answers yet and yes we re-extracted the numina1.5
1
0
2
@HKydlicek
Hynek Kydlíček
1 day
@vwxyzjn @natolambert @hamishivi 🤗 When I switched from EM to math-verify we hoped by about 15% on most models
2
0
1
@HKydlicek
Hynek Kydlíček
1 day
There is pretty much nothing new in 0.5.2, compared to 0.5.1😅, so that won't help. I am not sure what you sourced your question/answers from, but at least for Numina I noticed that a lot of ground-truths are in very malformed latex, so one way to increased the recall is to re-extract the answers with LLM. We have done that but haven't run the final ablation yet.
1
0
3
@HKydlicek
Hynek Kydlíček
1 day
@vwxyzjn @natolambert @hamishivi You should switch to math-verify especially because of em on gsm8k sucks
1
0
1
@HKydlicek
Hynek Kydlíček
1 day
@natolambert @vwxyzjn @hamishivi What are you using now for evals?
1
0
1
@HKydlicek
Hynek Kydlíček
1 day
RT @anton_lozhkov: LLM Reasoning labs will be eating good today🍔 We commandeered the HF cluster for a few days and generated 1.2M reasonin…
0
99
0
@HKydlicek
Hynek Kydlíček
3 days
@e__honig Just my copy+paste issue. Math-verify will convert it to sympy.Number(41)
0
0
1
@HKydlicek
Hynek Kydlíček
3 days
@Teknium1 I don't mind to review it, but won't do it myself. Should be literally two lines of code tho
1
0
1
@HKydlicek
Hynek Kydlíček
3 days
RT @LoubnaBenAllal1: We just published the second OpenR1 update with OpenR1-220k-Math, our new large-scale dataset for mathematical reasoni…
0
60
0
@HKydlicek
Hynek Kydlíček
5 days
@Noahpinion Not true at all; the biggest moat is the inherent byproduct that you need to obtain during the creation of LLM: infrastructure and data.
0
0
0
@HKydlicek
Hynek Kydlíček
5 days
@zsytony 🫡 god-speed
0
0
0
@HKydlicek
Hynek Kydlíček
6 days
@JFPuget Yeah I read that there non-contaminated* and my first comment
0
0
1
@HKydlicek
Hynek Kydlíček
6 days
Why non-contaminated* ? See this absolutely awesome analysis 👇
4
0
23