Jiachen "Tianhao" Wang @JiachenWang97 profile

Jiachen "Tianhao" Wang

@JiachenWang97

Followers

173

Following

78

Statuses

50

PHD student @ Princeton Data Attribution, Data Selection, Privacy

Princeton, NJ

Joined August 2022

Don't wanna be here? Send us removal request.

Jiachen "Tianhao" Wang

@JiachenWang97

2 months

Here are all three parts of the slides for "Advancing Data Selection for Foundation Models: From Heuristics to Principled Methods" tutorial at NeurIPS yesterday! Part 1 (Intro + Empirical Methods): Part 2 (Principled Methods): Part 3 (Foundations): We will also summarize all relevant materials (including those suggested by the audiences) and publish them after the conference. Thanks to everyone who attended yesterday!

Ruoxi Jia

@ruoxijia

2 months

Here’s my slide deck from the tutorial. Thanks to everyone who attended yesterday - it was a super rewarding process to prepare for this!

0

1

13

Jiachen "Tianhao" Wang

@JiachenWang97

6 days

RT @ruoxijia: Submission deadline AoE today for our Workshop on Data Problems for Foundation Models! Look forward to your contributions!

0

3

0

Jiachen "Tianhao" Wang

@JiachenWang97

6 days

RT @reds_tiger: Announcing the ICLR 2025 Workshop on Data Problems for Foundation Models (DATA-FM)! We welcome submissions exploring ALL AS…

0

2

0

Jiachen "Tianhao" Wang

@JiachenWang97

17 days

Hope to see you in Singapore!

0

1

Jiachen "Tianhao" Wang

@JiachenWang97

2 months

I will be presenting our NeurIPS spotlight work on gradient-based online data selection for LLMs today (12/12) at 4:30-7:30pm PST (East Hall #4400). Big thanks to my amazing collaborators @ruoxijia @TongWu_Pton @dawnsongtweets @prateekmittal_ Please feel free to come by and discuss any data-related research problems! #NeurIPS2024

1

8

54

Jiachen "Tianhao" Wang

@JiachenWang97

2 months

RT @KoMyeongseob: Excited to attend #NeurIPS2024 in Vancouver 🇨🇦! I will be presenting our work: "Boosting Alignment for Post-Unlearning T…

0

6

0

Jiachen "Tianhao" Wang

@JiachenWang97

2 months

Just arrived in Vancouver! @ruoxijia @lschmidt3 and I will present a tutorial on data selection for foundation model tomorrow (12/10) at 1:30pm in West Ballroom C. Come by and say hi! And feel free to DM me to discuss any data-related research!

Ruoxi Jia

@ruoxijia

2 months

Join us Tuesday at 1:30 PM PT at #NeurIPS2024 for our tutorial on data selection for foundation models! With @lschmidt3 & @JiachenWang97, we'll cover principled experimentation, selection algorithms, a unified theoretical framework, and open challenges. Hope to see you there!

0

5

Jiachen "Tianhao" Wang

@JiachenWang97

3 months

RT @profnaren: Small matter of @virginia_tech pride! Google Scholar turns 20 today 🎉🎉🎉 Kudos to its creators, Anurag Acharya and Alex Ver…

0

9

0

Jiachen "Tianhao" Wang

@JiachenWang97

3 months

Flying to San Diego for the Rising Stars in Data Science workshop on Nov 14-15! @HDSIUCSD @StanfordData @DSI_UChicago If you are at UCSD, feel free to DM me and chat about data-related research problems! #MachineLearning #DataScience

0

9

Jiachen "Tianhao" Wang

@JiachenWang97

3 months

RT @si_chen0921: If you're interested in LLM fact tracing and information retrieval, join me as I present our work, FASTTRACK at Session F…

0

3

0

Jiachen "Tianhao" Wang

@JiachenWang97

3 months

Thanks so much for featuring our research on scaling up principled data attribution techniques. Data attribution is important for ML interpretability, data curation, and fairly compensating data providers. New papers on this line are coming soon!

Parth Sharma

@parthshr370

3 months

Shapley Values can help you understand how data point contribute to the output. An addition to this principle we have In-Run Data shapley a paper by @JiachenWang97 @prateekmittal_ @dawnsongtweets @ruoxijia Here is my blog ex - inspired by - @joemelko

1

6

Jiachen "Tianhao" Wang

@JiachenWang97

4 months

RT @ruoxijia: New paper led by @feiyang_ml on optimizing LLM pre-training data mix - one of the most fun projects! Takeaways: (1) Optimal…

0

3

0

Jiachen "Tianhao" Wang

@JiachenWang97

5 months

RT @Fanghui_SgrA: Our fine-tuning workshop@NeurIPS’24 @neurips24fitml has the following amazing speakers and panelists! Welcome to submit y…

0

7

0

Jiachen "Tianhao" Wang

@JiachenWang97

6 months

RT @hjy836: 📢Announcing the first GenAI4Health Workshop at #NeurIPS2024 where we invite speakers and participants from #health, #AI_safety,…

0

16

0

Jiachen "Tianhao" Wang

@JiachenWang97

7 months

RT @thegautamkamath: "Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining," with @florian_tram…

0

30

0

Jiachen "Tianhao" Wang

@JiachenWang97

7 months

It's in 10min! Check out our poster in today's morning session at #2517!

Jiachen "Tianhao" Wang

@JiachenWang97

7 months

Excited to be attending #ICML next week! I will give an oral presentation on our work about the theoretical foundation of Data Shapley for data curation. Happy to discuss data curation, data attribution, and all related topics. Feel free to DM for a coffee chat in Vienna! ☕️

0

9