tylerachang Profile Banner
Tyler Chang Profile
Tyler Chang

@tylerachang

Followers
147
Following
74
Statuses
17

PhD student @UCSanDiego. He/him/his.

Joined June 2022
Don't wanna be here? Send us removal request.
@tylerachang
Tyler Chang
2 months
We scaled training data attribution (TDA) methods ~1000x to find influential pretraining examples for thousands of queries in an 8B-parameter LLM over the entire 160B-token C4 corpus!
Tweet media one
1
20
127
@tylerachang
Tyler Chang
2 months
RT @camrobjones: One of the major pieces of feedback that we got on the last Turing test is that it was "too easy" because it used a 2-play…
0
8
0
@tylerachang
Tyler Chang
2 months
And we hope you enjoy our paper: This work wouldn't have been at all possible without @dheerajgopal @tolgab0 @iislucas and @iftenney !
0
0
13
@tylerachang
Tyler Chang
3 months
RT @linguist_cat: ✨New pre-print!✨Successful language technologies should work for a wide variety of languages. But some languages have sys…
0
14
0
@tylerachang
Tyler Chang
3 months
@kanishkamisra congrats!!
1
0
2
@tylerachang
Tyler Chang
3 months
RT @linguist_cat: @tylerachang and my paper “When is Multilinguality a Curse?” was awarded outstanding paper! Thank you @emnlpmeeting ❤️
0
4
0
@tylerachang
Tyler Chang
5 months
RT @linguist_cat: Our paper “When is Multilinguality a Curse?” will be presented at #EMNLP2024! We found that multilingual data hurts high-…
0
9
0
@tylerachang
Tyler Chang
6 months
@linguist_cat @PennarEng @GooRee it also helps to add `prefix='[CLS]', do_sample=True` to the call to text_generator(), because the model is trained with a start-of-sequence CLS/BOS token
1
0
2
@tylerachang
Tyler Chang
6 months
@GooRee @linguist_cat benchmark results in our paper! Goldfish models on average have better perplexities (including for Welsh!) but slightly worse reasoning abilities than large multilingual models:
1
0
1
@tylerachang
Tyler Chang
6 months
1
0
1
@tylerachang
Tyler Chang
6 months
RT @linguist_cat: Super excited to finally release the Goldfish models, joint work with @tylerachang. These are small, comparable models fo…
0
18
0
@tylerachang
Tyler Chang
1 year
RT @linguist_cat: New preprint with @tylerachang and Benjamin Bergen! We find that some languages need up to five times as much storage in…
0
3
0