Colin Raffel @colinraffel profile

Colin Raffel

@colinraffel

Followers

31K

Following

3K

Statuses

2K

https://t.co/mStEmzIJH0

Joined March 2017

Don't wanna be here? Send us removal request.

Colin Raffel

@colinraffel

1 year

📢Life update:📢 I moved to Toronto, where I'm now an associate professor at the University of Toronto and an associate research director at the Vector Institute. I wrote a blog post about the long winding path that led me here:

128

43

1K

Colin Raffel

@colinraffel

7 days

RT @Ar_Douillard: now that ICML deadline is over, time to submit to the MCDC workshop for ICLR!

0

8

0

Colin Raffel

@colinraffel

26 days

RT @Ar_Douillard: Workshop alert 🚨 We'll host in ICLR 2025 a workshop on modularity, encompassing collaborative + decentralized + continua…

0

38

0

Colin Raffel

@colinraffel

2 months

Application link:

0

10

Colin Raffel

@colinraffel

2 months

@giffmana I dunno, back in the day during my PhD we also had gaming desktop machines with gaming GPUs in our lab and we also called them "servers". I think any computer that is used for long-running jobs/experiments that you mainly use by ssh'ing into should be called a "server".

1

0

10

Colin Raffel

@colinraffel

3 months

Application link: Please share widely.

0

9

Colin Raffel

@colinraffel

3 months

RT @AdaptiveML: Instead of mitigating length bias in LLM-as-judge, what if you could simply 🙋ask models to output comparisons of the same l…

0

3

0

Colin Raffel

@colinraffel

3 months

@JayAlammar We have always referred to this diagram as the "octopus". I used to keep an informal list of all of the papers that had an octopus-style diagram in it.

0

1

22

Colin Raffel

@colinraffel

3 months

RT @mciccone_AI: 🚨 Life update 🚨 I moved to Toronto 🇨🇦and joined @VectorInst as a Postdoctoral Fellow to work with @colinraffel and his lab…

0

4

0

Colin Raffel

@colinraffel

3 months

RT @prateeky2806: I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modul…

0

59

0

Colin Raffel

@colinraffel

6 months

RT @prateeky2806: We just released our survey on "Model MoErging", But what is MoErging?🤔Read on! Imagine a world where fine-tuned model…

0

45

0

Colin Raffel

@colinraffel

10 months

RT @arankomatsuzaki: 🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from th…

0

109

0

Colin Raffel

@colinraffel

1 year

@madiator Good question. I think 1) bandwagonism/inertia (it's anti-zeitgeist) and 2) it works well for classification tasks and is less proven for open-ended generation. But I've heard T-few has been implemented and is in use by various LLM startups, they just don't advertise it as such.

0

2

Colin Raffel

@colinraffel

1 year

RT @ada_rob: I love music most when it’s live, in the moment, and expressing something personal. This is why I’m psyched about the new “DJ…

0

104

0

Colin Raffel

@colinraffel

1 year

RT @AlbalakAlon: {UCSB|AI2|UW|Stanford|MIT|UofT|Vector|Contextual AI} present a survey on🔎Data Selection for LLMs🔍 Training data is a clos…

0

77

0

Colin Raffel

@colinraffel

1 year

@jeremyphoward @Muqeeth10 @liu_haokun Lots more work coming from us along these lines! Would love to sync up sometime.

1

0

Colin Raffel

@colinraffel

1 year

@sivil_taram Thank you! It took us a long time - turns out to be challenging in the zero-shot setting. The LoraHub approach makes a lot of sense in the few-shot setting. Ultimately both settings are very important!

1

0

5