Goodfire @GoodfireAI profile

Goodfire

@GoodfireAI

Followers

2K

Following

136

Statuses

85

Advancing humanity's understanding of AI through interpretability research. Building the future of safe and reliable AI systems.

San Francisco

Joined August 2024

Don't wanna be here? Send us removal request.

Goodfire

@GoodfireAI

2 months

Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.

16

90

554

Goodfire

@GoodfireAI

15 days

RT @leedsharkey: Big new review! 🟦Open Problems in Mechanistic Interpretability🟦 We bring together perspectives from ~30 top researchers…

0

86

0

Goodfire

@GoodfireAI

21 days

Job listings here: We encourage you to apply even if you may not meet all of the requirements!

0

1

10

Goodfire

@GoodfireAI

25 days

RT @livgorton: We (@GoodfireAI) are hiring research fellows! Early career researchers and engineers can join us for ~2 months to work on an…

0

24

0

Goodfire

@GoodfireAI

27 days

RT @apartresearch: Myra Deng: Goodfire's Interpretability Tools in Action Our Community Manager @varchanaiyer_ interviews @GoodfireAI's @m…

0

3

0

Goodfire

@GoodfireAI

30 days

@posedscaredcity @Sauers_ @posedscaredcity Feel free to send us a DM with more info on the issue you're running into!

1

0

1

Goodfire

@GoodfireAI

1 month

Launch post: Open source SAEs: Ember API/SDK for accessing SAEs:

Goodfire

@GoodfireAI

1 month

We're open-sourcing Sparse Autoencoders (SAEs) for Llama 3.3 70B and Llama 3.1 8B! These are, to the best of our knowledge, the first open-source SAEs for models at this scale and capability level.

1

6

32

Goodfire

@GoodfireAI

1 month

4/ We're excited to see how the community builds on these foundations. If you're working on model interpretability or interested in applying similar approaches to your AI systems, we'd love to hear from you!

0

26

Goodfire

@GoodfireAI

1 month

@ady_mehtaa @Contrary_Res Send it over to myra@goodfire.ai, thanks!

0

1

Goodfire

@GoodfireAI

2 months

@IntuitMachine We've launched publicly with support for Llama 3.3 70B:

Goodfire

@GoodfireAI

2 months

Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.

0

2

Goodfire

@GoodfireAI

2 months

@victormustar Our API/SDK is publicly available if you want to try out more visualizations! Check out

0

Goodfire

@GoodfireAI

2 months

@Ahmad_Al_Dahle Our team sprinted to launch Ember - a hosted interpretability API - on Llama 3.3 70B! Check out

0

2

Goodfire

@GoodfireAI

2 months

@voooooogel

Goodfire

@GoodfireAI

2 months

Scaling interpretability work for frontier AI alignment has never been more important. That’s why we’re launching Ember - the first hosted mechanistic interpretability API with fast inference support for generative models like Llama 3.3 70B.

0

4

Goodfire

@GoodfireAI

2 months

We believe that accessible interpretability tools on powerful models enables both new research and applications. We plan to open source both our Llama 3.1 8B and Llama 3.3 70B SAEs and release a detailed research report on our steering work in January.

0

13