Braintrust @braintrustdata profile

Braintrust

@braintrustdata

Followers

1,702

Following

51

Media

36

Statuses

143

Braintrust is the enterprise-grade stack for building AI products.

https://t.co/uxkCB1j2vh

Joined August 2023

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

#TGS2024 • 203061 Tweets

Miami • 104623 Tweets

Knicks • 90240 Tweets

Towns • 77712 Tweets

Minnesota • 73516 Tweets

Randle • 70639 Tweets

भगत सिंह • 65399 Tweets

mingyu • 59471 Tweets

FAYEYOKO MACAU FANMEET • 58658 Tweets

#FayeYoko1stFMinMacau • 57279 Tweets

Donte • 43719 Tweets

Wolves • 40469 Tweets

Paola • 38775 Tweets

日本破壊クソメガネ • 24550 Tweets

ババコンガ • 21116 Tweets

シリウスS • 21077 Tweets

Virginia Tech • 20917 Tweets

Cam Ward • 20313 Tweets

#bhagatsinghjayanti • 14815 Tweets

#AFLGF • 14139 Tweets

Rutgers • 13798 Tweets

Brunson

ミッキーマドンナ

लता मंगेशकर

ケンティー

Va Tech

Naz Reid

ダウンズ

ハギノアレグリアス

外枠発走

ドドブランゴ

中京11R

ジェットマグナム

ミンギュ

アラエス

選対委員長

Gobert

そくほー

ロコポルティ

則宗さん

Bホール

軽めのバッグ

シリウスステークス

Tukang Hoax

JOSH FOUND IN NFT

オメガギネス

アメリカンビキニ

ヴァンヤール

パジャマアクスタ

#consadole

Last Seen Profiles

@TCD2050

@Nino_nvrr

@ProxyRotator

@syogeki_channel

@PenaNetizen

@jmcvea81

@CarlogiePS

@AdamFagan24

@_dennismorgan2

@JJaremko5

@GeeHaitham

@rizcha_diana

@RustyJShack

@wearewatcher

@matexamargox

@eunchaewon__

@prowrestlewotak

@wunnawonn

@PaulVerbek

@MarcHirschi

Pinned Tweet

Braintrust

@braintrustdata

1 year

Take off 🚀

Ankur Goyal

@ankrgyl

1 year

I'm excited to announce a new product I've been working on called @braintrustdata . Braintrust helps innovative companies and developers ship higher quality AI products by making it easy to run evals. 🔉 on

39

54

420

4

5

47

Braintrust

@braintrustdata

11 months

OpenAI announced: Reproducible outputs 💥 This is game changer for developers! It's now possible to actually evaluate and unit test your LLM apps! You can know that when a test passes locally, it'll pass for your team and in CICD. 1/3🧵

1

3

21

Braintrust

@braintrustdata

11 months

We made it onto Replit! @Replit

0

2

21

Braintrust

@braintrustdata

11 months

Our playground now supports all the new OpenAI models and some fun OS models 😎 thanks to @perplexity_ai

3

1

18

Braintrust

@braintrustdata

11 months

Braintrust handles the boring work. Spend your time having fun building AI apps with your team 🥳

0

7

19

Braintrust

@braintrustdata

4 months

Over the past year, it's been an absolute joy getting to know @mikeknoop and build Braintrust with our friends at @zapier . We worked together on a blog post that captures their workflow. If you want to build a world-class AI product, this is for you!

1

5

19

Braintrust

@braintrustdata

11 months

We made it to @Huggingface !

0

3

18

Braintrust

@braintrustdata

11 months

We evaluated Google's text-bison LLM against OpenAI's gpt-3.5-turbo on a SQL generation task in Braintrust. Here's how they performed: - finetuned-gpt3.5: 92.4% - finetuned-bison: 84.2% - gpt3.5: 78.7% - bison: 74.8% (We finetuned both models too!) Dig into the evals below:

2

14

Braintrust

@braintrustdata

11 months

. @Retool surveyed how companies are adopting AI. Some of the top challenges: model output accuracy, hallucinations, and prompt engineering. Braintrust help's you solve these challenges: run evaluations, visualize and inspect your results, and experiment with prompts quickly.

4

3

11

Braintrust

@braintrustdata

11 months

Now, we can have much more fun evaluating our AI apps 🥳 Check out our docs on how to use Braintrust to evaluate your AI app. It's very easy to integrate Braintrust evals with your existing CICD workflow (see our docs below)

0

8

Braintrust

@braintrustdata

10 months

We have some exciting news today 😀!

Ankur Goyal

@ankrgyl

10 months

Not too long ago, I announced a new company called @braintrustdata . Today I'm super excited to share our $5m seed round led by @saammotamedi at @GreylockVC .

31

21

261

0

2

8

Braintrust

@braintrustdata

11 months

You can self host Braintrust within your own VPC! Learn how in our docs or just reach out to us 👋

0

2

10

Braintrust

@braintrustdata

7 months

We are super excited to partner with Liucija, Senior Data Scientist on the AI team @Hostinger , as they work towards leveraging AI for use cases like customer support, website building, and more. If you'd like to learn how Braintrust helped Hostinger: - 3x the number of AI

2

1

10

Braintrust

@braintrustdata

7 months

Super fun to host an AIUX Demo Night last week at @eladgil 's office! We are super excited about the future of UIUX w/ AI and loved seeing what talented people are building. Thank you to everyone who came out and special shoutout to our demoers 🙂 If you’re interested in coming

1

0

10

Braintrust

@braintrustdata

10 months

LlamaIndex just released Llama Datasets so you can easily benchmark RAG pipelines. We contributed a help desk dataset with Coda so you can easily benchmark chat qa & support use cases. Check it out on Llamahub

Ankur Goyal

@ankrgyl

10 months

Exciting to collaborate with @llama_index @braintrustdata @siuheihk on a great dataset for developing RAG apps!

1

0

3

0

1

8

Braintrust

@braintrustdata

11 months

The AI app development journey: 1. Start with a prototype and manually test 2. Get tired of manually testing 3. Evaluations enlightenment: add evaluations to your code ??? 4. App is in production. Users rave about your app Braintrust makes it easy to evaluate your AI code.

1

4

7

Braintrust

@braintrustdata

11 months

🤩 New feature: text blocks in the playground! These blocks just return a constant or variable value without any LLM call. This makes it easy to: - debug your prompts - mock API responses and vectorDB calls

0

1

7

Braintrust

@braintrustdata

11 months

Don't get stuck manually inputting test cases into your LLM app after every prompt change. Braintrust makes it easy to automatically evaluate and test your LLM apps.

0

1

7

Braintrust

@braintrustdata

11 months

Braintrust also easily integrates with Pytest, Jest, etc. What other testing libraries do you like to use?

0

8

6

Braintrust

@braintrustdata

1 year

We are hiring for roles including: - Software Engineer — Data Visualization - Software Engineer — Systems - Chief of Staff Learn more here:

Careers

Braintrust is a small team of builders passionate about empowering developers working with AI. We’re looking for highly independent, self-motivated, and creative people to join us.

www.braintrust.dev

1

2

6

Braintrust

@braintrustdata

11 months

🎉We are hiring!

Ankur Goyal

@ankrgyl

11 months

We're hiring engineers :) Do you love: * building visualizations on text, images, and numbers that (re-)render in <100ms? * searching/grouping billions of rows of semistructured text-heavy data in <200ms? * grinding away LLM latency by any means necessary? If so, LMK

11

17

145

0

5

6

Braintrust

@braintrustdata

11 months

👎 Before: - your app generates different outputs every test - if you use LLMs to grade outputs, those grades would also be random every test 👍 Now w/ reproducible outputs: - your app generates consistent outputs even if temperature !=0 - your model graded evals are consistent

1

0

6

Braintrust

@braintrustdata

10 months

Which LLM is the best at summarizing GitHub issues? We informally tested to find: GPT4>Mistral7b>Claude2.1>GPT3.5 It's easy to run evaluations with Braintrust using our eval libraries and AI proxy. Check out the code below:

1

0

6

Braintrust

@braintrustdata

11 months

⏰ We added duration stats to experiments! See which test cases were faster or took longer. There's a tradeoff between speed <> quality. Use Braintrust to help you find the optimal balance 😇.

1

8

6

Braintrust

@braintrustdata

11 months

The modern AI app development workflow

0

1

6

Braintrust

@braintrustdata

11 months

Spend your time building the fun parts of AI apps w/ Braintrust :)

0

1

6

Braintrust

@braintrustdata

5 months

🚨 Braintrust shoutout @ 24:45 - thank you @eladgil :)

Turner Novak 🍌🧢

@TurnerNovak

5 months

🎧🍌New @ThePeelPod with @EladGil Stream the full episode here on X or links below Timestamps: 03:46 Building cool monuments 09:12 Fixing education 16:38 Why AI is underhyped 19:02 Four trends to watch in AI 19:55 Why there aren’t large biotech companies 23:21 The current state

8

7

48

0

6

Braintrust

@braintrustdata

8 months

The LLM App Stack by a16z. Validation is the most crucial step in building reliable and quality AI apps. Braintrust helps you integrate evals to rapidly ship reliable AI.

1

0

5

Braintrust

@braintrustdata

11 months

Our prompt playground supports OpenAI function calling and tools now! Try it out on Braintrust.

0

4

Braintrust

@braintrustdata

11 months

😍 It's now so easy to use variables in our Playground. We got tired of editing raw JSON so we upgraded our UI to support variable/object inputs better.

0

7

5

Braintrust

@braintrustdata

11 months

Simplify your evaluation scripts with Braintrust. Just define 3 functions: data, task, and scores. We do all the tedious optimizations like parallelizing requests for you.

0

9

5

Braintrust

@braintrustdata

11 months

@jerryjliu0 @llama_index @FastAPI Need an evaluations script for your AI app? We just opened a PR adding in Braintrust for create-llama to test and evaluate the LLM calls in the templates.

1

0

5

Braintrust

@braintrustdata

10 months

👋 Braintrust makes it easy to eval your AI app

Greg Brockman

@gdb

10 months

evals are surprisingly often all you need

67

82

1K

0

4

Braintrust

@braintrustdata

2 months

We are excited to announce Braintrust is now SOC 2 Type II certified! We have supported enterprise customers from day 1, and achieving SOC 2 compliance is further validation of how seriously our team takes governance, risk, and compliance.

Braintrust achieves SOC 2 Type II compliance

We are excited to announce that Braintrust has achieved SOC 2 Type II compliance.

www.braintrust.dev

0

1

4

Braintrust

@braintrustdata

2 months

We are very excited Braintrust was featured in the inaugural Future 50! We are thankful for the recognition and can’t wait to continue supporting amazing AI teams.

Mario Gabriele 🦊💭

@mariogabriele

2 months

I’m so excited to sareh the Future 50, a database of extraordinary, high-potential startups. A few companies you'll learn about: 🚚 A trucking company doing $45M ARR 🧬 A biotech building "AWS for biology" 🇯🇵 Japan's answer to OpenAI 📈 A payments company that grew 20x in 18

3

19

103

0

1

4

Braintrust

@braintrustdata

11 months

@goodside This is how we are thinking about deterministic outputs: it's a game changer for developers

Braintrust

@braintrustdata

11 months

OpenAI announced: Reproducible outputs 💥 This is game changer for developers! It's now possible to actually evaluate and unit test your LLM apps! You can know that when a test passes locally, it'll pass for your team and in CICD. 1/3🧵

1

3

21

0

4

Braintrust

@braintrustdata

11 months

Don't have an eval set already? Tired of writing scoring functions? Our `autoevals` library makes it easy to grade your LLM outputs. It includes prebuilt scoring functions: • Model-based (using LLMs) • Heuristic (e.g. Levenshtein distance) • Statistical (e.g. BLEU)

0

1

4

Braintrust

@braintrustdata

2 months

Honored to be named as one of the most promising startups of 2024 by Business Insider! Thank you @CorinneMRiley & Shravan Narayen for nominating us

85 of the most promising startups of 2024, according to top VCs

Leading VCs name the most promising US startups in 2024, with AI, data infrastructure, and security dominating the list of emerging companies.

www.businessinsider.com

1

4

Braintrust

@braintrustdata

11 months

We have 5 tutorials on how to evaluate AI apps in our docs so far. What other eval examples do you want us to make?

0

4

Braintrust

@braintrustdata

8 months

The Modern AI Stack by Menlo Ventures. "Customers expect and deserve high-quality outputs, and enterprises are smart to be concerned that hallucinations could cause customers to lose trust." Braintrust helps you integrate evals to rapidly ship AI without guesswork.

1

0

4

Braintrust

@braintrustdata

4 months

New cookbook on how to use the fantastic @ragas_io framework in Braintrust! Among other things, the Braintrust implementation: * Available in both TS and Python * Uses function calling (which substantially boosts performance) * Is fully debuggable

0

4

Braintrust

@braintrustdata

7 months

Demoers! @ankrgyl @davidtsong @oscardumlao @tinahhong @KevinAFischer @jphorism &

1

0

3

Braintrust

@braintrustdata

1 year

@rivet_ts @AnthropicAI @AssemblyAI @GentraceAI @huggingface @MongoDB @pinecone We had a lot of fun working together on this plugin! 😄

0

1

3

Braintrust

@braintrustdata

11 months

@retool

Braintrust

@braintrustdata

11 months

. @Retool surveyed how companies are adopting AI. Some of the top challenges: model output accuracy, hallucinations, and prompt engineering. Braintrust help's you solve these challenges: run evaluations, visualize and inspect your results, and experiment with prompts quickly.

4

3

11

0

3

Braintrust

@braintrustdata

1 year

🥳New feature: customize your experiment dashboards! Choose only the charts you want to see for your experiment

0

3

Braintrust

@braintrustdata

11 months

😎 We just made our experiment sidebar resizable. Now, you can quickly view what you need without having to change pages all the time.

0

3

Braintrust

@braintrustdata

11 months

@BorisMPower Braintrust offers a good evaluation framework for LLM apps 🥳

Braintrust

Rapidly ship AI without guesswork

www.braintrust.dev

0

3

Braintrust

@braintrustdata

11 months

It's so easy to manage test sets and datasets with Braintrust. We made a web UI for editing evals with your team so you don't need to make your own with Google Sheets/Retool. Our TS/Python library also...

1

0

3

Braintrust

@braintrustdata

11 months

No need to spend all your time testing changes manually after every prompt and pipeline change :)!

0

2

Braintrust

@braintrustdata

6 months

@fblissjr @sh_reya 🙏

0

1

Braintrust

@braintrustdata

11 months

@mathemagic1an @codegen @ThriveCapital 🔥 we just signed up on the waitlist. we'd love to evaluate how it works on our codebase

1

0

2

Braintrust

@braintrustdata

11 months

@DescriptApp These are awesome! We love using Descript and had a lot of fun making our launch video using it!

0

2

Braintrust

@braintrustdata

11 months

Read the full blog post here to save time on your AI app development journey:

The AI product development journey

Building reliable AI apps is hard. It’s easy to build a cool demo but hard to build an AI app that works in production for real users. In traditional software development, there’s a set of best...

www.braintrust.dev

0

2

Braintrust

@braintrustdata

11 months

Braintrust makes it fun to collaborate with your team on AI app development.

0

1

Braintrust

@braintrustdata

7 months