Jeffrey 🐬 confident-ai.com @jeffr_yyy profile

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

Followers

159

Following

282

Statuses

898

Cofounder @confident_ai, building @deepeval, ex-@Google, ex-@Microsoft

San Francisco

Joined September 2020

Don't wanna be here? Send us removal request.

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

6 hours

It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, our open-source approach, and ROIs we're bringing for customers:

0

8

17

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

27 minutes

@iam_abdhamid @confident_ai @ycombinator Keen to be providing the resources for it!

0

1

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

3 hours

The o-1 models overthink too much at times, not necessarily better than got-4o, just different

0

1

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

4 hours

RT @KritinV78426: LLM evaluation has come a long way—AI agent evaluation hasn’t. Most existing benchmarks (AgentBench, SWE-bench) focus on…

0

2

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

5 hours

RT @ycombinator: Subtrace lets developers resolve production issues faster. Unlike Sentry and other backend monitoring tools, @subtrace_de…

0

8

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

6 hours

RT @jeffr_yyy: It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, our…

0

8

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

7 hours

Uploading our launch yc video to the exact page on youtube creators studio i've built back when i was at google is an interesting experience

0

2

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

7 hours

from personal experience, research-based OS projects cannot live solely off of their research paper

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

14 hours

@simon_kubica @blackbirdvc @BainCapVC @indexplan congratz! hows this different from clickup? or is this just a linear killer?

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

14 hours

And @deepeval was used for evals btw

Martin Fowler

@martinfowler

14 hours

NEW § LLMs struggle with large amounts of context. Bharani Subramaniam and I explain how to mitigate this common RAG problem with a Reranker which takes the document fragments from the retriever, and ranks them according to their usefulness.

0

2

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

16 hours

@WaseemElahi You got me, will do!

0

1

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

1 day

@WaseemElahi What if we're broke

2

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

1 day

As startup founders, how do you think about competition and price wars? Is there a tendency to race to the bottom?

0

1

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

1 day

@life_is_a_model 🤫🤫🤫 (i call it investing)

1

0

1

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

1 day

I just found this gemstone of an article written by @martinfowler. He shows how LLM applications (mainly RAG) are built and even use @deepeval in its evaluation code samples! Check if out here:

0

2

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

1 day

You know ChatGPT is about to cook when its response starts with “Unforunately”

0

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

2 days

Visiting Palo Alto, was pleasure to visit the @ollama office and the amazing team there

1

0

11

Jeffrey 🐬 confident-ai.com

@jeffr_yyy

2 days

@topazlabs The color changed a lil I see

0