Jeffrey 🐬 confident-ai.com Profile
Jeffrey 🐬 confident-ai.com

@jeffr_yyy

Followers
159
Following
282
Statuses
898

Cofounder @confident_ai, building @deepeval, ex-@Google, ex-@Microsoft

San Francisco
Joined September 2020
Don't wanna be here? Send us removal request.
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
6 hours
It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, our open-source approach, and ROIs we're bringing for customers:
0
8
17
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
27 minutes
@iam_abdhamid @confident_ai @ycombinator Keen to be providing the resources for it!
0
0
1
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
3 hours
The o-1 models overthink too much at times, not necessarily better than got-4o, just different
0
0
1
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
4 hours
RT @KritinV78426: LLM evaluation has come a long wayβ€”AI agent evaluation hasn’t. Most existing benchmarks (AgentBench, SWE-bench) focus on…
0
2
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
5 hours
RT @ycombinator: Subtrace lets developers resolve production issues faster. Unlike Sentry and other backend monitoring tools, @subtrace_de…
0
8
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
6 hours
RT @jeffr_yyy: It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, our…
0
8
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
7 hours
Uploading our launch yc video to the exact page on youtube creators studio i've built back when i was at google is an interesting experience
Tweet media one
0
0
2
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
7 hours
from personal experience, research-based OS projects cannot live solely off of their research paper
0
0
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
14 hours
@simon_kubica @blackbirdvc @BainCapVC @indexplan congratz! hows this different from clickup? or is this just a linear killer?
0
0
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
14 hours
And @deepeval was used for evals btw
@martinfowler
Martin Fowler
14 hours
NEW Β§ LLMs struggle with large amounts of context. Bharani Subramaniam and I explain how to mitigate this common RAG problem with a Reranker which takes the document fragments from the retriever, and ranks them according to their usefulness.
0
0
2
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
16 hours
@WaseemElahi You got me, will do!
0
0
1
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
1 day
@WaseemElahi What if we're broke
2
0
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
1 day
As startup founders, how do you think about competition and price wars? Is there a tendency to race to the bottom?
0
0
1
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
1 day
@life_is_a_model 🀫🀫🀫 (i call it investing)
1
0
1
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
1 day
I just found this gemstone of an article written by @martinfowler. He shows how LLM applications (mainly RAG) are built and even use @deepeval in its evaluation code samples! Check if out here:
0
0
2
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
1 day
You know ChatGPT is about to cook when its response starts with β€œUnforunately”
0
0
0
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
2 days
Visiting Palo Alto, was pleasure to visit the @ollama office and the amazing team there
1
0
11
@jeffr_yyy
Jeffrey 🐬 confident-ai.com
2 days
@topazlabs The color changed a lil I see
0
0
0