![Jeffrey π¬ confident-ai.com Profile](https://pbs.twimg.com/profile_images/1715004527760621568/O-j038qK_x96.jpg)
Jeffrey π¬ confident-ai.com
@jeffr_yyy
Followers
159
Following
282
Statuses
898
Cofounder @confident_ai, building @deepeval, ex-@Google, ex-@Microsoft
San Francisco
Joined September 2020
It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, our open-source approach, and ROIs we're bringing for customers:
0
8
17
RT @KritinV78426: LLM evaluation has come a long wayβAI agent evaluation hasnβt. Most existing benchmarks (AgentBench, SWE-bench) focus onβ¦
0
2
0
RT @ycombinator: Subtrace lets developers resolve production issues faster. Unlike Sentry and other backend monitoring tools, @subtrace_deβ¦
0
8
0
RT @jeffr_yyy: It's official! @confident_ai just went public on @ycombinator's Launch YC! Checkout how we're changing LLM evaluation, ourβ¦
0
8
0
@simon_kubica @blackbirdvc @BainCapVC @indexplan congratz! hows this different from clickup? or is this just a linear killer?
0
0
0
And @deepeval was used for evals btw
NEW Β§ LLMs struggle with large amounts of context. Bharani Subramaniam and I explain how to mitigate this common RAG problem with a Reranker which takes the document fragments from the retriever, and ranks them according to their usefulness.
0
0
2
I just found this gemstone of an article written by @martinfowler. He shows how LLM applications (mainly RAG) are built and even use @deepeval in its evaluation code samples! Check if out here:
0
0
2