gen_analysis Profile Banner
General Analysis Profile
General Analysis

@gen_analysis

Followers
54
Following
70
Statuses
11

Automated AI Safety and Red Teaming Tools— Backed by @ycombinator

San Francisco
Joined January 2025
Don't wanna be here? Send us removal request.
@gen_analysis
General Analysis
6 days
Read our recent blogpost on how our team found vulnerabilities in GPT-4o, o1, and o3 guardrails. The consistent success of our method highlights the need for developing robust, automated testing frameworks to safeguard AI systems.
Tweet media one
0
1
3
@gen_analysis
General Analysis
6 days
This is why we need stress testing for LLMs. Finding subtle ways where these models fail requires careful analysis and engineering. Pulse is doing a great job at this.
@ritvikpandey21
Ritvik Pandey
6 days
Document ingestion & Gemini 2.0 caused a lot of buzz this week. As builders in this space, here's our take: Data ingestion is a multistep pipeline, and maintaining confidence from LLM nondeterministic outputs over millions of pages is a problem. (1/7)
Tweet media one
0
0
3
@gen_analysis
General Analysis
10 days
LLM-powered search is fast, but is it resilient? We’re digging into systems like Perplexity and Deep Research to find where they break—and why. Stay tuned.
@OpenAI
OpenAI
10 days
Deep Research Live from Tokyo 4pm PT / 9am JST Stay tuned for link to livestream.
0
0
0
@gen_analysis
General Analysis
14 days
@kanat_sh Correct! At least if you’re our customer.
0
0
1
@gen_analysis
General Analysis
15 days
RT @gen_analysis: Latest Update: ⛓️ Uncovering Hallucinations in GPT-4o for Legal AI We've recently completed an extensive red-teaming stud…
Tweet media one
0
41
0
@gen_analysis
General Analysis
16 days
RT @sjgadler: Some personal news: After four years working on safety across @openai, I left in mid-November. It was a wild ride with lots o…
0
225
0
@gen_analysis
General Analysis
16 days
RT @founderjournals: 🚀 @gen_analysis launched! Finding Failure Modes for AI Models "Acquiring software products and replacing human teams…
0
1
0
@gen_analysis
General Analysis
20 days
Jailbroken: Read our latest report on how GPT 4 hallucinates on legal questions!
Tweet media one
0
5
12
@gen_analysis
General Analysis
20 days
General Analysis just launched on @ycombinator's Launch YC! General Analysis: Finding Failure Modes for AI Models. Check them out:
0
1
6