![All Hands AI Profile](https://pbs.twimg.com/profile_images/1856767771406680065/-kb8Giy9.png)
All Hands AI
@allhands_ai
Followers
3K
Following
180
Media
13
Statuses
161
We build AI software development agents, in the open. Developing OpenHands (prev: OpenDevin): https://t.co/wDOBeXGLmO
Joined May 2024
In this work with UC Berkeley and CMU, we found that o1 underperforms Claude when used in OpenHands. We're still digging into details, but we thought it'd be interesting to share our intermediate results now!. Reasoning is not a panacea for agents, other elements are necessary.
Surprising find: OpenAI's O1 - reasoning-high only hit 30% on SWE-Bench Verified - far below their 48.9% claim. Even more interesting: Claude achieves 53% in the same framework. Something's off with O1's "enhanced reasoning". π§΅1/8
2
8
62
We are proud to announce that All Hands has raised $5M to build the worldβs best software development agents, and do it in the open π. Thank you to @MenloVentures and our wonderful slate of investors for believing in the mission!.
3
6
59
Last week we announced that OpenHands is the strongest software developer in the world. Today we're officially announcing an online app that makes OpenHands easier to use than ever: It's currently in beta and rolling out quickly, sign up today!.
We just released, open source, the strongest AI software developer to date:. π₯ 53% resolve rate on SWE-Bench Verified.π₯ 41.7% resolve rate on SWE-Bench Lite. Details: π§΅β
5
11
56
Say hello to our new name, OpenHands!. If you'd like to help us build software development agents in an open and collaborative way:.* Join our open source community: * Apply for our open positions:
Announcement: the maintainers team of the OpenDevin open AI software development agent have decided to rename the project to OpenHands π. We have also made a big 0.9.0 release with a number of new features, read below for details.
1
5
31
π£Announcing OpenHands 0.10.0!π£. Our biggest release in a while with.- A brand new UI .- Ability to connect to Github projects w/ tokens.- Getting started examples.- Many resiliancy improvements.- Support for running sandboxes on @modal_labs .
2
4
30
Big release of OpenHands 0.15.0 π (so big we need two posts π). 1/2.* More efficient web browsing through markdown.* A web browsing alignment checker to prevent dangerous web actions (w/ @invariant_labs).* Custom docker images for the github resolver.
2
1
20
Interesting use case for OpenHands, automatically curating datasets for building vision-language models π§βπ¬.
@AdjectiveAlli i gave it a HF token and told it to create an art style eval dataset, came back to this
0
2
19
Merry Christmas everyone π. You too can have your own AI software engineer for Christmas:. * Download it now: * Sign up for the online version: If you already use OpenHands, what do you want in the new year?.
@allhands_ai This is a great project, great Christmas present. I'm having a lot of fun.
2
1
19
See how we implemented a security verifier in our web browsing agent, preventing possibly dangerous web actions in partnership with @InvariantLabsAI.
With (web) agents on everyone's mind, check out our latest blog post (link in thread) on browser agent safety guardrails. We replicate and defend against attacks on the @allhands_ai web agent, preventing it from generating harmful content and falling for harmful requests.
0
3
18
A new course from @weights_biases features some of our learnings on agent evaluation in OpenHands, as well as a demo of OpenHands autonomously running an experiment and doing evaluation with their new product Weave π.
We're excited to announce the LLM Apps: Evaluation course is now LIVE! π. Created in collaboration with guest experts @DynamicWebPaige and @gneubig, this course equips you with the skills needed to build trustworthy evaluations for your GenAI apps. Ready to skill up? π.
4
2
15
Introducing OpenHands 0.14.3π. Exciting updates:.1. One-button click to push to github.2. Incorporation of the commit0 benchmark for building apps from scratch (thanks @wzhao_nlp -- benchmark results soon!).3. Many other smaller robustness improvements
1
3
10
Thanks @tryolabs for naming OpenHands one of the top 10 open-source AI/ML/Data projects of 2024! We started in March 2024, and we're just getting starts π.
π€ Top Python Libraries 2024: AI / ML / Data Edition. Continuing our 10th anniversary celebration with 10 cutting-edge tools reshaping the AI landscape. Here are our top 5 picks that are revolutionizing how we build AI-powered applications. Let's dive.
0
1
9
That's it! We're extremely excited to have state-of-the-art open agents for development, and we're even more excited to have you join our quest to build them!.- Contribute to the open source: - Apply to work with us at @allhands_ai:
1
0
7
We're excited to be part of this amazing cohort of startups! Let's go π.
Excited to announce the first 18 startups of the Menlo's Anthology Fund with Anthropic!. Anthology Batch One. In the last 6mos, we looked through 1000s of startups to get to these. CRUD business apps will collapse in the AI era, logic will shift to AI agents.
0
0
6
Thanks so much to all the contributors, especially the new ones: @jeevaramanathan, @KLieret, @peywalt, Vaishakh-SM, amantyagiprojects, adityasoni9998, AlexCuadron, Ethan0456 π.
0
0
4
@ToivoMattila @gneubig For example, write "please write and run a flask server that does X" or "please run my react code with npm run dev", and it should show up here. This is a new feature though, so please report if there are any issues!.
0
0
4
@ToivoMattila Thanks a lot for trying OpenHands out! We're actually writing a blog post right now with some examples of the various things you can do with OpenHands. We'll try to post it by Wednesday here:
1
0
3
@domoritz @charliermarsh @gneubig Yep! If you give OpenHands ( a concrete prompt telling it what you want to do it should be able to make the edits, then iteratively run the tests until it works. Happy to help you try it out if you want π.
0
0
2
@MatthewBerman Thanks for the shout-out @MatthewBerman ! We'd love to have people try it and ping us with feedback, or join the community.
0
0
2
@rahulvrane Definitely! We're working on multi-session support for the online version, and saving and restarting sessions will be part of this.
0
0
2
@drivelinekyle @DrivelineBB Glad you like it! Please feel free to reach out anytime with suggestions or feature requests. Also happy to add you to our online app beta so you can easy run multiple sessions in parallel, etc. In that case just DM your github username.
1
0
2
@meanderingexile Yeah, we do it all the time on the OpenHands repo! If you get merge conflicts, just ask OpenHands to resolve them π.
0
0
2
@AlexTobiasDev @gaurav_dhiman @MatthewBerman Fair enough! Feedback noted on the "way too many tokens" part.
1
0
1
@WKarsens Yeah, it's an area that we're actively working on actually! Lots of room for experimentation here.
0
0
1
@DominikSeifert4 @cognition_labs @cursor_ai Interesting idea π This'll be a fun thing to try in OpenHands, and of course we welcome contributions!.
0
0
1
@RichAC2020 @gneubig All Hands can access web sites in two ways:.1. It can read individual web pages's content so the agent can reference them in its work.2. It can write programs to scrape web sites. It sounds like you might want "2.", which is definitely possible.
0
0
1
@thoughtisdead Great, glad you liked it! Feel free to ping us on github or slack if you have any issues.
1
0
1
@call_me_Nithin Yes, it's the default agent in the github repo!. (Actually we're on CodeAct version 2.2 now, with the improvements in 2.1 plus native support for web browsing).
0
0
1