Siddharth Sharma Profile Banner
Siddharth Sharma Profile
Siddharth Sharma

@siddrrsh

Followers
2,965
Following
2,761
Media
31
Statuses
999
Explore trending content on Musk Viewer
Pinned Tweet
@siddrrsh
Siddharth Sharma
4 months
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as
32
90
587
@siddrrsh
Siddharth Sharma
4 months
Introducing Gemma with a 10M context window We feature: • 1250x context length of base Gemma • Requires less than 32GB of memory • Infini-attention + activation compression Check us out on: • 🤗: • GitHub: • Technical
Tweet media one
44
146
1K
@siddrrsh
Siddharth Sharma
4 months
how are people getting cs degrees without doing parallel computing.
21
13
341
@siddrrsh
Siddharth Sharma
3 months
Re Llama3V: Firstly, we want to apologize to the original authors of MiniCPM. @AkshGarg03 and I posted Llama3V with @mustafaaljadery . Mustafa wrote the code for the project. Aksh and I were both excited about multimodal models and liked the architectural extensions on top of
@yangzhizheng1
PrimerYang
3 months
Shocked! Llama3-V project from a Stanford team plagiarized a lot from MiniCPM-Llama3-V 2.5! its code is a reformatting of MiniCPM-Llama3-V 2.5, and the model's behavior is highly similar to a noised version of MiniCPM-Llama3-V 2.5 checkpoint. Evidence:
Tweet media one
Tweet media two
Tweet media three
36
167
896
50
42
268
@siddrrsh
Siddharth Sharma
4 months
There's no better feeling than building something and realizing that something new now exists because you brought it to life.
4
6
140
@siddrrsh
Siddharth Sharma
5 months
So much alpha in research papers. Entire companies could be built around implementing certain ML papers re distributed training and robustness.
6
2
132
@siddrrsh
Siddharth Sharma
6 months
Mustafa ( @maxaljadery ) and I are excited to announce MLXserver: a Python endpoint for downloading and performing inference with open-source models optimized for Apple metal ⚙️ Docs:
6
5
106
@siddrrsh
Siddharth Sharma
2 years
Built a better way to search for classes at Stanford with @punwaiw - 📕 Powered by the latest @openai embeddings and @pinecone vector search db 🪄
15
6
102
@siddrrsh
Siddharth Sharma
2 years
Vercel is epic.
3
3
96
@siddrrsh
Siddharth Sharma
6 months
Building on top of last week’s release, we introduce mlxcli. Build on top of Apple MLX ( @awnihannun )  and 🤗 ( @reach_vb @julien_c ) with mlxcli. Usage: pip install mlxcli Docs: MLXcli achieves over 20+ tok/sec on M2 Mac’s
2
17
93
@siddrrsh
Siddharth Sharma
5 months
The future is built at Green library. 🌲
Tweet media one
9
2
89
@siddrrsh
Siddharth Sharma
1 year
Need a way to keep up with the Cambrian explosion in AI research? Built a better way to find and bookmark the latest papers with @apatwa7 - . We enable rapid search and QA for over 240,000 ML papers. Built with @supabase , @LangChainAI , @pinecone , @vercel
Tweet media one
1
5
65
@siddrrsh
Siddharth Sharma
4 months
Don't think in USD, think in dollars per hour for 8x NVIDIA H100 SXM with 80 GB VRAM/GPU, 208 vCPUs, 1800 GiB RAM, and 26 TiB SSD storage. For the price of a vanilla latte, you could probably get some gradient updates.
6
2
60
@siddrrsh
Siddharth Sharma
6 months
Excited to be a part of this team! The best is yet to come.
@mlfoundry
Foundry
6 months
We're excited to announce $80M in seed and Series A funding co-led by @sequoia and @lightspeedvp to further our mission of orchestrating the world’s compute capacity, making it universally accessible and useful. How we can help 👇
Tweet media one
3
8
128
2
1
48
@siddrrsh
Siddharth Sharma
4 months
We are releasing an early checkpoint of the model trained for 200 steps. We plan on releasing future models trained on a lot more tokens!
Tweet media one
3
3
44
@siddrrsh
Siddharth Sharma
4 months
Unlike OpenAI’s desktop app where you must provide a screenshot or upload a file, the context from your screen is automatically parsed. We also provide the ability to run secure local models like Gemma and Phi-3 multimodal from our interface. Due to the local model sizes, at
Tweet media one
4
5
43
@siddrrsh
Siddharth Sharma
1 year
We're changing how people discover, understand, and share AI research. 🪄 Now with folders, bookmarks, chat-w-paper, highlight and explain, friends, inbox, and more, check out Cambrian 2.0: 🦕
2
7
40
@siddrrsh
Siddharth Sharma
3 months
We’d like to thank the folks at Meta for their work in ensuring open-source is here to stay. We also wanted to shout out the authors of LLaVA-UHD as our methods are directly inspired by their intuition when it comes to image splitting and prepending the latents to the text.
0
0
36
@siddrrsh
Siddharth Sharma
1 year
Super excited to join Lux part-time with @graceisford and the rest of the @Lux_Capital team! Looking forward to thinking about frontiers in AI/ML.
@wolfejosh
Josh Wolfe
1 year
1/ Some NEWS–– unveiling Lux 8 → ♾️… • Early-stage (from inception to expansion) • From founders first $100k to their last $100m • Lux’s largest ever fund ($1B+)—now with $5B+ AUM
63
35
679
3
3
31
@siddrrsh
Siddharth Sharma
11 months
. @maxaljadery and I authored a short textbook on RL to explain the concepts with our own style of pedagogy. We understand best by teaching and there's no better way to learn than trying to convey nuanced topics with as much simplicity as possible.
Tweet media one
2
2
32
@siddrrsh
Siddharth Sharma
4 months
Never take no for an answer. A lot of people want to help - you've just got to dial the phone.
2
3
31
@siddrrsh
Siddharth Sharma
4 months
We’re excited about the future of open-source models and would love to hear any thoughts and/or suggestions on how we can take long-context further. Shoutout to the Gemma team for their awesome work in building these open-source models! @OriolVinyalsML @clmt @JeffDean @koraykv
4
0
29
@siddrrsh
Siddharth Sharma
4 months
ambientGPT is open-source and we plan to integrate vllm and ollama to provide more extensive inference hosting abilities with our multimodal GUI. We also aim to release ambientGPT on the apple app store soon.
Tweet media one
1
1
28
@siddrrsh
Siddharth Sharma
11 months
Hey all, last weekend @maxaljadery and I built a rapid indexing and 80x+ faster visualization layer to understand how groups of neurons (features) activate and cluster based on the latest data from @AnthropicAI 's mechanistic interpretability research. 🧵
Tweet media one
1
2
30
@siddrrsh
Siddharth Sharma
3 months
ML research is the most "empirical" science to ever exist. we test things and when they work or look promising, we double click ... so a lot of major results are child nodes of a tree of trial and error
1
2
28
@siddrrsh
Siddharth Sharma
8 months
i love coding. it makes me think about the highways and mountain roads - someone constructed them with intention. in software, the same effect is just as wild - someone wrote the compiler, built the text editor, and designed the GUI. people building off one another is how the
2
1
27
@siddrrsh
Siddharth Sharma
10 months
Amazing news! @graceisford is the best mentor I could ever ask for. She's made my time at @Lux_Capital super memorable and full of amazing learnings. There's no one who deserves it more.
@wolfejosh
Josh Wolfe
10 months
Super proud of @Lux_Capital Partner @graceisford @Forbes 30 under 30...
Tweet media one
16
11
236
1
0
23
@siddrrsh
Siddharth Sharma
4 months
We need a young president. Young not just in age, but in ideas.
4
0
22
@siddrrsh
Siddharth Sharma
1 year
Excited to announce my article on sparsity for LLMs with @graceisford , @DannyCrichton for @Lux_Capital
1
4
21
@siddrrsh
Siddharth Sharma
6 months
The future is actually starting to feel like the future.
0
1
20
@siddrrsh
Siddharth Sharma
11 months
Having worked at @AWS , and with @AnthropicAI recently adding Claude to Bedrock, @maxaljadery and I built Python and typescript SDKs to interact with Anthropic’s models on AWS Bedrock. It makes it really easy to do all of the AWS auth and use Anthropic models in production. The
Tweet media one
1
1
18
@siddrrsh
Siddharth Sharma
4 months
The real final exam.
Tweet media one
0
0
16
@siddrrsh
Siddharth Sharma
4 months
That feeling when one of your heroes @JeffDean likes your tweet 🥲
Tweet media one
1
0
13
@siddrrsh
Siddharth Sharma
7 months
So many books, such little time.
2
0
15
@siddrrsh
Siddharth Sharma
4 months
Never been more optimistic. Global maxima of expectations for the future.
1
0
15
@siddrrsh
Siddharth Sharma
11 months
Hey folks - @maxaljadery and I are excited to launch : an open-source infrastructure for labeling multimodal data while enabling RLHF tagging and augmenting your existing training data at no cost.
1
2
14
@siddrrsh
Siddharth Sharma
5 months
Bullish NYC.
@graceisford
Grace Isford
5 months
Today I’m thrilled to announce @Lux_Capital 's NYC AI Directory & NYC AI Map - 2 resources for the burgeoning AI talent ecosystem READ MORE👇 NYC AI Directory: NYC AI Map:
30
53
362
0
2
14
@siddrrsh
Siddharth Sharma
1 year
For students, it's tricky to know what exactly you want to optimize for: my framework focuses on execution that comes from the heart
1
1
13
@siddrrsh
Siddharth Sharma
5 months
Obsidian is critical for me to operate.
@meowhib
meowhib
5 months
this is why i love @obsdmd
Tweet media one
47
129
2K
0
0
12
@siddrrsh
Siddharth Sharma
4 months
orchestration is the future. haven't seen such a sick project in a while.
@AkshGarg03
Aksh Garg
4 months
we're collecting project ideas for D3N to try out!! have interesting devin projects you want to try out? reply below and we'll select the most upvoted/interesting ideas to send to devin Bonus points if the ideas are naturally distributed or parallelizable
7
1
11
1
2
12
@siddrrsh
Siddharth Sharma
1 year
Hey @Stanford ... 🍁Fall course enrollment alert—We built an optimized tool to streamline class planning, matching interests & fulfilling WAYs requirements: 🌲
0
2
12
@siddrrsh
Siddharth Sharma
1 year
Midjourney is the closest thing to magic I've ever seen. What a time to be alive.
0
0
11
@siddrrsh
Siddharth Sharma
5 months
it's not the models that are getting RLHF'd - it's you that's getting RLHF'd 🤣
@trenchieW
trenchie
5 months
rich and powerful people will make us believe whatever they want us to think, and most people just don't seem to understand that
2
0
15
0
0
11
@siddrrsh
Siddharth Sharma
4 months
0
1
8
@siddrrsh
Siddharth Sharma
1 year
Great work is when you get in a flow state and genuinely enjoy the process of what you're creating, not when you fully pander to an institutional ranking or external arbiter.
@sherjilozair
Sherjil Ozair
1 year
It feels surreal that we're talking about academic conferences and submitting papers and bad reviewers in 2023. We're literally living in a sci-fi future. You could choose to do anything. But you're choosing to play a snobby cargo-cult game for fake internet points. It's an
13
8
157
0
1
10
@siddrrsh
Siddharth Sharma
1 year
@punwaiw @sofianeflarbi and I decided to give an upgrade by adding grade distributions 📕 ... check it out! 🧭
3
0
9
@siddrrsh
Siddharth Sharma
4 months
When someone asks me, how do you define happiness - this page is the first thing that comes to mind.
Tweet media one
2
0
9
@siddrrsh
Siddharth Sharma
6 months
Love to see people build on top of !
@ronaldmannak
Ronald Mannak
6 months
Run Apple MLX from your menu bar. Introducing Pico MLX Server, a graphical frontend to download and start multiple(!) AI models locally on your Mac. You can use it with any chat client you like (e.g. @PicoGPT ) that uses the OpenAI API standard.
Tweet media one
38
64
520
0
3
9
@siddrrsh
Siddharth Sharma
6 months
whatever this aesthetic is - we need to double down on it.
@luusssso
lusso
6 months
Cinematic bathrooms >>
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
321
4K
1
0
9
@siddrrsh
Siddharth Sharma
11 months
CS 229S content is wicked so far!
@Azaliamirh
Azalia Mirhoseini
11 months
I'm very excited to share that I've started as an Assistant Professor of Computer Science at Stanford University! My lab will focus on self-improving AI methodologies and systems. Looking forward to working with the next generation of researchers! ()
76
86
2K
1
0
9
@siddrrsh
Siddharth Sharma
2 years
Good lesson in software development today. Never have your product totally rely on third-party APIs/services. For Cardinal Compass, we happened to suffer from 3-4 hours of lost traffic on the day of course enrollment due to pinecone being down. Better be safe than sorry.
0
0
7
@siddrrsh
Siddharth Sharma
3 months
Great comeback from Sinner. The French crowd can’t stop this guy 💪
2
0
8
@siddrrsh
Siddharth Sharma
8 months
@EvanHillHB Missed kick against the rams is about to cost us the postseason
2
0
7
@siddrrsh
Siddharth Sharma
10 months
Presenting the greatest place on the planet to work.
@satyanadella
Satya Nadella
10 months
@sama I’m super excited to have you join as CEO of this new group, Sam, setting a new pace for innovation. We’ve learned a lot over the years about how to give founders and innovators space to build independent identities and cultures within Microsoft, including GitHub, Mojang Studios,
1K
3K
32K
0
0
8
@siddrrsh
Siddharth Sharma
5 months
bullish on companies where the founder is a beast at Mathcounts
@jtguibas
John Guibas
5 months
When I was in middle school I qualified for Nationals at MathCounts and I remember distinctly watching @0xShitTrader (CEO of Ellipsis), absolutely destroy in the Countdown round That was when I realized I was very very good at math, but I was not Eugene
22
10
155
1
0
8
@siddrrsh
Siddharth Sharma
4 months
Choosing the right problem is the hardest part usually.
@natfriedman
Nat Friedman
4 months
If you like solving problems, the good news is that you'll never run out. Either you fail to solve a problem, and need to keep trying, or you succeed, in which case you've created new problems that need solving.
30
46
581
1
0
6
@siddrrsh
Siddharth Sharma
5 months
What a gift to the foundation model community.
@karpathy
Andrej Karpathy
5 months
# explaining llm.c in layman terms Training Large Language Models (LLMs), like ChatGPT, involves a large amount of code and complexity. For example, a typical LLM training project might use the PyTorch deep learning library. PyTorch is quite complex because it implements a very
421
1K
10K
0
1
7
@siddrrsh
Siddharth Sharma
6 months
@josemorgado grigor is easily playing top 5 right level right now.
1
0
6
@siddrrsh
Siddharth Sharma
4 months
On running shoes are sick.
1
1
7
@siddrrsh
Siddharth Sharma
1 year
Novak is the greatest all-around player and winningest of all time. Rafa is the greatest clay court player and fighter of all time. Roger played the most beautiful tennis of all time with the most grass-court dominance. Novak is on top but all three are GOATs in their own right.
1
0
6
@siddrrsh
Siddharth Sharma
1 year
Wish I could be there IRL - regardless it's gonna be epic!!
0
0
6
@siddrrsh
Siddharth Sharma
4 months
Something's in the air .. #smol
Tweet media one
1
0
7
@siddrrsh
Siddharth Sharma
10 months
A sequel to the Social Network is writing itself as we speak.
Tweet media one
3
0
7
@siddrrsh
Siddharth Sharma
1 year
A mentor shared this article with me several months back ... the lessons and beauty of the article are unmatched for anyone who enjoys writing software:
0
1
7
@siddrrsh
Siddharth Sharma
1 year
The @Apple vision pro feels like what the early 2000s kids thought the future would look like.
2
1
6
@siddrrsh
Siddharth Sharma
1 year
Searching, discovering, and understanding the state-of-the-art in AI research has never been easier! Check out Cambrian's latest demo below 🦕
1
0
7
@siddrrsh
Siddharth Sharma
4 months
I love GPU pods
@AkshGarg03
Aksh Garg
4 months
1/ @SohamGovande , @jameszhou02 , @jzhou891 and I spent the weekend building PodPlex: A platform for distributed training & serverless inference at scale I'm very glad to say that we left $10,000 GPU credits richer and 36 hours of sleep poorer more details in 🧵
11
13
136
0
0
6
@siddrrsh
Siddharth Sharma
10 months
If someone could encode the entirety of YouTube, that would be the world's ultimate reflector of humanity: GPT-6 if you will. TLDR: Video data extends the richness of images by another dimension and YouTube's 1B+ videos enable this at an unprecedented scale.
1
0
7
@siddrrsh
Siddharth Sharma
4 months
something that is literally *quite big* is coming. stay tuned
2
0
7
@siddrrsh
Siddharth Sharma
4 months
The best indicator of something being important to you is when you think about it in the shower or in your commute to work.
2
0
6
@siddrrsh
Siddharth Sharma
8 months
to chase this feeling is the greatest privilege as a human
@gdb
Greg Brockman
8 months
building is fun; seeing what you've built get used by others is indescribable
111
449
4K
0
0
6
@siddrrsh
Siddharth Sharma
5 months
every day, i wish i had more RAM. it's one thing to be GPU-poor. it's another thing to be RAM-poor :(
@reach_vb
Vaibhav (VB) Srivastav
5 months
IT WORKS! Running Mixtral 8x22B with Transformers! 🔥 Running on a DGX (4x A100 - 80GB) with CPU offloading 🤯
14
54
419
0
0
6
@siddrrsh
Siddharth Sharma
1 year
Couldn't be a better time to read Bostrom's Superintelligence
Tweet media one
@sama
Sam Altman
1 year
something like an IAEA for advanced AI is worth considering, and the shape of the tech may make it feasible: (and to make this harder to willfully misinterpret: it's important that any such regulation not constrain AI below a high capability threshold)
231
240
1K
1
0
6
@siddrrsh
Siddharth Sharma
4 months
thanks to @mihiranan (mr. clutch) for his help with the demo once again!
1
0
5
@siddrrsh
Siddharth Sharma
1 year
@vercel and @rauchg doing software the way it should be done - so epic
@rauchg
Guillermo Rauch
1 year
We have a new version of Speed Insights coming. The key evolution: it's an intelligent "Kanban Board" of what pages you should be optimizing. … instead of maintaining TODO lists and issue trackers, let @vercel do it. Oh, and the issues "close" as data comes in in realtime 😁
Tweet media one
37
37
786
0
0
3
@siddrrsh
Siddharth Sharma
11 months
Unless you’re an incumbent provider (ex. AWS, Nvidia, Azure, etc.) or model builder/source who can pretrain at a scale that less than 8-10 companies can operate at (inflection, OAI, anthropic, etc.) - you are most likely ngmi
0
0
4
@siddrrsh
Siddharth Sharma
9 months
The most underrated startup paradigm in my eyes: Take an AWS service --> understand its customers and potential shortcomings --> make it as intuitive and straightforward as possible --> distribute/market and capture similar horizontal opportunities (consolidate)
3
0
5
@siddrrsh
Siddharth Sharma
2 years
Favorite Driver of all time: Ayrton Senna Driver I dislike: Max Verstappen Driver that grew on me: Charles Leclerc Most overrated Driver: Fernando Alonso Most underrated Driver: Nico Rosberg The GOAT of F1: Michael Schumacher
@63secs
alex
2 years
Favorite Driver of all time: Driver I dislike: Driver that grew on me: Most overrated Driver: Most underrated Driver: The GOAT of F1: Comment/quote with your answers
303
9
212
2
0
5
@siddrrsh
Siddharth Sharma
1 year
Excited to share the latest demo for Cambrian: ! 🦕
0
0
5
@siddrrsh
Siddharth Sharma
6 months
wait bro we did this??
@shashtikar
Shashank Ashtikar
6 months
Indeed very exciting! @ollama has a rival now!
1
1
5
1
0
5
@siddrrsh
Siddharth Sharma
5 months
Deep learning is unique in that the field has such high promise when customer-facing/in-production but we know so little about why certain optimizations create certain effects.
@dwarkesh_sp
Dwarkesh Patel
5 months
. @_sholtodouglas poses a challenge. In the spirit of @natfriedman (whose Vesuvius Challenge was solved by a listener of my podcast - @LukeFarritor ). Can you figure out what the experts in a Mixture of Experts model are each specialized in? "A wonderful research project to do:
18
42
485
0
0
5
@siddrrsh
Siddharth Sharma
7 months
Huge fan of @obsdmd - great interface and proves brutally optimized simplicity is what software should be centered around
@karpathy
Andrej Karpathy
7 months
Love letter to @obsdmd to which I very happily switched to for my personal notes. My primary interest in Obsidian is not even for note taking specifically, it is that Obsidian is around the state of the art of a philosophy of software and what it could be. - Your notes are
Tweet media one
386
931
9K
0
0
5
@siddrrsh
Siddharth Sharma
6 months
Wild that this is on Arxiv. LLM security is sure to be its own sector these days.
@_akhaliq
AK
6 months
Google announces Stealing Part of a Production Language Model We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the
Tweet media one
41
334
2K
1
0
5
@siddrrsh
Siddharth Sharma
1 year
Excited to share a revamp of Cambrian: Bookmarking, search, and QA for ML papers has never been easier. 🦕
Tweet media one
0
1
5
@siddrrsh
Siddharth Sharma
10 months
@zebulgar imagine if internal GPT-5 helped pushed the narrative when used in board discussions
0
0
5
@siddrrsh
Siddharth Sharma
4 months
Great to see a community developing around this! @ollama , @GroqInc , and @vllm_project integrations on the way 🫡
@siddrrsh
Siddharth Sharma
4 months
Introducing ambientGPT: an open-source and multimodal MacOS foundation model GUI Run GPT-4o and open-source models with full ambient knowledge of your screen. Foundation models have long been confined to the browser. With ambientGPT, your screen context is directly inferred as
32
90
587
2
1
4
@siddrrsh
Siddharth Sharma
1 year
First @tab_delete , now @itsandrewgao ... love to see Stanford students never settling for less than the truth.
@itsandrewgao
andrew gao
1 year
PART TWO of #PaperGate ! That #ChatGPT button "Regenerate response" often gets pasted into #Scientific Papers, via @gcabanac !! A 🧵 of #peerreviewed scientific publications from reputable publishers like @sciencedirect @IEEEorg @ElsevierConnect First up, a paper on solar
Tweet media one
18
112
456
0
0
5
@siddrrsh
Siddharth Sharma
4 months
Excellent writing on decentralized/distributed training. Exciting times ahead!
@AkshGarg03
Aksh Garg
4 months
1/ As promised, here's my thesis on the future of decentralized training of foundation models. Covers: 1) why decentralized makes sense from scaling, margins, and marketplace lenses 2) challenges 3) exciting enabling research shifts In long form at:
9
19
171
0
0
5
@siddrrsh
Siddharth Sharma
10 months
RIP everything that doesn’t have a data/context moat or distribution 🙏
0
0
5
@siddrrsh
Siddharth Sharma
6 months
Uncertainty is a privilege.
0
0
5
@siddrrsh
Siddharth Sharma
4 months
0
0
5
@siddrrsh
Siddharth Sharma
9 months
Incentives rule the world. Kudos to @AravSrinivas and @perplexity_ai
@AravSrinivas
Aravind Srinivas
9 months
A lot of people ask about : why do we exist? Google would just eat you and do the same. How dare you think you have better engineers than Google? Well, we’ve never ever said we have better engineering. I’m a fan of every OG Googler and the founders. The
Tweet media one
Tweet media two
85
97
1K
0
0
4
@siddrrsh
Siddharth Sharma
1 year
If you use it correctly, chatgpt is a copilot for your entire life.
0
0
5
@siddrrsh
Siddharth Sharma
2 years
@punwaiw @OpenAI @pinecone Over 8,000 requests/site hits in less than 2 hours on launch 🎊
1
0
4
@siddrrsh
Siddharth Sharma
5 months
I am speed.
0
0
4
@siddrrsh
Siddharth Sharma
4 months
@aadityabuilds Welcome to the farm! Writing is the best :)
0
0
4
@siddrrsh
Siddharth Sharma
2 years
now at 16,000 requests in 5 hours!
0
0
4
@siddrrsh
Siddharth Sharma
6 months
@michelleqin_ greatest place on earth.
0
0
4