![Ayaan Naveed Malik Profile](https://pbs.twimg.com/profile_images/1880177527509176320/Iz_xJOeZ_x96.jpg)
Ayaan Naveed Malik
@ayaannmalik
Followers
153
Following
234
Statuses
290
@stanford | prev. @togethercompute
Stanford, CA
Joined November 2023
One of my life-goals is to be in a class taught by @karpathy. I really wish he was still teaching at Stanford. It'd be the one class in my undergrad career that I'd never miss.
0
0
3
@BeraDemirbilek i don’t know if windsurf is better on a technical level (better retrieval, edits, bug fixes) but i’ll be playing around with it for the next week
0
0
1
Probably gonna be a banger video. For context, @karpathy’s videos taught me more about LLMs than stanford’s NLP class, cs 224n
New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful!
0
0
2
@itsandrewgao @LangChainAI @browserbasehq @ExaAILabs @firecrawl_dev @OpenAI takes 10 mins to make using Julep. Manages parallelism, @browserbasehq, search and everything in-between.
0
0
9