jack Profile Banner
jack Profile
jack

@jack

Followers
6M
Following
35K
Statuses
29K

Joined March 2006
Don't wanna be here? Send us removal request.
@jack
jack
5 months
We reject: kings, presidents, and voting. We believe in: rough consensus and running code. —David Clark, 1992
1K
2K
9K
@jack
jack
3 days
105
114
782
@jack
jack
5 days
this is excellent
@karpathy
Andrej Karpathy
7 days
New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. We cover all the major stages: 1. pretraining: data, tokenization, Transformer neural network I/O and internals, inference, GPT-2 training example, Llama 3.1 base inference examples 2. supervised finetuning: conversations data, "LLM Psychology": hallucinations, tool use, knowledge/working memory, knowledge of self, models need tokens to think, spelling, jagged intelligence 3. reinforcement learning: practice makes perfect, DeepSeek-R1, AlphaGo, RLHF. I designed this video for the "general audience" track of my videos, which I believe are accessible to most people, even without technical background. It should give you an intuitive understanding of the full training pipeline of LLMs like ChatGPT, with many examples along the way, and maybe some ways of thinking around current capabilities, where we are, and what's coming. (Also, I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version of this topic. They can still be combined, as the talk goes a lot deeper into other topics, e.g. LLM OS and LLM Security) Hope it's fun & useful!
Tweet media one
147
210
3K
@jack
jack
5 days
RT @MistralAI: Le Chat is fast (1,100 tok/s for flash queries on an updated Mistral Large). Download it at or http…
0
379
0
@jack
jack
5 days
most people speak about 150 words per minute, and read 200 wpm. most type about 50 wpm and listen around 150 wpm. speech-to-text is the optimal interface that gets us closest to thinking velocity. yet our thinking is mostly limited and constrained by our language.
966
889
10K
@jack
jack
6 days
RT @Saboo_Shubham_: Software Engineering AI Agent on your machine connects with your apps and tools to automate engineering tasks. It can…
0
102
0
@jack
jack
7 days
yes
@LuminEthics
Lumin
13 days
@blockopensource 2/ Why it matters: The AI race has been dominated by centralized models with restricted access. Goose challenges that by enabling modular AI agents that can install, execute, edit, and test with any LLM, not just a select few.
82
69
607
@jack
jack
11 days
¡ allez bordeaux !
192
35
552
@jack
jack
12 days
RT @Snowden: It's quite simple, Senator: if you're more upset at the whistleblower than you are at the lawbreaking they revealed, you're no…
0
23K
0
@jack
jack
13 days
RT @Teknium1: This is the entire code needed to reproduce R1 lol Hundreds of Billions of Dollars Later
Tweet media one
0
2K
0
@jack
jack
13 days
RT @Snowden: The Senate Intel Committee spent nearly the entirety of its session today furiously demanding that DNI nominee Gabbard condemn…
0
12K
0
@jack
jack
13 days
goose never died…and is #1 trending on github
Tweet media one
196
190
2K
@jack
jack
13 days
@garrytan 💯
23
3
86
@jack
jack
13 days
pardon @Snowden & Assange
2K
6K
41K
@jack
jack
13 days
@lord_pixel_ @pmarca no. $xyz is block��s ticker on nyse
54
22
113
@jack
jack
14 days
400
243
2K
@jack
jack
14 days
Tweet media one
487
299
3K
@jack
jack
14 days
you don’t need permission to build or use open source and open protocols. you must get permission to use anything and everything else. think about it.
870
1K
11K
@jack
jack
14 days
$xyz
2K
1K
8K
@jack
jack
14 days
collaborative comments are the death of documents
277
154
2K