bhutanisanyam1 Profile Banner
Sanyam Bhutani Profile
Sanyam Bhutani

@bhutanisanyam1

Followers
38K
Following
11K
Media
950
Statuses
8K

๐Ÿ‘จโ€๐Ÿ’ป Working on llama models @AIatMeta | Previously: @h2oai, @weights_biases ๐ŸŽ™ Podcast @ctdsshow ๐Ÿ‘จโ€๐ŸŽ“ Fellow @fastdotai ๐ŸŽฒ Grandmaster @Kaggle

Menlo Park, CA
Joined October 2016
Don't wanna be here? Send us removal request.
@bhutanisanyam1
Sanyam Bhutani
3 years
This is the best week of my life ๐Ÿ™. โœ… Reached Kaggle Grandmaster tier. โœ… My ML Hero & Guru: @jeremyphoward was kind enough to host me for an interview about my journey. I promise to continue creating ML content to the best of my ability & sincerely take up competitions next ๐Ÿต.
@jeremyphoward
Jeremy Howard
3 years
This week I'm filling in for regular "chai time data science" podcast host @bhutanisanyam1, with a very special interview with a recently-anointed Kaggle grandmaster.
17
12
284
@bhutanisanyam1
Sanyam Bhutani
2 years
โ€œTransformers from scratchโ€ by Brandon Rohrer ๐Ÿค– . This is one of the best write ups, that starts from 0 and explains every single detail of the model architecture. Even if you need a refresher or donโ€™t, I would still highly recommend reading it:.
Tweet media one
37
540
2K
@bhutanisanyam1
Sanyam Bhutani
2 years
Easily the best paper on current State of LLMs! ๐Ÿ™. A 50 page read but itโ€™s not โ€œjust anotherโ€ survey paper, that only documents facts. The authors actually add very useful commentary capturing all aspects of building Large Language Models. Hence the result is a collection of
Tweet media one
39
316
2K
@bhutanisanyam1
Sanyam Bhutani
7 months
Extremely excited to start working on the llama community officially!. Iโ€™ve joined @Meta and will be working in an absolute dream of contributing to the llama community among many other cool things . So how and why did I get here?. The price to big dreams is paid in units of
Tweet media one
148
26
2K
@bhutanisanyam1
Sanyam Bhutani
3 years
An absolute masterclass by World's Top Data Scientists ๐Ÿ™. The awesome Kaggle Grandmaster Team at NVIDIA shares their winning tips and tricks in this series:.
Tweet media one
5
333
2K
@bhutanisanyam1
Sanyam Bhutani
6 months
Life update: I have moved to Bay Area to work @Meta HQ! ๐Ÿ™. The flight from India takes a day but my journey was 2 years to get to Silicon Valley: . In 2022, @jeremyphoward gave an advice that took over my mind:. โ€œYou should live in Bay Area for a while if you want to meet some
Tweet media one
61
41
2K
@bhutanisanyam1
Sanyam Bhutani
4 months
NotebookLlama: An Open Source version of NotebookLM ๐Ÿ™. A complete tutorial on building a PDF to Podcast flow using Llama:. - 1B to pre-process PDF.- 70B to convert it to a podcast Transcript.- 8B to make it more dramatic.- Parler and Suno models for TTS.
Tweet media one
31
282
2K
@bhutanisanyam1
Sanyam Bhutani
5 years
Google Colab has now a Subscription model for Power users: . - Faster GPUs.- Longer runtimes.- More memory. It's for $9.99/Month. I know many power users might enjoy it: .
23
369
1K
@bhutanisanyam1
Sanyam Bhutani
2 years
How to become an expert at any thing ๐Ÿ™. I rediscovered this gem by @karpathy in my bookmarks today
Tweet media one
21
153
1K
@bhutanisanyam1
Sanyam Bhutani
1 year
My favourite LLM paper is finally open source! ๐Ÿ™. Running a single Large Language Model agent is easy. Running multiple is hard. Running multiple over days of sustained interactions is really hard. Iโ€™ve spent the last 3 days reading through the code of the paper that solved
Tweet media one
13
200
1K
@bhutanisanyam1
Sanyam Bhutani
2 years
This is the best resource to get started in NLP in 2023 ๐Ÿ™. In 2 days, I will be kicking off a weekly study group to learn with everyone:. @_lewtun will kindly join us for an opening AMA.
Tweet media one
18
118
942
@bhutanisanyam1
Sanyam Bhutani
1 year
CS 324 Notes are a LLM Book! ๐Ÿ™. The Large Language Model course notes are a crispy book covering the foundations. Perfectly structured like an onion, starting at an overview & then levelling up. Itโ€™s Mixture of Expert section is one of the best:.
Tweet media one
9
224
908
@bhutanisanyam1
Sanyam Bhutani
2 years
The best summary of Transformers and itโ€™s evolution ๐Ÿ™. I found @giffmanaโ€™s slides from 2022 to be the best โ€œpocket referenceโ€ on the topic. Many posts have covered Transformers however this one also covers the state of field before and how it got adopted to different domains.
Tweet media one
11
155
888
@bhutanisanyam1
Sanyam Bhutani
2 years
Best tutorial on setting up LLMs locally! ๐Ÿ™. @Rob_Mulla made an end to end video teaching how to install, run with GUI and connect a Large Language Model to your own data on your own machine. All open source, running offline:.
Tweet media one
13
160
881
@bhutanisanyam1
Sanyam Bhutani
2 years
Watching โ€œState of GPTโ€ by @karpathy is the best 40 minutes you will spend this week ๐Ÿ™. I actually found it really helpful for filling a lot of my knowledge gaps:. - Comparisons against human brain and LLM brain. - Why prompting works and why is it helpful to ask a model to โ€œbe
Tweet media one
17
120
817
@bhutanisanyam1
Sanyam Bhutani
2 years
Officially wrapping up the @kaggle Top Solutions Series ๐Ÿ™. Iโ€™ve hosted over 25 videos sharing and explaining tricks, secrets of Kagglers for all domains of machine learning. The series is quite complete & Iโ€™m graduating to more challenges:.
Tweet media one
13
172
809
@bhutanisanyam1
Sanyam Bhutani
2 years
โ€œPretend youโ€™re an Indian parentโ€ ๐Ÿ˜‚
Tweet media one
33
71
787
@bhutanisanyam1
Sanyam Bhutani
2 years
The best tutorials on building LLM powered applications ๐Ÿ“š . @GregKamradt is an incredible teacher of @LangChainAI:. โœ… Top down & applied series.โœ… Amazing teaching style.โœ… Very practical examples.
Tweet media one
20
155
777
@bhutanisanyam1
Sanyam Bhutani
2 years
Papers Iโ€™ve read in the last month! ๐Ÿ™. Iโ€™m currently writing a LLM roadmap along with my notes. If anyone is interested in reviewing and providing early feedback-please reach out!
Tweet media one
82
31
751
@bhutanisanyam1
Sanyam Bhutani
5 years
MAJOR personal update: . Iโ€™ll be starting my work in a full time role @h2oai today as a Machine Learning Engineer and AI Content Creator! . Iโ€™m really excited to be a part of a team of many of my โ€œML Heroesโ€ and THE best kagglers. Recap on my ML journey:.
72
59
728
@bhutanisanyam1
Sanyam Bhutani
2 years
Arxiv Chat: Chat w the latest papers ๐Ÿ™. I made a really simple demo that makes it easy for me to understand the latest papers. The whole app is <100 lines of code:. โœ… @LangChainAI for the main logic.โœ… @h2oai Wave for the UI.โœ… ChatGPT for asking Qs
27
91
716
@bhutanisanyam1
Sanyam Bhutani
1 year
Implementing LLaMA from scratch! ๐Ÿ™. This implements a LLM in the style that Karpathy implemented nanoGPT. Even though it focuses on LLaMA-1, itโ€™s a refreshing code first read. Perfect for a Sunday crispy read:.
Tweet media one
5
157
714
@bhutanisanyam1
Sanyam Bhutani
5 years
I'll say this out loud since no one does. I studied CS at college, it didn't make me a better programmer. Practising Programming makes you a better programmer, not studying it. If you're starting your ML Journey,trust me a "CS background" won't be as helpful as practising code.
35
86
700
@bhutanisanyam1
Sanyam Bhutani
2 years
Run 13B model on an iPhone! ๐Ÿคฏ. Just finished reading @Tim_Dettmersโ€™ amazing work on SpQR. SpQR unlocks 3.35 bit quantisation which lets us run 33B models on 3090s and 13B models on an iPhone. Here are my notes from the paper:. - Quantisation is basically like compressing the
Tweet media one
29
111
688
@bhutanisanyam1
Sanyam Bhutani
1 year
Insanely detailed notes on Training LLMs! ๐Ÿ™. @StasBekman has shared his field notes on training foundational models. These are insanely detailed, cover a lot of gotchas and caveats. A crispy read with well documented code, in depth discussions:.
Tweet media one
6
127
682
@bhutanisanyam1
Sanyam Bhutani
3 years
Finally got my @kaggle Grandmaster hoodie! ๐Ÿ™๐Ÿต
Tweet media one
18
5
668
@bhutanisanyam1
Sanyam Bhutani
4 years
If you're looking for the best resources to prepare for ML interviews in 2020,. Here's a wiki from the @fastdotai forums:. Also, all contributions are welcomed!
Tweet media one
6
154
670
@bhutanisanyam1
Sanyam Bhutani
6 years
"Start by learning the basics really well [. ] Most advanced research projects require you to be excellent at the basics [. ] @AndrewYNg always told me to work on thorough mastery of these basics" .Read the complete interview w @goodfellow_ian @hackernoon:
4
184
643
@bhutanisanyam1
Sanyam Bhutani
2 years
NLP for absolute beginners ๐Ÿ™. @jeremyphoward kindly shared the Stanford materials which are incredibly high signal resource for NLP. Hereโ€™s another tutorial teaching you the absolute NLP basics upto how to make a submission on Kaggle, by Jeremy himself!.
Tweet media one
3
139
638
@bhutanisanyam1
Sanyam Bhutani
2 years
We can now train a 7B model from scratch on a single GPU! ๐Ÿคฏ. DeepSpeed Chat: a framework offering insane optimisations and speed ups for training RLHF models:. โœ… Efficient and more affordable.โœ… Insane Scalability.โœ… Easy to use scripts.
Tweet media one
9
115
605
@bhutanisanyam1
Sanyam Bhutani
1 year
Simulating a software company with LLMs! ๐Ÿš€ . Remember the 25 agents living in a simulation? This does the same but for a software company. ChatDev asks the questions around effectively getting Large Language Model agents collaborate on writing entire code bases:. - Writing a
30
120
612
@bhutanisanyam1
Sanyam Bhutani
3 years
I'm tea-ry eyed. I can't believe this :'). I've reached the @kaggle Grandmaster tier today! Thank you so much everyone!๐Ÿต. My sincerest gratitude to @jeremyphoward for introducing me to Kaggle and to @vopani for pushing me to pursue it! ๐Ÿ™.
63
15
597
@bhutanisanyam1
Sanyam Bhutani
3 years
The kind people @NVIDIAAI sent a welcome gift ๐Ÿ™
Tweet media one
20
11
571
@bhutanisanyam1
Sanyam Bhutani
1 year
The definitive guide to RAG in production! ๐Ÿ™. @GokuMohandas walks us through implementing RAG from scratch, building a scalable app. It now has updated discussion on embedding fine-tuning, re-ranking and effectively routing requests. I think this is easily the most complete
Tweet media one
12
91
568
@bhutanisanyam1
Sanyam Bhutani
2 years
Outperforming LLMs with 2000x smaller models! ๐Ÿš€. โ€œDistilling Step-by-Step!โ€ is an incredible paper showcasing the promise of using CoT prompting with LLMs to generate steps of logical thinking and high quality labels that can produce great smaller models:. โœ… Outperforms both
Tweet media one
8
115
557
@bhutanisanyam1
Sanyam Bhutani
1 year
My favourite LLM blogs! ๐Ÿ™. For your weekend learning, hereโ€™s an opinionated list of my favourite Large Language Model educators. Pick any or all of their articles and read them cover to cover
Tweet media one
16
82
557
@bhutanisanyam1
Sanyam Bhutani
2 years
The Deep Learning book study group ๐Ÿ™. Starting this Saturday, we will be going through the Bible for understanding the basics of DL ๐Ÿ“š.
Tweet media one
12
51
539
@bhutanisanyam1
Sanyam Bhutani
2 years
Combining Knowledge Graphs and LLMs! ๐Ÿ™. To my surprise, this paper is extremely detailed around training strategies of building such models and goes beyond โ€œjust promptingโ€ ChatGPT and GPT-4. In fact, I realised after reading-it doesnโ€™t even mention these models in most of the
Tweet media one
22
79
538
@bhutanisanyam1
Sanyam Bhutani
4 years
This is just surreal! . I just won the @hackernoon Contributor of the year award for 2 categories! ๐Ÿ™๐Ÿต. - Machine Learning.- Tutorial.
Tweet media one
Tweet media two
43
30
535
@bhutanisanyam1
Sanyam Bhutani
1 year
CS 25 has a great roadmap of LLM papers! ๐Ÿ™. Transformers United has great guest lectures spanning the foundations of Large Language Models. An underrated aspect of the course is the curated list of papers on every topic. Perfect for your weekend reads:.
Tweet media one
8
121
538
@bhutanisanyam1
Sanyam Bhutani
3 years
I finally got my copy of Deep Learning w Python by @fchollet! ๐Ÿ“š. I couldn't be more excited about this ๐Ÿ™. I'll be starting a reading group on Jan 8, and Francois has kindly agreed to join for an AMA! ๐Ÿต. Please send your Qs around Keras/the book as here!. Links TBD Soon!
Tweet media one
22
28
511
@bhutanisanyam1
Sanyam Bhutani
1 year
A 50 page book on LLM Agents! ๐Ÿ™. This is my new favourite survey paper. It reads like a perfect book on the why we need different techniques to make Large Language Model agents work and how different papers approached it.
Tweet media one
9
95
519
@bhutanisanyam1
Sanyam Bhutani
2 years
Easily one of the biggest announcements for DL! ๐Ÿ™. @fchollet announced Keras 3.0, a complete re-write making Keras the front end for TF, JAX and PyTorch. This means some amazing things. My mentor and core contributor @A_K_Nain gave me a rundown:. - Unified framework: This is
Tweet media one
11
83
514
@bhutanisanyam1
Sanyam Bhutani
2 years
The most detailed and practical write up on applying LLMs! ๐Ÿ™. This reads like a survey paper but written for the industry and applications. @eugeneyan is known as the best NLP writer for a reason. Itโ€™s the most comprehensive overview of patterns on building Large Language Models
Tweet media one
7
96
500
@bhutanisanyam1
Sanyam Bhutani
2 years
CS 25: Transformers United! ๐Ÿฆพ. One of the best courses covering concepts of Transformers in the context of LLMs along with applications and secrets to building these models. My favourite part is the guest lectures from the best of our field!.
Tweet media one
7
108
487
@bhutanisanyam1
Sanyam Bhutani
2 years
After all the travelling, Iโ€™ve invested my remaining savings into more 3090 GPUs for Kaggle ๐Ÿ™
Tweet media one
30
13
487
@bhutanisanyam1
Sanyam Bhutani
2 years
I finally met my ML hero whoโ€™s taught me everything: @jeremyphoward ๐Ÿ™
Tweet media one
12
8
489
@bhutanisanyam1
Sanyam Bhutani
2 years
A masterpiece on applying LLM agents! ๐Ÿ™ . MetaGPT paper is a golden treat on effectively applying Large Language Model agents. It takes inspiration from how humans work. Hereโ€™s my summary:. - Assembly line: Every agent has a role assigned to it. - Software Engineering: The above
Tweet media one
11
84
479
@bhutanisanyam1
Sanyam Bhutani
6 years
Personal Update Thread on The @GoogleAI Residency:. Earlier this year I got a life-changing email. My Google AI Residency application had made it to the final interview rounds! This spring, Google flew me out to NYC where I gave my "On-site" interviews.
15
44
455
@bhutanisanyam1
Sanyam Bhutani
2 years
Annotated PyTorch papers! ๐Ÿ”ฅ . @labmlai has the best resources for learning how to really implement ideas. This is a no-nonsense website with a side-by-side implementation of papers in @PyTorch. The transformers section is my fav:.
Tweet media one
9
123
443
@bhutanisanyam1
Sanyam Bhutani
5 years
"Machine Learning doesnโ€™t have to be a black box anymore. What use is a good model if we cannot explain the results to others. Interpretability is as important as creating a model.". A neat kernel on "Intrepreting Machine Learning models" by @pandeyparul.
5
99
434
@bhutanisanyam1
Sanyam Bhutani
4 years
I'm super excited to share that I've joined @weights_biases! ๐Ÿต. I've been a fan of their community since the early days, I'm really looking forward to contributing to it further. Please expect study groups, events, Kaggle deep dives, and much more! ๐Ÿ™ .
55
20
435
@bhutanisanyam1
Sanyam Bhutani
5 years
The mindset of "Completing an online course" isn't right: . It's not a college degree-hacking your way to completion shouldn't be the goal and def won't be helpful. Take your time, even build a project midway: Gaining knowledge & building Projects/Solving Prob should be the goal!.
12
54
429
@bhutanisanyam1
Sanyam Bhutani
2 years
Incredible recap of key Transformer concepts! ๐Ÿ™. What I really like about this write up is it covers 30 key papers and flows really well as a recap. @lilianweng has written so many incredible posts, this one captures all key architectural concepts:. - Transformer basics:
Tweet media one
7
73
430
@bhutanisanyam1
Sanyam Bhutani
6 years
Great tutorial on Deploying Deep Learning Models On Web And Mobile (Along with a working demo!) by @reshamas and Nidhin P. They've used the library, but the tutorial can be used to create a web and mobile app using any framework.
1
104
424
@bhutanisanyam1
Sanyam Bhutani
1 year
The most underrated LLM Cookbook! ๐Ÿ™. @OpenAIโ€™s guide is an incredibly underrated resource. My favourite bit is the practical advice and guides sprinkled throughout the examples. It also has the highest quality code of many learning resources. The examples cover all important
Tweet media one
4
80
417
@bhutanisanyam1
Sanyam Bhutani
2 years
GitHub GPT: Understand any repository! ๐Ÿš€ . Here is a demo where I played with connecting GPT-4 to any repository. The main logic is <20 lines of code:. โœ… @LangChainAI for the main logic.โœ… @activeloopai for storing embeddings.โœ… Simple App that runs in the terminal
14
60
418
@bhutanisanyam1
Sanyam Bhutani
2 years
Iโ€™m in happy tears to awarded โ€œTop GenAI Scientistโ€ award by @AnalyticsVidhya ๐Ÿ™. I feel really honoured by the recognition. Will make this one count!
Tweet media one
31
9
415
@bhutanisanyam1
Sanyam Bhutani
2 years
A truly open source assistant chabot: GPT4All-J ๐Ÿ™. A new model based was shipped to the GPT4All family. This one permits commercial usage and is completely open source:. โœ… Model weights.โœ… Training logs.โœ… Training dataset.
6
93
402
@bhutanisanyam1
Sanyam Bhutani
1 year
Another great roadmap of LLM papers! ๐Ÿ‘Œ . CS224n has a really good curated list of papers to read for Large Language Models. I would recommend starting with the papers before the slides and lectures:.
Tweet media one
1
70
401
@bhutanisanyam1
Sanyam Bhutani
2 years
The NLP study group is back after a break ๐Ÿ™. Today, Iโ€™ll explain and summarise the BloombergGPT paper. This was an incredible read since the authors have kindly shared a fair bit of model details along with the reasons for their architectural choices:.
Tweet media one
5
46
403
@bhutanisanyam1
Sanyam Bhutani
11 months
A perfect intro to open source LLMs! ๐Ÿ™. The course by @asangani7 is now my top recommendation for getting started with Large Language Models:. - Just enough theory for a whole picture. - Teaches prompting, special tokens and conversational agents. - Perfectly abstracts the
Tweet media one
0
52
332
@bhutanisanyam1
Sanyam Bhutani
1 year
Iโ€™m writing a guide on building Multi-GPU machines! ๐Ÿ™. Over the past few years, Iโ€™ve spent a lot of time learning how to build ML servers. Iโ€™ve decided to write a guide on the topic. What questions/topics would you want covered?. TIA!
Tweet media one
57
37
386
@bhutanisanyam1
Sanyam Bhutani
1 year
Efficient Deep Learning course! ๐Ÿ‘Œ . The lectures cover various techniques relevant to LLMs. Happy Sunday learning:.
Tweet media one
1
55
387
@bhutanisanyam1
Sanyam Bhutani
1 year
AutoAgents: Autonomously generate LLM agents for any goal! ๐Ÿค– . This tries to solve the need for strong prompting and role definition by autogenerating agents. The code is sparsely documented but readable:.
11
82
383
@bhutanisanyam1
Sanyam Bhutani
1 year
The best NLP lectures! ๐Ÿ™. @chrmanningโ€™s latest CS224n lectures are finally live! . The 14 hours of new content covers Large Language Models, Interpretability, and some crispy framework tutorials:.
Tweet media one
4
76
384
@bhutanisanyam1
Sanyam Bhutani
3 years
My next goal:. I will spend at least 500 hours this year competing on @kaggle ๐Ÿต. If I fail to do it, I will not drink chai for an entire year and giveaway all my GPUs ๐Ÿ™.
43
11
377
@bhutanisanyam1
Sanyam Bhutani
2 years
Studying the LLaMA paper at Little LLaMA Cafรฉ ๐Ÿ˜‹๐Ÿ™
Tweet media one
17
7
377
@bhutanisanyam1
Sanyam Bhutani
2 years
The definitive guide to Multimodal deep learning! ๐Ÿ™. Since the GPT-4 demo, multimodal has become one of the coolest domains in our field. This is a 240 page no-nonsense book to the domain, it starts from the basics of individual modalities upto the key details of the domain.
Tweet media one
5
72
377
@bhutanisanyam1
Sanyam Bhutani
6 years
The Interview with @kaggle Grandmaster and Senior CV Engineer @LyftLevel5: Vladimir Iglovikov @viglovikov just got published @hackernoon. The Grandmaster has really been kind enough to share *ALL* of his secrets, you can find all of them here:
8
86
367
@bhutanisanyam1
Sanyam Bhutani
2 years
Very practical course on applying LLMs! ๐Ÿ™. @HamelHusain had mentioned that langchain makes for a great cookbook of cutting edge ideas. This course is a refreshingly applied one teaching how to use @LangChainAI to build different applications. My favourite part is itโ€™s
Tweet media one
6
59
360
@bhutanisanyam1
Sanyam Bhutani
1 year
The most comprehensive series Iโ€™ve read on Vector databases! ๐Ÿ’พ. Most of us got exposed to vector dbs via Langchain or llamaindex documentation. However, Thereโ€™s a lot of nuance and options to select from when building Large Language Model apps. @tech_optimist has written a 4
Tweet media one
9
58
362
@bhutanisanyam1
Sanyam Bhutani
2 years
If youโ€™re interested in diving into ML research & understanding more papers this year:. The Deep Learning book is an incredible resource teaching the basics & math behind DL. Iโ€™ve created a 5 part series explaining chapters here:.
Tweet media one
4
54
360
@bhutanisanyam1
Sanyam Bhutani
2 years
Terrific tutorial on fine-tuning LLMs to your own data ๐Ÿ‘จโ€๐Ÿ”ฌ. Tomas Bratnic has shared a really crispy write up on creating a Cypher generating LLM:. โœ… All Open Source tools.โœ… Walkthrough of setup.โœ… Detailed steps on how to solve this @h2oai LLMStudio.
5
65
355
@bhutanisanyam1
Sanyam Bhutani
1 year
Masterclass of Pythonic Thinking: PyTudes ๐ŸคŒ. Its the highest quality resource for learning โ€œthe Pythonic wayโ€ and problem solving. The large number of problems cater to everyone at all levels. Every revisit, thereโ€™s something new to learn.
Tweet media one
3
72
352
@bhutanisanyam1
Sanyam Bhutani
3 years
Today is a glorious day for @kaggle community! ๐Ÿต. Kaggle legend: @sudalairajkumar has conquered all categories and become the newest 4x Grandmaster! ๐Ÿ™
Tweet media one
11
19
352
@bhutanisanyam1
Sanyam Bhutani
2 years
โ€œA cookbook of Self-Supervised Learningโ€ @ylecun et al๐Ÿ‘ฉโ€๐Ÿณ๐Ÿ‘จโ€๐Ÿณ . SSL is the tasty sauce behind a lot of the success in Language models, Computer Vision and beyond. It permits working with limited data by allowing you to include unlabelled data in your workflow. Hence becoming โ€œthe
Tweet media one
5
63
336
@bhutanisanyam1
Sanyam Bhutani
3 years
Weekly @kaggle Top Solutions Study group ๐Ÿ™. Starting this Sunday, I will be going through top solutions of recently ended competitions that might be relevant to the ongoing ones:.
Tweet media one
4
58
325
@bhutanisanyam1
Sanyam Bhutani
2 years
NLP with Transformers Study Group ๐Ÿค— . Starting next week, Iโ€™m hosting a study group on the absolute gem book by @huggingface team ๐Ÿ™. @_lewtun has kindly agreed to join the kickoff session. I canโ€™t think of a better way to learn NLP:.
Tweet media one
10
48
329
@bhutanisanyam1
Sanyam Bhutani
1 year
This really helped me understand why LLMs work! ๐Ÿ™. - Why next word prediction is powerful. - Why prompting works. - What we know about emergence. Thanks @_jasonwei for the gem:.
Tweet media one
1
39
326
@bhutanisanyam1
Sanyam Bhutani
2 years
Am I doing @karpathy and chill right? . Cafe overlooking Himalayas, tasty breakfast and lecture rewatch ๐Ÿ˜‹๐Ÿ™
Tweet media one
19
2
316
@bhutanisanyam1
Sanyam Bhutani
5 years
This is THE BEST CAREER ADVICE for Data Science that Iโ€™ve ever read: . IMO @kaggle forums often have write ups/advice of *much* higher quality than most of the blogposts out there. Iโ€™d highly recommend reading all of @ryan_cheslerโ€™s write ups on Kaggle.
0
67
317
@bhutanisanyam1
Sanyam Bhutani
2 years
A hands on guide to train LLaMA with RLHF ๐Ÿค— . Itโ€™s one of the most complete tutorials on the topic with detailed explanations around why and how to follow the fine-tuning approaches.
2
76
319
@bhutanisanyam1
Sanyam Bhutani
2 years
The first open source Financial LLM! ๐Ÿš€. BloombergGPT was the first proprietary financial LLM. This week, we witness the first open source one. FinLLM takes a โ€œdata centricโ€ approach towards finance by building on top of multiple APIs/resources. Hereโ€™s an overview of its
Tweet media one
4
46
313
@bhutanisanyam1
Sanyam Bhutani
4 years
The next 30 days, everyday w/o exception, I will:. - Wake up at 4 AM.- Workout for 2 hr.- Kaggle for 3 hr.- Move onto other tasks after 10 AM . Extra rules:.- No ๐Ÿ“ฑbefore 5 PM.- No Emails before 12 PM . I'll post a video announcing the micro-resolution tom
22
15
316
@bhutanisanyam1
Sanyam Bhutani
3 years
๐Ÿงต Top Kaggle solutions always feature many great insights and hidden details: . I spent the past two days reading top solutions from the recently ended Great Barrier reef competition. There were many fascinating tricks shared, my short summary. TL;DR๐Ÿ‘‡.
4
49
309
@bhutanisanyam1
Sanyam Bhutani
2 years
This is best Prompt Engineering guide ๐Ÿ™. @omarsar0 and team have kindly been curating a very extensive and complete guide on the topic. Like anything covered by @dair_ai, itโ€™s really high quality:. โœ… Introduction & Basics.โœ… Zero-Shot, Few-Shot.โœ… Chain of Thought.โœ… ReAct.
14
55
315
@bhutanisanyam1
Sanyam Bhutani
1 year
Last year, @HamelHusain (re) taught me a super power ๐Ÿ™. โ€œIf you sincerely spend 3 hours everyday learning a new topic. In 6 months, youโ€™ll be really far aheadโ€. This is something we learned in @fastdotai that Hamel reminded me when I expressed my imposter scare of LLMs
Tweet media one
6
41
312
@bhutanisanyam1
Sanyam Bhutani
2 years
My 15 day LLM Study Vacation! ๐Ÿš€ . The plan for next 2 weeks:. โœ… Hike/Visit a Himalayan mountain daily with a paper to read.โœ… Build 15 @LangChainAI apps.โœ… Finish catching up on LLM research
Tweet media one
21
12
310
@bhutanisanyam1
Sanyam Bhutani
4 years
I just sat for 5 minutes straight smiling and trying not tear up after today's interview. It's done. I was able to record enough interviews to complete my dream goal of publishing 2 episodes-every Sunday and Thursday, at 9AM PT. No exceptions in 2020.
Tweet media one
20
9
310
@bhutanisanyam1
Sanyam Bhutani
1 year
An extremely crispy intro to Vector Databases! ๐Ÿ™. Have you watched those wired videos explaining concepts at incremental levels of detail?. @helloiamleonie done the same for Vector Dbs. She teaches the topic using Feynman technique, in 3 levels of detail.
Tweet media one
5
52
302
@bhutanisanyam1
Sanyam Bhutani
1 year
The hardest challenge Iโ€™ve done! ๐Ÿ™. Last week, I completed 200 days of writing everyday about Large Language Models. After ~2k hours of learning, Iโ€™m ready to make crispy videos:.
Tweet media one
17
15
303
@bhutanisanyam1
Sanyam Bhutani
5 years
I'm really excited to share the interview with my and @fastdotai family's greatest ML Hero: @jeremyphoward. Not adding any Tweet introductions this time ๐Ÿต. Audio: Show Notes: Video:
11
59
302
@bhutanisanyam1
Sanyam Bhutani
4 years
If you're looking for a code first NLP course ๐Ÿ‘จโ€๐Ÿ’ป. There is an NLP course by @fastdotai covering ๐Ÿ•ต๏ธโ€โ™‚๏ธ. - What is NLP.- Topic Modelling.- Sentiment Classification.- Regex.- LM.- RNNs.- Transformers.- Bias & Ethics. Blog: YT Playlist:
5
63
294
@bhutanisanyam1
Sanyam Bhutani
1 year
Strong LLM blog recommendation! ๐Ÿ™. For any practitioner interested in Large Language Models, this is the best blog. @eugeneyan is magical at combining industrial patterns, experiments & research ideas very clearly . His weekend exp are my fav read:.
Tweet media one
3
39
301
@bhutanisanyam1
Sanyam Bhutani
1 year
This is my favourite type of tutorial! ๐Ÿ™. Remember the awesome fastai tutorials that share only the necessary theory and quickly dive into applying it?. @Sentdex teaches us QLoRA in the exact manner by applying it to give llama-2 more personality.
Tweet media one
6
42
300
@bhutanisanyam1
Sanyam Bhutani
2 years
150 interview with ML heroes ๐Ÿ™. Iโ€™ve actively hosted interviews with the best Kagglers, Researchers and practitioners from 2019-22. The questions discuss the guestโ€™s journey and approach in a timeless and educational way:.
Tweet media one
5
48
284
@bhutanisanyam1
Sanyam Bhutani
5 years
Huge congratulations to @pandeyparul on becoming the 1st Woman @kaggle Kernels Grandmaster from India. And to the best of my knowledge, 2nd one in the world. Although, I really hope she won't stop sharing her amazing kernels with us๐Ÿต.
Tweet media one
4
22
284
@bhutanisanyam1
Sanyam Bhutani
3 years
2022 Goals:. 1 Workout for 350 hours. 2 Compete on @kaggle for 200 Hours. 3 Write 3 Kaggle Kernels. 4 Host 50 meetups & interviews @weights_biases . 5 Spend 500 hours reading. 6 Write 4 High-Quality @PyTorch blogposts. 7 Publish 1 Open Source Repo.
11
15
285
@bhutanisanyam1
Sanyam Bhutani
2 years
ControlNet + @PyTorch ๐Ÿ”ฅ . Many thanks to @nischay_twt for tag-teaming ๐Ÿ™
8
30
287