Rishabh Srivastava @rishdotblog profile

Rishabh Srivastava

@rishdotblog

Followers

11,742

Following

1,285

Media

572

Statuses

3,427

Co-Founder @DefogData (YC W23). Previously founded . Data nerd 🤓

https://t.co/lbrWx8U4sy

San Francisco / Singapore

Joined September 2011

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Israel • 3510155 Tweets

Iran • 2810690 Tweets

ايران • 1108031 Tweets

Gaza • 1021460 Tweets

اسرائيل • 955658 Tweets

Tel Aviv • 702940 Tweets

Walz • 537370 Tweets

Vance • 439668 Tweets

Presidenta • 424546 Tweets

Middle East • 382997 Tweets

Halloween • 324315 Tweets

Claudia Sheinbaum • 259554 Tweets

Iron Dome • 165680 Tweets

イスラエル • 135641 Tweets

Wizkid • 104270 Tweets

John Amos • 99449 Tweets

Adams • 92651 Tweets

Davido • 89043 Tweets

Mets • 78547 Tweets

Good Times • 69448 Tweets

60 Minutes • 63422 Tweets

Tigers • 61050 Tweets

Isreal • 50875 Tweets

Raiders • 44723 Tweets

#GHLímite4 • 44425 Tweets

Royals • 37714 Tweets

からくりサーカス • 35530 Tweets

対象作品 • 35466 Tweets

集英社秋マンガチャ開催 • 33726 Tweets

Astros • 33344 Tweets

ケンタッキー • 30429 Tweets

#WWENXT • 30399 Tweets

Orioles • 27055 Tweets

Frogido • 25883 Tweets

Brewers • 23966 Tweets

#TemptationIsland • 23110 Tweets

週の真ん中 • 11299 Tweets

Giulia • 10752 Tweets

Roxanne • 10422 Tweets

豆腐の日

Vaccari

Jacobe Smith

Leslie Stahl

Sevy

Greig

第三次世界大戦

ぼよよん行進曲

#VPDebate2024

Cora Jade

#マックのコーヒー記念カキコ

Last Seen Profiles

@geronimoapo

@tinadetelj

@littlebuffbabe

@fa_chika6

@AIPF01

@WhelanRonan

@IJqcptc

@ForestWithin

@M_algnnam

@StopGapRamp

@darkestnite_

@com_in_

@FNFOUTBREAK

@nerfherder76

@BMWCCGB

@juyanmai

@AHNIMres

@BillHanzlik

@kept_man42

@com_in_

Pinned Tweet

Rishabh Srivastava

@rishdotblog

5 months

Llama-3 based SQLCoder 8b is out! Open weights with a commercially friendly cc-by-sa license. Probably the best <10B param model for Postgres text to SQL right now. Slightly better than gpt-4-turbo and claude opus for 0-shot text to SQL generation. Also approaches their

25

123

591

Rishabh Srivastava

@rishdotblog

1 year

We just open-sourced SQL Coder, a 15B param text-to-SQL model that outperforms OpenAI's gpt-3.5! When fine-tuned on an individual schema, it outperforms gpt-4. The model is small enough to run on a single A100 40GB in 16 bit floats, or on a single

84

348

3K

Rishabh Srivastava

@rishdotblog

11 months

We finally beat GPT-4 for SQL generation, after 3 months of trying! 🤓 SQLCoder now writes better Postgres SQL than GPT-4. Benchmarks aside, I'm amazed at how well it works even without fine-tuning. WITH further fine-tuned on a particular schema, it's ridiculously good. We've

57

231

2K

Rishabh Srivastava

@rishdotblog

8 months

Welp, just finished training and evaluating CodeLlama-70B for SQL. This thing is a beast when fine-tuned. Miles ahead of anything else (including GPT-4). Open-sourcing the weights either today or tomorrow!

36

117

2K

Rishabh Srivastava

@rishdotblog

8 months

We just opened sourced SQLCoder-70B! It outperforms all publicly accessible LLMs for Postgres text-to-SQL generation by a very wide margin. SQLCoder is finetuned on @AIatMeta 's CodeLlama-70B model that was released yesterday on less than 20,000 hand-curated prompt completion

58

280

2K

Rishabh Srivastava

@rishdotblog

11 months

Apple Silicon is seriously impressive! Just got a refurbished M2 Max with 64GB RAM. It does ~50 tok/s on our q4 quantized 7b mistral fine-tune, with comparable speeds to GPT-4 Will run tests on our quantized 34B model soon 🤓

40

69

1K

Rishabh Srivastava

@rishdotblog

10 months

Been sitting on this for a while – we raised a $2.2M round led by @ajhodls and @ycombinator , with participation from some incredible angels (including @dharmesh , who I have looked up to for many years). Excited to continue building – back to work now!

82

56

974

Rishabh Srivastava

@rishdotblog

10 months

Running our new 7B model 100% locally on an M1 Mac 🤓 76% accuracy on sql-eval with GGUF. For reference, GPT-4 is 82.5% and SQLCoder-34B-v2 is at 85% Pretty wild that this works locally *on a laptop*! Super excited about getting this on a Mac app soon.

28

62

889

Rishabh Srivastava

@rishdotblog

1 year

We just got our 15B parameter SQLCoder2 model to match (and *slightly* beat) GPT-4 for complex SQL generation on out-of-training-set schemas. Releasing the weights (hopefully) next week! Our previous model – SQLCoder – beat GPT-3.5 but lagged behind GPT-4. The new model

28

76

635

Rishabh Srivastava

@rishdotblog

1 year

We just open-sourced SQLCoder2 and SQLCoder-7B! They outperform GPT-4 when fine-tuned on a specific database schema, and outperform GPT-3.5 on out-of-training-set schemas SQLCoder2 is a 15B parameter model that uses the excellent Starcoder model by @BigCodeProject as a base

24

100

590

Rishabh Srivastava

@rishdotblog

1 year

We now have a 15B parameter text=>SQL model that outperforms gpt-3 and is competitive with gpt-3.5-turbo! Will open-source our evals framework and the model weights later this month. Been a hermit the last month and a half as we tried to get this up and running! Worth it! :D

27

37

567

Rishabh Srivastava

@rishdotblog

1 year

Incredibly excited to launch Agents today! Agents automate complex, repetitive work in SQL, Python and R - all while keeping data scientists in the loop for feedback and clarification. We built Agents to help with the many tedious trial-and-error tasks involved in statistical

13

56

484

Rishabh Srivastava

@rishdotblog

5 months

Wow. gpt-4o crushed all of our evals for text-to-SQL compared to gpt-4-turbo (and gpt-4). Very cool!

16

44

475

Rishabh Srivastava

@rishdotblog

8 months

Launching the second generation of SQLCoder-7b on @huggingface today! This is distilled from our 70B model, and performs around as well* as GPT-4 for text-to-SQL generation. Finetuned on @AIatMeta 's CodeLlama-7b. *To be more precise – this model is much better at ratios and

5

47

435

Rishabh Srivastava

@rishdotblog

2 years

Since social media mostly has wins: got rejected by @ycombinator . But onwards and upwards! @narratives_data is a search+collaboration engine to help creators analyse the world better. As every co becomes a media co, this’ll become a massive market that few are building for rn

44

15

423

Rishabh Srivastava

@rishdotblog

2 years

Happy Diwali! You will hear tons of emotional chatter about air quality today. I have been tracking air quality in India for 6 years, and it’s annoying to see the debate reduced to “are crackers the main cause of pollution”. Here’s some nuance

4

82

386

Rishabh Srivastava

@rishdotblog

2 years

The @ycombinator speed multiplier is real! Have shipped a major feature every day since the batch started, and I'll have more commits in the first 6 weeks of 2023 than in all of 2022 🤯 User feedback and fast iterations create a pavlovian loop that's all kinds of awesome!

13

21

344

Rishabh Srivastava

@rishdotblog

1 year

Phew. Just pitched @defogdata at @ycombinator 's demo day. My heart rate was at 112 bpm 😅 YC has been amazing. Excited about going back to building product (and a team) now!

23

3

265

Rishabh Srivastava

@rishdotblog

8 months

Two big updates today! We updated the weights for sqlcoder-7b-2, and it now outperforms GPT-4 for most SQL queries – specially if you give it the right instructions and prompt well @huggingface link here: 2) We've added basic instruction following

14

32

243

Rishabh Srivastava

@rishdotblog

5 years

I made an Indian name to ethnicity, affluence, gender, and age classifier! All data used in making this was obtained from publicly available sources, and cleaning it was a total pain. Hope I never have to parse PDFs again :/

What’s in a name? Plenty of possibilities for analytics

TL;DR: Names can be used to estimate affluence, ethnicity, age, gender, religion, and more

medium.com

31

54

223

Rishabh Srivastava

@rishdotblog

5 months

Won the GPU lottery and got an 8x H100 SXM on runpod. I don't think I can ever go back Run that was going to take 8 hours on 4x RTX 6000 Ada took 15 minutes on the H100s with FSDP 15 minutes of compute to go to Llama3 => GPT-4ish performance on specific tasks. 15 minutes!

10

11

224

Rishabh Srivastava

@rishdotblog

2 years

Swapped out our old model with @OpenAI 's GPT3.5 (used for ChatGPT). @defogdata works *way* better now! Very cool – specially because it's easy to fine-tune GPT3.5 on custom data. Even without fine-tuning, it's able to understand how to calculate COGS, gross profit ratio etc!

9

15

209

Rishabh Srivastava

@rishdotblog

7 years

Just open-sourced a #dataviz library that let's you go convert maps like the one on the left to the one on the right. Also works with Excel files and CSVs. Would love feedback!

5

68

211

Rishabh Srivastava

@rishdotblog

1 year

Got to chat with @sama and OpenAI folks in SG today! Pretty cool that they're doing this world tour thing – and are actually taking complaints and feedback seriously. *Really* hoping for more stability and increased capacity in the near future 🤞🏼

7

6

201

Rishabh Srivastava

@rishdotblog

4 years

Reflecting on our decision to shut down last year. The product consistently served 100M+ unique monthly IPs, and produced content that was always on the first page of Google But it failed as a business, largely because of my management failures. A long🧵

9

37

202

Rishabh Srivastava

@rishdotblog

5 months

We recently onboarded a large British customer. They are.. extremely polite when talking to LLMs 😅 Instead of questions like "what is X", most questions are phrased like "I am hoping to get X", or "could you please provide recommendations on how to get X". Cool to see!

23

1

200

Rishabh Srivastava

@rishdotblog

11 months

Useful learning over the weekend – the RTX4090 is a beast. If you can get your model to fit in memory, it is almost as fast as an H100 SXM under low load⚡️ Realizing that number of CUDA Cores is often more important than memory bandwidth for inference. The 4090 really shines

6

23

197

Rishabh Srivastava

@rishdotblog

2 years

Copilot converted an entire library from Python to NodeJS for me, and it worked nearly perfectly (I had to make just 2 edits) It also wrote ~30% of the Python library

2

7

199

Rishabh Srivastava

@rishdotblog

9 months

The more I work on @databricks , the more I'm amazed at how "complete" their product is Probably the best positioned large co for enterprise adoption of AI and LLMs right now - Fantastic semantic layer to understand data - SQL warehouse that integrates with everything -

9

8

159

Rishabh Srivastava

@rishdotblog

5 months

Aight we have a pretty good Llama3-8B based SQLcoder brewing! - GPT-4-turbo level performance on 0-shot text to SQL - Almost GPT-4-turbo performance on instruction following and in-context k-shot learning for SQL gen Lack of instruction following and in-context learning was the

15

17

163

Rishabh Srivastava

@rishdotblog

2 years

Excited to finally share this! Super proud of what we are building at @defogdata , and YC has helped us be a lot more ambitious about our vision. Our group partners @_puneetKumar and @bradflora have already added so much value and pushed us to think bigger!

Y Combinator

@ycombinator

2 years

Welcome to YC, @rishdotblog , @medha_basu , and team @defogdata ! Defog is like ChatGPT for data - right within your app. It enables users to query data in seconds, using natural language.

7

12

81

30

11

157

Rishabh Srivastava

@rishdotblog

7 months

Evaluated Claude-3 on SQL-Eval. Much better than Claude 2, but some way to go until GPT-4 For SQL generation, Opus has GPT-4 turbo level performance. Sonnet has similar performance as 3.5-turbo, but is also roughly 4x slower. GPT-4 is still significantly better

5

13

155

Rishabh Srivastava

@rishdotblog

2 years

Let's goooo! @narratives_data got a YCombinator interview invite! Hoping that our second attempt is the charm

24

4

151

Rishabh Srivastava

@rishdotblog

6 months

Llama3 (8b) performs much better than the code-focused on CodeLlama 7b in my tests so far – both for SQL generation and general programming tasks Amazing to see in a model that's also has world knowledge. Makes it a great base for both agent planning and SQL-generation tasks.

11

7

151

Rishabh Srivastava

@rishdotblog

11 months

Quantization (and AWQ) is amazing. Just got our 34B model running – with almost no accuracy loss – on an single RTX 4090! Next up, GGUF and making it work inside a Mac app 🤓

6

5

135

Rishabh Srivastava

@rishdotblog

3 years

I'll likely combine this data with step count, heart rate, and sleep data from Fitbit and then publish it to Github as a weekend project Many startups (like ) are emerging in the wearable-enabled health space. Super excited about the future! [/fin]

18

11

137

Rishabh Srivastava

@rishdotblog

2 years

First @stripe payout for @defogdata 😁Let’s go!!

18

1

131

Rishabh Srivastava

@rishdotblog

2 years

FactGPT is pretty nuts. Possibly a preview of what Bing would look like with the ChatGPT integration Has accurate information, works across geographies. Have not been able to get it to return "false" news Take a bow @AnkurPandey , @averma12 , and the @LongShot_ai team!

5

22

120

Rishabh Srivastava

@rishdotblog

2 months

Just finished running evals for Postgres text-to-SQL on the new Llama 3 models TLDR - Unfinetuned llama models not (yet) as good as OpenAI and Claude models, but will easily outperform with finetuning on domain specific tasks - Llama 3.1 8B is faaaar better than the Llama 3 8b

5

11

120

Rishabh Srivastava

@rishdotblog

2 years

At an awesome event with @eladgil and @zoink rn — some interesting takeaways

4

113

Rishabh Srivastava

@rishdotblog

3 years

Damn the Software Industry in India has exploded this year. This is the graph for software jobs posted on over the last 10 years

4

12

102

Rishabh Srivastava

@rishdotblog

1 year

Important things I forgot to mention in the original tweet! - SQLCoder is fine-tuned on StarCoder, an awesome initiative of the @BigCodeProject - We used a slightly novel training approach. We first trained the model on "easy" questions, and then trained the result of that on

8

7

102

Rishabh Srivastava

@rishdotblog

8 months

Released!

Rishabh Srivastava

@rishdotblog

8 months

We just opened sourced SQLCoder-70B! It outperforms all publicly accessible LLMs for Postgres text-to-SQL generation by a very wide margin. SQLCoder is finetuned on @AIatMeta 's CodeLlama-70B model that was released yesterday on less than 20,000 hand-curated prompt completion

58

280

2K

0

7

102

Rishabh Srivastava

@rishdotblog

1 year

We're *finally* SOC-2 Type II compliant 🤓 Took a fair bit of work, but getting our data security controls in place was so worth it – specially as we start serving more enterprise customers!

9

2

100

Rishabh Srivastava

@rishdotblog

10 months

Got the beginnings of an AI data analyst that works 100% on the Macbook up! Amazed at how fast it was (not sped up at all – literally just took the app 2 seconds to generate a query!) llama-cpp and llama-cpp-python made dealing with Apple metal ridiculously easy :D

6

5

98

Rishabh Srivastava

@rishdotblog

7 months

Gemma looks quite promising! Starting a fine-tuning run on our SQL dataset, will report back how it performs in the next few hours :D

4

1

95

Rishabh Srivastava

@rishdotblog

10 months

At an event with Jensen Huang right now — some notes 1. Automated production of intelligence at scale = new kinds of productivity. Ability to harness data into intelligence at scale will enable humans to do so much more 2. Much of this productivity will be independent of

6

4

95

Rishabh Srivastava

@rishdotblog

8 months

You can now run SQLCoder with a GUI on Apple Silicon or any NVIDIA GPU-enabled device! On Apple Silicon, just run CMAKE_ARGS="-DLLAMA_METAL=on" pip install "sqlcoder[llama-cpp]" sqlcoder launch The Apple Silicon version is not super accurate, but works great for simple

5

11

88

Rishabh Srivastava

@rishdotblog

2 years

Wow, @OpenAI announcements today are 🔥 - chatgpt api, 10x cheaper than davinci and with generally better performance - whisper-large as an api, cheaper and faster than anything else out there - much better terms around data privacy and logging Nuts!

3

4

85

Rishabh Srivastava

@rishdotblog

6 months

Finished running SQL-Eval (200 text to SQL questions) on both the new GPT-4 turbo, and Gemini Pro 1.5 Caveat: this is for 0-shot responses only. Results might be different with k-shot prompting, and it's entirely possible that I used suboptimal prompts. GPT-4 turbo reasons

11

5

84

Rishabh Srivastava

@rishdotblog

2 years

Flying to SF tomorrow! Will be in the Bay Area for the next 4 months Yay to being in a similar time zone as our users. 1AM calls have been brutal😅

11

2

83

Rishabh Srivastava

@rishdotblog

2 years

Took an afternoon off for the first time in forever today to explore SF. Such a beautiful city! So many people playing music, walking their dogs, exercising, and reading in the sun. Hoping to do more of this in the next few weekends :D

5

1

80

Rishabh Srivastava

@rishdotblog

3 years

Low carb, high-fat foods led to almost no rise in blood sugar while being super satiating For instance, the eggs & avocado toast below (cooked in olive oil, with some feta cheese) was around 600 calories and led to no blood sugar spike whatsoever! [5/]

1

3

74

Rishabh Srivastava

@rishdotblog

11 months

Got SQLCoder-34B running on a Macbook (with minimal accuracy loss), using GGUF q5_k_m quantization! @ggerganov has opened so many doors for normies to experience AI and Apple Metal! Quantized accuracy was 80%, compared to 84% for an unquantized model Mean latency for SQL

3

6

73

Rishabh Srivastava

@rishdotblog

2 years

Ashris is one of the best in the business and has a massive, living resume that speaks to his expertise. Can’t recommend this enough if you’re trying to learn more about data viz!

India in Pixels by Ashris

@indiainpixels

2 years

Data need not be boring! Let's learn to make data fun and insightful with IIP's course Introduction to Data Visualization on Unacademy! Link: Hurry! Limited discount period - students can avail of extra discount!

5

2

105

3

8

72

Rishabh Srivastava

@rishdotblog

2 years

@Suhail Same – a bunch of people tried asked our data engine "how are you". It actually had a reasonable reply!

2

3

70

Rishabh Srivastava

@rishdotblog

3 years

A surprising learning for me – carb heavy meals after not eating for a while cause a huge sugar spike! Rajma + a whole meal wrap after 22 hours of fasting led to this. When intermittent fasting, will avoid a carb-heavy lunch from now [2/]

2

65

Rishabh Srivastava

@rishdotblog

3 years

The impact of refined carbs was stark, too. Rotis (made with wheat atta) led to a really bad spike here It would've likely been worse if not for a short (~10min) walk right after eating [3/]

6

1

65

Rishabh Srivastava

@rishdotblog

1 year

Sigh I'm so glad we're moving to self-hosted LLMs for code gen OpenAI keeps changing the underlying model without any notice. So frustrating to deal with

9

3

63

Rishabh Srivastava

@rishdotblog

10 months

Back to traditional software engineering today after many days in fine tuning land and was super productive 🤓 Open-source app that makes LLM powered data analysis easy (and possible on a macbook!) coming this week

2

1

63

Rishabh Srivastava

@rishdotblog

6 months

Cloudflare's new AI announcements look fun! Check out sqlcoder-7b-2 on their playground :D Unfortunately allows only for chat-styled inference right now (which we are not optimized for) – but still outperforms other models for text to SQL tasks!

7

8

63

Rishabh Srivastava

@rishdotblog

2 years

Air Quality will be bad tonight (though not as bad as last year). But average air quality over the next 3-4 months will be bad too, and causes more harm than just one day of bad air Hoping that public angst around the issue won’t be restricted to just one cultural flash point!

1

9

61

Rishabh Srivastava

@rishdotblog

3 years

Firecrackers (obviously) affect short-term air quality, but it's more complicated than that AQI in Chennai (generally a very clean city) spiked because of firecrackers. But it will rapidly improve tomorrow because of the city's geography Delhi is a different story [1/]

8

21

59

Rishabh Srivastava

@rishdotblog

3 years

Pushed this yesterday! Still crappy, but Data Narratives now supports video creation with AI voiceovers! Users make reports from charts they've saved, and a single click converts those reports into videos Loads to improve (animations, titles, transitions) – but it's a start :D

6

59

Rishabh Srivastava

@rishdotblog

3 years

Will definitely get a CGM for my parents so they can see what food items lead to a sugar spike for them. Continuous measurement will help identify dietary culprits Also hope that the Apple Watch 7 has an optical glucose monitor – will help diabetics save so much money! [7/]

5

2

60

Rishabh Srivastava

@rishdotblog

5 months

After a bit more testing, gpt-4o is a remarkable model for programming, tool use, and planning. Much better than gpt-4-turbo in ways that aren’t always captured by evals. Also relies far less on prompt engineering, and tends to “just work” most of the time. Excited to see what

0

3

59

Rishabh Srivastava

@rishdotblog

2 years

Such a great time to be building AI apps right now. GPT-4 is super promising. Google Cloud announcements are great. Fine-tuned Llama and Flan-UL2 are amazing for self-hosted models. So many great options to choose from. *Amazing* time to be a builder!

2

3

57

Rishabh Srivastava

@rishdotblog

2 years

Damn, the Indian government had sales of *Opium* as a revenue line item in the budget of 1952-53!

2

11

55

Rishabh Srivastava

@rishdotblog

11 months

Equally importantly, it is 2x faster than GPT-4 and GPT-4-Turbo when deployed on a single A100 80GB GPU. And as fast when deployed on 4x A10 GPUs (using vLLM) We haven't had a chance to play with Nvidia's TensorRT LLM, but might get more speed gains with that. Huge props to

2

1

56

Rishabh Srivastava

@rishdotblog

7 months

Useful finding about fine-tuning today – training on the same dataset and hyperparameters can still give you dramatically different end results, even if your train and eval loss are pretty much the same Consider this. I did 3 finetuning runs back to back on the same machine,

6

3

57

Rishabh Srivastava

@rishdotblog

7 months

PSA: if you use GPT-4 prompts for other models (Gemini/Mistral/open-source), they won't work as well. Spend the time to play around with different prompts for different models – what works for one is rarely what works for others

4

0

56

Rishabh Srivastava

@rishdotblog

2 years

At a @Cloudflare + @supabase event in Singapore tonight — live tweeting interesting notes from it Featuring @ritakozlov_ , @everConfusedGuy , @thorwebdev

3

9

55

Rishabh Srivastava

@rishdotblog

2 years

Gah my GitHub commits went to near 0 in the last two weeks. Spent most of my time in investor calls Super hungry to make up for lost time 🤓 Starting off with onboarding and efficiency improvements. Then more "fun" features!

3

1

55

Rishabh Srivastava

@rishdotblog

3 years

But going for long walks after eating high-carb, high-calorie meals can lead to a less extreme response In this graph, I had the same amount of paneer as in the graph above and the wraps had the same amt of carbs as the rotis. But a long (~5km) walk meant no sugar spike [4/]

6

2

53

Rishabh Srivastava

@rishdotblog

1 year

Aight starting another 1x/day shipping challenge for the month of May – with the added constraint of exercise and sleep The plan: - push 1 feature live every day - do one of a 4km run OR strength training OR a 10km hike every day - sleep atleast 7 hours every day Should be a

8

0

53

Rishabh Srivastava

@rishdotblog

1 year

At a Singapore x TechCrunch event in SF today and @AndrewYNg gave an awesome talk around where he sees AI going. I got my start with ML on Coursera 10 years ago! Pretty cool to see him talk about ML all these years later!

1

0

53

Rishabh Srivastava

@rishdotblog

2 years

As winter sets in, winds slow down, temperatures are lower, stubble burning increases as farmers clear their fields… and Diwali coincides with all these things

2

6

50

Rishabh Srivastava

@rishdotblog

2 years

Oh wow – @benthompson published my email response in the @stratechery update today! Nerd achievement unlocked 🤓

1

2

51

Rishabh Srivastava

@rishdotblog

10 months

Lol I love the RELEASE file in @MistralAI 's torrent! Also, look at how tiny that team is. Amazing what a small group of smart, motivated people can do

1

50

Rishabh Srivastava

@rishdotblog

3 years

Epiphany today: thinking about the same stuff over & over can feel like going around in circles. But it’s an upwards spiral Quality of insights compounds over time. Engaging with novel things *feels* great, but meditating on the same stuff over years can lead to better outcomes

3

7

50

Rishabh Srivastava

@rishdotblog

2 years

So happy to see Data Narratives being adopted by giants like @timesofindia ! Excited that our collaboration workflows are coming together. Massive thanks to @indianeconomy & @drindrajeetrai for trying an initially buggy product & making it more useful :D

2

5

49

Rishabh Srivastava

@rishdotblog

2 years

Just had a 10th grader reach out who runs a freemium newsletter for value investors, has built a SaaS app, runs a podcast, and is trying to get better at machine learning right now Love the drive. The next gen is alright!

4

0

50

Rishabh Srivastava

@rishdotblog

4 years

If you want traction, don't say 'I have a category defining product'. Instead, say 'the world is broken in this way'. The former is narcissistic, the latter empathetic Great #MastersofSaaS session by @dharmesh and @MohapatraHemant . Notes at 🧵below [1/]

3

11

49

Rishabh Srivastava

@rishdotblog

3 years

Lastly, it's not just about what you eat. Portion size is as important. If I overeat healthy things (like chicken breast+feta+capsicum wraps, or wholemeal oats), my blood sugar still spikes a lot Though overeating high carb things (rajma, pizza) is much worse [6/]

5

2

47

Rishabh Srivastava

@rishdotblog

2 years

Built something that debugs itself :D

Defog.ai (YC W23)

@defogdata

2 years

We just enabled self-adaptive learning! When Defog gets a query wrong, it now debug itself automatically. In this example, the initial query had an edge case for a divide by 0 error. Defog saw the error and fixed it - like a human would. Day 13/30 in our feature-a-day streak!

0

1

23

0

2

47

Rishabh Srivastava

@rishdotblog

2 years

21y old to me today: "hope you won't be offended by this, but how old are you?" Sigh. The thirties are well and truly here :/

6

0

47

Rishabh Srivastava

@rishdotblog

7 months

TIL about Google's Deplot – an chart to table VLM that works surprisingly well! Just 282M parameters – quite fast even on CPUs! Can probably fine-tune this to also give good results for statistical charts (like boxplots etc). Will play around!

2

0

45

Rishabh Srivastava

@rishdotblog

3 years

Exactly 5 years ago, we started making data-driven election videos that would get eventually get 4 million+ views on a shoestring budget – and powered production across YouTube, FB live, and TV! @nalinmehta , @sanjeevrsingh and I worked our butts off, but had a ton of fun!

4

5

45

Rishabh Srivastava

@rishdotblog

2 years

Can't believe this works :D This model is all of 12 minutes old at this point (and has been in the works for a month). Will improve over time, but super happy with where it is rn!

Defog.ai (YC W23)

@defogdata

2 years

Defog now has (rudimentary) reasoning abilities! You can now ask broad questions, like whether higher prices lead to lower sales, and get human-interpretable answers! This is Day 9 of our daily product pushes. We like moving fast, and we are just getting started 🛠️

2

1

25

0

4

46

Rishabh Srivastava

@rishdotblog

2 years

Trained an LLM to answer questions based on my book notes (around a million words in total) – works *really* well! Inspired by @NirantK 's Roam model. Getting it to stop hallucinating was a fun challenge. As was getting it to return "I don't know" answers

7

1

46

Rishabh Srivastava

@rishdotblog

1 year

Woot, SQLCoder is on Github trending! PS: we're launching something pretty cool that builds on top of this in 3 weeks :D

1

2

45

Rishabh Srivastava

@rishdotblog

2 years

So @medha_basu and I A/B tested our elevator pitch on @collision today! Incredibly kind of John to talk to 2 no-name early stage founders. Very struck by how carefully and thoughtfully he listened Thanks @caitbhri , @42piyush and the Stripe folks for making this happen!

3

2

45

Rishabh Srivastava

@rishdotblog

8 months

Pretty cool to see this on HF trending today :D Also, building some fun MLX integrations, thanks to @Ubunta 's awesome MLX port. Already a part of sql-eval in this PR:

0

2

44

Rishabh Srivastava

@rishdotblog

2 years

ChatGPT has replaced Google as my primary go-to place for technical questions Doesn't always get it right. But WAY faster to try + debug ChatGPT code than go through SEO clickbait

4

2

44

Rishabh Srivastava

@rishdotblog

3 years

Live Election Dashboards are up on Data Narratives! Uttar Pradesh: Punjab: Uttarakhand: Goa: Manipur:

1

10

43

Rishabh Srivastava

@rishdotblog

2 years

Me this morning, two americanos in, manically banging out code and headbanging to great music, thinking "it's such a wonderful day!" Barista, 30 minutes later: Sir, I love that you're having a great time, but could you relax a bit? A patron think that you're high on drugs 😅

9

1

43

Rishabh Srivastava

@rishdotblog

8 months

Some nerd stuff! 🤓 - If you want a very simple way to play with it, check out our Github repo: - You'll need a TON of VRAM to this fast. We've found doing AWQ quantizations a really good to keep accuracy high while keeping latency and VRAM low. Would

8

3

44

Rishabh Srivastava

@rishdotblog

11 months

Wow. @OpenAI 's product velocity is incredibly inspiring. Amazed to see them move as quickly as they have in the last 12 months. Something for all builders to aspire to!

2

3

44