Archie Profile Banner
Archie Profile
Archie

@archieemwood

Followers
1,097
Following
190
Media
390
Statuses
1,325

the data driven dad (TM) 💻: @evidence_dev

Joined March 2015
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@archieemwood
Archie
2 months
the @evidence_dev charting library is getting soo good
7
17
203
@archieemwood
Archie
6 months
Tweet media one
@ExcelHumor
memes.xlsx
6 months
Tweet media one
21
530
5K
4
104
833
@archieemwood
Archie
3 months
one of the more awesome things you can do with @duckdb query geospatial data straight from github this is from the excellent Natural Earth Data, and contains the boundaries for all countries in the world
Tweet media one
4
43
413
@archieemwood
Archie
2 months
pov: your title is analytics engineer
Tweet media one
3
47
363
@archieemwood
Archie
3 months
update: @duckdb ftw
Tweet media one
@archieemwood
Archie
3 months
who needs stats packages when you have SQL
Tweet media one
0
5
37
6
32
343
@archieemwood
Archie
2 months
who needs a BI tool when you have @duckdb
9
40
315
@archieemwood
Archie
1 year
When I was a Chief of Staff, one meeting generated more stress than *all the others* in the week. What was it? Our Tuesday AM 📊Weekly Business Review📊 (1/x)
Tweet media one
10
25
297
@archieemwood
Archie
2 months
okay so how about this @duckdb 's statistical functions have enough coverage that you can basically recreate facebook prophet in SQL time series forecasting in SQL, here we come
Tweet media one
4
23
268
@archieemwood
Archie
2 months
it can be tricky to do forecasting in SQL, because SQL isn't really designed for operating on data that doesn't exist in @duckdb it's simple, by combining generate_series() with unnest()
2
20
236
@archieemwood
Archie
2 months
TIL @duckdb has an .excel command which opens the next query in Excel
8
19
234
@archieemwood
Archie
1 month
awesome @duckdb ad oh wait, was it for something else?
@mdancho84
🔥 Matt Dancho (Business Science) 🔥
1 month
Why all data scientists should learn Polars in Python. This is why: 🧵
Tweet media one
7
92
521
7
13
216
@archieemwood
Archie
3 months
69yo mother in law asked about my job: me “I work in BI” her: “is that like AI?” me: “confusingly not” didnt have the heart to tell her i also do CI
6
17
206
@archieemwood
Archie
18 days
you can query any public dataset stored on @huggingface with @duckdb
1
25
204
@archieemwood
Archie
1 month
every data team goes through this
Tweet media one
4
13
192
@archieemwood
Archie
2 months
okay so this is it this is what the CFO wanted, right?
6
10
169
@archieemwood
Archie
2 months
to build this whole app, I wrote <60 lines of code ~ 20 lines of SQL + 35 lines of Markdown
Tweet media one
2
7
161
@archieemwood
Archie
2 months
presented without comment
Tweet media one
4
9
160
@archieemwood
Archie
2 months
okay so why is this such a cool @duckdb feature? it means you group by multiple different dimensions in one query!
Tweet media one
@archieemwood
Archie
2 months
very excited to try our CUBE / ROLLUP in @duckdb
Tweet media one
2
11
79
3
12
136
@archieemwood
Archie
2 months
so you can store a website inside @duckdb ...and render it using @evidence_dev
Tweet media one
2
10
135
@archieemwood
Archie
2 months
more things you shouldn't do with @duckdb hit the undocumented google trends API
Tweet media one
5
9
133
@archieemwood
Archie
1 month
when you run a query in @duckdb , it outputs results in a nice box but there are 20 other **really useful** formats you can use with .mode here are 7 of my favourite .mode duckbox (default)
@PoslunsMitchell
Mitchell Posluns
1 month
@archieemwood @duckdb @evidence_dev Keep rocking my world with these posts…unreal
0
0
9
4
14
132
@archieemwood
Archie
19 days
Evidence, now used by @NHSuk
Tweet media one
8
15
127
@archieemwood
Archie
2 months
also had some fun investigating command line plotting tools for the same dataset this is Youplot by Red Data Tools
Tweet media one
Tweet media two
2
9
114
@archieemwood
Archie
2 months
okay so now plot as a @duckdb extension run SQL, get chart
3
5
103
@archieemwood
Archie
2 months
part of what makes the SQL so terse is that it's @duckdb `group by all` is such a quality of life feature
Tweet media one
@archieemwood
Archie
2 months
to build this whole app, I wrote <60 lines of code ~ 20 lines of SQL + 35 lines of Markdown
Tweet media one
2
7
161
0
6
102
@archieemwood
Archie
2 months
change my mind: all radar charts should just be another kind of chart
Tweet media one
23
2
97
@archieemwood
Archie
2 months
exploring @duckdb 's recent performance data with @evidence_dev is such a delight with the new SQL autocomplete
3
7
88
@archieemwood
Archie
3 months
and since @evidence_dev uses DuckDB, you can get geospatial data directly into Evidence like this
Tweet media one
@archieemwood
Archie
3 months
one of the more awesome things you can do with @duckdb query geospatial data straight from github this is from the excellent Natural Earth Data, and contains the boundaries for all countries in the world
Tweet media one
4
43
413
3
9
86
@archieemwood
Archie
1 month
underrated feature of @evidence_dev it creates a parquet catalog from your data sources and publishes them with the app so you’re not waiting on the data warehouse for your dashboard to load
Tweet media one
2
9
86
@archieemwood
Archie
11 months
just released @duckdb VSCode SQLTools v0.0.5, with support for DuckDB v0.9.1 - database explorer, query runner, autocompletion - connect to a local, in memory or @motherduck DB
3
9
86
@archieemwood
Archie
2 months
NLP exploration with @duckdb ... DuckDB doesn't have libraries like nltk for NLP, but that doesn't mean you can't do NLP, you just have to implement it from scratch
Tweet media one
6
4
84
@archieemwood
Archie
1 month
presidential race odds just using SQL with @duckdb and Youplot
2
3
83
@archieemwood
Archie
2 months
just recorded some new getting started videos on how to use @evidence_dev 1 / Installing Evidence from VS Code
2
8
82
@archieemwood
Archie
2 months
if you haven't yet, it's time to come check out @evidence_dev
@archieemwood
Archie
2 months
who needs a BI tool when you have @duckdb
9
40
315
1
10
81
@archieemwood
Archie
9 months
this is really some cool innovation it uses @duckdb webassembly to reissue SQL queries in the browser against data files this means you can explore very large datasets, with extremely low latency
@evidence_dev
evidence
9 months
Evidence Universal SQL is coming! This brings multi data source support, and fast interactivity to Evidence Sign up for the community preview next week here:
3
11
68
1
9
81
@archieemwood
Archie
1 year
continuing on my vscode extension tour, just published an open source extension for duckdb supports connecting to local, in-memory, and @motherduck instances
0
16
80
@archieemwood
Archie
2 months
very excited to try our CUBE / ROLLUP in @duckdb
Tweet media one
2
11
79
@archieemwood
Archie
1 month
brb query running
Tweet media one
@matsonj
Jacob Matson
1 month
@archieemwood @nicoritschel @duckdb When the query is too fast, less downtime for tweets
1
1
13
3
7
77
@archieemwood
Archie
2 months
and if you can get it into duckdb, then you can get it in @evidence_dev so i replicated the twitter stats portal in Evidence
@archieemwood
Archie
2 months
X premium users get analytics on how much people are viewing their content if you open the devtools network tab, you can see all that data gets delivered in a JSON blob it's a pretty hairy nested JSON, but as always, only a query away from a nice table in @duckdb
6
4
66
1
5
76
@archieemwood
Archie
10 months
Update, I now have all the (roughly 100) coalesce session videos online - so you can watch any you missed! All indexed and easy to search and find
2
12
75
@archieemwood
Archie
1 year
just shipped read-write support in SQLTools for DuckDB edit your local @duckdb or @motherduck , and see it update, all in vscode
Tweet media one
0
8
71
@archieemwood
Archie
1 month
so wow you can deploy @evidence_dev to @huggingface and it's so easy watch this space...
2
6
71
@archieemwood
Archie
1 month
in @evidence_dev you can annotate notable events in your database and then display them on your charts all in SQL
Tweet media one
3
2
69
@archieemwood
Archie
2 months
writing a @duckdb extension for the first time here's a quick guide DuckDB extensions allow you to add: - custom functions (scalar, aggregate, table) - custom data types - custom output formats
4
8
67
@archieemwood
Archie
2 months
X premium users get analytics on how much people are viewing their content if you open the devtools network tab, you can see all that data gets delivered in a JSON blob it's a pretty hairy nested JSON, but as always, only a query away from a nice table in @duckdb
6
4
66
@archieemwood
Archie
1 month
most charting tools require inordinate amounts of config for a decent looking chart in @evidence_dev a great looking chart is 5 lines of code
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
6
64
@archieemwood
Archie
14 days
out here doing my best to shill @duckdb and i have to read stuff like this
@merm_bot
Senior Spreadsheet Engineer
14 days
guys you gotta shill this harder, did you know you can create a duckdb database from a csv and then just query it from dbeaver? Very very useful:
9
6
123
1
1
62
@archieemwood
Archie
1 month
@PradyuPrasad so if I look after my friends child for four hours and they pay me $100 and then tomorrow they look after my child and pay me $100 for 4 hrs, that increases GDP by $200 But if we just agree to swap, then it wouldn’t There are lots of uncounted exchanges of services
0
0
60
@archieemwood
Archie
2 months
this is all SQL for the dynamic forecasting @evidence_dev for the plotting
@archieemwood
Archie
2 months
okay so how about this @duckdb 's statistical functions have enough coverage that you can basically recreate facebook prophet in SQL time series forecasting in SQL, here we come
Tweet media one
4
23
268
4
3
61
@archieemwood
Archie
4 months
so my newest life update is that I'm now a dad!
Tweet media one
15
0
61
@archieemwood
Archie
2 months
how I feel when I rip open @duckdb
Tweet media one
@NTFabiano
Nicholas Fabiano, MD
2 months
Psilocybin leads to a profound shift in brain connectivity patterns. 🧵1/10
Tweet media one
358
2K
19K
1
3
60
@archieemwood
Archie
2 months
X also stores a JSON blob with the stats for each of your individual posts, so i've made a page for each post next stop: what makes a hit tweet?
@archieemwood
Archie
2 months
and if you can get it into duckdb, then you can get it in @evidence_dev so i replicated the twitter stats portal in Evidence
1
5
76
1
6
60
@archieemwood
Archie
12 days
getting started with the @duckdb CLI in <20s for @merm_bot
@matsonj
Jacob Matson
14 days
getting started with @duckdb in 45 seconds - for @merm_bot
5
6
73
1
3
60
@archieemwood
Archie
2 months
this is a use case I really like small multiples for
@archieemwood
Archie
2 months
exploring @duckdb 's recent performance data with @evidence_dev is such a delight with the new SQL autocomplete
3
7
88
0
6
55
@archieemwood
Archie
2 months
made an incredibly minimal CLI plotting tool allows you to plot using csv or command line arguments what should I add next?
4
0
52
@archieemwood
Archie
2 months
next step: turn the text into a "Bag of Words" representation it's a sparse table with one row per tweet, one column per word, count of words in most SQL dialects this would be impossible - you need to know the columns ahead of time to pivot not with @duckdb 's PIVOT syntax
@archieemwood
Archie
2 months
one of the more interesting parts of this is you need a list of "stop words" - common words that aren't interesting when doing text analysis: I, you, they, about, in, that they aren't built into duckdb, but there are lots of great public lists, so just a SELECT away
1
3
43
2
3
51
@archieemwood
Archie
9 months
anyone else doing advent of code in SQL? 👀
Tweet media one
3
7
49
@archieemwood
Archie
11 months
Oh @davidgasquez I love this so much - the terminal IDE for duckdb
Tweet media one
4
8
48
@archieemwood
Archie
1 month
probably more useful to use it to generate @evidence_dev charts
@archieemwood
Archie
1 month
playing around with LLM summaries for @evidence_dev 🔥 or👎
9
3
44
2
3
45
@archieemwood
Archie
1 month
playing around with LLM summaries for @evidence_dev 🔥 or👎
9
3
44
@archieemwood
Archie
2 months
one of the more interesting parts of this is you need a list of "stop words" - common words that aren't interesting when doing text analysis: I, you, they, about, in, that they aren't built into duckdb, but there are lots of great public lists, so just a SELECT away
@archieemwood
Archie
2 months
NLP exploration with @duckdb ... DuckDB doesn't have libraries like nltk for NLP, but that doesn't mean you can't do NLP, you just have to implement it from scratch
Tweet media one
6
4
84
1
3
43
@archieemwood
Archie
6 months
just using @evidence_dev 's Dimension Grid for the first time, and it is a really nice way to understand your data
0
4
43
@archieemwood
Archie
25 days
US BLS publishes avg salaries for 1,100 jobs at a state / metro level 6M rows per year = child's play for @duckdb data scientist salaries by state, 2023
3
8
42
@archieemwood
Archie
1 month
*gentle quacking noise in background* @duckdb
@RyanEls4
Ryan Els
1 month
Who does this 🤔
Tweet media one
759
725
12K
0
0
42
@archieemwood
Archie
4 months
now I get to combine my three favourite things Data + Dad + Dashboards
Tweet media one
@tayloramurphy
Taylor A Murphy
4 months
@archieemwood data dads are best dads! huge congratulations Archie!!
0
0
5
6
2
42
@archieemwood
Archie
1 month
in most bi tools it’s hard to create layered maps because drag and drop UIs for creating maps layers suck in @evidence_dev you compose maps from layers with code based on the wonderful, open source @LeafletJS
Tweet media one
3
8
41
@archieemwood
Archie
1 year
For all you BigQuery-ers out there I just published an extension so you can explore and run SQL from VSCode #bigquery #vscode #sql
2
6
39
@archieemwood
Archie
10 days
for the @evidence_dev hackathon at our offsite this year, I built evidence-new drop your CSV into a new Evidence app in 20 seconds
4
2
38
@archieemwood
Archie
3 months
who needs stats packages when you have SQL
Tweet media one
@archieemwood
Archie
3 months
yo dawg i heard you like linear regressions in your BI tool
Tweet media one
1
0
5
0
5
37
@archieemwood
Archie
1 month
i wrote up how to deploy @evidence_dev on @huggingface spaces
Tweet media one
2
5
36
@archieemwood
Archie
11 months
if you listen carefully, you can hear the sound of no-one reading your documentation
2
3
35
@archieemwood
Archie
18 days
one of the low-key best things about @evidence_dev is how terse the charting syntax is 6 lines of code -> beautiful viz
Tweet media one
1
4
34
@archieemwood
Archie
20 days
have been waiting until someone grabbed the olympics subdomain on @evidence_dev was not disappointed
Tweet media one
Tweet media two
Tweet media three
Tweet media four
3
6
33
@archieemwood
Archie
6 months
stuff coming out from @evidence_dev today: 1) a really sweet dimension grid component - kudos to @AdamMcaskill for shipping this one
1
5
33
@archieemwood
Archie
1 month
today is a special day
Tweet media one
5
0
32
@archieemwood
Archie
7 months
so an absolutely mega set of features released today for @evidence_dev here are some of my favourites 👇
3
4
30
@archieemwood
Archie
2 months
with @evidence_dev in the top 10 in product hunt today, I wanted to share 10 awesome projects the community has built 🧵
2
12
29
@archieemwood
Archie
6 days
Evidence-new: Create an @evidence_dev app in 40 seconds
1
1
37
@archieemwood
Archie
11 months
if you missed any of the talks at dbt coalesce this year, I've published an "unofficial coalesce app" where you can find slides for ~30% of the sessions. planning to add links to youtube vids after they are published too
1
1
27
@archieemwood
Archie
9 months
One of the other cool things that is supported in universal SQL is non database data sources like... google sheets
@evidence_dev
evidence
9 months
Evidence Universal SQL is coming! This brings multi data source support, and fast interactivity to Evidence Sign up for the community preview next week here:
3
11
68
3
4
26
@archieemwood
Archie
11 months
90% of data team-biz stakeholder interactions Data: Hey, we're from the data team, and we're here to help. If there's a problem that we can solve with data - ask! Biz: I have this excel, it's taken me 100 hours to build, and takes me 5 more each week. Can you automate? Data:💀
2
1
26
@archieemwood
Archie
9 months
advent of code in SQL day 1 part deux this regex feels pretty dirty - anyone got anything better?
Tweet media one
4
0
26
@archieemwood
Archie
2 months
@0xGKBRK @duckdb stop with your reasonable questions i’m just here to shitpost SQL
0
0
25
@archieemwood
Archie
11 months
Did a demo call with @Oracle MySQL Heatwave today, and must say the pitch is good: 1. One database for both transactional and analytical queries: Less ETL needed. 2. It’s 3x faster and 5-8x cheaper than [Snowflake / Aurora / BigQuery] 3. Support for querying CSV, parquet files
Tweet media one
4
2
25
@archieemwood
Archie
18 days
this is a cool feature let me look up my job title in the BLS occupation list to see what bucket I'm closest to results seem good
Tweet media one
@tdoehmen
Till Döhmen
18 days
Excited to share that @motherduck now makes computing semantic similarities in @duckdb as simple as a one-liner!
Tweet media one
2
6
64
1
2
23
@archieemwood
Archie
1 month
rendering a new component on the fly with @evidence_dev (very directed output for now)
@matsonj
Jacob Matson
1 month
@archieemwood @evidence_dev Rendering a new component on the fly would be sick, but could it work reliably enough??
0
0
1
3
1
23
@archieemwood
Archie
6 months
Just released DuckDB 0.10.0 support for the VSCode SQLTools extension Thanks @_ccer0z for the PR to get it over the line!
1
1
22
@archieemwood
Archie
1 year
If anyone's looking for a data front-end for @motherduck
@evidence_dev
evidence
1 year
👀
Tweet media one
0
0
18
0
5
21
@archieemwood
Archie
1 month
US election odds in @evidence_dev 15 lines of code and a json file
Tweet media one
0
3
21
@archieemwood
Archie
2 months
@duckdb @evidence_dev 8 lines of markdown
Tweet media one
2
1
20
@archieemwood
Archie
1 month
v excited for this
@rolandbouman
Roland Bouman
1 month
Development builds of @DuckDB have a version of histogram() () that lets you specify the bins. () So now we can do stuff like:
Tweet media one
Tweet media two
1
3
20
1
0
20
@archieemwood
Archie
8 months
making your own interactive components is super easy in Evidence now made this "freetext search in a chart" in a few mins cool: 💯 useful: 🤔
@evidence_dev
evidence
8 months
Evidence Universal SQL is live. Try it out:
2
14
113
1
1
20
@archieemwood
Archie
6 months
@matsonj continuing to build the coolest stuff with @evidence_dev a sweet "what if" matchup simulator between historical NBA teams
1
2
20
@archieemwood
Archie
1 month
billion dollar idea - who’s building this?
Tweet media one
7
2
20
@archieemwood
Archie
3 months
@FrankBoerman @duckdb duckdb> install spatial; load spatial; select * from ' https://whatever/shapefile/on/the/internet.geojson';
1
3
18
@archieemwood
Archie
2 months
@evidence_dev 60 lines of code in @evidence_dev here's the source code if you want to run it on your own account
Tweet media one
2
2
18
@archieemwood
Archie
6 months
tables containing lots of KPIs in rows are really common, but somehow they are really hard in most BI tools messing around with the UX in @evidence_dev thoughts?
Tweet media one
3
1
18
@archieemwood
Archie
2 years
Just arrived for our wedding in Lyon and @AirCanada has lost our bag, with my fiancée’s wedding dress in it. Can you help get the dress to us before Friday and save the wedding? +1 647 207 7690 Flew AC7968, AC867 Have willing friends in MTL, NYC & Paris so can get creative!
5
10
18
@archieemwood
Archie
10 months
my first database was as csv file committed to git it worked but made my commit history look kinda crazy
Tweet media one
Tweet media two
@_sysengineer
sysengineer
10 months
I just read a 127 comment thread about people arguing whether or not Excel was a database or not
262
102
2K
1
1
17