DuckDB Profile Banner
DuckDB Profile
DuckDB

@duckdb

Followers
13,963
Following
3
Media
145
Statuses
702

DuckDB is an analytical in-process SQL database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.

Amsterdam, The Netherlands
Joined May 2019
Don't wanna be here? Send us removal request.
Explore trending content on Musk Viewer
Pinned Tweet
@duckdb
DuckDB
24 days
We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck". This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format. Check out our
Tweet media one
23
284
1K
@duckdb
DuckDB
5 months
New blog post by @mraasveldt : Multi-Database Support in DuckDB DuckDB can now attach MySQL, Postgres, and SQLite databases in addition to databases stored in its own format. This allows data to be read into DuckDB and moved between these systems in a
Tweet media one
10
85
364
@duckdb
DuckDB
2 months
DuckDB was recently covered in @andy_pavlo 's Advanced Database Systems course at CMU. The lecture covers DuckDB's history, internals, and integration with other systems. Slides: Recording:
Tweet media one
Tweet media two
Tweet media three
Tweet media four
1
70
362
@duckdb
DuckDB
4 months
We are proud to release DuckDB v0.10.0: Some highlights: – A reworked and much faster CSV reader – Fixed-length arrays – Multi-database support – Secrets manager – Temporary memory manager – Adaptive lossless floating-point compression – New CLI editor –
Tweet media one
1
63
273
@duckdb
DuckDB
1 year
Looks like duckdb is now pre-installed on Google Colaboratory
Tweet media one
3
23
268
@duckdb
DuckDB
1 year
DuckDB 0.7.0 "Labradorius" released with #JSON support, parallel and partitioned export to CSV and Parquet, UPSERT, @DataPolars integration, and much more in our release announcement blog post:
9
52
266
@duckdb
DuckDB
2 months
DuckDB is introducing support for vector similarity search through the new VSS extension. Read @Maxxen_ 's blog post for a sneak preview on the new extension's capabilities:
Tweet media one
2
55
253
@duckdb
DuckDB
7 months
We wrote a performance guide for DuckDB users! This guide covers topics such as the effects of schema (constraints, indexing) and hardware (CPU, memory, disk). We also share best practices for querying Parquet files and tips for tuning your workload.
Tweet media one
1
47
226
@duckdb
DuckDB
29 days
New blog post: Access 150k+ Datasets from Hugging Face with DuckDB This blog post, co-authored by the @huggingface and DuckDB teams, describes how you can use the hf:// prefix in DuckDB to access datasets in Hugging Face repositories. Read more at
Tweet media one
4
57
226
@duckdb
DuckDB
17 days
DuckDB has introduced native Delta Lake support. In our new blog post, @samansmink walks through the design and implementation of the new Delta Lake extension. Read more at
Tweet media one
6
39
222
@duckdb
DuckDB
5 months
We launched a new website! Check it out at .
Tweet media one
3
35
209
@duckdb
DuckDB
3 months
New blog post by @lnkuiper – No Memory? No Problem. External Aggregation in DuckDB The post describes how DuckDB can efficiently aggregate over many more groups than fit in memory, allowing it to complete the 50 GB variant of the
Tweet media one
2
34
191
@duckdb
DuckDB
8 months
DuckDB's co-creator, Hannes Mühleisen, recently became a professor of data engineering at Radboud University. The recording of his inaugural lecture, titled "The Ancient Art of Data Management", is now available.
1
29
183
@duckdb
DuckDB
7 days
New blog post by @szarnyasg : Command Line Data Processing: Using DuckDB as a Unix Tool This blog post shows how DuckDB stacks up against classic Unix tools (such as cut, grep, sort, and sed) when performing simple data processing steps. Read more at
Tweet media one
1
55
185
@duckdb
DuckDB
2 years
DuckDB 0.6.0 "Oxyura" released with improved storage, higher performance for CSV loading and indexing, new SQL syntax, better memory management, shell tweaks and so many new features @mraasveldt wrote a separate blog post to explain it all:
4
41
170
@duckdb
DuckDB
4 months
DuckDB supports querying buckets in the AWS S3 Express One Zone. Read the related guide at , which shows that DuckDB can read a Parquet file from an S3 Express One bucket at about 1.2 gigabytes per second! PS: You may also noticed that we started rolling
Tweet media one
1
33
154
@duckdb
DuckDB
4 months
The DuckDB repository hit 15,000 stars on GitHub today. Thanks to our contributors and our amazing community!
Tweet media one
0
5
144
@duckdb
DuckDB
4 months
New blog post by @__AlexMonahan__ SQL Gymnastics: Bending SQL into flexible new shapes In this post, Alex presents pure SQL queries to implement dynamic groupings and aggregate functions using DuckDB's friendly SQL extensions. The queries can be used to
Tweet media one
2
25
143
@duckdb
DuckDB
10 months
We have revamped one of our core operators, aggregation. It has improved scalability for many unique groups and for a large number of cores. Thanks to these, you can expect better performance when running large aggregations on big machines.
1
19
141
@duckdb
DuckDB
6 months
The Awesome DuckDB repository, maintained by @davidgasquez , has grown to more than 100 entries in less than a year. If you are aware of more cool projects using DuckDB, please consider submitting a PR!
Tweet media one
2
28
140
@duckdb
DuckDB
2 months
Did you know that you can connect to a DuckDB database file via HTTPS or S3 with just two SQL statements? We have a new guide that explains how to do this.
Tweet media one
2
18
137
@duckdb
DuckDB
3 months
New blog post by @hfmuehleisen duckplyr: dplyr powered by DuckDB The post describes the new R package duckplyr, which translates the dplyr API to DuckDB’s execution engine. Read more at
Tweet media one
1
37
137
@duckdb
DuckDB
3 months
There are now a lot of handy tools and cool projects built around DuckDB. You can find a list of these in the Awesome DuckDB repository maintained by @davidgasquez . See the list and contribute your project at
Tweet media one
1
30
129
@duckdb
DuckDB
2 years
DuckDB just reached 5000 stars on GitHub, thanks everyone!
Tweet media one
2
10
131
@duckdb
DuckDB
23 days
This blog post by Cal Paterson (), "DuckDB Isn't Just Fast", discusses some of DuckDB's characteristics outside of sheer processing speed: developer ergonomics, scalability using out-of-core processing, and ease of setup.
0
18
125
@duckdb
DuckDB
1 year
New blog post: JupySQL enables SQL cells in Jupyter, supports DuckDB, and also enables plotting larger than memory datasets using DuckDB! JupySQL is an active fork of ipython-sql being enhanced by the folks at @ploomber . Let us know what you think!
4
23
122
@duckdb
DuckDB
5 months
The new DuckDB landing page, , has several code snippets for SQL features and DuckDB's APIs. You can use the "Live Demo" button to execute the queries on an example dataset in your browser using the DuckDB shell that runs in WebAssembly. Note: the demo
1
18
119
@duckdb
DuckDB
18 days
DuckDB's documentation is now available for offline use both as a PDF and as a ZIP archive, which contains the static HTML of the website. Head to to grab a copy.
Tweet media one
3
10
119
@duckdb
DuckDB
13 days
The R package for DuckDB v1.0.0 is out on CRAN! Run install.packages("duckdb") to upgrade.
Tweet media one
0
21
116
@duckdb
DuckDB
17 days
We released DuckDB v1.0.0 a week ago. There is a growing list of tools integrating with DuckDB, applications that use DuckDB, and extensions created for DuckDB. You can find a list of these in the Awesome DuckDB repository, maintained by @davidgasquez . The list is never
1
18
115
@duckdb
DuckDB
4 months
We have started publishing the recordings of DuckCon #4 . We are first releasing the “State of the Duck” talk by DuckDB's co-creators, Hannes Mühleisen ( @hfmuehleisen ) and Mark Raasveldt ( @mraasveldt ). Video: Slides: Special thanks
Tweet media one
2
21
113
@duckdb
DuckDB
4 months
The DuckDB team celebrates yesterday’s release, v0.10.0, with a custom-made yoghurt cake! 🍰
Tweet media one
2
4
111
@duckdb
DuckDB
6 months
Lambda functions are one of the most popular features in DuckDB. We recently added list_reduce, a new scalar function that supports lambdas, and they got their own documentation page at . Note that this feature is currently only available in DuckDB's
Tweet media one
0
16
106
@duckdb
DuckDB
11 months
DuckDB just got a tldr page. If you have @tldr_pages installed, you can get examples of the most common command-line arguments with: $ tldr duckdb
Tweet media one
0
21
105
@duckdb
DuckDB
16 days
Congratulations to @motherduck on launching their DuckDB-powered cloud data warehouse in General Availability! 🎉
@motherduck
MotherDuck
16 days
MotherDuck, the ducking simple data warehouse, is now Generally Available! 🍾🥂 Thank you to our community of thousands of users who have tested, validated, and helped improve MotherDuck over the last year.❤️🦆
5
44
186
3
10
107
@duckdb
DuckDB
2 months
Did you know that DuckDB's documentation is available as a single PDF? It includes all documentation pages and guides, and just surpassed 1,000 pages.
Tweet media one
2
15
103
@duckdb
DuckDB
6 months
New blog post by @carlo_piovesan : Extensions for DuckDB-Wasm Thanks to recent developments, DuckDB-Wasm users can now load DuckDB extensions, allowing them to run extensions in the browser.
2
31
102
@duckdb
DuckDB
8 months
DuckDB was included in @InfoWorlds 's best open-source software list as a "tiny-but-powerful project" that provides just enough OLAP for most use cases. The award praised the lightweight nature and many features of DuckDB.
Tweet media one
@InfoWorld
InfoWorld
8 months
The best open source software of 2023 Ready to discover more? Click the image below ⬇️
0
2
9
2
19
100
@duckdb
DuckDB
4 months
The third talk at DuckCon #4 was presented by Subash Roul of @Fivetran , titled “Building Data Lake Using DuckDB”. Video: Slides:
Tweet media one
1
22
97
@duckdb
DuckDB
3 months
This blog post is a short summary of the ICDE 2024 ( @icdeconf ) paper authored by @lnkuiper , @peterabcz , and @hfmuehleisen : Robust External Hash Aggregation in the Solid State Age. The paper is available at
Tweet media one
@duckdb
DuckDB
3 months
New blog post by @lnkuiper – No Memory? No Problem. External Aggregation in DuckDB The post describes how DuckDB can efficiently aggregate over many more groups than fit in memory, allowing it to complete the 50 GB variant of the
Tweet media one
2
34
191
0
20
98
@duckdb
DuckDB
5 months
Did you know that you can use DuckDB to query Google Sheets via its CSV export? This blog post at areca data explains how:
Tweet media one
2
16
97
@duckdb
DuckDB
3 months
DuckDB can both export to and import from Numpy arrays. We added two new guides to cover these features:
Tweet media one
Tweet media two
5
15
95
@duckdb
DuckDB
9 days
DuckDB's co-creator @hfmuehleisen announced support for Delta Lake ( @DeltaLakeOSS ) in DuckDB at last week's @Data_AI_Summit . You can rewatch the keynote segment below: For more information, see the delta extension's documentation:
1
11
97
@duckdb
DuckDB
17 days
DuckDB's co-creator @hfmuehleisen will give a keynote tomorrow at @GOTOamst . Hannes, who is also a professor of data engineering at @Radboud_Uni , will give an overview of the last decades of data management, discuss why relational systems are still prevailing, and why
Tweet media one
2
13
94
@duckdb
DuckDB
1 year
New post by @lnkuiper : Shredding deeply nested #JSON one vector at a time Querying JSON as a table is as easy as SELECT * FROM 'file.json'; It's fast too, thanks to DuckDB's lists/structs and the yyjson parser by @ibireme .
3
27
92
@duckdb
DuckDB
7 months
We rolled out an updated syntax highlighter and a new color scheme in the DuckDB documentation, . The highlighter now knows all of DuckDB's keywords and functions. The color scheme is based in the Bluloco theme ().
Tweet media one
Tweet media two
Tweet media three
2
11
91
@duckdb
DuckDB
1 month
PS: DuckDB v1.0.0 will be released this summer.
2
15
91
@duckdb
DuckDB
27 days
New blog post by @szarnyasg : Analyzing Railway Traffic in the Netherlands This tutorial demonstrates some of DuckDB's key query features using datasets that capture the railway traffic in the Netherlands.
Tweet media one
3
18
91
@duckdb
DuckDB
3 months
Did you know that DuckDB supports function chaining? This allows function calls to be rewritten in more a readable manner. See the Even Friendlier SQL with DuckDB blog post for details:
Tweet media one
0
13
89
@duckdb
DuckDB
24 days
As promised just 1.5 weeks ago, we have indeed released DuckDB v1.0.0 this summer ☀️
Tweet media one
4
13
84
@duckdb
DuckDB
29 days
“With DuckDB as a browser for the data cloud, relational datasets are always just a hyperlink away.” – That's a great line. Thanks for this nice blog post, @NikolasGoebel !
@NikolasGoebel
Nikolas Göbel
30 days
@duckdb doesn't need data to be a database (and to me that's what makes it special)
2
10
64
2
5
82
@duckdb
DuckDB
1 month
We have released DuckDB v0.10.3, a bugfix release. The command 'pip install duckdb --upgrade' already delivers the new version. DuckDB clients in other package management systems (CRAN, Maven, Homebrew, etc.) will be updated in the coming days. For the release notes and binary
2
15
81
@duckdb
DuckDB
10 months
We created a single, easy-to-search PDF file from the DuckDB documentation: This is a preview version – if you find any issues, please report them at
Tweet media one
0
17
78
@duckdb
DuckDB
2 years
Team DuckDB is giving a talk and running a booth @PydataEindhoven today. Drop by for some swag!
Tweet media one
2
5
72
@duckdb
DuckDB
26 days
Tweet media one
@hfmuehleisen
Hannes M​ühleisen
26 days
. @duckdb trending on @github today ^^
1
2
78
0
4
73
@duckdb
DuckDB
21 days
🚨 A reminder for our old users and a pointer to our new followers: the DuckDB CLI client has a tldr page. If you have @tldr_pages installed, you can get examples of the most common command line arguments with: $ tldr duckdb
Tweet media one
0
8
72
@duckdb
DuckDB
5 months
We extended our performance guide with a new recommendation: avoid joining on VARCHAR-typed columns (i.e., strings). The accompanying microbenchmark demonstrates a case where performing a large join on BIGINT columns is 2.6× faster than evaluating the same join on VARCHAR
Tweet media one
2
9
69
@duckdb
DuckDB
7 months
DuckDB's co-creator Hannes Mühleisen gave a talk this week at the Hasso-Plattner-Institut @HPI_DE titled "Two Tier Architectures are Anachronistic". The recording is now available online.
Tweet media one
0
10
66
@duckdb
DuckDB
3 years
DuckDB passed 4k stars on GitHub, yay!
0
6
65
@duckdb
DuckDB
7 months
The recording of Hannes Mühleisen's DuckDB talk at @pydataamsterdam is now available online.
Tweet media one
1
8
66
@duckdb
DuckDB
1 year
#DuckCon lightning talk recordings: " @lakeFS ❤️ DuckDB" by @ozkatz100 "Using DuckDB and ArrowFlight to Power a Feature Store" by @tdoehmen from @hopsworks "Integrating DuckDB with @fluvio_io " by Sehyo Chang
0
18
63
@duckdb
DuckDB
2 years
Next release 0.5.0 is planned for 2022-09-05, major changes, now would be a good time to test with the latest master branch and report issues if any.
2
9
63
@duckdb
DuckDB
14 days
Today's keynote at the @Data_AI_Summit will (again) feature DuckDB co-creator @hfmuehleisen , who will talk about DuckDB's support for Delta Lake ( @deltalakeoss ). Follow the live stream (starting at 8.30am Pacific Time, in approximately half an hour):
Tweet media one
1
13
62
@duckdb
DuckDB
6 months
DuckCon #4 will feature a talk by @polinaeterna of @huggingface titled “Hugging a Duck: democratizing data access and exploration with DuckDB and Hugging Face Hub”. The talk will explain how they use DuckDB to allow people to easily explore over 250k public dataset on the
Tweet media one
0
13
61
@duckdb
DuckDB
3 months
The recordings of DuckCon #4 's 5-minute lightning talks are available. See the full playlist at
Tweet media one
1
11
61
@duckdb
DuckDB
2 months
We just released DuckDB v0.10.2, a bugfix release. See the installation page and the release notes for details: – –
Tweet media one
1
8
59
@duckdb
DuckDB
3 months
A reminder: DuckDB has a tldr page. If you have @tldr_pages installed, you can get examples of the most common command-line arguments by running $ tldr duckdb
Tweet media one
0
9
60
@duckdb
DuckDB
3 months
We have released DuckDB v0.10.1, a bugfix release. For installation instructions, see: This release fixes several issues with the CSV parser and tackles scenarios which previously resulted in out-of-memory (OOM) errors (details in 🧵).
Tweet media one
1
12
57
@duckdb
DuckDB
5 months
We held DuckCon #4 today in Amsterdam. Thanks to all speakers and attendees for making this an amazing event, and to @RillData for sponsoring the drinks & snacks! The speaker decks are available on the event's site: The recordings will be published in
Tweet media one
0
5
58
@duckdb
DuckDB
5 months
DuckDB's DevRel, @szarnyasg , gave a talk last November at the @oredev conference titled "DuckDB: Harnessing in-process analytics for data science and beyond". The recording is now available: The slide deck is here:
0
7
56
@duckdb
DuckDB
4 months
A new DuckDB article is out on Datanami with quotes from @hfmuehleisen : DuckDB Walks to the Beat of Its Own Analytics Drum “DuckDB has this different angle,” Mühleisen said. “It’s more like something that you put into a workflow rather than something
0
9
54
@duckdb
DuckDB
9 months
We have released DuckDB v0.9.1 today. This is a bug fix release for various issues discovered after we released 0.9.0. There are no new features, just bug fixes. Database files created by DuckDB v0.9.0 can be read by DuckDB v0.9.1.
1
11
56
@duckdb
DuckDB
1 month
Laurens Kuiper will present his paper "Robust External Hash Aggregation in the Solid State Age" tomorrow at ICDE 2024 in Utrecht. This work describes the techniques that make larger-than-memory aggregation possible in DuckDB. The paper is co-authored by Peter Boncz ( @peterabcz )
Tweet media one
0
10
54
@duckdb
DuckDB
5 months
DuckCon #4 is now underway with @mraasveldt and @hfmuehleisen talking about the state of DuckDB.
Tweet media one
3
5
56
@duckdb
DuckDB
5 months
DuckCon #4 is next week in Amsterdam, on Feb 2 (Friday). Subash Roul from @fivetran is going to talk about building data lakes using DuckDB. See the rest of the talks and the registration link at
Tweet media one
0
11
54