DuckDB
@duckdb
Followers
18K
Following
3K
Media
215
Statuses
844
DuckDB is an analytical in-process SQL database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.
Amsterdam, The Netherlands
Joined May 2019
New blog post by @mraasveldt: Multi-Database Support in DuckDB. DuckDB can now attach MySQL, Postgres, and SQLite databases in addition to databases stored in its own format. This allows data to be read into DuckDB and moved between these systems in a
9
82
348
DuckDB was recently covered in @andy_pavlo's Advanced Database Systems course at CMU. The lecture covers DuckDB's history, internals, and integration with other systems. Slides: Recording:
1
66
351
DuckDB 0.7.0 "Labradorius" released with #JSON support, parallel and partitioned export to CSV and Parquet, UPSERT, @DataPolars integration, and much more in our release announcement blog post:
8
51
253
New blog post: Access 150k+ Datasets from Hugging Face with DuckDB. This blog post, co-authored by the @huggingface and DuckDB teams, describes how you can use the hf:// prefix in DuckDB to access datasets in Hugging Face repositories. Read more at
4
56
216
We started a series of blog posts called “DuckDB tricks”. In these posts, we will present some useful shorthands, hacks, and design patterns for DuckDB. In the first part of the series, @szarnyasg explains some techniques for handling floats, fine-tuning the CSV reader, and
6
37
211
DuckDB has introduced native Delta Lake support. In our new blog post, @samansmink walks through the design and implementation of the new Delta Lake extension. Read more at
6
36
210
New blog post by @mraasveldt:.Memory Management in DuckDB. This blog post explains how DuckDB manages memory. It covers streaming execution, spilling intermediate results to disk, and the buffer manager. Read more at
0
43
181
New blog post by @szarnyasg:.Command Line Data Processing: Using DuckDB as a Unix Tool. This blog post shows how DuckDB stacks up against classic Unix tools (such as cut, grep, sort, and sed) when performing simple data processing steps. Read more at
1
50
179
New blog post by @__AlexMonahan__:.Benchmarking Ourselves over Time at DuckDB. The DuckDB team's philosophy is to first ensure correctness, then iterate and optimize to improve performance. This blog explores how this happened over the last three years, when DuckDB became
5
36
176
DuckDB 0.6.0 "Oxyura" released with improved storage, higher performance for CSV loading and indexing, new SQL syntax, better memory management, shell tweaks and so many new features @mraasveldt wrote a separate blog post to explain it all:
4
40
161
A copy of the “Getting Started with DuckDB” book – authored by @SimonAubury and @nletcher, and published by @PacktPublishing – just arrived to the @duckdblabs office! Congratulations to the authors on publishing this book.
3
16
160
New blog post by @__AlexMonahan__:.DuckDB in Python in the Browser with Pyodide, PyScript, and JupyterLite. In this post, Alex explains how you can set up a fully in-browser DuckDB notebook in seconds using Pyodide.
2
27
146
New blog post by @__AlexMonahan__. SQL Gymnastics: Bending SQL into flexible new shapes. In this post, Alex presents pure SQL queries to implement dynamic groupings and aggregate functions using DuckDB's friendly SQL extensions. The queries can be used to
1
25
140
The Awesome DuckDB repository, maintained by @davidgasquez, has grown to more than 100 entries in less than a year. If you are aware of more cool projects using DuckDB, please consider submitting a PR!.
2
27
136
New blog post by @hfmuehleisen. duckplyr: dplyr powered by DuckDB. The post describes the new R package duckplyr, which translates the dplyr API to DuckDB’s execution engine. Read more at
1
36
134
New blog post by @__AlexMonahan__:.Creating a SQL-Only Extension for Excel-Style Pivoting in DuckDB. In this beginner-friendly blog post, Alex walks us through creating a DuckDB extension to implement a generic pivot table function with minimal C++ code.
1
17
128
The recording of the talk on DuckDB's spatial library, presented by its author @maxxen at @GeoPythonConf 2024 is now available. DuckDB Spatial: Supercharged Geospatial SQL.
3
22
123
There are now a lot of handy tools and cool projects built around DuckDB. You can find a list of these in the Awesome DuckDB repository maintained by @davidgasquez. See the list and contribute your project at
1
30
126
New blog post by @holanda_pe:. CSV Files: Dethroning Parquet as the Ultimate Storage File Format — or Not?. In this post, DuckDB's CSV reader faces off. DuckDB's Parquet reader on reading tables and running queries directly on files.
3
15
115
We released DuckDB v1.0.0 a week ago. There is a growing list of tools integrating with DuckDB, applications that use DuckDB, and extensions created for DuckDB. You can find a list of these in the Awesome DuckDB repository, maintained by @davidgasquez. The list is never.
1
16
109
We have started publishing the recordings of DuckCon #4. We are first releasing the “State of the Duck” talk by DuckDB's co-creators, Hannes Mühleisen (@hfmuehleisen) and Mark Raasveldt (@mraasveldt). Video: Slides: Special thanks
1
20
107
𝐃𝐮𝐜𝐤𝐃𝐁 𝐓𝐫𝐢𝐜𝐤𝐬: 𝐏𝐚𝐫𝐭 𝟐. We continue our DuckDB Tricks series with a second part, where @szarnyasg shares some helpful hints for data wrangling: Have a trick you’d like to share? Let us know in the comments! 🦆.
1
13
106
DuckDB just got a tldr page. If you have @tldr_pages installed, you can get examples of the most common command-line arguments with:. $ tldr duckdb
0
21
104
Congratulations to @motherduck on launching their DuckDB-powered cloud data warehouse in General Availability! 🎉.
MotherDuck, the ducking simple data warehouse, is now Generally Available! 🍾🥂 Thank you to our community of thousands of users who have tested, validated, and helped improve MotherDuck over the last year.❤️🦆.
3
10
105
New blog post by @carlo_piovesan: Extensions for DuckDB-Wasm. Thanks to recent developments, DuckDB-Wasm users can now load DuckDB extensions, allowing them to run extensions in the browser.
2
31
97
DuckDB was included in @InfoWorlds's best open-source software list as a "tiny-but-powerful project" that provides just enough OLAP for most use cases. The award praised the lightweight nature and many features of DuckDB.
2
16
95
This blog post is a short summary of the ICDE 2024 .(@icdeconf) paper authored by @lnkuiper, @peterabcz, and @hfmuehleisen: Robust External Hash Aggregation in the Solid State Age. The paper is available at
New blog post by @lnkuiper – No Memory? No Problem. External Aggregation in DuckDB. The post describes how DuckDB can efficiently aggregate over many more groups than fit in memory, allowing it to complete the 50 GB variant of the
0
17
93
New blog post by @holanda_pe – DuckDB's CSV Sniffer: Automatic Detection of Types and Dialects.
3
12
92
DuckDB's co-creator @hfmuehleisen announced support for Delta Lake (@DeltaLakeOSS) in DuckDB at last week's @Data_AI_Summit. You can rewatch the keynote segment below:. For more information, see the delta extension's documentation:.
1
11
90
DuckDB's co-creator @hfmuehleisen will give a keynote tomorrow at @GOTOamst. Hannes, who is also a professor of data engineering at @Radboud_Uni, will give an overview of the last decades of data management, discuss why relational systems are still prevailing, and why
2
10
86
Interested in finding out more about data wrangling with DuckDB? Be sure to watch the keynote from posit::conf, where @hfmuehleisen talks about "Data Wrangling [for Python or R] Like a Boss With DuckDB":
0
10
89
𝗙𝗮𝘀𝘁𝗲𝗿 𝘁𝗼𝗽 𝗡 𝗳𝗼𝗿 𝗮𝗴𝗴𝗿𝗲𝗴𝗮𝘁𝗲 𝗳𝘂𝗻𝗰𝘁𝗶𝗼𝗻𝘀. The end of the week brings us another blog post. @__AlexMonahan__ takes a deep dive into the top N capabilities in DuckDB: Happy Friday!.
0
15
89
New blog post by @szarnyasg:.Analyzing Railway Traffic in the Netherlands. This tutorial demonstrates some of DuckDB's key query features using datasets that capture the railway traffic in the Netherlands.
3
16
86
𝗖𝗼𝗻𝗰𝘂𝗿𝗿𝗲𝗻𝗰𝘆 𝗰𝗼𝗻𝘁𝗿𝗼𝗹 𝗮𝗻𝗱 𝗹𝗼𝗴𝗴𝗶𝗻𝗴 𝗶𝗻 𝗗𝘂𝗰𝗸𝗗𝗕. Have you ever wondered what the .wal file is when working with DuckDB? Time to find out! .This week, the co-creators of DuckDB, @hfmuehleisen and @mraasveldt, wrote a blog post about the Multi-Version.
0
14
83
𝐖𝐨𝐫𝐤𝐢𝐧𝐠 𝐰𝐢𝐭𝐡: 𝐝𝐮𝐜𝐤𝐩𝐥𝐲𝐫. This week’s blog post is brought to you by @hfmuehleisen, who shows how you can optimize the data analysis in R and go from seconds to milliseconds.
1
18
79
“With DuckDB as a browser for the data cloud, relational datasets are always just a hyperlink away.” – That's a great line. Thanks for this nice blog post, @NikolasGoebel!.
@duckdb doesn't need data to be a database (and to me that's what makes it special).
2
5
74