Pedro Holanda @holanda_pe profile

Pedro Holanda

@holanda_pe

Followers

2K

Following

2K

Statuses

537

Ph.D. in Database Architectures. Turning knobs for a living.

Amsterdam

Joined February 2020

Don't wanna be here? Send us removal request.

Pedro Holanda

@holanda_pe

4 days

RT @duckdb: We are happy to announce DuckDB v1.2.0 “Histrionicus”! The new release has several usability, security and performance improve…

0

47

0

Pedro Holanda

@holanda_pe

5 days

RT @mehd_io: Once again, I was faced with the unfortunate reality of parsing CSVs. Fortunately, @duckdb's excellent work on their CSV sniff…

0

7

0

Pedro Holanda

@holanda_pe

8 days

RT @yusuktan: 仕事でCSVをちょこっと解析する必要があって、DuckDBを初めて使ってみたらとても体験が良かった

0

1

0

Pedro Holanda

@holanda_pe

9 days

@matsonj @duckdb How else should I know is time to buy spx6900

1

0

2

Pedro Holanda

@holanda_pe

15 days

RT @jamesacowling: Got to sit down with Professor @andy_pavlo on the latest @convex_dev Databased podcast for his 2024 databases year in re…

0

17

0

Pedro Holanda

@holanda_pe

17 days

RT @duckdb: New blog post: Query Engines: Gatekeepers of the Parquet File Format In this post, Laurens Kuiper argues that we are wasting…

0

39

0

Pedro Holanda

@holanda_pe

22 days

I had a little adventure investigating a SIGSEGV when loading DuckDB with RocksDB/TensorFlow on Linux. This led to a spiral of checking leaked symbols, and learning that random_device has two different interfaces, one in libc++ and another in libstdc+, which can cause symbol collisions. 😄

0

8

Pedro Holanda

@holanda_pe

22 days

@caiocgomes @kaitou_renegade Interessante! Por curiosidade, você tem um exemplo de plano de consulta do seu modelo, ou um benchmark que possa compartilhar? 😄 Se for melhor em pvt, pedro@duckdblabs.com

1

0

Pedro Holanda

@holanda_pe

23 days

Sem duvidas Caio! Eu só queria fazer um teaser de que muitas vezes o que parece ser um volume de dados necessário de uma solução distribuída, pode não ser exatamente o que parece. Ambas ferramentas tem seus propósitos, vantagens e desvantagens :-) Mas eu tive que subir um cluster spark em 2015 no meu mestrado, talvez dai venha meu trauma 😂.

1

0

1

Pedro Holanda

@holanda_pe

23 days

@kaitou_renegade @caiocgomes A gente tem suporte tanto pra iceberg ( quanto pra delta ( e 2025 promete pra ambos 😄

1

0

1

Pedro Holanda

@holanda_pe

24 days

@kaitou_renegade @caiocgomes E mesmo em uma tabela wide, com várias colunas, por conta de projection pushdown, a quantidade de dados que você carrega é bem menor, IO de disco desce muito, alguns operadores podem até executar direto no dado comprimido, etc. Pra precisar subir um cluster de spark vai chão 😅

1

0

1

Pedro Holanda

@holanda_pe

24 days

@wileycwj After I got a CSV file exported from a big Dutch bank with unescaped quotes in quoted values I lost all my hope. And then I sold the shares I had 😂.

0

Pedro Holanda

@holanda_pe

24 days

Last year, I added a new integer type to DuckDB called VARINT, capable of storing up to 1,262,612 digits. I initially implemented a quadratic algorithm to convert it to VARCHAR, which was slow. (My plan was to eventually get inspired by CPython.) However, an external contributor beat me to it: They even referenced "The Art of Computer Programming" and implemented a solution that's more than an order of magnitude faster than my naive approach. Just excellent!

0

7

71

Pedro Holanda

@holanda_pe

24 days

@wileycwj I kid you not, after supporting encodings, I think this is the most requested option. At least it’s not multibyte quotes/escapes (yet). 😅

1

0

1

Pedro Holanda

@holanda_pe

30 days

RT @duckdb: New blog post by @__AlexMonahan__ Vertical Stacking as the Relational Model Intended: UNION ALL BY NAME Alex takes us on a jo…

0

9

0

Pedro Holanda

@holanda_pe

1 month

“DuckDB has entered the zeitgeist as the default choice for someone wanting to run analytical queries on their data. Pandas previously held DuckDB's crowned position.”

Andy Pavlo (@andypavlo.bsky.social)

@andy_pavlo

1 month

Buckle up because we're crashing into the new year with my annual database retrospective: License change blowbacks! @databricks vs. @SnowflakeDB gangwar! @DuckDB shotgun weddings! Buying a college quarterback with database money for your new lover!

0

6

47

Pedro Holanda

@holanda_pe

2 months

RT @duckdb: The DuckDB repository just reached 25,000 stars on GitHub. We used this occasion to stop for a moment and reflect on the projec…

0

5

0