I wrote a previous small blog about PyIceberg and Glue iceberg Rest ApiThis week is saw the announcement of Duckberg, combining all the favorites in a single library: PyIceberg, DuckDB and IcebergI re...
I wrote a couple of weeks ago about using DuckDB and Rill Data to explore a new data source that I’m working with. I wanted to understand the data’s structure and distribution of values, as well a...
I wrote my previous article about Duckberg, a combination of PyIceberg with Duckdb to access iceberg tables on aws.But on Pi-Day (3 March) DuckDB released a preview with native iceberg glue integratio...
That DuckDB is gaining more and more attention is no accident. As a rising star in desktop analytics, it masters SQL with ease, effortlessly handling CTE recursive queries, multi-layered window functi...
Introduction
DuckDB is widely recognized for its analytical processing capabilities, but its true power lies in its exceptional query engine. Unlike traditional databases that require extensi...
Parquet files offer significant advantages over traditional formats like CSV or JSON. This is more relevant in analytical workloads and processing. Tools like parquet-tools and DuckDB make it easy to ...
DuckDB can directly read common files such as CSV, Parquet, and JSON. With just a single SQL statement, it can load the file and perform a query, such as SELECT * FROM ‘data.csv’ WHERE price>10...
esProc SPL is much more convenient than DuckDB in operating multi-layer JSON data, particularly when preserving JSON hierarchy and performing complex calculations are required.DuckDB’s ability to op...
Modern Data Stack Updates: April 2025
※This article is an English translation of my original Japanese post.As a consultant specializing in the Modern Data Stack, I've noticed that there's a...
Introduction
DuckDB has been updated to version 1.20. 🙌
It seems that various features have been added, so I tried out some of the new functionality mentioned in the official blog.
...