The book
Sovereign by Design
Building a Production Query Engine on DataFusion
The complete journey of replacing Trino with a purpose-built Rust query engine. From first DataFusion query to distributed execution over Apache Iceberg, with OIDC auth passthrough, policy-as-plan-rewriting, and zero service accounts. A practitioner’s guide to building data infrastructure you actually own.
PDF & EPUB downloads are being prepared.
Contents
Chapters
- 00 Preface: The Sovereignty Thesis →
- 01 The Catalog Wars →
- 02 Tables Made of Files →
- 03 The Engine You Already Have →
- 04 You Are the Query →
- 05 Speaking Arrow →
- 06 The Catalog Is the API →
- 06b Speaking to Many Catalogs →
- 06c Attaching at Runtime →
- 07 Making dbt Work →
- 08 Writing Is a Contract →
- 09 What You Can't See Can't Hurt You →
- 10 Making It Operable →
- 11 Why Distribute at All →
- 12 Standing on Ballista's Shoulders →
- 13 Neither Trusts the Other →
- 14 Failure Is a Feature →
- 15 Deploying Sovereignty →
- 16 Benchmarks Don't Lie (But They Mislead) →
- 16b The Matrix and the Quiet Bug →
- 16c Following Through →
- 16d The DuckDB Drift →
- 16e The Lineage Trail →
- 17 What We'd Do Differently →
- 18 Epilogue →