Nikos Karayannidis @dbfunctor
Senior Data Zookeeper | Data Engineering Lead@Chubb | Working on the formal verification of data pipelines - Views are my own orcid.org/0009-0007-7413… Greece Joined October 2016-
Tweets150
-
Followers75
-
Following637
-
Likes451
7/7 Paper, proofs & appendices: arxiv.org/abs/2601.00995 Feedback from the DB-theory and data-engineering folks very welcome. 🙏 #dataengineering #databases #typetheory
6/7 It's a mathematical proof, not a test: correctness for ALL inputs, from the type structure alone — without running a single query (zero compute cost on cloud warehouses). All theorems machine-checked in Lean 4; cross-validated against PostgreSQL.
5/7 That atom determines how two data types integrate (their entity key) and the canonical way to read/write their elements (behavioral class). And it scales: grain propagates through a whole pipeline DAG — hundreds of operations — verifying the output grain matches the target.
4/7 New preprint: we make grain formal. Grain theory is a type-theoretic framework that defines grain on *any* algebraic data type — relational data, but also recursive types (lists, trees) and streams of unbounded data. Grain is the atom of a data type.
3/7 Fan traps & chasm traps (silent data loss) are just symptoms of one problem: a transformation that misaligns grain — the level of detail of data. Grain's been around since Kimball, but only informally: prose, fact tables only, no rules for how transformations change it.
2/7 Join is on customer+date, but the tables sit at customer×channel×date vs customer×product×date. So it cross-products the unmatched parts, duplicating rows before aggregation — a fan trap. Invisible to schema checks, type checks, and small-data unit tests.
1/7 This SQL query compiles, runs, and returns the right-shaped result. It also silently inflates both metrics — in production, not in your tests. The reason: the two tables have different *grains*. 🧵
AI is writing a growing share of the world's software. No one is formally verifying any of it. New essay: "When AI Writes the World's Software, Who Verifies It?" leodemoura.github.io/blog/2026/02/2…
excerpt from a longer paper Ensuring safety for powerful learned systems requires a fundamentally different foundation based on mathematically provable constraints on the acts an AI may perform. Such a foundation must rest on a simple principle: we should never trust an AI’s outputs or intentions by default, no matter how competent or aligned it appears; trust must be earned only through verifiable, enforceable proofs of safety for each act. Our assumption that enacting unsafe AI acts is worse than rejecting safe AI acts leads to the central premise of our work, the Universal Declaration of AI Acts: No AI act may be treated as safe unless harmlessness is proven mathematically. Note that this is the mathematical dual of the Universal Declaration of Human Rights: No person may be treated as guilty unless guilt is proven, which is based on the assumption that punishing an innocent person is worse than letting a guilty person go.
@catalinmpit Reduce your sleep hours and work very early in the morning or late at night, or both. In any case you are going to suffer so you must be really determined. Don’t do this everyday to retain some balance.
You don’t hate math. You hate the way it was taught to you. But because you haven’t learned math properly, you confuse correlation and causation, and therefore think that you hate math.
Nice blogpost by @muratdemirbas on the foundational treatment of serializability theory in databases from Chapter 2 of the book Concurrency Control and Recovery in Database Systems (1987) by Bernstein, Hadzilacos, and Goodman. I started reading the post and after a bit of exploration found this foundational 1979 paper on Serializability of Concurrent Database Updates by Christos H Papadimitriou. This paper has been referenced in Bernstein's book as well. Looks like Papadimitriou has also written quite a lot on the theory of database concurrency control .. Looking forward to some weekend readings .. links to blog post and the paper: 👇
🏆 The ACM SIGMOD Test of time award (2025) goes to K-shape: Efficient and accurate clustering of time series John Paparrizos, Luis Gravano 2025.sigmod.org/sigmod_awards.…
New blog post on the mental model I've used when working through complex or confusing papers on transactional systems. transactional.blog/blog/2025-deco…
This paper covering the internals and architecture of @ClickHouseDB is one of the best in database architecture that I read in 2024. Some great insights on SIMD and Multicore parallelisation of the query processing layer, query compilation based on LLVM, the various data structures for aggregation and hash joins, data pruning techniques in the storage layer and lots and lots of information related to the architecture. Loved it ..
The first two videos for @CMUDB's latest seminar series on Database Building Blocks are now posted. You should start off with @andrewlamb1111's fantastic introductory overview to @ApacheDataFusio: youtube.com/watch?v=iJhRbD…
Elizabeth Garrett Christensen (@sqlliz): Window Functions for Data Analysis with Postgres postgr.es/p/6Df
I am thrilled to announce that my book, Functional Design and Architecture, has just been released by @ManningBooks! FINALLY RELEASED!!! 😃😃😃😄😊😊❤️❤️❤️❤️❤️❤️ This has been a long journey, and I sincerely hope this book will make a significant contribution to the functional programming world. 🎇🎇 ❓ Who is this book for? It's for all developers interested in practical functional programming. This book is useful for software architects, senior developers, and everyone else. The model language is Haskell, but the ideas are universal and applicable to languages like Scala, OCaml, F#, and even C# and C++. ❓What is this book about? It’s about applying an engineering approach to functional programming. Design patterns, design principles, application architectures, best practices, approaches, and deep ideas—all combined into a comprehensive and highly consistent methodology for building real-world applications. 📔 Functional Design and Architecture is structured, consistent, well-written, and approachable. I’ve made a special effort to ensure the content is accessible to a wide audience. The narrative is engaging, free of jargon and complex mathematics, and progresses in a friendly, gradual manner. 💡 The ideas are universal; some were known before, but many were developed throughout this project. There was a significant knowledge gap, and this book covers much of it for our benefit. A titanic amount of work went into this book. Specifically, the following were created: 🟠 A full-fledged application framework, Hydra; 🔴 A proof-of-concept platform for creating spaceship management scenarios; 🟡 The methodology of Functional Declarative Design, covering various aspects of design and software architecture in functional programming; 🟢 A unique architectural approach, Hierarchical Free Monads; 🔵 A multitude of new design patterns, approaches, and practices, in addition to those that already existed; 🟣 Several demo applications, included both in the book and in the Hydra framework; 🟤 A wealth of accompanying material: articles, talks, and side projects; ⚪️ And of course, these ideas have been successfully tested in practice in several places. You'll also find many links to other valuable resources in this book because the subject is very broad and deep. I am especially grateful to all those who initiated this movement toward the practical application of functional programming. I stand on the shoulders of giants and deeply appreciate their contributions. On the cover, you’ll find testimonials from these distinguished individuals: 🟡 @ScottWlaschin, author of Domain Modeling Made Functional (Pragmatic Bookshelf) 🟤 @debasishg, author of Functional and Reactive Domain Modeling and DSLs in Action (both from Manning) 🟣 @VBragilevsky @_bravit , author of Haskell in Depth (Manning) I hope you enjoy the book as much as I enjoyed writing it. Bon voyage!
Learning Haskell/FP often expands people’s minds. In a world dominated by Java + Python intro courses, being forced to see computation in a new way is very empowering. There’s almost no downside to learning these techniques and having them in your toolbox.
Yiorgos Adamopoulos @hakmem
2K Followers 3K Following Reading AI Memo 239. Learned amateur. RT != endorsement. I run infrastructures and speak for myself. Bubblegum does not scale. @[email protected]
Laura Ashalintubbi @LauraAsh001
107 Followers 3K Following Single mother to a beautiful daughter, Living my best life in Plano Dallas Texas
Riasui @Riasui882
25 Followers 341 Following
Billy _ jarju @BilaliJarju
271 Followers 2K Following "For God so loved the world that he gave his one and only Son, that whoever believes in him shall not perish but have eternal life @RefugeUtah
Torsmo @TorsmoWrAQ2WP
43 Followers 4K Following
Roman Agabekov @agabekovroman
4K Followers 3K Following Building Releem in public 🧠 AI Database Advisor for MySQL, MariaDB & PostgreSQL
Eder @Do_Widzeni_a
350 Followers 393 Following A Data Engineer who misses Software Engineering.#DataOps ,#AI, #IoT,#ML, #MLOps. #Rust -Enthusiast and #Scala
JoanDora @4ZqBCocXmK1GvP
73 Followers 7K Following
Elizabeth Garrett Chr... @sqlliz
690 Followers 848 Following Doing fun things with Postgres @snowflake via @crunchydata. Married to @pg_dwc.
TinaTyler @woJqURZb24VmQk
68 Followers 7K Following
ferkan @ferkan2694313
26 Followers 893 Following
Dimitris Orfanos @Dimitris_Orf
145 Followers 481 Following UI/UX Designer / Graphic Designer Thessaloniki, Kalamaria https://t.co/xMzdiBTNOD Instagram: dimitris_orfanos_
Piotr Komorowski @piotr_msg
2K Followers 3K Following Software Engineer & Co-Founder at https://t.co/A5xEesglUC and https://t.co/ZP35eOk5fx. Crafting MVPs from dreams with a dash of humor. #SaaS #MVP
Sarah Bugden @SarahBugden3
283 Followers 891 Following SaaS Ent AE. Cultural observations | Likes are not endorsements, opinions are mine & not the views of my employer
Rock60s @babis49
62 Followers 509 Following
HINANO @HINANO87190351
16 Followers 254 Following おふ&みせあい募集垢です👼 仕事柄出会いがないのではじめちゃいました💝 気持ちいぃの大好き💕 ふぉろーくれたらDM送りますね✉💛
Hikari @Hikari91266943
76 Followers 293 Following 彼氏ほしい 22さいFカップ🙈 エステティシャン 気になったひとフォロしてく DMだともっと変態な動画送るよ💘
Manami @Manami74436861
73 Followers 626 Following かまってほしい 24さいGカップ🥺 薬剤師 ふぉろしてね オナしたくなるような動画フォロワーさんに送るね🙈
yuki💛 @yuki22093634
91 Followers 870 Following 大学生をしている20代ですʕ•ᴥ•ʔ よろしくお願いします💙 ちゃんと会えます。裏切ることしたくないので純粋に会ってくれる人にDM返します。 見た目で判断する人は嫌いです!
Atish Panda @atishpanda
145 Followers 2K Following Idolize Snorlax, 🐌 and sloth. The Panda 🐼 who deals with data. Engineer - BI, Data and Analytics
Metaplane by Datadog @metaplane
444 Followers 313 Following Be the first to know of data issues with end-to-end data observability. Changelog ⭐️: https://t.co/wEwm1wEnbD Docs 📋: https://t.co/LxLfQzXiP8
Kristián Balaj @KristianBalaj
876 Followers 888 Following Cardano Blockchain Developer | Web3 Craftsman | Crypto | Decentralization Functional programming fanboy. Clean air enthusiast.
Kostas Pardalis @KostasPardalis
795 Followers 178 Following Building the semantic control plane that grounds AI agents in your data platform | prev: Blendo (CEO, acq. RudderStack), Trino | Always an Engineer
Sebastian @sscdotopen
3K Followers 2K Following Professor of data management for ML at @bifoldberlin. Ex-@UvA_Amsterdam, @NYUDataScience, @Twitter intern; member of @TheASF & @EFF. Views are my own.
Christian Stork @cstork
19 Followers 129 Following
Anurag Gupta @awgupta
574 Followers 248 Following
Benedetta @benecittadin
297 Followers 2K Following Growth & Marketing @Siffletdata | Italian in Paris 🍝
TriggerMesh @triggermesh
3K Followers 4K Following TriggerMesh simplifies building event-driven applications. Your AWS EventBridge open source alternative.
Diana Pfeil @dianam
808 Followers 795 Following machine learning, startups, and figuring out humans | CTO at @Pex
re_data @re_data_labs
1K Followers 4K Following re_data is an open-source & dbt native data reliability framework built for modern data stack
Christos Koutras @ckoutras91
130 Followers 195 Following
Transform Data @transformio
1K Followers 583 Following Data conversations without the question marks. Make metrics the language of data
RudderStack @RudderStack
2K Followers 2K Following Infrastructure to collect, transform, and deliver customer data everywhere it's needed in real time.
Mikko Ohtamaa @moo9000
22K Followers 16K Following Cofounder @tradingprotocol - Tweeting about human and digital rights, economics, blockchain, information security, open source
davidad 🎇 @davidad
23K Followers 10K Following cognizing structures of information processing systems, in all their forms | category theory, perennial philosophy, Bodhitropic Alignment | cancel heat death
Fly on the wall @Strangecrypto13
39 Followers 177 Following
Yağız “Yaz” Erk... @yagiz
3K Followers 3K Following Engineering Leader. Team builder. Cat herder. Mountain mover. Avid reader. YouTuber. Blogger. Coach. Mentor. Beekeeper 🐝. GSL 💛❤️ *DM for Leadership Coaching*



















