Welcome to Tuplex!


Tuplex is a new framework for processing larger than memory datasets in a monadic programming paradigm as in Apache Spark or Apache Flink. Under the hood it uses whole-stage code generation to speed up processing and provides native speed comparable to a pipeline written in C, which is then compiled to a native executable. Furthermore, it allows users to handle exceptions in a novel way to bolster overall productivity and to facilitate running complex and data intense ETL pipelines. Tuplex is developed currently within the Database Management Group at Brown University.