Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
-
Updated
Jun 8, 2026 - Rust
Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
Apache Spark Connect Client for Rust
Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!
velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docker Compose.
Extension to {sparklyr} that allows you to interact with Spark & Databricks Connect
A reverse proxy server which allows secure connectivity to a Spark Connect server
Study system for the Databricks Certified Associate Developer for Apache Spark 3.5 exam. 5 interconnected Claude Code skills covering all 7 exam sections, with sources linked to Apache Spark 3.5 docs and Damji's Learning Spark 2nd Edition.
Docker Compose environment for big data research and machine learning development
Collection of articles, using the Literate Programming style, about Data Engineering and Software Tooling in general
Interactive Apache Iceberg debugger and visualization platform for exploring production tables, tracing metadata evolution, and understanding Iceberg internals through real time graph visualization.
Real-time insights on bike availability using the JCDECAUX API. Data flows through Kafka, processed by Spark, and visualized in a Streamlit app. Deployed on Kubernetes.
User-land broadcast-variable workaround for PySpark on Spark Connect / Databricks Serverless
Kotlin adapter for Spark Connect JVM with dual-backend serialization and Unity Catalog integration. Bachelor's thesis implementation.
TypeScript client for Apache Spark Connect
Manage accounts payable priority and predictive risk using this Django-based strategic platform.
Add a description, image, and links to the spark-connect topic page so that developers can more easily learn about it.
To associate your repository with the spark-connect topic, visit your repo's landing page and select "manage topics."