Ballista Client

A distributed query execution client built on Apache Arrow Ballista, demonstrating how to connect to a Ballista cluster and execute distributed queries against CSV and Parquet files.

Features

Distributed Query Processing: Leverages Ballista's distributed execution engine
Multiple File Formats: Supports CSV and Parquet data sources
DataFusion Integration: Uses DataFusion's powerful DataFrame API
Production-Ready: Configurable, observable, and thoroughly tested

Quick Start

1. Install Ballista Components

cargo install --locked ballista-scheduler
cargo install --locked ballista-executor

2. Start the Cluster

Start the scheduler (in terminal 1):

RUST_LOG=info ballista-scheduler

Start executor(s) (in separate terminals):

# Executor 1
RUST_LOG=info ballista-executor --bind-port 50051 -c 4

# Executor 2 (optional, for true distributed processing)
RUST_LOG=info ballista-executor --bind-port 50052 -c 4

# Executor 3 (optional)
RUST_LOG=info ballista-executor --bind-port 50053 -c 4

3. Run the Client

# Build and run
cargo run

# Or with custom configuration
BALLISTA_SCHEDULER=df://localhost:50050 cargo run

Configuration

Configure the client using environment variables:

BALLISTA_SCHEDULER: Scheduler address (default: df://localhost:50050)
RUST_LOG: Logging level (e.g., info, debug, trace)

Development

# Run tests
cargo test

# Check code quality
cargo clippy

# Format code
cargo fmt

# Security audit
cargo audit

Architecture

See CLAUDE.md for detailed architecture documentation, API notes, and development guidelines.

License

MIT License - see LICENSE for details

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.github		.github
src		src
testdata		testdata
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ballista Client

Features

Quick Start

1. Install Ballista Components

2. Start the Cluster

3. Run the Client

Configuration

Development

Architecture

License

About

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

License

duyet/ballista

Folders and files

Latest commit

History

Repository files navigation

Ballista Client

Features

Quick Start

1. Install Ballista Components

2. Start the Cluster

3. Run the Client

Configuration

Development

Architecture

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 4

Uh oh!

Languages