CPU-to-Storage I/O Benchmark

Benchmarks for comparing different file I/O implementations in Python and C++. This project measures throughput for parallel read/write operations with various block sizes and thread counts.

The Benchmarks:

blocks: Writing\reading a fixed number of blocks for each block size, changing the overall data for each block size.
data: Writing\reading a fixed number of data (in GB), changing the number of blocks for each block size.
concurrent: Running read and write operations simultaneously to measure combined throughput and performance under mixed workloads. Uses half the total data for reads and half for writes.

Each benchmark can use one of the following I/O implementations:

C++: using custom thread pool.
Python (self implementation): Optimized Python implementation with pre-opened file descriptors.
Python (aiofiles): Async I/O using aiofiles library.
NIXL: using NVIDIA's NIXL library with POSIX backend.

The benchmarks measures throughput for both read and write operations across different:

Block sizes (configurable)
Thread counts (16, 32, 64)
Storage backends (tmpfs /dev/shm or persistent storage)

1. Setup

Create the pod and pvc:

kubeclt apply -f deployment/io_bench_pod.yaml
kubeclt apply -f deployment/io_bench_pvc.yaml      # in case you want to use persistant storage

Copy the files to the pod:

./copy_to_pod.sh

Incase you want to test the C++ option, build the extention:

python setup.py build_ext --inplace

This compiles the C++ utilities in benchmark_cpp_utils/ and creates a cpp_ext Python module.

2. Running Benchmarks

The benchmark script accepts command-line arguments for easy configuration:

# Show help and available options
python compare_file_operations.py --help

# Run with default settings (C++, 100GB buffer, 1000 blocks, 5 iterations)
python compare_file_operations.py

Command-Line Arguments

Argument	Type	Default	Description
`--mode`	str	`blocks`	Benchmark mode: `blocks` for fixed number of blocks, `data` for fixed data size, or `concurrent` for simultaneous read/write
`--backend`	str	`python_self_imp`	Backends: `cpp`, `python_aiofiles`, `python_self_imp`, `nixl`
`--buffer-size`	int	`100`	Buffer size in GB
`--block-sizes`	int list	`2 4 8 16 32 64`	Block sizes in MB to test.
`--iterations`	int	`5`	Number of iterations per test
`--num-blocks`	int	`1000`	Number of blocks to transfer (blocks mode only)
`--total-gb`	int	`100`	Total data size in GB (data and concurrent modes)
`--test-name`	str	`""`	Add to results' file name
`--verify`	flag	`False`	Verify file contents after write/read operations

Checkpoints and Resumption

The benchmark system includes automatic checkpoint functionality that allows you to resume interrupted benchmarks:

Automatic Saving: Results are saved incrementally after each test combination (thread count × block size) completes.
Resume on Restart: If a benchmark is interrupted, simply run the same command again - it will automatically detect existing results and skip completed tests.
Configuration Validation: The system verifies that the configuration matches before resuming (buffer size, iterations, block sizes, etc.)
Atomic Writes: Results are written atomically to prevent corruption if the process is killed during a save

Environment Variables

Configure storage path and cluster name:

export CLUSTER_NAME=my-cluster

# Use tmpfs (memory-backed storage)
export STORAGE_PATH=/dev/shm

# Or use persistent storage
export STORAGE_PATH=/mnt/persistent-storage

Note: If not set, defaults are:

STORAGE_PATH: /dev/shm (tmpfs)
CLUSTER_NAME: unknown

3. Plotting Results

Copy the all the results from the pod to your local machine:

./copy_to_pod.sh --get-results

Results will be copied to ./results/ directory.

The plotter accepts the benchmark mode and one or more result files:

python plotter.py <mode> <file1> [file2] [file3] ...

Examples

Example 1: Data Mode Benchmark

Here is an example of testing cpp, python_self and nixl's backend for writing/reading 100GB at a time from tmpfs and later plotting the results:

Change the path to tmpfs and compile cpp extention:

export STORAGE_PATH=/dev/shm
python setup.py build_ext --inplace

Run the three tests:

python compare_file_operations.py --mode data --backend python_self_imp --test-name example
python compare_file_operations.py --mode data --backend cpp --test-name example
python compare_file_operations.py --mode data --backend nixl --test-name example

Each test will create a json file in ./results/ directory (backend and test mode will be included in the result's file name).

Copy the results back to local machine:

./copy_to_pod.sh --get-results

Plot the results:

python plotter.py data results/data_example_100gb_memory_cpp.json results/data_example_100gb_memory_python_self_imp.json results/data_example_100gb_memory_nixl.json

This will generate a comparison plot showing throughput vs block size for all three implementations:

Example 2: Concurrent Benchmark

The concurrent benchmark measures performance when read and write operations run simultaneously, simulating real-world mixed workloads:

# Run concurrent benchmark with C++ backend
python compare_file_operations.py --mode concurrent --backend cpp --total-gb 100 --test-name concurrent_test

# Run with Python self implementation
python compare_file_operations.py --mode concurrent --backend python_self_imp --total-gb 100 --test-name concurrent_test

The concurrent mode:

Splits the total data size equally between read and write operations (e.g., 100GB total = 50GB read + 50GB write)
Runs both operations simultaneously to measure combined throughput
Reports individual read/write times and combined throughput
Useful for understanding how the system handles mixed I/O workloads

Results will be saved to results/concurrent_<test_name>_<total_gb>gb_<storage>_<backend>.json.

Copy the results back to local machine:

./copy_to_pod.sh --get-results

Plot the concurrent benchmark results:

python plotter.py concurrent results/concurrent_concurrent_test_100gb_tmpfs_cpp.json results/concurrent_concurrent_test_100gb_tmpfs_python_self_imp.json

The plotter will generate comparison plots showing:

Write throughput under concurrent load
Read throughput under concurrent load
Combined throughput for mixed workloads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CPU-to-Storage I/O Benchmark

The Benchmarks:

1. Setup

2. Running Benchmarks

Command-Line Arguments

Checkpoints and Resumption

Environment Variables

3. Plotting Results

Examples

Example 1: Data Mode Benchmark

Example 2: Concurrent Benchmark

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
backends		backends
deployment		deployment
plots		plots
utils		utils
README.md		README.md
compare_file_operations.py		compare_file_operations.py
copy_to_pod.sh		copy_to_pod.sh
plotter.py		plotter.py
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

CPU-to-Storage I/O Benchmark

The Benchmarks:

1. Setup

2. Running Benchmarks

Command-Line Arguments

Checkpoints and Resumption

Environment Variables

3. Plotting Results

Examples

Example 1: Data Mode Benchmark

Example 2: Concurrent Benchmark

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages