Team 5 Distributed Data Assignment 2025

We have implemented two different approaches: one using the synchronous Saga pattern and the other following an asynchronous event-driven architecture.

We would greatly appreciate it if you could also check out the event-driven implementation briefly, as it also provides fault tolerance while ensuring consistency. Main difference is that logging is better implemented for the checkout workflow in this branch, and therefore we are using it as main but the event-driven approach has a slightly better RPS.

The event-driven approach is implemented in rabbitmq-final branch

Saga Workflow and Orchestration

This repository contains our primary implementation of a distributed checkout system using the Saga pattern with orchestration.

Key Components

Implemented a central orchestrator in the order-service, which:
- Initiates stock subtraction (stock-service)
- Initiates payment deduction (payment-service)
- Finalizes the order upon success
- Performs compensation in case of failure (refund payment or restore stock)
Followed an orchestration-based pattern: The order service directly coordinates the workflow through synchronous HTTP requests.
Workflow is synchronous and request-driven, which keeps the flow deterministic and easier to debug.
Leveraged Redis pipelines and optimistic locking (WATCH / MULTI / EXEC) in stock-service and payment-service to:
- Prevent race conditions in concurrent updates
- Ensure atomicity and consistency in high-concurrency scenarios
- Support safe retries when conflicts are detected

Highlights

Choreography-based Saga: No standalone orchestrator. Instead, order-service drives the workflow, coordinating the saga while also being a domain-bound participant. This maintains a choreographed design rather than an orchestrated one.
Synchronous choreography simplifies tracing, error handling, and debugging.
Optimistic concurrency control using Redis prevents conflicts and maintains performance without locking.

Logging and Recovery Logic

To ensure fault tolerance and state recoverability, each service implements logging:

All state transitions (e.g., order placed, stock reserved, payment processed) are written to log files:
```
logging/
├── order_log.txt
├── stock_log.txt
└── payment_log.txt
```
These logs are:
- Append-only and human-readable
- Chronologically ordered for replay
- Used for recovery after crashes
On startup or failure recovery, each service runs:
```
./start_redis_with_recovery.sh
```
This script:
- Parses the service’s log file
- Reconstructs Redis state deterministically
- Allows the service to resume as if it never crashed
This decentralized logging model ensures:
- Fast recovery with no coordination overhead
- No single point of failure
- Replay-safe execution aligned with the last committed operation

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
env		env
helm-config		helm-config
k8s		k8s
order		order
payment		payment
stock		stock
test		test
.gitignore		.gitignore
README.md		README.md
contributions.txt		contributions.txt
deploy-charts-cluster.sh		deploy-charts-cluster.sh
deploy-charts-minikube.sh		deploy-charts-minikube.sh
docker-compose.yml		docker-compose.yml
gateway_nginx.conf		gateway_nginx.conf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Team 5 Distributed Data Assignment 2025

Saga Workflow and Orchestration

Highlights

Logging and Recovery Logic

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Team 5 Distributed Data Assignment 2025

Saga Workflow and Orchestration

Highlights

Logging and Recovery Logic

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages