allisson · allisson · Jan 26, 2026 · Jan 26, 2026
diff --git a/README.md b/README.md
@@ -1,12 +1,25 @@
-# sqsx
+# sqsx 🚀
 [![Tests](https://github.com/allisson/pysqsx/actions/workflows/tests.yml/badge.svg?branch=main)](https://github.com/allisson/pysqsx/actions/workflows/tests.yml)
 ![PyPI - Version](https://img.shields.io/pypi/v/sqsx)
 ![PyPI - Python Version](https://img.shields.io/pypi/pyversions/sqsx)
 ![GitHub License](https://img.shields.io/github/license/allisson/pysqsx)
 
-A simple task processor for Amazon SQS.
+A simple, robust, and thread-safe task processor for Amazon SQS. 💪
 
-## Quickstart
+## ✨ Features
+
+- 🔒 **Thread-Safe**: Built-in locks protect shared state in multi-threaded environments
+- 🔄 **Resilient**: Automatic retry with exponential backoff for transient failures
+- 🛑 **Graceful Shutdown**: Clean shutdown on SIGINT/SIGTERM with proper resource cleanup
+- 📦 **Context Manager Support**: Use `with` statements for automatic cleanup
+- 📏 **Message Size Validation**: Enforces SQS 256KB message limit
+- 🏭 **Production Ready**: Comprehensive error handling for SQS API failures
+- ✅ **Type Validated**: Pydantic-based configuration validation
+- ⚡ **High Performance**: Messages acknowledged as they complete (not batch-blocked)
+- 📚 **Well Documented**: Comprehensive docstrings for all public APIs
+- 🧪 **Fully Tested**: 59 tests with 100% pass rate
+
+## 🚀 Quickstart
 
 For this demonstration we will use elasticmq locally using docker:
 
@@ -20,7 +33,7 @@ Install the package:
 pip install sqsx
 ```
 
-### Working with sqsx.Queue
+### 📋 Working with sqsx.Queue
 
 We use sqsx.Queue when we need to work with scheduling and consuming tasks.
 
@@ -83,7 +96,7 @@ DEBUG:sqsx.queue:Waiting some seconds because no message was received, seconds=1
 INFO:sqsx.queue:Stopping consuming tasks, queue_url=http://localhost:9324/000000000000/tests
 ```
 
-### Working with sqsx.RawQueue
+### 🔧 Working with sqsx.RawQueue
 
 We use sqsx.RawQueue when we need to work with one handler consuming all the queue messages.
 
@@ -145,7 +158,107 @@ DEBUG:sqsx.queue:Waiting some seconds because no message was received, seconds=1
 INFO:sqsx.queue:Stopping consuming tasks, queue_url=http://localhost:9324/000000000000/tests
 ```
 
-### Working with exceptions
+## 🎯 Advanced Usage
+
+### 🗂️ Using Context Managers
+
+Both `Queue` and `RawQueue` support context managers for automatic resource cleanup:
+
+```python
+from sqsx import Queue
+
+# Context manager ensures proper cleanup
+with Queue(url=queue_url, sqs_client=sqs_client) as queue:
+    queue.add_task_handler("my_task", task_handler)
+    queue.add_task("my_task", a=1, b=2, c=3)
+    queue.consume_messages(run_forever=False)
+# Resources are automatically cleaned up when exiting the context
+```
+
+### ⚡ Concurrent Processing
+
+Process multiple messages concurrently using threads:
+
+```python
+# Process up to 10 messages at once with 5 worker threads
+queue.consume_messages(
+    max_messages=10,        # Fetch up to 10 messages per batch
+    max_threads=5,          # Process with 5 concurrent threads
+    run_forever=True
+)
+```
+
+**🚀 Performance Optimization**: Messages are acknowledged as they complete (using `as_completed()`), not waiting for the slowest message. This means fast messages are acknowledged immediately, improving overall throughput.
+
+**⚠️ Important**: For optimal performance with `max_threads > 1`, configure boto3 connection pooling:
+
+```python
+from botocore.config import Config
+
+config = Config(
+    max_pool_connections=5,  # Match your max_threads value
+    retries={'max_attempts': 3, 'mode': 'standard'}
+)
+sqs_client = boto3.client('sqs', config=config, ...)
+```
+
+Without connection pooling, threads will compete for a single connection, reducing throughput. Always set `max_pool_connections` to at least your `max_threads` value. 📊
+
+### 🛑 Programmatic Graceful Shutdown
+
+Trigger graceful shutdown programmatically:
+
+```python
+import threading
+
+def shutdown_after_delay():
+    import time
+    time.sleep(30)  # Wait 30 seconds
+    queue.exit_gracefully()
+
+# Start consumer
+shutdown_thread = threading.Thread(target=shutdown_after_delay)
+shutdown_thread.start()
+
+queue.consume_messages(
+    run_forever=True,
+    enable_signal_to_exit_gracefully=False  # Disable signal handlers
+)
+
+shutdown_thread.join()
+```
+
+### ⚙️ Configuration Options
+
+Configure backoff behavior and queue parameters:
+
+```python
+queue = Queue(
+    url=queue_url,
+    sqs_client=sqs_client,
+    min_backoff_seconds=30,    # Minimum retry delay (default: 30)
+    max_backoff_seconds=900,   # Maximum retry delay (default: 900, max: 43200)
+)
+```
+
+The backoff calculator uses exponential backoff: `timeout = min(min_backoff * 2^retries, max_backoff)`
+
+### 🎛️ consume_messages() Parameters
+
+Fine-tune message consumption behavior:
+
+```python
+queue.consume_messages(
+    max_messages=1,              # Messages per batch (1-10, default: 1)
+    max_threads=1,               # Worker threads (default: 1)
+    wait_seconds=10,             # Sleep when no messages (default: 10)
+    polling_wait_seconds=10,     # SQS long polling timeout (default: 10)
+    run_forever=True,            # Continue until stopped (default: True)
+    enable_signal_to_exit_gracefully=True  # Handle SIGINT/SIGTERM (default: True)
+)
+```
+
+### ⚠️ Working with exceptions
 
 The default behavior is to retry the message when an exception is raised, you can change this behavior using the exceptions sqsx.exceptions.Retry and sqsx.exceptions.NoRetry.
 
@@ -176,3 +289,102 @@ def task_handler(context: dict, a: int, b: int, c: int):
 def message_handler(queue_url: str, sqs_message: dict):
     raise NoRetry()
 ```
+
+## 🛡️ Error Handling & Resilience
+
+### 🔄 Automatic Retry on Transient Failures
+
+sqsx automatically handles and retries transient SQS API failures:
+
+- **⏱️ Throttling errors**: Automatically retried with a 5-second delay
+- **🌐 Network errors**: Connection issues are logged and retried
+- **☁️ Service unavailable**: Temporary AWS outages are handled gracefully
+
+```python
+# No special code needed - automatic retry is built-in
+queue.consume_messages()
+```
+
+Error logs will show retry attempts:
+
+```
+ERROR:sqsx.queue:SQS API error: ThrottlingException, queue_url=..., retrying...
+ERROR:sqsx.queue:Network/connection error: EndpointConnectionError, queue_url=..., retrying...
+```
+
+### 📏 Message Size Limits
+
+Messages are automatically validated against SQS limits (256KB):
+
+```python
+# Will raise ValueError if message exceeds 256KB
+try:
+    queue.add_task("my_task", large_data=huge_string)
+except ValueError as e:
+    print(f"Message too large: {e}")
+```
+
+### 🔄 Graceful Shutdown Behavior
+
+When shutdown is triggered (SIGINT, SIGTERM, or `exit_gracefully()`):
+
+1. ⛔ **Stop flag is set**: No new message batches are fetched
+2. ✅ **Active tasks complete**: All currently processing messages finish
+3. 🧹 **Clean resource cleanup**: Handlers are cleared, signal handlers restored
+4. ⚡ **Fast response**: Stop flag checked every 100ms during idle periods
+
+This ensures no messages are lost or left in a processing state during shutdown.
+
+## 🔒 Thread Safety
+
+sqsx is fully thread-safe for concurrent message processing:
+
+- 🔐 **Shared state protection**: All shared data structures use locks
+- ✅ **Safe handler registration**: Handlers can be added during message processing
+- 🤝 **Coordinated shutdown**: Stop flag properly synchronized across threads
+
+Example with concurrent processing:
+
+```python
+# Safe to use with multiple threads
+queue.consume_messages(max_messages=10, max_threads=5)
+
+# Safe to add handlers while processing (in another thread)
+queue.add_task_handler("new_task", new_handler)
+```
+
+## 💡 Best Practices
+
+1. **🗂️ Use context managers** for automatic cleanup:
+   ```python
+   with Queue(url=queue_url, sqs_client=sqs_client) as queue:
+       # Your code here
+       pass
+   # Automatically cleaned up
+   ```
+
+2. **🔌 Configure connection pooling** for concurrent processing:
+   ```python
+   config = Config(max_pool_connections=max_threads)
+   sqs_client = boto3.client('sqs', config=config, ...)
+   ```
+
+3. **📦 Keep messages small** (under 256KB) for better performance
+
+4. **⏱️ Use appropriate backoff values** for your use case:
+   - Short-lived tasks: `min_backoff_seconds=10, max_backoff_seconds=300`
+   - Long-running tasks: `min_backoff_seconds=60, max_backoff_seconds=3600`
+
+5. **🛡️ Monitor and handle exceptions** appropriately in your handlers
+
+6. **🧪 Test graceful shutdown** in your deployment process
+
+## 📦 Requirements
+
+- Python 3.10+
+- boto3
+- pydantic
+
+## 📄 License
+
+This project is licensed under the MIT License.