Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -595,7 +595,7 @@ The following parameters can be used with `LiveServerless` (full remote code exe
| `gpuCount` | Number of GPUs per worker | 1 | 1, 2, 4 |
| `workersMin` | Minimum number of workers | 0 | Set to 1 for persistence |
| `workersMax` | Maximum number of workers | 3 | Higher for more concurrency |
| `idleTimeout` | Minutes before scaling down | 5 | 10, 30, 60 |
| `idleTimeout` | Seconds before scaling down | 60 | 300, 600, 1800 |
| `env` | Environment variables | `None` | `{"HF_TOKEN": "xyz"}` |
| `networkVolumeId` | Persistent storage ID | `None` | `"vol_abc123"` |
| `executionTimeoutMs`| Max execution time (ms) | 0 (no limit) | 600000 (10 min) |
Expand Down
2 changes: 1 addition & 1 deletion docs/Flash_SDK_Reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -124,7 +124,7 @@ class ResourceConfig:
# Worker scaling
workersMin: int = 0 # Minimum workers to maintain
workersMax: int = 3 # Maximum workers allowed
idleTimeout: int = 300 # Seconds before idle worker terminates
idleTimeout: int = 60 # Seconds before idle worker terminates

# Networking
networkVolumeId: Optional[str] = None # Mount persistent storage
Expand Down
2 changes: 1 addition & 1 deletion src/runpod_flash/cli/utils/skeleton_template/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -128,7 +128,7 @@ The `@remote` decorator transparently executes functions on serverless infrastru

### Resource Scaling
Both workers scale to zero when idle to minimize costs:
- **idleTimeout**: Minutes before scaling down (default: 5)
- **idleTimeout**: Seconds before scaling down (default: 60)
- **workersMin**: 0 = completely scales to zero
- **workersMax**: Maximum concurrent workers

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@
name="cpu_worker",
workersMin=0,
workersMax=1,
idleTimeout=5,
idleTimeout=60,
)


Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@
gpus=[GpuGroup.ANY],
workersMin=0,
workersMax=1,
idleTimeout=5,
idleTimeout=60,
)


Expand Down
2 changes: 1 addition & 1 deletion src/runpod_flash/core/resources/serverless.py
Original file line number Diff line number Diff line change
Expand Up @@ -156,7 +156,7 @@ class ServerlessResource(DeployableResource):
# === Input Fields ===
executionTimeoutMs: Optional[int] = 0
gpuCount: Optional[int] = 1
idleTimeout: Optional[int] = 5
idleTimeout: Optional[int] = 60
instanceIds: Optional[List[CpuInstanceType]] = None
locations: Optional[str] = None
name: str
Expand Down
Loading