Conversation
GPU workers are pinned to Python 3.12 (torch/CUDA only installed for 3.12 in base image). CPU workers support 3.10-3.12. Build pipeline handles wheel selection automatically. Also bumps requires-python to <3.14 and runpod-flash to >=1.7.0.
Replace GpuGroup.ANY with explicit GpuType values (e.g. GpuType.NVIDIA_GEFORCE_RTX_4090) across all examples, docs, and contributing guide. Specific GPU types give users faster provisioning and predictable behavior.
- idle_timeout is seconds, not minutes — update values accordingly - Remove unnecessary workers/idle_timeout defaults from simple examples - Unpin pandas/numpy versions for cross-Python-version compatibility - Add missing accelerate dependency to SD example - Add explicit workers to LB endpoints that need them
There was a problem hiding this comment.
Pull request overview
Updates the Flash examples/docs to reflect current runtime constraints (Python versions), standardize GPU selection guidance (favoring explicit GpuType values), and correct autoscaling idle_timeout configuration to use seconds.
Changes:
- Document deployed-worker Python version constraints and tighten
requires-pythonwhile bumpingrunpod-flashminimum version. - Replace many
GpuGroup.ANYusages with explicitGpuTypevalues in docs and examples. - Convert
idle_timeoutexamples/docs to seconds, remove unnecessary defaults in simple examples, and relax dependency pins for broader Python compatibility.
Reviewed changes
Copilot reviewed 22 out of 22 changed files in this pull request and generated 13 comments.
Show a summary per file
| File | Description |
|---|---|
| pyproject.toml | Constrains supported Python versions and bumps runpod-flash minimum version. |
| docs/cli/workflows.md | Updates workflow snippets for GpuType usage and corrects idle_timeout examples. |
| docs/cli/troubleshooting.md | Updates troubleshooting guidance to reference GpuType/GpuGroup and revised GPU availability guidance. |
| README.md | Adds deployed-worker Python version notes and updates idle_timeout unit wording. |
| CONTRIBUTING.md | Migrates contributing examples from GpuGroup.ANY to explicit GpuType usage. |
| CLAUDE.md | Updates internal guide examples and symbol table to reflect GpuType usage. |
| 05_data_workflows/01_network_volumes/gpu_worker.py | Switches to GpuType, fixes idle_timeout units, and adds missing deps for the SD example. |
| 05_data_workflows/01_network_volumes/cpu_worker.py | Adds explicit idle_timeout for CPU worker. |
| 04_scaling_performance/01_autoscaling/gpu_worker.py | Migrates to GpuType and converts scale-to-zero timeout to seconds. |
| 04_scaling_performance/01_autoscaling/cpu_worker.py | Converts CPU scale-to-zero timeout to seconds. |
| 04_scaling_performance/01_autoscaling/README.md | Updates autoscaling reference/examples to match GpuType and new timeout semantics. |
| 03_advanced_workers/05_load_balancer/gpu_lb.py | Migrates GPU LB example to explicit GpuType and expands config for readability. |
| 03_advanced_workers/05_load_balancer/cpu_lb.py | Adds explicit workers to CPU LB endpoint configuration. |
| 03_advanced_workers/05_load_balancer/README.md | Updates load balancer docs to use GpuType in examples. |
| 02_ml_inference/01_text_to_speech/gpu_worker.py | Converts idle_timeout to seconds for GPU worker example. |
| 01_getting_started/04_dependencies/cpu_worker.py | Unpins pandas/numpy to improve cross-Python compatibility. |
| 01_getting_started/04_dependencies/README.md | Updates dependency guidance from strict pinning to constraints-based recommendations. |
| 01_getting_started/03_mixed_workers/gpu_worker.py | Removes unnecessary idle_timeout default from a simple GPU endpoint. |
| 01_getting_started/03_mixed_workers/README.md | Converts idle_timeout examples from minutes to seconds. |
| 01_getting_started/02_cpu_worker/cpu_worker.py | Removes unnecessary workers/idle_timeout defaults from the simplest CPU example. |
| 01_getting_started/01_hello_world/gpu_worker.py | Changes default GPU selection away from GpuGroup.ANY. |
| 01_getting_started/01_hello_world/README.md | Updates examples to use GpuType and refreshes guidance for choosing GPU types. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
You can also share your feedback on Copilot code review. Take the survey.
There was a problem hiding this comment.
Pull request overview
Copilot reviewed 22 out of 22 changed files in this pull request and generated 8 comments.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
You can also share your feedback on Copilot code review. Take the survey.
… examples - Fix GpuType.NVIDIA_A100_80GB to NVIDIA_A100_80GB_PCIe across docs - Fix GpuType.NVIDIA_H100_80GB to NVIDIA_H100_80GB_HBM3 in README - Migrate hello_world gpu_worker from GpuGroup.ADA_24 to GpuType.NVIDIA_GEFORCE_RTX_4090 - Fix idle_timeout config dict mismatches in autoscaling workers - Correct idle_timeout units from minutes to seconds in README tables - Use GpuGroup.ANY for GPU unavailable troubleshooting quick fix
Summary
GpuGroupto explicitGpuTypevalues in example worker files (e.g.GpuGroup.ADA_24->GpuType.NVIDIA_GEFORCE_RTX_4090)idle_timeoutvalues and units (parameter is seconds, not minutes)GpuTypeenum names in docs to match actual SDK valuesChanges
1. Python version documentation
requires-pythonto>=3.10,<3.13andrunpod-flashto>=1.7.02. GpuGroup -> GpuType migration in worker files
GpuGroup.ADA_24withGpuType.NVIDIA_GEFORCE_RTX_4090in hello_world exampleGpuGroup.ANYandGpuGroupare intentionally kept in docs for troubleshooting and flexible GPU selection contexts (they remain valid SDK APIs)3. GpuType naming corrections in docs
GpuType.NVIDIA_A100_80GB->GpuType.NVIDIA_A100_80GB_PCIe(correct enum name)GpuType.NVIDIA_H100_80GB->GpuType.NVIDIA_H100_80GB_HBM3(correct enum name)GpuGroup.ANYorGpuType.ANY4. idle_timeout and dependency fixes
idle_timeoutunits in README tables from minutes to secondsidleTimeoutvalues in autoscaling workers to match decorator valuesidle_timeout=1toidle_timeout=300for consistencyacceleratedependencyTest plan
flash runstarts successfully with updated examplesidle_timeoutconfig dict values match decorator valuesGpuTypereferences use valid enum names from SDK