Available GPU configurations and selection guide for optimal performance
GPUType
enum provides standardized GPU configurations for different workloads. Each type represents a specific combination of GPU count and memory capacity.
A100-80GB Single GPU
A100-80GB Dual GPU
A100-80GB Quad GPU
A100-80GB Octa GPU
Model Size | Parameters | Recommended GPU | Memory Reasoning |
---|---|---|---|
Small | < 1B | A100_80GB_1 | Fits comfortably in 80GB |
Medium | 1B - 7B | A100_80GB_2 | Benefits from parallel processing |
Large | 7B - 30B | A100_80GB_4 | Requires distributed memory |
Massive | 30B+ | A100_80GB_8 | Needs maximum memory capacity |