Cloud Type
Choose a cloud type based on workload requirements.AirCloud
Best for workloads where cost efficiency and flexibility matter.AirCloud+
Best for workloads that require higher reliability, performance, and a more predictable operating environment.AirCloud 0
An option that leverages distributed infrastructure (e.g., individuals, partner providers).Instance types
When deploying containers, you can select a GPU instance type. Examples include:- RTX 4070 Ti Super
- RTX 4070 Super
- RTX 4090
- RTX 5090
- RTX PRO 6000
What to consider
- Required latency
- Model size and memory requirements
- Expected request volume
- Replica strategy
- Whether autoscaling is enabled
Related docs
Deploy a Container
Deployment flow and configuration steps.
Autoscaling and Scheduling
Autoscaling and scheduled scaling guidance.

