Pick a model & time pack
Browse the OpenLLM catalog or community templates on the self-deploy page. Choose Gemma 4 26B A4B, Qwen3.6 27B A4B, or another mapped model on RTX 4090 / 5090. Select an 11-hour or 24-hour flat pack — one price, no per-token meter.