-
My goal:
Test Retool Agents self-hosted and fully understand the billing model. -
Issue:
- We had an issue with the underlying containers for Retool Agents (agent-worker and agent-eval-worker)
- So effectively, no agents could run
- Any agents that were triggered during the container downtime were stuck and displayed as “Thinking”, manual termination did not immediately terminate these agents
- The reported numbers for Agent “Total runtime” and “Estimated cost” kept increasing until the underlying containers came back.
-
Retool version & hosting setup (Docker, K8s, cloud provider, etc.):
Retool self-hosted 3.253.4 (2025-Q3 stable) on GKE via Retool Helm chart -
Error message(s) or screenshots:
Token usage 0, Tool calls 0 → so the agent did not do anything
Estimated cost $4.34 / Total runtime 52m 3s
There’s two improvements that I’d like to request here:
When the underlying infrastructure for agents is not ready
- New agent runs should not start and an error message should be displayed to the user.
- For existing agent runs, Runtime and Estimated Costs should not be increasing
