Agent Reliability

leoncalverley · October 18, 2025, 9:10am

In recent weeks, multiple agents that had previously worked routinely (many invoked by scheduled workflows) have begun to fail with a variety of errors.

I’m conscious of the need to separate deterministic/non-deterministic functions, and the LLM call limits - but these are workflows which previously worked, and now do not.

I had specific, separate problems a few weeks ago with some Retool workflows invoked from agents - I think that the naming rules may have changed, and workflows with spaces in the names began to error - but this isn’t the cause of this issue.

Is anybody else facing similar issues?

*Something went wrong

Hit error when calling LLM: [ { "expected": "'TOOL' | 'ANSWER'", "received": "undefined", "code": "invalid_type", "path": [ "arguments", "nextStep" ], "message": "Required" } ]*

kent · October 20, 2025, 4:20pm

Hey @leoncalverley thanks very much for calling this out. Is there any other information you can share? Do they fail 100% of the time, or inconsistently? Which models are you using? Have the inputs changed drastically (or at all?)?

leoncalverley · October 20, 2025, 10:54pm

Hi Kent

Right now - I’d say 90% of agents fail.

I’ve spotted that one weekly agent had failed on 10 Oct, and the 17 Oct.
I ran it from chat - and it failed.

I switched the model to Deepseek, and upped the max iterations from 10 to 40 (it had been working previously fine on 10 - I hadn’t changed this.)

It has now failed entirely again. It is triggered from a workflow. Each run leaves significant log activity - but zero indication of a cause for the failure.

I submitted a Retool support message to ask if there was a broader issue I needed to know about - and was directed to the Report a Breakage link - but that’s seemingly not for Agents, so my query was unanswered.

Feels like there’s something fundamentally wrong.

leoncalverley · October 20, 2025, 11:01pm

Switched it to 4o - and again it failed

leoncalverley · October 20, 2025, 11:21pm

Added my Vertex API key - so switched to Google APIs for the first time - and it has failed again.
(Note: this agent was previously running smoothly, every week. No changes have been made)

kent · October 21, 2025, 12:04am

Oh my goodness - yes, with 90% failure something is wrong. For what it’s worth, our overall error rate has decreased in the last month (and is, of course, much much lower than 90%) so this isn’t a platform-wide issue.

I switched the model to Deepseek

Which models were they before?

Would you be able to email me an export of your agent so we can take a look? Also, would you be willing to test a trivial agent (like one with an execute code tool and ask it to add two numbers) and let me know if that fails at all?

I’m kent@retool.com.

Topic		Replies	Views
Agent chat/call frequently ends with 'Something went wrong' 💬 Agents bug , agents	2	75	January 15, 2026
Tools Bug / Function Block Dissapears 💬 Agents	18	198	September 4, 2025
Agents + Tools - Slack 💬 Agents	9	155	December 15, 2025
AI Agent very very slow 💬 Agents	8	204	December 2, 2025
Periodic crashes of RetoolRPC due to "agent version mismatch" 💬 App Building	4	64	August 15, 2025

Agent Reliability

Related topics