Custom AI Provider cannot stream output in LLM Chat component

wonka · November 8, 2025, 8:37am

Hello Retool community,

I am trying to connect to my own chat completion endpoint, my chat endpoint is custom using fastapi (python), I have tried many ways, but LLM Chat component doesn’t stream my output, instead it displayed everything at the end which is quite a long time.

Anyone knows how can I stream output from custom endpoint?

Darren · November 14, 2025, 12:13am

Hey @wonka - we're in the early stages of adding support for HTTP response streaming to the REST API resource. If you're interested, I can raise the corresponding flag for your org.

Topic		Replies	Views
How to "stream" responses from Databricks 💬 App Building	7	360	February 12, 2026
Send Retool AI query response to Chat when triggering AI from a query 💬 Queries and Resources	3	138	July 7, 2025
Bug in Chat Component w/ AI - Won't stream and doesn't auto-scroll 💬 App Building	5	240	November 15, 2024
Self-served OpenAI-compatible support 💬 Feature Requests openai , ai	4	213	February 12, 2026
Local LLM as resource 💬 Feature Requests ai	5	202	March 1, 2025

Custom AI Provider cannot stream output in LLM Chat component

Related topics