LLM Error - Prompt is too long: >200000 Max

bobthebear · July 23, 2025, 8:34pm

there is a 100 page/slide limit per file that all major LLMs have. there's also a single file size limit and a batch size limit when uploading (I think Retool has one, i forget cause they use a nifty pricing system when it comes to AIs, but OpenAI and Claude def have this).

I actually discuss a possible solution over here. it involves pulling raw text from a pdf, turning pages/slides into images, running OCR on the images, then chunking/combining the raw text and the OCR results and creating an embedding from there.

a down and dirty solution would be to use Math.Ceiling(num_of_pages % 100) to get the number of files needed for each to be less than 100 pages then use pypdfium2 to create that many new pdf files and iterate over all the pages of the original adding them to these new files you created. you could then create a new vector store and upload these splits individually. the vector store now technically contains the whole pdf.... you'll need to do some finagling with the prompt to get it to return which chunk it used then do some math to figure out which page(s) it used to create the response (if you need to know).... I will note, you may want to consult a lawyer on using this method if you do not own the files you are splitting, as technically you're creating a copy and whoever does own them may have security requirements that don't allow the direct copying of the files

Topic		Replies	Views
Retool Vector Limit 💬 Queries and Resources	12	108	October 14, 2025
Pdf viewer - image limit 💬 App Building	7	1220	August 1, 2023
Error uploading Bulk Documents to Vector AI 💬 Queries and Resources bug	7	485	March 6, 2025
413 Request Entity Too Large in workflow 💬 Workflows bug , python	7	203	August 14, 2025
Converting a complex form to JSON Schema with ResponseVault 🤝 Community Show & Tell form , json-schema-form	5	92	July 18, 2025

LLM Error - Prompt is too long: >200000 Max

Related topics