Is there a recommendation for a service where I can embed PDFs with additional elements than just the text? This would help to create more comprehensive agents for a support chat or similar application.
As I understand it, one can also use external vector databases, if retool vectors donβt offer this feature
thanks a lot for your suggestions, def something I will also use for other workflows. However what I wanted to ask: how can one embed pdf's including their contained images into a vector-database or better yet retool vectors ?
Or does it require the use of another external service such as pinecone and embedding method X (which I am not familira with)
Great question! If you are using Retool AI Document Actions, our PDF parser is only parsing raw text for now, so this may not be possible if you need the AI to reference those images.
On the other hand, the PDFs, including their images, could be stored as base64 data. We could then send that data to a LLM but I'm not sure if it would be able to provide accurate information about the underlying text and images on each file.
It is a great tool to ingest and understand multimodal pdfs.
I have made a prototype and gave it a 60 page interior design guideline (all with graphics and sketches) which I can ask now to design our house.
I ended up connecting it to a CustomGPT via OpenAPI Action Schema. If you want to have a look I can DM the link.
I checked and can also provide a preview like this.
how-to:
Just ask it a question like this or any other interior related:
I have two sofas, how shall I put them in a small room? And use the codeword "Ask Buddy" in case it does not trigger custom action.