Embed PDFs with images graphs and tables in a Vector Database

mascaritas · September 4, 2024, 10:02am

dear community

Is there a recommendation for a service where I can embed PDFs with additional elements than just the text? This would help to create more comprehensive agents for a support chat or similar application.

As I understand it, one can also use external vector databases, if retool vectors don’t offer this feature

Best Greetings

Paulo · September 7, 2024, 1:31am

Hi @mascaritas, you can definitely add images to the PDF exporter, here is how:

Alternatively, @mstevenson shared a TPA that can help you create more complex PDFs:

mascaritas · September 11, 2024, 9:11pm

Dear Paulo

thanks a lot for your suggestions, def something I will also use for other workflows. However what I wanted to ask: how can one embed pdf's including their contained images into a vector-database or better yet retool vectors ?

Or does it require the use of another external service such as pinecone and embedding method X (which I am not familira with)

I hope this makes sense now as a question

Cheers

Paulo · September 11, 2024, 11:34pm

Great question! If you are using Retool AI Document Actions, our PDF parser is only parsing raw text for now, so this may not be possible if you need the AI to reference those images.

On the other hand, the PDFs, including their images, could be stored as base64 data. We could then send that data to a LLM but I'm not sure if it would be able to provide accurate information about the underlying text and images on each file.

mascaritas · January 21, 2025, 9:17pm

I will test this integration here, it seem to read manuals and tech documents particularly well:

Paulo · February 28, 2025, 10:30pm

Hi @mascaritas, how did the integration go?

mascaritas · February 28, 2025, 11:15pm

It is a great tool to ingest and understand multimodal pdfs.
I have made a prototype and gave it a 60 page interior design guideline (all with graphics and sketches) which I can ask now to design our house.

I ended up connecting it to a CustomGPT via OpenAPI Action Schema. If you want to have a look I can DM the link.

Best

Paulo · February 28, 2025, 11:49pm

I would love to take a look at it. Thank you!

mascaritas · March 1, 2025, 12:20am

I checked and can also provide a preview like this.

how-to:
Just ask it a question like this or any other interior related:
I have two sofas, how shall I put them in a small room? And use the codeword "Ask Buddy" in case it does not trigger custom action.

https://chatgpt.com/g/g-67b0e4e481d48191a56ebb19942455b4-interior-buddy

curios to see what you find...

Paulo · March 1, 2025, 12:27am

It may be a private workspace:

Screenshot 2025-02-28 at 4.27.14 PM

Jeje · March 13, 2025, 5:08pm

Just adding +1 here as that would be a really powerful addition to the retool vectors

Topic		Replies	Views
PDF Exporter in Retool 💬 Feature Requests	7	1280	June 6, 2024
Retool API - Scopes and remote vector creation 💬 Queries and Resources api	6	89	April 11, 2025
Tutorial: Making PDF's in Retool 🤝 Community Show & Tell	22	24991	November 21, 2025
How to convert a renderID of a PDF to a JPEG or PNG? 💬 App Building javascript , api , pdf	12	1950	January 20, 2025
Creating vectors from table record URLs 💬 App Building table , openai	11	1045	January 8, 2024

Embed PDFs with images graphs and tables in a Vector Database

Related topics