Background thread: How are text chunks for embeddings generated?
Can we have more granular control in the Retool Vector query? For example, when we choose to upload a document to a vector, it would be nice to have options to decide the splitters, maximum chunk size, etc. Essentially, it would be an explicit call of the
Highly needed!! would love to have something like this.
Hi there! Thanks for the request. I've passed this feedback on internally as well. Could you share more details on how you'd like to use this?
Are you looking to upload part of a document? Are you experiencing performance issues?
Thanks for getting back. Essentially, I'd like to have options to set parameters such as
TextSplitter when I create a Retool Vectors Resource, either to upsert doc or insert doc, etc.
What I am doing now is once I have the OCR text from the document I want to upload, I would go to Resources -> Retool Vectors, then click "Create Vector". Since I can't customize the chunking strategy, I would need to manually edit the texts in a text editor, then upload the texts to create a vector through trial and error to get to the chunks I want. Ideally, when we are able to set the parameters, I don't need to go through the trial and error part any more.
Thank you for sharing these details! Will follow up here with any updates I get internally