Downloading files retrieved via API

Nic · July 18, 2024, 7:11pm

Goal: Trying to download files pulled from an API, but they keep downloading corrupt
Steps: utils.downloadFile(DownloadFile2.data, 'test', 'pdf') - have tried adding {base64Binary: } have tried modifying the API headers a few ways.

Currently using content type: application/octet-stream and using this transform

const base64Message = "data:application/pdf;base64" + btoa(unescape(encodeURIComponent(data.message)))
return base64Message;

That seems to get good base64, but the download stays in that format (when opened in notepad++) and doesn't work. We did notice that the working PDF we have is ANSI encoded and the downloads are in UTF-8 The working PDF also does not look like base64 anymore when opened in notepad++

Ultimately, this will have more than just PDF files coming through, but we have some of it hard-coded as a short cut while we get the downloading part sorted.

Paulo · July 18, 2024, 10:02pm

Hi @Nic, after lots of trial and error, it does look like this is a limitation with converting the ANSI format to base64.

Here are my findings:

I uploaded the PDF you shared with us during OH on Retool Storage. When I get it through the built-in Retool Storage query, it looks like this:

Here is the response object:

To simulate getting this as the response of an API, I created a REST Api resource that does the same thing, it gets the file:

Update:
We are able to download the pdf file with the following query:

Note: query4 is the one from the first screenshot.

The download is successful:

However, it seems like when we are trying to make the base64 with the response from your API:

... lots of scrolling later ...

It looks like we get the base64 but it seems to not be a perfect convertion. Which leads to the file being corrupted.

Paulo · July 18, 2024, 11:02pm

Is there a way for the endpoint to respond with a different format? base64 would be ideal.

Nic · July 19, 2024, 12:06am

I just confirmed there is no way to get the data in a different format. Is there some other code that could be used to be more explicit about what it is receiving so that it can better convert it?

Paulo · July 23, 2024, 8:07pm

I tried a few different ways to handle this on Retool's side but no success yet.

Paulo · August 20, 2024, 11:08pm

Hi @Nic, after more testing I can't help but to think that the API is not giving us ASCII back. Maybe it is, but it seems to be adding characters that shouldn't be there.

Here is the ASCII chart:

The response you shared with us:

Has a bunch of these: ��

This is why we are unable to generate the correct base64 to create the PDF.

If this was ASCII, we would be able use a service like:

To get the base64 and generate the PDF file. I just tried it and it doesn't work either.

Why are we using this API? Are we storing our files there?

Nic · August 28, 2024, 1:14am

Hi Paulo, we were actually able to get it working in an entirely script, using fetch, but when we switched to the production environment we started getting CORS errors and it sounds like fetch isn't fully supported - I've been trying to merge this with a normal query, but I'm having trouble returning all the data in a way that it makes it back to the script. Any suggestions?

Here is the script that worked in staging and works in prod if I use a CORS proxy, but that's not a workable long term solution.

const MAX_CHUNK_SIZE = 1024 * 1024 * 5; // Define tu tamaño de chunk aquíconst
const internalFileIdentifier = FileID

class MyAppState {
    constructor() {
        this.AccessToken = (obfuscated, this is elsewhere);
    }
}

const _appState = new MyAppState();

function setMessageDefaultHeaders() {
    return {
        "Authorization": `Bearer ${_appState.AccessToken}`,
        "Content-Type": "application/vnd.api+json",
        "Accept": "application/vnd.api+json",
        "User-Agent": "MyPythonApp",        
      "Access-Control-Allow-Origin": "*",
              "Access-Control-Allow-Credentials": true,
      "mode": "no-cors"
    };
}

async function downloadFileAsync(internalFileIdentifier, fileSize) {
    let totalChunks = Math.floor(fileSize / MAX_CHUNK_SIZE);
    if (fileSize % MAX_CHUNK_SIZE !== 0) {
        totalChunks += 1;
    }

    const data = [];
    const headers = setMessageDefaultHeaders();

    for (let i = 0; i < totalChunks; i++) {
        const url = `(shortenedforclarity/files/${internalFileIdentifier}?part_number=${i + 1}`;

        console.log(`Requesting part ${i + 1}. URL: ${url}`); // Debugging statement
        const response = await fetch(url, { headers });

        if (response.ok) {
            const chunk = await response.arrayBuffer();
            console.log(`Downloaded chunk ${i + 1} of size ${chunk.byteLength} bytes`); // Debugging statement
            data.push(new Uint8Array(chunk));
        } else {
            console.error(`Failed to download part ${i + 1}, Status: ${response.status}`); // Debugging statement
            return new Uint8Array(data.reduce((acc, val) => acc.concat(Array.from(val)), []));
        }
    }

    console.log(`Total downloaded data size: ${data.reduce((acc, val) => acc + val.length, 0)} bytes`); // Debugging statement
    return new Uint8Array(data.reduce((acc, val) => acc.concat(Array.from(val)), []));
}

function saveFile(fileName, data) {
    const blob = new Blob([data], { type: "application/octet-stream" });
    const url = URL.createObjectURL(blob);

    const a = document.createElement("a");
    a.href = url;
    a.download = fileName;
    document.body.appendChild(a);
    a.click();
    document.body.removeChild(a);

    URL.revokeObjectURL(url);
}

async function main() {
    const internalFileIdentifier = currentSourceRow.file;
    const fileSize = currentSourceRow.fileSize; // Reemplazar con el tamaño real del archivo
    const data = await downloadFileAsync(internalFileIdentifier, fileSize);
    saveFile(currentSourceRow.name+".pdf", data);
}

// Ejecutar la función principal
main().catch(console.error);

Any idea on how to make sure we get the right data back? I used this, and it seems to run ok, using the additional scope to structure the query, but doesn't seem to pass enough data back to the query for the rest to work. Advice for how to make what gets back to the query look more like it would as a fetch?
const response = await DownloadFile.trigger({additionalScope:{field1:url,field2:partnum}})
results.push(response);
results.push(result);

Paulo · August 28, 2024, 9:29pm

I haven't seen this implementation before but the code looks good.

fetch should work on any JS query despite of the environment:

Do you mind sharing the CORS error we are getting?

Nic · August 28, 2024, 11:21pm

This is what we get,

When we tried no-cors it corrupted the response somehow so that it didn't work in a different way. Using a query to run the same command manually works just fine (but then we get back to the conversion error), and running it in a fetch gets the CORS error. If there is a way to get that to work, great, or if we can pass the query back in the same style that a fetch would have gotten it, then I think that would actually be even better than the fetch.

We've tried a couple of other iterations, so here's another error log with one of the other settings we tried.

Paulo · August 29, 2024, 8:20pm

What environment did this work on?

Nic · August 29, 2024, 9:13pm

The software we are working with, Actionstep, has a staging and a production version of the server - it worked with their staging environment but not with their production environment.

Paulo · August 29, 2024, 10:42pm

Do we have API Credentials? Actionstep doc:

Nic · August 29, 2024, 10:56pm

Yes, I have keys, I've used them for lots of other things and they do work in this case when using a query instead I can still get a proper response, but since I could only get a working download file using the script, I'm struggling to pass the response from a query version back to the script without it being corrupted or otherwise not working anymore. The fetch mechanism seems to be treated differently from a CORS perspective and from a processing the response perspective - one for the better, one for the worse.

Paulo · August 31, 2024, 1:23am

Thank you for clarifying!

Let's try something. Set up a REST API to handle one chunk at a time so it can replace the fetch, which seems to be the limitation. Use the same headers and set up Additional Scope variables for the part_number and other dynamic values we need. Then on the script you shared, just replace the await fetch(url, { headers }) with:

await RestAPIqueryName.trigger({
  additionalScope:{
    part_number: `${i + 1}`,
    internalFileIdentifier: internalFileIdentifier
  }
})

This way, the request is routed through Retool's backend, which can handle CORS more effectively.

Test it on staging first, since we do get the file there. If it works, let's test on prod.

Nic · September 2, 2024, 3:53am

Doesn't look like it worked in staging even - I think it doesn't pass enough data or it gets corrupted even just doing that - the query works on its own, and the full version looks like it is running ok, but the downloaded file is empty, so it's still definitely getting corrupted somehow on the way out.

Topic		Replies	Views
Trouble changing name of glb files downloading from GCS 💬 Queries and Resources	15	107	February 3, 2025
Download a file from API 💬 Queries and Resources api	11	18897	August 2, 2023
Uploading files fails with the new components 💬 Queries and Resources	27	8428	November 30, 2023
Unable to download file from retool storage programmatically 💬 App Building bug , javascript	6	188	April 7, 2025
Returning all results for a cursor-based paginated API 🧑‍💻 Retool Tips & Tricks	12	12485	April 29, 2024

Downloading files retrieved via API

Related topics