Production-ready RAG data loaders for Timescale Vector
Creating a vector index from any data using Llamaindex and Timescale
You've got your amazing RAG AI app ready. You've got the data loaders done, but they run on your laptop. Now what? Where to run them on schedule or otherwise repeatable? Or even make other people able to use them? You've come to the right place!
This tutorial shows how to run production-grade RAG data loaders on Robocorp.
With Robocorp, you can run Python on a scale anywhere. It comes with built-in logging, a powerful orchestrator with an API to everything, and scalable runtimes in the cloud or on-prem. In this case, we are focusing on these highlights:
- How to wrap your RAG data loader to a runnable Robocorp Task and instantly get detailed logs of every execution
- How to schedule a Task to run on the Robocorp Control Room using on-demand containers
- How to send an email to your RAG loader to process the attachments - with zero additional lines of code
The video below will show you some of the cool highlights.
Let's get going!
What you'll need
To complete this tutorial yourself, you'll need the following:
- Robocorp Control Room account (sign up for free)
- Timescale account with a deployment (free trials available)
- OpenAI API key
- Robocorp Code extension for VS Code, connecting your IDE to Control Room for easier local development (download for free)
Once all is installed, do the following (if you follow the names exactly, the code will work without changes):
- Link your development environment to the Robocorp Cloud and your workspace.
- Create two Robocorp Vault items that house your secrets:
- One called
OpenAI
that has one entry calledkey
that contains your OpenAI API key. - Second called
Timescale
that has your Timescale service URL in an entry calledservice-url
.
- One called
You'll find the Service URL from your Timescale console under Connection Info here:
Raw Python to Robocorp Task
In the example code, this is already done, but considering you'd start from a "plain vanilla" Python script, this is what you'd do.
1. Set up your dependencies in the conda.yaml file and declare runnables
Generally, this is simple; you move all your normal pip install
s under the pip dependencies in the configuration file. Why? The reason is that the runtimes will build the Python environment according to these specs. On your machine and later wherever the Task gets executed. This is the secret ingredient here. You'll not be at the mercy of whatever Python versions the runtime has, as Robocorp always runs Tasks in its fully isolated and replicable environment.
In this case, a small trick is needed.
For the Timescale connection to work, you'll need to have the psycopg2
package coming from the conda-forge (top-level item) and also add psycopg2-binary
, but that needs to come from pip.
Next, you'll make sure that your robot.yaml contains the declarations of what the Control Room can see as runnables. In this case, you would see Data Loader
and Do a Query
Tasks in your workspace.
2. Wrap your code in a @task
The next step is wrapping your existing code in a method and adding a @task
decorator. It does a bunch of things, for example automatically initializes the intricate logging that Robocorp provides, as well as enables you to run the code easily using the environment just configured in the previous step.
This is what you'll be adding, and all your code goes into the middle where the comment is:
3. Hide the secrets
As we will run the code somewhere other than on your own laptop (for example, on-demand cloud runtimes), we can't rely on your local environment variables anymore. In the example code, we've wrapped all the necessary setup steps in a method, which pulls the necessary credentials from the Robocorp Vault and sets the necessary (environment) variables. Like this:
4. Use the Work Items
Last but not least, we'll put Work Items into work.
Our code builds the embeddings from a local folder, data
, so we must get some stuff into that folder first. The easiest way is to use work items, which can contain JSON payloads or files. Now, we are only interested in files. The fancy part of Robocorp Control Room is that regardless of how the Task gets triggered, it'll pass the files in as input Work Items. So you could call the Task with an API with files, you can send an email with attachments, or manually start a run with files from the Control Room. In all cases, just these two lines will do the trick.
They'll iterate through all the available Work Items, and save all files to the local data
folder.
Now you are good to go!
Running it locally
The tutorial repository contains three example Work Items for easy testing:
- simple-file is just one input
.txt
file - multiple-files contains two pdf files
- email contains an example of how the email is represented as a Work Item when sent to the Control Room to trigger the run
To run and debug locally in VS Code, the easiest way is through Command Palette (CTRL + Shift + P or Command + Shift + P) and Robocorp: Run Robot. That way, the run happens in the environment you have defined.
You'll be prompted for two things.
First, which Task to run? The repo contains two: Data Loader
is where the beef is. Do a Query
is there to validate if your data loader worked. It creates a simple query engine and runs one query against your database.
Second, choose the input Work Item. For the Data Loader
, try any of the three available. Or jump to the deep end and create your own test Work Items from the Resources panel in the Robocorp Code extension. For the Do a Query
task, you don't need any work item.
After running something locally, check the Robo Task Output to see what the run did. If anything failed, it's easy to see what went south.
Going cloud ☁️
Now that all runs locally, time to go production-grade. In this section we'll do the following things: upload the project to the Control Room, create a process out of it, set up an email trigger and test it.
1. Deploy code to Control Room
While you can upload your project directly from VS Code to the Control Room (Command Palette - Robocorp: Upload Robot to the Control Room), the recommended way is to do it via the git repo. You may fork this repo to your own, or simply just use our example repo directly.
It's easy: Tasks tab under your Workspace, Add Task Package, give it a name, and paste the link to the repo. Done.
2. Create a Process
Next up, you'll need to create a Process. Tasks are reusable components that you can use to build sequences of actions. When creating a Process you'll map your chosen Tasks with the Worker runtimes.
Follow the UI again, as the video below shows. Processes, Create a new Process and add your details.
Once that's done, you'll have an opportunity to either set the scheduling, or create an email trigger. We'll choose the latter. In the last step, you can create alerts and callbacks to Slack, Email and as Webhook calls. In this example we set a Slack notification for both successfull and unsuccessful runs.
3. Run it manually
Once the Proces is created, time to run it! Hit the Run Process button and choose Run with Input Data, and give a file as an input. Once the run is complete, your Timescale has embeddings in it! 🤞
Tip: open the Process run for detailed log on the execution.
4. Start with email
Pushing this even further, next grab the email address of the process, you'll find it from the Configuration of the Process under Scheduling.
Now what ever you send to that particular email automatically triggers your run, takes the attachments as input files and loads them to your vector database. And remember: you did not write a single line of code that handles emails! ✨ We did it for you.
Push the boundaries
Now when all the basics work, here are some ideas of what else you could do with Robocorp!
- Build more complex data loaders that e.g. get data from websites and runs on a daily schedule. Here's a repo for inspiration.
- Deploy self-hosted workers so that the data loader has access to data and systems inside your own network.
- Each Process is callable over Control Room API. Try triggering the load process with an API call.
Technical information
Last updated
12 December 2023License
Apache License 2.0