Getting Started
Setting Up the DataBridge server
Direct Installation
Prerequisites
Cloning the Repository
To get started with DataBridge, we need to first setup the server. This involves cloning the repository, installing the dependencies, and the running the server. You are just a few steps away from accurate, agentic RAG over your multi-modal data!
First, let’s clone the repository from GitHub.
After cloning the repository, you will notice a databridge-core
folder in your current directory.
If you see an error like ls: databridge-core: No such file or directory
, it means that the repository is not cloned properly. Please try again.
Once you have cloned the repository, navigate into the databridge-core
folder.
Next, you need to set up a virtual environment called .venv
.
Setting Up the Environment
Installing Python Dependencies
While it is not required, we highly recommend using a virtual environment to ensure dependencies from other projects do not conflict with DataBridge. You may use managers like uv
or poetry
, but for this guide, we will use the built-in venv
module.
Now, you need to activate the virtual environment. The activation command differs based on your operating system.
After activation, your command prompt should be prefixed with (.venv)
, indicating that the virtual environment is active. Once your virtual environment is activated, you can install the required dependencies.
Setting up the Server Parameters
At this point, you may want to customize the server - such as use a different embedding model, enable or disable certain features, etc. - you can do so by editing the databridge.toml
file. You can find more details about configuration here.
DataBridge uses environment variables to manage secrets and api keys. In order to ensure that any pre-set keys are available to the server, copy the .env.example
file to .env
:
In case you’re using external models, you may have to edit the .env
file with the necessary API keys. Finally, you can run the setup script to install dependencies and setup the database and vector store.
Launching the Server
You are now ready to launch the DataBridge server! Just run the following command to start the server.
You should see the following output:
This means that the server is running on http://localhost:8000
. You can now interact with the server using the API or the Python SDK.
Next Steps
Now that you have the server running, you can explore the different ways to interact with the server.
Configure DataBridge
Configure DataBridge using the databridge.toml
file.
API
Use the API to interact with the server.
Python SDK
Use the Python SDK to interact with the server.
DataBridge UI and CLI
Use the DataBridge UI or CLI to interact with the server.