Direct Installation

Prerequisites

Cloning the Repository

To get started with DataBridge, we need to first setup the server. This involves cloning the repository, installing the dependencies, and the running the server. You are just a few steps away from accurate, agentic RAG over your multi-modal data!

First, let’s clone the repository from GitHub.

git clone https://github.com/databridge-org/databridge-core.git

After cloning the repository, you will notice a databridge-core folder in your current directory.

ls databridge-core

If you see an error like ls: databridge-core: No such file or directory, it means that the repository is not cloned properly. Please try again.

Once you have cloned the repository, navigate into the databridge-core folder.

cd databridge-core

Next, you need to set up a virtual environment called .venv.

Setting Up the Environment

Installing Python Dependencies

While it is not required, we highly recommend using a virtual environment to ensure dependencies from other projects do not conflict with DataBridge. You may use managers like uv or poetry, but for this guide, we will use the built-in venv module.

python3.12 -m venv .venv

Now, you need to activate the virtual environment. The activation command differs based on your operating system.

source .venv/bin/activate

After activation, your command prompt should be prefixed with (.venv), indicating that the virtual environment is active. Once your virtual environment is activated, you can install the required dependencies.

pip install -r requirements.txt

Setting up the Server Parameters

At this point, you may want to customize the server - such as use a different embedding model, enable or disable certain features, etc. - you can do so by editing the databridge.toml file. You can find more details about configuration here.

DataBridge uses environment variables to manage secrets and api keys. In order to ensure that any pre-set keys are available to the server, copy the .env.example file to .env:

cp .env.example .env

In case you’re using external models, you may have to edit the .env file with the necessary API keys. Finally, you can run the setup script to install dependencies and setup the database and vector store.

python quick_setup.py

Launching the Server

You are now ready to launch the DataBridge server! Just run the following command to start the server.

python start_server.py

You should see the following output:

INFO:     Started server process [15169]
INFO:     Waiting for application startup.
INFO:     Application startup complete.
INFO:     Uvicorn running on http://localhost:8000 (Press CTRL+C to quit)

This means that the server is running on http://localhost:8000. You can now interact with the server using the API or the Python SDK.

Next Steps

Now that you have the server running, you can explore the different ways to interact with the server.

Installing via Ollama