Skip to content

ugurkanates/dkn-compute-node

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo

Dria Compute Node

Dria Compute Node serves the computation results within Dria Knowledge Network.

License: Apache-2.0 Workflow: Tests Docker Version Discord

About

A Dria Compute Node is a unit of computation within the Dria Knowledge Network. It's purpose is to process tasks given by the Dria Admin Node. To get started, setup your envrionment and then see usage to run the node.

Tasks

Compute nodes can technically do any arbitrary task, from computing the square root of a given number to finding LLM outputs from a given prompt, or validating an LLM's output with respect to knowledge available on the web accessed via tools.

  • Ping/Pong: Dria Admin Node broadcasts ping messages at a set interval, it is a required duty of the compute node to respond with a pong to these so that they can be included in the list of available nodes for task assignment. These tasks will respect the type of model provided within the pong message, e.g. if a task requires gpt-4o and you are running phi3, you won't be selected for that task.

  • Workflows: Each task is given in the form of a workflow, based on Ollama Workflows (see repository for more information). In simple terms, each workflow defines the agentic behavior of an LLM, all captured in a single JSON file, and can represent things ranging from simple LLM generations to iterative web searching.

Requirements

Software

You need the following applications to run compute node:

  • Git: We will use git to clone the repository from GitHub, and pull latest changes for updates later.
  • Docker: Our services will make use of Docker so that the node can run on any machine.

Tip

You can check if you have these via:

which git
which docker

Hardware

For overall specifications about required CPU and RAM, please refer to dkn-node-specs.

In general, if you are using Ollama you will need the memory to run large models locally, which depend on the model's size that you are willing to. If you are in a memory-constrained environment, you can opt to use OpenAI models instead.

Note

The compute node is a lightweight process, but you may see increased memory & CPU usage during the initial testing phases, due to various protocol-level operations with the growing network size.

Setup

To be able to run a node, we need to make a few simple preparations. Follow the steps below one by one.

1. Clone the repository

This repository has the necessary setup to run the node, so start by cloning it using the command below:

git clone https://github.com/firstbatchxyz/dkn-compute-node
cd dkn-compute-node

2. Prepare Environment Variables

Dria Compute Node makes use of several environment variables. Create a .env file, and copy the environment variables as given in .env.example. We will fill out the missing parts in a moment.

cp .env.example .env

Note

DKN_ADMIN_PUBLIC_KEY is used to verify that the tasks are given by certain nodes, so that your node does not work for tasks given to the network by untrusted people. You don't need to change this, simply copy and paste it to your .env.

Tip

While adding anything to your .env, you can do it without leaving the terminal. For example, suppose you want to set VALUE to some KEY, you can do it as:

echo "KEY=VALUE" >> .env

If you would like to view the .env without leaving the terminal, you can do:

cat .env

3. Prepare Ethereum Wallet

Dria makes use of the same Ethereum wallet, that is the recipient of your hard-earned rewards! Place your private key at DKN_WALLET_SECRET_KEY in .env without the 0x prefix. It should look something like:

DKN_WALLET_SECRET_KEY=ac0974bec39a17e36ba4a6b4d238ff944bacb478cbed5efcae784d7bf4f2ff80

Caution

Always make sure your private key is within the .gitignore'd .env file, nowhere else! To be even safer, you can use a throwaway wallet, you can always transfer your rewards to a main wallet afterwards.

4. Setup LLM Provider

For the final step, we need to make sure we can serve LLM requests.

For OpenAI

If you will be using OpenAI to serve its models, you need to have an API key in the environment. Simply set the key within your .env:

OPENAI_API_KEY=<YOUR_KEY>

For Ollama

Of course, first you have to install Ollama; see their download page. Then, you must first pull a small embedding model that is used internally.

ollama pull hellord/mxbai-embed-large-v1:f16

For the models that you choose (see list of models just below here) you can download them with same command. Note that if your model size is large, pulling them may take a while.

# example for phi3:3.8b
ollama pull phi3:3.8b

Tip

Alternatively, you can set OLLAMA_AUTO_PULL=true in the .env so that the compute node will always download the missing models for you.

Optional Services

Based on presence of API keys, Ollama Workflows may use more superior services instead of free alternatives, e.g. Serper instead of DuckDuckGo or Jina without rate-limit instead of with rate-limit. Add these within your .env as:

SERPER_API_KEY=<key-here>
JINA_API_KEY=<key-here>

Usage

With all setup steps above completed, we are ready to start a node!

1. Choose Model(s)

Based on the resources of your machine, you must decide which models that you will be running locally. For example, you can use OpenAI with their models, not running anything locally at all; or you can use Ollama with several models loaded to disk, and only one loaded to memory during its respective task. Available models (see here for latest) are:

Ollama Models

  • adrienbrault/nous-hermes2theta-llama3-8b:q8_0
  • phi3:14b-medium-4k-instruct-q4_1
  • phi3:14b-medium-128k-instruct-q4_1
  • phi3:3.8b
  • llama3.1:latest
  • phi3.5:3.8b
  • phi3.5:3.8b-mini-instruct-fp16

OpenAI Models

  • gpt-3.5-turbo
  • gpt-4-turbo
  • gpt-4o
  • gpt-4o-mini

Tip

If you are using Ollama, make sure you have pulled the required models, as specified in the section above!

2. Start Docker

Our node will be running within a Docker container, so we should make sure that Docker is running before the next step. You can launch Docker via its desktop application, or a command such as:

sudo systemctl start docker

Note

You don't need to do this step if Docker is already running in the background.

3. Run Node

It's time to run our compute node. We have a starter script that makes this much easier, you can see available commadns with:

See the available commands with:

chmod +x start.sh
./start.sh --help

Simply run the script with the model names provided, such as:

./start.sh -m=llama3.1:latest -m=gpt-3.5-turbo

Start script will run the containers in the background. You can check their logs either via the terminal or from Docker Desktop.

Running in Debug Mode

To print DEBUG-level logs for the compute node, you can add --dev argument to the start script. For example:

./start.sh -m=gpt-4o-mini --dev

Running in debug mode will also allow you to see behind the scenes of Ollama Workflows, i.e. you can see the reasoning of the LLM as it executes the task.

Similarly, you can run in trace mode with --trace to see trace logs, which cover low-level logs from the p2p client.

4. Looking at Logs

To see your logs, you can go to Docker Desktop and see the running containers and find dkn-compute-node. There, open the containers within the compose (click on > to the left) and click on any of the container to see its logs.

Alternatively, you can use docker compose logs such as below:

docker compose logs -f compute  # compute node logs
docker compose logs -f ollama   # ollama logs

The -f option is so that you can track the logs from terminal. If you prefer to simply check the latest logs, you can use a command such as:

# logs from last 1 hour
docker compose logs --since=1h compute

# logs from last 30 minutes
docker compose logs --since=30m compute

5. Stopping the Node

When you start your node with ./start.sh, it will wait for you in the same terminal to do CTRL+C before stopping. Once you do that, the containers will be stopped and removed. You can also kill the containers manually, doing CTRL+C afterwards will do nothing in such a case.

Note

Sometimes it may not immediately exit whilst executing a task, if you REALLY need to quite the process you can kill it manually.

Using Ollama

If you don't have Ollama installed, you can ignore this section.

If you have Ollama installed already (e.g. via brew install ollama) then you must indicate that you will be using that Ollama, instead of a Docker container. To do this, we set the provide the argument --local-ollama=true which is true by default. With this, the compute node will use the Ollama server on your machine, instead of a Docker container.

If the Ollama server is not running, the start script will initiate it with ollama serve and terminate it when the node is being stopped.

  • If --local-ollama=false or the local Ollama server is reachable, the compute node will use a Docker Compose service for it.

Tip

There are three Docker Compose Ollama options: ollama-cpu, ollama-cuda, and ollama-rocm. The start script will decide which option to use based on the host machine's GPU specifications.

# Run with local ollama
./start.sh -m=phi3 --local-ollama=true

Additional Static Nodes

You can add additional relay nodes & bootstrap nodes from environment, using the DKN_RELAY_NODES and DKN_BOOTSTRAP_NODES variables respectively. Simply write the Multiaddr string of the static nodes as comma-separated values, and the compute node will pick them up at the start.

Releases

We have 3 types of releases:

  • Versioned: With each release, a versioned image is deployed on Docker hub with the version tag :vX.X.X.
  • Latest: As usual, the latest version is kept under :latest tag.
  • Development: On each push to master, a new image is created with :unstable.

See deployed images on Docker Hub.

Development

If you have a feature that you would like to add with respect to its respective issue, or a bug fix, feel free to fork & create a PR!

If you would like to run the node from source (which is really handy during development), you can use our shorthand scripts within the Makefile. You can see the available commands with:

make help

You will need OpenSSL installed as well, see shorthand commands here. While running Ollama elsewhere (if you are using it) or with an OpenAI API key provided, you can run the compute node with:

make run      # info-level logs
make debug    # debug-level logs

Testing & Benchmarking

You can the tests as follows:

make test         # unit tests
make test-ollama  # Ollama tests (requires a running Ollama client)

To measure the speed of some Ollama models we have a benchmark that uses some models for a few prompts:

cargo run --release --example ollama

You can also benchmark these models using a larger task list at a given path, with the following command:

JSON_PATH="./path/to/your.json" cargo run --release --example ollama

Documentation

Open crate docs using:

make docs

Styling

Lint and format with:

make lint   # clippy
make format # rustfmt

Profiling

We would like to profile both CPU and Memory usage.

To create a flamegraph of the application, do:

make profile-cpu

This will create a profiling build that inherits release mode, except with debug information.

To profile memory usage, we make use of cargo-instruments.

Note

CPU profiling may require super-user access.

License

This project is licensed under the Apache License 2.0.

About

Compute Node of Dria Knowledge Network.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Rust 87.9%
  • Shell 9.6%
  • Makefile 1.8%
  • Dockerfile 0.7%