Bring - Lock-Free SPSC Ring Buffer

A high-performance, lock-free Single Producer Single Consumer (SPSC) ring buffer implementation in modern C++23. Designed for maximum throughput and minimal latency in concurrent scenarios.

Features

Lock-Free: Uses atomics with acquire-release semantics for thread safety
Cache-Optimized: False sharing prevention with 64-byte alignment
Zero-Copy: Move semantics and in-place construction support
Type-Safe: C++23 concepts for compile-time validation
Header-Only: Easy integration, just include and use

Quick Start

Requirements

C++23 compatible compiler (GCC 12+, Clang 16+, MSVC 2022+)
CMake 3.20+

Installation

Header-Only

Simply copy include/bring/ring_buffer.hpp to your project:

#include <bring/ring_buffer.hpp>

bring::RingBuffer<int, 1024> buffer;

CMake Integration

include(FetchContent)
FetchContent_Declare(
  bring
  GIT_REPOSITORY https://github.com/wizenink/bring.git
  GIT_TAG master
)
FetchContent_MakeAvailable(bring)

target_link_libraries(your_target PRIVATE bring::bring)

Basic Usage

#include <bring/ring_buffer.hpp>
#include <thread>

// Create a ring buffer (capacity must be power of 2)
bring::RingBuffer<int, 64> buffer;

// Producer thread
std::thread producer([&buffer]() {
    for (int i = 0; i < 1000; ++i) {
        while (!buffer.try_push(i)) {
            std::this_thread::yield(); // Buffer full, retry
        }
    }
});

// Consumer thread
std::thread consumer([&buffer]() {
    for (int i = 0; i < 1000; ++i) {
        auto value = buffer.try_pop();
        while (!value.has_value()) {
            std::this_thread::yield(); // Buffer empty, retry
            value = buffer.try_pop();
        }
        // Process value.value()
    }
});

producer.join();
consumer.join();

API Reference

Construction

bring::RingBuffer<T, Capacity> buffer;

T: Element type (must be move-constructible and destructible)
Capacity: Buffer size (must be power of 2, > 1)

Core Operations

`try_push(item) -> bool`

Try to push an item into the buffer. Returns true on success, false if buffer is full.

if (buffer.try_push(42)) {
    // Success
}

`try_pop() -> std::optional<T>`

Try to pop an item from the buffer. Returns std::optional with value on success, std::nullopt if empty.

auto result = buffer.try_pop();
if (result.has_value()) {
    int value = result.value();
}

`try_pop_ip(T& out) -> bool`

In-place pop for better performance (avoids std::optional overhead).

int value;
if (buffer.try_pop_ip(value)) {
    // value contains popped element
}

`try_consume(Func&& processor) -> bool`

Process element with callback without extracting it.

buffer.try_consume([](int&& value) {
    // Process value
});

`emplace(Args&&... args) -> bool`

Construct element in-place. Returns true on success.

buffer.emplace(arg1, arg2, arg3);

Query Operations

`is_empty() -> bool`

Check if buffer is empty.

`is_full() -> bool`

Check if buffer is full.

`get_state() -> BufferState`

Atomically check both empty and full state from a consistent snapshot.

auto state = buffer.get_state();
if (state.empty) {
    // Buffer is empty
}
if (state.full) {
    // Buffer is full
}

Performance

Benchmarks show exceptional performance for SPSC scenarios:

Single-threaded: ~1-2 ns per push/pop operation
Multi-threaded (SPSC): ~3-10 ns per operation depending on buffer size
Cache performance: >99.999% L1 data cache hit rate (validated with cachegrind)
Throughput: Up to 300M+ operations/second on modern hardware

Run cmake --build build --target run_benchmarks to see performance on your system.

Design Decisions

Why Power-of-2 Capacity?

Enables fast modulo operations using bitwise AND: (index + 1) & (Capacity - 1)

Memory Ordering

Acquire-Release: Ensures visibility of data between producer/consumer
Relaxed: Used for same-thread operations where ordering not critical

Cache Line Alignment

Head and tail pointers are aligned to 64-byte boundaries to prevent false sharing between producer and consumer threads.

One-Slot Reservation

The buffer reserves one slot to distinguish between full and empty states (when head == tail).

Building & Testing

Running Tests

# Configure with testing enabled
cmake -B build -DBUILD_TESTING=ON

# Build
cmake --build build

# Run all tests using CMake target
cmake --build build --target run_tests

# Or run tests individually
./build/unit_tests        # Single-threaded tests
./build/mt_tests          # Multi-threaded stress tests

# Or use CTest
cd build && ctest --output-on-failure

Running Benchmarks

# Configure with benchmarks enabled
cmake -B build -DBUILD_BENCHMARKS=ON -DCMAKE_BUILD_TYPE=Release

# Build and run benchmarks using CMake target
cmake --build build --target run_benchmarks

# Or run directly
./build/benchmarks

# Run with specific parameters
./build/benchmarks --benchmark_filter=BM_SPSC --benchmark_min_time=1.0s

Cache Performance Analysis

# Build the cachegrind benchmark
cmake -B build -DBUILD_TESTING=ON

# Run with valgrind cachegrind
valgrind --tool=cachegrind --cachegrind-out-file=cachegrind.out ./build/cachegrind_bench

# View results
cg_annotate cachegrind.out | head -50

Thread Safety

SPSC Only: This ring buffer is designed for exactly one producer thread and one consumer thread. Using it with multiple producers or consumers will result in race conditions.

For MPSC/MPMC scenarios, use a different data structure or add external synchronization.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
cmake		cmake
examples		examples
include/bring		include/bring
tests		tests
.clang-tidy		.clang-tidy
.clang_format		.clang_format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

License

wizenink/bring

Folders and files

Latest commit

History

Repository files navigation

Bring - Lock-Free SPSC Ring Buffer

Features

Quick Start

Requirements

Installation

Header-Only

CMake Integration

Basic Usage

API Reference

Construction

Core Operations

try_push(item) -> bool

try_pop() -> std::optional<T>

try_pop_ip(T& out) -> bool

try_consume(Func&& processor) -> bool

emplace(Args&&... args) -> bool

Query Operations

is_empty() -> bool

is_full() -> bool

get_state() -> BufferState

Performance

Design Decisions

Why Power-of-2 Capacity?

Memory Ordering

Cache Line Alignment

One-Slot Reservation

Building & Testing

Running Tests

Running Benchmarks

Cache Performance Analysis

Thread Safety

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`try_push(item) -> bool`

`try_pop() -> std::optional<T>`

`try_pop_ip(T& out) -> bool`

`try_consume(Func&& processor) -> bool`

`emplace(Args&&... args) -> bool`

`is_empty() -> bool`

`is_full() -> bool`

`get_state() -> BufferState`

Packages