Skip to content

harrisonvanderbyl/rwkv.hpp

Repository files navigation

RWKV.hpp

Header only library for cpu inference with rwkv v5

Todos and stuff

  • AVX512
  • AVX512-skylake
  • AVX2
  • NEON(Arm)
  • Non-simd
  • FP32
  • BF16
  • FP16
  • INT8
  • Cuda
  • Rocm
  • Vulkan
  • Batch Inference
  • Sequence Inference ( state generation )
  • Static memory usage via buffers
  • Fixing memory leakage
  • Example app
  • Windows build .bat
  • Mac build

Quickstart

  1. go to ./models/
  2. Download a model from https://huggingface.co/BlinkDL/rwkv-5-world/tree/main
  3. Edit convert.py to point to the download model
  4. run convert.py (your converted model is placed into ./build/)
  5. run ./build.sh
  6. go to ./build
  7. from the terminal, run ./rwkv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages