Moved this repo to https://github.com/poudels14/llama2_rs since this isn't a fork of the original and doesn't reflect the new commits made to llama.c for comparison.
This was derived from https://github.com/karpathy/llama2.c to run multi-threaded inference in Rust.