Said-Akbar / vllm-rocm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 2
Star 15

FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs

docs.vllm.ai

Apache-2.0 license

15 stars 6.2k forks Branches Tags Activity

Star

Notifications

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
cacheflow		cacheflow
csrc		csrc
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Repository files navigation

CacheFlow

About

FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs

docs.vllm.ai

Readme

Apache-2.0 license

Code of conduct

Security policy

Activity

15 stars

1 watching

2 forks

Report repository

Releases

1 tags

Packages

No packages published

Languages

Python 83.5%
Cuda 11.2%
C++ 3.3%
C 0.8%
Shell 0.7%
CMake 0.4%
Dockerfile 0.1%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CacheFlow

About

Releases

Packages

Languages

License

Said-Akbar/vllm-rocm

Folders and files

Latest commit

History

Repository files navigation

CacheFlow

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages