Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tensor parallel MOE implementation #2293

Closed
wants to merge 50 commits into from
Closed
Changes from 1 commit
Commits
Show all changes
50 commits
Select commit Hold shift + click to select a range
92c3a3c
expert parallel moe
scv119 Dec 27, 2023
d24d9dd
update
scv119 Dec 27, 2023
408ed6d
update
scv119 Dec 28, 2023
ec913db
update
scv119 Dec 28, 2023
56f3220
update
scv119 Dec 28, 2023
6067ec6
update
scv119 Dec 28, 2023
869e0c5
update
scv119 Dec 28, 2023
ea44a0f
update
scv119 Dec 28, 2023
90e223a
update
scv119 Dec 28, 2023
69b5a55
update
scv119 Dec 28, 2023
357b046
update
scv119 Dec 28, 2023
86f7e1e
update
scv119 Dec 28, 2023
40f0f59
update
scv119 Dec 28, 2023
baa90d2
update
scv119 Dec 29, 2023
4367d6a
update
scv119 Dec 29, 2023
b4df657
update
scv119 Dec 29, 2023
1ac4890
update
scv119 Dec 29, 2023
14f29b3
update
scv119 Dec 30, 2023
b3a1b77
update
scv119 Dec 30, 2023
8045832
update
scv119 Dec 30, 2023
5eb304a
update
scv119 Dec 30, 2023
a172a7c
update
scv119 Dec 30, 2023
a56b2df
update
scv119 Dec 30, 2023
ca7110e
update
scv119 Dec 30, 2023
82de999
update
scv119 Dec 30, 2023
20fcbc0
update
scv119 Dec 30, 2023
92709c1
update
scv119 Dec 30, 2023
22daa9b
update
scv119 Dec 31, 2023
42c659d
update
scv119 Dec 31, 2023
4e84e02
udpate
scv119 Dec 31, 2023
d586474
update
scv119 Dec 31, 2023
15f820f
update
scv119 Dec 31, 2023
e0d9440
update
scv119 Dec 31, 2023
817e0bb
update
scv119 Dec 31, 2023
285e7af
Apply suggestions from code review
scv119 Jan 4, 2024
d850834
update
scv119 Jan 4, 2024
920209f
update
scv119 Jan 4, 2024
fdd5b77
update
scv119 Jan 4, 2024
17d17f1
update
scv119 Jan 4, 2024
6c60c3b
update
scv119 Jan 4, 2024
9749e64
update
scv119 Jan 4, 2024
4bee472
update
scv119 Jan 4, 2024
0fe75f3
update
scv119 Jan 4, 2024
cce13fb
update
scv119 Jan 4, 2024
f0f1d5e
update
scv119 Jan 4, 2024
42b3cc3
update
scv119 Jan 4, 2024
0a8069b
update
scv119 Jan 4, 2024
f955162
update
scv119 Jan 4, 2024
1089dd8
reorder operations
scv119 Jan 17, 2024
43ec685
Merge remote-tracking branch 'origin/main' into moe
scv119 Jan 17, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
update
  • Loading branch information
scv119 committed Dec 31, 2023
commit 817e0bb981b37c673b91b723645090645bd69726
2 changes: 1 addition & 1 deletion csrc/pybind.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ PYBIND11_MODULE(TORCH_EXTENSION_NAME, m) {
ops.def(
"bincount",
&vllm_bincount,
"Gather key and value from the cache into contiguous QKV tensors");
"cuda-graph compatible bincount implementation");

// Cache ops
pybind11::module cache_ops = m.def_submodule("cache_ops", "vLLM cache ops");
Expand Down
Loading