Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Speculative Decoding] Support draft model on different tensor-parallel size than target model #5414
[Speculative Decoding] Support draft model on different tensor-parallel size than target model #5414
Changes from 129 commits
f5b5f94
709de21
0eacc96
2011ed0
2e16c4e
b412a51
593ccfa
c5d3476
44e623b
98caf17
7fc4ff5
a96e720
db39576
b2e8595
756442a
32094f1
7890191
53b2ea9
a29c9c5
52ba09d
d26ef08
80c4994
0f16f3f
140f478
3fd7e91
495aa30
3a5a47f
07ddbb8
b0a677d
96782a2
9998b9c
e92ecdc
b421607
386ab9b
b25f74e
8b51f08
d4b283c
dfc90cb
9bef5e4
85d087d
9af36b7
5a0bf45
531c9f0
287da20
08d1b2a
237c966
0bb38c2
c097d6c
957a325
3ec8cb5
8a8a1e4
7f06f64
1e87579
abc546c
7880cb0
2ebe6f3
90d46ee
7e1426c
ad52d93
355475b
9cfdb5b
6a6c5ff
ddef229
965f648
1bb5534
ea6b8f5
71977d2
bc5f77a
5655a49
eabc16a
f748edf
c099c94
4b74a45
c9786ad
a42664a
ac7701a
eea6a7e
a648f5d
f23ba8c
aa9af93
56c8927
385b4f8
43f37eb
99350e2
a9f3e23
6ba250d
3e78613
6532af7
6839797
aac586b
98e584d
2d5e64d
ba88bd4
46e5274
85f4f25
c1b5373
4a58617
b09e7be
7168d78
fe0bd5b
2e0d170
36f8aa5
54bf514
bfd7d2f
f337428
4654b9f
e39926e
1c6eefd
f2d2ee5
302955c
3d4754e
620b224
b245d3c
1e71e98
a01c00d
debffc2
39fe67f
af1b0be
834c6e0
5bc2bc3
8740369
4d82ca1
7bf831c
3fccc76
e8d0e93
91c2e43
fac7e68
271822e
ae0d7f1
b84a070
86fda24
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
This file was deleted.