Skip to content

Conversation

@kerthcet
Copy link
Member

@kerthcet kerthcet commented Sep 2, 2024

What this PR does / why we need it

Support the advanced inference tech speculative decoding, vllm as the first choice.

Which issue(s) this PR fixes

Part of #59

Special notes for your reviewer

Does this PR introduce a user-facing change?

Add support for SpeculativeDecoding

@InftyAI-Agent InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 2, 2024
@kerthcet kerthcet changed the title [1/N] Add speculativeDecoding support [1/N] Add SpeculativeDecoding support Sep 2, 2024
@kerthcet
Copy link
Member Author

kerthcet commented Sep 2, 2024

/kind feature
/api-change
/lgtm
/approve

@InftyAI-Agent InftyAI-Agent added feature Categorizes issue or PR as related to a new feature. lgtm Looks good to me, indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. and removed do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 2, 2024
@kerthcet
Copy link
Member Author

kerthcet commented Sep 2, 2024

/kind api-change

@InftyAI-Agent InftyAI-Agent added the api-change Indicates PR includes api change. label Sep 2, 2024
Signed-off-by: kerthcet <kerthcet@gmail.com>
@InftyAI-Agent InftyAI-Agent removed the lgtm Looks good to me, indicates that a PR is ready to be merged. label Sep 2, 2024
@kerthcet
Copy link
Member Author

kerthcet commented Sep 2, 2024

/lgtm

@InftyAI-Agent InftyAI-Agent added the lgtm Looks good to me, indicates that a PR is ready to be merged. label Sep 2, 2024
@InftyAI-Agent InftyAI-Agent merged commit a0b1925 into InftyAI:main Sep 2, 2024
@kerthcet kerthcet deleted the feat/add-scheduler branch September 2, 2024 07:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api-change Indicates PR includes api change. approved Indicates a PR has been approved by an approver from all required OWNERS files. feature Categorizes issue or PR as related to a new feature. lgtm Looks good to me, indicates that a PR is ready to be merged. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants