[Feature] [Spec Decode]: Simplify the Use of Eagle Spec Decode

### 🚀 The feature, motivation and pitch

As documented [here](https://docs.vllm.ai/en/latest/features/spec_decode.html#speculating-using-eagle-based-draft-models), currently, users need to change the config and checkpoint using the [script](https://gist.github.com/abhigoyal1997/1e7a4109ccb7704fbc67f625e86b2d6d), which is tedious.
In the ideal case, vllm should load the eagle head automatically from huggingface or locally without extra conversion.
This can be achieved by:
1. Identify this is a eagle head from the model name.
2. Identify the LM head from the original base model.

After 1 and 2, vllm should be able to use eagle speculative decoding without user conversion.

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [X] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature] [Spec Decode]: Simplify the Use of Eagle Spec Decode #11943

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature] [Spec Decode]: Simplify the Use of Eagle Spec Decode #11943

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions