Explanation of Sampling Param semantics in documentationΒ #602
Closed
Description
π Describe the new functionality needed
Sampling params semantics
- strategy: greedy | top_k | top_p
- temperature: Optional[float] = 0.0
- top_p: Optional[float] = 0.95
- top_k: Optional[int] = 0
How does strategy relate to temperature / top_p / top_k?
What are the semantics when a user provides combinations of temperature / top_p / top_k?
π‘ Why is this needed? What if we don't build it?
Users need to understand the impact of the SamplingParams on their inference requests.
Other thoughts
No response
Metadata
Assignees
Labels
No labels