Skip to content

Explanation of Sampling Param semantics in documentationΒ #602

Closed
@mattf

Description

πŸš€ Describe the new functionality needed

Sampling params semantics

  • strategy: greedy | top_k | top_p
  • temperature: Optional[float] = 0.0
  • top_p: Optional[float] = 0.95
  • top_k: Optional[int] = 0

How does strategy relate to temperature / top_p / top_k?

What are the semantics when a user provides combinations of temperature / top_p / top_k?

πŸ’‘ Why is this needed? What if we don't build it?

Users need to understand the impact of the SamplingParams on their inference requests.

Other thoughts

No response

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions