[2207.10342] Language Model Cascades #735

irthomasthomas · 2024-03-16T13:10:44Z

[2207.10342] Language Model Cascades

[2207.10342] Language Model Cascades

DESCRIPTION: Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with control flow and dynamic structure require techniques from probabilistic programming, which allow implementing disparate model structures and inference strategies in a unified language. We formalize several existing techniques from this perspective, including scratchpads / chain of thought, verifiers, STaR, selection-inference, and tool use. We refer to the resulting programs as language model cascades.

URL: https://arxiv.org/abs/2207.10342

Suggested labels

irthomasthomas · 2024-03-16T13:10:45Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[2207.10342] Language Model Cascades #735

[2207.10342] Language Model Cascades #735

irthomasthomas commented Mar 16, 2024

irthomasthomas commented Mar 16, 2024

[2207.10342] Language Model Cascades #735

[2207.10342] Language Model Cascades #735

Comments

irthomasthomas commented Mar 16, 2024

[2207.10342] Language Model Cascades

Suggested labels

irthomasthomas commented Mar 16, 2024

Related content