Ability for `./main` to keep the model in memory and pass it more text

The `./main` program currently outputs text and then quits.

How hard would it be to add a mode where it could stay running and be ready to accept more text piped to standard input?

This could help avoid the overhead of loading the model again every time the script runs.

Maybe it could output the generated text followed by a marker of some sort when it's done, so a wrapping process could see when it's finished and available to send a new prompt for evaluation.

I'm interested in wrapping it in a tiny Python web server to give myself a UI for interacting with the model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ability for `./main` to keep the model in memory and pass it more text #23

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Ability for ./main to keep the model in memory and pass it more text #23

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Ability for `./main` to keep the model in memory and pass it more text #23