enable prompt caching by tsuz · Pull Request #36 · tsuz/flightdeck

tsuz · 2026-05-25T13:30:01Z

Fixes #29

Copilot

Pull request overview

Enables Anthropic prompt caching by introducing a PROMPT_CACHING flag and attaching a cache_control breakpoint to the Claude system prompt, with accompanying documentation and tests. The Java Think Consumer also updates cost calculation to account for cached-token billing reported by Anthropic.

Changes:

Add PROMPT_CACHING env/config flag and emit system as content blocks with optional cache_control: { type: "ephemeral" }.
Update Java Claude response parsing/cost calculation to include cache read/write token usage.
Add/update SDK runners and documentation to expose the prompt caching capability.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
think/think-consumer/src/test/java/io/flightdeck/think/service/ClaudeApiServiceTest.java	Adds unit tests validating `cache_control` inclusion/omission in `system` blocks.
think/think-consumer/src/main/java/io/flightdeck/think/service/ClaudeApiService.java	Adds `buildSystemBlocks`, uses it in requests, and includes cached-token billing in cost calculation/logging.
think/think-consumer/src/main/java/io/flightdeck/think/config/AppConfig.java	Introduces `PROMPT_CACHING` configuration flag.
sdk/typescript/src/think-consumer-runner.ts	Adds TS runner support for prompt caching (but needs cached-token cost handling).
sdk/python/flightdeck_sdk/think_consumer_runner.py	Adds Python runner support for prompt caching (but needs cached-token cost handling).
README.md	Documents `PROMPT_CACHING` env var behavior and constraints.
memoir/update-memoir-consumer/src/main/java/io/flightdeck/memoir/service/ClaudeMemoirService.java	Adds prompt-caching-aware `system` request construction.
memoir/update-memoir-consumer/src/main/java/io/flightdeck/memoir/config/AppConfig.java	Introduces `PROMPT_CACHING` configuration flag for memoir consumer.
architecture/models.md	Adds architecture/schema documentation including the new env var.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+// Token pricing from environment variables (per-token, not per-million)
+const INPUT_TOKEN_PRICE = process.env.INPUT_TOKEN_PRICE
+  ? parseFloat(process.env.INPUT_TOKEN_PRICE)
+  : null;
+const OUTPUT_TOKEN_PRICE = process.env.OUTPUT_TOKEN_PRICE
+  ? parseFloat(process.env.OUTPUT_TOKEN_PRICE)
+  : null;


+    const usage = (response.usage as Record<string, number>) || {};
+    const inputTokens = usage.input_tokens || 0;
+    const outputTokens = usage.output_tokens || 0;
+    const cost =
+      INPUT_TOKEN_PRICE != null && OUTPUT_TOKEN_PRICE != null
+        ? (inputTokens / 1_000_000) * INPUT_TOKEN_PRICE + (outputTokens / 1_000_000) * OUTPUT_TOKEN_PRICE
+        : null;
+


    def _call_claude(self, system_prompt: str, messages: list[dict], *, include_tools: bool = True) -> dict:
+        system: Any = system_prompt
+        if self._config.prompt_caching:
+            # Add a cache_control breakpoint so the static prefix can be cached.
+            system = [
+                {
+                    "type": "text",
+                    "text": system_prompt,
+                    "cache_control": {"type": "ephemeral"},
+                }
+            ]
+
        body: dict[str, Any] = {
            "model": self._config.claude_model,
            "max_tokens": self._config.claude_max_tokens,
-            "system": system_prompt,
+            "system": system,
            "messages": messages,
        }


enable cache control

4f8bf98

tsuz changed the title ~~enable cache control~~ enable prompt caching May 25, 2026

tsuz added 3 commits May 25, 2026 23:41

add prompt cache from env var

9fa0e90

test: verify PROMPT_CACHING toggles cache_control in think-consumer

d3ec121

ading architecgtur

3388e01

tsuz requested a review from Copilot May 25, 2026 14:57

Copilot started reviewing on behalf of tsuz May 25, 2026 14:57 View session

Copilot AI reviewed May 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable prompt caching#36

enable prompt caching#36
tsuz wants to merge 4 commits into
mainfrom
feat/cache-control

tsuz commented May 25, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tsuz commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tsuz commented May 25, 2026 •

edited

Loading