Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
18 commits
Select commit Hold shift + click to select a range
3cfcfa3
feat(tts): add Qwen3-TTS as third TTS backend
basnijholt Jan 25, 2026
5e77f44
Merge 3cfcfa347b0acd0224989fb0a67695204ac91d43 into cb4293c67a4d41b1f…
basnijholt Jan 25, 2026
7b69915
Update auto-generated docs
github-actions[bot] Jan 25, 2026
6aff91a
fix(tts): add qwen-tts to requires_extras and add tests
basnijholt Jan 25, 2026
58f14a0
fix(tts): address PR review feedback for Qwen backend
basnijholt Jan 25, 2026
eeb2f5a
fix(docker): use COPY --chmod to avoid duplicate layer in transcripti…
basnijholt Jan 25, 2026
6c04db2
fix(tts): remove unused QWEN_DEFAULT_SAMPLE_RATE constant
basnijholt Jan 25, 2026
7a8c37e
docs(skill): emphasize --from flag for branch-based work (#312)
basnijholt Jan 25, 2026
e0925e3
fix(tts): address PR review feedback - use helpers and clean up deps
basnijholt Jan 25, 2026
d44164e
fix(tts): preserve original _extras.json format and ordering
basnijholt Jan 25, 2026
f49828e
Merge d44164e6324f637a32650f149e8abcba3edcebd0 into 7a8c37ed1d09bfeff…
basnijholt Jan 25, 2026
782faaa
Update auto-generated docs
github-actions[bot] Jan 25, 2026
c7a816f
fix(tts): correct Qwen model size to ~4GB
basnijholt Jan 25, 2026
cf357c3
fix(tts): move first-party imports to module level in cli.py
basnijholt Jan 25, 2026
0f91e53
fix(tts): move pcm_to_wav import to module level
basnijholt Jan 25, 2026
2bc9b08
fix(tts): remove flash_attention_2 requirement (not always available)
basnijholt Jan 25, 2026
3c5eab9
perf(tts): add torch.compile() optimization for Qwen backend
basnijholt Jan 25, 2026
7fcfefa
perf(tts): add Flash Attention 2 support for Qwen backend
basnijholt Jan 25, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 9 additions & 4 deletions .claude-plugin/skills/agent-cli-dev/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,14 +34,19 @@ Do NOT spawn when:

## Core command

For short prompts:
For new features (starts from origin/main):
```bash
agent-cli dev new <branch-name> --agent --prompt "Fix the login bug"
agent-cli dev new <branch-name> --agent --prompt "Implement the new feature..."
```

For work on current branch (review, test, fix) - use `--from HEAD`:
```bash
agent-cli dev new <branch-name> --from HEAD --agent --prompt "Review/test/fix..."
```

For longer prompts (recommended for multi-line or complex instructions):
```bash
agent-cli dev new <branch-name> --agent --prompt-file path/to/prompt.md
agent-cli dev new <branch-name> --from HEAD --agent --prompt-file path/to/prompt.md
```

This creates:
Expand Down Expand Up @@ -129,7 +134,7 @@ Each agent works independently in its own branch. Results can be reviewed and me
| `--agent` / `-a` | Start AI coding agent after creation |
| `--prompt` / `-p` | Initial prompt for the agent (short prompts only) |
| `--prompt-file` / `-P` | Read prompt from file (recommended for longer prompts) |
| `--from` / `-f` | Base branch (default: origin/main) |
| `--from` / `-f` | Base ref (default: origin/main). **Use `--from HEAD` when reviewing/testing current branch!** |
| `--with-agent` | Specific agent: claude, aider, codex, gemini |
| `--agent-args` | Extra arguments for the agent |

Expand Down
69 changes: 65 additions & 4 deletions .claude-plugin/skills/agent-cli-dev/examples.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,68 @@ Each prompt for a spawned agent should follow this structure:
5. **Focused scope** - Keep solutions minimal, implement only what's requested
6. **Structured report** - Write conclusions to `.claude/REPORT.md`

## Scenario 1: Multi-feature implementation
## Scenario 1: Code review of current branch

**User request**: "Review the code on this branch" or "Spawn an agent to review my changes"

**CRITICAL**: Use `--from HEAD` (or the branch name) so the review agent has access to the changes!

```bash
# Review the current branch - MUST use --from HEAD
agent-cli dev new review-changes --from HEAD --agent --prompt "Review the code changes on this branch.

<workflow>
- Run git diff origin/main...HEAD to identify all changes
- Read changed files in parallel to understand context
- Check CLAUDE.md for project-specific guidelines
- Test changes with real services if applicable
</workflow>

<code_exploration>
- Use git diff origin/main...HEAD to see the full diff
- Read each changed file completely before judging
- Look at surrounding code to understand patterns
- Check existing tests to understand expected behavior
</code_exploration>

<context>
Code review catches issues before merge. Focus on real problems - not style nitpicks. Apply these criteria:
- Code cleanliness: Is the implementation clean and well-structured?
- DRY principle: Does it avoid duplication?
- Code reuse: Are there parts that should be reused from other places?
- Organization: Is everything in the right place?
- Consistency: Is it in the same style as other parts of the codebase?
- Simplicity: Is it over-engineered? Remember KISS and YAGNI. No dead code paths, no defensive programming.
- No pointless wrappers: Functions that just call another function should be inlined.
- User experience: Does it provide a good user experience?
- Tests: Are tests meaningful or just trivial coverage?
- Live tests: Test changes with real services if applicable.
- Rules: Does the code follow CLAUDE.md guidelines?
</context>

<scope>
Review only - identify issues but do not fix them. Write findings to report.
</scope>

<report>
Write your review to .claude/REPORT.md:

## Summary
[Overall assessment of the changes]

## Issues Found
| Severity | File:Line | Issue | Suggestion |
|----------|-----------|-------|------------|
| Critical/High/Medium/Low | path:123 | description | fix |

## Positive Observations
[What's well done]
</report>"
```

**Common mistake**: Forgetting `--from HEAD` means the agent starts from `origin/main` and won't see any of the branch changes!

## Scenario 2: Multi-feature implementation

**User request**: "Implement user auth, payment processing, and email notifications"

Expand Down Expand Up @@ -169,7 +230,7 @@ After verifying tests pass, write to .claude/REPORT.md with summary, files chang
</report>"
```

## Scenario 2: Test-driven development
## Scenario 3: Test-driven development

**User request**: "Add a caching layer with comprehensive tests"

Expand Down Expand Up @@ -289,7 +350,7 @@ After ALL tests pass, write to .claude/REPORT.md:
</report>"
```

## Scenario 3: Large refactoring by module
## Scenario 4: Large refactoring by module

**User request**: "Refactor the API to use consistent error handling"

Expand Down Expand Up @@ -357,7 +418,7 @@ After tests pass and linting is clean, write to .claude/REPORT.md:
</report>"
```

## Scenario 4: Documentation and implementation in parallel
## Scenario 5: Documentation and implementation in parallel

**User request**: "Add a plugin system with documentation"

Expand Down
13 changes: 9 additions & 4 deletions .claude/skills/agent-cli-dev/SKILL.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,14 +34,19 @@ Do NOT spawn when:

## Core command

For short prompts:
For new features (starts from origin/main):
```bash
agent-cli dev new <branch-name> --agent --prompt "Fix the login bug"
agent-cli dev new <branch-name> --agent --prompt "Implement the new feature..."
```

For work on current branch (review, test, fix) - use `--from HEAD`:
```bash
agent-cli dev new <branch-name> --from HEAD --agent --prompt "Review/test/fix..."
```

For longer prompts (recommended for multi-line or complex instructions):
```bash
agent-cli dev new <branch-name> --agent --prompt-file path/to/prompt.md
agent-cli dev new <branch-name> --from HEAD --agent --prompt-file path/to/prompt.md
```

This creates:
Expand Down Expand Up @@ -129,7 +134,7 @@ Each agent works independently in its own branch. Results can be reviewed and me
| `--agent` / `-a` | Start AI coding agent after creation |
| `--prompt` / `-p` | Initial prompt for the agent (short prompts only) |
| `--prompt-file` / `-P` | Read prompt from file (recommended for longer prompts) |
| `--from` / `-f` | Base branch (default: origin/main) |
| `--from` / `-f` | Base ref (default: origin/main). **Use `--from HEAD` when reviewing/testing current branch!** |
| `--with-agent` | Specific agent: claude, aider, codex, gemini |
| `--agent-args` | Extra arguments for the agent |

Expand Down
69 changes: 65 additions & 4 deletions .claude/skills/agent-cli-dev/examples.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,68 @@ Each prompt for a spawned agent should follow this structure:
5. **Focused scope** - Keep solutions minimal, implement only what's requested
6. **Structured report** - Write conclusions to `.claude/REPORT.md`

## Scenario 1: Multi-feature implementation
## Scenario 1: Code review of current branch

**User request**: "Review the code on this branch" or "Spawn an agent to review my changes"

**CRITICAL**: Use `--from HEAD` (or the branch name) so the review agent has access to the changes!

```bash
# Review the current branch - MUST use --from HEAD
agent-cli dev new review-changes --from HEAD --agent --prompt "Review the code changes on this branch.

<workflow>
- Run git diff origin/main...HEAD to identify all changes
- Read changed files in parallel to understand context
- Check CLAUDE.md for project-specific guidelines
- Test changes with real services if applicable
</workflow>

<code_exploration>
- Use git diff origin/main...HEAD to see the full diff
- Read each changed file completely before judging
- Look at surrounding code to understand patterns
- Check existing tests to understand expected behavior
</code_exploration>

<context>
Code review catches issues before merge. Focus on real problems - not style nitpicks. Apply these criteria:
- Code cleanliness: Is the implementation clean and well-structured?
- DRY principle: Does it avoid duplication?
- Code reuse: Are there parts that should be reused from other places?
- Organization: Is everything in the right place?
- Consistency: Is it in the same style as other parts of the codebase?
- Simplicity: Is it over-engineered? Remember KISS and YAGNI. No dead code paths, no defensive programming.
- No pointless wrappers: Functions that just call another function should be inlined.
- User experience: Does it provide a good user experience?
- Tests: Are tests meaningful or just trivial coverage?
- Live tests: Test changes with real services if applicable.
- Rules: Does the code follow CLAUDE.md guidelines?
</context>

<scope>
Review only - identify issues but do not fix them. Write findings to report.
</scope>

<report>
Write your review to .claude/REPORT.md:

## Summary
[Overall assessment of the changes]

## Issues Found
| Severity | File:Line | Issue | Suggestion |
|----------|-----------|-------|------------|
| Critical/High/Medium/Low | path:123 | description | fix |

## Positive Observations
[What's well done]
</report>"
```

**Common mistake**: Forgetting `--from HEAD` means the agent starts from `origin/main` and won't see any of the branch changes!

## Scenario 2: Multi-feature implementation

**User request**: "Implement user auth, payment processing, and email notifications"

Expand Down Expand Up @@ -169,7 +230,7 @@ After verifying tests pass, write to .claude/REPORT.md with summary, files chang
</report>"
```

## Scenario 2: Test-driven development
## Scenario 3: Test-driven development

**User request**: "Add a caching layer with comprehensive tests"

Expand Down Expand Up @@ -289,7 +350,7 @@ After ALL tests pass, write to .claude/REPORT.md:
</report>"
```

## Scenario 3: Large refactoring by module
## Scenario 4: Large refactoring by module

**User request**: "Refactor the API to use consistent error handling"

Expand Down Expand Up @@ -357,7 +418,7 @@ After tests pass and linting is clean, write to .claude/REPORT.md:
</report>"
```

## Scenario 4: Documentation and implementation in parallel
## Scenario 5: Documentation and implementation in parallel

**User request**: "Add a plugin system with documentation"

Expand Down
1 change: 1 addition & 0 deletions agent_cli/_extras.json
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@
"speed": ["Audio speed adjustment (audiostretchy)", ["audiostretchy"]],
"piper": ["Local Piper TTS", ["piper"]],
"kokoro": ["Kokoro neural TTS", ["kokoro"]],
"qwen-tts": ["Qwen3-TTS backend (multilingual)", ["qwen_tts"]],
"vad": ["Voice Activity Detection (silero-vad)", ["silero_vad"]],
"faster-whisper": ["Whisper ASR (CUDA/CPU)", ["faster_whisper"]],
"mlx-whisper": ["Whisper ASR (Apple Silicon)", ["mlx_whisper"]]
Expand Down
2 changes: 1 addition & 1 deletion agent_cli/_requirements/kokoro.txt
Original file line number Diff line number Diff line change
Expand Up @@ -359,7 +359,7 @@ tqdm==4.67.1
# huggingface-hub
# spacy
# transformers
transformers==4.57.5
transformers==4.57.3
# via
# agent-cli
# kokoro
Expand Down
2 changes: 1 addition & 1 deletion agent_cli/_requirements/memory.txt
Original file line number Diff line number Diff line change
Expand Up @@ -296,7 +296,7 @@ tqdm==4.67.1
# chromadb
# huggingface-hub
# transformers
transformers==4.57.5
transformers==4.57.3
# via agent-cli
typer==0.21.1
# via
Expand Down
Loading