Replies: 1 comment 1 reply
-
I think the industry needs refreshed best practices reference material for SDD+TDD with LLMs. spec-kit is one project trying to find that pattern. But it seems like spec-kit is an integration project, at least at the moment. I don't think SDD+TDD is the end-game for any of us, it's the obvious hammer+nail for right now to effect guardrails. Other discussions on this board indicate you're not alone in seeing this gap. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
**Feature Request: **
I’d like to propose adding a comprehensive benchmarking suite for spec-kit to evaluate its performance on SDD with different large language models on multiple tasks.
Motivation
Raw stats and leaderboards alone aren’t very informative for spec-kit-specific use cases. The SDD systematic approach to planning, research and task management drives different development behaviours and outcomes.
Proposal
Benefits
Would the maintainers and community be open to exploring this?
Beta Was this translation helpful? Give feedback.
All reactions