A high-performance repository of "Skills" and RTK-powered tools designed for Tech Leads. These workflows are Agent-Ambiguous, allowing any LLM agent (Gemini, Claude, GPT) to assist with implementation planning, code review, and automated testing.
- π Quick Start
- How to use in any project
- Branching Strategy
- Antigravity Setup
- Cursor Setup
- π§Ή Resetting a Project
- π§ͺ CI/CD
- Resources π
- Available Skills, what they do, how they do it , and what they cost
[!IMPORTANT] This stack and its associated skills are currently only configured for macOS and Linux. Windows users should use WSL2 for full compatibility.
This stack includes a helper script to run agent-specific tasks defined in
package.json.
-
Set the alias (one-time per session):
alias rtk="$(pwd)/scripts/rtk-run.sh"
-
Run a task:
rtk run mission-control rtk run planning-expert rtk list
- RTK (Runtime Toolkit):
curl -fsSL https://raw.githubusercontent.com/rtk-ai/rtk/refs/heads/master/install.sh | sh - GitHub CLI (gh): Required for automated PR management.
- Browsers (Playwright):
npx playwright install chromium - Python Deps:
pip install python-dotenv playwright - System: Access to your local Chrome User Data Directory.
- Firecrawl API: (Optional) For the
planning-expertto read external links.
This repository enforces Trunk Based Development with a rebase-first workflow and squash-and-merge PRs.
For detailed day-to-day workflow examples and guidelines for both developers and AI agents, please refer to the Branch Management Strategy document.
Clone this repo and link it globally for easy access:
# Add this to your ~/.zshrc
alias lead-init='bash /path/to/tech-lead-stack/install.sh --link .'
# Cursor: register skills globally (~/.cursor/skills/) without touching your app repo
alias lead-init-cursor='bash /path/to/tech-lead-stack/install.sh --link . --ide cursor'
Navigate to any repository you want to automate and run the new alias:
lead-init
- Permissions: Run
chmod +x scripts/cleanup.shin your toolbox repo. - Execution: You can now run
lead-initto build it andlead-cleanto tear it down. - Mission Control: Your
lead-initwill now automatically run a pre-flight check to make sure you didn't miss a step.
The "Tech-Lead Stack" is built upon three foundational pillars of modern engineering excellence:
- G-Stack (Modularity & Diagnosis-First): Inspired by the
garrytan/gstack philosophy, this
pillar mandates Diagnosis before Advice. Every skill begins with Phase
0: Tech-Stack Discovery. Agents must understand the project's language,
framework, and constraints (by inspecting
package.json,tsconfig.json, etc.) before proposing a single line of code. - MinimumCD (Atomic Batches & Continuous Verification): This pillar prioritizes small, atomic batches of work (<100 lines per task) and continuous automated verification. It is designed to prevent "Big Bang" integrations by enforcing vertical slicing and early detection of regression risks.
- Agent Skills (Production-Grade Ethos): Based on Addy Osmani's agent-skills, this pillar treats AI agents as disciplined senior engineers rather than shortcut-taking assistants.
Our methodology is reinforced by the Agent Skills ethos, ensuring AI agents default to high-discipline engineering rather than the shortest path:
- Process over Prose: Skills are structured workflows (not vague advice) with specific verification gates.
- Anti-Rationalization: It uses documented rebuttals to combat common AI excuses (e.g., "I'll add tests later" or "The fix seems right").
- Verification is Non-Negotiable: Every task must end with hard evidence (tests, logs, or screenshots). "Seems right" is never an acceptable exit criterion.
[!NOTE] G-Stack is a Methodology, not a Stack: While the name implies a specific technology set, the Tech-Lead Stack treats "G-Stack" as an engineering philosophy centered on modularity, diagnosis-first planning, and robust verification. It is designed to work seamlessly with C#, Python, JavaScript, Java, Go, and any other ecosystem.
To maintain high performance and auditability, the Tech-Lead Stack uses a dual-layered architecture:
The rtk.tools section in package.json acts as the Single Source of Truth
for tool execution.
- The Human Side: When you run
rtk run <tool>, the rtk-run.sh script specifically looks for that key in your local (or linked)package.json. - The Synergy: This ensures that even if you aren't using an AI agent, you can manually audit or trigger any skill logic via the terminal. It guarantees that the Agent and the Human are always working from the same operational registry.
The MCP Server serves as the Intelligence Layer for your IDE.
- Skill Discovery: The server dynamically reads
.ai/skills/*.mdfiles and exposes them as tools. It uses theinternal: trueflag to hide support-only skills from primary discovery while keeping them available for implementation. - Telemetry & Metrics: Unlike the CLI, executions via the MCP are instrumented via Langfuse. This captures token usage, project attribution, and agentic decision-making for enterprise-grade analytics.
| Category | Storage | Purpose |
|---|---|---|
| Agent Skills | .ai/skills/ |
Core Brains: High-density instructions for the AI. Some are "Internal" support logic. |
| Dev Workflows | .agents/workflows/ |
User Orchestrations: Antigravity /slash commands or manual starting prompts. |
If using a web-based agent (Claude.ai, ChatGPT) or starting a fresh session without workspace access:
"Analyze the skills in /path/to/lead-stack/.ai/skills/. You are now a Tech Lead Agent equipped with these workflows. Use
rtk run <tool>for all tool executions."
Since lead-init has already linked the instructions to your project, simply prompt the agent in your workspace:
"Read the instructions in .ai/agents.md and follow the planning-expert workflow for this ticket."
Cursor: use install.sh --link . --ide cursor (or lead-init-cursor above)
so the same skills appear under your user ~/.cursor/skills/ as symlinks
into this repo. Your app repository does not get a .cursor/ folder from this
step. Invoke skills from Cursorβs skills UI (or the slash menu) like Antigravity
workflows.
Open your terminal and clone the tech-lead-stack repository to a permanent
location on your machine.
git clone https://github.com/your-username/tech-lead-stack.git ~/tech-lead-stack
cd ~/tech-lead-stackRun the install.sh script to set up dependencies and link the stack to your
current working directory.
./install.sh --link .Note: This will also output a JSON snippet for MCP configuration. Keep this handy for Step 6.
Open the Agent panel in Antigravity. Click the "Open Agent Manager" button at the top to find the Customizations menu.
In the Customizations panel, select the Workflows tab.
To make workflows available across all projects:
- Click the + Global button.
- Open any workflow file from
.agents/workflows/in your IDE (e.g.,audit-tech-debt.md). - Copy the entire content of the markdown file.
- Paste it into the Antigravity workflow editor.
- Give it a name (e.g.,
audit-tech-debt). You can optionally add a suffix like-tlsif you want to distinguish them. - Repeat for other workflows you wish to use globally.
Antigravity requires the MCP server to execute tools (like rtk).
- Go to Settings (Gear icon) -> MCP.
- Add a new MCP server named
tech-lead-stack. - Use the following configuration (replacing
/path/to/tech-lead-stackwith your actual absolute path):
{
"mcpServers": {
"tech-lead-stack": {
"command": "npm",
"args": [
"--prefix",
"/path/to/tech-lead-stack",
"--silent",
"run",
"mcp:start"
]
}
}
}You can now invoke these workflows in the Agent chat by typing / followed by
the workflow name.
name: workflow-clean-code-audit description: Clean Code Audit
// turbo
-
Phase 0: Tech-Stack Discovery (MANDATORY): Inspect the project root to identify the primary language and framework.
-
Call the tech-lead-stack.get_skills tool:
- skillName: "clean-code"
- projectName: "<YOUR_CURRENT_PROJECT_NAME>"
- model: "<YOUR_MODEL_NAME>"
- agent: "<YOUR_AGENT_NAME>"
-
Follow its workflow to audit architecture and recommend SOLID improvements.
Open your terminal and clone the tech-lead-stack repository to a permanent
location on your machine.
git clone https://github.com/your-username/tech-lead-stack.git ~/tech-lead-stack
cd ~/tech-lead-stackRun the install.sh script to set up dependencies and link the stack to your
current working directory.
./install.sh --link . --ide cursorNote: This will also output a JSON snippet for MCP configuration. Keep this handy for Step 4.
The install.sh script will automatically symlink the skills into your global
~/.cursor/skills/ directory. You can verify this by running:
ls -la ~/.cursor/skills/You should see symlinks to the .ai/skills/ directory of the tech-lead-stack
repo.
Cursor requires the MCP server to execute tools (like rtk).
- Open Cursor Settings (Gear icon) -> Cursor Settings.
- Select MCP from the sidebar.
- Click + Add New MCP Server.
- Name the server
tech-lead-stack. - Set the Type to
command. - Use the following configuration (replacing
/path/to/tech-lead-stackwith your actual absolute path):
{
"command": "npm",
"args": [
"--prefix",
"/path/to/tech-lead-stack",
"--silent",
"run",
"mcp:start"
]
}You can now invoke these skills in the Cursor chat by typing @ followed by the
skill name (e.g., @planning-expert). Cursor will suggest the skill from the
list of available global skills.
# Add this to your ~/.zshrc
alias lead-clean='bash /path/to/tech-lead-stack/scripts/cleanup.sh .'
If you want to remove the AI workflows and symlinks from a repository:
lead-clean
This repository uses GitHub Actions to validate:
- Skill Integrity: Ensures all
.mdfiles in.ai/skills/have valid YAML frontmatter. - Markdown Linting: Prevents malformed instructions that could confuse agents.
- Script Permissions: Ensures all tools in
scripts/remain executable.
If you get an error that the browser profile is "already in use," close your
active Chrome window or create a dedicated Profile for the Agent and update your
.env accordingly.
tech-lead-stack/
βββ .ai/
β βββ agents.md
β βββ skills/
β βββ agent-optimizer.md
β βββ code-review-checklist.md
β βββ mission-architect.md
β βββ planning-expert.md
β βββ regression-bug-fix.md
β βββ verification-auditor.md (Internal)
β βββ visual-verifier.md
βββ .github/
β βββ workflows/
β βββ agent-ci.yml
βββ scripts/
β βββ autoeval-check.js
β βββ cleanup.sh
β βββ gh-pr-create.sh
β βββ upload-evidence.py
βββ templates/
β βββ PULL_REQUEST_TEMPLATE.md
βββ .env
βββ .env.example
βββ .gitignore
βββ ONBOARDING.md
βββ install.sh
βββ package.json
βββ README.md
βββ requirements.txt
- https://skills.sh/
- https://github.com/AgriciDaniel/claude-seo
- https://skills.sh/dammyjay93/interface-design/interface-design
- https://github.com/orgs/firecrawl/repositories?q=sort%3Astars
- https://www.npmjs.com/package/autoevals
- https://github.com/pezzolabs/pezzo
- https://beyond.minimumcd.org/docs/team-chatbot/
- https://substack.com/home/post/p-187289110
- https://github.com/garrytan/gstack
- https://bryanfinster.substack.com/p/ai-broke-your-code-review-heres-how
- https://migration.minimumcd.org/docs/reference/practices/continuous-integration/
- https://agents.md/
- https://github.com/bdfinst/agentic-dev-team?tab=readme-ov-file#review-agents
- https://github.com/addyosmani/agent-skills
To maintain User Sovereignty and ensure Context Hygiene, the Tech-Lead Stack does not simply read static files. Access to all skills is brokered through the MCP Server, which enforces a strict priority of discovery:
- Project-Local Override:
.ai/skills/in your current working directory. - Global Fallback:
.ai/skills/in thetech-lead-stackrepository.
- Customization: Teams can "fork" a skill for a specific project without modifying the global repository.
- Auditability: Every skill retrieval is wrapped in a Langfuse Trace (Telemetry) to track which model, agent, and project are executing specific logic.
- Cost Control: The server captures and reports the "Budgeted Cost" of each skill to prevent uncontrolled LLM spend.
// src/mcp-server/fs-service.ts
async readSkill(safeSkillName: string) {
// Define Search Paths: Local Project has priority over Global Repo
const localSkillsDir = path.join(process.cwd(), ".ai/skills");
const searchDirs = [localSkillsDir, this.repoSkillsDir];
for (const dir of searchDirs) {
const skillPath = path.join(dir, `${safeSkillName}.md`);
try {
// Returns the first match found (Local Override logic)
const content = await fs.readFile(skillPath, "utf-8");
return { content, path: skillPath };
} catch {
// Continue to Fallback
}
}
}| Skill | Description | How it works | Use Case | Est. Context Footprint |
|---|---|---|---|---|
accessibility-auditor |
Specialized audit for Web Accessibility (A11y). Scans for contrast, semantics, and ARIA debt. | Static analysis via grep, visual scrutiny of CSS, and read-only runtime DOM inspection. |
Ensuring WCAG 2.1 compliance and multi-viewport accessibility. | ~650 tokens |
mission-architect |
Master Blueprint Engine. Orchestrates Strategy -> Research -> Plan -> Deliver for complex features. | Strategic extraction from roadmaps, deep codebase audit, and multi-stage planning via planning-expert. |
Designing and executing a major architectural change or multi-file feature. | ~1300 tokens |
planning-expert |
The complete Planning Expert. Orchestrates deep pattern discovery, vertical slicing, and safe incremental delivery. | Deep codebase audit followed by an atomic G-Stack blueprint and commit-ready task list. | Breaking down complex Jira tickets or architectural refactors into test-driven steps. | ~850 tokens |
planning-expert-quick |
Ultra-lean strategic planning. Optimized for speed, token efficiency, and rapid MVC delivery. | Anchors tech stack followed by a condensed W/W/H blueprint and rapid execution cycle. | Common, less complex, lite-weight tasks where velocity is the priority. | ~600 tokens |
regression-bug-fix |
Unified remediation engine for resolving QA, Design Review (DR), and Regression feedback. | Maps feedback to code impact, generates a localized remediation plan, and verifies the fix against regressions. | Fixing "Login button misaligned" or "API returning 500" after a QA pass. | ~1350 tokens |
code-review-checklist |
High-density pre-commit quality auditor for verifying functionality and G-Stack standards. | Analyzes local diffs against 4 gates (Spec, SOLID, A11y, Evidence), ensuring zero any types and compliance. |
Rapid local verification before running rtk run create-pr. |
~650 tokens |
clean-code |
Architectural auditor enforcing SOLID principles and programmatic standards (KISS, DRY, YAGNI). | Scans for "God Objects" and tight coupling. Recommends strategy patterns and colocation of code. | Checking a new feature branch before merging to prevent technical debt. | ~950 tokens |
security-audit |
Cross-platform security scanner detecting malware, prompt injection, and exfiltration. | Scans skills, scripts, and inputs for malicious patterns (curl | bash, eval()). |
Running on agent-generated scripts to ensure no backdoors are introduced. | ~550 tokens |
pr-automator |
Automates G-Stack Pull Requests with synthesized diffs and verification evidence. | Fetches visual proof (screenshots) and maps code changes to the original Strategic Mission. | Finalizing a feature branch into a professional, evidence-backed PR. | ~950 tokens |
visual-verifier |
Captures before/after screen evidence for visual smoke testing. | Runs local app via Playwright and captures Desktop/Mobile screenshots for the PR body. | Proving that a CSS fix works as intended across different viewports. | ~450 tokens |
changelog-generator |
Transforms Git history into user-facing release notes with strict noise filtering. | Ingests git log, groups by semantic commit type, filters noise, and formats to Markdown. |
Generating clean release notes for stakeholders. | ~750 tokens |
daily-standup |
Generates a daily status update by analyzing 48h of git activity and task progress. | Categorizes commits, assess blockers, and generates a rolling report using a professional standup template. | Automating your daily update or summarizing work for a sync meeting. | ~550 tokens |
ask |
Expert technical advisor providing architectural insights and precise code snippets for manual implementation. | Diagnostic research via Phase 0 discovery, followed by high-density technical advice and snippets. | Q&A about the codebase or "How would I change this?" queries. | ~450 tokens |
product-strategist |
Strategic roadmap auditor validating market positioning and Impact vs. Effort. | Scans metrics and positioning to ensure current implementation work maps to high-ROI customer goals. | Auditing a proposed feature list against the core product vision. | ~850 tokens |
feature-design-assistant |
Architectural discovery engine for pre-implementation prototyping. | Discovers existing patterns and generates technical specs before the first line of code is written. | High-level ideation for a new service or module. | ~800 tokens |
style-logic-exporter |
Extraction engine for transforming CSS/Tailwind logic into portable design tokens for Figma. | Scans style sheets and theme configurations to extract variables, colors, and typography metrics. | Syncing code-based styling with design systems or external documentation. | ~650 tokens |
technical-debt-auditor |
Scans codebase for anti-patterns, complexity hotspots, and architectural drift. | Metrics-driven analysis combined with G-Stack methodology to prioritize refactoring tasks. | Routine codebase maintenance and pre-refactoring audits. | ~950 tokens |
[!NOTE] The Est. Context Footprint listed above refers to the budgeted tokens required to load the skill's instructions into the agent's context. Live execution costs may vary depending on the LLM model and project size, but are tracked accurately via Langfuse telemetry to provide baseline reliability across the entire developer workforce.