The Lead Stack: Agent-Ambiguous Workflows

A high-performance repository of "Skills" and RTK-powered tools designed for Tech Leads. These workflows are Agent-Ambiguous, allowing any LLM agent (Gemini, Claude, GPT) to assist with implementation planning, code review, and automated testing.

Requirements

RTK (Runtime Toolkit): curl -fsSL https://raw.githubusercontent.com/rtk-ai/rtk/refs/heads/master/install.sh | sh
GitHub CLI (gh): Required for automated PR management.
Browsers (Playwright): npx playwright install chromium
Python Deps: pip install python-dotenv playwright
System: Access to your local Chrome User Data Directory.

Firecrawl API: (Optional) For the planning-expert to read external links.

Branching Strategy

This repository enforces Trunk Based Development with a rebase-first workflow and squash-and-merge PRs.

For detailed day-to-day workflow examples and guidelines for both developers and AI agents, please refer to the Branch Management Strategy document.

🚀 Quick Start

1. Installation

Clone this repo and link it globally for easy access:

# Add this to your ~/.zshrc
alias lead-init='bash /path/to/tech-lead-stack/install.sh --link .'

# Cursor: register skills globally (~/.cursor/skills/) without touching your app repo
alias lead-init-cursor='bash /path/to/tech-lead-stack/install.sh --link . --ide cursor'

2. Initialize a Project

Navigate to any repository you want to automate and run the new alias:

lead-init

🛠 Final Checklist

Permissions: Run chmod +x scripts/cleanup.sh in your toolbox repo.
Execution: You can now run lead-init to build it and lead-clean to tear it down.
Mission Control: Your lead-init will now automatically run a pre-flight check to make sure you didn't miss a step.

🧠 The Methodology: Three Pillars

The "Tech-Lead Stack" is built upon three foundational pillars of modern engineering excellence:

G-Stack (Modularity & Diagnosis-First): Inspired by the garrytan/gstack philosophy, this pillar mandates Diagnosis before Advice. Every skill begins with Phase 0: Tech-Stack Discovery. Agents must understand the project's language, framework, and constraints (by inspecting package.json, tsconfig.json, etc.) before proposing a single line of code.
MinimumCD (Atomic Batches & Continuous Verification): This pillar prioritizes small, atomic batches of work (<100 lines per task) and continuous automated verification. It is designed to prevent "Big Bang" integrations by enforcing vertical slicing and early detection of regression risks.
Agent Skills (Production-Grade Ethos): Based on Addy Osmani's agent-skills, this pillar treats AI agents as disciplined senior engineers rather than shortcut-taking assistants.

Production-Grade Ethos

Our methodology is reinforced by the Agent Skills ethos, ensuring AI agents default to high-discipline engineering rather than the shortest path:

Process over Prose: Skills are structured workflows (not vague advice) with specific verification gates.
Anti-Rationalization: It uses documented rebuttals to combat common AI excuses (e.g., "I'll add tests later" or "The fix seems right").
Verification is Non-Negotiable: Every task must end with hard evidence (tests, logs, or screenshots). "Seems right" is never an acceptable exit criterion.

[!NOTE] G-Stack is a Methodology, not a Stack: While the name implies a specific technology set, the Tech-Lead Stack treats "G-Stack" as an engineering philosophy centered on modularity, diagnosis-first planning, and robust verification. It is designed to work seamlessly with C#, Python, JavaScript, Java, Go, and any other ecosystem.

🛠 Technical Architecture: RTK & MCP Synergy

To maintain high performance and auditability, the Tech-Lead Stack uses a dual-layered architecture:

1. The Human-CLI Registry (`package.json`)

The rtk.tools section in package.json acts as the Single Source of Truth for tool execution.

The Human Side: When you run rtk run <tool>, the rtk-run.sh script specifically looks for that key in your local (or linked) package.json.
The Synergy: This ensures that even if you aren't using an AI agent, you can manually audit or trigger any skill logic via the terminal. It guarantees that the Agent and the Human are always working from the same operational registry.

2. The Agent-Knowledge Broker (MCP Server)

The MCP Server serves as the Intelligence Layer for your IDE.

Skill Discovery: The server dynamically reads .ai/skills/*.md files and exposes them as tools. It uses the internal: true flag to hide support-only skills from primary discovery while keeping them available for implementation.
Telemetry & Metrics: Unlike the CLI, executions via the MCP are instrumented via Langfuse. This captures token usage, project attribution, and agentic decision-making for enterprise-grade analytics.

3. Agent Skills vs. Dev Workflows

Category	Storage	Purpose
Agent Skills	`.ai/skills/`	Core Brains: High-density instructions for the AI. Some are "Internal" support logic.
Dev Workflows	`.agents/workflows/`	User Orchestrations: Antigravity `/slash` commands or manual starting prompts.

How to use in any project

3. Usage Options

Option A: The "Context Injection" (Universal)

If using a web-based agent (Claude.ai, ChatGPT) or starting a fresh session without workspace access:

"Analyze the skills in /path/to/lead-stack/.ai/skills/. You are now a Tech Lead Agent equipped with these workflows. Use rtk run <tool> for all tool executions."

Option B: The Symlink (Best for Antigravity/Cursor/Claude Code)

Since lead-init has already linked the instructions to your project, simply prompt the agent in your workspace:

"Read the instructions in .ai/agents.md and follow the planning-expert workflow for this ticket."

Cursor: use install.sh --link . --ide cursor (or lead-init-cursor above) so the same skills appear under your user ~/.cursor/skills/ as symlinks into this repo. Your app repository does not get a .cursor/ folder from this step. Invoke skills from Cursor’s skills UI (or the slash menu) like Antigravity workflows.

Antigravity Setup

Step 1: Clone the Repository

Open your terminal and clone the tech-lead-stack repository to a permanent location on your machine.

git clone https://github.com/your-username/tech-lead-stack.git ~/tech-lead-stack
cd ~/tech-lead-stack

Step 2: Run the Installer

Run the install.sh script to set up dependencies and link the stack to your current working directory.

./install.sh --link .

Note: This will also output a JSON snippet for MCP configuration. Keep this handy for Step 6.

Step 3: Access Antigravity Customizations

Open the Agent panel in Antigravity. Click the "Open Agent Manager" button at the top to find the Customizations menu.

Step 4: Navigate to Workflows

In the Customizations panel, select the Workflows tab.

Step 5: Register Global Workflows

To make workflows available across all projects:

Click the + Global button.
Open any workflow file from .agents/workflows/ in your IDE (e.g., audit-tech-debt.md).
Copy the entire content of the markdown file.
Paste it into the Antigravity workflow editor.
Give it a name (e.g., audit-tech-debt). You can optionally add a suffix like -tls if you want to distinguish them.
Repeat for other workflows you wish to use globally.

Step 6: Configure the MCP Server

Antigravity requires the MCP server to execute tools (like rtk).

Go to Settings (Gear icon) -> MCP.
Add a new MCP server named tech-lead-stack.
Use the following configuration (replacing /path/to/tech-lead-stack with your actual absolute path):

{
  "mcpServers": {
    "tech-lead-stack": {
      "command": "npm",
      "args": [
        "--prefix",
        "/path/to/tech-lead-stack",
        "--silent",
        "run",
        "mcp:start"
      ]
    }
  }
}

Step 7: Usage

You can now invoke these workflows in the Agent chat by typing / followed by the workflow name.

name: workflow-clean-code-audit description: Clean Code Audit

// turbo

Phase 0: Tech-Stack Discovery (MANDATORY): Inspect the project root to identify the primary language and framework.
Call the tech-lead-stack.get_skills tool:
- skillName: "clean-code"
- projectName: "<YOUR_CURRENT_PROJECT_NAME>"
- model: "<YOUR_MODEL_NAME>"
- agent: "<YOUR_AGENT_NAME>"
Follow its workflow to audit architecture and recommend SOLID improvements.

Cursor Setup

Step 1: Clone the Repository

Open your terminal and clone the tech-lead-stack repository to a permanent location on your machine.

git clone https://github.com/your-username/tech-lead-stack.git ~/tech-lead-stack
cd ~/tech-lead-stack

Step 2: Run the Installer with Cursor Flag

Run the install.sh script to set up dependencies and link the stack to your current working directory.

./install.sh --link . --ide cursor

Note: This will also output a JSON snippet for MCP configuration. Keep this handy for Step 4.

Step 3: Verify Symlinked Skills

The install.sh script will automatically symlink the skills into your global ~/.cursor/skills/ directory. You can verify this by running:

ls -la ~/.cursor/skills/

You should see symlinks to the .ai/skills/ directory of the tech-lead-stack repo.

Step 4: Configure the MCP Server in Cursor

Cursor requires the MCP server to execute tools (like rtk).

Open Cursor Settings (Gear icon) -> Cursor Settings.
Select MCP from the sidebar.
Click + Add New MCP Server.
Name the server tech-lead-stack.
Set the Type to command.
Use the following configuration (replacing /path/to/tech-lead-stack with your actual absolute path):

{
  "command": "npm",
  "args": [
    "--prefix",
    "/path/to/tech-lead-stack",
    "--silent",
    "run",
    "mcp:start"
  ]
}

Step 5: Usage

You can now invoke these skills in the Cursor chat by typing @ followed by the skill name (e.g., @planning-expert). Cursor will suggest the skill from the list of available global skills.

🧹 Resetting a Project

Tech-Lead Stack Cleanup Alias

# Add this to your ~/.zshrc
alias lead-clean='bash /path/to/tech-lead-stack/scripts/cleanup.sh .'

If you want to remove the AI workflows and symlinks from a repository:

lead-clean

🧪 CI/CD

This repository uses GitHub Actions to validate:

Skill Integrity: Ensures all .md files in .ai/skills/ have valid YAML frontmatter.
Markdown Linting: Prevents malformed instructions that could confuse agents.
Script Permissions: Ensures all tools in scripts/ remain executable.

Pro-Tip: The "Profile Locked" Error

If you get an error that the browser profile is "already in use," close your active Chrome window or create a dedicated Profile for the Agent and update your .env accordingly.

tech-lead-stack/
├── .ai/
│   ├── agents.md
│   └── skills/
│       ├── agent-optimizer.md
│       ├── code-review-checklist.md
│       ├── mission-architect.md
│       ├── planning-expert.md
│       ├── regression-bug-fix.md
│       ├── verification-auditor.md (Internal)
│       └── visual-verifier.md
├── .github/
│   └── workflows/
│       └── agent-ci.yml
├── scripts/
│   ├── autoeval-check.js
│   ├── cleanup.sh
│   ├── gh-pr-create.sh
│   └── upload-evidence.py
├── templates/
│   └── PULL_REQUEST_TEMPLATE.md
├── .env
├── .env.example
├── .gitignore
├── ONBOARDING.md
├── install.sh
├── package.json
├── README.md
└── requirements.txt

Resources 📚

🛠 Technical Overview: Skill Discovery & Priority

To maintain User Sovereignty and ensure Context Hygiene, the Tech-Lead Stack does not simply read static files. Access to all skills is brokered through the MCP Server, which enforces a strict priority of discovery:

Project-Local Override: .ai/skills/ in your current working directory.
Global Fallback: .ai/skills/ in the tech-lead-stack repository.

Why Go Through the MCP?

Customization: Teams can "fork" a skill for a specific project without modifying the global repository.
Auditability: Every skill retrieval is wrapped in a Langfuse Trace (Telemetry) to track which model, agent, and project are executing specific logic.
Cost Control: The server captures and reports the "Budgeted Cost" of each skill to prevent uncontrolled LLM spend.

Priority Logic Snippet:

// src/mcp-server/fs-service.ts

async readSkill(safeSkillName: string) {
  // Define Search Paths: Local Project has priority over Global Repo
  const localSkillsDir = path.join(process.cwd(), ".ai/skills");
  const searchDirs = [localSkillsDir, this.repoSkillsDir];

  for (const dir of searchDirs) {
    const skillPath = path.join(dir, `${safeSkillName}.md`);
    try {
      // Returns the first match found (Local Override logic)
      const content = await fs.readFile(skillPath, "utf-8");
      return { content, path: skillPath };
    } catch {
      // Continue to Fallback
    }
  }
}

Available Skills

Skill	Description	How it works	Use Case	Est. Context Footprint
`accessibility-auditor`	Specialized audit for Web Accessibility (A11y). Scans for contrast, semantics, and ARIA debt.	Static analysis via `grep`, visual scrutiny of CSS, and read-only runtime DOM inspection.	Ensuring WCAG 2.1 compliance and multi-viewport accessibility.	~650 tokens
`mission-architect`	Master Blueprint Engine. Orchestrates Strategy -> Research -> Plan -> Deliver for complex features.	Strategic extraction from roadmaps, deep codebase audit, and multi-stage planning via `planning-expert`.	Designing and executing a major architectural change or multi-file feature.	~1300 tokens
`planning-expert`	The complete Planning Expert. Orchestrates deep pattern discovery, vertical slicing, and safe incremental delivery.	Deep codebase audit followed by an atomic G-Stack blueprint and commit-ready task list.	Breaking down complex Jira tickets or architectural refactors into test-driven steps.	~850 tokens
`planning-expert-quick`	Ultra-lean strategic planning. Optimized for speed, token efficiency, and rapid MVC delivery.	Anchors tech stack followed by a condensed W/W/H blueprint and rapid execution cycle.	Common, less complex, lite-weight tasks where velocity is the priority.	~600 tokens
`regression-bug-fix`	Unified remediation engine for resolving QA, Design Review (DR), and Regression feedback.	Maps feedback to code impact, generates a localized remediation plan, and verifies the fix against regressions.	Fixing "Login button misaligned" or "API returning 500" after a QA pass.	~1350 tokens
`code-review-checklist`	High-density pre-commit quality auditor for verifying functionality and G-Stack standards.	Analyzes local diffs against 4 gates (Spec, SOLID, A11y, Evidence), ensuring zero `any` types and compliance.	Rapid local verification before running `rtk run create-pr`.	~650 tokens
`clean-code`	Architectural auditor enforcing SOLID principles and programmatic standards (KISS, DRY, YAGNI).	Scans for "God Objects" and tight coupling. Recommends strategy patterns and colocation of code.	Checking a new feature branch before merging to prevent technical debt.	~950 tokens
`security-audit`	Cross-platform security scanner detecting malware, prompt injection, and exfiltration.	Scans skills, scripts, and inputs for malicious patterns (`curl \| bash`, `eval()`).	Running on agent-generated scripts to ensure no backdoors are introduced.	~550 tokens
`pr-automator`	Automates G-Stack Pull Requests with synthesized diffs and verification evidence.	Fetches visual proof (screenshots) and maps code changes to the original Strategic Mission.	Finalizing a feature branch into a professional, evidence-backed PR.	~950 tokens
`visual-verifier`	Captures before/after screen evidence for visual smoke testing.	Runs local app via Playwright and captures Desktop/Mobile screenshots for the PR body.	Proving that a CSS fix works as intended across different viewports.	~450 tokens
`changelog-generator`	Transforms Git history into user-facing release notes with strict noise filtering.	Ingests `git log`, groups by semantic commit type, filters noise, and formats to Markdown.	Generating clean release notes for stakeholders.	~750 tokens
`daily-standup`	Generates a daily status update by analyzing 48h of git activity and task progress.	Categorizes commits, assess blockers, and generates a rolling report using a professional standup template.	Automating your daily update or summarizing work for a sync meeting.	~550 tokens
`ask`	Expert technical advisor providing architectural insights and precise code snippets for manual implementation.	Diagnostic research via Phase 0 discovery, followed by high-density technical advice and snippets.	Q&A about the codebase or "How would I change this?" queries.	~450 tokens
`product-strategist`	Strategic roadmap auditor validating market positioning and Impact vs. Effort.	Scans metrics and positioning to ensure current implementation work maps to high-ROI customer goals.	Auditing a proposed feature list against the core product vision.	~850 tokens
`feature-design-assistant`	Architectural discovery engine for pre-implementation prototyping.	Discovers existing patterns and generates technical specs before the first line of code is written.	High-level ideation for a new service or module.	~800 tokens
`style-logic-exporter`	Extraction engine for transforming CSS/Tailwind logic into portable design tokens for Figma.	Scans style sheets and theme configurations to extract variables, colors, and typography metrics.	Syncing code-based styling with design systems or external documentation.	~650 tokens
`technical-debt-auditor`	Scans codebase for anti-patterns, complexity hotspots, and architectural drift.	Metrics-driven analysis combined with G-Stack methodology to prioritize refactoring tasks.	Routine codebase maintenance and pre-refactoring audits.	~950 tokens

[!NOTE] The Est. Context Footprint listed above refers to the budgeted tokens required to load the skill's instructions into the agent's context. Live execution costs may vary depending on the LLM model and project size, but are tracked accurately via Langfuse telemetry to provide baseline reliability across the entire developer workforce.

Name		Name	Last commit message	Last commit date
Latest commit History 237 Commits
.agents		.agents
.ai		.ai
.github		.github
.husky		.husky
.rtk		.rtk
.vscode		.vscode
artifacts		artifacts
docs		docs
prisma		prisma
public		public
scripts		scripts
src		src
templates		templates
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.prettierignore		.prettierignore
.prettierrc		.prettierrc
BRANCH_MANAGEMENT.md		BRANCH_MANAGEMENT.md
CLAUDE.md		CLAUDE.md
ONBOARDING.md		ONBOARDING.md
README.md		README.md
components.json		components.json
docker-compose.yml		docker-compose.yml
eslint.config.mjs		eslint.config.mjs
install.sh		install.sh
jest.config.js		jest.config.js
jest.setup.ts		jest.setup.ts
next.config.ts		next.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
prisma.config.ts		prisma.config.ts
requirements.txt		requirements.txt
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

The Lead Stack: Agent-Ambiguous Workflows

Table of Contents

Running Agent Tasks

Requirements

Branching Strategy

🚀 Quick Start

1. Installation

2. Initialize a Project

🛠 Final Checklist

🧠 The Methodology: Three Pillars

Production-Grade Ethos

🛠 Technical Architecture: RTK & MCP Synergy

1. The Human-CLI Registry (package.json)

2. The Agent-Knowledge Broker (MCP Server)

3. Agent Skills vs. Dev Workflows

How to use in any project

3. Usage Options

Option A: The "Context Injection" (Universal)

Option B: The Symlink (Best for Antigravity/Cursor/Claude Code)

Antigravity Setup

Step 1: Clone the Repository

Step 2: Run the Installer

Step 3: Access Antigravity Customizations

Step 4: Navigate to Workflows

Step 5: Register Global Workflows

Step 6: Configure the MCP Server

Step 7: Usage

Cursor Setup

Step 1: Clone the Repository

Step 2: Run the Installer with Cursor Flag

Step 3: Verify Symlinked Skills

Step 4: Configure the MCP Server in Cursor

Step 5: Usage

🧹 Resetting a Project

Tech-Lead Stack Cleanup Alias

🧪 CI/CD

Pro-Tip: The "Profile Locked" Error

Resources 📚

🛠 Technical Overview: Skill Discovery & Priority

Why Go Through the MCP?

Priority Logic Snippet:

Available Skills

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. The Human-CLI Registry (`package.json`)

Packages