DELTA Lab

PM-DELTA

AI-Native Development Infrastructure

v1.0

DELTA Lab

Shanghai Innovation Institute · 2026

From Tools to Value

Value Delivery
Dev Infrastructure
Coding Tools
Foundation Models

PM-DELTA

Delivers business value, not just developer productivity.

Cursor, Copilot, Devin

Make individual developers faster. Single-task scope.

Claude, GPT, etc.

Provide intelligence. We build ON them.

The gap between tools and value was bridged by humans. We computationalize this gap.

Better Practice, Not Just Developer

AI as Developer

  • Writes code in isolation
  • No project awareness
  • Forgets everything per session
vs

AI as Project Manager

  • Orchestrates scope, context, quality
  • Multi-project coordination
  • Persistent organizational learning
Core Insight

The hard problem is context construction, not code generation.

Worker Layer: Engineering Discipline

delta-templates define per-project principles, skills, and commands

Rules as Code

Behavioral directives in .claude/rules/ that AI loads automatically every session.

Skills & Commands

Reusable capabilities: /start-task, /task-done, /code-review, /decompose.

File-Based State

Git-tracked proposals, inbox state machine, Linear as source of truth for tasks.

Quality Pipeline

Intent → Demand Review → Plan → Plan Review → Execute → Code Review. Three gates before code ships.

Interactive + Autonomous

Same SOP, different supervision level

Day Mode (Interactive)

Plan
Implement
Review
Ship

Human guides each step. Reviews plans before implementation.

Night Mode (Nightrun)

Plan
Gate
Impl
Gate
Ship

Quality gates replace human checkpoints. Fleet of parallel workers.

Every workflow improvement benefits both modes. You choose a supervision level, not a tool.

PM Layer: Multi-Project Orchestration

PM-DELTA
Meta-Repo
Project A
git subtree
Project B
git subtree
Shared Infra
delta-shared
Templates
.templates/
Portfolio
student work
Project C
local repo

Linear + GitHub Integration

Cross-project visibility via unified task tracking and automated PR workflows.

Worktree Isolation

Each task gets its own git worktree. Parallel nightrun workers, zero interference.

Team Project Management

Daily/weekly reports aggregate progress across all projects and portfolio students.

The Autonomy Risk

As AI autonomy increases, so does the blast radius of mistakes

Over-Trust

AI confidently produces plausible but wrong output. Bugs ship to production. The more autonomous the system, the later mistakes are caught.

Under-Trust

Humans review everything "just in case." AI capacity is wasted. The team gains autonomy on paper but not in practice.

Core Question

How do you know which tasks are safe to automate and which need human oversight?

Without systematic assessment, you are guessing.

SOTIF-Inspired Safety Review

Systematic automation risk assessment, borrowed from automotive safety

ARIL Scoring

Automation Risk Impact Level per module. Quantifies reliability risk of AI-driven changes.

Fractal Tree Traversal

Bottom-up risk propagation through CLAUDE.md hierarchy. Leaf modules aggregate to root.

Capability-Aware Gating

Match task complexity to current AI capability. Route high-risk tasks to human review.

Root — Aggregated Risk
Module — Local ARIL
Leaf — Source-Level

The system evaluates its own automation safety before executing.

The Context Crisis

AI starts every session with amnesia

Zero Memory

Each session starts from scratch. Decisions made yesterday are invisible today. The AI re-discovers what it already learned.

Knowledge in Heads

Team knowledge lives only in human memory. When someone is unavailable, the knowledge is gone. Unscalable.

Inconsistent Workers

Different AI workers get different context depending on who prompted them. Quality varies wildly across sessions.

The Bottleneck

The hard problem of AI-assisted development is context construction, not code generation.

Code generation is solved. Knowing what to build, why, and how it fits -- that is the gap.

Context Is All You Need

Four sources converge into unified task context

Auto-Context
Fractal CLAUDE.md (T1/T2/T3 loading)
Learnt Context
3-tier rules, insights, principles
Bitter Lessons
Anti-pattern capture + retrieval
Cognitions
Session journals + decision memory
Unified
Task
Context
Right Context, Right Time
Never too much, never too little
Organizational Memory
System improves through its own work
Cross-Session Coherence
Knowledge persists across time

Every AI worker starts with the full picture. No amnesia. No guessing.

Fractal CLAUDE.md

The right context at the right time -- never too much, never too little

CLAUDE.md T1 auto
.claude/rules/ T2 auto
conventions.md
insights.md
src/auth/CLAUDE.md T3 demand
src/auth/SPEC.md
SCOPE.md

T1: Session Start

Root CLAUDE.md auto-loads. Project identity, children map, data flow. Always available.

T2: Module Access

Per-module rules load when touching that module. Conventions, insights, local patterns.

T3: On Demand

Deep docs read explicitly. SPEC.md for architecture rationale. SCOPE.md for strategy.

Learning System

Manual corrections compound into organizational memory

Tier 1: Local Rules

Per-project behavioral directives. Immediate effect next session. Project scope

Tier 2: Project Insights

Cognitive patterns, auto-refined to 1500 chars max. Project scope

Tier 3: Team Principles

Cross-project review criteria. Requires human approval. Team-wide

Bitter Lessons Pipeline

Every code review finding is captured, tagged, and auto-retrieved when the same pattern recurs. Bug fixes trigger root-cause reflection.

Key Insight

This is not RLHF. It is explicit, auditable organizational learning.

Three-Layer Emergence

Quality emerges from disciplined context construction at three time scales

Structured Redundancy Minutes

Multi-agent planning. Iterative review. Gate-fix retry loops. More compute per task = more reliable output.

Evolutionary Selection Hours

Plan + code review select the best approach. Failed attempts inform future decisions.

Compound Learning Weeks

Human feedback routed to 3-tier system. The system improves through the work it enables.

Design Principle

Better prompts help one task.

Better structure helps every task, forever.

Progressive Autonomy

L0 AI-Assisted Human codes, AI autocompletes Industry
L1 AI-Driven Human reviews every step, AI codes Done
L2 AI-Autonomous Human reviews at gates, AI follows SOP Done
L3 AI Fleet Human reviews batches, AI executes queues In Progress
L4 AI Self-Planning Human sets direction, AI decomposes + executes Foundation
L5 Hypothesis-Driven Human sets hypotheses, AI designs experiments Vision
AI Infrastructure
SOP, gates, nightrun
Human Knowledge
Feedback, corrections
Harnessing Eng.
Conventions, isolation
LLM Advances
Each generation lifts all

Results & Trajectory

~50%
First-Pass PR Approval
3-5
Nightrun Tasks/Week
30 min
Review Time/PR
5+
Projects Managed
A
Inner Loop
Q2-Q3 2026

Current

B
Intent + KB
Q3-Q4 2026
C
Predictive
H1 2027
D
Autonomous Research
H2 2027
Metric Now Phase B Phase D
PR Approval ~50% 80% >95%
Tasks/Week 3-5 10-15 30+
Human Role Code + Plan Outcome Review Hypothesis

AI4AI4X

Use AI to build AI infra. Use AI infra to transform industries.

Industry Impact
AI Infrastructure
Foundation Models

The Loop

PM-DELTA manages its own development. Every improvement immediately improves the process that builds it.

Deep Cultivation

Value from domain depth, not breadth. AI+Finance, AI+Education -- each domain compounds learning.

The Flywheel

More Domains
Wider coverage
Richer Learning
3-tier compounds
Better Autonomy
Higher approval rate
More Capacity
More tasks per night

Compound Growth

The system that ran 100 tasks is meaningfully better than the one that ran 10.

AI Cost-Effectiveness Map

Internal framework guides domain selection: which industries benefit most from AI automation today.

Join Us

Building a team of domain-deep AI practitioners

AI+X PhD Students

Combine AI methods with domain expertise: finance, education, healthcare, manufacturing.

Full-Time Engineers

Build AI infrastructure that compounds. Shape the future of AI-native development.

Research Partners

Cross-institutional collaboration. Real data, real systems, real impact.

Our Mission

Not just model builders -- domain practitioners who wield AI as infrastructure.

DELTA Lab

PM-DELTA

AI-Native Development Infrastructure

github.com/SII-DELTA