The Framework

    Supply Chain of Intelligence™

    SCoI, the 10 layers of the generative AI stack

    One page. Six questions. The canonical reference.

    01

    The Definition

    What is it?

    Intelligence is a supply chain.

    A deeper, fuller view — beyond the older stack and value chain lenses

    The AI stack and the AI value chain are older, partial ways of seeing AI — each shows one slice and misses the rest. The Supply Chain of Intelligence is the deeper, fuller view of the same system: it surfaces what those older lenses structurally cannot — gatekeeping, absorption risk, currents, flywheels, vertical adjacency, and timing — so strategy can be reasoned about at the level where value is actually won or lost.

    Why the stack view, and the value chain view, are not enough

    A stack shows the parts. A value chain shows the flow. Only a supply chain of intelligence shows gatekeeping, bottlenecks, currents, flywheels, and absorption — the forces that decide who actually keeps the value.

    Seven things that go missing

    • Gatekeeping — who controls each chokepoint, and what they can charge to let traffic through. Neither a stack nor a value chain has a concept for this.
    • Bottlenecks above and below the visible layers — L−1 resources, L3 verification, L8 memory. Stack diagrams crop them out.
    • Currents that move value sideways — capital, demand, attention flow across layers and decide which defensible position becomes a business.
    • Flywheels that compound across sublayers — L5 → L1d → L8c is a loop, not a list of components.
    • Vertical adjacencies the Intelligence Cube exposes — the same layer behaves differently in Legal vs. Health vs. FinTech.
    • Absorption risk a platform poses to every layer beneath it — the stack shows neighbors, not predators.
    • Timing — when each layer commoditizes, and what survives the compression.

    A stack describes parts. A value chain describes flow. A supply chain of intelligence describes the whole system — gatekeeping, bottlenecks, currents, flywheels, absorption — which is the level at which durable AI strategy can actually be reasoned about.

    How it sits next to the AI stack

    The AI stack explains how intelligence is built. The Supply Chain of Intelligence explains where intelligence becomes economically defensible.

    02

    The Map

    Where does value accrue?

    A map of where value accrues, not where code runs.

    The 10 layers (L−1 Resources → L8 Memory) are the structural vocabulary. They group into three tiers — Substrate, Workflow, Surface — that compound on very different timescales. Three market currents (Demand Gravity, Attention Economics, Capital Flows) flow horizontally across all of them and decide whether a defensible position becomes a business. This is the map an AI stack diagram can't draw.

    Before the 10, the 3 tiers

    01
    SURFACEL7

    What users touch

    Easily replicated. Platforms ship this for free.

    02
    WORKFLOWL4 · L5 · L6

    What users live inside

    Sticky if deep. Survivable if owned.

    03
    SUBSTRATEL−1 · L0 · L1 · L2 · L3 · L8

    What users depend on

    Proprietary data, trust gates, compounding memory.

    Value escapes the surface and accumulates in the layers below. Own the lower layers, or rent them, and rent your future.

    The full taxonomy, one image

    Supply Chain of Intelligence™

    The 10 layers × 50 sublayers of the generative AI stack.

    L8
    Memory
    L8a
    Session & Short-Term Memory
    L8b
    User & Entity Profiles
    L8c
    Aggregated Network Learning
    L8d
    Institutional Knowledge
    L8e
    Learned World Models
    L7
    Surface
    L7a
    Conversational
    L7b
    Visual Interfaces & Media
    L7c
    Embedded & Embodied AI
    L7d
    Transaction Surface
    L7e
    Async & Ambient Surfaces
    L6
    Orchestration
    L6a
    Agent Loops
    L6b
    Human-in-the-Loop
    L6c
    Role Routing & Task Decomposition
    L6d
    Context & State Management
    L6e
    Runtime Assurance & Learning Loops
    L5
    Execution
    L5a
    Domain Execution & Tool Use
    L5b
    Decision Frameworks & Reasoning Scaffolds
    L5c
    Retrieval-Augmented Workflows
    L5d
    Operating Playbooks
    L5e
    Interaction Skills & Actuation
    L4
    Access
    L4a
    API & Integration Layer
    L4b
    Agent Interface Protocols
    L4c
    Access Governance & Agent Commerce
    L4d
    Real-Time Interaction Infrastructure
    L4e
    Agent Identity & Provenance
    L3
    Gates
    L3a
    Compliance & Export Controls
    L3b
    Quality Gates
    L3c
    Safety, Security & Provenance
    L3d
    Editorial Gates
    L3e
    Distribution Gates
    L2
    Models
    L2a
    Foundation & Multimodal Models
    L2b
    Specialized & Fine-Tuned Models
    L2c
    Embedding & Retrieval
    L2d
    Model Routing & Composition
    L2e
    Reasoning & World Models
    L1
    Data
    L1a
    Public & Open Data
    L1b
    Proprietary Data
    L1c
    Behavioral & Sensor Data
    L1d
    Outcome Data
    L1e
    Synthetic & Simulation Data
    L0
    Infra
    L0a
    Silicon & Memory
    L0b
    Data Centers
    L0c
    Interconnect Fabric
    L0d
    Compute & State Infrastructure
    L0e
    Edge & On-Device Compute
    L−1
    Resources
    L-1a
    Energy & Grid Interconnect
    L-1b
    Thermal & Water Management
    L-1c
    Fabrication & Foundry
    L-1d
    Critical Materials & Supply Chain
    L-1e
    Skilled Trades & Human Capital

    Print it. Mark it up. Map your own, or any company you cover.

    SupplyChainOfAI.com

    Why we call it a supply chain

    From gold in the ground to the ring on your finger

    Every layer transforms the output of the layer below it. Most companies only own one layer. The supply chain is only as strong as its weakest link.

    Gold Supply Chain
    Supply Chain of Intelligence
    The Ground Itself
    Land
    rock
    L-1
    Resources
    NextEra · TSMC fabs · MP Materials
    The Shovels & Mining Equipment
    pickaxe
    L0
    Infrastructure
    NVIDIA · AMD · TSMC
    The Raw Gold Ore
    rock
    L1
    Data
    Apollo.io · Bloomberg · ZoomInfo
    The Smelter & Refinery
    flame
    L2
    Models
    OpenAI · Anthropic · Google DeepMind
    The Hallmark & Assay Office
    shield
    L3
    Gatekeeping
    Vanta · Drata · OneTrust
    The Railroads & Transport
    railroad
    L4
    Access
    AWS · Snowflake · Supabase
    The Master Jeweler
    gem
    L5
    Execution
    Harvey · Sierra · 11x
    The Jewelry Store & Workshop
    storefront
    L6
    Orchestration
    LangChain · CrewAI · Zapier (at risk)
    Wearing the Jewelry
    The Moment of Experience
    ring
    L7
    Surface
    ChatGPT · Gemini · Copilot
    The Record Book
    Compounding Knowledge
    book
    L8
    Memory
    Sierra · Notion (partial) · Rewind AI
    rockL-1Resources

    The Ground Itself, Land, Power, Materials

    Before the gold rush, you need land, water rights, ore deposits, and the miners who work the seams. In AI: power generation, cooling water, foundry capacity, rare earths, and the electricians and technicians who physically build the boom. When demand spikes, this layer is the real bottleneck.

    L-1a
    Energy & Grid Interconnect

    Power generation, PPAs, transmission, and the multi-year grid-interconnect queue, the megawatts the stack consumes and the wait to get them switched on

    L-1b
    Thermal & Water Management

    Cooling systems, water access, immersion/liquid cooling, heat reuse, the thermodynamic ceiling on every GPU cluster

    L-1c
    Fabrication & Foundry

    Leading-edge chip fabrication capacity, EUV lithography, advanced packaging (CoWoS), the physical floor of L0

    L-1d
    Critical Materials & Supply Chain

    Rare earths, lithium, cobalt, gallium, specialty substrates, and the refining, logistics, and geopolitical chokepoints that gate them

    L-1e
    Skilled Trades & Human Capital

    Electricians, HVAC techs, data-center builders, fab process engineers, robotics technicians, the labor pool no model can synthesize

    NextEraTSMC fabsMP MaterialsVistraBechtelThe real bottleneck. Slow to build, impossible to fake.
    Deep dive on L-1
    pickaxeL0Infra

    The Shovels & Mining Equipment

    Before anyone finds gold, someone has to build the pickaxes, drill rigs, and mine shafts. In AI: NVIDIA builds the GPUs, CoreWeave builds the data centers, hyperscalers run the clouds. No shovels → no gold rush. Shovel sellers outlast most miners.

    L0a
    Silicon & Memory

    GPUs, TPUs, custom AI accelerators, plus HBM and high-bandwidth memory (SK Hynix, Micron, Samsung), the invisible bottleneck behind every chip cycle

    L0b
    Data Centers

    Physical facilities housing compute at scale

    L0c
    Interconnect Fabric

    Networking between chips, racks, regions, clouds

    L0d
    Compute & State Infrastructure

    On-demand compute, scheduling, and durable agent state (checkpointing, workflow state, runtime memory stores)

    L0e
    Edge & On-Device Compute

    Local inference on phones, vehicles, sensors, endpoints

    NVIDIAAMDTSMCCoreWeaveEquinixShovel sellers win every gold rush.
    Deep dive on L0
    rockL1Data

    The Raw Gold Ore

    The unrefined material pulled from the earth. Some mines have pure veins (proprietary data), others have common dirt (public data). Public data is already mined by everyone. The L1b test: if your data is public, the model layer wins.

    L1a
    Public & Open Data

    Common Crawl, Wikipedia, government data, open datasets

    L1b
    Proprietary Data

    Licensed, paywalled, or internally generated training corpora

    L1c
    Behavioral & Sensor Data

    Clicks, sessions, interaction logs, and camera, LiDAR, IMU, telemetry, and physical-world sensor streams for robotics and autonomy

    L1d
    Outcome Data

    Labels, results, conversions, win/loss, audit trails, what actually happened after the model acted

    L1e
    Synthetic & Simulation Data

    Machine-generated corpora and simulated environments (Isaac Sim, CARLA, Omniverse, world-sim) for training, augmentation, and embodied agent rollout

    Apollo.ioBloombergZoomInfoScale AIStructurally safe. API-first wins.
    Deep dive on L1
    flameL2Models

    The Smelter & Refinery

    Raw ore becomes pure gold through smelting. In AI: foundation, specialized, and reasoning models refine raw data into intelligence. Refining is expensive and only a few can do it at scale, but once refined, the gold is a commodity.

    L2a
    Foundation & Multimodal Models

    Large pre-trained generalists, GPT, Claude, Gemini, Llama, and vision-language-action and video models (Sora, Veo) that span text, image, audio, and motion

    L2b
    Specialized & Fine-Tuned Models

    Domain-tuned, distilled, and PEFT/LoRA-adapted models for specific verticals or tasks (BloombergGPT, Med-PaLM, Codestral)

    L2c
    Embedding & Retrieval

    Vector representations, search indices, reranking, and RAG infrastructure

    L2d
    Model Routing & Composition

    Selecting, chaining, ensembling, or mixture-of-experts routing across multiple models per task to balance cost, latency, and quality

    L2e
    Reasoning & World Models

    Extended chain-of-thought, planning, and multi-step inference, plus predictive world models (V-JEPA, Genie, Sora-as-simulator) that let agents and robots imagine outcomes before acting

    OpenAIAnthropicGoogle DeepMindMeta AIWinner-take-most. Commodity risk high.
    Deep dive on L2
    shieldL3Gates

    The Hallmark & Assay Office

    Before gold enters the market, the assay office verifies purity and the hallmark guarantees quality. In AI: compliance, evals, safety, editorial taste, and distribution control are the gates. Without the hallmark, no enterprise, and no app store, lets you in.

    L3a
    Compliance & Export Controls

    Regulatory, legal, and policy filters (HIPAA, GDPR, SOC 2, EU AI Act), plus chip export controls, model sovereignty, and data-residency regimes that decide where the stack is allowed to run

    L3b
    Quality Gates

    Accuracy, hallucination detection, output grading, eval harnesses, regression suites

    L3c
    Safety, Security & Provenance

    Harmful-content filtering, adversarial defense, prompt-injection protection, and content provenance (C2PA, watermarking, deepfake attestation) that proves what was generated and by whom

    L3d
    Editorial Gates

    Tone, brand voice, style, taste, the human judgment layer

    L3e
    Distribution Gates

    App store approval, ranking, marketplace curation, discovery control

    VantaDrataOneTrustApple App StoreEssential. More agents = more access control.
    Deep dive on L3
    railroadL4Access

    The Railroads & Transport

    Refined gold needs to move, by rail, armored truck, secure vault. In AI: APIs, MCP, real-time pipes, and agent identity move intelligence between systems. Grammarly survived because it had tracks into every workflow. Jasper had none.

    L4a
    API & Integration Layer

    REST/GraphQL endpoints, SDKs, webhooks connecting AI to systems

    L4b
    Agent Interface Protocols

    MCP, tool-use specs, agent-to-agent communication standards

    L4c
    Access Governance & Agent Commerce

    Who can use what, RBAC, scoping, audit trails, and agent-payment rails (Stripe/Visa/Mastercard agent-pay, spend limits, programmatic checkout, machine-to-machine billing)

    L4d
    Real-Time Interaction Infrastructure

    Streaming, voice pipelines, video, low-latency modality transport

    L4e
    Agent Identity & Provenance

    Verifying which agent acted, credential chains, trust signatures

    AWSSnowflakeSupabaseTwilioLoad-bearing walls. Invest accordingly.
    Deep dive on L4
    gemL5Execution

    The Master Jeweler

    A jeweler takes refined gold and crafts rings, necklaces, watches, each requiring specialized skill. In AI: domain skills, decision frameworks, and operating playbooks transform generic intelligence into specific capability. Harvey knows legal. Sierra knows CX.

    L5a
    Domain Execution & Tool Use

    Doing the actual work, legal drafting, code generation, diagnosis, underwriting, including function calling, code interpreter, browser/computer use, and structured tool invocation that turns a model into an operator

    L5b
    Decision Frameworks & Reasoning Scaffolds

    Structured thinking patterns, checklists, rubrics the agent follows

    L5c
    Retrieval-Augmented Workflows

    Grounding execution in retrieved context, knowledge, and documents

    L5d
    Operating Playbooks

    Company-specific SOPs, rules, preferences encoded for agents

    L5e
    Interaction Skills & Actuation

    Tone, empathy, negotiation, persuasion, and physical-world actuation (robotic control, valve/vehicle/device operation)

    HarveySierra11xCursorDurable if deep. Generic skills get absorbed.
    Deep dive on L5
    storefrontL6Orchestration

    The Jewelry Store & Workshop

    A single ring is useful. A curated collection with fitting and custom design is an experience. In AI: orchestration composes individual skills into multi-step workflows with human override and runtime assurance. One skill → one task. Orchestration → entire workflows.

    L6a
    Agent Loops

    Single-agent plan-act-observe cycles

    L6b
    Human-in-the-Loop

    Escalation patterns, approval workflows, human override design

    L6c
    Role Routing & Task Decomposition

    Breaking complex work into subtasks and assigning to the right agent

    L6d
    Context & State Management

    Maintaining working memory, session state, context windows across steps

    L6e
    Runtime Assurance & Learning Loops

    Post-deployment monitoring, evals, feedback pipelines, drift detection

    LangChainCrewAIZapier (at risk)Make (at risk)Contested. Becoming a feature, not a product.
    Deep dive on L6
    ringL7Surface

    Wearing the Jewelry, The Moment of Experience

    People see the ring on the finger, the surface, the sparkle, the emotional moment. In AI: chat, dashboards, copilots, and ambient agents are the surfaces. Beautiful, but the most exposed layer, unless you're embedded inside the workflow or own the moment of transaction.

    L7a
    Conversational

    Voice and chat interfaces, the talking layer

    L7b
    Visual Interfaces & Media

    Dashboards, generated images, video, rich media output

    L7c
    Embedded & Embodied AI

    AI woven into existing tools (IDE copilots, email assistants, in-app agents) and embodied in physical hardware (robots, devices, vehicles)

    L7d
    Transaction Surface

    Where the AI closes a deal, books an appointment, processes a payment

    L7e
    Async & Ambient Surfaces

    Background agents, notifications, proactive nudges, always-on monitoring

    ChatGPTGeminiCopilotElevenLabsModality = commodity. Context = moat.
    Deep dive on L7
    bookL8Memory

    The Record Book, Compounding Knowledge

    The jeweler keeps records: which designs sold, which metals each customer prefers. Over time, this memory makes every decision better. In AI: session, entity, network, institutional, and world-model memory compound. The system that remembers wins long-term.

    L8a
    Session & Short-Term Memory

    Within-conversation context, scratch state, working memory

    L8b
    User & Entity Profiles

    Persistent preferences, history, relationship context per user or account

    L8c
    Aggregated Network Learning

    Patterns learned across many users/customers, fleet intelligence

    L8d
    Institutional Knowledge

    What the organization knows, docs, decisions, tribal knowledge encoded

    L8e
    Learned World Models

    The system's accumulated causal understanding of how things work

    SierraNotion (partial)Rewind AIThe ultimate moat. Memory that compounds wins.
    Deep dive on L8

    ← Key insight: Each layer transforms the output of the layer below it. Land and power (L-1) feed the shovels (L0). Shovels mine the ore (L1). Ore is refined (L2), assayed (L3), transported (L4), crafted (L5), arranged (L6), and worn (L7), and none of it compounds without record-keeping (L8). The supply chain is only as strong as its weakest layer, and most companies only own one.

    On the word "agent"

    "Agent" is not a layer. It's a costume worn by L5.

    Every company shipping "an agent" in 2025 is selling the same structural package: L5 Execution + L6 Orchestration, usually wrapped in an L7 Surface, sometimes with L8 Memory, riding on L4 Access pipes. When you read "we launched an agent," decode it: name L5 + L6 first, then which of L4 / L7 / L8 it bundles.

    The Decoder

    • Agent + L1b Proprietary Data → fortress. (Sierra, Harvey, Klarna's internal stack.)
    • Agent + L4 Distribution → railroad. (Salesforce Agentforce, Microsoft Copilot agents.)
    • Agent + L8 Compounding Memory → memory moat. (Glean, Cresta, Decagon.)
    • Agent + nothing else → exposed L7 wrapper. Commoditizes the moment the underlying L2 ships the same loop.

    The forces acting on the map

    Three Currents flow across every layer

    The 10 layers describe how intelligence is produced and delivered, the supply side. Three market currents flow horizontally across every layer and decide whether a defensible position actually compounds into a business.

    Currents are market forces, not layers. Regulatory and geopolitical constraints live at their native layers (L−1 energy/fabs/materials, L3 compliance and export controls) and are not currents.

    C1

    Demand Gravity

    Where the budget actually sits, and what it pulls toward.

    As L2 prices collapse, demand moves toward outcomes (L5+L8), verification (L3), and proprietary data access (L1), not generation itself.

    Use it · Name the buyer, the budget line, and what they stop paying for once L2 is free.

    C2

    Attention Economics

    What becomes scarce when generation becomes infinite.

    Default placement, OS integration, habit loops, and on-ramp ownership decide who gets used. Apple, Google, Microsoft become L7 landlords charging rent in attention.

    Use it · Assume infinite supply. Ask: who owns the on-ramp, what does default placement cost?

    C3

    Capital Flows

    How funding rounds bend the chain they fund.

    Tens of billions into L2 created a generation glut; near-zero into L−1 created the energy and fab bottleneck constraining everything above it. Capital overheats the fashionable layer and starves the unglamorous one.

    Use it · Read the funding map as a distortion field, not as a value signal.

    03

    The Laws

    Why does it work?

    Four structural laws predict the future.

    Not opinions. Structural forces that explain why most AI products get compressed in the layer they were built in, and which counter-moves keep them durable as the platforms move. Each Law is falsifiable: name a counter-example mechanism and the Law has to be amended.

    I

    Intelligence Commoditizes Downward

    If your product depends only on generic model capability, the platform layer below you will eventually absorb it. Wrappers don't survive, wrappers become features.

    Jasper ($1.5B → ~$300M) was a wrapper on GPT. Once ChatGPT shipped, the value flowed to L2.

    Predicts WHO gets absorbed.

    II

    Value Accrues at Bottlenecks

    Durable value rarely sits in the model or the UI. It sits at the scarce layer, proprietary data, workflow control, verification, distribution, memory, compliance, or trust. Find the bottleneck. Own it.

    NVIDIA owns L0 silicon. Vanta owns L3 compliance. Bloomberg owns L1b data. Each is the bottleneck in their chain.

    Predicts WHERE value is going.

    III

    The Surface Captures Attention; the Chain Captures Power

    A beautiful UI may get users. But durable companies own a deeper layer of the intelligence chain, data, execution, memory, gates. Surface without depth rarely compounds.

    Gamma owns L7 surface. Replit owns agent + code-gen + hosting + auth + database (L4 + L5 + L6 + L8). Same prompt-to-output category. Different fate.

    Predicts WHO survives the platform era.

    IV

    Generation and Verification Must Be Separate

    Wherever output carries fiduciary, regulatory, safety, or reputational weight, the generator and the verifier must be separate economic entities. L3 above L2/L5 is structurally permanent, the model can't audit itself, the codegen can't certify itself, the drafter can't approve itself.

    Vanta (L3) over AWS/OpenAI. Snyk (L3) over Copilot. Big-4 audit over SAP. Ironclad over Harvey. The verifier survives every model generation.

    Predicts WHERE L3 is non-absorbable.

    Why the Laws matter

    Jobs To Be Done finds demand. Supply Chain of Intelligence finds defensibility.

    Jobs To Be Done
    Supply Chain of Intelligence™
    What does it answer?
    Why will users hire this product?
    Why won't a platform fire it next quarter?
    What does it find?
    Demand.
    Defensibility.
    Time horizon
    Today's user need.
    Tomorrow's structural position.
    Failure mode it catches
    Building something nobody wants.
    Building something everyone can copy or absorb.
    Audience
    PMs, designers, researchers.
    Founders, product leaders, investors, boards.
    Output
    Roadmap, features, positioning.
    Layer ownership, moat strategy, exit/defend/deepen call.
    04

    The Dynamics

    How does it evolve?

    Six patterns under the Laws. Six archetypes above them.

    The Laws say what is structurally true. The Dynamics describe how the stack actually moves: repeatable market patterns we see across hundreds of AI companies, and the six fates every SaaS company collapses into. Patterns earn promotion to Laws over time; archetypes describe where companies end up.

    Six structural patterns

    Pattern · 01
    L3GatesL5Execution

    The Two-Vendor Rule

    Enterprises will pay for two vendors when one vendor's mistake is unrecoverable. Codegen + code-security. Draft + review. Model + eval. Trade + clearing. The buyer pays the duplication tax to avoid the single-point-of-failure tax.

    • Cursor for codegen + Snyk/Semgrep for security review, no CISO accepts the same vendor doing both.
    • Harvey drafts contracts; Ironclad/Kira reviews them. The drafter is structurally not allowed to be the approver.
    Pattern · 02
    L3Gates

    Regulatory Half-Life

    The more regulated the industry, the longer L3 outlives L2 churn. A compliance gate written into law is a moat measured in decades, not quarters. Models cycle every 6 months; SOC 2, HIPAA, EU AI Act, FDA 510(k) cycle every 5–10 years.

    • Vanta and Drata are 4 model generations old and untouched. The frontier model labs are not certifying themselves.
    • Epic's L3+L4 position in healthcare predates the entire AI wave and will outlive GPT-7.
    Pattern · 03
    L2ModelsL3Gates

    The Bundling Asymmetry

    Foundation model labs will expand from L2 into L5/L6/L7, adjacent value, because the buyer accepts the same vendor doing both. They will not expand across the trust boundary into L3 above themselves. OpenAI will ship agents. OpenAI will not issue its own SOC 2 audit.

    • OpenAI shipped GPTs, the Apps SDK, Operator, and Codex, all L5/L6/L7 expansion. None of it is self-certification.
    • AWS ships hundreds of services but pays Vanta/Drata for compliance evidence. The platform respects the boundary.
    Pattern · 04
    L8MemoryL3Gates

    Memory Is Not Truth

    L8 memory of what happened, what the user said, did, preferred, is a clean moat. L8 claims about what is true, diagnoses, legal positions, financial valuations, require an L3 verifier above them. The moment memory makes a truth claim, it inherits a regulator.

    • Notion AI remembers your docs (L8b, defensible). It does not diagnose your patients.
    • An AI medical scribe (L8) is valuable; the same scribe issuing a diagnosis triggers FDA (L3) and an MD signature requirement.
    Pattern · 05
    L2ModelsL7Surface

    Distribution Eats Generation

    Once L2 commoditizes (and it always does), the surplus flows to whichever layer owns the user's moment of consumption, L7c (embedded copilot) or L7d (transaction surface). The model is generic; the context of use is not.

    • Cursor captures the codegen surplus, not the model underneath it. The model is interchangeable; the IDE moment is not.
    • Perplexity captures the answer surplus by owning the question moment. The model could be any of four, the surface is the moat.
    Pattern · 06
    L1DataL3GatesL7Surface

    The Gatekeeper Tax is Always Arbitraged

    Wherever a gatekeeper extracts rent between the marginal cost of supply and the perceived value of demand, an arbitrageur, API shim, cloud automation, open-source replacement, lateral integration, or regulatory appeal, will step into the gap. The gatekeeper's pricing power is bounded by the cost of the workaround. The arbitrageur lives at L7 and quietly reaches down into L5 to widen the margin further.

    • Dripify (L7) arbitrages LinkedIn's (L1+L3) connection-request bottleneck: cloud automation + proxies cost pennies, sales teams pay $39–$99/seat/month. Newer entrants now hook L5 open-source LLMs to auto-reply, compressing the last human cost.
    • Plaid (L4b) arbitraged the bank gatekeepers' API absence for a decade; the moment banks shipped their own APIs, Plaid's margin compressed and it had to migrate up into identity and data.
    Read the case study: Dripify vs LinkedIn, the L7 arbitrageur

    Six company archetypes

    The six fates of SaaS

    Every SaaS company collapses into one of these patterns over time.

    Data Refineries

    safe

    L1b ★ — proprietary data compounds. Apollo, Bloomberg.

    Infrastructure Rails

    safe

    L4b/L4e ★ — essential pipes & agent identity. Supabase, Twilio.

    Workflow Fortresses

    contested

    L5+L6b ★ — agent loops + human-in-loop. Salesforce, HubSpot.

    Domain Specialists

    safe

    L5a/b/d ★ + L8c ★ — Harvey, Sierra. Encoded expertise.

    Thin-Layer Graveyard

    dead

    L7a/L7b, no ★ — Gamma, Jasper, Chegg. Already dead.

    Full-Stack Juggernauts

    dominant

    L2a+L7c/d ★+L8c ★ — Claude, ChatGPT, Copilot.

    05

    The Applications

    How do I use it?

    Two instruments. Four self-serve tools.

    The framework comes with instruments — the Defensible Triangle and the Intelligence Cube — that turn the map into something you can place a company on. Plus self-serve tools you can run on your own product, portfolio, or thesis: a defensibility audit, the market maps, the playbook, and the live predictions.

    Instrument · The Defensible Triangle

    One common pattern, not the only way to win

    The Triangle (L1b + L5a/b/d + L8c/d/e) is a recurring fortress pattern across application- layer companies — Sierra, Harvey, Glean, BloombergGPT, Tempus all exhibit some version of it. It is not the only way to survive. A pure gatekeeper like Vanta wins on L3 alone; NVIDIA wins on L0; Snowflake on L4. Owning one layer deeply can be enough. What kills you is owning a thin sliver of a contested one.

    L1b

    Proprietary Data

    Data behind enterprise walls. No one else has it.

    L5a/b/d

    Deep Skills & Playbooks

    Domain execution, decision frameworks, company SOPs.

    L8c/d/e

    Compounding Memory

    Network learning, institutional knowledge, world models.

    Instrument · The Intelligence Cube™

    10 Functions × 10 Verticals × 10 Layers

    Volume in the Cube = structural durability. Height is layers, width is functions, depth is verticals. Thin single-layer plays compress fast; multi-layer stacks hold longer. The counter-move is always to add depth.

    Sierra — defensible stack

    Customer Care × 4 verticals × 3 layers (L1b + L5b + L8c). Memory compounds per customer.

    L1b ★L5a ★L8c ★

    Gamma — thin stack, exposed

    Product + PM × cross-industry × L7a only. Counter-move: add L1 proprietary data, L5 templates, or L8 per-team memory.

    L7a only
    06

    The Observations

    What's happening now?

    Live readings of the market.

    The framework does not change weekly. Which company sits in which layer does. Observations are time-bound applications of the map — current verdicts, predictions, case studies, market maps. Every reading carries a re-review date. This is the only section of the framework that has dates attached.

    Rule of thumb: if a claim has a date attached, it is an Observation. If changing it would force a Paper version bump, it is in the Map or the Laws. If changing it would mean the framework is wrong about what AI is, it is in the Definition.

    Shareable

    Want the poster?

    The full 10×5 grid and the square version, both downloadable.

    Open the poster gallery →