Comprehensive Business Plan · Confidential

Building the dojo for agentic AI.

Kojen is a full-stack training platform that turns intent into production-grade agentic models. This document outlines our strategy, market, product, financials, and roadmap.

$420B
TAM by 2030
$1.2M
ARR run rate
94%
Weekly retention
$6M
Seed raise
01 — Executive Summary

A new operating system for training AI.

Kojen is an end-to-end platform for training, evaluating, and continuously improving agentic AI systems. We give teams a single workflow — from problem statement to deployed model — guided by an AI trainer ("Sensei") that critiques and refines every step.

Today, fewer than 13% of ML projects reach production. Teams cobble together a dozen vendors, lose months to data plumbing, and watch deployed models silently regress. Kojen replaces this with a disciplined, opinionated dojo: dataset forge, training katas, evaluation, deployment, and continuous learning — in one system, with Sensei in the loop.

We are raising a $6M seed round to scale Sensei, expand our kata library, and bring Kojen to every team building agentic AI in the next 18 months.

Stage
Seed
Round
$6M
ARR
$1.2M run-rate
Retention
94% weekly
HQ
San Francisco, CA
Team
11 FTE
02 — Vision & Mission

Make training AI a craft anyone can master.

Vision: A world where every product team can train, evaluate, and ship an agentic model with the same confidence they ship code today.

Mission: Build the definitive training platform for agentic AI — opinionated, guided, and continuously learning — so practitioners spend their time on judgment, not plumbing.

Our north-star metric is models in production per active team. When that number rises, every other business metric follows.

03 — Problem

Training AI is artisanal, fragile, and lonely.

Modern AI teams operate in a fragmented stack: notebooks for experimentation, scripts for data, separate vendors for fine-tuning, evaluation, deployment, and monitoring. The result: brittle pipelines, slow cycles, and silent failure.

87% never ship

Most ML projects die between notebook and production.

12+ tools

The median team stitches a dozen vendors per model.

Silent drift

Models regress in production with no feedback loop.

Talent scarcity

ML engineers are bottlenecked on infrastructure work.

The cost of failure is enormous: average spend per failed model attempt exceeds $2M when you account for engineering time, GPU burn, and opportunity cost.

04 — Solution

One platform. Every step of the training journey.

Kojen unifies the training workflow into a single, opinionated dojo. Practitioners describe intent; Kojen generates the dataset schema, suggests architecture, runs training, evaluates against katas, deploys, and monitors for drift — with Sensei guiding every decision.

Intent → Model

Describe a problem. Kojen returns a complete, executable plan.

Forge & Katas

A library of repeatable training kata for every common task.

Sensei

AI trainer that reviews datasets, critiques runs, and suggests fixes.

Continuous Learning

Production data flows back into retraining automatically.

The result: cycle time drops from weeks to hours, model quality improves measurably, and teams ship more models per quarter than they previously shipped per year.

05 — Market Opportunity

A $420B opportunity by 2030.

Foundation models commoditized inference. The next wave of value is in training, fine-tuning, and continuous improvement — accessible to every team, not just frontier labs. Open weights (Llama, Mistral, Qwen), agentic workflows, and enterprise compliance demands are converging into a generational tailwind.

TAM
$420B

Global AI infrastructure & MLOps by 2030

SAM
$48B

Model training & fine-tuning platforms

SOM
$2.1B

Agentic AI training tools, 5-yr capture

Sources: Grand View Research, IDC AI Infrastructure Forecast 2025, internal modeling.

06 — Product

The Kojen platform.

Kojen ships as a unified web platform with a CLI and SDK. Every surface is built around the same primitive: a kata — a reproducible training routine with dataset, model, eval, and deploy spec.

01
Intent
02
Dataset
03
Train
04
Evaluate
05
Deploy
Sensei

Conversational AI trainer with chat and Intent → Model modes.

Forge

Visual dataset builder with schema generation and synthetic data.

Pipelines

Drag-and-drop training pipelines with versioning and rollback.

Deployments

One-click deploy to managed inference with autoscaling.

Continuous Learning

Drift monitoring, feedback logs, scheduled retrains.

Marketplace

Share and remix kata, datasets, and templates with the community.

07 — Competitive Landscape

A fragmented field. No unified dojo.

Today's incumbents specialize in slices: data labeling (Scale, Surge), training infra (Modal, Together), MLOps (Weights & Biases), and inference (Replicate, Fireworks). None cover the full agentic loop with an opinionated trainer.

PlayerFocusGap vs. Kojen
Weights & BiasesExperiment trackingNo agentic guidance, no deploy/retrain loop
Modal / TogetherTraining infrastructureCompute only — no opinions, no Sensei
Hugging FaceModel hub & librariesDIY workflow; no end-to-end training journey
Scale AIData labelingSingle layer; no model lifecycle
Replicate / FireworksHosted inferenceInference only; no training discipline

Kojen's moat: the kata library + Sensei + continuous learning loop. Each customer that trains on Kojen makes Sensei sharper, and every shared kata strengthens our marketplace.

08 — Business Model

Usage-based SaaS. Land small, expand fast.

Kojen monetizes through tiered subscriptions plus usage-based billing for compute and inference. Self-serve practitioners become team accounts, which become enterprise contracts.

Practitioner
$49/mo

Solo builders. Unlimited katas, 10 models, community Sensei.

Team
$499/mo

Up to 10 seats, shared workspaces, Sensei Pro, priority compute.

Enterprise
Custom

Private deploy, SSO/SAML, dedicated GPUs, SLA, audit logs.

Gross margin (target)
78%
Net revenue retention
138%
CAC payback
9 months
LTV : CAC
5.4x
09 — Go-to-Market

Bottom-up adoption. Top-down expansion.

We win the practitioner first. Kojen's free tier and open kata marketplace pull individual ML engineers in; viral mechanics (shared kata, public leaderboards, embedded Sensei reviews) create organic distribution. Once 2+ practitioners on a team adopt Kojen, an inbound team-tier conversation begins.

Developer relations

Open-source kata, docs, community, conference talks.

Content engine

Tutorials, model walkthroughs, Sensei case studies.

Enterprise outbound

Targeted ABM into Series B+ AI-forward companies.

Partner ecosystem

Integrations with cloud providers, vector DBs, observability.

GTM headcount in year one: 1 head of growth, 2 DevRel, 1 founding AE, 1 founding SE. We expect 70% of new ARR to originate from product-led signals in the first 18 months.

10 — Operations & Tech

Built on a modern, scalable stack.

Kojen runs as a cloud-native platform with multi-tenant isolation, GPU-aware scheduling, and a managed control plane. The customer surface is built on TanStack Start with edge SSR; the training plane orchestrates jobs across multiple cloud GPU providers with automatic spot/on-demand arbitrage.

Cloud
Multi-cloud (AWS, GCP, Lambda)
GPU partners
Lambda, CoreWeave, RunPod
Storage
Object store + vector index
Observability
OpenTelemetry, custom drift metrics
Security
SOC 2 Type II (in progress)
Compliance
GDPR, HIPAA-ready architecture

Internally, we ship daily, run weekly Sensei evaluations against a held-out kata suite, and review every customer training run in a shared support channel. Operational discipline is part of the brand.

11 — Team & Org

Operators who've shipped AI at scale.

Founder & CEO

Ex-AW3 venture studio. Shipped AI products to millions of users.

Head of AI

Former research lead. PhD applied ML. 40+ peer-reviewed papers.

Head of Product

Built developer tools used by 100K+ engineers at prior co.

Head of Engineering

Distributed systems veteran. Scaled infra to 10M+ jobs/day.

Today: 11 FTE (6 engineering, 2 AI research, 1 product, 1 design, 1 operations). Year-one plan: scale to 28 FTE, weighted toward AI research and developer relations.

Advisors and angels include operators from OpenAI, Anthropic, Hugging Face, and the AW3 venture studio network.

12 — Financial Plan

Capital-efficient path to $20M ARR.

Our financial plan models conservative bottom-up adoption with top-down enterprise expansion in year two. We reach default-alive cash-flow at $14M ARR, with a path to $20M ARR by end of year three on a single seed + Series A.

MetricY1 (2026)Y2 (2027)Y3 (2028)
ARR$3.4M$11.2M$24.8M
Paying customers6402,1004,800
Gross margin72%76%79%
Headcount285285
Cash burn / mo$420K$580K$310K
Runway30 mo22 moDefault alive

Use of funds skews toward AI research (Sensei) and engineering; GTM ramps in late year one as product-led signal proves repeatable.

40%
AI research & Sensei
30%
Engineering & platform
20%
GTM & DevRel
10%
Operations & runway
13 — Risks & Mitigations

Honest about what could go wrong.

Foundation model providers move into training tools.

Mitigation: We partner with model providers (not compete) and own the multi-model, opinionated workflow layer they will not build.

GPU supply or pricing shocks compress margins.

Mitigation: Multi-cloud arbitrage, spot/on-demand mix, and reserved capacity contracts with three GPU partners.

Open-source alternatives erode willingness to pay.

Mitigation: Sensei + the kata marketplace + managed continuous learning create proprietary value above any OSS substitute.

Long enterprise sales cycles starve early revenue.

Mitigation: Self-serve practitioner tier funds early growth; enterprise is upside, not the base case.

Talent scarcity in AI research slows roadmap.

Mitigation: Founder network, advisor pipeline, and a research residency program launching in Y1.

14 — Roadmap

The next 18 months.

Q2 2026
Sensei v2 + Continuous Learning GA

Launch reasoning-tuned Sensei, retrain scheduling, and drift alerts.

Q3 2026
Kata Marketplace public beta

Community-shared katas with revenue share for top contributors.

Q4 2026
Enterprise tier GA

SSO/SAML, audit logs, private VPC deploy, dedicated GPU pools.

Q1 2027
Multi-agent orchestrator

Train and evaluate multi-agent systems as first-class primitives.

Q2 2027
Series A

Raise on $11M+ ARR run rate to expand GTM and international.

15 — The Ask

Raising $6M seed.

We are partnering with funds and operators who want to shape how the next decade of AI is trained. The round is led by a committed lead; we have room for a small number of additional partners with relevant networks in AI infrastructure, developer tools, or enterprise GTM.

Begin Your Training

Ready to enter the dojo?

Reach out for a full data room, financial model, and customer references.

Kojen · Confidential← Back to Kojen