A new operating system for training AI.
Kojen is an end-to-end platform for training, evaluating, and continuously improving agentic AI systems. We give teams a single workflow — from problem statement to deployed model — guided by an AI trainer ("Sensei") that critiques and refines every step.
Today, fewer than 13% of ML projects reach production. Teams cobble together a dozen vendors, lose months to data plumbing, and watch deployed models silently regress. Kojen replaces this with a disciplined, opinionated dojo: dataset forge, training katas, evaluation, deployment, and continuous learning — in one system, with Sensei in the loop.
We are raising a $6M seed round to scale Sensei, expand our kata library, and bring Kojen to every team building agentic AI in the next 18 months.
Make training AI a craft anyone can master.
Vision: A world where every product team can train, evaluate, and ship an agentic model with the same confidence they ship code today.
Mission: Build the definitive training platform for agentic AI — opinionated, guided, and continuously learning — so practitioners spend their time on judgment, not plumbing.
Our north-star metric is models in production per active team. When that number rises, every other business metric follows.
Training AI is artisanal, fragile, and lonely.
Modern AI teams operate in a fragmented stack: notebooks for experimentation, scripts for data, separate vendors for fine-tuning, evaluation, deployment, and monitoring. The result: brittle pipelines, slow cycles, and silent failure.
Most ML projects die between notebook and production.
The median team stitches a dozen vendors per model.
Models regress in production with no feedback loop.
ML engineers are bottlenecked on infrastructure work.
The cost of failure is enormous: average spend per failed model attempt exceeds $2M when you account for engineering time, GPU burn, and opportunity cost.
One platform. Every step of the training journey.
Kojen unifies the training workflow into a single, opinionated dojo. Practitioners describe intent; Kojen generates the dataset schema, suggests architecture, runs training, evaluates against katas, deploys, and monitors for drift — with Sensei guiding every decision.
Describe a problem. Kojen returns a complete, executable plan.
A library of repeatable training kata for every common task.
AI trainer that reviews datasets, critiques runs, and suggests fixes.
Production data flows back into retraining automatically.
The result: cycle time drops from weeks to hours, model quality improves measurably, and teams ship more models per quarter than they previously shipped per year.
A $420B opportunity by 2030.
Foundation models commoditized inference. The next wave of value is in training, fine-tuning, and continuous improvement — accessible to every team, not just frontier labs. Open weights (Llama, Mistral, Qwen), agentic workflows, and enterprise compliance demands are converging into a generational tailwind.
Global AI infrastructure & MLOps by 2030
Model training & fine-tuning platforms
Agentic AI training tools, 5-yr capture
Sources: Grand View Research, IDC AI Infrastructure Forecast 2025, internal modeling.
The Kojen platform.
Kojen ships as a unified web platform with a CLI and SDK. Every surface is built around the same primitive: a kata — a reproducible training routine with dataset, model, eval, and deploy spec.
Conversational AI trainer with chat and Intent → Model modes.
Visual dataset builder with schema generation and synthetic data.
Drag-and-drop training pipelines with versioning and rollback.
One-click deploy to managed inference with autoscaling.
Drift monitoring, feedback logs, scheduled retrains.
Share and remix kata, datasets, and templates with the community.
A fragmented field. No unified dojo.
Today's incumbents specialize in slices: data labeling (Scale, Surge), training infra (Modal, Together), MLOps (Weights & Biases), and inference (Replicate, Fireworks). None cover the full agentic loop with an opinionated trainer.
| Player | Focus | Gap vs. Kojen |
|---|---|---|
| Weights & Biases | Experiment tracking | No agentic guidance, no deploy/retrain loop |
| Modal / Together | Training infrastructure | Compute only — no opinions, no Sensei |
| Hugging Face | Model hub & libraries | DIY workflow; no end-to-end training journey |
| Scale AI | Data labeling | Single layer; no model lifecycle |
| Replicate / Fireworks | Hosted inference | Inference only; no training discipline |
Kojen's moat: the kata library + Sensei + continuous learning loop. Each customer that trains on Kojen makes Sensei sharper, and every shared kata strengthens our marketplace.
Usage-based SaaS. Land small, expand fast.
Kojen monetizes through tiered subscriptions plus usage-based billing for compute and inference. Self-serve practitioners become team accounts, which become enterprise contracts.
Solo builders. Unlimited katas, 10 models, community Sensei.
Up to 10 seats, shared workspaces, Sensei Pro, priority compute.
Private deploy, SSO/SAML, dedicated GPUs, SLA, audit logs.
Bottom-up adoption. Top-down expansion.
We win the practitioner first. Kojen's free tier and open kata marketplace pull individual ML engineers in; viral mechanics (shared kata, public leaderboards, embedded Sensei reviews) create organic distribution. Once 2+ practitioners on a team adopt Kojen, an inbound team-tier conversation begins.
Open-source kata, docs, community, conference talks.
Tutorials, model walkthroughs, Sensei case studies.
Targeted ABM into Series B+ AI-forward companies.
Integrations with cloud providers, vector DBs, observability.
GTM headcount in year one: 1 head of growth, 2 DevRel, 1 founding AE, 1 founding SE. We expect 70% of new ARR to originate from product-led signals in the first 18 months.
Built on a modern, scalable stack.
Kojen runs as a cloud-native platform with multi-tenant isolation, GPU-aware scheduling, and a managed control plane. The customer surface is built on TanStack Start with edge SSR; the training plane orchestrates jobs across multiple cloud GPU providers with automatic spot/on-demand arbitrage.
Internally, we ship daily, run weekly Sensei evaluations against a held-out kata suite, and review every customer training run in a shared support channel. Operational discipline is part of the brand.
Operators who've shipped AI at scale.
Ex-AW3 venture studio. Shipped AI products to millions of users.
Former research lead. PhD applied ML. 40+ peer-reviewed papers.
Built developer tools used by 100K+ engineers at prior co.
Distributed systems veteran. Scaled infra to 10M+ jobs/day.
Today: 11 FTE (6 engineering, 2 AI research, 1 product, 1 design, 1 operations). Year-one plan: scale to 28 FTE, weighted toward AI research and developer relations.
Advisors and angels include operators from OpenAI, Anthropic, Hugging Face, and the AW3 venture studio network.
Capital-efficient path to $20M ARR.
Our financial plan models conservative bottom-up adoption with top-down enterprise expansion in year two. We reach default-alive cash-flow at $14M ARR, with a path to $20M ARR by end of year three on a single seed + Series A.
| Metric | Y1 (2026) | Y2 (2027) | Y3 (2028) |
|---|---|---|---|
| ARR | $3.4M | $11.2M | $24.8M |
| Paying customers | 640 | 2,100 | 4,800 |
| Gross margin | 72% | 76% | 79% |
| Headcount | 28 | 52 | 85 |
| Cash burn / mo | $420K | $580K | $310K |
| Runway | 30 mo | 22 mo | Default alive |
Use of funds skews toward AI research (Sensei) and engineering; GTM ramps in late year one as product-led signal proves repeatable.
Honest about what could go wrong.
Mitigation: We partner with model providers (not compete) and own the multi-model, opinionated workflow layer they will not build.
Mitigation: Multi-cloud arbitrage, spot/on-demand mix, and reserved capacity contracts with three GPU partners.
Mitigation: Sensei + the kata marketplace + managed continuous learning create proprietary value above any OSS substitute.
Mitigation: Self-serve practitioner tier funds early growth; enterprise is upside, not the base case.
Mitigation: Founder network, advisor pipeline, and a research residency program launching in Y1.
The next 18 months.
Launch reasoning-tuned Sensei, retrain scheduling, and drift alerts.
Community-shared katas with revenue share for top contributors.
SSO/SAML, audit logs, private VPC deploy, dedicated GPU pools.
Train and evaluate multi-agent systems as first-class primitives.
Raise on $11M+ ARR run rate to expand GTM and international.
Raising $6M seed.
We are partnering with funds and operators who want to shape how the next decade of AI is trained. The round is led by a committed lead; we have room for a small number of additional partners with relevant networks in AI infrastructure, developer tools, or enterprise GTM.
Ready to enter the dojo?
Reach out for a full data room, financial model, and customer references.
