Where chaos
becomes craft.

Transmuting code into confidence — a structured pipeline that orchestrates AI agents across any provider, scores every output, and heals itself when things go sideways.

pipeline.yaml
phases:
  spec          # define what to build
  behavioral    # write contracts
  adversary     # challenge them
  tests         # prove it works before it exists
  implement     # write the code
  review        # score it
  test          # verify it

transitions:
  review.fail: implement  # loop back, not forward

THE PROBLEM

AI is probabilistic.
Your pipeline shouldn't be.

AI agents are coin flips wrapped in confidence. Every run produces different output. Every output needs verification. Most teams either trust blindly or review manually. Neither scales.

The industry built orchestrators for routing — nobody built one for trust.

THE VISION

From probabilistic
to deterministic.

Orchemist is not another orchestrator. It is a trust engine.

01

Specs before code

Define what success looks like before writing a single line.

02

Adversarial review

Challenge assumptions before you build on them.

03

Tests before implementation

Prove it works before it exists.

04

Scoring and routing

Quantify confidence. Gate on thresholds. Route on results.

05

Bounded self-healing

Retry with feedback, not blindly. Surface blockers, don't hide them.

ARCHITECTURE

Any workload. Any sequence.
One engine.

A pipeline is a YAML state machine. You define phases, transitions, and quality gates. The engine follows the graph. Swap phases, change models, add gates — without touching code.

SPEC Define gates BUILD Execute phase SCORE Evaluate output GATE OK start done scored pass fail / retry with feedback escalate / re-spec

SHOWCASES

Proven in production.

THE MARKET

They orchestrate agents.
We orchestrate trust.

Seven frameworks compete on how elegantly they route agents. None compete on whether you can trust what those agents produced. LangGraph, CrewAI, Pydantic AI, Google ADK -- all solve communication. Orchemist solves verification.

WHAT THEY DO
WHAT WE ADD
Agent routing
Behavioral acceptance tests
Prompt chaining
Confidence scoring with routing
Multi-model support
Adversarial spec review
Workflow graphs
Bounded self-healing
Full audit trail

TRACTION

Numbers, not promises.

0
Tests passing
0
Pipeline runs
0
Execution modes
MIT
Licensed

GET STARTED

Five minutes to
your first pipeline.

Install. Scaffold. Launch. The engine handles the rest.

terminal
$ pip install orchemist

$ orch new my-pipeline

$ orch run my-pipeline.yaml --mode openrouter