Open-model inference, verifiable

Open-model inference
you can audit.

Point your OpenAI client at Orogen and your existing code runs unchanged, at roughly $0.40 per 1M output tokens for Llama-3.1-70B. Every response comes back with a signed receipt you can verify yourself, and validators re-run a sample of jobs on identical hardware to catch anyone cutting corners.

Open Playground Start mining How it works

OpenAI-compatible HTTP + SSE
A signed receipt on every response
Validator replay on identical hardware

the verification loop you · network · chain

Live on the Forge testnet The OpenAI-compatible gateway, the Python and TypeScript SDKs, the CLI wallet, and a public chain node are all running today. Mainnet pricing and capacity are on the roadmap.

Why Orogen

Inference you do not have to take on faith.

Most inference networks rent you a GPU and ask you to trust whatever comes back. You cannot tell which model actually answered, whether it was the quantization you paid for, or whether the hardware was what it claimed, so you reconcile an invoice and hope it was honest. Orogen works the other way around. The network verifies the work and hands you the proof, so you check the result rather than take the operator's word for it.

A receipt you can check yourself

Every response carries a signed receipt that binds the model, your input, the output, and a hardware attestation into one record. You verify it locally in a single call before you trust the result.

The network checks the work for you

Validators independently re-run a sample of jobs on hardware identical to the operator's. Any mismatch between what an operator claimed and what the replay produces opens a challenge.

Paid in dollar terms for real work

Operators are paid per completed, verified job at oracle spot, denominated against the dollar, with most of each payout auto-swapped to a stablecoin. Pay tracks served demand, not token price.

Issuance tracks revenue

Customers pay in stablecoins, the gateway burns OROG to settle each job, and new OROG is minted only against work that validators have verified. Supply follows real inference revenue.

How it works

A loop between you, the network, and the chain.

Verification is a loop, and you sit at the start of it. Read the full walkthrough in how it works.

1
You verify

You send a request, receive an answer and a signed receipt, and verify that receipt locally in a single call.
2
The network checks

Validators replay a sample of jobs on identical hardware and flag any mismatch, which opens a challenge.
3
The chain records

Each job's commitment is stored on-chain, so a disputed result has a record to settle against rather than two parties arguing from memory.

For builders

Drop-in for the OpenAI code you already wrote.

Set your base URL to the Orogen gateway and your existing code keeps working. There is no new SDK to learn and no rewrite to schedule. Reach for the Orogen SDK when you want to pick a hardware tier per request or pull the receipt object directly.

Open the Playground Open the docs

from openai import OpenAI

client = OpenAI(
    api_key="orog_...",
    base_url="https://gateway.orogen.network/v1",
)

response = client.chat.completions.create(
    model="llama-3.1-70b-instruct",
    messages=[{"role": "user", "content": "Summarise this contract clause..."}],
)

Pricing

Roughly $0.40 per 1M output tokens.

That is the standard datacenter tier for Llama-3.1-70B, against $10 to $15 for comparable closed-model APIs. You pick the hardware tier per request, you pay only for the tokens you use, and there is no subscription floor.

See the full tier matrix

For operators

Run an open-source inference server with three additions: hardware attestation, deterministic kernels, and the receipt sidecar. You are paid per completed job at oracle spot, denominated against the dollar, with most of each payout auto-swapped to a stablecoin. Pay tracks verified work, so honest operators are not diluted by emissions handed out for nothing.

Start mining

Why the token holds up

Customers pay in stablecoins, and the gateway burns OROG to settle each job. New issuance is minted only against work that validators have verified, so issuance tracks real inference revenue rather than a fixed inflation schedule. There is no foundation mint discretion and no halving.

Read the tokenomics

Sourced from prior networks

What we learned from the networks before us.

Orogen's design comes from studying what prior networks got right and what they left on the table.

Bittensor SN3 showed that a global operator base already exists, with more than 70 peers across three continents running on commodity links.

Bittensor SN3 public dashboards

Phala showed that TEE attestation works at scale, routing more than a billion tokens a day for OpenRouter clients.

Phala public dashboards

Akash showed that burn-and-mint can move issuance toward real revenue instead of a fixed inflation schedule.

Akash community reports

Orogen anchors all three to verified physical work. Like the orogeny that slowly builds a mountain range under pressure, durable infrastructure is the product of accumulated verified work rather than announcements.

Build on it, or get paid to run it.

Start building against the live gateway with the OpenAI client you already have, or run a node and get paid in dollar terms for verified work.

Open Playground Start mining

Open-model inference you can audit.