managed runtime
The runtime, operated.
Same runtime as the open-source Mercury — patched, scaled, and operated by us. Bring your own tools, your own model providers, your own code. We handle the plumbing.
cosmic stack cloud
Run agent fleets in production without operating the runtime yourself. Cosmic Stack Cloud is the same open-source runtime that powers Mercury — managed, scaled, and observable, on infrastructure designed for long-horizon tool use.
what cloud is
managed runtime
Same runtime as the open-source Mercury — patched, scaled, and operated by us. Bring your own tools, your own model providers, your own code. We handle the plumbing.
long-horizon scheduling
Agent jobs are not request/response. A single task can fan out across dozens of tool calls over minutes or hours. Cloud is built around that shape — checkpointed state, resumable on preemption, billed by useful work done, not wallclock.
observability built-in
Every tool call, every model call, every retry, every cost — instrumented as a single trace per task. Search by failure mode. Replay any run. Diff two model versions on the same input. No third-party APM required.
byo provider
Bring your OpenAI, Anthropic, Bedrock, Vertex, or self-hosted endpoints. We route, we fail over, we cache — we never proxy your tokens through our account or buy on your behalf.
compliance
SOC 2 Type II audit underway. HIPAA BAA available for healthcare customers. Choose data residency per workspace: US, EU, or APAC. Audit logs exported to your SIEM.
pricing
Per successful task, not per CPU-minute. Failed runs that didn't accomplish anything aren't billed. Pricing scales sublinearly with volume — large customers get genuinely lower unit costs, not 'enterprise pricing' theatre.
platform
| Regions | US · EU · APAC | Data residency per workspace |
|---|---|---|
| Uptime SLO | 99.95% | Tracked publicly at status.cosmicstack.ai |
| Compliance | SOC 2 · HIPAA | Type II audit in progress |
| Egress | Zero markup | Cloudflare-backed, pass-through pricing |
early access
Tell us about the workload. The more concrete you are about what you're building and what's currently in your way, the faster we'll get back to you.
faq