Pricing
Simple, usage-based pricing for compute and chat completions.
Chat Completions
Section titled “Chat Completions”Pay per token for chat completions.
| Model | Input | Output |
|---|---|---|
tzafon.sm-1 | $0.20 / 1M tokens | $0.30 / 1M tokens |
tzafon.northstar-cua-fast | $0.30 / 1M tokens | $0.50 / 1M tokens |
Compute
Section titled “Compute”Pay only for what you use, billed per second.
| Resource | Rate |
|---|---|
| CPU | $0.38 / core-hour |
| Memory | $0.095 / GiB-hour |
A normal browser workload runs at roughly $0.075 / hour.
How billing works
Section titled “How billing works”- CPU — based on actual core usage (2 cores at 100% = 2 core-seconds per second)
- Memory — based on actual memory used, not allocated
- Granularity — per-second billing, minimum 0.125 cores per container
What if I forget to terminate? Instances automatically terminate after a configurable inactivity period to prevent unexpected charges.
Are there setup fees? No setup fees, no monthly minimums. Pay only for what you use.
Volume discounts? Contact support@tzafon.ai for high-volume pricing.