--- title: Pricing | Lightcone description: Simple, usage-based pricing for compute and chat completions. --- ## Chat Completions Pay per token for chat completions. | Model | Input | Output | | --------------------------- | ----------------- | ----------------- | | `tzafon.sm-1` | $0.20 / 1M tokens | $0.30 / 1M tokens | | `tzafon.northstar-cua-fast` | $0.30 / 1M tokens | $0.50 / 1M tokens | See [Chat Completions](/guides/chat-completions/index.md) for usage details and code examples. ## Compute Pay only for what you use, billed per second. | Resource | Rate | | -------- | ----------------- | | CPU | $0.38 / core-hour | | Memory | $0.095 / GiB-hour | A normal browser workload runs at roughly **$0.075 / hour**. ### How billing works - **CPU** — based on actual core usage (2 cores at 100% = 2 core-seconds per second) - **Memory** — based on actual memory used, not allocated - **Granularity** — per-second billing, minimum 0.125 cores per container Always call `terminate()` or use a context manager when you’re done to stop billing automatically. ## FAQ **What if I forget to terminate?** Instances automatically terminate after a configurable inactivity period to prevent unexpected charges. **Are there setup fees?** No setup fees, no monthly minimums. Pay only for what you use. **Volume discounts?** Contact for high-volume pricing.