---
title: Pricing | Lightcone
description: Simple, usage-based pricing for compute and chat completions.
---

## Chat Completions

Pay per token for chat completions.

| Model                       | Input             | Output            |
| --------------------------- | ----------------- | ----------------- |
| `tzafon.sm-1`               | $0.20 / 1M tokens | $0.30 / 1M tokens |
| `tzafon.northstar-cua-fast` | $0.30 / 1M tokens | $0.50 / 1M tokens |

See [Chat Completions](/guides/chat-completions/index.md) for usage details and code examples.

## Compute

Pay only for what you use, billed per second.

| Resource | Rate              |
| -------- | ----------------- |
| CPU      | $0.38 / core-hour |
| Memory   | $0.095 / GiB-hour |

A normal browser workload runs at roughly **$0.075 / hour**.

### How billing works

- **CPU** — based on actual core usage (2 cores at 100% = 2 core-seconds per second)
- **Memory** — based on actual memory used, not allocated
- **Granularity** — per-second billing, minimum 0.125 cores per container

Always call `terminate()` or use a context manager when you’re done to stop billing automatically.

## FAQ

**What if I forget to terminate?** Instances automatically terminate after a configurable inactivity period to prevent unexpected charges.

**Are there setup fees?** No setup fees, no monthly minimums. Pay only for what you use.

**Volume discounts?** Contact <support@tzafon.ai> for high-volume pricing.