Usage & Billing
Every interaction with the Data Science Agent consumes AI tokens. Usage is tracked automatically and visible in your account settings.
How Tokens Work
Each conversation with the agent uses tokens in two directions:
- Input tokens — Your messages, system instructions, tool schemas, and tool results sent to the AI model
- Output tokens — The AI’s responses, SQL queries, Python code, and tool calls
A typical interaction (browse data + run a query) uses approximately 3,000-5,000 tokens total.
Pricing
Costs depend on which AI model you use. Gemini is available by default; OpenAI and Anthropic require your own API key (BYOK).
| Model | Input | Output | Typical Query Cost |
|---|---|---|---|
| Gemini 2.5 Flash | $0.15/M | $0.60/M | ~$0.001 |
| GPT-4o Mini | $0.15/M | $0.60/M | ~$0.001 |
| GPT-4o | $2.50/M | $10.00/M | ~$0.01 |
| Claude Haiku 3.5 | $0.80/M | $4.00/M | ~$0.003 |
| Claude Sonnet 4 | $3.00/M | $15.00/M | ~$0.015 |
Example costs (Gemini 2.5 Flash):
- Exploring your datasets: ~$0.001
- Running a data query: ~$0.001-0.002
- Building a full model (browse + generate + preview): ~$0.002-0.005
- A full day of heavy usage (50 interactions): ~$0.05-0.10
Bring Your Own Key (BYOK)
To use OpenAI or Anthropic models, add your API key in Settings > AI Keys:
- Navigate to Settings > AI Keys
- Find the provider card (OpenAI or Anthropic)
- Paste your API key
- Click Test to validate the key works
- Click Save Key
Once configured, the model selector dropdown in the agent header will show all available models for that provider. You can update or remove keys at any time.
When using BYOK models, you are billed directly by the provider — Vendo does not add any markup. Token usage is still tracked in Usage Metrics regardless of which model you use.
Exporting Data
Query results can be exported as CSV files:
- Run a query in the agent
- The results table will display in the right panel
- Click the CSV button in the table header
- The file downloads with the query title as the filename
Viewing Usage
Navigate to Settings > Usage Metrics to see:
Data Section
- Total storage across your BigQuery datasets
- Rows synced
- Per-dataset breakdown (source vs production data)
Compute Section
- Pipeline read/write operations
- Model compute costs
- Total compute usage
AI Agent Section
- Total Tokens — Input and output token counts
- Estimated Cost — Based on Gemini pricing
- Sessions — Number of chat interactions
- Tool Calls — Data queries and actions executed
- Usage by Model — Breakdown by AI model with token counts and costs
What Counts as an Interaction
Each time you send a message and the agent completes its response (including all tool calls), one interaction is logged. A single interaction may include multiple tool calls (browsing datasets, inspecting schemas, running queries) — these are all part of one interaction.
Limits
- Tool calls per interaction — The agent is limited to 15 steps per response
- Query results — Limited to 500 rows per query
- Query timeout — 30 seconds per query execution
- Table previews — Limited to 10 sample rows