Thunderbit Monitor — Workflow & LLM Observability Dashboard

Monitor

LLM Calls

Input Tokens

Output Tokens

Est. Cost

LLM Calls TrendiHourly count of LLM API calls.

loading

Token Usage TrendiHourly token usage broken down by input (user_prompt), output (response), and system (system_prompt).

loading

Est. Cost Trend ($)iHourly estimated LLM cost. Rough rates: ~$1/1M input tokens, ~$5/1M output tokens.

loading

Est. Cost by Model ($)iEstimated cost per model. Rough rates: ~$1/1M input tokens, ~$5/1M output tokens.

loading

Model Calls Over TimeiHourly call count broken down by provider and model.

loading

Avg Latency by Model (ms)iHourly average response latency per model group. Only successful calls.

loading

Model LatencyiAvg TTFT = average time to first token (response_start_latency). Avg Total = average full response time. Only successful calls.

loading

Model Success / Error RateiSuccess and error counts per model. Error Rate = error count / total calls per model.

loading

Usage by FeatureiDistribution of LLM calls grouped by feature field. Top 10 features.

loading

Error TrendiHourly count of failed LLM calls over time.

loading

Token Usage by ModeliTotal token usage (user_prompt + response) broken down by model. Top 10 models.

loading

Model Call DistributioniPie chart showing the share of LLM calls per model. Top 10 models.

loading