Skip to main content
The Insights page provides organization administrators with detailed, real-time observability into LLM usage, token consumption, and financial spend. It helps you monitor costs, track performance, and drill down into usage at the workspace, agent, provider, and model level.

Accessing the dashboard

The Usage dashboard is admin-only. To access it:
1

Open your organization dashboard

Navigate to your organization dashboard.
2

Click Insights

In the left navigation sidebar, click Insights (above Settings).
Access to the Insights tab is restricted to users with the Admin role in the organization.

KPI strip

At the top of the dashboard, the KPI strip displays high-level metrics for the selected time window:
  • Spend: Total estimated spend in USD, based on standard list prices.
  • Tokens: Total token volume (Input + Output + Cache Read + Cache Write).
  • Sessions: Distinct user/agent sessions that recorded usage in the selected period.
  • Cache Rate: The percentage of prompt tokens served from cache (e.g., Anthropic Prompt Caching or OpenAI Cached Input).
  • Output Ratio: Output tokens as a share of total input + output tokens.
  • VS Previous Period: Percent change in spend compared to the previous equal-length time window (e.g., comparing the last 30 days to the 30 days before that). If no baseline data exists, this displays “No prior data”.

Usage over time

The Usage Over Time chart displays daily metrics in an interactive area chart.
  • You can toggle between Tokens (daily token volume) and Spend (daily USD spend) using the segmented control.
  • Hovering over any point on the chart displays a tooltip with the formatted daily value and date.
You can narrow down the metrics shown across the entire page using the filters in the header:
  • Time Window: Choose between 7d (last 7 days), 30d (last 30 days), or 90d (last 90 days).
  • Provider: Filter metrics to a specific LLM provider (e.g., Anthropic, OpenAI, Gemini).
  • Model: Filter metrics to a specific model (e.g., claude-sonnet-4-6, gpt-4o).
Active filters are displayed as removable chips below the header. Clicking the X on a chip clears that specific filter, or click “Clear filters” to remove all active filters.

Usage breakdown

Below the chart, the breakdown panel splits organization spend and token counts four ways across tabs:

Workspaces

Shows spend, token count, and session count for each active workspace. Clicking on any workspace row drills down into its specific usage details.

Agents

Shows spend, token count, and session count attributed to specific versioned agents. Clicking on an agent row drills down into its specific usage details.
Usage from tool tasks or direct assistant tasks without a committed agent version is omitted from the agent breakdown but is captured in the organization/workspace totals.

Providers

Shows spend and total tokens grouped by inference provider.

Models

Shows spend and token counts grouped by LLM model name.
  • Rates & spend tooltip: Each model row features an information (i) icon next to its Spend value. Hovering over this icon displays the per-million-token list prices (Input, Output, Cache Read, Cache Write) and the exact calculated spend contribution for each token type.
  • Estimated pricing indicator: Models that Guild priced using the default list price display a warning indicator next to the model name. Hover the indicator to see a tooltip, or expand the row to read the estimate note in the rate panel.

Drilling down (workspace and agent scopes)

Clicking on a row in the Workspaces or Agents breakdown navigates to a dedicated drill-in view:
  • The breadcrumb navigation changes to Usage / Workspace · <Name> or Usage / Agent · <Name>.
  • A prominent scope pill badge is displayed below the header indicating the current drill-in filter.
  • The KPI cards, chart, and remaining breakdown tabs are automatically scoped to represent only the selected workspace or agent.