Skip to content

Playbook: Quick Triage

When to use: First step for any customer-reported issue. Establishes context before diving into specific playbooks.

  • Operator access token (ps_at_ prefix) — get one from an operator user
  • MCP client connected (Claude Code, Cursor, or direct API)
  • Verify connectivity: run whoami
MCP: lookup_tenant
query: "<customer name, email, or domain>"

Note the organization_id, workspace_id, and cluster_name from the response.

MCP: remote_debug
workspace_id: "<workspace_id>"
command: "agent_health"

Healthy response: status: connected, recent heartbeat, capability list present.

Unhealthy indicators:

  • status: disconnected — agent is down or network issue
  • Stale heartbeat (>60s) — agent may be stuck
  • Empty capabilities — agent version too old
MCP: get_system_health

Look for degraded components that could explain the customer’s issue.

MCP: query_audit_logs
workspace_id: "<workspace_id>"
limit: 20

Check for unusual patterns — repeated errors, failed operations, config changes.

FindingNext Playbook
Agent disconnectedAgent Connectivity
Analysis jobs failingAnalysis Failures
All systems healthy but customer sees issuesCheck frontend logs, ask for screenshots

If the issue cannot be identified via MCP tools, escalate with:

  • Organization ID and workspace ID
  • Output from steps 1-4
  • Customer’s exact error message or screenshot