Scrydon
DeploymentOperations

Operations

Day-2 runbooks for operating Scrydon — backup, restore, migrations, license rotation, observability, SIEM forwarding, supply-chain verification, and upgrades.

Day-1 covers getting Scrydon installed. Day-2 covers operating it — backup, restore, migrations, license rotation, observability, and the runbooks your on-call team needs.

Day-2 expectations

For a steady-state Scrydon deployment, plan on:

ActivityCadence
License heartbeat health checkDaily (automated)
Audit log review (focus events)Weekly
Backup verification (restore-from-yesterday drill)Monthly
Disaster recovery exerciseQuarterly
Minor version upgradeQuarterly
Major version upgradeAnnually
Vulnerability scan + patch cycleContinuous

What requires planned downtime

Most operations are non-disruptive. The exceptions:

  • Major version upgrades that touch the OLAP warehouse (StarRocks) typically require a brief read-only window while indexes rebuild.
  • Encryption-strategy changes (LOCAL → BYOK → HYOK) require re-encryption of secrets in place — typically a few minutes for normal-sized vaults.
  • Replacing PostgreSQL with a managed instance requires a one-time migration window.

Each is covered in the relevant runbook with the expected duration.

Where to look if something's wrong

  1. Check the audit log first — most failures show up there with a structured *_FAILED event.
  2. Check the platform metrics dashboard for the affected subsystem.
  3. Check the relevant subsystem's logs (workflow runtime, analytics, ontology, copilot).
  4. If the issue isn't visible in any of the above, contact Scrydon support with the audit log and dashboard screenshots.
On this page

On this page