zlay: add zlay-probe tool + relaxed liveness for cold-start
wraps the port-forward + curl + metrics-parsing dance behind
`just zlay probe {health|delivery|metrics|delta|sweep}` so the
operator diagnostic path is reproducible across sessions. the
hydrant smoke test recipe also gained full-network flag, sig
verification, and PASS/FAIL stats parsing.
liveness/readiness probes in zlay-values.yaml are relaxed
(initialDelay 300s, timeout 15s, failureThreshold 20) to survive
the ~20min cold-start when the PDS subscriber spawn loop contends
with HTTP fibers. see docs/zlay-external-review-2026-04-09.md for
the full context; tighten again once the spawn/resolver path is
fixed.
.claude/skills/zlay-diagnose documents when to reach for each
probe subcommand. .claude/settings.local.json gitignored since
it's a per-host permissions file.
Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com>
authored by