// what I verify immediately
→Is it current? Confirm timestamps and whether telemetry is stale, delayed, or duplicated
→Is it real outside the originating console? Validate with at least one independent signal
→What's the blast radius? One device vs many, one site vs global, one workflow vs multiple
→What's the failure mode? Down vs degraded, constant vs intermittent, user-facing vs backend
// obvious checks that prevent wasted work
→Resource saturation: disk, memory, CPU spikes, storage I/O queues, scheduled task overlap
→Service state: stopped services, failed startups, cert issues, DNS, auth and trust failures
→Known windows: backups, patching, scans, automation jobs, and scheduled reboots
→Workflow validation: confirm access path, identity, device, and step where failure occurs
exit criteria
Boxed problem statement exists · Scope and time window recorded with FKG + FKB · If a change is required: low-risk, reversible, clear verification. Otherwise → Mid Pass.