Documenting the remaining 2 unresolved issues for the next operator session, with the recovery paths from this session captured inline so the next agent doesn't repeat the same blind alleys: 1. **rootdisk QEMU flock** — every new launcher pod fails QEMU start with `Failed to get "write" lock` on the rootdisk Filesystem-mode disk.img. Stale flock from a previous force-deleted virt-launcher pod. Longhorn engine on rke2-agent2 needs to release the lock; `kubectl patch volume.longhorn.io/<pvc-name> spec.nodeID=""` is reverted by the Longhorn controller. Operator-level recovery only. 2. **SATA CDROM read timeout** — even with bootOrder=1 (windows-iso first), OVMF UEFI fails Boot0001 with "Time out" reading the SATA CDROM backed by the Filesystem-mode PVC. Block-mode DataVolume migration was attempted but blocked by CDI v1.65.0's upload pod running with `capabilities.drop: [ALL]` and `runAsUser: 107`, preventing direct block-device writes (`blockdev: cannot open /dev/cdi-block-volume: Permission denied`). See ISO PVC header docstring for 3 forward paths. Net commits during this session: -1c4145a: bootOrder swap (windows-iso=1, rootdisk=2) -87a7d7c: deprecated `running:` -> `runStrategy: Always` -0bf47df: ISO migration to Block-mode DataVolume (REVERTED) -9f6dc1a: revert to Filesystem PVC (CDI block-upload blocked) -1c4145a+87a7d7c+9f6dc1aare the live, correct configuration. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
19 KiB
19 KiB