diff --git a/apps/kubevirt-vms/ci1.yaml b/apps/kubevirt-vms/ci1.yaml index 3fa25a1..afad22a 100644 --- a/apps/kubevirt-vms/ci1.yaml +++ b/apps/kubevirt-vms/ci1.yaml @@ -307,6 +307,26 @@ spec: # Running and RunStrategy are mutually exclusive. # `Always` keeps a VMI running and restarts it if it crashes/exits — same # semantics as the old `running: true`. + # + # **2026-05-08 status: VM cannot start due to a stale QEMU flock on the + # rootdisk PVC** (qemu reports `Failed to get "write" lock` on + # `/var/run/kubevirt-private/vmi-disks/rootdisk/disk.img`). The flock was + # left by a previous QEMU process during a force-deleted launcher pod + # cycle. Recovery requires either (a) a Longhorn engine restart on + # rke2-agent2, (b) a Longhorn volume detach via the longhorn-manager API + # (kubectl patch on `volume.longhorn.io/` does not work — the + # spec.nodeID is reconciled back), or (c) a node reboot of rke2-agent2. + # + # **Confirmed working:** the bootOrder swap (windows-iso=1, rootdisk=2) + # and the runStrategy migration (above). The ISO PVC was successfully + # repopulated via virtctl image-upload pvc on the Filesystem-mode PVC. + # + # **Open: SATA CDROM read timeout** — even with bootOrder=1, OVMF reported + # `BdsDxe: failed to start Boot0001 ... Time out` reading the SATA CDROM + # backed by the Filesystem-mode PVC. A switch to Block-mode DataVolume + # was attempted but blocked by a CDI v1.65.0 upload-pod permission issue + # (capability drop prevents writing to the underlying block device). + # See header docstring on the ISO PVC. runStrategy: Always # LIVE — ISO uploaded 2026-05-08, password in 1P template: metadata: