<marmijo[m]>
dustymabe: thats good! The selinux policy was updated again recently and it looks like the builds are completing before timing out now.
<dustymabe>
marmijo[m]: yeah, but let me look at something
<dustymabe>
marmijo[m]: so 39.20230220.91.0 ran and passed in 1h7m (we got lucky it didn't timeout)
<dustymabe>
39.20230222.91.0 ran and passed in 22m50s
<dustymabe>
now.. I did update the azure libraries backing these processes yesterday, so that could account for some change
<dustymabe>
but let's focus on the package set change between 39.20230220.91.0 and 39.20230222.91.0 to see if we can pick out what we think "fixed" the problem
<dustymabe>
jlebon: WDYT about adding a "releasever" denylist entry (similar to osversion, but this would make it so we don't have to add a new variable to our FCOS configs): https://github.com/coreos/coreos-assembler/pull/2934
<marmijo[m]>
dustymabe: Alright, I'll start looking through the differences
<jlebon>
dustymabe: i guess another approach is to keep it as `osversion` in the denylist, and have kola look for `osversion` first, and fallback to `releasever`. it's a bit confusing, but if we doc it, it seems ok
bgilbert has joined #fedora-coreos
<dustymabe>
jlebon: true
Betal has joined #fedora-coreos
jcajka has quit [Quit: Leaving]
rsalveti has joined #fedora-coreos
<dustymabe>
travier[m]: can you replace the `\ ` in the cut command in your PR to `' '`
<travier[m]>
dustymabe: sure
<travier[m]>
done
<dustymabe>
had one more comment for you in the PR
<baude>
has the idea of using efi variables for ignition been kicked around here?
<dustymabe>
baude: not sure, but will note EFI isn't supported on 2/4 arches we operate on
<baude>
yeah, this is for a "new" platform
<baude>
trying to figure out how to weasel the ignition file in
jpn has joined #fedora-coreos
<apollo13[m]>
baude: oh that would be interesting:)
<baude>
it is a thing evidently
<apollo13[m]>
I have no idea how to stuff that into proxmox but I guess I can figure it out. 😉
bgilbert has joined #fedora-coreos
<jlebon>
dustymabe: updated!
<dustymabe>
marmijo[m]: were you able to figure out what update fixed the azure timeout issue?
<marmijo[m]>
dustymabe: I see that libselinux was updated in 39.20230218.91.0, which was the first rawhide build to not timeout. It was still a long run though, so could have just gotten lucky. There was a kernel update between the two latest builds, but I'm not sure if anything else would have 'fixed' the issue.
<marmijo[m]>
Container-selinux and selinux-policy were also updated in 38.20230216.91.0, but that build timed out. Every process is significantly shorter after you updated the azure libraries though, could it be a combination of that and the selinux related packages update?
<dustymabe>
jlebon: replied
jpn has quit [Ping timeout: 246 seconds]
<dustymabe>
marmijo[m]: hmm. well we do know that there are SELinux denials - do the logs from the newer runs have those denials in them still?
<dustymabe>
clarification: "we do know that there are SELinux denials in the old failure case"
<marmijo[m]>
Yes, there are still SELinux denials in the logs. It's interesting because the tests are passing in a reasonable amount of time meaning they're not waiting for the reboot
<dustymabe>
marmijo[m]: interesting for sure
<dustymabe>
marmijo[m]: if the denials still exist (and you can manually reproduce the issue by starting an instance and using the web interface to try to `reboot` and it does nothing) then we still need to open a BZ for this
<marmijo[m]>
dustymabe: Sounds good. I'll test that out
bgilbert has quit [Ping timeout: 252 seconds]
<dustymabe>
marmijo[m]: in the API update I use force delete - so maybe that could be the difference