dustymabe changed the topic of #fedora-coreos to: Fedora CoreOS :: Find out more at https://getfedora.org/coreos/ :: Logs at https://libera.irclog.whitequark.org/fedora-coreos
jdirte has joined #fedora-coreos
jdirte has left #fedora-coreos [#fedora-coreos]
Betal has joined #fedora-coreos
plarsen has quit [Remote host closed the connection]
jlebon has quit [Ping timeout: 248 seconds]
nb has quit [Ping timeout: 268 seconds]
gursewak has quit [Ping timeout: 265 seconds]
travisghansen has quit [Ping timeout: 250 seconds]
travisghansen has joined #fedora-coreos
paragan has joined #fedora-coreos
smitterl has joined #fedora-coreos
sentenza has quit [Remote host closed the connection]
Betal has quit [Quit: WeeChat 3.8]
saschagrunert has joined #fedora-coreos
jcajka has joined #fedora-coreos
apiaseck has joined #fedora-coreos
nb has joined #fedora-coreos
jpn has joined #fedora-coreos
darknao has quit [Quit: WeeChat 3.6]
darknao has joined #fedora-coreos
jpn has quit [Ping timeout: 240 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 268 seconds]
jpn has joined #fedora-coreos
vgoyal has joined #fedora-coreos
dustymabe has quit [Quit: WeeChat 3.6]
jlebon has joined #fedora-coreos
smitterl has quit [Ping timeout: 268 seconds]
nalind has joined #fedora-coreos
mheon has joined #fedora-coreos
dustymabe has joined #fedora-coreos
<dustymabe> marmijo[m]: mind holding off on progressing the FCOS releases while I investigate something
<jlebon> o/
<dustymabe> after I get out of this meeting I'm going to try to look into https://github.com/coreos/fedora-coreos-tracker/issues/1455
<marmijo[m]> dustymabe: yeah I’ll hold off for a while. Let me know when I can continue
plarsen has joined #fedora-coreos
jcajka has quit [Quit: Leaving]
cyberpear has joined #fedora-coreos
smitterl has joined #fedora-coreos
gursewak has joined #fedora-coreos
Betal has joined #fedora-coreos
smitterl has quit [Remote host closed the connection]
saschagrunert has quit [Remote host closed the connection]
ravanell_ has joined #fedora-coreos
ravanelli has quit [Ping timeout: 260 seconds]
jpn has quit [Quit: Lost terminal]
jdirte has joined #fedora-coreos
jdirte has left #fedora-coreos ["later"]
paragan has quit [Quit: Leaving]
<dustymabe> jlebon: did you have a chance to look into https://github.com/coreos/fedora-coreos-tracker/issues/1455 ?
<dustymabe> from what I've read I'm thinking we just go forward with our current releases 4.4.4 will be in the next round
ravanell_ has quit [Remote host closed the connection]
ravanelli has joined #fedora-coreos
<dustymabe> marmijo[m]: I think we can move forward with the releases (unless jlebon comes to a different conclusion than I did in ^^)
* dustymabe goes for food
<marmijo[m]> dustymabe: sounds good to me
<jlebon> dustymabe, marmijo[m]: agreed
<dustymabe> jlebon: so I'm digging into this secureboot upgrade test failure
<dustymabe> seems like instances get to `37.20230322.2.0` and then hit a kernel panic
<dustymabe> jlebon: interestingly enough the failure looks like the one in https://github.com/coreos/fedora-coreos-tracker/issues/1382
* dustymabe bumps memory and re-runs in a local test
<jlebon> dustymabe: that's odd. yeah, worth trying for more info. though i don't know why it would require it for an upgrade but not in CI when we initially shipped this
<jlebon> unless old GRUB used memory less efficiently somehow?
<dustymabe> for #1382 I don't think we ever did anything (i think the problem just went away)
<dustymabe> yeah, it appears in my local test to be getting past the first upgrade now
<dustymabe> jlebon: maybe when there are two entries in grub it uses more memory?
<jlebon> ok fun. so possibly a memory bug in older grub + newer fatter kernel & initrd
<dustymabe> jlebon: I don't even think we can call it older grub
<dustymabe> in my local test it starts from `37.20221225.2.1`
<jlebon> ohhh, this makes me realize we didn't run the regular upgrade test in !bios
<jlebon> if we did, maybe we would've caught this with our 1G default
<dustymabe> maybe
<dustymabe> it's weird to me that grub would take up more than 1G of memory
<dustymabe> it's literally the first thing that happens on the system (i.e. you'd think low resources)
<dustymabe> and.. what's so special about secure boot?
<jlebon> it could be some kind of memory layout preparation thing
<jlebon> dustymabe: starting from what release does it work with the default 1G?
<dustymabe> maybe re-define your question :)
<dustymabe> they all seem to fail
<dustymabe> i.e. they upgrade until they select `37.20230322.2.0` and then once they try to boot `37.20230322.2.0` we get the
<dustymabe> error: ../../grub-core/kern/mm.c:376:out of memory.
<dustymabe> so if we start at f35 or f36 or f37
<jlebon> ahh OK. I misunderstood the context then. so if I take the latest e.g. stable and do a fake upgrade in secureboot mode with 1G, it should fail?
<dustymabe> jlebon: I *think* so
<jlebon> let me try that
<dustymabe> obviously it could be a specific update within 37 somewhere that introduced the problem
<dustymabe> but for now we can just assume all 37
<dustymabe> and see where it becomes not true
<dustymabe> jlebon: any luck?
<dustymabe> ravanelli: anything blocking https://github.com/coreos/fedora-coreos-config/pull/2336 ?
<jlebon> `kola run-upgrade --qemu-firmware uefi-secure` failed for another reason. i also couldn't reproduce it manually.
<jlebon> mind filing an issue for this with the exact output? and maybe for now we bump memory to 1.5G for secureboot
<dustymabe> "i also couldn't reproduce it manually" -> it should repro easy enough if you just boot a `37.20221225.2.1` with 1G of RAM
<dustymabe> it comes up and then does an update - on next reboot -> kernel panic
<jlebon> hmm, works here
<dustymabe> interesting
<dustymabe> what happens if you run: `cosa kola run --tag extended-upgrade --build=37.20221225.2.1 --qemu-firmware=uefi-secure` ?
<jlebon> --- PASS: ext.config.upgrade.extended (164.62s)
<jlebon> ahhh interesting
<jlebon> `dnf update` in my pet container shows there's a grub2 update available
<jlebon> well no, the EFI firmware i think comes from edk2-ovmf ?
<dustymabe> dang -did the pipeline go down?
<jlebon> dustymabe: i think at this point, we should file something and track it there to not lose the details
<ravanelli> dustymabe: I don't think so, I did some tests with rhcos/fcos and they seem ok
<jlebon> not loading here either, looking
apiaseck has quit [Ping timeout: 248 seconds]
<ravanelli> I was only waiting for the QE test part
<jlebon> and... the cluster console isn't loading either
<ravanelli> s/I merge/I can merge
<dustymabe> ravanelli: i'm good with it
<ravanelli> jlebon: cc ^
<dustymabe> opened https://github.com/coreos/fedora-coreos-tracker/issues/1456 for the grub2 OOM on secureboot issue
<dustymabe> jlebon: at the top of the web console I see: "This cluster is updating from 4.11.26 to 4.12.9"
<jlebon> ravanelli: also good with it!
<dustymabe> marmijo[m]: just try to re-run the jobs when things come back
<jlebon> dustymabe: weird, i don't see that
sentenza has joined #fedora-coreos
<dustymabe> jlebon: it's gone now
<dustymabe> it was a yellow banner at the top of the console
<jlebon> neat
<marmijo[m]> dustymabe: Will do. Things are still loading I suppose, so I'll give it a bit
<nirik> yeah, made the jump to 4.12.x :)
<dustymabe> jlebon: so.. you're telling me when you tried the `cosa kola` command I pasted you earlier it passed for you locally?
<jlebon> dustymabe: yup
<dustymabe> I created this issue for the problem: https://github.com/coreos/fedora-coreos-tracker/issues/1456
<dustymabe> you can see from that the number of scenarios that hit this problem
<jlebon> dustymabe: can you review https://github.com/coreos/fedora-coreos-config/pull/2263 when you have a chance?
<dustymabe> note that all of those are run with `--allow-rerun-success`, which means it had to fail twice in a row in order to truly fail
apiaseck has joined #fedora-coreos
travisghansen has quit [Quit: The Lounge - https://thelounge.github.io]
travisghansen has joined #fedora-coreos
gursewak has quit [Ping timeout: 246 seconds]
gursewak has joined #fedora-coreos
apiaseck has quit [Quit: Leaving]
bgilbert has joined #fedora-coreos
plarsen has quit [Quit: NullPointerException!]
plarsen has joined #fedora-coreos
plarsen has quit [Client Quit]
plarsen has joined #fedora-coreos
gursewak has quit [Read error: Connection reset by peer]
plarsen has quit [Client Quit]
plarsen has joined #fedora-coreos
troglodito has quit [Ping timeout: 265 seconds]
troglodito has joined #fedora-coreos
nalind has quit [Quit: bye for now]
<jlebon> :lgtm:
<jlebon> see y'all tmw!
jlebon has quit [Quit: leaving]
Guest65 has joined #fedora-coreos
Guest65 has quit [Client Quit]
vadaosman[m] has joined #fedora-coreos
gursewak has joined #fedora-coreos
vgoyal has quit [Quit: Leaving]
mheon has quit [Ping timeout: 255 seconds]