dustymabe changed the topic of #fedora-coreos to: Fedora CoreOS :: Find out more at https://getfedora.org/coreos/ :: Logs at https://libera.irclog.whitequark.org/fedora-coreos
nb has quit [Quit: The Lounge - https://thelounge.chat]
nb has joined #fedora-coreos
poppajarv has quit [Quit: Ping timeout (120 seconds)]
poppajarv has joined #fedora-coreos
saroy has joined #fedora-coreos
testing_ has joined #fedora-coreos
testing_ is now known as frytaped
frytaped has quit [Read error: Connection reset by peer]
frytaped has joined #fedora-coreos
saroy has quit [Remote host closed the connection]
testing is now known as crap
crap is now known as go4godvin
bgilbert has quit [Ping timeout: 268 seconds]
frytaped has quit [Quit: WeeChat 3.5]
paragan has joined #fedora-coreos
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 248 seconds]
gursewak has quit [Ping timeout: 256 seconds]
jpn has joined #fedora-coreos
gursewak has joined #fedora-coreos
jcajka has joined #fedora-coreos
gursewak has quit [Ping timeout: 256 seconds]
bgilbert has joined #fedora-coreos
jpn has quit [Ping timeout: 256 seconds]
bgilbert has quit [Ping timeout: 248 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 255 seconds]
c4rt0 has joined #fedora-coreos
jpn has joined #fedora-coreos
Betal has quit [Quit: WeeChat 3.6]
jpn has quit [Ping timeout: 256 seconds]
jpn has joined #fedora-coreos
frytaped has joined #fedora-coreos
ravanelli has joined #fedora-coreos
wkawka has quit [Quit: Client closed]
wkawka has joined #fedora-coreos
crap has joined #fedora-coreos
frytaped has quit [Quit: WeeChat 3.5]
ravanelli has quit [Ping timeout: 256 seconds]
ravanelli has joined #fedora-coreos
wkawka has quit [Quit: Client closed]
jpn has quit [Ping timeout: 252 seconds]
nalind has joined #fedora-coreos
ravanelli has quit [Read error: Connection reset by peer]
jpn has joined #fedora-coreos
ravanelli has joined #fedora-coreos
jpn has quit [Ping timeout: 244 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 268 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 255 seconds]
mheon has joined #fedora-coreos
jpn has joined #fedora-coreos
<dustymabe> jlebon: it looks like we have an issue in our --rerun logic in mantle
<dustymabe> the recent --tag=reprovision work you did seems to throw it off
<dustymabe> it re-runs the failed test, but also all other tests (the ones that passed before)
ravanelli has quit [Ping timeout: 244 seconds]
<jlebon> dustymabe: indeed. fun
<jlebon> let me file something in cosa
arnulfo_7 has joined #fedora-coreos
arnulfo_7 has quit [Changing host]
arnulfo_7 has joined #fedora-coreos
ravanelli has joined #fedora-coreos
<dustymabe> jlebon: sweet - commented there
<jlebon> linked them
<dustymabe> jlebon: new topic: yesterday I put in https://github.com/coreos/fedora-coreos-config/commit/02c26bf719fffa8f997a402285172406cfdca83f but apparently the test is still failing
<dustymabe> passes locally
<dustymabe> ideas?
<dustymabe> the pipeline finally failed so I'll grab the logs from it now to see if there is anything
<dustymabe> oh interesting
<jlebon> yeah, let's attach logs to the tracker issue
<dustymabe> in the pipeline run this time the `coreos.boot-mirror.luks/detach-primary` failed on the first go round and in the rerun it was `coreos.boot-mirror/detach-primary`
<dustymabe> one message on the console that keeps scrolling by: "block device autoloading is deprecated and will be removed."
<dustymabe> that message is also in my local logs (where the test passed)
<jlebon> hmm, not sure what that's referring to
<dustymabe> but in my local logs it takes upwards of 6 minutes to reboot that machine
<dustymabe> so clearly the problem is there in my local tests too - just happens to be fast enough to not trigger the timeout
jcajka has quit [Quit: Leaving]
ravanelli has quit [Remote host closed the connection]
<dustymabe> jlebon: basically it appears (in a RAID setup) after we delete the primary block device and then try to reboot that reboot gets hung up and can take a really long time.
crobinso has joined #fedora-coreos
<dustymabe> jlebon: re: the autoloading message from earlier.. check out https://patchwork.kernel.org/project/linux-block/patch/20220104071647.164918-1-hch@lst.de/#24842631
<dustymabe> where the guy talks about mdraid
bytehackr has joined #fedora-coreos
plarsen has joined #fedora-coreos
bgilbert has joined #fedora-coreos
gursewak has joined #fedora-coreos
<walters> TIL centos ci is running 6 metal nodes in aws right now
ravanelli has joined #fedora-coreos
jpn has quit [Ping timeout: 244 seconds]
paragan has quit [Quit: Leaving]
<jlebon> MichaelArmijo[m]: ok, there's a simpler approach. try rerunning the job and use 'basic' for the KOLA_TESTS parameter
<MichaelArmijo[m]> jlebon: sounds good. I'll do that now
<jlebon> the thing with parallelizing it is that I think we've been hitting capacity limits in AWS for aarch64
<jlebon> dustymabe: haven't read scrollback yet. lots of meetings :)
<MichaelArmijo[m]> jlebon: test restarted
<dustymabe> I think the capacity limit doesn't have to do with `quota` though. it's just amazon running out of instances
jpn has joined #fedora-coreos
Betal has joined #fedora-coreos
jpn has quit [Quit: Lost terminal]
<MichaelArmijo[m]> dustymabe: jlebon: I removed ppc64le from the build jobs, should I also remove that arch from the release job?
<dustymabe> MichaelArmijo[m]: yes please
<MichaelArmijo[m]> sounds good. thanks
ravanelli has quit [Remote host closed the connection]
crobinso has quit [Ping timeout: 268 seconds]
<jlebon> dustymabe: re. quota, the comment was around whether we should change our cloud tests so that everything runs in parallel so that if e.g. a test in the regular kola run fails but passes in the rerun, we've still run all kola invocations so we can choose to ignore the one failure
<jlebon> another approach is to remember the kola failure, but still keep going and still fail the overall job at the end
bytehackr has quit [Ping timeout: 255 seconds]
rsalveti has quit [Quit: Connection closed for inactivity]
<dustymabe> jlebon: ahh I see what you are saying.. IOW the "followup" tests where we launch on a specific instance type (or types) would run alongside the main tests?
<dustymabe> jlebon: gursewak: mind a review on https://github.com/coreos/fedora-coreos-config/pull/1926
<jlebon> dustymabe: yup exactly
<dustymabe> makes sense
jpn has joined #fedora-coreos
<dustymabe> jlebon: a few for the branched enablement:
nalind has quit [Quit: until next time]
jpn has quit [Ping timeout: 256 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 256 seconds]
c4rt0 has quit [Ping timeout: 255 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 268 seconds]