#fedora-coreos on 2022-08-23 — irc logs at libera.irclog.whitequark.org

2022-05-11 12:42 dustymabe changed the topic of #fedora-coreos to: Fedora CoreOS :: Find out more at https://getfedora.org/coreos/ :: Logs at https://libera.irclog.whitequark.org/fedora-coreos

02:35 nb has quit [Quit: The Lounge - https://thelounge.chat]

02:41 nb has joined #fedora-coreos

04:17 poppajarv has quit [Quit: Ping timeout (120 seconds)]

04:18 poppajarv has joined #fedora-coreos

04:29 saroy has joined #fedora-coreos

04:37 testing_ has joined #fedora-coreos

04:39 testing_ is now known as frytaped

04:43 frytaped has quit [Read error: Connection reset by peer]

04:44 frytaped has joined #fedora-coreos

05:10 saroy has quit [Remote host closed the connection]

05:14 testing is now known as crap

05:23 crap is now known as go4godvin

05:28 bgilbert has quit [Ping timeout: 268 seconds]

05:30 frytaped has quit [Quit: WeeChat 3.5]

06:09 paragan has joined #fedora-coreos

06:13 jpn has joined #fedora-coreos

06:18 jpn has quit [Ping timeout: 248 seconds]

06:25 gursewak has quit [Ping timeout: 256 seconds]

06:29 jpn has joined #fedora-coreos

06:32 gursewak has joined #fedora-coreos

06:45 jcajka has joined #fedora-coreos

06:57 gursewak has quit [Ping timeout: 256 seconds]

07:09 bgilbert has joined #fedora-coreos

07:13 jpn has quit [Ping timeout: 256 seconds]

07:26 bgilbert has quit [Ping timeout: 248 seconds]

07:28 jpn has joined #fedora-coreos

07:32 jpn has quit [Ping timeout: 255 seconds]

07:37 c4rt0 has joined #fedora-coreos

08:19 jpn has joined #fedora-coreos

08:48 Betal has quit [Quit: WeeChat 3.6]

09:32 jpn has quit [Ping timeout: 256 seconds]

09:51 jpn has joined #fedora-coreos

10:11 frytaped has joined #fedora-coreos

10:11 ravanelli has joined #fedora-coreos

10:25 wkawka has quit [Quit: Client closed]

10:29 wkawka has joined #fedora-coreos

10:49 crap has joined #fedora-coreos

11:08 frytaped has quit [Quit: WeeChat 3.5]

11:15 ravanelli has quit [Ping timeout: 256 seconds]

11:16 ravanelli has joined #fedora-coreos

11:27 wkawka has quit [Quit: Client closed]

11:51 jpn has quit [Ping timeout: 252 seconds]

12:00 nalind has joined #fedora-coreos

12:14 <dustymabe> easy review: https://github.com/coreos/fedora-coreos-config/pull/1924

12:17 ravanelli has quit [Read error: Connection reset by peer]

12:17 jpn has joined #fedora-coreos

12:17 ravanelli has joined #fedora-coreos

12:22 jpn has quit [Ping timeout: 244 seconds]

12:34 jpn has joined #fedora-coreos

12:39 jpn has quit [Ping timeout: 268 seconds]

12:50 jpn has joined #fedora-coreos

13:02 jpn has quit [Ping timeout: 255 seconds]

13:07 mheon has joined #fedora-coreos

13:07 jpn has joined #fedora-coreos

13:14 <dustymabe> jlebon: it looks like we have an issue in our --rerun logic in mantle

13:14 <dustymabe> the recent --tag=reprovision work you did seems to throw it off

13:15 <dustymabe> it re-runs the failed test, but also all other tests (the ones that passed before)

13:15 <dustymabe> see https://jenkins-fedora-coreos-pipeline.apps.ocp.fedoraproject.org/blue/organizations/jenkins/build/detail/build/72/pipeline

13:20 ravanelli has quit [Ping timeout: 244 seconds]

13:22 <jlebon> dustymabe: indeed. fun

13:23 <jlebon> let me file something in cosa

13:23 arnulfo_7 has joined #fedora-coreos

13:23 arnulfo_7 has quit [Changing host]

13:23 arnulfo_7 has joined #fedora-coreos

13:27 <jlebon> https://github.com/coreos/coreos-assembler/issues/3041

13:28 ravanelli has joined #fedora-coreos

13:31 <dustymabe> jlebon: sweet - commented there

13:31 <jlebon> linked them

13:33 <dustymabe> jlebon: new topic: yesterday I put in https://github.com/coreos/fedora-coreos-config/commit/02c26bf719fffa8f997a402285172406cfdca83f but apparently the test is still failing

13:34 <dustymabe> passes locally

13:34 <dustymabe> ideas?

13:35 <dustymabe> the pipeline finally failed so I'll grab the logs from it now to see if there is anything

13:36 <dustymabe> oh interesting

13:37 <jlebon> yeah, let's attach logs to the tracker issue

13:37 <dustymabe> in the pipeline run this time the `coreos.boot-mirror.luks/detach-primary` failed on the first go round and in the rerun it was `coreos.boot-mirror/detach-primary`

13:39 <dustymabe> one message on the console that keeps scrolling by: "block device autoloading is deprecated and will be removed."

13:41 <dustymabe> that message is also in my local logs (where the test passed)

13:42 <jlebon> hmm, not sure what that's referring to

13:42 <dustymabe> but in my local logs it takes upwards of 6 minutes to reboot that machine

13:43 <dustymabe> so clearly the problem is there in my local tests too - just happens to be fast enough to not trigger the timeout

13:45 <dustymabe> jlebon: a quick google search leads to https://patchwork.kernel.org/project/linux-block/patch/20220103190342.146980-1-hch@lst.de/

13:58 jcajka has quit [Quit: Leaving]

14:01 ravanelli has quit [Remote host closed the connection]

14:08 <dustymabe> jlebon: basically it appears (in a RAID setup) after we delete the primary block device and then try to reboot that reboot gets hung up and can take a really long time.

14:13 crobinso has joined #fedora-coreos

14:23 <dustymabe> jlebon: re: the autoloading message from earlier.. check out https://patchwork.kernel.org/project/linux-block/patch/20220104071647.164918-1-hch@lst.de/#24842631

14:23 <dustymabe> where the guy talks about mdraid

14:24 bytehackr has joined #fedora-coreos

14:52 plarsen has joined #fedora-coreos

15:04 bgilbert has joined #fedora-coreos

15:06 gursewak has joined #fedora-coreos

15:09 <walters> TIL centos ci is running 6 metal nodes in aws right now

15:14 ravanelli has joined #fedora-coreos

15:25 jpn has quit [Ping timeout: 244 seconds]

15:36 paragan has quit [Quit: Leaving]

15:37 <jlebon> MichaelArmijo[m]: ok, there's a simpler approach. try rerunning the job and use 'basic' for the KOLA_TESTS parameter

15:37 <MichaelArmijo[m]> jlebon: sounds good. I'll do that now

15:37 <jlebon> the thing with parallelizing it is that I think we've been hitting capacity limits in AWS for aarch64

15:38 <jlebon> dustymabe: haven't read scrollback yet. lots of meetings :)

15:40 <MichaelArmijo[m]> jlebon: test restarted

15:40 <dustymabe> I think the capacity limit doesn't have to do with `quota` though. it's just amazon running out of instances

15:52 jpn has joined #fedora-coreos

16:19 Betal has joined #fedora-coreos

17:03 jpn has quit [Quit: Lost terminal]

17:56 <MichaelArmijo[m]> dustymabe: jlebon: I removed ppc64le from the build jobs, should I also remove that arch from the release job?

17:57 <dustymabe> MichaelArmijo[m]: yes please

17:57 <MichaelArmijo[m]> sounds good. thanks

18:14 ravanelli has quit [Remote host closed the connection]

18:20 crobinso has quit [Ping timeout: 268 seconds]

18:41 <jlebon> dustymabe: re. quota, the comment was around whether we should change our cloud tests so that everything runs in parallel so that if e.g. a test in the regular kola run fails but passes in the rerun, we've still run all kola invocations so we can choose to ignore the one failure

18:41 <jlebon> another approach is to remember the kola failure, but still keep going and still fail the overall job at the end

18:46 bytehackr has quit [Ping timeout: 255 seconds]

20:05 rsalveti has quit [Quit: Connection closed for inactivity]

20:21 <dustymabe> jlebon: ahh I see what you are saying.. IOW the "followup" tests where we launch on a specific instance type (or types) would run alongside the main tests?

20:22 <dustymabe> jlebon: gursewak: mind a review on https://github.com/coreos/fedora-coreos-config/pull/1926

20:23 <jlebon> dustymabe: yup exactly

20:24 <dustymabe> makes sense

20:40 jpn has joined #fedora-coreos

20:55 <dustymabe> jlebon: a few for the branched enablement:

20:55 <dustymabe> https://github.com/coreos/fedora-coreos-pipeline/pull/616

20:56 <dustymabe> https://github.com/coreos/fedora-coreos-config/pull/1927

21:00 nalind has quit [Quit: until next time]

21:28 jpn has quit [Ping timeout: 256 seconds]

23:06 jpn has joined #fedora-coreos

23:10 jpn has quit [Ping timeout: 256 seconds]

23:12 c4rt0 has quit [Ping timeout: 255 seconds]

23:39 jpn has joined #fedora-coreos

23:43 jpn has quit [Ping timeout: 268 seconds]