dustymabe changed the topic of #fedora-coreos to: Fedora CoreOS :: Find out more at https://getfedora.org/coreos/ :: Logs at https://libera.irclog.whitequark.org/fedora-coreos
sentenza__ has quit [Ping timeout: 246 seconds]
sentenza__ has joined #fedora-coreos
ravanelli has joined #fedora-coreos
hotbox has quit [Ping timeout: 246 seconds]
plarsen has quit [Remote host closed the connection]
hotbox has joined #fedora-coreos
samuelbernardo has quit [Ping timeout: 250 seconds]
samuelbernardo has joined #fedora-coreos
_whitelogger has joined #fedora-coreos
saschagrunert has joined #fedora-coreos
jpn has joined #fedora-coreos
paragan has joined #fedora-coreos
orowith2os has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 246 seconds]
vgoyal has joined #fedora-coreos
<walters> Just thinking about how the periodic race conditions between koji tagger and yum repo sync would be fixed if we versioned rpm-md repositories as OCI artifacts instead (hit this in https://jenkins-coreos-ci.apps.ocp.fedoraproject.org/blue/organizations/jenkins/ostree/detail/PR-2926/2/pipeline )
ravanelli has quit [Remote host closed the connection]
ravanelli has joined #fedora-coreos
plarsen has joined #fedora-coreos
<vrothberg> dustymabe: the change to the ready.service of podman-machine considerably improved the flake rate (see github.com/containers/podman/issues/17403). But ... I am still seeing the flake. What I observe is that always the first SSH attempt into the machine fails. That happens right after starting the machine (and receiving the signal from the ready.service).
<vrothberg> Adding a sleep etc. resolves the issue. It seems there is still a race. Do you have any other tricks in your pockets? I am tempted to just do an exponential backoff to "wait" for SSH in the machine to be ready. But that feels like patching symptoms rather than fixing the underlying issue.
cyberpear has joined #fedora-coreos
job[m] has joined #fedora-coreos
<dustymabe> vrothberg: I don't have any other ideas without spending more time digging into the problem in depth. It all depends on where the error is happening. Is it actually reaching the ssd process or is port 22 somehow not yet bound? If it's the former we'd need to look at the source code.
<dustymabe> sometimes a sleep/retry is ugly, but it's practical. and at least the delay here should be really small.
<vrothberg> dustymabe: Thanks for checking! I agree that a client-side check is probably the best way forward at the moment.
jpn has joined #fedora-coreos
<dustymabe> vrothberg: sorry I couldn't be of more help :(
<dustymabe> marmijo[m]: mind a review on: https://pagure.io/fedora-infra/ansible/pull-request/1505
<vrothberg> dustymabe: you were of great help! You reassured me that a client-side retry is a valid approach, so I am happy :)
<dustymabe> walters: we put a mitigation for this in the pipeline (adamP used it as a learning opportunity): https://github.com/coreos/fedora-coreos-pipeline/pull/863
<dustymabe> as a followup we were going to try to make it more generic and maybe hoist it up into coreos-ci-lib so the upstream CI projects would have the workaround
nalind has joined #fedora-coreos
saschagrunert has quit [Remote host closed the connection]
Betal has joined #fedora-coreos
Betal has quit [Client Quit]
Betal has joined #fedora-coreos
jpn has quit [Ping timeout: 260 seconds]
<dustymabe> marmijo[m]: I think the vexxhost issues settled since yesterday
jpn has joined #fedora-coreos
miabbott has joined #fedora-coreos
miabbott[m] has joined #fedora-coreos
<marmijo[m]> dustymabe: That's good. I see we've had some successful kola openstack jobs.
<marmijo[m]> bgilbert: It looks like the new release of ignition in rawhide is attempting to install `hv_utils` on ppc64le and s390x and causing the builds for those arches to fail.
<dustymabe> aaradhak anthr76 apiaseck davdunc dustymabe guidon gursewak jaimelm jbrooks jcajka jdoss jlebon jmarrero lorbus miabbott nasirhm quentin9696[m] ravanelli saqali walters
<dustymabe> FCOS community meeting in #fedora-meeting-1
<dustymabe> If you don't want to be pinged remove your name from this file: https://github.com/coreos/fedora-coreos-tracker/blob/main/meeting-people.txt
HristoMarinov[m] has joined #fedora-coreos
mheon has joined #fedora-coreos
paragan has quit [Ping timeout: 250 seconds]
jpn has quit [Ping timeout: 252 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 250 seconds]
<bgilbert> marmijo[m]: yeah, makes sense. :-(
<marmijo[m]> bgilbert: I saw your message in Slack. Thank you!
<bgilbert> +1
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 246 seconds]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
sentenza has joined #fedora-coreos
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
sentenza has quit [Ping timeout: 246 seconds]
vgoyal has quit [Quit: Leaving]
sentenza has joined #fedora-coreos
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 264 seconds]
mheon has quit [Ping timeout: 240 seconds]
sentenza_ has joined #fedora-coreos
sentenza has quit [Ping timeout: 240 seconds]
<dustymabe> ravanelli: let me know when to do another round of reviews
cyberpear has quit [Quit: Connection closed for inactivity]
cyberpear has joined #fedora-coreos
sentenza_ has quit [Remote host closed the connection]
sentenza has joined #fedora-coreos
nalind has quit [Quit: bye for now]
plarsen has quit [Remote host closed the connection]
mboddu_ has joined #fedora-coreos
travisghansen9 has joined #fedora-coreos
ninjanne has joined #fedora-coreos
travisghansen has quit [*.net *.split]
mnaser__ has quit [*.net *.split]
flokli has quit [*.net *.split]
HappyMan has quit [*.net *.split]
mboddu has quit [*.net *.split]
justJanne has quit [*.net *.split]
travisghansen9 is now known as travisghansen
mnaser__ has joined #fedora-coreos
flokli has joined #fedora-coreos
HappyMan has joined #fedora-coreos