plarsen has quit [Remote host closed the connection]
<orowith2os>
<marshallford> "Hello all, I'm running into what..." <- can't you specify that the container should wait for network to be available before doing anything?
ravanelli has joined #fedora-coreos
ravanelli has quit [Remote host closed the connection]
ravanelli has joined #fedora-coreos
marshallford has joined #fedora-coreos
ravanelli has quit [Remote host closed the connection]
ravanelli has joined #fedora-coreos
ravanelli has quit [Ping timeout: 245 seconds]
marshallford has quit [Ping timeout: 245 seconds]
misuto has quit [Quit: Leaving]
misuto has joined #fedora-coreos
sentenza has quit [Remote host closed the connection]
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
jpn has joined #fedora-coreos
jasperw[m] has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
hartan[m] has quit [Remote host closed the connection]
samcday[m] has joined #fedora-coreos
<samcday[m]>
Hello! I'm trying to get stable CoreOS to boot on an old Librem 13 I have lying around. After GRUB screen I see the messages for loading kernel + initrd and then ... nothing. Screen goes blank and nothing seems to be happening.
<samcday[m]>
I saw some related issues in Fedora workstation forums, but the suggested `module_blacklist=ucsi_acpi` kernel option didn't help. I also tried disabling CPU mitigations, adding `rd.shell rd.debug`, nothing seems to work. The same ISO is booting okay on my other laptop.
<samcday[m]>
I thought maybe it was a newer kernel issue, but latest Arch ISO with 6.3.9 is booting fine. Fedora 38 Workstation is also booting fine and showing both early con and full graphical environment.
<samcday[m]>
Any advice/suggestions appreciated!
lack has quit [Read error: Connection reset by peer]
jpn has joined #fedora-coreos
lack has joined #fedora-coreos
jpn has quit [Ping timeout: 246 seconds]
sentenza has joined #fedora-coreos
jpn has joined #fedora-coreos
dustymabe has quit [Read error: Connection reset by peer]
ravanelli has quit [Remote host closed the connection]
ravanelli has joined #fedora-coreos
<samcday[m]>
Colin Walters: Ah yes, I also tried the suggestions on that page to no avail. Although maybe I've been trying the wrong thing. I haven't been customizing the kernel args per the suggestions on that page, but instead hitting TAB on the GRUB splash and editing the kernel arguments there. I tried to append `console=tty0` using that method with no effect
ravanelli has quit [Remote host closed the connection]
<samcday[m]>
* that method, with, * no effect. Would you expect that method to work?
ravanelli has joined #fedora-coreos
<walters>
Yes. You don’t see any logs at all after grub?
<samcday[m]>
Zip. The similar issue reported by folks in Fedora Discourse suggest a single, sad, solitary `_` shows on the screen. In my case the screen goes completely blank. The backlight is still on but absolutely no output.
<samcday[m]>
Right after it finishes the "Loading initrd..." there's a brief ... effect. Kinda like a rapid scroll-up of the kernel + initrd loading messages, but with lots of tearing, and then completely blank.
<samcday[m]>
(But to reiterate, I can get a live Fedora Workstation 38 ISO to boot on this same machine)
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 272 seconds]
plarsen has joined #fedora-coreos
jpn has joined #fedora-coreos
jpn has quit [Ping timeout: 272 seconds]
marshallford has joined #fedora-coreos
jpn has joined #fedora-coreos
marshallford has quit [Remote host closed the connection]
marshallford has joined #fedora-coreos
jpn has quit [Ping timeout: 245 seconds]
marshallford has quit [Quit: Leaving]
mbff has joined #fedora-coreos
<mbff>
Hello all, I'm running into what I think is a race condition on boot between a quadlet container pull and networking/DNS being available. Any tips? I suspect this is an issue with systemd and not podman itself.
<mbff>
For a touch more context: I've tried After/Wants with network-online.target, systemd-resolved.service, nss-lookup.service, etc
<pemensik[m]>
I think podman usually does not include systemd inside container. So would not include systemd-resolved there.
<pemensik[m]>
I am not sure how podman propagates resolv.conf to its containers. Do you start podman containers right after boot? Do they have After=nss-lookup.service?
<mbff>
Take podman out of it (I'm using podman to generate systemd services that run containers) -- how in CoreOS can I guarantee that a service starts after DNS/networking is up?
<mbff>
The exact error I'm getting is "Error: initializing source docker://certbot/dns-route53:v2.6.0: pinging container registry registry-1.docker.io: Get "https://registry-1.docker.io/v2/": dial tcp: lookup registry-1.docker.io: no such host"
<pemensik[m]>
After=nsss-lookup.target should be enough on the same machine. Can you check what /etc/resolv.conf contents do they have?
<mbff>
The /etc/resolv.conf looks reasonable
<mbff>
I've tried nss-lookup.target (both as a Wants and Requires)
<pemensik[m]>
does getent ahosts registry-1.docker.io return addresses from it?
<pemensik[m]>
don't use Wants or Requires. They should be started anyway. You need just After=
<mbff>
yes it returns an address (if I login to the machine, but that point the networking/DNS seems up). Tempted to add an exec pre with a sleep for 10s but that feels hacky.
<mbff>
I should also note I use butane/ignition to set up static networking which could be causing issues?
<pemensik[m]>
It should be possible as podman run -ti fedora:latest getent hosts fedoraproject.org
<mbff>
Not sure I follow... once the host has networking/DNS I should be good to go to pull a container image
<mbff>
Is there a way to wait until ignition has setup static networking? Is it possible that there is a blip between switching from DHCP (auto) to static on that interface?
<mbff>
Or does that setup happen on the initial boot anyway (before any systemd services I've defined run)
<pemensik[m]>
so it is failing not inside the container, but still on the host preparing that container?
<pemensik[m]>
I don't know anything about ignition, sorry. cannot help with that
<mbff>
correct
<mbff>
the container pull/download is failing
<pemensik[m]>
it should work with NM or systemd-networkd
<mbff>
What do you mean?
<pemensik[m]>
Try comparing error time with time of "systemctl status nss-lookup.target" got started.
<pemensik[m]>
waiting for network-online.target has to be implemented by network configuration system. I have no idea whether ignition tries to do that.
<mbff>
good idea on that first message
<mbff>
As for network-online.. wouldn't that be up to NM to implmement that not ignition?
<mbff>
The log "Reached target nss-lookup.target" happened on the same sec as "Trying to pull docker.io/certbot/dns-route53:v2.6.0..."
<mbff>
Let me try again with nss-lookup.target
<pemensik[m]>
well, maybe. It is generating NM configuration. but not sure it works correctly with method=manual
<mbff>
Still logging on the same second
<mbff>
With After=network-online.target nss-lookup.target Wants=network-online.target nss-lookup.target
<mbff>
Wonder if I shouldn't try After and Wants with NetworkManager-wait-online.service and NetworkManager.service?
<mbff>
Nevermind, NM-wait-online is WantedBy=network-online.target.
<mbff>
pemensik[m], still around? I've been troubleshooting some more with restart=on-failure, but that solution doesn't work for oneshot services.
<mbff>
Bottom line: I'm convinced that NetworkManager-wait-online.service starts too quickly that services running after network-online.target may fail if they rely on network/DNS.
apollo13 has quit [Remote host closed the connection]
jpn has quit [Ping timeout: 245 seconds]
apollo13 has joined #fedora-coreos
mbff has quit [Ping timeout: 246 seconds]
<pemensik[m]>
Its possible. I expect most software should retry few times before giving up. I admit it might be difficult to analyze it properly. Recording queries by wireshark might help
plarsen has quit [Remote host closed the connection]