#yocto on 2021-06-28 — irc logs at libera.catirclogs.org

2021-06-01 12:48 dl9pf changed the topic of #yocto to: Welcome to the Yocto Project | Learn more: http://www.yoctoproject.org | Join the community: http://www.yoctoproject.org/community | Channel logs available at https://www.yoctoproject.org/irc/ and https://libera.irclog.whitequark.org/yocto/ | Having difficulty on the list, or with someone on the list? Contact YP community mgr Nicolas Dechesne (ndec)

00:10 alejandr1 has joined #yocto

00:43 nerdboy_ is now known as nerdboy

00:43 nerdboy has joined #yocto

00:43 nerdboy has quit [Changing host]

00:49 sakoman has quit [Read error: Connection reset by peer]

01:00 jpuhlman__ has joined #yocto

01:04 jpuhlman_ has quit [Ping timeout: 258 seconds]

01:06 sakoman has joined #yocto

01:36 sakoman has quit [Quit: Leaving.]

02:00 camus has joined #yocto

02:20 georgem has quit [Quit: Connection closed for inactivity]

02:30 Vonter has quit [Ping timeout: 265 seconds]

03:46 camus1 has joined #yocto

03:47 camus has quit [Remote host closed the connection]

03:47 camus1 is now known as camus

04:29 paulg has quit [Ping timeout: 272 seconds]

05:27 davidinux1 has joined #yocto

05:27 davidinux1 is now known as davidinux

05:30 manuel_ has quit [Ping timeout: 246 seconds]

05:42 rob_w has joined #yocto

05:46 jonah1024 has joined #yocto

06:09 goliath has joined #yocto

06:25 Schlumpf has joined #yocto

06:26 camus1 has joined #yocto

06:26 camus has quit [Read error: Connection reset by peer]

06:26 camus1 is now known as camus

06:26 Guest32 has joined #yocto

06:29 <Guest32> Hi,

06:30 davidinux has quit [Ping timeout: 265 seconds]

06:32 <Guest32> I tried to upgrade the Yocto version from Dunfell to Hardknott. But after upgrading to hardknott I don't see the libpcre2 package under /usr/lib/. Below are the missing so files with hardknott installed:

06:33 mckoan|away is now known as mckoan

06:33 <mckoan> good morning

06:33 <Guest32> libpcre2-8.so.0

06:34 davidinux has joined #yocto

06:34 <Guest32> libpcre2-posix.so.2 libpcre2-8.so.0.9.0 libpcre2-posix.so.2.0.3

06:34 <Guest32> good morning

06:38 <Guest32> any input on this?

06:39 frieder has joined #yocto

06:45 Guest3216 has joined #yocto

06:45 Guest3216 has quit [Client Quit]

06:45 RKBH has joined #yocto

06:48 Guest32 has quit [Ping timeout: 246 seconds]

06:51 rfried has quit [Quit: The Lounge - https://thelounge.github.io]

06:52 rfried has joined #yocto

06:54 cquast has joined #yocto

07:00 florian has joined #yocto

07:00 zpfvo has joined #yocto

07:01 prabhakarlad has joined #yocto

07:12 manuel_ has joined #yocto

07:13 florian has quit [Ping timeout: 252 seconds]

07:15 tnovotny has joined #yocto

07:15 RKBH has quit [Quit: Client closed]

07:35 Schlumpf has quit [Quit: Client closed]

07:36 leon-anavi has joined #yocto

07:39 manuel_ is now known as Manuel1985

07:39 ant__ has quit [Remote host closed the connection]

07:39 Manuel1985 is now known as manuel1985

07:42 leonanavi has joined #yocto

07:44 leon-anavi has quit [Ping timeout: 258 seconds]

07:45 leonanavi is now known as leon-anavi

07:45 leon-anavi has quit [Client Quit]

07:45 leon-anavi has joined #yocto

08:06 Schlumpf has joined #yocto

08:10 leon-anavi has quit [Remote host closed the connection]

08:10 leon-anavi has joined #yocto

08:23 mihai has joined #yocto

08:23 <RP> paulbarker: that patchset makes things worse I'm afraid: https://autobuilder.yoctoproject.org/typhoon/#/builders/83/builds/2293

08:24 <RP> paulbarker: looks like it is on the older distros

08:27 mranostaj has quit [Remote host closed the connection]

08:29 mranostaj has joined #yocto

08:35 <paulbarker> RP: At least it's a quick failure now!

08:40 <paulbarker> RP: Is there a quick way to tell which python version bitbake is running under?

08:43 <paulbarker> As Debian 8 will be using the buildtools tarball I guess older distro actually means newer python

08:51 frieder has quit [Ping timeout: 268 seconds]

09:04 <kanavin> why is debian 8 builder even active still?

09:04 frieder has joined #yocto

09:05 <RP> paulbarker: right, a lot of those (all?) would be using buildtools, yes

09:11 <paulbarker> RP: I'll see if I can grab the latest buildtools tarball and run a build with that locally

09:18 <RP> paulbarker: you can see which one it is using from the helper

09:18 <RP> paulbarker: http://git.yoctoproject.org/cgit.cgi/yocto-autobuilder-helper/tree/config.json

09:19 <RP> (it says which hosts at the end too(

09:19 <paulbarker> RP: Thank you! I'll grab that and set it up here

09:20 <paulbarker> It'll be on opensuse-15.3 but I think the issue here may be due to the Python version so as long as that matches I should hopefully see it fail

09:20 <paulbarker> If it all works fine I guess it's time for a Debian 8 VM, though that will take longer

09:20 <RP> paulbarker: I'm not sure what the trigger was but that seems the logical place to start...

09:21 <RP> paulbarker: could be as simple as something missing from buildtools :/

09:21 <paulbarker> RP: Just to confirm - is the hashserv instance running from the same commit of bitbake during these tests? Or is that running from a known-good commit?

09:22 <RP> paulbarker: the hashserve is a single instance autobuilder wide and unchanged during these tests -that would be upgraded separately as it is standalone

09:22 <RP> unless tests use a local one

09:23 <paulbarker> RP: Ok, that will narrow down where the failure could be

09:26 zyga-mbp has joined #yocto

09:34 gourve_l has quit [Ping timeout: 258 seconds]

09:39 zyga-mbp has quit [Read error: Connection reset by peer]

09:40 zyga-mbp has joined #yocto

09:47 gourve_l has joined #yocto

09:55 florian has joined #yocto

09:58 frieder has quit [Ping timeout: 272 seconds]

10:12 <kanavin> rburton, RP: zstd decompresses 10 times faster than xz. I'll look into switching rpm compression to that in 4.17 timeframe, as we could get drastically faster do_rootfs and do_populate_sdk from it.

10:13 <kanavin> (rpm 4.17 that is, currently in rc)

10:14 camus1 has joined #yocto

10:14 <kanavin> compression times are similar

10:16 <RP> kanavin: sounds nice! :)

10:16 <rburton> awesome

10:16 <kanavin> RP, rburton : I tested with 2.5 Gb tarball, 6 Gb uncompressed

10:16 camus has quit [Ping timeout: 272 seconds]

10:16 camus1 is now known as camus

10:16 <RP> rburton: I just did some stats collection. 18 ptest AB-INT failures, 8 only ever seen on arm host :/

10:16 <kanavin> xz took 110 seconds, zstd 10 seconds (!!!)

10:16 <rburton> nice!

10:17 <rburton> RP: ouch

10:17 <rburton> RP: load, i imagine? weaker host?

10:17 <RP> rburton: the stats are deceptive as where it occurred once on x86 I didn't mark as arm specific but the arm failures are much more frequent :/

10:18 <RP> rburton: I'm not sure of the cause, we did back off the load on the arm worker but it didn't seem to improve things

10:18 <RP> kanavin: that is pretty neat.

10:19 <RP> rburton: we need some kind of a plan for the ptest issues as they're about 40% of the open AB-INT issues

10:20 <rburton> glancing at the list i half debugging 14244 so i'll finish that off

10:25 <perdmann_> I want to create a Lib from the min protocol. ERROR: libmin-1.0-r0 do_install: oe_soinstall: libmin.so.1.0 is missing ELF tag 'SONAME'.

10:26 <rburton> your makefile is broken

10:26 frieder has joined #yocto

10:28 * RP has closed 11 of the AB-INT bugs, down to 46 of them now

10:32 <perdmann_> rburton: i dont have a makefile ... https://dpaste.org/Z2WR

10:37 <RP> rburton: in the interests of closing bugs - https://bugzilla.yoctoproject.org/show_bug.cgi?id=13999 - the remaining issue is overlap of files. Does the sstate code not detect that? Maybe it didn't due to the quoting issue?

10:42 <rburton> perdmann_: please write a makefile, and delete most of that recipe

10:43 <rburton> perdmann_: there's a perfectly good cmakelists in the repo you're cloning, why are you building by hand?

10:43 davidinux has quit [Ping timeout: 272 seconds]

10:44 <rburton> perdmann_: your recipe can most likely be just SRC_URI/S assignments, and inherit cmake

10:45 davidinux has joined #yocto

10:56 bunk has joined #yocto

10:59 <RP> rburton: hmm, you're right about the directory race :/

11:00 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

11:02 <RP> rburton: code even says " # We can race against another package populating directories as we're removing them so we ignore errors here." :/

11:02 florian_kc has joined #yocto

11:02 <RP> that isn't enough :(

11:02 <rburton> what race?

11:03 <rburton> obviously, i'm always right

11:03 <RP> rburton: https://bugzilla.yoctoproject.org/show_bug.cgi?id=13999 - sstate can be running sstate_clean_manifest whilst another task extracts files

11:03 <RP> rburton: always :)

11:03 <rburton> oh that, yeah

11:04 <rburton> just bite the bullet and put a read/write lock on pkgdata

11:04 <rburton> or sstate in general

11:04 <RP> rburton: its a general sstate problem though :(

11:04 <RP> rburton: the read/write locks are painful on performance

11:05 <rburton> worth benchmarking though?

11:05 <RP> rburton: I have before a long time ago

11:05 <RP> rburton: imagine a build where all the setscene tasks end up serialised :/

11:13 zyga-mbp has joined #yocto

11:22 camus1 has joined #yocto

11:23 camus has quit [Read error: Connection reset by peer]

11:23 camus1 is now known as camus

11:28 <perdmann_> rburton: that cmake file does not create a so

11:29 <perdmann_> rburton: ohhh it is . i see... iam sorry. i will inherit cmake and try again, thank

11:29 <zedd> RP: 5.13 dropped, so I'm finalizing the libc-headers and reference recipes now ... I'll triple check that our LTP and rcu stall changes are there (I've been testing them on 5.13 already), since they won't be mainline quite yet.

11:29 <rburton> perdmann_: delete 99% of the recipe in the process, most of that recipe is redundant or actively hardful

11:31 <RP> zedd: thanks. We managed to close a lovely number of AB-INT bugs with those :)

11:32 <RP> zedd: 58 down to 46

11:32 <rburton> is that rcu lock the core problem? and now it will stall on load but not crash and die?

11:32 <zedd> awesome. and I hope the remaining are less annoying, :D hopefully no more kernel ones.

11:33 <zedd> but we obviously should document those as "why the yocto AB stress testing helps the world"

11:33 <RP> rburton: we'll get warnings now but not hangs, the hang was the problem

11:33 <rburton> right

11:34 <rburton> as the stalls are not massively unexpected when on heavy load, that's fine

11:34 <RP> rburton: exactly

11:34 <RP> we can live with the odd stall. Locking up the VM is antisocial though

11:35 <perdmann_> rburton: so i dont need SOFILE and these SO related stuff?

11:35 <RP> zedd: it means I can't ignore the "bitbake server timeout" issue for much longer :(

11:35 <RP> zedd: I'd rather debug the kernel than try and fix that :(

11:35 <rburton> perdmann_: no, that's all default

11:35 <rburton> and the insane skips were because you were building wrong

11:36 <RP> zedd: I've closed most of the qemu weirdness bugs on the basis we should reopen new ones with "good" data

11:36 <zedd> RP: indeed. I'm thinking it is even harder to reproduce for debugging as in the guts of things

11:36 <RP> zedd: I know what the bitbake server issue is. I could just increase the timeout as it is an IO problem. I just don't like doing that :/

11:36 <zedd> RP: yah, that's the most efficient way to get real ones to pop back up.

11:36 <RP> Really the whole bitbake server thing needs rewriting

11:36 <zedd> aha

11:37 <RP> zedd: torn between hacking around it or doing some nasty rewrite

11:37 * zedd is always tempted by rewrites :D

11:38 * RP remembers the large number of races we fixed in this code already

11:39 <perdmann_> rburton: thanks...

11:50 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

12:00 georgem has joined #yocto

12:01 zyga-mbp has joined #yocto

12:21 dmoseley has quit [Quit: ZNC 1.8.2 - https://znc.in]

12:21 argonautx has joined #yocto

12:24 <perdmann_> rburton: ok, i removed evything and added the line "inherit cmake"

12:24 <perdmann_> But then bitbake tells me its missing an install task, so i readded the install task but then i get some SONAME Error

12:26 <rburton> inherit cmake will provide an install task

12:26 <rburton> pastebin your recipe?

12:26 <perdmann_> rburton: of course

12:26 <rburton> maybe the cmakelists is broken too

12:26 <rburton> you just need to respect CC CPPFLAGS CFLAGS LDFLAGS etc,all in the environment

12:27 <rburton> cmake does that normally, but people can write bad cmakefiles that explicitly don't

12:27 <rburton> only so much you can do when people actively break stuff

12:27 <perdmann_> https://dpaste.org/VLAG

12:27 <rburton> RP: think i fixed the util-linux one

12:28 dmoseley has joined #yocto

12:28 <rburton> perdmann_: you can remove FILESEXTRAPATHS and all your FILES_

12:28 <rburton> cmake.bbclass definitely has an install task

12:28 <rburton> unless the cmake doesn't have an install action, which is what you mean

12:29 <perdmann_> rburton: | ninja: error: unknown target 'install'

12:29 <rburton> yeah their cmake doesn't provide an install then

12:29 <perdmann_> rburton: yes, so do_install just calls this install section, which i dont have

12:30 <rburton> and they didn't set a soname in the library either

12:30 <perdmann_> Thats why it only build an .a file?

12:30 <rburton> oh if it only builds a .a then that's exactly why there's no soname, just install the .a

12:31 <rburton> not using the soinstall as that's for Shared Objects, not archives

12:31 <perdmann_> ok, i had the idea that i wanted to link that dynamical

12:31 <rburton> fix the cmakelist to build a shared library then

12:31 <perdmann_> with a patch?

12:31 <rburton> yeah

12:31 <rburton> easier, and you get to send it upstream too

12:32 <perdmann_> rburton: sounds like a good idea

12:32 <perdmann_> i will, i just need to find out how to do that in cmake

12:34 <rburton> iirc you just add SHARED in the build library statement

12:34 <rburton> RP: turns out util-linux ptest wasn't testing most of util-linux

12:38 <RP> rburton: Why am I not surprised :(

12:39 <rburton> this was a relatively recent change but it should have been spotted in ptest regressions

12:39 <perdmann_> rburton: yes. Lets see. :)

12:40 <rburton> ooh util-linux can now build with meson

12:43 <RP> rburton: I worry we don't handle regression tests correctly :(

12:43 <rburton> me too

12:44 <rburton> i think the lack of a decent machine readable format doesn't help

12:44 <rburton> qa should be able to generate a table of all the tests and their results

12:48 <perdmann_> rburton: it works

12:49 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

12:49 <perdmann_> install still missing... whats your suggestion: CMake patch file for install or do_install

12:50 <RP> rburton: we have a machine readable format?

12:50 <rburton> well, sort of :)

12:50 <RP> rburton: the hard part is finding the one to compare against automatically

12:50 <rburton> you do quite often see mangled test names as it got all confused

12:50 <RP> rburton: they should at least get mangled consistently

12:51 zyga-mbp has joined #yocto

13:15 paulg has joined #yocto

13:15 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

13:17 zyga-mbp has joined #yocto

13:22 <RP> rburton: I think https://bugzilla.yoctoproject.org/show_bug.cgi?id=14379 is related too

13:34 davidinux has quit [Ping timeout: 268 seconds]

13:34 davidinux has joined #yocto

13:37 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

13:41 <perdmann_> rburton: IT WORKED! thanks a lot.

13:41 Schlumpf has quit [Quit: Client closed]

13:43 camus1 has joined #yocto

13:45 camus has quit [Ping timeout: 272 seconds]

13:45 camus1 is now known as camus

13:48 Falital has joined #yocto

13:54 <jonesv[m]> Is it not possible to have recipes point to closed git repos? I was hoping that bitbake would just try to use my user ssh key, but it appears it does not 😕

13:54 <jonesv[m]> Or maybe it does not work with a passphrase-protected key?

13:54 <jonesv[m]> <jonesv[m] "Or maybe it does not work with a"> oooh, if I use `ssh-agent` it works. It just does not want to ask for my password apparently

13:57 <rburton> yeah, agents work, escaping from several layers of abstraction to ask for a password less so

13:57 <rburton> also agents work in automated builds, asking a non-existent user to enter a password doesn't work well

14:02 otavio has quit [Remote host closed the connection]

14:03 sakoman has joined #yocto

14:04 otavio has joined #yocto

14:05 zyga-mbp has joined #yocto

14:21 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

14:22 tlwoerner has quit [Remote host closed the connection]

14:22 tlwoerner has joined #yocto

14:31 tnovotny has quit [Quit: Leaving]

14:34 sakoman has quit [Remote host closed the connection]

14:35 sakoman has joined #yocto

14:37 jonah1024 has quit [Quit: Connection closed for inactivity]

14:39 jonah1024 has joined #yocto

14:54 davidinux has quit [Ping timeout: 268 seconds]

14:55 davidinux has joined #yocto

14:59 BCMM has joined #yocto

15:04 manuel1985 has quit [Quit: Leaving]

15:07 <override> morning, can someone link me to a some systemd service recipe templates? About to write my first one, so need something to go off of.

15:09 <override> just a template that'll help me figure out what to inherit and all maybe

15:09 <override> thanks!

15:10 <mckoan> override: https://wiki.koansoftware.com/index.php/Add_a_systemd_service_file_into_a_Yocto_image

15:11 <override> mckoan: thanks!

15:23 <override> mckoan: whats a good way to figure out if ive got systemd enabaled by default on my image, as opposed to systemV?

15:28 zyga-mbp has joined #yocto

15:28 argonautx has quit [Ping timeout: 252 seconds]

15:29 <mckoan> override: seen from Yocto build point of vew or from the target system?

15:29 argonautx has joined #yocto

15:29 <override> build pov, mckoan:

15:30 <mckoan> override: if you have DISTRO_FEATURES_append = " systemd"

15:30 <override> got it, thanks!

15:31 frieder has quit [Remote host closed the connection]

15:34 <jonesv[m]> There is something I don't get yet. An image is a group of packages. I can define multiple different images, and build them with `bitbake flavor1-image` or `bitbake flavor2-image`, right? And I define flavor1 in say `meta-flavor1`, and flavor2 in say `meta-flavor2`. So in my bblayers.conf, I have both those layers included. And if they both contain a `bbappend` (say they both create a config file for hostapd), then those two bbappend conflict

15:34 <jonesv[m]> with each other.

15:34 <jonesv[m]> Is there a way to not have meta-flavor1 look into the meta-flavor2 layer?

15:34 <jonesv[m]> My guess is that they should be both in the same layer, say `meta-myproject`, and there I should define two images: `recipes-flavor1` and `recipes-flavor2`. But in my case, I have images that are quite different from each other (i.e. different projects), so they don't feel like they belong to the same layer. However, I don't want to checkout a new poky setup for each project, because then it will take a ton of disk space and I need to rebuild

15:34 <jonesv[m]> everything from scratch for each new project 😕

15:40 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

15:58 frieder has joined #yocto

16:02 frieder_ has joined #yocto

16:03 frieder has quit [Ping timeout: 265 seconds]

16:03 zpfvo has quit [Remote host closed the connection]

16:06 frieder_ has quit [Remote host closed the connection]

16:15 mckoan is now known as mckoan|away

16:22 florian_kc has quit [Quit: Ex-Chat]

16:23 * paulg is almost afraid to ask if the autobuilder is ok, or still spitting out random RCU implicated spews...

16:25 florian has quit [Ping timeout: 272 seconds]

16:44 Vineela has joined #yocto

16:47 jonah1024 has quit [Quit: Connection closed for inactivity]

16:53 <jonesv[m]> I tried to formalize my question here, if somebody is interested: https://stackoverflow.com/questions/68167244/image-specific-layers

16:53 <Tartarus> JPEW: Hey, mingw tangent question. Is SDK_ARCHIVE_TYPE expected to be set in local.conf ? It's not in BB_ENV_EXTRAWHITE_OE under scripts/oe-buildenv-internal

16:55 <Tartarus> or is tar.xz really just easy enough to work with in Windows these days it doesn't matter? I haven't shuffled + rebooted for this quick PoC I built yet :)

16:55 <JPEW> Tartarus: It should be set in local.conf... not sure if it should be in EXTRAWHITE

16:56 <Tartarus> OK, easy enough, thanks.

16:57 <JPEW> Last I check, tar.gz has some troubles on Windows, but TBH we don't use either that *or* zip and have a self extracting python file.... which I _still_ need to upstream

17:03 <override> anyone know what layer oe keeps nginx recipe under?

17:04 <rburton> override: https://layers.openembedded.org/layerindex/branch/master/recipes/?q=nginx

17:05 <Tartarus> JPEW: I was a little surprised there wasn't an installer like Linux, but only a little. Just doing a PoC for a quote for a customer atm anyhow

17:06 <RP> paulg: much happier

17:06 <JPEW> Tartarus: Ya. I wanted to do a unified replacement for the installer that used python so it would be the same on both MinGW and Linux

17:06 <JPEW> But still a work in progress

17:06 <RP> paulg: I closed about 12 open bugs on the basis that several were related...

17:07 <JPEW> Tartarus: The basic idea is to create a tar.gz file with the SDK contents, then use Python to extract it and do the pre/post processing

17:08 <JPEW> Tartarus: There is a pretty interesting trick that Python has where if the archive is a zip file, it will extract the contents to a temporary directory and execute __main__.py from them; this would allow you to efficiently package the SDK tar.gz with the extraction script in a single file

17:08 <Tartarus> Neat

17:10 <JPEW> Tartarus: https://docs.python.org/3/library/zipapp.html

17:19 <override> rbuton: in a service file for systemd, would something like Requires=nginx.service work, or would I have to use a vraiable or soemthing for nginix?

17:20 <override> the service im working with has a lot going on for nginx, so Im trying to how that would work

17:21 <paulg> RP, well that is good news.

17:25 <RP> paulg: yes, its made me a lot happier

17:25 Falital has quit [Ping timeout: 272 seconds]

17:25 <override> basically when I bring in nginx using a recipe, can I be writing services with stuff like Requires=nginx.service?

17:27 <paulg> RP, was that 12 just for RCU dain-bramage alone, or also including earlier LTP/cgroup wreckage?

17:31 <JPEW> override: Recipes with systemd support will list the services they install in the SYSTEMD_SERVICE variables

17:35 <JPEW> Hmm, I had several jobs timeout when trying to share DL_DIR over NFS. It looks like they all deadlocked trying to flock the .lock file

17:35 <JPEW> Anyone else see such a thing?

17:45 BCMM has quit [Ping timeout: 258 seconds]

17:57 creich has quit [Remote host closed the connection]

18:04 camus has quit [Read error: Connection reset by peer]

18:04 camus has joined #yocto

18:15 ant__ has joined #yocto

18:26 xmn has joined #yocto

18:39 camus1 has joined #yocto

18:40 camus has quit [Ping timeout: 268 seconds]

18:40 camus1 is now known as camus

18:45 <chrfle> Does anyone know if block devices which are not mounted are automatically synced upon reboot

18:47 Guest15 has quit [Quit: Client closed]

19:00 <rburton> if they're not mounted... how will they have pending writes?

19:04 <chrfle> rburton: e.g. dd if=my_fancy_firmware of=/dev/sda6

19:06 Spooster has joined #yocto

19:33 florian has joined #yocto

19:35 <marc1> chrfle: systemd will call sync() at shutdown which in turn will flush all fs and block devices caches, see: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/fs/sync.c#n123

19:36 <chrfle> marc1: do you know where in systemd that call is made?

19:40 <chrfle> systemd-shutdown I presume, will go have a look

19:44 yates has joined #yocto

19:47 davidinux has quit [Read error: Connection reset by peer]

19:51 davidinux has joined #yocto

19:51 <smurray> chrfle: one option there is to tell dd to use direct writes

19:56 mattofak has quit [Remote host closed the connection]

19:57 <marc1> chrfle: look at shutdown.c in SD sources

19:58 <chrfle> marc1: yeah, found it in async.c called from shutdown.c, thanks

19:59 BCMM has joined #yocto

19:59 Falital has joined #yocto

20:00 Falital has quit [Client Quit]

20:04 jpuhlman__ has quit [Quit: Leaving]

20:05 jpuhlman has joined #yocto

20:09 Vineela has quit [Quit: Leaving.]

20:13 Vineela has joined #yocto

20:22 zyga-mbp has joined #yocto

20:28 leonanavi has joined #yocto

20:31 leon-anavi has quit [Ping timeout: 272 seconds]

20:38 camus has quit [Ping timeout: 265 seconds]

20:38 camus has joined #yocto

20:45 zyga-mbp has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

20:50 davidinux has quit [Ping timeout: 268 seconds]

20:52 <Spooster> I added a kernel cfg fragment... and it looks like the recipe picked it up... and I see that it showed up in the workdir... aside from verifying the behavior by running the kernel... is there another way to verify that the new kernel was built with the options?

20:52 <Spooster> my fear is I just copied a random file that ends in .cfg, and it won't do anything

20:54 davidinux has joined #yocto

20:54 <smurray> Spooster: look at the .config in the kernel build directory under ${WORKDIR}?

20:56 <Spooster> I see my .cfg fragment, a file named defconfig.cfg, and a couple others popping up in ./build/tmp/work/raspberrypi4_64-poky-linux/linux-raspberrypi/1_5.4.72+gitAUTOINC+5d52d9eea9_154de7bbd5-r0

20:56 <Spooster> but I don't know enough about the meta-raspberrypi recipe to know if "that's it" or if that's the ${workdir}

20:58 <smurray> there's a build output directory under there, for linux-raspberrypi, it'll be something like linux-raspberrypi4_64-standard-build, in there will be the final .config

21:05 rob_w has quit [Read error: Connection reset by peer]

21:07 <RP> paulg: covered both but ltp was 2-3 of that total

21:07 <RP> abelloni: looks like there is an arm ltp hang again

21:08 <RP> paulbarker: Looks like a prserver hang: https://autobuilder.yoctoproject.org/typhoon/#/builders/87/builds/2268 :/

21:08 <RP> paulbarker: is there any debug we want from that?

21:08 <Spooster> +1 smurray tyvm. Found and confirmed what I was hoping to see

21:10 goliath has quit [Quit: SIGSEGV]

21:14 <paulbarker> RP: Damn. I have no idea where it is hanging now if there's no backtrace at all. May be worth a look at the cookerdaemon log at least

21:14 leonanavi has quit [Remote host closed the connection]

21:14 leonanavi has joined #yocto

21:17 florian has quit [Ping timeout: 272 seconds]

21:17 <RP> paulbarker: no traceback, last command completed successfully, last command looked to be "bitbake -R conf/prexport.conf -p"

21:17 camus has quit [Ping timeout: 272 seconds]

21:18 camus has joined #yocto

21:18 <RP> paulbarker: that looks like something bitbake-prserv-tool would run

21:19 <paulbarker> RP: Ok, so that bitbake command completed successfully but the corresponding test (likely test_import_export_override_db) never finished

21:19 <RP> paulbarker: The cooker log says the command completed, I'm not sure the server exits

21:20 <paulbarker> It's likely http://git.yoctoproject.org/cgit/cgit.cgi/poky/tree/meta/lib/oeqa/selftest/cases/prservice.py?h=master-next#n76

21:20 <RP> paulbarker: agreed, yes

21:21 <RP> $ ps ax | grep prser

21:21 <RP> 1441773 ? S 0:00 /bin/sh -c bitbake-prserv-tool export /home/pokybuild/yocto-worker/oe-selftest-ubuntu/build/build-st-507461/export.inc

21:21 <paulbarker> So it's probably stuck waiting for the server to shutdown, somewhere where there is no timeout

21:22 <RP> paulbarker: it kind of looks like bitbake's main loop thinks something is still active

21:22 <paulbarker> RP: Is there any way to figure out which bitbake pid is the prservice server? Maybe run `lsof` with the path to prserv.sqlite3

21:23 <RP> sh(1441773)───bash(1441776)───KnottyUI(1443232)───{KnottyUI}(1444017)

21:23 <RP> paulbarker: pstree -p 1441773

21:24 <RP> paulbarker: what looks bad is that there are a ton of parser worker zombie processes

21:25 <RP> 1444977 ? Z 0:03 [Parser-2] <defunct>

21:25 <RP> 1444980 ? Z 0:03 [Parser-3] <defunct>

21:25 <RP> but 58 of them

21:25 <paulbarker> Ouch

21:25 <paulbarker> So I'm guessing the main bitbake process is stuck at http://git.yoctoproject.org/cgit/cgit.cgi/poky/tree/bitbake/lib/prserv/serv.py?h=master-next#n365

21:25 <RP> paulbarker: I can see 1444905 is the prserv (or it at least has the sqlite open)

21:26 <paulbarker> If you kill that pid I'd like to see if the bitbake server shuts down cleanly

21:26 <RP> so you want me to kill it?

21:26 <paulbarker> Yes just that pid

21:27 <RP> paulbarker: now also a zombie

21:28 Spooster has quit [Remote host closed the connection]

21:28 Spooster has joined #yocto

21:29 <paulbarker> Well that's disappointing

21:30 florian has joined #yocto

21:33 Spooster has quit [Ping timeout: 258 seconds]

21:38 <paulbarker> RP: So I started with the assumption that the way cooker spawns hashserv (http://git.yoctoproject.org/cgit/cgit.cgi/poky/tree/bitbake/lib/bb/cooker.py#n389) is well validated

21:39 <paulbarker> But that doesn't get exercised on the autobuilder as it uses a separate hashserv daemon

21:39 <RP> paulbarker: correct

21:40 <paulbarker> That code creates the asyncio loop in the main process then runs it (??) in a subprocess

21:41 <paulbarker> I wonder if the next step is to rip that out for prserv so the subprocess is started first then all the prserv work (opening database, initialising asyncio loop, etc) occurs within the subprocess

21:42 <RP> paulbarker: this is what we use to do as it is hard to ensure the subprocesses don't hold the wrong resources. Its not very pythonic though :/

21:43 <RP> paulbarker: what is odd is that there is one parser thread that is still "alive" :/

21:43 <paulbarker> Hanging code isn't very pythonic either haha

21:43 cquast has quit [Ping timeout: 268 seconds]

21:43 <RP> paulbarker: I get a lot of complaints about the fact we use old fashioned fork() calls ;-)

21:43 <RP> but yes, I like old/simple in many ways for this reason

21:43 <paulbarker> What does bother me is that I've never been able to replicate the issue here. I guess it's due to a lower level of parallelism

21:44 <RP> the autobuilder does seem to find things at scale that most people don't :/

21:44 <paulbarker> I tried to run oe-selftest in parallel but it triggered the OOM killer

21:45 <paulbarker> `oe-selftest -j12 ...`, BB_NUMBER_THREADS=12 and PARALLEL_MAKE=-j12.

21:46 <paulbarker> Eat through 64GB RAM, 8GB swap and then the kernel started chomping processes

21:46 <paulbarker> I saw a load average >700 on this 6 core/12 thread machine

21:48 <RP> paulbarker: ok, I installed python3-dbg and we have some backtraces on the processes. Let me try and dump this into an email

21:52 Vineela has quit [Ping timeout: 272 seconds]

21:52 florian has quit [Ping timeout: 252 seconds]

21:54 <RP> paulbarker: I've mailed it over to you. It looks to me like when bitbake forks off the parser worker threads, the worker threads are inheriting the asyncio in progress from the parent :/

21:54 <paulbarker> RP: Ah that would definitely break everything!

21:54 argonautx has quit [Quit: Leaving]

21:56 florian has joined #yocto

22:00 <RP> paulbarker: looking more closely I'm wrong about that. It is a parser thread sitting in async clinent connection code

22:04 camus has quit [Read error: Connection reset by peer]

22:04 camus1 has joined #yocto

22:06 <paulbarker> RP: I'll take a look at those dumps tomorrow

22:06 Spooster has joined #yocto

22:07 camus1 is now known as camus

22:10 <RP> paulbarker: its sitting in prserv_dump_db() but since we killed the server now, I'm not sure what it would do. It is data at least, happy to have the stack traces

22:18 florian has quit [Ping timeout: 258 seconds]

22:33 leonanavi has quit [Quit: Leaving]

22:35 <RP> abelloni, rburton: with the arm worker ltp bug, I ssh'd in and it was stuck on proc01. I installed strace, attached to the stuck process and it unblocked it and everything started running again

22:35 <RP> I lost the log as it scrolled off my terminal buffer :(

22:36 <RP> it is reading /proc/kmsg

23:02 <jonesv[m]> hmm I thought I could use `${IMAGE_BASENAME}` to enable my bbappend only for a specific image, but that does not seem possible... (details here: https://stackoverflow.com/questions/68167244/image-specific-layers)

23:06 <abelloni> yeah, so proc01 is an issue on arm

23:06 <abelloni> and it is blocked on a read

23:09 camus1 has joined #yocto

23:09 camus has quit [Ping timeout: 268 seconds]

23:09 camus1 is now known as camus

23:15 <jonesv[m]> Would it make sense to add a COMPATIBLE_IMAGE variable, similar to COMPATIBLE_MACHINE or COMPATIBLE_HOST, that would allow me to write bbappends that are ignored on incompatible images?

23:16 Vineela has joined #yocto

23:36 fullstop_ has joined #yocto

23:37 fullstop has quit [Ping timeout: 244 seconds]

23:37 fullstop_ is now known as fullstop

23:56 abelloni has quit [Ping timeout: 244 seconds]

23:56 abelloni has joined #yocto

23:57 xantoz has quit [Ping timeout: 244 seconds]

23:58 xantoz has joined #yocto