16:29:39 #startmeeting fedora_coreos_meeting 16:29:39 Meeting started Wed Nov 23 16:29:39 2022 UTC. 16:29:39 This meeting is logged and archived in a public location. 16:29:39 The chair is dustymabe. Information about MeetBot at https://fedoraproject.org/wiki/Zodbot#Meeting_Functions. 16:29:39 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:29:39 The meeting name has been set to 'fedora_coreos_meeting' 16:29:43 .hello siosm 16:29:44 travier: siosm 'TimothΓ©e Ravier' 16:29:45 #topic roll call 16:29:51 .hi 16:29:52 dustymabe: dustymabe 'Dusty Mabe' 16:30:14 .hi 16:30:15 fifofonix: fifofonix 'Fifo Phonics' 16:30:18 .hi 16:30:18 c4rt0: c4rt0 'Adam Piasecki' 16:30:23 .hi 16:30:24 walters: walters 'Colin Walters' 16:30:41 #chair travier fifofonix c4rt0 walters 16:30:41 Current chairs: c4rt0 dustymabe fifofonix travier walters 16:30:46 .hello mnguyen 16:30:47 mnguyen: mnguyen 'Michael Nguyen' 16:30:56 #chair ravanelli mnguyen 16:30:56 Current chairs: c4rt0 dustymabe fifofonix mnguyen ravanelli travier walters 16:31:02 .hi 16:31:03 lorbus: lorbus 'Christian Glombek' 16:31:20 #chair lorbus 16:31:20 Current chairs: c4rt0 dustymabe fifofonix lorbus mnguyen ravanelli travier walters 16:31:51 .hello2 16:31:52 jlebon: jlebon 'None' 16:32:03 .hello 16:32:03 anthr76[m]: (hello ) -- Alias for "hellomynameis $1". 16:32:14 .hello anthr76 16:32:15 anthr76[m]: anthr76 'Anthony Rabbito' 16:32:18 #chair jlebon anthr76[m] 16:32:18 Current chairs: anthr76[m] c4rt0 dustymabe fifofonix jlebon lorbus mnguyen ravanelli travier walters 16:32:21 welcome all 16:32:34 #topic Action items from last meeting 16:32:47 Looks like there were no action items from last meeting, so we can move on 16:32:55 #topic FOSDEM 2023: Image Based Linux dev room 16:32:59 #link https://github.com/coreos/fedora-coreos-tracker/issues/1348 16:33:05 * dustymabe passes floor to travier 16:33:11 Heya 16:33:19 Hi all 16:33:37 πŸ‘‹ 16:33:41 The UAPI group / image based summit crowd is organizing a dev room at FOSDEM 16:34:09 Feel free to submit a talk about FCOS (or anything else related to the topic) there if you intend to be there 16:34:13 that's it :) 16:34:26 (I likely won't be at FOSDEM this year) 16:35:11 yeah, since devconf moved to the summer it makes going to FOSDEM for non-Europe based harder 16:35:47 .hi 16:35:49 ravanelli: ravanelli 'Renata Ravanelli' 16:35:54 thanks for sharing travier 16:36:04 πŸ‘ 16:36:07 #chair ravanelli 16:36:07 Current chairs: anthr76[m] c4rt0 dustymabe fifofonix jlebon lorbus mnguyen ravanelli travier walters 16:36:24 #topic amd-gpu-firmware no longer being installed between stable to testing 16:36:29 #link https://github.com/coreos/fedora-coreos-tracker/issues/1345 16:36:58 For this one the linux-firmware package broke out the GPU firmware it was previously shipping in the main package into subpackages 16:37:17 in f36 the subpackages were requires, in f37+ they are recommends 16:37:25 so when we moved to f37 we dropped them out 16:37:47 the question is: should we add them back? 16:37:57 I think the answer is probably yes 16:38:02 how many new subpackages fall in this category? 16:38:14 but I think it's worth us discussing how much we think they are used 16:38:48 jlebon: amd-gpu-firmware intel-gpu-firmware nvidia-gpu-firmware 16:39:01 $ rpm -qi amd-gpu-firmware-20221109-144.fc37.noarch | grep Size 16:39:01 Size : 22564513 16:39:47 is that ~ 22MiB ? 16:40:05 $ rpm -qi intel-gpu-firmware-20221109-144.fc37.noarch| grep Size 16:40:05 Size : 8329555 16:40:05 $ rpm -qi nvidia-gpu-firmware-20221109-144.fc37.noarch| grep Size 16:40:05 Size : 1262337 16:40:26 dustymabe: looks like it 16:40:35 22 + 8 +1 16:41:07 definitely not insignificant 16:41:52 I think we can probably skip intel & nvidia 16:42:05 it's unlikely someone will use them right now 16:42:25 travier: can you expand on that? 16:42:34 travier: What is your reasoning? 16:42:52 ehh, I think we probably shouldn't split hairs - probably should do all or nothing 16:43:07 yeah, agreed 16:44:17 this is technically a regression, so i'm inclined to add them back before it hits stable even if we decide later on to remove them with some window 16:44:28 this is for GPU/OpenCL workloads 16:44:39 NVIDIA will require the binary drivers 16:44:57 jlebon: yeah, that's a good point 16:45:00 I don't think there is Intel cards for that although we have the ARC ones now 16:45:24 so I really think we don't need the NVIDIA ones 16:45:35 intel maybe, amd yes 16:46:06 travier: couldn't some users be pulling in the drivers themselves though? 16:46:20 what do you mean? 16:46:24 FWIW I know quite a few folks using iGPUs with Intel way more then AMD in fact. https://github.com/intel/intel-device-plugins-for-kubernetes 16:46:55 The real question (to me) is do all GPUs require direct firmware loading? If not which ones don't 16:47:21 travier: are the nvidia drivers easily available in a way that we could expect some users to layer them? 16:47:57 jlebon: you get them from RPM Fusion and usually you need to freeze the kernel as it moves too fast in Fedora for NVIDIA 16:48:19 Also, IIRC these really aren't drivers more or less the layer before the driver. mesa (a driver for example) can't start if the firmware isn't initialized. 16:48:25 I really don't see users running NVIDIA workloads on nouveau 16:48:34 that would be a complete waste of maney 16:48:36 money* 16:49:20 (but if you want to add all firmware I won't mind, just trying to reduce the size of what we add) 16:49:37 travier: ack ok, so it's possible they do pull it in, but unlikely 16:49:52 yeah even so i think nvidia is working on making the open source side of things better and I'd prefer not to track progress there. I think I'd rather just include it especially since it's the smallest of the 3 16:49:52 yeah, the 1M cost doesn't seem worth being inconsistent 16:50:02 the binary NVIDIA drivers do not use those firmwares files 16:50:11 they use their own 16:50:18 * dustymabe wishes all of the drivers were just open source 16:50:27 soon! 16:50:29 :) 16:50:38 ℒ️ 16:50:49 fifofonix: actually experimenting with nvidia's open kernel for the kernel interface piece presently. :o) 16:50:53 so OK - maybe we should open a new issue to really discuss what we think we should do in the future here 16:51:23 but for now maybe we should do what jlebon suggests and get them back in before we ship the next `stable` 16:51:31 +1 16:52:13 +1 16:52:28 do we have numbers for image size between f36 & f37? 16:52:47 +1 16:53:05 #proposed we will add the amd-gpu-firmware intel-gpu-firmware nvidia-gpu-firmware packages back to FCOS to restore the previous functionality we had and open a new ticket to discuss the merits of these packages and if we ever want to remove them in the future. 16:53:26 travier: we could compare the meta.json size values 16:54:01 1.55G vs 1.57G for the uncompressed qemu qcow2 it looks like 16:54:25 ack to proposal 16:54:33 ack 16:55:49 any other inputs? 16:55:55 vote 16:56:52 * dustymabe will +1 16:57:03 anthr76[m]: nice work noticing this before it gets to stable! 16:57:09 #agreed we will add the amd-gpu-firmware intel-gpu-firmware nvidia-gpu-firmware packages back to FCOS to restore the previous functionality we had and open a new ticket to discuss the merits of these packages and if we ever want to remove them in the future. 16:57:15 yes thank you anthr76[m] 16:57:24 Happy to help and be here :) 16:58:01 we generally do a thorough analysis of added/removed packages (see https://github.com/coreos/fedora-coreos-tracker/issues/1221) but I just didn't have time to do it this cycle 16:58:27 #topic open floor 16:58:37 this one is tricky as it's not removed, it's "not added" :) 16:59:04 travier: right, but rpm-ostree db diff would report on it anyway 16:59:04 as it's a new sub package 16:59:11 so I would have caught it 16:59:17 would it? 16:59:21 I don't think so 16:59:26 absolutely it would 16:59:26 travier: the subpackages are in f36 16:59:39 this ^^ 16:59:40 oh, my bad then 16:59:47 πŸ‘ 16:59:50 travier: sorry, you would have been right otherwise 16:59:57 πŸ‘ 17:00:36 https://fedoraproject.org/wiki/Changes/Linux_Firmware_Minimization 17:00:37 ? 17:00:46 note that soon we'll try to start evaluating the change proposals for f38 17:01:11 travier: interesting.. maybe we overlooked that 17:01:32 https://koji.fedoraproject.org/koji/buildinfo?buildID=2089044 > I can see them here indeed 17:01:39 weird. I don't see it in https://github.com/coreos/fedora-coreos-tracker/issues/1222 17:01:56 I saw the breakout on my desktop, but wasn't aware of the change proposal. So I didn't think much of it on my FCOS nodes until GPUs stop initializing :) 17:02:28 hmm. it's not in the list of 37 changes: https://fedoraproject.org/wiki/Releases/37/ChangeSet#Fedora_Linux_37_Accepted_System-Wide_Changes 17:02:29 https://src.fedoraproject.org/rpms/linux-firmware/blob/rawhide/f/linux-firmware.spec#_23 17:02:42 It's a F37 change :) 17:03:12 travier: isn't there a difference in a proposed change and an accepted change ? 17:03:29 it should be indeed 17:03:32 looks like it was withdrawn based on bcotton's edit here: https://fedoraproject.org/w/index.php?title=Changes/Linux_Firmware_Minimization&diff=649159&oldid=649137 17:03:32 it's really weird 17:04:05 in the spec the change is there and it impacts us as we don't ship recommends 17:04:17 so they did not do the full change, just a part of it 17:05:01 so it's indeed both not a F37 change and a F37 change :D 17:05:02 yeah the change proposal talks about using a `Supplements metadata` to automatically install appropriate firmware based on the hardware of the machine 17:05:09 well, I'll see myself out :) 17:05:34 i imagine that the all of the change wasn't done, but some pieces they thought were harmless were 17:05:44 πŸ‘ 17:05:59 i.e. changing to recommends is pretty harmless for most users, but we're obviously doing something that most users aren't 17:06:28 anyway :) - we're on top of it now 17:06:44 and our "stream" model is working great at giving us clues about what's to come 17:07:05 +1 17:07:25 for example: I would bet that not a lot of people use this firmware given it took til this late in the cycle 17:07:26 dustymabe: weird we didn't notice it when we moved testing-devel to f37 17:07:33 though I guess that could just mean not a lot of people try `next` 17:07:56 it is in the git diff indeed of https://github.com/coreos/fedora-coreos-config/commit/08ace84d0104661970047642dc49c91bc764943e 17:08:10 we didn't move testing devel over to f37 until after `testing` had shipped 17:08:55 https://github.com/coreos/fedora-coreos-config/pull/2083#issuecomment-1312052842 17:09:20 summary: we'll do better next time :) 17:09:42 hopefully we aren't migrating our internal build systems to a whole new model next release too 17:10:29 anything else for open floor? 17:10:37 dustymabe: right, not arguing that we should've caught this for testing or next. it probably came in through bump-lockfiles which we don't inspect really. for testing-devel, we could've had CI that detected this in that PR. 17:11:00 though really even for bump-lockfile, one thing I've wanted to do was defer to PR the result if the pkgset changed 17:11:20 i think there's an issue somewhere for that 17:11:29 yeah that would be nice 17:11:39 I have something about native containers.. 17:11:41 but I think where we really want to catch it is in rawhide 17:12:00 dustymabe: +1 indeed 17:12:13 but I think that would imply using lockfiles for rawhide too, which we could do 17:12:16 anthr76[m]: go for it! 17:12:28 we could compare against the previous rawhide build 17:13:48 I just wanted a sanity check here before I open an issue (I suppose against ignition?)... (full message at ) 17:14:14 * dustymabe will copy the rest of that here so the meeting log will have it 17:14:21 I'm building a derivation like so: https://github.com/anthr76/infra/blob/de2d532938eb168be6672ef035d58a9ccec611ab/armature/prod/fcos-layers/k8s-node/Dockerfile#L16-L19 17:14:26 Basically applying this big butane file. https://github.com/anthr76/infra/blob/main/armature/prod/fcos-layers/k8s-node/config.bu.yaml 17:14:31 Though, when rebasing units that are enabled aren't actually getting enabled. For example: https://github.com/anthr76/infra/blob/de2d532938eb168be6672ef035d58a9ccec611ab/armature/prod/fcos-layers/k8s-node/config.bu.yaml#L159 17:14:35 Sorry :) how does that show on IRC? 17:14:35 I assume the symlink is getting lost somewhere.. 17:14:50 anthr76[m]: matrix posts a link to the full message 17:14:57 Is it more preferred to attend meetings on IRC? 17:15:08 not really, i'm happy either way 17:15:23 anthr76[m]: yeah, i would file an issue 17:15:23 i just wanted the full message to be in the meeting log 17:15:46 might be some interaction between layering and systemd's firstboot preset logic 17:16:20 or an issue with ignition-liveapply 17:17:43 ack'd will do. It's simple to reproduce in a vm, and you can probably even reproduce it in a container with enough fiddling 17:18:34 any other topics for open floor? 17:18:46 short-term hack, you can `systemctl enable` the units directly in your Dontainerfile 17:18:51 whoops 17:18:54 Dockerfile* 17:19:12 had started to type Containerfile and then saw it was a Dockerfile 17:19:51 I do believe I tried that at some point and it was still no good. I like to try to actually make the symlink to multi-user.target 17:19:55 and thus Dontainerfile was born 🌟 17:20:18 I'm all for de-dockering everything but it's really hard :P 17:20:40 any other topics here? 17:20:48 I gave up on Containerfile when I had to hack renovatebot around it all the time :/ and instructing LSPs and what not 17:20:54 Good on my side 17:21:06 * dustymabe closes out the meeting in 60s 17:21:22 thanks 17:21:23 anthr76[m]: interesting. add that bit to the issue too. `systemctl enable` is used in some of the the coreos-layering-examples 17:22:04 Sure. I will double check before hand and post it up in about a hour 17:22:11 #endmeeting