16:30:46 #startmeeting fedora_coreos_meeting 16:30:46 Meeting started Wed Sep 28 16:30:46 2022 UTC. 16:30:46 This meeting is logged and archived in a public location. 16:30:46 The chair is dustymabe. Information about MeetBot at https://fedoraproject.org/wiki/Zodbot#Meeting_Functions. 16:30:46 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:30:46 The meeting name has been set to 'fedora_coreos_meeting' 16:30:51 #topic roll call 16:30:54 .hi 16:30:55 dustymabe: dustymabe 'Dusty Mabe' 16:31:51 .hello siosm 16:31:52 travier: siosm 'Timothรฉe Ravier' 16:32:08 .hello jasonbrooks 16:32:10 jbrooks: jasonbrooks 'Jason Brooks' 16:32:39 .hi 16:32:41 jmarrero: jmarrero 'Joseph Marrero' 16:32:45 #chair travier jbrooks jmarrero 16:32:45 Current chairs: dustymabe jbrooks jmarrero travier 16:33:50 .hi 16:33:51 ravanelli: ravanelli 'Renata Ravanelli' 16:34:04 .hello2 16:34:05 jlebon: jlebon 'None' 16:35:19 #chair ravanelli jlebon 16:35:19 Current chairs: dustymabe jbrooks jlebon jmarrero ravanelli travier 16:35:40 #topic Action items from last meeting 16:35:48 #info Looks like there were no action items from last meeting! 16:36:01 #topic alias quay.io/fedora/fedora-coreos:stable to :latest in quay 16:36:06 #link https://github.com/coreos/fedora-coreos-tracker/issues/1309 16:37:13 looks like this is a request for a `:latest` tag in our quay repo for the FCOS OSContainer 16:38:20 it'd be cool if you could mark a tag as default in a repo and runtime tools picked up on that 16:38:34 yeah 16:38:49 i like how clean it is that we only have a tag per stream, but to be practical having a :latest tag makes sense 16:38:54 i have some concerns 16:39:31 I think we have a nice delineation today and forcing people to be thoughtful about which stream they are using makes sense IMO 16:40:34 This isn't a normal container image - it is an OS that is versioned/released in a unique way and I think forcing them to know what tag to choose also forces them to have some understanding of what they are doing 16:40:42 I think most people expect to get something when they try to pull an image without a tag, I would expect the project maintainer to give me their default flavor. 16:41:23 jmarrero: but what context are you operating in? 16:42:33 `sudo rpm-ostree rebase --experimental ostree-unverified-registry:quay.io/fedora/fedora-coreos:stable` is nicely explicit and it's clear what "stream" we are following 16:42:55 what would `sudo rpm-ostree rebase --experimental ostree-unverified-registry:quay.io/fedora/fedora-coreos` even mean? how do we derive the stream information from that? 16:43:42 it'd be in the image metadata and commit metadata, but yeah point taken that it's nice to have it in the ref 16:44:01 it'd be the only place the stream would show up in a plain `rpm-ostree status` 16:44:23 .hi 16:44:24 aaradhak: aaradhak 'Aashish Radhakrishnan' 16:44:31 #chair aaradhak 16:44:31 Current chairs: aaradhak dustymabe jbrooks jlebon jmarrero ravanelli travier 16:44:34 I think if someone cares about the stream, version they will go find it out. If we are trying to use fcos a container that we can use in a dockerfile or run it to do quick test I think we should behave like all other containers. 16:45:33 Do we expect the latest -> stable alias to ever change? I don't think it will (unlikely) 16:45:43 to me it's just a shortcut 16:46:04 ehh. I kind of feel like we should behave like other remotes here.. i.e. there is no `latest` analogue for the OSTree repo 16:46:26 hi :) 16:46:33 travier: probably not. but you never know 16:46:34 ๐Ÿ‘‹ 16:46:51 #chair Nemric 16:46:51 Current chairs: Nemric aaradhak dustymabe jbrooks jlebon jmarrero ravanelli travier 16:48:58 i'm not too concerned about users not typing `:stable`, but do think it should be easy to tell that that's what they got. in the rpm-ostree context, we'd have to enhance `status` to surface it better 16:49:53 if someone asks me for FCOS on a USB stick, i'd give them stable. they don't have to know all about our streams upfront. :) 16:50:14 if other places where we push FCOS had a concept of defaults, it'd make sense to default to stable there too 16:50:29 that might be the case already in clouds that have a "family" concept, not sure 16:50:51 in GCP the image family is section off by stream 16:51:01 so there are different image families per stream.. 16:51:13 in all cases today you have to make a stream choice for FCOS up front 16:51:44 I guess the download page is special 16:51:55 in that `stable` is shown to you by default 16:51:58 right, but if we had more access points where defaults were a thing, would we make use of them? 16:52:27 I don't personally think so (bgilbert and lucab usually have strong opinions about stuff like this) 16:52:58 For FCOS I think it's important people know about the streams and what is available to them. It's part of how our community supports itself 16:53:24 a counterpoint is `coreos-installer download` defaults to stable 16:53:40 yes, but the resulting artifact knows that it is following the stable stream 16:53:55 that'd be the case here too, right? 16:54:06 would it? 16:54:14 the stream is in the commit metadata. that's what zincati uses today 16:54:19 it's following a container in a registry with `:latest` 16:54:53 Agree with that, default should be stable, and aother strems to a bit more advanced users ... so dowload page show stable first and other stream are shown to and it's fine 16:55:01 right, but the artifact does know it's stable 16:55:23 i do agree though the lack of visibility in `status` would be something that'd need to be addressed 16:55:58 we could even have rpm-ostree do a thing where it just auto-dereferences and uses :stable 16:56:23 "auto-dereferences" how? 16:56:32 hardcoded and only works for FCOS? 16:57:24 it could look for the canonical tag name in the image metadata and substitute it 16:57:33 not an FCOS-specific thing 16:58:21 but that assumes the tag exists in the container repo 16:58:32 dustymabe: to be fair, i'm mostly playing devil's advocate here. i do see value in having users being more thoughtful upfront. :) 16:58:37 it might not always be the case (especially for people's derived container images) 16:58:59 I guess my thing here is that I see a lot of downsides and not a lot of benefit 16:59:21 i see this being useful to mesh better with existing container tooling 16:59:34 there are a lot of open questions about rpm-ostree displaying the status, possibly making assumptions, and how zincati will work (which we still haven't figured out for CoreOS Layering) and could complicate things there. 16:59:35 we could also defer and re-evaluate as layering actually gains more traction 17:00:39 yes - I think that may be a good idea. I'm not totally opposed to doing this eventually, but I'm concerned about some of the implications, especially early on in the lifecycle of us using containers for delivering FCOS 17:01:07 does anyone want to make a `#proposed` here? 17:01:14 people using it at this stage know what they're getting into so there's less pressure to appeal to the kicking-the-tires crowd 17:01:28 where this might be seen as friction 17:02:13 note: if there was a mechanism where container tooling would pull `quay.io/fedora/fedora-coreos` and get `quay.io/fedora/fedora-coreos:stable` automagically then I'd be for it 17:02:23 i.e. almost like a redirect 17:03:25 .hello2 17:03:26 walters: walters 'Colin Walters' 17:03:36 ๐Ÿ‘‹ walters 17:03:43 #chair walters 17:03:43 Current chairs: Nemric aaradhak dustymabe jbrooks jlebon jmarrero ravanelli travier walters 17:04:10 should we try to come to a conclusion on this or let walters catch up and join the discussion? 17:04:27 only skimming back but one thing I'd say is that if we'd done container-native from the start, I suspect our stable stream would have been *called* `:latest` 17:05:03 what would be the logical distinction between a redirect and "re-tagging"? 17:05:26 I imagine at a technical level it could indeed be a HTTP 304 but to the user, that's the same thing 17:05:38 unless podman/docker printed something when they got a 304 but... 17:06:36 I'm not sure what exactly "re-tagging" is (might need to clarify), but to me a redirect would tell the tooling "you want to use this tag" and the resulting pulled image would be `quay.io/fedora/fedora-coreos:stable`, not `quay.io/fedora/fedora-coreos` or `quay.io/fedora/fedora-coreos:latest` 17:06:43 anyways my 2c is I don't care really strongly about doing this right now, it's fine to just have it documented 17:06:45 I feel like "re-tagging" involves making assumptions 17:07:20 client side 17:07:29 oh, I see. yeah I could imagine something like that being added to the container stack 17:07:50 ๐Ÿ‘ 17:09:20 typing up something now - one moment 17:10:01 #proposed While this would provide benefit to new users who just want to pull FCOS and see what's inside we could see some potential problems with it and would like to defer implementing something like this until our update story for CoreOS Layering (https://github.com/coreos/fedora-coreos-tracker/issues/1263) is figured out. 17:10:17 I'll go into more detail in the issue comment 17:10:24 I'll go into more detail/summary in the issue comment 17:11:16 aside: it would be nice if container registry redirects worked in general.. would make moving repos around easier (think like moving a git repo and old links still work) 17:11:21 ack 17:11:36 sure, +1 to that 17:11:59 +1 17:12:05 ack 17:12:08 (i don't fully see how this intersects with the update story FWIW, but let's leave that for the issue) 17:12:29 ack 17:12:32 #agreed While this would provide benefit to new users who just want to pull FCOS and see what's inside we could see some potential problems with it and would like to defer implementing something like this until our update story for CoreOS Layering (https://github.com/coreos/fedora-coreos-tracker/issues/1263) is figured out. 17:12:51 I don't see where we would have issues but it's not a priority either now so we can wait 17:13:04 jlebon: +1 - maybe it doesn't cause any problems, but maybe it does - we just haven't fleshed out that full story yet 17:13:20 #topic Document /boot requirements and constrains when installing/upgrading kernels 17:13:24 #link https://github.com/coreos/fedora-coreos-tracker/issues/1247 17:14:11 ok - new update on this one.. turns out the zstd change doesn't appear to be a silver bullet for the new ppc64le artifacts we are generating 17:14:28 see https://github.com/coreos/fedora-coreos-tracker/issues/987#issuecomment-1249794907 17:15:02 I'm wondering if we could/should also add priority to https://github.com/ostreedev/ostree/issues/2670 to facilitate making this less of a problem 17:15:23 (added a comment to the bottom of that issue earlier today) 17:16:00 yeah probably not too hard though I would still like to investigate shrinking the initramfs too 17:16:54 aside: do we know why ppc64le has heavier kernel+initrd? 17:17:09 i wonder if we're shipping something in the initrd there that we don't need 17:17:26 I'm +1 for shrinking the initramfs, but definitely prefer options with minor complexity (i.e. finding generic "decrease size of static binary" wins) 17:17:42 jlebon: good question - i'd love for someone to investigate 17:18:19 everyone has the powerpc to do it 17:18:30 ๐Ÿคฃ 17:18:45 heh 17:18:55 but really.. anyone should be able to do this on their laptop (just files, no specific hardware required) 17:19:13 I know walters knows that, just making sure we don't confuse anyone 17:19:55 ok so immediate steps: 17:20:14 1. investigate why ppc64le initramfs is larger than say x86_64 or aarch64 17:20:32 2. maybe someone can start working on https://github.com/ostreedev/ostree/issues/2670 ? 17:20:47 3. we can look at options for reducing the size of the initramfs 17:21:08 "generically reducing the size" 17:21:34 dustymabe: in the current state, can that system you pasted from even update? given that the zstd size seems greater than available 17:21:50 jlebon: :) 17:22:24 We still have the option to make making /boot larger on first boot easier 17:22:30 that was after the last update so I have tried with that exact configuration.. but what I will tell you is that for this system I've been deleting the rollback deployment before updating it 17:22:56 which is why I knew this was a problem to begin with 17:23:16 and blocked sending ppc64le to production streams until it's fixed 17:23:29 this is our ppc64le builder (dogfooding is good) 17:23:43 *haven't tried with that exact configuration" 17:23:44 ouch, ok right. so this is blocking ppc64le then 17:24:26 jlebon: I think after the next update (i.e. the rollback and booted deployment have ZSTD compressed initrd) we might be able to update, but just barely 17:24:37 it's too close for comfort 17:25:07 yeah, was looking at that, but not even sure it could. we'd get 9M, so 110M free, but the kernel+initrd is 112M 17:25:40 right.. i was thinking we'd save more like 20M rather than ~10M with the zstd compression, but that doesn't seem to be the case 17:25:54 travier: indeed 17:26:11 ok, let's signal that in the tracker issue 17:26:30 jlebon: in https://github.com/coreos/fedora-coreos-tracker/issues/1247 ? 17:26:55 dustymabe: yeah. probably should retitle it at this point 17:27:11 +1 - suggestions for new title? 17:27:19 i don't think we were aware this was effectively blocking FCOS ppc64le 17:27:52 yeah, it wasn't really at first, but just happened to be working on ppc64le at the same time and stumbled on it 17:28:51 maybe something like "Boot partition can easily run out of space" ? 17:29:00 once we figure out a mechanism to make the initramfs smaller we should make a test to verify the generated initramfs for each arch is under a certain threshold size, so we get fair warning of future issues 17:29:31 jlebon: what about: "Boot partition can easily run out of space on upgrade"? 17:29:51 SGTM 17:30:07 still hoping long-term we can bump the partition size 17:30:31 i'll try to put a summary in the ticket of this discussion and link to it 17:30:45 #topic tracker: Fedora 37 changes considerations 17:30:50 #link https://github.com/coreos/fedora-coreos-tracker/issues/1222 17:30:55 ok I updated the list this morning 17:30:56 my thoughts are: right now, only ppc64le is impacted. we need to make improvements for all arches but we can make a short term "increase /boot" workaround for ppc64le until we have better options 17:31:18 and it's not like we have a large ppc installation base 17:31:48 there were a few changes that got removed from F37 17:32:00 I need to check, but I recall a bug in ppc64le where it took a lot of time to boot rhcos, with some loop and a bunch of logs it could be related. + it uses petiboot instead of grub for boot things 17:32:01 and then one that was added (late I guess?) 17:32:22 subtopic 228. Minizip Renaming 17:32:45 #info we don't ship minizip in FCOS, so this should be a noop for us 17:32:51 :) 17:33:03 nice :) 17:33:09 anything else Changes related, that we should discuss? 17:33:41 I don't think so 17:33:44 #topic open floor 17:33:44 (we're over time) 17:33:49 sorry we are over time 17:34:26 anyone know if the NM team has "daemonize in initramfs" on their radar? re https://issues.redhat.com/browse/OCPBUGS-1327 17:34:34 #info there is a FCOS 37 test day happening tomorrow! 17:34:40 #link https://lists.fedoraproject.org/archives/list/coreos@lists.fedoraproject.org/message/4J5XDZN6L2UL5U4W5SN7RM5J7U65YAQK/ 17:35:10 walters: NM already runs as a daemon in the initramfs in Fedora and RHEL9 IIUC 17:35:49 oh hmm 17:35:54 `nm-initrd.service` 17:36:05 Nemric: did you have something for open floor? 17:37:09 who all can come and help us with the test day tomorrow? 17:37:13 dustymabe: nice, would have to look at that on RHEL 9 17:37:21 thank you jbrooks for helping to organize 17:37:24 jbrooks++ 17:37:57 we're going to have a video/screen share session tomorrow at 9:30 AM - 11 AM EDT (1:30 PM - 3:00 PM UTC) 17:38:04 ๐Ÿ‘ 17:38:43 should be able to join 17:39:00 jlebon++ 17:39:00 dustymabe: Karma for jlebon changed to 1 (for the current release cycle): https://badges.fedoraproject.org/tags/cookie/any 17:39:07 come one come all :) 17:39:48 I'll force mnguyen_ to join too (via threats of violence) :) 17:40:00 * dustymabe closes the meeting in 30s if inactivity ensues 17:40:02 i'll come 17:40:39 #endmeeting