16:30:03 #startmeeting fedora_coreos_meeting 16:30:03 Meeting started Wed Jun 10 16:30:03 2020 UTC. 16:30:03 This meeting is logged and archived in a public location. 16:30:03 The chair is dustymabe. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:30:03 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:30:03 The meeting name has been set to 'fedora_coreos_meeting' 16:30:08 #topic roll call 16:30:10 .hello2 16:30:11 dustymabe: dustymabe 'Dusty Mabe' 16:30:39 .hello2 16:30:40 .hello2 16:30:40 lucab: lucab 'Luca Bruno' 16:30:43 slowrie: slowrie 'Stephen Lowrie' 16:30:46 .hello2 16:30:47 miabbott: miabbott 'Micah Abbott' 16:30:51 .hello sohank2602 16:30:55 skunkerk: sohank2602 'Sohan Kunkerkar' 16:31:41 .hello2 16:31:42 jlebon: jlebon 'None' 16:32:44 #chair lucab slowrie miabbott skunkerk jlebon 16:32:44 Current chairs: dustymabe jlebon lucab miabbott skunkerk slowrie 16:32:53 #topic Action items from last meeting 16:33:09 * dustymabe and jlebon (and anyone else who wants to) will review test 16:33:10 cases linked from the various test day pages and try to fixup things 16:33:12 that need fixing 16:33:14 * dustymabe to send coreos-status post with relevant information about 16:33:16 the change to fedora 32 16:33:46 #info we had the FCOS test day and I did send out a coreos-status post with relevant information 16:33:58 though the mail archives don't display html :( 16:34:12 #topic meeting agenda items for today 16:34:36 any large items that we should discuss that need immediate attention? 16:34:47 otherwise we'll go through my short list and then meeting tickets 16:34:57 .hello2 16:34:58 jdoss: jdoss 'Joe Doss' 16:35:02 #chari jdoss 16:35:04 sigh 16:35:07 #chair jdoss 16:35:07 Current chairs: dustymabe jdoss jlebon lucab miabbott skunkerk slowrie 16:35:33 * dustymabe waits 30 seconds 16:35:33 .hello2 16:35:34 cyberpear: cyberpear 'James Cassell' 16:35:39 #chair cyberpear 16:35:39 Current chairs: cyberpear dustymabe jdoss jlebon lucab miabbott skunkerk slowrie 16:36:52 #topic FCOS test day on 06/08 16:36:57 #link https://github.com/coreos/fedora-coreos-tracker/issues/491 16:37:03 so we had a test day! 16:37:27 looks like we had quite a few people contribute and run test cases 16:37:31 🎉 16:37:34 #link http://testdays.fedorainfracloud.org/events/84 16:37:43 yeah, i was pleasantly surprised by the turnout 16:38:05 looks like we might have some gaps on the cloud platforms 16:38:19 yeah, was going to say. no love for DO 16:38:21 i'll probably try to run through the digitalocean test case sometime soon 16:38:33 i think jdoss said he would run an AWS test :) 16:38:49 It's on my list! 16:39:02 I need to go through and look at the feedback in the test day app to see what is actionable 16:39:19 I know we got some feedback in the IRC channel about documentation improvements 16:39:30 some stuff around static partitioning 16:39:46 - need warning/docs that reconfiguring root storage doesn't work 16:40:09 docs are not updating since last week btw, known issue on fedora-infra side 16:40:25 A note: I tried to fill out an entry for a test I did with QEMU and the form / site timed out yesterday. I need to go try again today. 16:40:27 - need warning/docs about /home -> /var/home (fcct could spit out warning about toplevel mount points) 16:40:43 jdoss: yeah, probably because of the infra datacenter move 16:40:51 +1 16:40:57 i also filed/investigated some UX issues in zincati and rpm-ostree 16:41:25 yep and I found one with the displayed output from the coreos-ssh-key checker console message helper script 16:41:51 one thing the test day did highlight - we have no docs for openstack 16:41:58 they symlinks have always been a pain point... is a bind-mount not tenable? 16:41:59 would anyone like to volunteer to add some? 16:42:31 I tried to tag the tickets I've seen with the proper label. If I missed some, please retro-tag 16:43:43 #info we need documentation for openstack deployments 16:44:15 #info we had a productive test day with a lot of feedback and a few cosmetic issues found, we're still processing the feedback 16:44:22 anything else for the test day topic 16:44:55 (did we ever make openstack afterburn "just work" with metadata service?) 16:45:06 (sorry, I didn't make it to the test day...) 16:45:26 i think that's still pending 16:45:27 cyberpear: no worries :) - you can still participate if you find something that hasn't been done yet 16:45:40 right we've decided we want to do it, but need the work to be done 16:46:03 https://github.com/coreos/fedora-coreos-tracker/issues/422#issuecomment-607513602 16:46:24 i'm not sure when it will get worked on 16:46:43 anybody want to volunteer to work on ^^ ? 16:47:32 If it's not a complex change then maybe someone who isn't yet familiar with the codebase could try 16:48:00 * dustymabe will move on to the next topic in 30s 16:48:55 I don't currently have an openstack environment to check that. Once I get there, I will tackle that if nobody does first. 16:49:22 #topic F32 rebase tracker for changes discussion 16:49:29 #link https://github.com/coreos/fedora-coreos-tracker/issues/372 16:49:35 we're using this ticket to discuss the rebase to f32 16:49:38 lucab: let's chat after 16:50:00 so our `testing` stream has been on f32 for a week now 16:50:10 I don't think I've seen any major issues, though some medium ones 16:50:44 and those were mentioned in the last update in this issue 16:50:55 so I think for now we're on schedule to ship to stable next week 16:51:22 one change that we are considering making is documented in https://github.com/coreos/fedora-coreos-tracker/issues/525 16:52:47 TL;DR we build FCOS with the fedora modular repos as inputs, but we have those repos disabled by default on the running FCOS systems. We're proposing to just turn off the modular repos for the FCOS builds as well. Doing this will make package layering on top of f32 more reliable and also will hopefully make rebases to f32 less likely to fail. 16:53:12 #info we build FCOS with the fedora modular repos as inputs, but we have those repos disabled by default on the running FCOS systems. We're proposing to just turn off the modular repos for the FCOS builds as well. Doing this will make package layering on top of f32 more reliable and also will hopefully make rebases to f32 less likely to fail. This is documented in 16:53:13 https://github.com/coreos/fedora-coreos-tracker/issues/525 16:54:00 so we'd want to turn off those repos and respin `testing` so that we can make that change for next week's `stable` 16:54:38 that's the current plan, hopefully the Fedora infra DC move will stabilize soon so we can achieve that goal. 16:54:49 any other topics before we move on? 16:56:33 makes sense on the modular, since there's no default modules in F32 (a big win!) 16:57:04 ahh, hmm I might have to do some reading up on that 16:57:11 cyberpear: link? 16:57:44 and also, who is that a "big win" for? /me trying to gain context 16:57:59 big win as in you don't have any headache coming from the modules 16:58:02 (by default) 16:58:07 kk 16:58:08 cyberpear: as in by default no modular packages ship on the media? 16:58:30 sorry, we can chat about this back in #fedora-coreos 16:58:34 jlebon: I believe that is the case, yes, such as for Server media, workstation spin 16:58:36 don't have it right off hand, but it was a FESCo decision, IIRC 16:58:40 yeah I'll move on, but now I need to learn more :) 16:58:42 * cyberpear yields to next topic 16:58:49 #topic Need dnsmasq for podman to create CNI networks 16:58:55 #link https://github.com/coreos/fedora-coreos-tracker/issues/519 16:59:23 .hello2 16:59:24 lorbus: lorbus 'Christian Glombek' 16:59:27 so it sounds like podman needs dnsmasq for some use cases but doesn't have a dep on it 16:59:30 o/ sorry I'm late :) 17:00:03 #chair lorbus 17:00:03 Current chairs: cyberpear dustymabe jdoss jlebon lorbus lucab miabbott skunkerk slowrie 17:00:09 .fire lorbus 17:00:09 adamw fires lorbus 17:00:16 .chair lorbus 17:00:16 lorbus is seated in a chair with a nice view of a placid lake, unsuspecting that another chair is about to be slammed into them. 17:00:22 :P 17:00:43 well, dustymabe didn't respond to an @ from yesterday... 17:00:46 .fire dustymake 17:00:46 adamw fires dustymake 17:00:48 a lake, or a plashet? 17:00:51 close enough! 17:01:08 adamw: darn :( - sorry about that - maybe ping me again and I'll try to respond today 17:01:16 hehe, nbd 17:01:34 so I guess the question is 1. should podman at least recommend dnsmasq and 2. should we include it in FCOS 17:02:13 I agree and disagree with luca's statement in https://github.com/coreos/fedora-coreos-tracker/issues/519#issuecomment-639582552 17:02:16 :) 17:02:49 I agree that we don't users relying on dnsmasq in the host, but I do think that we want user's relying on features of the container runtime (which is in the host) 17:03:28 so if it were strictly so that users could rely on dnsmasq for some application then I would be inclied to say no 17:03:40 well, that's quite a bit of corner cutting on podman side 17:03:42 but if it's for the container runtime, i lean the other way 17:04:54 lucab: want to elaborate? do you think that podman should not be using dnsmasq (i.e. could do it in a different more native way)? 17:05:08 indeed 17:05:37 CNI bridging does not require dnsmasq 17:05:55 mheon's comment says it's for name resolution between containers 17:06:31 they likely wanted to introduce some new feature, but didn't bother actually implementing it in CNI 17:06:54 so they just shell out 17:07:07 so maybe we should reach out to them to get more perspective ? 17:08:33 to reconsider the CNI plugin design, yes 17:09:20 in general i'm not opposed to shelling out to other things that have a specific job to do - we do it all the time, but in this case if it's a small part of dnsmasq they are using and it could easily be implmented in a more appropriate way somewhere else, then I'd like to explore that as an option 17:09:54 well, yes, but those are in the category of "utilies" 17:09:58 *utilities 17:10:11 lucab: vs services ? 17:10:43 otherwise you can shell out to httpd any time you need an HTTP support 17:11:49 at some point I think they'll have to produce a config for dnsmasq, and have some version requirement on it 17:12:07 everyone agreed to discuss it with the podman team and circle back? 17:12:32 WFM 17:12:32 which means this CNI plugin now suddenly wants to be the one deciding the dnsmasq version 17:13:06 +1 17:15:07 #topic F33 feature/change proposal SwapOnZRAM by default 17:15:13 #link https://github.com/coreos/fedora-coreos-tracker/issues/509 17:15:50 so currently we don't configure any swap for our instances (IIRC) 17:16:29 there a few questions this topic brings up 17:16:42 1. should we be configuring swap by default 17:17:06 2. even if we don't configure swap by default, should we use zram if the user sets up swap? 17:18:14 for 1. I think the institutional knowledge in me says that we don't have swap configured because if your system needs more memory then you should get a bigger system or you should scale out your fleet to handle the extra load 17:19:13 anybody else have any historical knowledge for why swap is typically not enabled in cloud images? 17:19:20 it also conflicts with most assumptions in higher-level container orchestration schedulers 17:20:28 is it safe to say that we're probably not going to change and "add swap" to our disk images? 17:21:33 jlebon: slowrie miabbott cyberpear jdoss ^^ 17:21:53 I think so. OKD/k8s will likely not work anymore. Even if you hammer that in place, pods QoS will behave weirdly. 17:21:58 I'd take the Fedora lead and keep zram swap if Fedora makes it the default 17:22:14 cyberpear: right, i think that's point 2. 17:22:18 +1 to not enabling swap 17:22:27 +1 to no swap on cloud instances 17:22:45 cyberpear: so for 2. the question is - if a user enables swap should it use zram by default 17:22:55 which I think is fine 17:23:47 I think yes, keep zram, even if there's other swap, but again, follow Fedora. (The proposed zram swap is higher priority than disk-swap, IIUC) 17:23:52 * cmurf says hi 17:24:13 about upgrades... 17:24:53 i'm confident it's safe but there's a good chance this is for new installations only and we back off on doing it for upgrades 17:25:06 #info since fedora coreos images don't currently deliver any swap partition, it conflicts with how most people run things in cloud environments, and container orchestration platforms assuming no swap, we'll stick with what we're currently doing 17:25:15 cmurf: context ^^ 17:25:31 cmurf: now the next question was related to "what do we do if a user configures swap" 17:26:08 #proposed if a user configures swap via ignition we'll use the fedora defaults (changing in f33 to use SwapOnZRAM by default) 17:26:36 i think that's up to the user no? or are you thinking in the context of sugar in fcct? 17:26:37 (I think swap on cloud can make sense, but maybe zram swap solves that case, so I won't argue now for adding a swap partition to cloud.) 17:26:39 all - does that align with what we discussed above? 17:27:04 what does "user configures swap" actually mean? 17:27:15 jlebon: good question 17:27:36 i assume there is still some part of a hard drive somewhere that is dedicated to swap 17:27:56 either they use ignition to mkswap, or they drop some configs somewhere for swap-on-zram 17:27:57 cmurf: you can set me straight here ^^ 17:28:06 (or both?) 17:28:16 https://github.com/systemd/zram-generator 17:28:48 separate question: do we want to enable SwapOnZRAM early for F32-based FCOS if it's accepted for Fedora 33? 17:29:07 ahh ok so I was slightly mistaken I think 17:29:36 I was thinking swaponzram augmented existing swap parititions and essentially made them more effective 17:29:49 nope 17:29:56 by balancing part of the swaped pages into ram and some in the actual swap partition 17:29:57 no disk based swap at all 17:30:03 memory block device only, using compression 17:30:27 interesting 17:30:27 I think the flow would be 1) write zram conf in Ignition 2) write a formatting + mounting in Ignition 3) let the generator create the device 4) let the unit format the swap 17:30:55 zram-generator being included in your image will not cause swap-on-zram to be setup, it also requires a configuration 17:31:05 so you can choose to opt out entirely; or you could just not ship a configuration 17:31:27 it's basically a "formattable disk device in RAM" which can be used as a target for swap 17:31:27 I see 17:31:34 correct 17:31:39 it's just a ram disk that compresses 17:31:39 so I think it makes sense to include it, with the default configuration, and document having ignition masking the zram config if you don't want it 17:31:48 but we're gonna format it as swap 17:31:57 ok good info 17:32:05 thanks for bringing me up to speed 17:32:29 i wonder if we can configure it such that it's only activated for some small threshold of memory 17:32:36 the overhead is actually less than the upstream doc suggests, if the provisioning is such that memory never gets close to full to trigger paging (swapping) 17:32:37 it would be nice to see how it behaves with 1) mount namespaces 2) non-swap FS because it seems like a interesting alternative to loop0 17:32:50 i.e., if your system has < 6G of ram then we enable it by default, else we don't 17:33:21 #undo 17:33:21 Removing item from minutes: 17:33:34 #undo 17:33:34 Removing item from minutes: INFO by dustymabe at 17:25:06 : since fedora coreos images don't currently deliver any swap partition, it conflicts with how most people run things in cloud environments, and container orchestration platforms assuming no swap, we'll stick with what we're currently doing 17:33:42 there is such an option in the config 17:33:45 we should probably have a look at fixing their TODO note about /usr/lib defaults, because I foresee some fight on the default values 17:33:57 IIUC, the proposed Fedora Change would limit the uncompressed size to 25% of RAM or 4GiB, whichever is smaller -- for a 2:1 compression ratio this would occupy 12% or 2GiB of ram, but it's only allocated as-needed 17:34:09 right now it's none (no limit, i.e. 128G you still get 4G of swaponzram) but you could ship your own config with your own defaults 17:34:33 cyberpear: is there a way to configure it such that zramonswap won't happen at all if you're above a certain threshold of memory? 17:34:46 dustymabe: anaconda does this today, so it's possible 17:34:49 i.e., "I've got 32G of memory, I don't need swap" 17:35:16 very good info 17:35:27 cmurf: which component owns the default value/algorithm/logic in vanilla Fedora? 17:35:30 the "in defense of swap" article makes a very compelling argument for having some swap in all cases 17:35:37 ok I think since we're over time I'll try to summarize this discussion in the ticket and we can circle back on it next time 17:35:55 lucab: not decided and btw i am seriously stupid when it comes to packaging 17:36:10 also on devel@ lennart points out that default configs shouldn't be in /etc 17:36:10 seriously? :) 17:36:15 seriously 17:36:21 haha 17:36:24 like impressive, not in a good way, stupid 17:36:27 #url https://chrisdown.name/2018/01/02/in-defence-of-swap.html 17:36:36 well thanks cmurf for fixing my misconceptions 17:37:34 cmurf: yes, I echo Lennart 17:37:49 right so carving out everyone's needs? i think it can be done because i've seen it elsewhere, i just don't know exactly how it's done 17:37:50 ok i'll move to open floor for now 17:38:07 cmurf: don't worry we'll figure it out if/when we make that move 17:38:16 just knowing it's possible helps us make the decision 17:38:23 #topic open floor 17:38:28 anyone have anything for open floor? 17:40:07 * dustymabe counts down to 0 17:40:55 #endmeeting