16:30:35 #startmeeting fedora_coreos_meeting 16:30:35 Meeting started Wed Oct 14 16:30:35 2020 UTC. 16:30:35 This meeting is logged and archived in a public location. 16:30:35 The chair is dustymabe. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:30:35 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:30:35 The meeting name has been set to 'fedora_coreos_meeting' 16:30:40 #topic roll call 16:30:43 .hello2 16:30:44 dustymabe: dustymabe 'Dusty Mabe' 16:30:48 .hello2 16:30:49 .hello2 16:30:49 cyberpear: cyberpear 'James Cassell' 16:30:49 .hello2 16:30:52 lucab: lucab 'Luca Bruno' 16:30:55 aoei: aoei 'Joanna Doyle' 16:30:55 .hello redbeard 16:30:58 red_beard: redbeard 'Brian 'redbeard' Harrington' 16:31:02 .hello2 16:31:04 bgilbert: bgilbert 'Benjamin Gilbert' 16:31:28 .hello2 16:31:31 lorbus: lorbus 'Christian Glombek' 16:31:38 .hello2 16:31:39 nasirhm: nasirhm 'Nasir Hussain' 16:32:23 #chair cyberpear lucab aoei red_beard bgilbert lorbus nasirhm 16:32:23 Current chairs: aoei bgilbert cyberpear dustymabe lorbus lucab nasirhm red_beard 16:32:38 .hello2 16:32:39 jlebon: jlebon 'None' 16:32:39 .hello sohank2602 16:32:42 skunkerk: sohank2602 'Sohan Kunkerkar' 16:32:54 #chair jlebon skunkerk 16:32:54 Current chairs: aoei bgilbert cyberpear dustymabe jlebon lorbus lucab nasirhm red_beard skunkerk 16:33:02 nice crowd today :) 16:33:13 #topic #topic Action items from last meeting 16:33:16 #undo 16:33:16 Removing item from minutes: 16:33:23 #topic Action items from last meeting 16:33:38 we had one re-action from last meeting 16:33:40 * jlebon to reach out to the rpm maintainers to see if the relocation of 16:33:42 the rpmdb path is something they're willing to own for F34 16:34:36 oh shame 16:35:31 how about i do this right now during the meeting :) 16:35:58 WFM :) 16:36:09 ok moving on to topics 16:36:24 #topic daylight savings time changes and meeting time 16:36:57 For some countries daylight savings time is changing over the coming month 16:37:00 Hmm, What'll be the next meeting time ? 16:37:45 In the past we have stayed on UTC time, so it affects all locals equally. You just translate to your local time from UTC and it's up to you to handle the change (the fedora calendar also helps) 16:37:47 (In UTC) 16:37:49 is the meeting not sticking to UTC? or is this just informational? a few cycles ago i made sure the event in my calendar was UTC. 16:38:08 red_beard: I propose we stick with UTC 16:38:15 seconded 16:38:22 thirded 16:38:23 time is a constant. humans muck it up. 16:38:42 +1 16:38:51 +1 16:39:13 #action dustymabe to send an email to the coreos list (not status) with a PSA about the meeting time sticking with UTC 16:39:22 sound good? 16:39:36 +1 :) 16:39:47 ok moving on 16:39:48 +1 (well, really -1h for me) 16:40:26 #topic tracker: Fedora 33 rebase work 16:40:43 #link https://github.com/coreos/fedora-coreos-tracker/issues/609 16:40:46 just a few updates here 16:41:14 for the systemd-resolved migration of existing systems that upgrade we have a proposal 16:41:26 outlined here: https://github.com/coreos/fedora-coreos-tracker/issues/646#issuecomment-707984576 16:42:08 Thanks to a user identifying a potential issue we know some users will be affected, but we're planning a coreos-status post to detail the steps that need to be taken 16:42:34 the vast majority of users will be able to migrated automatically 16:42:41 does our fcct sugar not handle masking units? 16:43:07 cyberpear: see the last three comments in the thread :) 16:44:15 dustymabe: one last question about this i had: should we just make this dumber and match what the fedora scriptlet is doing? 16:44:33 i.e. only care about `systemctl is-enabled` and not look at the contents of `/etc/resolv.conf` at all 16:44:57 the only thing I worry about there is if somehow the user was managing their own resolv.conf 16:45:21 jlebon: but I wouldn't be opposed if we think that's the right thing to do 16:46:38 * dustymabe was trying to be careful 16:46:58 yeah, understood. i just like the elegance of having a super simple contract for this, and also matching the rest of Fedora 16:47:21 any others want to argue one way or the other? 16:47:22 if users want to manage their own resolv.conf they probably really want to disable resolved too 16:47:46 so being more strict like that will flush those use cases out, if any 16:47:55 jlebon: right, but.. disabling resolved does require manual intervention at this point 16:48:23 so in the past if we could auto migrate 90% maybe it drops down to 80% 16:49:09 the only part of the change I don't like is replacing /etc/resolv.conf with a pointer to 127.0.0.53 by default. and especially for problematic for upgrades 16:49:43 (I think the recent devel list thread was 200 messages) 16:49:53 I think we should follow the rest of Fedora, but still try not to break our users 16:50:17 cyberpear: i think that's what we're trying to do here, right? 16:50:21 follow fedora, not break users 16:50:26 yep! 16:50:40 dustymabe: i'm good with the current strategy i think, just wanted to explore this a bit more 16:50:53 basically /etc/resolv.conf gets pointed to a file managed by systemd-resolved 16:51:13 though cyberpear does bring up a good point we should try to consider 16:51:21 which is, rollback 16:51:45 jlebon: that change to /etc/resolv.conf would or would not get rolled back? 16:52:10 yup, /etc is bound to the deployment 16:52:30 ok i'll run through that test just to make sure when we get this implemented 16:52:54 the only other update I had for f33 is that we tracked one test failure down to a SELinux bug 16:53:13 https://github.com/coreos/fedora-coreos-tracker/issues/643#issuecomment-708417208 16:53:30 and that `next` is now based on F33 so please try to use it and give some feedback on what's broken 16:53:55 ok next topic 16:54:02 #topic F33 feature/change proposal SwapOnZRAM by default 16:54:12 #link https://github.com/coreos/fedora-coreos-tracker/issues/509 16:54:27 now that we have fedora 33 out we can consider the swaponzram change 16:55:07 I think the running proposal is that we don't enable swaponzram by default because of requires of kubernetes and other clustered offerings on top that don't work by default with it enabled 16:55:25 so I can up with two options for adding it but not having it actually on by default 16:55:33 https://github.com/coreos/fedora-coreos-tracker/issues/509#issuecomment-708408426 16:56:00 In one case we include both packages (including the default file) and override with a file in /etc/ that can be easily deleted 16:56:20 In the other case we just include the one package and then the user can create a zram conf file on their own 16:56:38 I think the 2nd option is probably most natural 16:56:38 dustymabe: I would like to ask, does SwapOnZRAM has use cases on Containerized Workloads ? 16:57:06 nasirhm: i would assume it has use cases in any scenario you're running low on RAM 16:57:23 dustymabe: I share the same feeling 16:58:07 dustymabe: Hmm, makes sense, We can add the one package and let the user create a zram conf file and document it in our docs. 16:58:22 just anecdotally - i have a laptop with 8G RAM. it was starting to run sluggish over time and would often lock up a bit with a lot running on it. After upgrading to F33 it's pretty darn smooth and no OOMs 16:58:28 I'd advotate adoping the SwapOnZRAM change in full, provided we don't break OKD, or if it would break them, see if it can be unbroken OKD-side. 16:59:45 cyberpear: yeah, it would require trying to work with the various upstreams (OKD is one, typhoon another, etc..) to make sure it gets properly disabled 17:00:03 ...or make them run properly with it enabled 17:00:09 but I think for right now maybe we should include, but not default to on, then re-evaluate for f34 switch 17:00:23 cyberpear: yeah there is an open issue against k8s to have it properly handled 17:00:28 dustymabe: +1 17:00:37 https://github.com/kubernetes/kubernetes/issues/53533 17:01:06 dustymabe++ thanks for linking the upstream k8s swap issue! 17:01:12 that issue has been open for a while 17:01:25 so I wouldn't bank on it spontaneously resolving itself :-( 17:01:44 bgilbert: yep. open source is the long game 17:01:47 right 17:01:58 so.. I'll make a proposal 17:03:10 #proposed we'll include the zram-generator package for now, which will allow users to drop down a config file to enable swaponzram. In the future we'll re-evaluate if creating a swaponzram device by default, is the right thing for us to do. 17:03:22 oh also, we'll add docs to show users how to do this. 17:03:34 ^^ will add to the #agreed if the proposal passes. 17:04:16 +1 17:04:17 makes sense to me 17:04:20 +1 17:04:32 SGTM 17:04:39 +1 17:05:25 #agreed We'll include the zram-generator package for now, which will allow users to drop down a config file to enable swaponzram. Additionally we'll add docs to show users how to do this. In the future we'll re-evaluate if creating a swaponzram device by default, is the right thing for us to do. 17:05:30 thanks :) 17:05:47 #topic no cloud agents: gcp 17:05:51 #link https://github.com/coreos/fedora-coreos-tracker/issues/67 17:06:08 welcome back to one of the original issues of all time :) 17:06:47 I have a periodic meeting with Google engineering and they ask about this. 17:07:04 Basically OSLogin support is considered by them to be essential to their platform. 17:07:23 Should we re-visit having this functionality in FCOS for GCP? 17:07:52 If so what do we think would be the best way to implement it? Using Google's code, or our own minimal re-implementation? 17:08:02 "Long term: Replace the platform agents, where possible, with our own minimal implementations. Ship those as part of the OS." https://github.com/coreos/fedora-coreos-tracker/issues/12#issuecomment-408112263 17:08:27 there's a couple pieces 17:08:41 OS Login is an NSS module and a PAM module and I think some glue 17:09:04 it's pretty minimal IIRC. the issue for us is that we'd need to conditionally swap out nsswitch.conf and pam configs 17:09:23 on CL we had a firstboot systemd unit that ran a shell script that did this 17:09:24 ...for a single platform 17:09:31 red_beard: yes, exactly 17:09:32 bgilbert: you mean it's not a static change? 17:10:19 jlebon: on CL we let users disable it 17:10:27 maybe we don't want to do that in FCOS 17:10:28 .hello2 17:10:29 davdunc: davdunc 'David Duncan' 17:10:39 👋 davdunc 17:10:43 :D 17:10:46 but either way, we need to enable the NSS/PAM modules only on GCP 17:10:48 davdunc 👋 17:11:05 so that's either a platform-specific Ignition base config or a platform-conditional unit 17:11:33 but all of the OS Login stuff is notionally separate from the question of whether we run an agent 17:11:38 which IMO we should not 17:11:38 hmm. so that's at least two solutions to the problem and neither sound that bad 17:11:59 as far as i'm concerned, unless google has provided (even a reference implementation) the server side pieces to make this generally useful for others in the future, they can pound sand. 17:12:05 I know there's _some_ integration between OS Login and the agent but I think it's only around enablement of OS Login 17:12:18 platform-conditional unit sounds better to me so you could have a more similar image for each platform 17:12:22 red_beard: that's generally not the standard 17:12:41 e.g. there's no reference implementation of the DigitalOcean metadata server but we still need to ask it for an IP address 17:12:58 boldly go 17:13:29 * bgilbert shrugs 17:13:41 bgilbert: it sounds like (from the previous CL work) we already know what modifications we'd need to make to the system to get this to work? 17:13:47 it's issues like this which continue to drive platform specific behavior. is there a line or is it a free for all? 17:14:02 red_beard: we generally try to hold a line, but it's ad-hoc 17:14:03 from the user side, have we actually had requests to make OS login work? 17:14:20 dustymabe: ajeddeloh did that work, so... not really 17:14:25 * dustymabe naively wishes most clouds were just based on openstack, then everyone can contribute to one platform and it helps them all 17:14:28 (he's no longer on the project) 17:14:54 dustymabe: but we can probably get some guidance from our GCP contacts 17:14:56 bgilbert: right, but hopefully we could glean something from what he did 17:15:22 GCP has since ported their agent to Go and it's possible that there are more/different interdependencies now 17:15:29 got ya 17:15:37 also on CL we shipped the agent in, I think, a rkt fly container 17:15:41 so not the same situation 17:16:13 but in general we think we should be able to get away with a platform dependent systemd unit that modifies a few files 17:16:30 jlebon: not AFAIK, but there's been an understanding that we need to support this eventually to be in the primary list of recommended OSes 17:16:30 and then magic.. oslogin works? 17:16:52 there was at least one issue reported I think about OSLogin 17:17:41 and my searching is failing 17:18:08 I think I found it 17:18:11 https://github.com/coreos/fedora-coreos-tracker/issues/405#issuecomment-596363000 17:18:53 So several users in there. I think at least a few of them were wanting OS Login 17:19:36 heh, if google is pushing for that in the web UI, we'll keep having users hitting that 17:19:46 ok so I think we should probable create a new issue specifically for OS Login and probably close out the gcp no agents ticket 17:20:22 anything else we'd like to discuss here before moving to open floor? 17:20:58 dustymabe: you may have a similar request on the ec2-instance-connect at some point. 17:21:14 I'll help you get you what you need if that becomes a request. 17:21:16 equivalent of OS Login for AWS EC2 ? 17:21:23 similar. 17:21:26 +1 17:21:51 davdunc: WDYT about rebasing EC2 on OpenStack? 17:21:59 🤣 17:22:00 :D 17:22:12 #topic open floor 17:22:18 I'll hold my comments on that one. :P 17:22:18 anyone with anything for open floor today? 17:22:47 as always here's a reminder to add the `meeting` label to issues you'd like to discuss, or comment in the issue if you don't have permissions to add that label 17:22:56 Nothing from my side, I'll be working on docs to add core user info in different Cloud providers pages. 17:23:04 thanks nasirhm! 17:23:10 we'll be glad to have that 17:23:26 dustymabe: Welcome ^^ :) 17:24:03 nasirhm: if you have anything you want to collaborate/review on the AWS side, let me know. 17:24:26 anybody here got some local servers?? Assuming they're not critical, switch over to `next` and let us know what breaks :) 17:24:32 Sure, Thanks davdunc . 17:25:31 #endmeeting