16:28:39 #startmeeting fedora_coreos_meeting 16:28:39 Meeting started Wed Apr 26 16:28:39 2023 UTC. 16:28:39 This meeting is logged and archived in a public location. 16:28:39 The chair is dustymabe. Information about MeetBot at https://fedoraproject.org/wiki/Zodbot#Meeting_Functions. 16:28:39 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:28:39 The meeting name has been set to 'fedora_coreos_meeting' 16:28:55 #topic roll call 16:28:58 .hi 16:28:59 dustymabe: dustymabe 'Dusty Mabe' 16:29:07 .hello c4rt0 16:29:08 c4rt0: c4rt0 'Adam Piasecki' 16:29:10 ./hi 16:29:26 .hi 16:29:27 fifofonix: fifofonix 'Fifo Phonics' 16:29:37 .hi 16:29:38 jmarrero: jmarrero 'Joseph Marrero' 16:29:43 .hello2 16:29:44 jlebon: jlebon 'None' 16:30:15 #chair fifofonix jmarrero jlebon 16:30:16 Current chairs: dustymabe fifofonix jlebon jmarrero 16:30:41 .hi 16:30:42 bgilbert: bgilbert 'Benjamin Gilbert' 16:30:50 hi 16:32:00 #chair rishabhsaini bgilbert 16:32:00 Current chairs: bgilbert dustymabe fifofonix jlebon jmarrero rishabhsaini 16:32:10 will get started here in a minute 16:32:44 .hello siosm 16:32:45 travier: siosm 'Timothée Ravier' 16:32:46 .hello marmijo 16:32:48 marmijo[m]: marmijo 'Michael Armijo' 16:33:56 #topic Action items from last meeting 16:34:03 #chair travier marmijo[m] 16:34:03 Current chairs: bgilbert dustymabe fifofonix jlebon jmarrero marmijo[m] rishabhsaini travier 16:34:15 * bgilbert to write up a comment in the bug 16:34:17 :) 16:34:39 .hi 16:34:42 ravanelli: ravanelli 'Renata Ravanelli' 16:34:43 #info bgilbert made a beautiful comment to explain things in https://github.com/coreos/fedora-coreos-tracker/issues/1474#issuecomment-1517289046 16:34:48 thanks bgilbert 16:34:49 <3 16:34:56 #chair ravanelli 16:34:56 Current chairs: bgilbert dustymabe fifofonix jlebon jmarrero marmijo[m] ravanelli rishabhsaini travier 16:35:13 that was all for action items.. 16:35:18 ❤️ 16:35:51 full disclosure - most of us in the FCOS team are in weeklong meetings this week so I'll only focus on meeting tickets that are time sensitive 16:36:02 +1 16:36:12 of which I think this one is: 16:36:20 #topic CIFS Mounts Failing On Next 38.20230322.1.0 due to SELinux Issue (keyutils) 16:36:29 #link https://github.com/coreos/fedora-coreos-tracker/issues/1447 16:36:46 It's good to see fifofonix here (thank you for opening this issue) 16:36:55 yw 16:37:46 as I understand it this is a prolem that prevents CIFS filesystem shares access (blocked by selinux denials) 16:37:53 s/prolem/problem 16:38:26 but not all CIFS shares, right? https://github.com/coreos/fedora-coreos-tracker/issues/1447#issuecomment-1520500744 16:38:27 yes, but i think it is actually restricted to mounting DFS-based shared 16:38:29 fifofonix: do I understand correctly that it's anyone using CIFS shares on FCOS that would be affected, or does the configuration need to be specific? 16:38:42 ahh, OK 16:38:43 how common is the affected configuration likely to be? 16:38:53 * dustymabe is not a CIFS expert 16:39:54 fifofonix: would you know the answer to ^^ 16:40:01 DFS is a popular MSFT thing that notionally provides HA mounts. 16:40:15 HA == high availability ? 16:40:35 does that require special server-side config, e.g. multiple backend servers? 16:40:44 Yes. No expert here either. Concept is that there is a single name that will resolve to active backend. 16:41:03 aside: /me wonders if we should add a samba test (I think Amazon has a service we could hook into for this) 16:41:26 When we say CIFS we really mean SMB. 16:41:33 fifofonix: correct, me too 16:41:40 dustymabe: +1 16:41:49 I think CIFS predates SMB but you still notionally mount as CIFS. 16:42:09 fifofonix: oh, I thought it was the other way around, but that was just me assuming things 16:42:24 I think one interesting part is how widely is this used, and the second is what are available workarounds. 16:42:37 so a) it's not just SMB but HA SMB b) there's a workaround (disable HA) c) there's currently no proper fix 16:42:56 right. I think c) is the big one that's a concern for me 16:43:04 (I think CIFS is the name for a particular point-in-time snapshot of the protocol that was documented) 16:43:12 In my set up I am able to workaround by specifically addressing the primary share server (obviously no good in event of failover). 16:43:18 if we decided to hold the promotion to stable it would be nice to know how long (i.e. not have it be unbounded) 16:44:03 This is also the same issue right? https://discussion.fedoraproject.org/t/fc38-mount-cifs-mount-smb-mount-smb3-error-126/81271 16:44:21 * dustymabe clicky 16:44:30 bgilbert: for b), is that something the client can decide? 16:44:46 That does look the same. 16:45:01 i.e. is the HA configured client side or server side? 16:45:23 so we got more people hitting this 16:45:32 so several people/groups are starting to hit this 16:45:33 jlebon: that was my understanding from fifofonix's GH comment 16:45:37 yeah 16:45:47 this is broader than fcos too obvs 16:45:59 fifofonix, could you confirm that the workaround is client-side? 16:46:17 jmarrero: I'll try to link that discussion thread to the BZs where we are tracking this already 16:46:45 bgilbert: +1 - that's good information to find out if fifofonix has it 16:46:59 so, i can modify the mounts on all my hosts to point to a primary server rather than a DFS name and then move forward. 16:47:22 so workarounds include: 16:47:34 1. some client configuration change in CIFs configuration 16:47:45 2. set SELinux to permissive 16:47:59 3. hold back updates on your server/fleet 16:48:06 on balance, I'd be inclined not to delay the promotion to stable 16:48:10 i guess 3 isn't a workaround per-se 16:48:13 with or without a coreos-status post 16:48:18 4. it should work to customize the policy too I think 16:48:26 jlebon: i.e. server side? 16:48:31 dustymabe: client-side 16:48:32 with all of the above it requires an alert to the community because action is required. 16:48:35 oh no, you mean selinux policy 16:48:41 right, yeah 16:48:52 could we ship a generic client policy update? 16:49:08 dustymabe: can you expand? 16:49:24 i.e. bake a policy with exceptions into the image we ship 16:49:32 or is it so specific 16:49:41 i.e. to the mountpoints or something 16:49:58 dustymabe: ahhh gotcha. we could modify our policy, yes. 16:50:13 but that's circumventing evaluation by selinux folks 16:50:20 of course, yes 16:50:27 (i was trying to get a .cil file to apply to do this - but don't know how to gen one from the audit2allow output in BZ) 16:50:55 jlebon: but it would be better than say if we were to decide to set selinux to permissive for all (which we wouldn't do) 16:51:15 fifofonix: yeah, i think it'd require building the module in a container maybe and then `semodule -i` it on the hosts 16:51:38 dustymabe: definitely 16:52:05 that is something that would require some investigation as I don't think we've ever done that specifically before 16:52:32 so 4. is "it should work to customize the policy too I think, client side" 16:52:49 and 5. would be for us to ship a customized policy in our images 16:53:15 ok at least we have the options listed out 16:53:51 ideas on path forward? it is interesting to note that so far we've only got one report of this from our users (thanks fifofonix) 16:53:56 fifofonix: how (un)acceptable is option 1 in your opinion? 16:54:09 only 4 people subscribed to https://github.com/coreos/fedora-coreos-tracker/issues/1447 16:54:26 another workaround would be to set the one problematic domain into permissive 16:54:35 but maybe they are subscribed to the issue and just passively watching 16:54:42 not the entire node, just one domain 16:54:55 dustymabe: 4 participating. subscribed count isn't shown AFAIK 16:55:11 bgilbert: ok yeah, there is a distinction there 16:55:26 travier: is that something you can easily at runtime without a policy update? 16:55:31 easily do* 16:55:40 yes, if we had semanage but we don't 16:55:45 i think (i) is acceptable to me but i'm aware of the issue and its extend and have had time to warm to it. 16:55:47 :) 16:55:54 I'm looking into how that works 16:56:17 travier: 1 was client configuration on the CIFS side, not selinux 16:56:26 but, for someone not running next/test ... i think they need some notice so they hold updates / can test changes / etc. 16:56:42 fifofonix: I think we'd need to send a coreos-status email - so hopefully they would see that 16:56:51 and then evaluate their options 16:56:58 dustymabe: it's to replace your 2 option 16:56:59 +1 16:57:18 travier: ahh, so then you are talking about 4. I think 16:57:39 well, we don't have to do it. people impacted by it can do it 16:57:42 it's a workaround 16:57:45 correct 16:58:01 ok thoughts on coming to a conclusion/decision? 16:58:12 #proposed We'll send a coreos-status email with a recommended workaround, and ship F38 to stable as scheduled. 16:58:12 back to 1. when you say client CIFS config I'm interpreting as systemd mount config change on all vms. 16:58:36 when is f38 stable? 16:58:39 fifofonix: right. whatever steps you took to change the config client side to stop having the problem 16:58:40 fifofonix: yeah 16:58:43 fifofonix: next week 16:58:52 F38 is stable already, but the FCOS stable rebase is next week 16:58:57 right, next week was the original scheduled date 16:59:20 ok well deserves 'IMPORTANT' flagging. 16:59:24 though if the SELinux folks get the info they need and produce a fix by then, we could possibly fast-track 16:59:35 +1 to proposed 16:59:40 we'd want to send the email out this week 16:59:49 ack to proposed 16:59:50 bgilbert: agree - if we get a fix soon then we could re-evaluate 16:59:53 yes, as much notice as possible. 17:00:01 should we reflect that in the proposal? 17:00:03 +1 to proposed 17:00:06 fifofonix: are you able to help with a reproducer in https://bugzilla.redhat.com/show_bug.cgi?id=2182643 ? 17:00:06 +1 to proposed 17:00:09 +1 to proposed 17:00:17 +1 17:00:18 (the "might reevaludate if a fix becomes available" part) 17:00:29 yeah, let me edit 17:00:31 jlebon: SGTM 17:00:50 bgilbert: the biggest component of a reproducer seems to be having a DFS share (i'm not aware that any work). 17:01:10 fifofonix: yeah, ideally they could get a step-by-step for that 17:02:10 #proposed We'll send a coreos-status email with a recommended workaround, and ship F38 to stable as scheduled. If a fix becomes available, we'll evaluate it for fast-tracking into stable. 17:02:19 +1 17:02:23 fifofonix: I think you're probably in the best position to help them 17:02:26 aye to proposal 17:02:47 we'll also need fifofonix help with example workaround steps (if he's willing) 17:02:48 bgilbert: unfortunately, i don't know how to create one of those. happy to run more detailed logging on my infra. 17:02:51 if that's actively being worked on, we may want to delay the coreos-status email a bit, but I opted not to put that in the proposed 17:03:10 "actively being worked on" -> the bug? 17:03:14 yeah, the bug 17:03:35 I haven't seen any activity this week. I emailed a few people earlier today, but I think it was already late in the day for them 17:03:43 fifofonix: hmm, okay. I don't think anyone in this meeting knows how to set that up then 17:03:49 since more and more people are seeing it in Fedora I think we might be able to get it prioritized, though 17:03:54 fair 17:04:12 #agreed We'll send a coreos-status email with a recommended workaround, and ship F38 to stable as scheduled. If a fix becomes available, we'll evaluate it for fast-tracking into stable. 17:04:28 should we target a day to try to send that email ^^ 17:04:37 not later than Friday I assume 17:04:43 bgilbert: +1 17:04:44 i.e. maybe give it one more day to see if anything pops up in the BZ and then send it 17:04:49 dustymabe: +1 17:05:00 ok so try to send thursday, latest Friday 17:05:07 not hopeful on BZ. has been super quiet. 17:05:13 yeah 17:05:26 volunteers to write the post? 17:05:40 Ondrej asked for a reproducer last week, but sounds like that's hard to do 17:06:02 yeah, maybe we could find a hosted service that shows the problem (but that would be work itself) 17:06:10 bgilbert: i can try if no one else wants to 17:06:20 dustymabe: +1 17:06:27 dustymabe: i can help you too 17:06:41 #action dustymabe to write coreos-status post about CIFS issue for f38 rebase heading to stable 17:07:18 I'll lean on you fifofonix for some client side workaround steps if you don't mind 17:07:37 #topic open floor 17:07:38 dustymabe: will do my best. 17:07:47 anyone with anything for open floor today? 17:08:02 OR are there any more time sensitive topics that we should discuss? 17:09:34 ok I'll close it out 17:09:37 +1 17:09:40 #endmeeting