19:00:14 #startmeeting Fedora Infrastructure Ops Daily Standup Meeting 19:00:14 Meeting started Mon Nov 28 19:00:14 2022 UTC. 19:00:14 This meeting is logged and archived in a public location. 19:00:14 The chair is nirik. Information about MeetBot at https://fedoraproject.org/wiki/Zodbot#Meeting_Functions. 19:00:14 Useful Commands: #action #agreed #halp #info #idea #link #topic. 19:00:14 The meeting name has been set to 'fedora_infrastructure_ops_daily_standup_meeting' 19:00:15 #chair nirik nb 19:00:15 Current chairs: nb nirik 19:00:15 #meetingname fedora_infrastructure_ops_daily_standup_meeting 19:00:15 The meeting name has been set to 'fedora_infrastructure_ops_daily_standup_meeting' 19:00:15 #info meeting is 30 minutes MAX. At the end of 30, its stops 19:00:15 #info agenda is at https://board.net/p/fedora-infra-daily 19:00:15 #info reminder: speak up if you want to work on a ticket! 19:00:17 #topic Tickets needing review 19:00:19 #info https://pagure.io/fedora-infrastructure/issues?status=Open&priority=1 19:00:22 here 19:00:42 * smooge was going to get some coffee 19:01:43 yeah, I need to do that too... but after meeting I guess. 19:01:48 same here 19:02:18 .ticket 11013 19:02:19 nirik: Issue #11013: Redeploy openQA workers with consistent storage configuration - fedora-infrastructure - Pagure.io - https://pagure.io/fedora-infrastructure/issue/11013 19:02:34 low low ops? or low trouble, med gain ? 19:03:28 from my past dealing with openqa and installing for it: med trouble, med gain ops 19:04:29 if adamw can help workout a kickstart or similar setup it will make it easier.. but the networking and some other things tend to make it a 'try this, ok no reinstall, try this' 19:05:11 and the PPC boxes I would put at high trouble :) 19:05:22 .ticket 11014 19:05:22 ok 19:05:22 zodbot: ping 19:05:22 pong 19:05:22 this is my outage ticket... med/med/ops/outage 19:05:25 nirik: Issue #11014: Planned Outage - Updates / Reboots - 2022-11-30 21:00 UTC - fedora-infrastructure - Pagure.io - https://pagure.io/fedora-infrastructure/issue/11014 19:05:40 +1 on med/med/ops/outage 19:05:57 smooge: I have a kickstart that works now I think... but needs tweaked for whatever storage we decide to finally do 19:06:06 ah ok cool. 19:06:19 I think you or I got lagged nirik 19:06:31 I think it was me... :( 19:06:47 ok, on to releng... there's some unretires. All low/low I think 19:07:53 the unretires low/low. 19:08:16 epel9 branch may be low low also? 19:08:33 I think it's solved now and that can be close. 19:08:35 closed. 19:09:10 ok 19:09:36 .releng 11154 19:09:37 I took a naked ping as an oncall earlier today and asked for 11154 to be opened 19:09:37 nirik: Issue #11154: bodhi haven't sent some packages to stable - releng - Pagure.io - https://pagure.io/releng/issue/11154 19:09:55 * nirik looks 19:10:03 I was guessing it might be something from the bodhi code updates? 19:10:06 ah.. 19:10:14 nope. All those are failing gating tests. 19:10:28 The submittor needs to either waive them or fix them. 19:10:40 ah ok 19:11:46 I can close with explain 19:12:53 thanks. the only other oncall thing I had was some spam on lists this weekend 19:13:28 banhammer the addresses. they had been opened over a year ago it seemed but finally got used by whoever 19:13:28 ah spam, so fun 19:13:53 #topic work plans / upcoming tasks 19:13:54 #info everyone should note what things they are hoping to work on over the next day / week 19:14:30 Still working on Splunk work 19:14:58 So, gonna be a busy week. ;) I'm planning on updating staging hosts today, a bunch of non outage causing ones tomorrow and then outage on wed. Around that I want to upgrade some hosts from f36 to f37 (koji and builders for sure). Catch up on PR's and fight down tickets. 19:15:15 aheath1992-mobil: thanks for working on that. will be interesting to see how it looks. ;) 19:15:20 ah yeah that (splunk) came up last week but I didn't want to do any changes while you were out and I was less than capable to volunteer 19:15:48 I think a firewall change is needed and possibly a routing issue but aheath1992-mobil put in a ticket for that 19:16:18 nirik, I got log01 last week on updates so it should be a quick update/reboot for that bobx 19:16:21 aheath1992-mobil: Any update?? 19:16:24 Yep working with IT Networking on the ticket 19:16:39 I also went through and got rkhunter dealt with on all but a couple of boxes 19:16:48 smooge: awesome, thanks. 19:16:48 Okeyy 19:17:39 the openqa-ppc box went into some sort of weird lock when I tried to run rkhunter --propupd but I was able to get the other ones 19:17:55 the others look to need some updates to a config to deal with containers 19:17:58 right before the holiday I also cleaned up a bunch of old composes and stuff on fedora_koji volume. It's down a good deal now (it was heading for the 100T limit) 19:17:59 and that was it 19:18:22 yeah, there's a new .containersomething man page. ;( 19:18:25 nirik, cool 19:18:30 we just need to allow it. 19:18:42 down to 80T now: ntap-iad2-c02-fedora01-nfs01a:/fedora_koji 80T 78T 1.7T 98% /mnt/fedora_koji 19:19:15 I was not sure how much of the space was 'real' or .snapshots 19:19:27 sounds like a lot of real 19:19:41 it's mostly real. 19:20:24 Snapshot Spill 2.57TB 3% 19:20:27 well time for more archiving I guess. F35 will finally be gone 19:20:33 so, 2.5TB are snapshots currently 19:20:38 I saw some ssl cert alerts expiration.. 19:20:40 dang thats not a lot 19:21:04 yeah, I want to move composes/ over to another volume, but thats lower pri 19:22:05 saibug[m]: yeah, smooge fixed those. Basically it just needed a playbook run and ansible renewed them via letsencrypt. 19:22:41 We do need to renew getfedora.org soon... it's a digicert cert currently. We could switch it to letsencrypt 19:22:46 yeah I thought the tag httpd/certificates would grab them, but the coreos use a completely different tag for eahc one 19:22:57 Ohh yes, with -t httpd 19:23:24 smooge: perhaps we should add a common tag to all the certs so we can just get them all if needed? 19:23:34 so for the coreos ones, you need to do -t status.updates.coreos.fedoraproject.org,raw-updates.coreos.fedoraproject.org,status.raw-updates.coreos.fedoraproject.org 19:23:50 much easy... 19:23:54 nirik, I was going to ask because it seemed they had been done different for a reason 19:24:06 and I didn't know the reason :) 19:24:21 :/ 19:24:52 I dont know.... I've slept since then? I would say keep those in case you want to target a specific one, but have a higher level tag for all certs... ? 19:24:54 saibug[m], the reason could have been "Its 2am and I need to commit this." or it could be "openshift needs a different thing" 19:25:24 so if there isn't a "openshift needs special care" then I am ok with adding the tag :) 19:25:36 s/openshift/coreos-stuff/ 19:25:46 and can do that shortly 19:25:55 or have someone else who is wanting an easyfix 19:26:03 It makes sense 19:27:12 ok, anything else to discuss? or shall we call it I standup? 19:27:40 nope I have said more than my share =) 19:28:07 I'm good 19:28:44 ok, thanks for coming everyone! 19:28:48 #endmeeting