15:00:52 #startmeeting Infrastructure (2020-09-24) 15:00:52 Meeting started Thu Sep 24 15:00:52 2020 UTC. 15:00:52 This meeting is logged and archived in a public location. 15:00:52 The chair is mkonecny. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:52 Useful Commands: #action #agreed #halp #info #idea #link #topic. 15:00:52 The meeting name has been set to 'infrastructure_(2020-09-24)' 15:00:52 #meetingname infrastructure 15:00:52 The meeting name has been set to 'infrastructure' 15:00:52 #chair nirik pingou smooge cverna mizdebsk mkonecny abompard siddharthvipul mobrien 15:00:52 #info Agenda is at: https://board.net/p/fedora-infra 15:00:52 #info About our team: https://docs.fedoraproject.org/en-US/cpe/ 15:00:52 Current chairs: abompard cverna mizdebsk mkonecny mobrien nirik pingou siddharthvipul smooge 15:00:53 #topic aloha 15:01:05 .hello zlopez 15:01:05 good morning everyone. 15:01:05 mkonecny: zlopez 'Michal Konečný' 15:01:09 hrllo 15:01:11 Goood morning \o 15:01:21 Hi everyone, I hope you have a good day 15:01:50 .hello2 15:01:51 mobrien: mobrien 'Mark O'Brien' 15:02:18 .hello siddharthvipul1 15:02:19 siddharthvipul: siddharthvipul1 'Vipul Siddharth' 15:02:42 Is everybody eager to see what today meeting has for us? 15:02:49 you bet! 15:02:55 OHH YEAHH 15:03:25 So let's go to the first topic 15:03:51 Who is here regularly already knows what will follow 15:03:54 #topic New folks introductions 15:03:54 #info This is a place where people who are interested in Fedora Infrastructure can introduce themselves 15:03:54 #info Getting Started Guide: https://fedoraproject.org/wiki/Infrastructure/GettingStarted 15:04:17 And it is here, so anybody new wants to say hello? 15:05:38 It looks like nobody new today 15:06:07 Let's move to the next topic 15:06:41 It's on fire and you have fun if you sit on it. What it is? 15:06:59 #topic Next chair 15:07:00 #info magic eight ball says: 15:07:00 #info 2020-09-24 - mkonecny 15:07:00 #info 2020-10-01 - nirik 15:07:00 #info 2020-10-08 - mobrien 15:07:01 #info 2020-10-15 - siddharthvipul 15:07:13 * nirik nods. 15:07:27 As I see it's pretty full for next few weeks 15:07:31 we are quite ahead here.. if we don't get some volunteer who is not a regular for this, I would recommend skipping 15:07:36 (for next chair) 15:08:02 siddharthvipul: Good idea, I was thinking the same 15:08:14 :D 15:08:52 So next topic will be something that will make your brain explode 15:09:06 Some of you may already know what I'm talking about 15:09:17 Let's go for it 15:09:23 #topic announcements and information 15:09:23 #info CPE Sustaining EU-hours team has a Monday through Friday 30 minute meeting going through tickets at 0830 UTC in #centos-meeting 15:09:23 #info CPE Sustaining NA-hours team has a Monday through Friday 30 minute meeting going through tickets at 1800 UTC in #fedora-admin 15:09:23 #info Datacenter move is over, but some items still need to be done: see https://fedoraproject.org/wiki/Infrastructure/2020-post-datacenter-move-known-issues 15:09:25 #info F33 Beta freeze is in effect 15:10:09 Do we want to announce anything else or is your brain already exploded :-D 15:11:02 not much more to announce currently... oh... 15:11:11 Don't be shy, just announce something 15:11:13 #info F33 beta go/no-go is later this morning 15:11:45 #info F33 beta go/no-go at 17:00 UTC 15:12:10 #info smooge is back from PTO and causing problems 15:12:40 not problems, learning opportunities 15:12:58 mobrien - the man of marketing :P 15:13:01 haha 15:13:10 We are looking forwards for the problems caused by smooge, last time the datacenter got moved :-) 15:14:12 Forward to the next topic 15:14:36 This one is the scary one, when nobody wants to volunteer 15:14:48 Let's welcome 15:14:49 #topic Oncall 15:14:49 #info https://fedoraproject.org/wiki/Infrastructure/Oncall 15:14:50 #info mobrien is oncall for 2020-09-17 to 2020-09-24 15:14:50 #info nirik is oncall for 2020-09-24 to 2020-10-01 15:14:50 #info ??? is oncall for 2020-10-01 to 2020-10-08 15:14:52 * nirik plays halloween music 15:15:03 .oncalltakeus 15:15:03 nirik: Kneel before zod! 15:15:14 nirik: You are the next victim 15:15:26 And who wants to go after him? 15:15:37 Who is brave enough to take the oncall? 15:15:50 mkonecny, I can 15:16:09 It's all yours 15:16:19 #info siddharthvipul is oncall for 2020-10-01 to 2020-10-08 15:16:52 Let's look at what horrible things the current oncall saw 15:16:54 #info Summary of last week: (from current oncall ) 15:17:06 mobrien: The floor is yours 15:17:10 Quiet week this week 15:17:19 just one thing really 15:17:34 ipsilon was giving 500 when trying to log in to AWS 15:17:55 a clearing of the browser cache solved it in this case 15:18:04 Oh, this sounds scary 15:18:05 it is an intermittent problem 15:18:32 I think it's a fas issue actually... it returns bad/incomplete data to ipsilon... 15:18:53 hopefully noggin will solve this then 15:19:20 Thats about it from on call 15:19:22 yep 15:20:37 Next on your meeting you can see horrible things, like servers getting hammered or unexpected outages 15:20:44 But now 15:20:46 #topic Monitoring discussion [nirik] 15:20:46 #info https://nagios.fedoraproject.org/nagios 15:20:46 #info Go over existing out items and fix 15:21:24 fun. ;) 15:21:26 lets see. 15:21:51 so, we have some down aarch64 machines still... and some machines we need to get firewall rules changed for 15:22:19 one question/discussion item that came up: 15:22:55 Do we want to monitor staging from our production nagios? or do we want to make a staging one? or do we just want to wait until we redo monitoring and start in staging... 15:23:31 If we are definitely redoing monitoring then it makes sense to start in staging 15:23:33 +1 for experimenting with new monitoring in staging 15:24:11 yeah... just not sure when we are going to be doing it... 15:24:53 anyhow, most of the rest are ones we have seen, just haven't fixed yet. 15:24:58 although there is one... 15:25:08 "Check proxies for oversubscription" 15:25:30 I think this is monitoring fedmsg-relay? 15:25:51 it seems to be alerting on all the new proxies (including some of the iad2 ones) 15:26:05 would be nice to fix that or remove the check. 15:26:13 I think this may be related to the script being written in python2 15:26:26 could be wrong though .. 15:26:34 oh wait... it's not fedmsg 15:26:36 it's haproxy 15:26:49 from proxy01 (where it is green): HAPROXY SUBS OK: 0.56% subscribed. 115 current of 20480 maxconn. 15:27:00 yeah, could be indeed. 15:27:20 mobrien: can you dig into it? I'm happy to help out... 15:27:34 Ya, sure 15:27:50 cool. 15:28:10 I'm happy to move on now, unless there's any alerts or monitoring anyone wants to discuss... 15:28:16 I'll have a look and may ping you later this evening for help/advice if needed 15:28:32 advise: burn it with fire 15:28:53 help: a can of kerosene, a box of matches, and some gun cotton 15:29:11 * nirik has come back from PTO spoiling for a fight. ;) 15:29:21 sorry, that should be: smooge has... 15:29:30 that will be you in a couple of weeks 15:29:46 just put if 1==1 everywhere in the script, should be fine 15:30:20 and if it doesn't.. we really need to look at that system 15:30:22 while true { exit 0; }; 15:31:10 If there isn't anything else, I would gladly take you on a journey 15:31:11 ok I have nothing more.. my plan for staging monitoring was just to put in a rule for various templates that if a system was in the staging group.. skip it 15:31:34 then the new monitoring can be built around what is in that group 15:32:09 Journey to the depths of the infrastructure, the fearful place of proxies 15:32:21 Let's give the floor to our guides 15:32:28 # topic learning topic 15:32:29 #info proxies [nirik/mobrien] for 2020-09-24 15:32:48 #topic learning topic 15:32:48 #info proxies [nirik/mobrien] for 2020-09-24 15:32:53 ok I can start here and nirik can fill in the blanks or correct me 15:33:02 sounds good! go for it. 15:33:18 A number of the proxies in AWS (The none NA ones) are not yet in use, I have them up and configured they just need some DNS configuration to be put in place. I was hoping to have that done this morning but realised I missed something so got side tracked. They should be done soon. 15:33:49 so this explanation is assuming that work as done 15:33:57 We have proxies all over the world, a large number in our DC's in North America as well as some in APAC, EU, SA and AFR 15:34:25 When someone tries to hit any of our webapps covered by the proxies the will get directed to a proxy in their region which is determined by GeoIP and then will be either served static content from the proxy or redirected 15:35:13 (or just proxied into a application via a vpn connection from the proxy back to our main datacenter) 15:35:23 Incoming traffic is dealt with by apache, there is some static web content synced to proxies at regular intervals (not sure what the interval are) from back end servers which is served or haproxy is used to direct any other requests 15:35:45 as nirik says those redirects are done over vpn 15:36:20 We use varnish to cache content as well 15:37:44 We have this which shows the status of backend services for each of our proxies https://admin.fedoraproject.org/haproxy/proxy01 15:37:59 just change the proxy number in the url to the desired proxy 15:38:40 looks like we need to make a notifs-web02 and a mbs-frontend02 ? 15:39:01 smooge: we could yeah. 15:39:12 or take them out of haproxy? 15:39:22 sorry.. I meant that for #fedora-noc 15:39:28 mobrien, my apologies 15:39:33 no prob 15:39:43 note on the haproxy page: we get complaints from time to time that we have a open http port there (not https). But I am not sure what we want to do about it or care... 15:40:33 I was wondering is all traffic between the backend servers an the proxies https? 15:40:58 If its over vpn it may not need to be 15:41:05 not in all cases, nope. It's http over vpn 15:41:38 ok so I guess the proxies work as a TLS termination point for a lot of our backend servers too then? 15:42:03 most of the time we terminate ssl on the proxies. Sometimes we do have https from proxy -> backend... like openshift apps do that 15:42:10 yep 15:42:38 ok cool, was wondering that. I can't think of anything else off the top of my head 15:42:47 Anyone have any questions about it? 15:42:57 #topic Learning topic discussion 15:42:58 oh, one other thing: we also run on proxies our mirrorlist servers. 15:43:23 it's a rust app. It gets new data every hour from mm-backend01 (pushed from there) 15:43:39 it was a container.. then they made it better 15:44:12 * smooge is really looking for a fight today it would seem 15:45:24 smooge is scarying plenty of people right now 15:45:47 halloween (my favourite holiday is in 37 days) 15:46:10 so I should put the scaring to next month 15:46:19 Let's go to the final topic of today. The one you are looking forward to 15:46:22 anyway.. back to looking at opendkim 15:46:27 we have a even scarier thing in 39 days in the us, but I digress. 15:46:28 #topic Open Floor 15:46:54 There is one thing from siddharthvipul for the Open Floor 15:47:20 #info Call for projects to participate in Hacktoberfest [siddharthvipul] 15:47:54 that just involves marking the ticket/issue with #hacktoberfest? or ? 15:48:28 nirik, yeah, pretty much! there is also an event that you can organise on October 1 to invite people and tell them more about the org 15:48:47 that pulls contributors as well 15:49:00 nice 15:49:49 one issue I am thinking is that all the projects that we have on github are pretty mature and are not really beginner friendly.. 15:49:50 I will go through the issues for Anitya and see if I can mark something 15:49:58 mkonecny, thank you 15:50:16 does it have to be on github? 15:50:51 yeah 15:50:54 that's the issue 15:51:01 ok. 15:51:17 there was a ticket to update mote... might be in line with this 15:51:44 https://github.com/fedora-infra/mote 15:51:59 nirik, oh yes, that sounds great 15:52:03 .ticket 9317 15:52:04 nirik: Issue #9317: Pull Request review and update deployment for Mote (IRC meeting log web app) - fedora-infrastructure - Pagure.io - https://pagure.io/fedora-infrastructure/issue/9317 15:54:04 so, request is to please tag some issues and if you can volunteer to talk about the Fedora project for 30 minutes on OCt 1, even better :) 15:54:54 * nirik nods 15:57:26 that's all from me 15:57:32 We are near the time that we have for this meeting 15:58:04 I hope everybody had fun this time and I'm looking forward to see you in the future 15:58:18 #endmeeting