19:00:17 #startmeeting Infrastructure (2011-04-14) 19:00:17 Meeting started Thu Apr 14 19:00:17 2011 UTC. The chair is nirik. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:00:17 Useful Commands: #action #agreed #halp #info #idea #link #topic. 19:00:17 #meetingname infrastructure 19:00:17 The meeting name has been set to 'infrastructure' 19:00:17 #topic Robot Roll Call 19:00:17 #chair goozbach smooge skvidal codeblock ricky nirik 19:00:17 Current chairs: codeblock goozbach nirik ricky skvidal smooge 19:00:19 rbergeron: Error: Can't start another meeting, one is in progress. 19:00:21 ohh 19:00:29 * CodeBlock here 19:00:29 humf 19:00:30 oops 19:00:32 * rbergeron always does this to you guys 19:00:43 if anyone pops in, send them to f-m-1 19:00:43 wait, what? 19:00:45 that's one, not L 19:00:46 what 19:00:46 :) 19:00:58 hah! 19:01:00 ok ok 19:01:02 * goozbach here 19:01:04 remember L 19:01:08 :) 19:01:09 * jpattonwx here, lurking 19:01:49 * jsmith lurks 19:02:40 * sijis is around 19:03:00 * crashmag says hi 19:03:20 sorry about that... 19:03:27 somehow I didn't realize they were the same time. ;( 19:03:36 well we have it first 19:03:47 who needs a release anyway 19:04:11 ok, lets gets started I guess. 19:04:20 #topic New Folks Intros 19:04:28 any new folks lurking here this week? 19:04:36 yap :) 19:04:37 want to say hi and such? 19:04:45 * jsmith hasn't said hi in a long time :-p 19:05:04 crashmag: welcome. 19:05:05 I will be able to help people get into infrastructure after this release I believe 19:05:33 I can sponsor some people after the semester ends, too 19:05:43 * skvidal is here 19:05:46 sorry I was late 19:05:52 skvidal: no. unexcusable. 19:05:53 I think I will wait until after semester here too before worrying about sponsorship 19:05:55 crashmag: any thoughts on areas you are interested in? do lurk in #fedora-admin and/or #fedora-noc and ask questions and chime in when you can. 19:06:07 CodeBlock: I know - I shall go flagellate myself 19:06:08 skvidal: just like my bad grammer. inexcusable* 19:06:45 * nirik points new folks to https://fedoraproject.org/wiki/Infrastructure/GettingStarted lots of good info there. 19:07:07 nirik, well I work for an ITSM company. I realise all of this is based upon trust, and that takes time. So I was hoping that theres a way at working on "lower level" tickets to begin with. 19:08:02 crashmag: sure. ;) Do look at our trac instance and https://fedoraproject.org/wiki/Infrastructure_Cleanup_Tasks_2011 19:08:16 there are some tasks that don't need much access at all... 19:08:25 started using linux around 02. my geek factor can be measured by my 3 servers at home. was hoping to help out and learn by doing so :) 19:08:56 excellent. 19:09:27 often lurking in the above channels can get you a feel for things going on, and people are usually happy to explain, etc. 19:09:55 oooh itsm I just closed some tickets on that 19:10:08 ok, shall we touch on Beta tickets? 19:10:18 #topic Fedora 15 Beta tickets. 19:10:58 so beta will be released tuesday. 19:11:05 we have several tickets to make sure that goes well. 19:11:15 .ticket 2704 19:11:16 nirik: #2704 ([Fedora 15 Beta] New Website) - Fedora Infrastructure - Trac - https://fedorahosted.org/fedora-infrastructure/ticket/2704 19:11:45 * nirik sees sijis is answering for websites in the other meeting right now. ;) 19:11:46 * CodeBlock looks at sijis 19:12:39 we can come back to this one. 19:12:45 ok. 19:12:45 .ticket 2707 19:12:46 nirik: #2707 ([Fedora 15 Beta] Permissions on mirrors) - Fedora Infrastructure - Trac - https://fedorahosted.org/fedora-infrastructure/ticket/2707 19:12:59 we need content first, then we can check this. 19:13:16 .ticket 2708 19:13:17 nirik: #2708 ([Fedora 15 Beta] Mirror manager repository redirects) - Fedora Infrastructure - Trac - https://fedorahosted.org/fedora-infrastructure/ticket/2708 19:13:27 will need to check with mdomsch on that one. 19:13:56 sijis: any changes in size or anything for the Beta website? any other notable changes? 19:14:41 nope. the size will increase a little but not significant 19:14:48 about 5-10mb (if that) 19:14:55 its because of the countdown banner 19:15:02 ok, cool. 19:15:31 .ticket 2706 19:15:32 nirik: #2706 ([Fedora 15 Beta] Release Day) - Fedora Infrastructure - Trac - https://fedorahosted.org/fedora-infrastructure/ticket/2706 19:15:43 .ticket 2710 19:15:45 nirik: #2710 ([Fedora 15 Beta] Infra change freeze) - Fedora Infrastructure - Trac - https://fedorahosted.org/fedora-infrastructure/ticket/2710 19:15:45 thats yours as boss 19:16:03 Thats for release day issues, and a reminder that we are in freeze. ;) 19:16:06 smooge: yep. ;) 19:16:12 :P 19:16:23 I think thats all the beta tickets. Anyone have anything else Beta related? 19:16:26 I closed the disk space one. we are way under 1TB at the moment 19:17:21 excellent. 19:17:47 #topic Upcoming items and outages 19:17:52 I will be starting a sync of F13 over to archives soon and a sync of secondary1 over to the netapp also 19:18:12 I have a list in the post on the items I know of that are coming up after this freeze. 19:18:27 smooge: ok. 13 has a bit more life, but not too much. 19:18:48 whats left on secondary1? staging stuff, some video content from old fudcons, etc? 19:18:49 smooge: not sure what that entails, but if you need some help and feel like teaching me, by all means, would love to learn :) 19:19:35 yeah I sync stuff over just to make it easier for when it is EOL 19:19:44 The end of next week we will be having an outage to move db03, nfs01, and pkgs01 (so it will be an outage for building/packagers). 19:20:10 and fas01 19:20:43 oh, I didn't have that one on my list. That was a rhel6 guest on a xen5 host we wanted to move? 19:21:02 correct 19:21:14 it will cut a bunch of issues down 19:21:27 as it silently reboots a lot and sometimes not so silent 19:21:37 it just needs to be built onto virthost03 19:21:38 ok. will add to the list. We will figure out exact timing on them and I will send out an outage notice. 19:22:05 fas01 contains various certs and such so when it is going down it will degrade other items 19:22:39 ok. We will need to determine that. Ideally we can sync it to a new one in advance so the outage window is small. 19:23:03 db03 will take the longest, as it will be a dump/reload on new machine. 19:23:24 indeed 19:23:32 pkgs01 may be the simplest 19:23:47 yeah, it should be just shutdown in one place and bring up in another. ;) 19:24:18 db02 can be done like that too right? 19:24:25 sorry db03 19:24:43 well, no, as we are switching it from rhel5 older db version to rhel6 19:24:57 it's now a rhel5 box with a custom postgresql on it. 19:25:07 it will go to rhel6 with the normal version. 19:25:43 eventually we will move the other db's over to rhel6 too... but not next week 19:26:22 after next week we have puppet upgrade and fpca landing, then after final we are going to upgrade nagios/noc01. ;) 19:26:35 yep 19:26:38 anyone have any other upcoming items they would like to discuss planning on. 19:27:02 * CodeBlock still wants to start playing with moving hosted to EL6, but that is looking like a summer project 19:27:32 yeah, that would be great to get done... 19:27:33 someday. 19:27:51 ok, meeting tickets... 19:27:56 #topic Meeting tickets 19:28:00 https://fedorahosted.org/fedora-infrastructure/query?status=new&status=assigned&status=reopened&group=milestone&keywords=~Meeting&order=priority 19:28:42 a grab bag of things. ;) 19:29:05 #info on cvs01, I am waiting to hear back from the last 2 projects on there. They may move to fedorahosted, or fedorapeople. 19:29:43 Anyone see a meeting ticket they would like to comment on? 19:31:11 what ever happened with moving blogs off of infra? 19:31:31 (i.e. ticket 2591) 19:32:22 it's not really moving forward. ;) I think ricky was going to look into it more when he had time... 19:32:38 it needs someone talking to wordpress.com or the like... and then talking to the blog users. 19:32:41 yeah. it needs a key person to push it forward. 19:32:45 sorry slow type 19:32:46 yeah, i'm not sure what's left on that. 19:33:06 I would say if its not in the ticket.. then everything is left on that :) 19:33:17 * jsmith has some friends at wordpress.com, but they're busy cleaning up from the recent security incident 19:33:19 is it contact wp and see if they will host that stuff for us and contacting users and seeing if they need help moving (or are we just moving stuff for them) 19:33:23 I'd be happy to make some introductions 19:33:33 oh yeah. i read that this morning 19:33:34 jsmith: That would be cool. 19:34:05 yeah, contact them and see what they can do for us... then once we know that, contact all our blog users and tell them about it... 19:34:26 I think it will depend on what they want to do if we can help move them or if users move their own. 19:34:37 and we will need to set a sunset date... 19:34:56 i believe they can export all their content with a single click. 19:35:07 yeah, should be pretty easy. 19:35:37 we also want to explain to them why we are doing this and such... so no one is blindsided or confused. 19:35:56 good point. its noted 19:36:27 so, if anyone would like to drive that forward, please feel free. 19:36:46 #topic Other tickets 19:36:56 Any other tickets folks would like to bring up or note? 19:37:32 nagios upgrade? skvidal was telling last time it didn't work due to some issues 19:37:47 ranjibd: nirik and CodeBlock have the knowledge there 19:37:57 yeah, ran into issues with our irc bot, and backed out the upgrade. 19:38:08 CodeBlock is planning on doing it again after f15 final. 19:38:17 Hopefully it will go smoothly this time. ;) 19:38:39 indeed 19:38:50 puppet upgrade ? can i help any way? 19:38:53 is that why we have zodbot and fedbot? 19:39:45 possibly.. it was looking at what we could break out 19:39:48 sijis: nope. ;) fedbot is one I run locally, mostly for #fedora. It's got some non packaged plugins and such, also it's locked down to not respond to lots of things, which we wouldn't want zodbot to do. 19:40:48 ranjibd: it should be pretty easy. Basically just upgrade the master, upgrade a 5 and 6 client and test... it should go smoothly. 19:41:13 I don't think splitting zodbot is going to help any. 19:41:22 nirik: all the clients have same version ? 19:41:43 splitting zodbot won't help the migration - but it will make the bot in general make a lot more since where it should live 19:42:03 having our meetingbot on our nagios server makes no sense to me 19:42:22 skvidal: yeah... but where should the meeting part go? 19:42:35 nirik: away 19:42:47 nirik: away from our nagios server 19:42:50 it makes me nervous 19:42:59 ranjibd: yeah. 19:43:08 For what it's worth, there's no reason that zodbot has to be on the same machine as noc01 to get nagios notifications 19:43:24 So moving zodbot to value or something is easy 19:43:27 skvidal: ok, we can figure something out. 19:43:37 yeah, value might make some sense. 19:43:59 we do need to serve the meetbot html stuff. 19:44:24 * nirik makes a note. 19:44:40 splitting it won't help the lag on restart unless we split it out by channels... 19:45:02 we may be able to prune the channels it's joining... 19:45:43 Sorry, I missed the beginning due to internet issues - what does splitting zodbot mean? 19:46:00 it would be two bot.. zod and bot :) 19:46:19 ok. i need a break. (lame joke) 19:46:26 there was talk about splitting it into fasbot, nocbot, meetbot 19:46:33 smoogebot 19:46:35 sijis: :) 19:46:51 I suggested splitting it logically 19:46:55 into its functional components 19:47:03 and it's authenticated-components 19:47:09 nocbot has more info thant meetboot 19:47:21 fasbot has different access than meetbot or nocbot 19:47:39 conflating all 3 seemed like a recipe for problems 19:47:42 isn't that just more stuff to oversee than 1 bot? 19:47:57 also - from a services and meta-services standpoint 19:48:18 it does not make sense to have a user-facing service (zodbot) living on a meta-service server (noc01) 19:48:34 just like it wouldn't make sense to run a webserver for the wiki on our dhcp server 19:48:41 * nirik nods. 19:48:51 that way if noc01 goes down 19:48:58 we don't have to worry about it impacting user-facing services 19:49:00 I think moving it is a good idea. Splitting it I am more meh on. 19:49:38 I like the idea of splitting it out, but that's me. 19:49:57 Same - supybot is pretty simple/stable, and it only really touches data that's public anyway 19:50:06 So while I wouldn't mind splitting it out, I don't see it as a huge priority or a huge gain 19:50:20 is everyone on board with moving it OFF of noc01? 19:50:24 yes 19:50:25 yes. 19:50:27 yes 19:50:36 great 19:50:37 let me rephrase, YES 19:50:38 i do agree that it doens't make sense that its on noc1. it should be in value1/2 as ricky mentioned 19:50:45 value## seems like a winner 19:50:52 Yup, only change would be firewall changes to accept supybot-notify + pointing nagios at the new IP 19:51:02 Ideally it should not be on a VPN host so that we get notifications if VPN goes down 19:51:04 so, how about we move it sooner rather than later... and if someone wants to drive splitting it down the road, they can 19:51:30 value already has a webserver for http://meetbot.fedoraproject.org/ 19:51:32 ? 19:51:59 We currently treat value like an app server, so we could proxy that through the proxies like we normally do if we want 19:52:13 Or it'd be fine to poke a hole for it too. 19:52:34 ok. 19:53:01 does someone want to take the task of moving it? I guess I can... 19:53:05 heh, less to worry about for the noc01/noc03-tmp switchover then 19:53:11 nirik: I can if you don't want 19:53:32 CodeBlock: sure, feel free. :) It will require doing it when no meetings are around... 19:53:41 when are we looking to do it? And what kind of an outage notif needs sent out? 19:53:51 bah.. I think we time it for rbergeron's most important meeting of the week 19:54:02 Probably an outage notification to devel-announce + infrastructure 19:54:10 well, we could add it to the pile next week, but thats getting lots of stuff. 19:54:16 And as a college student, I'm sure you'll be around at 3 AM some days, right? :-D 19:54:20 it shouldn't be frozen, so I suppose we could do it most anytime. 19:54:29 oh, that reminds me... 19:54:38 (half-kidding - on one hand, changes at 3 AM aren't that great - on the other hand, zodbot's dead simple) 19:55:40 What would folks think of the following addition to outage notifications: Once you write up one, have one other person review and approve before sending. It's so easy to mess up timezone stuff, or the like. 19:56:02 that would be good 19:56:03 Good idea :-) 19:56:09 ricky: Yeah 3AM is perfectly fine for me ;D 19:56:10 I have a counterproposal 19:56:11 it would make most of my emails though less funny 19:56:13 Doesn't have to be formal or anything, just an "OK" in IRC 19:56:20 how about we make all timezones Eastern Standard 19:56:20 Right? 19:56:29 so we don't have to mess around with silly things like timezones at all 19:56:30 :) 19:56:32 yeah, no red tape, just another pair of eyes. 19:56:33 * skvidal kids 19:56:39 skvidal: fine with me. ;) 19:56:44 ;D 19:57:02 +1 for UCT 19:57:06 anyone object to the above? will add to sop... 19:57:21 +1 for second set of eyes 19:57:52 cool. 19:57:57 #topic Open Floor 19:58:01 Anything for open floor? 19:58:39 3 weeks or so left in my semester then I'll have a lot more time to be around :) 19:58:39 Wow, perfect timing... this has got to be a first! 19:58:50 * CodeBlock assumes same for ricky 19:59:15 Yup, will have a lot more time this year 19:59:36 cool. ;) 19:59:59 ok, if nothing else, will close out the meeting in a minute here. 20:00:49 Thanks for coming everyone! 20:00:52 #endmeeting