17:59:59 #startmeeting Infrastructure (2012-06-07) 17:59:59 Meeting started Thu Jun 7 17:59:59 2012 UTC. The chair is nirik. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:59:59 Useful Commands: #action #agreed #halp #info #idea #link #topic. 18:00:00 #meetingname infrastructure 18:00:00 The meeting name has been set to 'infrastructure' 18:00:00 #topic Is anyone there? 18:00:00 #chair smooge skvidal CodeBlock ricky nirik abadger1999 lmacken dgilmore mdomsch threebean 18:00:00 Current chairs: CodeBlock abadger1999 dgilmore lmacken mdomsch nirik ricky skvidal smooge threebean 18:00:06 yah yah yah 18:00:09 * relrod here 18:00:14 * pingou here 18:00:42 * marcdeop here 18:00:49 * ianweller_ here 18:01:28 ok, lets go ahead and start in... 18:01:28 * fcami__ here 18:01:29 here 18:01:37 * threebean is here 18:01:57 #topic New folks introductions and Apprentice tasks. 18:01:57 If any new folks want to give a quick one line bio or any apprentices 18:01:57 would like to ask general questions, they can do so here. 18:02:00 * rossdylan is here 18:02:16 any new folks like to introduce themselves? or any general apprentice questions or concerns? 18:02:39 I am a new red hat intern working on integrating open badges into fedora infrastructure 18:02:45 Hi everyone 18:02:48 welcome rossdylan 18:02:53 Hi :) 18:02:55 cool on the badges. ;) 18:02:56 welcome rossdylan :) 18:03:12 thanks :D 18:03:51 ok, I guess lets move along then... 18:03:53 #topic two factor auth status 18:04:03 any news on this this week? 18:04:06 nothing really. I tried to get ahold of wolfkit 18:04:14 not much 18:04:20 yeah, I've not seen him on irc of late much 18:04:34 I get the idea we are close to ready to setup a staging/test 18:04:39 we are 18:04:55 which will be very nice. ;) 18:04:57 it's round-tuits and a little bit of code 18:05:01 also - I wanted to ask 18:05:14 has anyone in here used http://code.google.com/p/mod-auth-external/ 18:05:16 and 18:05:20 http://code.google.com/p/pwauth/ 18:05:26 Yeah I haven't talked to wolfkit much recently either, I can try poking him via other means (I have contact with him outside of Fedora) and get him to check in 18:05:28 it's somewhat related 18:05:37 * nirik hasn't 18:05:37 relrod: that'd be great 18:05:48 so the gist of it is 18:05:55 we use mod-auth-external 18:06:01 coupled with pwauth 18:06:20 to do auth to fas2 for things like hosted trac, nagios, etc etc 18:06:32 w/o needing a direct db connection to fas2 18:06:41 yeah, and that gets them to the same level as other places where the fas servers do not need to be up 18:06:53 yah 18:07:14 (or the database as the case may be) 18:07:19 I've read the security warnings about both 18:07:27 and they don't seem more dramatic than anything else 18:07:41 are they packaged up yet? 18:07:43 yes 18:07:44 in epel 18:07:47 all ready to roll 18:08:04 cool. Sounds like a good thing to test out and see if there are any gotchas. 18:08:14 nod 18:08:19 it would, of course, HAVE to be ssl'd 18:08:23 but I think we assume that now anyway 18:08:25 I was looking at mod_auth_openid... but it doesn't solve that problem (still need the fas openid provider to login) 18:08:45 so the only thing I am hazy on is this 18:08:51 pwauth needs to be setuid 18:09:00 so it can make the pam check at all 18:09:06 so the gist of it is 18:09:33 apache -> mod-auth-external -> pwauth -> pam -> talks to local pam auth 18:09:42 we might run this idea by the RH security folks too? 18:09:43 then you can make a fall through to check for a group membership 18:10:04 maybe... 18:10:29 okay - that's all. 18:10:37 ok, sounds good. 18:10:44 if anyone wants to setup those and test them - feel free to let me know 18:10:48 otherwise roundtuits 18:10:55 #info will look at testing mod_auth_external / pwauth for nagios/trac 18:11:33 #topic Applications status / discussion 18:11:46 Any application news this week? 18:11:52 * pingou has not much 18:12:01 oo, oo. me! 18:12:05 https://admin.stg.fedoraproject.org/updates (bodhi) and http://community01.dev.fedoraproject.org/tagger (tagger) are emitting zmq messages now. 18:12:06 I hear that we had actual messages going accross fedmsg bus in staging this week? :) 18:12:19 :) 18:12:23 You can see them by logging onto a staging machine and running: "$ fedmsg-tail --really-pretty" while poking at the apps. 18:12:39 and that means it's time for me to start work on datanommer ;) 18:13:00 I'm splitting up the config into /etc/fedmsg.d/ now (because /etc/fedmsg-config.py is ballooning). 18:13:03 threebean: does that need to be any specific machine? or just anything? 18:13:04 After that, I'll try standing up mediawiki+fedmsg in staging. 18:13:20 nirik: should be anything in puppetEnvironment == 'staging' 18:13:24 cool. 18:13:38 tres cool 18:14:12 speaking of mediawiki... smooge / ianweller_: whats next for mediawiki 119? 18:14:34 * relrod has a few small things - I worked with abadger1999 last night on making a new python-fedora release which adds a method to create FAS groups, which I needed for the fedorahosted app. Also did some testing with him making sure fas01.dev is working fine. 18:14:52 i think since we're still working on mw118, we do that 18:14:53 I need to build it in EPEL. I need to get the plugins reviewed 18:14:57 wait 18:14:59 we are doing 119 18:15:00 that's right 18:15:05 * ianweller_ shuts up 18:15:19 I can help with plugins. 18:15:27 would be nice to get it updated. ;) 18:15:29 right now the plugins are the ones we use in our private idaho 18:16:11 my steps would be then would be update in a .dev. environment to see if it will work cleanly. Then do so in staging and then production 18:16:21 yeah. 18:16:23 ianweller_, did the last "update" so I will need info from him 18:16:30 As far as the fedorahosted app itself, I'm working on getting the various fedorahosted SCMs processable via the CLI. Git requests should almost kindasorta work. Mailing lists should get created via ansible when we move lists to hosted-lists01 (I need to test the ansible bits locally, but it *should* work as-is). 18:16:38 we can probibly just go to stg with it... do it on one of the apps there. 18:16:47 that's fun, because i don't remember doing the last updat e;) 18:17:03 relrod: that's awesome 18:17:14 excellent. 18:17:29 smooge, nirik: if it winds up in app*.stg, let's just communicate so aren't both fighting over mediawiki at the same time. :) 18:17:29 ianweller_: I'm sure we can sort it out. ;) 18:17:32 smooge: are you wanting to work on this stuff today? 18:17:35 smooge: or when 18:17:45 threebean: yeah, agreed. Please coordinate. 18:18:34 GNOME server went tango uniform again.. will be focusing on that for a few minutes 18:18:51 smooge: I hear if you just reseat everything it'll be fine 18:18:52 ;) 18:19:04 In other application news, we moved fas db to it's own db server. This should prevent it from getting hit by load issues on other apps. 18:19:20 w00t 18:19:23 smooge: ;( 18:19:28 ok, any other application news? 18:19:30 we also gave db02.stg more ram 18:19:38 so hopefully it will stop crapping out occasionally 18:19:39 oh yeah. 18:19:51 rossdylan is making rapid progress on badges for week #1. 18:20:00 skvidal: are we overcommited on any virthosts now? 18:20:05 (on mem) 18:20:18 umm lemme look 18:20:35 threebean / rossdylan: cool. Is there any place with a plan/overview of the badges setup? or place were folks could step up to help? 18:20:37 indeed I am, you can now award badges to people (based on email) by listening on the fedmsg bus 18:20:59 nirik: running 18:21:00 current dev is hosted on github.com/rossdylan/fedmsg 18:21:31 nirik: nothing written down :/ 18:21:52 rossdylan: let's definitely do that so infra peeps can review it 18:21:56 also, dunno if it's worth looking at to see if you could pull in, but our askbot instance has badges... 18:22:00 nirik: virthost10.phx2.fedoraproject.org:-479 18:22:07 still a little short there - but I bet we never hit it 18:22:24 http://ask.fedoraproject.org/badges/ 18:22:29 nirik: oh and virthost06.phx2.fedoraproject.org:-814 18:22:29 skvidal: ok, good to know. 18:22:45 hmm 18:22:58 releng01, ask01, bapp02, packages01 18:23:08 releng01 is bvirthost06. ;) 18:23:19 nirik, we were thinking about creating scripts to dump existing badges (or existing activities that need awarding) into the newer open badges system 18:23:44 rossdylan: sounds reasonable. 18:23:53 ok, shall we move on then? 18:23:55 nirik: right - sorry - bad grep 18:24:13 #topic Upcoming Tasks/Items 18:24:27 here's what I have on my list for upcoming: 18:24:29 #info 2012-06-08 OOW: osuosl01.fedoraproject.org 18:24:29 #info 2012-06-11 remove people with pkgdb bugzilla issues. 18:24:29 #info 2012-06-14 23UTC class A and B reboots 18:24:29 #info 2012-06-17 OOW: sign-vault02.phx2.fedoraproject.org 18:24:30 #info 2012-06-21 to 2012-07-04 Kevin is off on trains and boats. 18:24:32 #info 2012-06-26 Fedora 15 end of life. 18:24:34 #info 2012-06-28 Seth at jury duty. 18:24:46 I'm going to ask fesco about the people who have bugzilla email issues... 18:24:53 get permission to just remove them all. 18:25:02 next wed we will be doing mass reboots. 18:25:14 anything else folks would like to schedule or note? 18:26:14 we need to reschedule the lists.fedorahosted move sometime. 18:26:35 does anyone here have any dyndns or round-robin dns mgmt experience? 18:26:58 I was looking for some input on if there is a vastly simpler way to do something we're currently doing in a kludgy, horrible way 18:26:59 :) 18:27:20 * nirik has avoided those setups in the past. ;) 18:27:35 well we're stuck with roundrobin 18:27:38 nirik: I don't have the date handy but -- date that RH is turning of old bugzilla compatibility should probably be noted 18:27:40 I do not know a way around it for our proxies 18:27:47 skvidal: oh, we might ask letoto, he's mr dns. 18:27:59 abadger1999: good idea. I think I saw it somewhere, let me look. 18:27:59 nirik: I thought he only cared about dnssec 18:28:01 skvidal, I do 18:28:52 smooge: so right now the process we use for updating dns for hte proxies is.... 18:28:56 ridiculously complicated 18:28:59 1. edit in 12 locations 18:29:01 2. edit serial 18:29:03 #topic Dns 18:29:05 3. commit to git 18:29:08 4. push 18:29:15 5. force-puppet on ns\* 18:29:17 6. wait 18:29:22 7. do what you want to do 18:29:30 8. do those again to put it all back 18:29:33 wow, that's a lot of steps 18:29:46 sorry 12 is an exaggeration 18:29:52 it's only 6 locations 18:29:56 yeah.. pretty much standard fair for doing the "we want people in X to use this proxy over that proxy" 18:30:06 so here's what I was thinking 18:30:16 listening 18:30:24 we use the proxies just as fedoraproject.org and wildcard.fedoraproject.org 18:31:03 in most cases the sites are CNAMES to wildcard 18:31:21 (admin.fp.o, for example or www) 18:31:36 yes 18:31:36 what I was thinking is that we setup a subdomain 18:31:43 dyn.fedoraproject.org 18:32:06 so we can edit that one domain - for changes to the proxies 18:32:15 and bump only the serial in that domain 18:32:31 the website names, of course, don't change 18:32:35 that's the point of the cnames 18:32:42 so if I go to www.fedoraproject.org 18:32:49 it currently goes to wildcard.fedoraproject.org 18:33:05 which RR dns to one of our proxy ips - dependent on which GEOip you're coming from 18:33:22 all we'd be changing is instead of going to wildcard.fedoraproject.org 18:33:28 it would be wildcard.dyn.fedoraproject.org 18:33:50 and instead of hand-editing the zones in puppet/git and pushing them out 18:34:01 we could script the creation/updating of that zone 18:34:26 just create a dictionary of ips and which regions they should exist in 18:34:32 I think this is worth persuing. 18:34:36 and populate the various region-specific files that way 18:34:47 skvidal, that sounds good. 18:34:47 so I guess what I'm wondering is.. 18:34:52 has someone else already done this? 18:35:02 not only for less editing/stuff when we change things, but allowing for automation (example nagios pulls a site thats down from dns for us) 18:35:36 it also seems like 18:35:39 well we were using cobbler to do zones for building stuff and editing bind files that way. At another spot we had a shell script which did pretty much what you said for a limited domain 18:35:42 it would make the process less error prine 18:35:51 s/prine/prone/ 18:35:56 yes. 18:36:28 also, ideally I'd prefer if there were a nag when it's not in the 'normal' config state. Currently, we take something out, and when it's back up, we forget to re-add it 18:37:19 okay.. 18:37:22 anyhow, I don't know of anything done like this before, but we could look around on the net? 18:37:23 here's what I might do 18:37:33 there's no reason to NOT setup the dns zone 18:37:37 and just not USE it for anything 18:37:45 ie: just test populating/updating the zones 18:37:54 but nothing is ever referred to inside it 18:37:55 right? 18:37:59 that seems harm-free to me 18:38:11 yep. Should be fine. 18:38:20 so I may look into that as a start 18:38:26 unless someone else wants to take this on 18:39:05 * marcdeop is not sure has the knowledge need but wants to help skvidal out 18:39:08 * nirik listens to the silence. ;) 18:39:19 * marcdeop s/need/needed 18:39:28 marcdeop: do you have any dns experience? 18:39:35 #info skvidal to work on more automated/dynamic zone updates for our primary zone 18:39:35 a little bit, yes 18:40:03 marcdeop: well - all we're talking about is a script that lets us generate a pretty simple dns zone 18:40:11 it's really just a template 18:40:22 I bet I could use python jinja to most of it 18:40:33 the only tricky part is state maintenance 18:40:42 and making the commands make sense to someone updating it 18:40:53 marcdeop: I'll sketch up some pseudocode 18:40:58 and post to infra for complaints 18:41:04 if you want to work on fleshing it out - feel free 18:41:10 sounds good. 18:41:39 #topic Open Floor 18:41:48 Any other items anyone has? questions? comments? 18:41:58 how about the clean up issues we encountered this week? 18:42:08 I will be out next Thursday 18:42:13 smooge: have fun! 18:42:22 skvidal: sure, would be good to note those... 18:42:23 I am going to a security conference in Santa Fe for a couple of hours 18:42:33 should be learning some new things 18:42:52 hopefully not all Windows related :) 18:42:58 smooge: cool. 18:43:00 nirik: well I was thinking of these items: 18:43:10 couple of hours = next week? wooow 18:43:12 :P 18:43:32 1. all of sign-* hosts redo/work/update 18:43:35 2. old puppet crap 18:43:55 3. firewalls? 18:43:57 3. httpd::site vs httpd::website and porting things around 18:44:02 (firewalls, yes) 18:44:06 4. fakefas reinstall 18:44:36 5. it's been a while since the last time we changed global root pws 18:44:58 5 I was thinking about the other day and agree with. 18:45:10 right. So, on 1. I am wondering if we should consider a late summer FAD for infrastructure security stuff. We could gather somewhere and work on redoing our sign stuff, finish implementing the 2factor auth stuff, talk about signing commits, talk about any other secuity deliverables we might want to have. 18:45:32 we could also do 5 at that time with many of us in person? 18:45:47 nirik: root pw change is triviall, actually 18:45:53 well, yeah, true. 18:45:54 i have a func script which can do it globally 18:46:08 speaking of that I made a change to our global ssh configuration this week 18:46:16 * marcdeop is sorry but has to leave. Hopes to read any stuff left in the log :) 18:46:22 such that it is now possible for us to allow root to login - but only using ssh keys 18:46:28 marcdeop: no problem. 18:46:32 however, I have not enabled any authorized_keys 18:46:55 I'm ok with a rootpw change anytime. 18:47:32 nirik: okay I'll jot it down 18:47:53 * nirik was also wanting to change the default IMM passwords/other places thats used. 18:47:58 but thats more work 18:48:07 nirik: when/where are you thinking about the FAD? I was thinking 'VFAD' but you said in-person 18:48:14 nirik: can we do that via the ssh connection? 18:48:20 nirik: b/c that would be WAY easier 18:48:21 skvidal: not sure. 18:48:24 relrod: VFADs suck 18:48:29 relrod: b/c people are still distracted 18:48:45 let me rephrase 18:48:50 vfads do not suck 18:48:53 they suck for things like this 18:48:54 relrod: yeah, if we can get funding... it occured to me that no one does FAD's anymore... but perhaps infra could put together a compelling one. 18:49:11 * pingou likes the idea 18:49:14 Yeah. I think it'd be cool 18:49:17 in person, relatively isolated could work for hammering out something. 18:49:19 not that I like traveling, but I bet we could get a lot done in a weekend face to face with less distractions. 18:49:31 or better yet! not a weekend! :) 18:49:34 or at least a lot of plans to bring back 18:49:38 sure, that too. 18:49:43 a long week-end ? :) 18:50:02 anyhow, it's just an idea. 18:50:12 I'll run it by the list and see what interest there is. 18:50:16 nirik: in your idea, where would it be held ? 18:50:16 * relrod likes it - if I can help plan it somehow let me know 18:50:29 pingou: good question. 18:50:37 we are pretty spread out... 18:50:39 nirik: US is the basic answer, but east/west cost ? 18:50:54 I bet we could get a conference room in the RDU office for free 18:50:57 I suspect it would come down to costs... 18:51:19 where can we get space/net/etc... where is cheap to fly people into...where is cheap to have people stay 18:52:03 * relrod likes the idea a lot 18:52:05 let us do one in Phoenix :). In the summer 18:52:09 another possibility would be to try something next to fudcon... have folks come in a few days sooner or leave a few days later. 18:52:17 but thats next year 18:53:13 smooge: we could all sit next to the servers in the datacenter. ;) 18:53:25 \รณ/ 18:53:32 nice and cold 18:53:41 but unfortunately, loud. 18:53:46 incredibly loud 18:53:56 hear plugs ? :) 18:53:59 anyhow, I'll send out a thing to the list and see what folks think... 18:53:59 I've never been in a quiet datacenter 18:54:05 nirik: cool 18:54:06 and we can chat on irc :) 18:54:24 I'll note it's important for FAD's to have a list of specific deliverables... 18:54:36 not just "work on stuff" 18:54:39 right 18:54:42 I think that list is not hard 18:54:51 sign* fixup/redo/document/process 18:54:56 2fa 18:54:59 yeah, I have a long list of things I would like to have done/do/plan 18:55:11 well we could focus it down to only auth/security if it would help 18:55:21 abadger1999: webapp FAD ? 18:55:22 yeah, thats what I was thinking too. 18:55:34 Maybe Colorado is the closest to all of infra-sysadmin :-) 18:55:51 sorry i almost missed the meeting - I was thinking it was #PM EDT :) 18:55:53 abadger1999: we could make a graph. ;) 18:55:58 3PM* 18:55:58 pingou: We could. I bet Boston area is most central for those. 18:56:07 anyway - to the list? 18:56:12 abadger1999: likely 18:56:27 webapp/fedora-apps FAD would be cool too, yes :P 18:56:31 yeah. (I would be happy with colorado, because then I wouldn't have to fly) 18:56:40 anyhow, anything else? or shall we call it a meeting? 18:56:44 nirik: you could drive to raleigh if you really wanted to 18:56:58 yeah. 18:57:16 possibly could do train too, but I bet I would have to go via chicago or something silly. 18:57:48 ok, thanks for coming everyone. 18:57:58 #endmeeting