18:00:00 #startmeeting Infrastructure (2012-06-14) 18:00:00 Meeting started Thu Jun 14 18:00:00 2012 UTC. The chair is nirik. Information about MeetBot at http://wiki.debian.org/MeetBot. 18:00:00 Useful Commands: #action #agreed #halp #info #idea #link #topic. 18:00:01 #meetingname infrastructure 18:00:01 The meeting name has been set to 'infrastructure' 18:00:01 #topic Good morning Fedora 18:00:01 #chair smooge skvidal CodeBlock ricky nirik abadger1999 lmacken dgilmore mdomsch threebean 18:00:01 Current chairs: CodeBlock abadger1999 dgilmore lmacken mdomsch nirik ricky skvidal smooge threebean 18:00:11 * mdomsch is here 18:00:20 who all is around for a exciting, thrilling, wonderous, fedora infrastructure meeting? 18:00:21 * skvidal is 18:00:23 * lmacken 18:00:27 * rossdylan is here 18:00:28 * ingm4r is 18:00:31 * threebean is here 18:00:48 Hi Team. I am Sudhir Menon from India (irc:sdrfed17). with 3yrs of experience in Linux Sys Administration and QA. Would like to contribute to Infrastructure Group. 18:01:02 welcome sdrfed17 18:01:28 * ingm4r would like to join the team, too 18:01:33 thankyou nirik 18:01:34 #topic New folks introductions and Apprentice tasks. 18:01:35 If any new folks want to give a quick one line bio or any apprentices 18:01:35 would like to ask general questions, they can do so now. Anyone? 18:01:56 Hi everone I am sumit rai from India, (irc: sumitrai), I have RHCSA, and I would love to be a part of fedora community 18:01:57 sdrfed17 / ingm4r: were you more interested in sysadmin tasks? or application development? 18:02:16 sysadmin task niik 18:02:41 Short Line from me: My name is Ingmar (I'm from Germany), I'm working as a Sysadmin since ~5 years 18:02:55 excellent. Lots of new folks today. ;) 18:03:02 so I\m interested in sysadmin tasks, too :) 18:03:02 sysadmin task is the thing that i am more interested in, would also like to have my hands on application development as well 18:03:23 I am interested in sysadmin task too. 18:03:26 For the sysadmin side of things, take a look at https://fedoraproject.org/wiki/Infrastructure_Apprentice and if that sounds of interest to you, I can set you up after the meeting (see me in #fedora-admin) 18:03:49 for application development, we have a number of apps we work on, and there's a list of easyfix items to look at: 18:03:58 http://fedoraproject.org/easyfix/ 18:04:58 so, we can get you all setup after the meeting. ;) 18:05:06 any general questions right now? 18:05:15 will do nirik 18:05:21 not yet :) 18:05:40 looks good to me the apprentice part 18:05:42 nirik 18:06:27 great. I can get you setup after the meeting. ;) 18:06:41 that would be fine. 18:06:41 do chime in with questions and comments as they come to you, and again welcome. 18:07:15 thank you we will be joining you @ fedora-admin after this meeting 18:07:20 #topic Applications status / discussion 18:07:36 abadger1999 / threebean / lmacken / pingou / relrod: any application news this week? 18:08:05 nirik: Sorta sysadminy -- we're about to retire app01.dev :-) 18:08:06 nothing exciting 18:08:17 abadger1999: hurray. 18:08:19 the apps that were being tested on it have moved to pkgdb01.dev and fas01.dev 18:08:46 is everyone happy with how staging works these days? is it better than when we had a staging branch? 18:08:46 in the process, I updated them to use passwordless sudo and made the login/sudo group the commit group for the applications 18:09:05 which were things we'd talked about migrating our dev boxes to do. 18:09:09 Seems to be working out fine. 18:09:28 worked a bit on HK and our student seems to make some progress as well 18:09:34 we did have a upgrade to koji this week. ;) 18:09:46 nirik: btw, what about python-bz ? any news ? 18:09:48 pingou: cool. 18:09:57 I think lmacken hit his first stg-was-nonintuitive issue this week (or last week) 18:10:17 nirik: i have found one bug in koji i need to get fixed 18:10:31 someone else had added an explicit stg module for something and then we couldn't figure out why committing to master wasn't showing up on stg. 18:10:40 (in the modules/ directory) 18:10:47 abadger1999: yup, that was confusing at first. 18:10:59 pingou: there is a 0.7.0 version. We should retest our stuff with it. 18:11:01 Not sure if we can do anything about that except remember to check for that. 18:11:09 abadger1999: ah yeah. I wonder if there's anything we can do about that... . 18:11:39 nirik: release? 18:11:41 in testing ? 18:12:06 pingou: packages in fedora updates-testing. Looks like it's not been built for epel yet. 18:12:14 ok 18:12:21 * pingou notes this on his todo 18:12:22 I can do a scratch build if anyone wants 18:13:00 ok, cool. 18:13:10 dgilmore: what was the bug? in login? 18:14:05 #topic Sysadmin status / discussion 18:14:06 nirik: yeah the web login issue 18:14:15 its adding a :80 to the url 18:14:24 I thought I'd add a section about what sysadminy things we have done over the last week too. 18:14:26 https:// doesnt work well on port 80 18:14:29 dgilmore: ah, bummer. ;( 18:14:43 we just finished last night a mass reboot... so everything should be up to date. 18:15:05 skvidal revamped out dns this week. :) Please read the readme in the dns repo 18:15:26 if anyone wants to updats our dns SOP 18:15:28 skvidal: that's huge 18:15:37 to point to the readme in the dns git repo 18:15:38 thanks for your effort there 18:15:40 please feel free 18:15:52 mdomsch: thanks for that - I hope it will make us all less pained when it comes to proxy time 18:15:54 * nirik can do that. 18:15:59 s/proxy/proxy rotation/ 18:16:04 yeah, I think it's less error prone for proxys. 18:16:10 thanks for working on it skvidal 18:16:13 and it should be less error prone in general 18:16:21 it is VERY hard to get an invalid zone file past it 18:16:36 I hope 18:16:37 :) 18:16:41 we also wiped out community01.dev and made a packages01.stg, which I think is mostly working now. 18:16:52 I've got the S3 mirrors functional in 3 zones now (us-east-1, us-west-1, and us-west-2) 18:16:55 yeah, which is something that had happened to us in the past. ;( 18:17:11 and spent another few nights trying to beat hardlink handling into s3cmd sync 18:17:38 mdomsch: any luck with that? 18:17:41 once that's working, need to parallelize it on 2 dimensions: 1) multiple uploads per upload target 18:17:56 2) scan the local file system once, then multiple upload targets in parallel 18:18:06 as it stands, we're beating the crap out of the netapps on every sync 18:18:18 as it calculates md5sums on each file before checking in with S3 to see if it has it 18:18:46 which is the last thing it needs right now - local tree md5sum caching 18:18:53 perhaps we could pregenerate that? or get it from the repodata? 18:19:02 does it have to be md5? 18:19:20 unfortunately, yes, md5 only. S3 returns that as the ETAG 18:19:35 mdomsch: can you generate an md5sum file and go off of datestamp? 18:19:49 mdomsch: b/c datestamp should be a simple stat() hit and not a full file read like md5sum 18:19:54 though I did just add stashing the md5 in the S3 per-file metadata, so could conceivably add another hash type 18:20:18 mdomsch: then you can assume the md5sum is the same, if the datestamp on the file is older than the last time you ran 18:20:33 (unless someone intentionally set the file mtime/ctime back) 18:21:01 maybe.... 18:21:11 definitely open to ideas to speed things up 18:21:39 problem is, mtime/ctime is an easy stat() call locally, but it requires a full HTTP HEAD call for each target remote 18:21:48 mdomsch: you don't need to compare it to remote 18:21:48 to get it out of the metadata 18:21:54 mdomsch: you just compare it to the last time you ran 18:21:57 MD5 we get "for free" from the bucket list command 18:22:12 any file with a timestamp > than the last time you ran 18:22:15 you take an md5 of 18:22:25 s/file/local file/ 18:22:31 that way you're not hitting EVERY file on the netapp 18:22:38 only those newer than the last execution of your script 18:22:40 skvidal: ah, yes 18:22:45 and when you're done 18:22:50 you store the md5sum of that file 18:22:56 so - if you need it for any reason 18:22:58 you have it 18:23:05 yes, that's completely feasible 18:23:05 w/o rereading it from the file itself 18:23:47 yeah 18:23:52 that's exactly in line with what I was thinking 18:24:30 cool 18:24:31 cool. Sounds like a number of optimizations possible... 18:24:40 ok, moving on? 18:24:46 so, if there are any new folks 18:24:49 apprentices etc 18:24:59 who know python and have time to monkey with it 18:25:06 I'm very open to the help... 18:25:13 excellent. 18:25:26 #info assistance with s3 mirroring welcome. 18:25:39 any other sysadmin stuff to note from this last week? 18:26:18 the bind managed-keys crap? 18:26:27 if anyone is familiar with named and dnssec 18:26:47 and can figure out why on every startup named belches out that it cannot find some managed-keys in dynamic/ 18:26:55 i would be OVERJOYED to see a solution 18:27:11 #info help welcome to track down managed-keys dns warnings 18:27:11 grep for managed-keys in the messages log of any of the nameservers 18:27:13 and you can see 18:27:49 yeah, it's an odd one. ;( 18:28:34 an named epylog module 18:28:37 if anyone wants to write one 18:28:47 I'm sure we'd be happy to be a tester of it 18:29:22 #info epylog named module welcome to parse named logs. 18:29:33 * nirik should file some of these for apprentice folks. ;) 18:29:53 #topic FAD ? 18:30:18 So, I sent out an email the other day to judge interest in holding a FAD (Fedora Activity Day). 18:30:27 sounds like there is some interest. 18:30:38 we need to try and isolate place and time and see who all can make it. 18:30:40 I bet there is :) 18:31:08 so, what I might do is make a wiki page, and ask people to sign up there and note their place/time prefs. 18:31:24 and possibly ballpark costs of flying them to place X or something. 18:31:46 nirik: you know - fudcon in paris - we could colocate a fad w/that 18:31:51 anyone have any further thoughts/ideas on this? is security a good topic? 18:31:56 skvidal: +1 18:31:56 nirik: I'm sure I could convince eunice that we need to go to paris in the fall. 18:32:07 skvidal: ha. yeah! 18:32:16 but we should still be able to do one before if we like 18:33:06 we should be able to get a room there 18:33:10 If folks know of spaces that would be very low cost/free for us to gather at, we could consider them too. 18:33:13 doubtful I could attend or be of much value for a security-focused FAD 18:33:23 mdomsch: you're always useful 18:33:53 I think security is too broad though. I'd like to see a "we will accomplish 1, 2, and with a lot of luck, 3, in 2 days" 18:34:05 mdomsch: you're always welcome. ;) 18:34:06 yeah... 18:34:10 I love the ideas on the list so far 18:34:31 just trim it down to something achievable with a few people who can Get It Done 18:34:53 * pingou wonders about a webapp component 18:34:53 yeah, I listed a bunch of possible things... 18:34:58 but then it would be 2 groups 18:35:07 I think the list is too long to get done all at once there. 18:35:16 nirik: Smooge had the idea of just getting two-factor auth done. 18:35:30 That seemed like it was a good focus for a FAD. 18:35:34 imho 2 factor should have the priority 18:35:43 we might want to focus on things that we could either a) be confident of getting done, b) need to discuss in person more to come up with a plan. 18:35:47 abadger1999: yeah. 18:37:06 well, I will see about whipping up a web page where we can collect costs and time/place prefs. 18:37:16 and we can narrow scope down 18:37:54 #action nirik will make a web page to collect possible attendees, flight costs and location / time prefs. 18:38:14 #topic Upcoming Tasks/Items 18:38:23 #topic Upcoming Tasks/Items 18:38:24 #topic 2012-06-18 remove people with pkgdb bugzilla issues. 18:38:24 #topic 2012-06-21 to 2012-07-04 Kevin is off on trains and boats. 18:38:24 #topic 2012-06-26 Fedora 15 end of life. 18:38:24 #topic 2012-06-28 Seth at jury duty. 18:38:24 #topic 2012-07-05 nag fi-apprentices 18:38:26 #topic 2012-07-12 drop inactive apprentices. 18:38:28 #topic 2012-08-07 to 2012-08-21 F18 Alpha Freeze 18:38:30 #topic 2012-08-21 F18 Alpha release. 18:38:32 ugh. 18:38:34 misskey 18:38:36 oh well. 18:38:38 lots of topics. ;) 18:38:43 (those were supposed to be infos) 18:38:49 holy cow 18:39:01 DOS...to get back into security :) 18:39:30 #topic Upcoming Tasks/Items 18:39:30 #info 2012-06-18 remove people with pkgdb bugzilla issues. 18:39:30 #info 2012-06-21 to 2012-07-04 Kevin is off on trains and boats. 18:39:30 #info 2012-06-26 Fedora 15 end of life. 18:39:30 #info 2012-06-28 Seth at jury duty. 18:39:30 #info 2012-07-05 nag fi-apprentices 18:39:32 #info 2012-07-12 drop inactive apprentices. 18:39:34 #info 2012-08-07 to 2012-08-21 F18 Alpha Freeze 18:39:36 #info 2012-08-21 F18 Alpha release. 18:39:46 anyhow, as noted there, I will be gone the next two meetings. ;) 18:40:06 if anyone needs anything from me, please ask me to do it before next thursday. 18:40:18 nirik: I need you to not be gone, kthx 18:40:23 nirik: :) 18:40:30 does anyone have any other upcoming tasks or things they would like to note on the schedule? 18:40:49 skvidal: working on it. ;) Looking forward to be sitting on the train reading a book looking out the window. ;) 18:41:32 hrmph 18:41:36 Oh, our private cloud hardware is supposedly in the datacenter somewhere. We just need it to be located and racked and wired and we can start setting it up. 18:41:44 nirik: and the networking setup 18:42:08 yeah. 18:42:21 I'm not sure if thats just one switch or two. 18:42:44 and of course whatever it means for ips 18:42:56 yeah. we do have an external class C ready for this. ;) 18:43:22 sorry, wayyyyy late, but I'm here. 18:43:31 oh, also, we are hopefully getting a osuosl02 box... will be good to have 2 machines there so we can HA them or whatever we need. 18:45:16 #topic Open Floor 18:45:26 any questions, comments, ideas for open floor/ 18:46:10 well since I missed the app discussion, quick update on fedorahosted automation app stuff 18:46:18 Fedora badges is coming along pretty well, working on building the nessicary rpm's of the python modules i have been working on 18:46:31 relrod: sure... 18:46:35 rossdylan: cool. 18:47:20 * nirik noted the ubuntu badges thing thats incompatible with open badges had a 0.2 release the other day. 18:47:46 The web side of the fedorahosted automation app is pretty much done, and the CLI I'd say is 75% done. The CLI (at least to the point where I can test it locally) can fully process git requests and Hg requests. I need to get it processing bzr and svn. 18:48:33 relrod: how much pain would it be if we had a hosted-agilo01 instance that was just for projects that needed agilo trac plugin/ 18:48:33 Flask still isn't packaged for el6 yet though. The maintainer is having some issues with the Flask tests not passing on el6 18:49:23 ah 18:49:38 nirik: Probably not too much pain, you'd just run the CLI on -agilo01 instead of hostedXX 18:49:46 ok 18:50:22 item: cgit vs gitweb-caching? 18:50:40 did we come to a conclusion there? 18:50:42 #topic cgit? 18:51:05 not that I know of. I was going to ask gnome.org folks what they thought of cgit (since they use it there) 18:51:13 but I didn't get around to it. 18:51:22 I got an internal email 18:51:25 on the subject 18:51:27 which said 18:51:39 'cgit is much better' 18:51:42 (more or less) 18:51:51 yeah... from looking it seems that way to me. 18:52:04 so, I'm fine moving to it. 18:52:17 the main downside is broken links. 18:52:20 +1 for cgit from me. 18:52:24 but there's some redirect rules that could help. 18:52:41 But I've never been as concerned about the broken links as other people. 18:53:06 yeah, it doesn't worry me overly. I don't think those links are used much... 18:53:14 abadger1999: I think I am inline with that now 18:53:17 if someone hits an old bug with a gitweb link, too bad. 18:53:18 I used to worry about the links 18:53:20 but screw it 18:53:25 it's just how things fall down sometimes 18:54:10 I'd be ok adding the redirects to try and make it somewhat nicer, or if we want just try and redirect all those gitweb things to a page that explains we are using cgit and how to search for what they were looking for. 18:54:56 worksforme 18:55:03 oh - I have another sysadmin-y task 18:55:04 so, does someone want to lead this? if not, I can add it to my list. ;) 18:55:06 that is a touch herculean 18:55:17 * nirik notes we could test cgit on hosted01/02 18:55:32 nirik: might be easier to test cgit on fedorapeople 18:55:43 nirik: then again maybe those ~ repos will be tricky on fedorapeople 18:56:04 there's also pkgs01.stg 18:56:11 nod 18:56:14 anyhow, you had another topic? 18:56:30 yah 18:56:32 so 18:56:41 our httpd::websites, etc module in puppet 18:56:42 is 18:56:44 to say the least 18:56:45 complicated 18:56:57 a while back when we moved infra.fp.o to be standalone 18:57:04 I wrote a new httpd::site class 18:57:16 which simplifies how websites can be setup in puppet 18:57:20 it doesn't involve any templates 18:57:28 and makes me less likely to scream 18:57:43 yeah, +1 on that 18:57:45 so 18:57:49 we need to move to that more 18:57:51 #topic http::websites 18:58:01 yeah, fine with me. 18:58:03 we need to convert sites over and whittle our way off of the other one 18:58:08 just takes people 18:58:17 yeah. 18:58:26 another item that is on my todolist but... 18:58:28 and getting puppet to do the right thing. 18:58:29 well it's a todolist from hell 18:58:38 iptables templates 18:59:08 my plan is to break iptables templates up into stg/prod templates 18:59:19 this separation is mainly to make sure we keep stg from talking to prod 18:59:40 sure... and possibly "untrusted vpn" ? 18:59:41 the idea is for the template to use the heredoc trick 19:00:00 so we have a standard preamble 19:00:14 then if iptables.$iptables_group for that node exists - it gets include 19:00:23 and if iptables.$iptables_datacenter exists - that gets included 19:00:37 and if iptables.$fqdn exits - that gets included 19:00:45 (actually reverse the first two 19:00:49 datacenter, group, fqdn 19:01:01 yep. just like the other sane places you already converted to that. ;) 19:01:13 so that we end up being able to add arbitrary rules, per host or per group of hosts (or per datacenter) 19:01:25 w/o having to deal with the defintion problem for iptables 19:01:30 that we deal with in puppet all the time 19:01:41 the other alternative, which I am not advocating but I am throwing out there 19:01:44 right. so would we remove the custom rules from nodes then? 19:01:50 yes 19:01:58 we would remove custom rules from node files 19:02:01 sounds good to me. 19:02:04 and put them in simple iptables 19:02:07 so the other alternative 19:02:10 that I want to mention 19:02:15 that the dns thing this week made me think about 19:02:35 we could setup iptables in templates - just the dns zone template 19:03:01 in a separate git repo, etc 19:03:11 construct per host and have puppet just run the update-iptables 19:03:15 which sucks down via git, etc 19:03:19 like I said 19:03:22 not advocating 19:03:24 just thinking about it 19:03:30 we could, but we don't often change iptables and it doesn't have serial numbers and such... not sure it's worth it. 19:03:51 nod - thr advantage I was thinking of was being able to validate iptables 19:04:19 which is... difficult with the pieces of iptables we'd have to work with in puppet 19:04:33 validating iptables in general is difficult. ;( 19:04:44 nirik: true 19:05:00 especially if you can have a valid iptables config that just block yourself :) 19:05:18 I'm happy to simplify and split out what we have now tho for sure... 19:05:27 since if you make a mistake now, it affects ALL machines. 19:05:30 the second advantage would be the speed at which we could deploy an iptable 19:05:34 change 19:06:20 yeah, currently we don't update often... but there are use cases I suppose. 19:06:27 right 19:06:50 anyway it's something I'm going to be working on so I figured it would be worth mentioning it 19:06:58 yeah, sounds good. 19:06:59 if anyone wants to work on it and is familiar with iptables - enjoy 19:07:20 #info iptables folks welcome to help with iptables revamp 19:07:37 #topic Open Floor (^2) 19:07:44 hah 19:07:48 any other items for open floor or other questions, comments? ;) 19:08:19 just a basic question, if thats ok 19:08:27 ingm4r: sure, fire away 19:08:35 stg ist staging and prd is productive? 19:08:44 production 19:08:53 ok 19:09:27 ingm4r: yeah... 19:09:42 so we try and test things like new package versions and changes in our staging setup... 19:09:53 then when they appear fine there, they go to production machines. 19:10:05 jup, thought so. Just wanted to be sure about the naming. 19:10:16 our staging setup is not a complete 1 to 1 mapping, but it has many copies of productions stuff. 19:10:26 nirik: so are apprentice guys allowed to work on both staging and production? 19:11:13 sdrfed17: sure, the way it works is that apprentices can login to machines and check out a read only copy of our puppet repo... so any changes you make need to be sent throug someone that has commit access. 19:11:25 so that way you can see how things are setup and propose patches for review. 19:11:55 nirik: ok 19:11:55 anyhow, happy to discuss more over in #fedora-admin... 19:12:04 we are over time, so lets go ahead and close out... 19:12:09 #endmeeting