19:00:01 #startmeeting Infrastructure (2013-04-04) 19:00:01 Meeting started Thu Apr 4 19:00:01 2013 UTC. The chair is nirik. Information about MeetBot at http://wiki.debian.org/MeetBot. 19:00:01 Useful Commands: #action #agreed #halp #info #idea #link #topic. 19:00:01 #meetingname infrastructure 19:00:01 The meeting name has been set to 'infrastructure' 19:00:02 #topic welcome y'all 19:00:02 #chair smooge skvidal CodeBlock ricky nirik abadger1999 lmacken dgilmore mdomsch threebean 19:00:02 Current chairs: CodeBlock abadger1999 dgilmore lmacken mdomsch nirik ricky skvidal smooge threebean 19:00:19 * skvidal is here 19:00:25 * pingou here 19:00:26 * lmacken here for 15m 19:00:33 * relrod here 19:00:37 * threebean is here 19:00:52 * fchiulli is listening 19:01:18 #topic New folks introductions and Apprentice tasks. 19:01:19 is here 19:01:29 any new folks or apprentices with questions or comments? 19:01:41 oddshocks: you wanted to introduce yourself? ;) 19:01:44 * oddshocks is new! 19:01:52 :P 19:02:09 Hey everyone, I just sent out a mailing list introduction, I'm an IT student and free software hacker from RIT hoping to get involved with things around here :) 19:02:17 potential Red Hat intern for the Summer 19:02:19 oddshocks: welcome 19:02:21 nice to meet you all :) 19:02:23 welcome :) 19:02:28 hye oddshocks :p 19:02:28 welcome oddshocks 19:02:30 nice, welcome :) 19:02:38 you more interested in sysadmin or software devel? 19:03:35 I'm still kinda on the fence, but systems administration catches my eye a bit more 19:03:51 sure, you could also help out in both time and focus permitting. ;) 19:04:16 I like writing code and I also like working with systems, so yeah 19:04:20 anyhow, welcome and do see us after the meeting in #fedora-admin / #fedora-apps and we can point you at where to get started. 19:04:27 Perfect, thanks! 19:04:33 #topic Applications status / discussion 19:04:46 any applications news? Note that we are in alpha freeze right now. 19:04:55 wanna talk about gsoc here or open floor? 19:05:18 * pingou worked on pkgdb2 still 19:05:21 either way. ;) 19:05:24 I let threebean mention tagger :) 19:05:32 * threebean mentions tagger 19:05:47 * nirik will have a seperate topic about freeze discussions after apps/sysadmin ones. 19:06:01 :P 19:06:11 that was easy ;) 19:06:28 GSoC is Google Summer of Code. It's coming up soon and Fedora has applied. We've already had students interested in working on infra come to find us in irc. What we really need to pull this off, though, is mentors. 19:06:30 tagger dev work is getting close to something we could put in staging. early next week at the earliest. 19:06:52 cool. 19:07:04 abadger1999: yeah, most of us are pretty strapped for time tho. ;( 19:07:13 Most of the usual suspects are busy this year (lmacken, myself, nirik, skvidal, pingou, etc). 19:07:15 threebean: btw we should update the dev instance with the python-fedora from abadger1999 so we can start checking it 19:07:27 abadger1999: additionally i'm a terrible mentor 19:07:33 pingou: oh yeah, let's set that up in #fedora-apps post meeting 19:07:45 threebean: cool 19:07:49 skvidal: really? :) 19:07:59 abadger1999: some of the items are not really directly needing mentors from us... the gitlab thing could well be mostly packaging. 19:08:01 I sent a request to the infra list for mentors to raise their hands if they want to work on something but so far no replies. 19:08:02 pingou: no need for you to mock 19:08:07 19:08:18 ...chain.py! 19:08:47 nirik: although... being useful to infra would make it more likely to be accepted. 19:08:51 wrt GSoC: tbh, there has been some people that looked very motivated, it kind of makes me want to help them 19:08:58 19:09:01 it's a bit difficult in that ideas on the page are submitted by anyone without infra really looking or saying they would like them. Perhaps next year we could be more proactive and prune ideas to ones we really would like and that might get more mentors involved. 19:09:21 Yeah, biker has showed up a number of times and been pretty enthusiastic. 19:09:40 skvidal: honestly, I wasn't 19:09:46 yeah, agreed. 19:09:49 He wondered about doing a unittesting GSoC... I'd be willing to partially mentor that. 19:10:03 biker was also interested in any other ideas too... wasn't set on the ideas page ones. 19:10:07 pingou: SSSSSSSSSSSSSURRE you weren't :) 19:10:08 19:10:13 skvidal: no really :) 19:10:32 GSoC: pkgdb2? Elections? 19:10:32 I've seen a lot of interest in gitlab and unit testing for fedpkg... 19:10:48 start porting to ansible? 19:11:02 MirrorManager? 19:11:02 elections, fchiulli is managing... pretty far along -- but from the hours that he's available I don't htink he can mentor. 19:11:31 I don't think mdomsch has time to mentor for mm... and not sure anyone else is up to speed enough yet. 19:11:33 * fchiulli is not available much during the day. Nights vary 19:11:49 mirrormanager needs a new driver since mdomsch has less time. Without a driver, gsoc contributions are likely to not get merged. 19:11:58 19:12:19 would anyone be willing to co-mentor with abadger1999 on unittesting? 19:12:21 ansible would be good -- but that would need a sysadmin mentor... and you guys are all pretty tapped, right? 19:12:23 perhaps thats one we could do? 19:13:21 abadger1999: well I did work quite a bit trying to port MM to TG2 19:13:23 the problem with the ansible side is that a lot of it is just deciding how we want to organize things... I'm not sure what GSoC folks could do there until we have more things in place and then not sure how exciting it would be. and yeah, time... 19:13:30 < 19:13:40 pingou: Want to take ownership? 19:14:01 we still need to get mm1.4 into production. 19:14:06 and fix issues with it. 19:14:06 pingou: heh -- if you aren't careful, you'll end up with ownership of pkgdb as well ;-) 19:14:18 abadger1999: I can deal w/ pkgdb more than MM :) 19:14:36 abadger1999: but don't think you're out of the loop ;-) 19:14:50 darn! ;-) 19:15:25 so, where does that leave us? ;) 19:15:30 anyhow -- if anyone wants to step up to mentor, I can co-mentor unittesting python-fedora and some other things. 19:15:50 can't do it alone. 19:15:57 pkgdb2 is an option? 19:16:08 nirik: it would be awesome for some mm love 19:16:31 sure, but without a driver on it, not sure GSoC would help us any 19:16:46 yeah 19:16:53 pingou: yes and no. If you want to mentor that I can help you --- but I wouldn't want to be primary mentor for that this summer. 19:16:54 * Smoother1rOgZ is around 19:17:26 abadger1999: and I shouldn't (not that I wouldn't) 19:17:30 hehe :-) 19:17:34 understood :-) 19:17:50 ok, well, lets keep discussing and see if we can offer anything... 19:17:54 any other application news? 19:18:10 python-fedora alpha is in infra; surprisingly stable. 19:18:21 cool :) stg or prod? 19:18:22 I just sent a change request to fix a few bugs. 19:18:28 threebean: prod 19:18:30 awesome 19:18:50 working on a pkgdb update (pingou's slimming of the api and db) 19:18:51 cool. 19:18:58 will go to stg next week I hope 19:19:00 that would be after alpha? 19:19:04 (for prod 19:19:05 ) 19:19:16 and figuring out how to make otp fit into the python-fedora api 19:19:22 nirik: correct. 19:19:25 fun times. ;) 19:19:41 good news: I've heard no negative feedback after tuesday's freeze break request - http://lists.fedoraproject.org/pipermail/infrastructure/2013-April/012724.html 19:19:42 ok, moving along to sysadmin... 19:19:53 threebean: yeah, seems ok, but the None's are weird. 19:19:57 * threebean nods 19:20:14 those are omitted from the ircbot messages in git... coming down the pipe. 19:20:15 #topic Sysadmin status / discussion 19:20:35 threebean: yeah 19:20:41 so, on the sysadmin side of things... 19:21:08 we have 3/4ths of our arm SOC's on line and I will be redoing the other one soon when switch ports are changed on it. 19:21:18 I created a sop for dealing with the arm socs: 19:21:27 http://infrastructure.fedoraproject.org/infra/docs/arm.txt 19:21:56 nirik: :) many thanks 19:22:11 no problem. Nice to see them finally cranking away. 19:22:36 I will likely request a freeze break in a bit for updating postgresql... might need a short outage for it. 19:23:21 bnfs01 (our backup storage box) is hosed again. smooge was looking into it. 19:23:41 we are finalizing our new netapp storage now, hopefully will have some in a while. 19:23:54 it is not looking good 19:24:01 smooge: ;( 19:24:22 any other sysadmin stuff? 19:24:29 * skvidal has been working on ansible-y stuff 19:24:38 the cron-runner 19:24:40 postgres update -- no dump and reload needed right? 19:24:41 oh, I was thinking we should add monitoring for the arm boxes at some point. I might file a ticket on that and let someone else do it. ;) 19:24:46 the freezes thing (which uncovered a bug) 19:24:51 abadger1999: correct. just update and restart 19:24:56 cool. 19:25:34 skvidal: so, were we going to do cron there? or triggered by commit? (or jury still out)? 19:25:45 nirik: well the script is the same, ultimately 19:25:52 so... 19:25:58 yeah. 19:25:59 one way or the other we need to be able to say 19:26:09 'these hosts or groups' - run the playbook for them and don't overlap 19:26:14 I don't know if we can get ansible to say 'this commit affects these hosts' 19:26:17 run them forked off - multiple in parallel at a time 19:26:25 yeah. 19:26:26 nirik: we can for playbooks, we can't, easily, for files 19:26:46 ah. ok. 19:26:48 nirik: for tasks it is.... theoretically... possible - but there will be a lot of grepping involved 19:26:56 speaking of ansible, do we finally have an idea how we want to handle stg/prod? 19:27:08 yeah, we will just need to look at where we want to draw the line probibly. 19:27:36 pingou: we did come up with a plan. ;) But it's not sure yet that it will work, so we need to test it... 19:27:41 :) 19:27:50 basically similar to how we do it in puppet... a path for things. 19:28:05 that plugin should work - right now the issue is this 19:28:16 skvidal: that was pulled into 1.1? or no? 19:28:20 nirik: pretty sure 19:28:22 lemme verify 19:28:37 yep 19:28:40 first_found lookup plugin 19:28:58 so - there are 2 things here, really 19:29:08 * nirik did update lockbox01 to 1.1 19:29:23 the playlets idea that I came up with is.... not very attractive unless we can make it easy to use in an existing playbook 19:29:42 in many case we can make-do w/o needing those playlets - just with included tasks 19:29:55 but I'm trying to figure out which cases where tasks will make it more confusing to use 19:29:56 yeah. 19:30:11 in most of the cases I can think of 19:30:11 yeah, I figure we will need to try things and adjust. 19:30:20 it's just where the variable includes would be ugly in every playbook 19:30:24 and you really just want to say 19:30:37 "you know that last group of hosts you ran against?" 19:30:48 "run this playbook over here, against them and only them" 19:31:38 if anyone wants to talk about this with me sometime 19:31:47 I think I could really use someone to bounce this back and forth with 19:32:04 http://infrastructure.fedoraproject.org/cgit/ansible.git/tree/playlets/README 19:32:06 this is the idea 19:32:18 cool -- I'll queue it up for reading 19:32:24 and then 19:32:41 yeah, lots of ways to do things... ;) 19:33:20 http://paste.fedoraproject.org/6712/10399613/ 19:33:36 ok, lets keep poking at it. ;) I'd like to add arm-releng to ansible repo soon so I can try and work out in reality how things will look 19:33:45 yah 19:33:59 did we already do cloud stuff? 19:34:00 any other sysadmin stuff anyone has? 19:34:03 not yet. ;) 19:34:05 ok 19:34:10 #topic Private Cloud status update / discussion 19:34:14 now. go! :) 19:34:18 heh 19:34:28 so I'd like to add a new ip range to the openstack cloudlet 19:34:42 sounds good. are we ready to add more compute nodes too? 19:34:42 I'm almost positive that the command I want to run won't break the existing network ip range ;) 19:34:47 nirik: we are 19:35:02 if it does... oh well, we do the best we can. ;) 19:35:02 fed-cloud05 and 07 need to be reformatted 19:35:16 but they are not doing anything now 19:35:33 so anyone in sysadmin-main who would like to kick them should feel free 19:35:47 cool. 19:35:59 also 19:36:02 adding compute nodes is super easy too... nova.conf and a few packages and away you go. 19:36:16 01 and 03 should be ready to be turned over to grizzly if anyone would like to do that 19:36:29 excellent. 19:36:32 this is sorta-cloud 19:36:52 I played with systemd-nspawn a bit and what it can do 19:37:12 oh yeah. f18? 19:37:16 and I think if anyone would like to tinker with infrastructure/scripting/etc for that on a big fedora 18 cloud instance 19:37:27 it would be something that will eventually be useful to us 19:37:31 us == fedora infra 19:37:37 yeah, I agree. 19:37:45 it is also completely bluesky afaict 19:37:51 so if you want to noodle around with it 19:37:54 * nirik nods. 19:37:58 bluesky? 19:38:07 threebean: go, play, doesn't matter if nothing comes of it 19:38:10 threebean: no constraints 19:38:15 not ready for real use right now, by someday... perhaps... 19:38:24 ok, thanks 19:38:29 so a nspawn is a container 19:38:35 forming one up is easy 19:38:47 I think the big things around it would be: managing them, and networking. 19:38:51 yum --installroot=/some/path install @core 19:39:09 systemd-nspawn -D /some/path -b 19:39:17 and then you're in the chroot, lickety-split 19:39:51 yep. 19:40:00 anyway - if anyone wants to do that 19:40:04 yell at one of us 19:40:10 and we can spin up an f18 instance quickly 19:40:20 sounds good. 19:40:26 #topic Freezes 19:40:29 brrr 19:40:35 so, we are in a freeze for alpha now... 19:40:43 and the topic came up how we determine whats frozen, etc. 19:40:47 (see post on list) 19:41:29 would anyone like to make a case for more hosts NOT being frozen than dev/stg ? or some way to decide those? 19:41:48 nirik: there's one example of a host-specifically marked as freezes:false 19:41:51 people03.fedoraproject.org 19:42:01 adding additional ones like that to the inventory is really easy 19:42:22 nirik: actually - this brings up a question 19:42:32 nirik: right now we have the inventory inside the public ansible repo 19:42:33 sure... but I was meaning from a standpoint of critera. 19:42:38 for alpha/beta or final or both? 19:42:45 abadger1999: I think for all 19:42:50 abadger1999: both, either... ;) 19:43:07 I'd like to have a critera we can hold a host up to and see if it should be frozen or not. 19:43:26 a very simple one would be: is the host *stg* or *dev* ? 19:43:35 but that means a lot more things are frozen than were before. 19:43:55 yeah... I don't really like mor ethings frozen. 19:44:10 nirik: I think there is room for more non-distro-releasing-impacting things being unfrozen 19:44:29 people, hosted, hosted-lists, 19:44:31 yeah, but what critera can we use. ;) 19:44:34 the items you mentioned in your email 19:44:38 I posted some other attempts... 19:44:42 nirik: 'does it impact the ability to release the distro' 19:44:54 well, hosted actually does impact the distro. ;) 19:45:07 nirik: how? 19:45:07 comps is pulled from there. 19:45:40 sigh 19:45:43 comps isn't mirrored into a package? 19:45:50 anyhow, we could continue discussion on list too. :) 19:45:57 i guess its repodata so no.. :-( 19:46:02 abadger1999: no, it's pulled from git by composes... 19:46:09 also spin-kickstarts is there. 19:46:25 now, we could look at moving those, but right now they are there. ;0 19:46:42 brb 19:47:22 yah.... I see 19:47:31 * skvidal grumbles... 19:47:34 okay fine 19:47:40 hosted-lists! 19:47:42 nirik: and branched is pulled from git on fedorahosted nightly 19:48:06 yeah, rel-eng repo too... 19:48:27 so - back to my earlier question 19:48:35 anyhow, lets continue that discussion on list... for purposes of this alpha freeze, we just keep using the picture. ;) 19:48:37 do we want to split out the inventory subdir of the ansible repo - into its own repo 19:48:47 skvidal: what does that get us? 19:48:51 so we can have people editing the inventory who might not be able to commit to ansible itself 19:49:13 not sure thats a very common thing is it? 19:49:18 true enough 19:49:28 this is why I ask these things ;) 19:49:30 I'd prefer to keep number of repos down if we can. ;) 19:50:05 commit on ansible is going to be the same as commit on puppet right? 19:50:15 which is what... sysadmin? 19:50:20 lemme look 19:50:34 yes 19:50:34 sysadmin 19:50:36 yah 19:50:38 you're right 19:50:43 I withdraw the suggestions 19:51:12 :) of course they must be in a group that has shell on lockbox01 too... 19:51:21 #topic Upcoming Tasks/Items 19:51:33 here's what I have for upcoming: 19:51:35 #info 2013-04-02 to 2013-04-16 ALPHA infrastructure freeze 19:51:36 #info 2013-04-15 - remove legacy-openid 19:51:36 #info 2013-04-15 - announce openid change on hosted. 19:51:36 #info 2013-04-16 F19 alpha release 19:51:36 #info 2013-04-17 - switch to new sigul servers. 19:51:37 #info 2013-04-19 - switch hosted to openid 19:51:38 #info 2013-05-07 to 2013-05-21 BETA infrastructure freeze 19:51:40 #info 2013-05-21 F19 beta release 19:51:42 #info 2013-05-31 end of 1st quarter 19:51:44 #info 2013-06-11 to 2013-06-25 FINAL infrastructure freeze. 19:51:46 #info 2013-06-25 F19 FINAL release 19:51:48 any other items people would like to schedule or note? 19:52:34 #topic Open Floor 19:52:41 anyone have items for open floor? 19:52:58 nirik: one question 19:53:24 nirik: what were the future points for determining what to do with hosted 19:53:31 nirik: and making it more-better 19:53:44 as in - it's running right now 19:53:46 yeah... so I think we decided on at least a short term plan... 19:53:51 do we just sit on it? 19:53:57 ah - okay - 19:54:02 well, we could do that too I guess... 19:54:10 I was thinking we could move it to osuosl02... 19:54:28 (or at least all the git ones) 19:54:31 okay - so that's something we do plan to do. 19:54:40 did you decide if you wanted osuosl02 to be cloud-y or not? 19:55:15 I was really on the fence there. I think cloud would be nice from a 'figure more stuff out' with it, but on the other hand, I don't know how much time we want to spend on hosted really... 19:55:40 Ideally we can get it to a stablish thing and not have to deal with it again for a while. 19:56:11 let me try and write up a plan and we can tweak it and see what all of it we want to do. 19:56:11 nirik: so... what does that mean? 19:56:13 ok 19:56:18 sounds round 19:56:19 I can put it on that hosted wiki page with all the ideas. 19:56:21 thank you 19:56:45 hosted02 is running ok now, but I think I'd like to move it anyhow, even if we do nothing else but move it. 19:57:19 cool 19:57:32 * nirik adds to his list. ;) 19:57:37 ok, anything else ? 19:58:08 ok, thanks for coming everyone! 19:58:09 not from me :) 19:58:14 #endmeeting