20:00:27 #startmeeting Infrastructure 20:00:27 Meeting started Thu Jun 3 20:00:27 2010 UTC. The chair is mmcgrath. Information about MeetBot at http://wiki.debian.org/MeetBot. 20:00:27 Useful Commands: #action #agreed #halp #info #idea #link #topic. 20:00:31 #topic Who's here? 20:00:37 * jsmith lurks 20:00:38 doh 20:00:57 * nirik is lurking around in the back. 20:01:41 * sgallagh listens with one ear 20:01:48 * sijis is here 20:01:55 Doesn't look like there are any meeting tickets to discuss so we'll just get started. 20:02:03 #topic Upgrade to RHEL5.5 20:02:08 smooge: how's that going? 20:02:22 woops sorry 20:02:26 ranting on eng-list 20:02:43 ok RHEL-5.5 was updated on stging systems and publictest systems. 20:03:03 smooge: how'd our applications hold up? 20:03:16 app07 and 2 live systems also got updated 20:03:41 there are some problems we found today with mod_python and mod_wsgi I guess 20:03:47 need to clean and fix those up 20:03:56 * mdomsch 20:04:06 smooge: yeah that's a minor change but something to watch out for. 20:04:06 the app systems seemed to work but I didn't go through a formal checklist to confirm 20:04:19 jokajak actually changed that package so mod_wsgi is commented out by default. 20:04:34 so we just need to make sure to deploy a /etc/httpd/conf.d/wsgi.conf that loads the module 20:04:40 smooge: If you don't know where to start on that, I might be able to help ( I fixed the mod_wsgi/mod_python problem for RHEL-5.4 for infra) 20:04:55 smooge: are you planning on scheduling downtime sometime soon for rebooting and whatnot? 20:05:09 abadger1999, I will need to talk with you in a bit. 20:05:14 20:05:18 * mmcgrath has some downtime to do soon. 20:05:25 mmcgrath, next tuesday would be our scheduled patch day 20:05:34 for production systems. 20:05:52 smooge: sounds good, send a note to the list to have everyone double check their apps in staging by then. 20:05:59 ok will do so 20:06:03 which list 20:06:03 I think I'm going to schedule my downtime for tomorrow (will explain in a bit) 20:06:08 fedora-infrastructure 20:06:15 ok webgroup? 20:06:48 sure 20:07:28 smooge: anything else on that? 20:07:40 not at the moment 20:07:43 k 20:07:50 #topic bastion1 is back baby! 20:07:56 the big isuse was dealing with the various --exclude for various projects 20:08:01 oh sorry 20:08:04 bastion1 20:08:08 So I've had a request from the network team to alter bastion's external IP address. 20:08:16 so I'm working on getting that in order. 20:08:23 smooge: what --excludes have you been needing to use? 20:08:52 mediawiki mostly 20:09:10 ah, k. 20:09:16 So yeah, bastion still runs our vpn and mail services. 20:09:21 bastion1's been dead for some time. 20:09:30 shouldn't ibe bastion01 :) 20:09:32 I'm going to re-create it (already have) then get production traffic moved to it. 20:09:39 sijis: ugh 20:10:00 the whole 0 vs non 0 thing has been a disaster, it's been half a year and we're still not all on 0whatever. 20:10:07 anywho .. :) 20:10:34 I'm going to schedule downtime for tomorrow, I'm hoping for a quick blip but since the vpn is involved who knows. I've done small failovers with success for testing. 20:10:53 I *really* want a full featured solution but it's remarkably complicated to design in our specific infrastructure. 20:10:58 it's a split brain problem. 20:12:08 that's all I have on that, any questions or comments? 20:12:55 alrighty 20:12:58 #topic CDN 20:13:00 nb: you around? 20:13:59 k, well I'll take this. At some point soon we'll be sending people in Europe to Europe servers, people in the states will go to servers in the states. 20:14:10 this should provide a better browsing experience for people. 20:14:29 I'd also like to spend more time analyzing our caching setup. 20:14:33 we've had some odd things happening 20:14:40 are we using some sort of geo-dns? 20:14:48 sijis: yeah 20:15:37 sijis: bind + some weird configs. 20:15:47 I'm a little worried about performance but it's a low risk thing because we can always revert. 20:16:00 anyone have any questions or comments on that? 20:16:08 its pretty quiet today. 20:16:13 uhm 20:16:15 * Schmidt says hi then... 20:16:44 is this just for mirrors or all content? 20:16:52 all fedoraproject.org websites. 20:16:57 the mirrors already have that. 20:17:01 nothing for mirrors changes 20:17:04 the mirrorlist server doesn't though :) 20:17:07 but will 20:17:07 and how much savings do people get when stuff still has to make a long haul back to PHX2 for the db's? 20:17:34 for that stuff not much, for all css, image, js and static / cached content (like the wiki) it could be significant. 20:17:55 also, with most of our applications the data coming in and out of the database is small, but the formatting of it is much larger. 20:17:58 which has network savings. 20:18:11 but we'll need to re-do some of our haproxy setup. I'm not totally sure how on that yet without getting messy. 20:18:23 I'm thinking about having a /etc/hosts entry for localapp1.fedoraproject.org 20:18:32 smooge: is your concenr the wiki data? 20:18:35 so if the proxy server is in the same location as an app server, it gets there. 20:19:18 sijis, wiki data, upcoming zikula apps, etc 20:19:33 oh actually zikula would also have significant savings. 20:19:36 is smolts related too? 20:19:38 it'd also be cached at the proxy layer. 20:19:46 some smolts pages would see savings 20:19:55 ah ok 20:19:58 not submission, though there's new code coming down for that. 20:20:43 Anyone have anything else? 20:21:00 * mmcgrath will get with nb to see hwo far he is wrt getting the dnssec signed geo zones in place. 20:21:12 AFAIK they're there, just need a quick alteration and a named.conf change. 20:21:31 alrighty. 20:21:39 well I think that's all I had for this meeting so 20:21:43 #topic Open Floor 20:21:48 did anyone have anything else they'd like to discuss? 20:22:00 I had one minor note, starting not next week but the week after I'm not going to be around much. 20:22:09 Training the first week, summit the next week. 20:22:49 mmcgrath: what's the weird caching thing you mentioned? 20:22:50 not at the moment 20:24:06 sijis: well, when trying to build out the new internetx site, we ran into some oddities. 20:24:14 basically trying to download some files we got some errors 20:24:20 but bypassing our normal proxy layer caused it to work fine. 20:24:40 it was just some generally odd things. 20:24:50 gotcha. 20:25:19 also, how important is it renaming stuff to include the 0? i could help with that. 20:25:39 i understand is updating the apps, servers too, not just the configs 20:25:44 puppet ones 20:26:10 sijis: eh, that's the thing. Renaming was always kind of asthetic and it's a pain in the ass to actually do. 20:26:38 yeah, we run into that here... naming scheme changes. 20:26:45 yeah 20:26:47 it is always easiert o to do it on a new build 20:26:55 Anyone have anything else there? 20:27:05 if not we'll close the meeting in 30. 20:27:55 #endmeeting