15:33:12 #startmeeting RELENG (2016-04-11) 15:33:12 Meeting started Mon Apr 11 15:33:12 2016 UTC. The chair is dgilmore. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:33:12 Useful Commands: #action #agreed #halp #info #idea #link #topic. 15:33:12 The meeting name has been set to 'releng_(2016-04-11)' 15:33:12 #meetingname releng 15:33:12 #chair dgilmore nirik tyll sharkcz bochecha masta pbrobinson pingou maxamillion 15:33:12 The meeting name has been set to 'releng' 15:33:12 Current chairs: bochecha dgilmore masta maxamillion nirik pbrobinson pingou sharkcz tyll 15:33:15 #topic init process 15:33:18 morning 15:33:22 * sharkcz is here 15:34:28 * pbrobinson waves 15:34:36 lets get started 15:34:41 there is no tickets 15:34:52 #topic Fedora 24 status 15:35:05 Just wanted to go over where things are 15:35:26 I worked with lsedlar friday 15:35:40 pungi is now making ostree repos as runroot tasks in koji 15:35:46 nice! 15:35:57 currently only enabled for rawhide, need to do f24 todat 15:35:59 today 15:36:21 .hello maxamillion 15:36:22 maxamillion: maxamillion 'Adam Miller' 15:36:25 sorry, was getting coffee 15:36:31 I merged in what should be the last fix to getting the ostree installer dvd working 15:36:44 and I need to get the atomic cloud images enabled 15:36:46 cool 15:37:15 we need some pungi aand koji work to polish it and make it more functional 15:37:33 I also want to try make the nightly two week compose using pungi 15:37:57 it will remove some of teh fragility of the existing way 15:38:04 that would be nice 15:38:19 +1 15:38:23 #info ostree deliverables for a first pass in pungi nearly complete, polish to come 15:38:47 #info 32 bit arm docker base image enabled 15:39:10 I also enabled on Friday building 32 bit arm docker base images 15:39:37 need to work with maxamillion and poke docker upstream on how to ship it 15:39:57 dgilmore can you work with me too, I have that problem to solve on all other arches 15:40:05 pbrobinson: sure 15:40:27 hopefullyi the same process for armhfp works for all the other arches also 15:40:31 dgilmore: +1 15:40:44 I'd definitely like to add that to the list of stuff we're pushing 15:40:55 need to also make sure that when we put the x86_64 image in the fedora docker registry we put the arm one there also 15:41:05 possibly also the secondary arches 15:41:06 dgilmore: yep, but I want to know all the details of said process.... 15:41:13 would enable some interesting things like kubernetes/openshift on arm .... (pending we can get some of the weird segfaults sorted) 15:41:34 pbrobinson: sure. right now we have no clue, maxamillion asked a few weeks back and got no answer 15:41:34 maxamillion: weird segfaults on arm or in general? 15:41:41 pbrobinson: on arm, for openshift 15:41:50 maxamillion: cool 15:41:57 pbrobinson: i build openshift, it died because it could not figure out the cpu speed 15:42:14 built even 15:42:33 jim perrin from centos did as well on some arm hardware, it also blew up in spectacular fashion but differently than what dgilmore was seeing 15:42:40 I tried building kubernetes on aarch64 the other day and it died in some weird way so I moved on to other golang horrors instead 15:42:48 hopefully we can get all of it upstream and sort that stuff out 15:43:24 #info intresting problems to sort around docker registries and non wx86_64 support 15:44:02 I think the one big remaining problem for f24 is s390 15:44:16 but I doubt we can solve that here in 5 minutes 15:45:44 #info we also need to sort out building and signing windows binaries for live usb creator 15:45:45 maxamillion: I've fixed up a bunch of dependencies on some of the lower packages, I've got a feeling I know what some of the issues might be (sadly, yes!) 15:46:30 pbrobinson: huzzah! 15:47:31 does anyone have any f24 thinsg they want to bring up? 15:48:17 we're just over a week out of freeze right? 15:48:42 The storage upgrade that was going to be at alpha release time that got postponed, is now likely to be next tuesday (beta freeze start) 15:48:44 https://fedoraproject.org/wiki/Releases/24/Schedule says April 19th 15:48:59 so yeah, next Tuesday 15:49:24 nirik: that should be okay 15:49:44 I am not sure it's set yet, they said it was likely then... we will see. 15:49:53 okay 15:50:11 maxamillion: so layered images 15:50:17 nirik: that should be non distruptive or total outage? 15:50:22 dgilmore: yo 15:50:24 maxamillion: will it be in prod by then? 15:50:39 dgilmore: I sure hope so 15:51:11 dgilmore: if I can figure out what's broken with my docker registry auth behind the proxy, I think I could get it all done in stage properly, but I'm still a bit at a loss as to what's going on here 15:51:26 pbrobinson: well, they need to throttle us, so that might make things slower, we are looking at locking master mirror rsync to tier1 mirrors for that time... but the actual event is supposed to be non disruptive. (no outage) 15:51:28 maxamillion: okay, I will try look this arvi 15:51:30 arvo 15:51:36 dgilmore: arvo? 15:51:43 afternoon 15:51:57 nirik: ACK, I figured as much, thanks 15:52:10 dgilmore: ah ok, thanks 15:52:26 dgilmore: yeah, I have a meeting right after this one so I likely won't get back around to it until after lunch 15:52:43 maxamillion: okay, lets try look at it then 15:52:47 dgilmore: sounds good 15:53:16 #topic Secondary Architectures updates 15:53:19 #topic Secondary Architectures update - ppc 15:53:25 lets jump into ppc 15:53:30 statistics: {'older': 158, 'local_only': 13, 'remote_only': 366, 'same': 17552, 'newer': 3, 'total_missing_builds': 357} 15:53:30 pbrobinson: how is it? 15:53:38 pretty close 15:53:38 we're looking pretty good 15:54:04 builds mostly there, blocking on a a couple of core packages, both now filed as FE to get some actual traction on them 15:54:20 the two I filed affect all secondary platforms 15:54:28 :) okay 15:54:48 not good that we have bugs only on all secondary arches 15:54:59 likely means someone did something silly 15:55:01 I suspect it's all gcc6 fall out 15:55:29 okay 15:55:32 nope, their code knows only x86 and arm, eg. 386-ds-base 15:55:36 amuses me that v6 was suppose to be a minor bump and fully compatible with 5 as it's been one of the worst bumps for a while 15:55:48 sharkcz: yes, on that one I agree 15:55:49 always fun 15:56:00 but they've often been x86ist 15:56:29 probably as it is due to visibility 15:56:32 we've got some golang bits to close out (and a few for aarch64 too) around container stuff but other than that I think ppc is pretty feature complete 15:56:47 we never really did a great job of letting people know when things fail on secondaries 15:56:48 and not looking too bad 15:57:08 #info ppc is looking good right now 15:57:10 ack 15:57:11 #topic Secondary Architectures update - s390 15:57:11 dgilmore: one of the many reasons I'd like the koji convergence :) 15:57:19 sharkcz: so how goes s390? 15:57:39 we do need to figure out how to make the runroot tasks work somehow 15:57:39 we're ready to go for pungi stuff, blocking on the storage team poking the netapp over exports 15:57:45 after pbrobinson installed the f23 based x86 builder for newRepos we are getting closer again 15:58:07 dgilmore: once I can get pungi working (netapp export issue) I'll be looking closer at the runroot side of things 15:58:29 mean I'll close 1300669 as an infra issue 15:58:29 pbrobinson: okay 15:58:48 pbrobinson: I have one builder ready to be used as runroot one 15:59:12 sharkcz: OK, I'll sync with you offline on that 15:59:24 pbrobinson: ok 15:59:29 sharkcz: on that we have set install_weak_deps=0 everywhere 15:59:47 pbrobinson: whats the status on the ppc8 rebuilds? 15:59:47 something you were kinda getting for free 16:00:24 dgilmore: yep, but we needed to get over the older builder that required weak deps installed 16:00:44 nirik: getting there, they dropped back a bit while I dealt with other more pressing things 16:01:11 ok. right now ansible is a bit confused. ;) Since all the ones in inventory are .ppc.fedoraproject.org and it can't reach them at that. 16:01:25 so they don't appear and their guests don't appear as reachable 16:02:07 nirik: I'll try and get to that again shortly 16:02:18 np, just wanted to mention it. 16:02:24 let me know if I can assist any 16:02:31 nirik: yep, am aware 16:06:08 * nirik has a few small items for open floor. 16:06:22 #topic Secondary Architectures update - arm 16:06:28 pbrobinson: how is arm 16:06:41 statistics: {'older': 62, 'local_only': 1, 'remote_only': 315, 'same': 17702, 'newer': 0, 'total_missing_builds': 134} 16:06:52 aarch64 is closer than ppc 16:06:59 stats wise looking good, same packages issues as ppc 16:07:22 #info same issues as ppc but slightly closer build wise 16:07:22 spent some time over the weekend fixing up other stuff 16:07:48 should hopefully have cloud/docker images enables for tomorrow's nightlies 16:08:07 #info docker and cloud images to be enabled soon 16:08:39 and also "disk images" hacked in too as we need to use lmc which doesn't have koji integration yet (appliance builder doesn't work there) 16:08:55 this week for all of them one way or the other! 16:09:22 nothing much else to report there 16:09:41 pbrobinson: cloud images are just a different type of disk image, maybe they can just be done with imagefactory 16:09:53 dgilmore: possibly 16:09:58 just using a different kickstart 16:10:05 I did think about that as an option 16:10:14 just enable raw-xz only 16:10:31 may be simpler 16:10:53 * linuxmodder pops in to brush up on arm 16:11:03 anything else? 16:11:20 not from me 16:11:31 #topic Open Floor 16:11:39 nirik: you have some things? 16:11:48 yeah, just a few quick ones. ;) 16:12:10 1. we have some outages this week... tomorrow for build stuff, wed for the rest... updates/reboot cycle 16:12:20 I was thinking I'd update the builders later this week/weekend... 16:12:34 okay 16:12:50 nirik: there is a manual schema update needed on koji 16:12:50 nirik: ignore ppc/arm and I'll do them, the former will be done as part of the rebuilds anyway 16:12:56 and per our discussion the other day, I am going to add some more buildvm's to stg with nested virt enabled. ;) 16:12:58 builders that is 16:13:18 nirik: primary koji has had it, but teh other koji's have not 16:13:19 dgilmore: ok, you want to do that part? or I can if you tell me what. 16:13:26 pbrobinson: ok. 16:13:31 or I guess pbrobinson can do it. ;) 16:13:36 nirik: either way, just need to be aware of it 16:14:05 it is not in a schema file 16:14:27 dgilmore: so where is it then? 16:14:41 a pull req somewhere? 16:14:44 pbrobinson: you want me to do hubs tomorrow? or you want to do them when doing builders later? or ? 16:14:51 alter table repo add signed BOOLEAN DEFAULT 'false'; 16:15:04 pbrobinson: it is part of jays signed repo changes 16:15:16 dgilmore: I'd figured a much 16:15:38 it is in the schema file, but there is no update schema file 16:15:53 nirik: I'll upgrade the koji + db changes on the hubs, can you do the patch/reboots on them? 16:16:16 sure. Sounds good. 16:16:26 outage tomorrow starts at 21UTC... 16:17:04 nirik: I'll do the secondary hubs tonight or tomorrow morning so I'm out of your way :) 16:17:13 Oh, last thing... jberkus is looking at our koji db issues. I gathered a bunch of info for him and he's going to look and get back to us later today. 16:17:34 hopefully we will find out why primary is at least getting so hammered 16:17:58 * nirik doesn't know if secondary has seen slowdowns 16:18:27 nirik: awesome, thanks for getting the data to him 16:19:14 nirik: they're mostly OK, did you apply the DB optimise bits to them? 16:19:22 yep. 16:19:26 they likely get hit a lot less than primary 16:19:32 yeah. 16:19:36 nirik: seems s390 hub is running fine, which slowdowns primary sees? 16:19:51 sharkcz: the db gets heavily loaded and things like kojira can't work 16:20:11 https://admin.fedoraproject.org/collectd/bin/index.cgi?hostname=db-koji01.phx2.fedoraproject.org&plugin=load×pan=604800&action=show_selection&ok_button=OK 16:20:34 nirik: is the primary DB on shared storage or local? 16:21:01 it's on iscsi... 16:21:40 but it was fine until about january... 16:21:43 https://admin.fedoraproject.org/collectd/bin/index.cgi?hostname=db-koji01.phx2.fedoraproject.org&plugin=load×pan=31622400&action=show_selection&ok_button=OK 16:22:52 nirik: yes, but you've got a lot more up in the air, 1gb links, switch ports, network queues etc etc 16:22:54 anyhow, we should know more later today. 16:23:06 nirik: I suspect it was when we updated to support content generators 16:23:06 it's actually 10G. ;) 16:23:41 and the db schema is not optimised 16:23:47 or the queries are horrible 16:23:57 yeah. 16:24:25 I know there's some gigantic queries... might be people doing dumb things with the web interface, or crawling things 16:24:36 and that 16:25:54 * nirik had nothing else. 16:26:09 Hi, I have a question 16:26:37 a rumour has reached me that releng wants to drop alpha releases starting from F26 - any info about this plans ? Or is it just a false info ? 16:29:13 jkurik: we have said it a lot we plan to do it 16:29:24 jkurik: we have talked to adamw about it 16:29:44 * nirik has heard about it, but I haven't seen any formal proposal 16:29:55 jkurik: until we are sure that the nightly testing is sufficent to keep things always at an alpha level there will be no changes 16:30:02 we would need to figure out what happens to the milestones around that 16:30:07 so far it is something we would like to do 16:30:33 but we can not propose it until we are sure that we can deliver at a high quality without it 16:30:54 jkurik: so for right now it is just an idea 16:30:59 not set in stone 16:31:04 dgilmore: ok, thanks for the explanation; please keep in in the loop if there are any changes regarding milestones 16:31:18 It does mean less press... 16:31:26 but not sure how big a deal that is 16:31:41 jkurik: as soon as we have something concrete we will be sure to let you know 16:31:44 * adamw is all for it. 16:32:08 great, that is all I had 16:32:10 it boils down to can we do it and not comprimise the release quality 16:33:17 one quick thing from herr 16:33:18 here 16:33:40 https://careers-redhat.icims.com/jobs/51819/software-engineer/job 16:33:42 https://careers-redhat.icims.com/jobs/51274/software-engineer/job 16:34:04 two jobs in Release Engineering at red hat 16:34:25 the first is in Westford, the second is in India 16:34:32 cool. ;) 16:34:34 +1 16:34:40 they will have osme level of interaction in fedora 16:34:48 the first will have quite a bit 16:35:02 there is alos another one coming that will be based in Brno 16:35:40 that is all I have 16:35:49 looks like releng wants to implement the "follow the sun" availability 16:35:51 if nothing else I will wrap up 16:38:46 #endmeeting