15:30:51 #startmeeting releng 15:30:51 Meeting started Mon Jun 2 15:30:51 2014 UTC. The chair is masta. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:30:51 Useful Commands: #action #agreed #halp #info #idea #link #topic. 15:30:51 #meetingname releng 15:30:51 #topic === Releng Roll Call === 15:30:51 #chair dgilmore dwa nirik tyll sharkcz bochecha 15:30:51 The meeting name has been set to 'releng' 15:30:51 Current chairs: bochecha dgilmore dwa masta nirik sharkcz tyll 15:31:02 morning 15:31:08 * masta waves 15:31:36 * sharkcz is here 15:31:46 we have a few folks running late, so please grab coffee or whatever you have to do... 15:33:55 hola 15:34:30 Hi 15:36:19 hi 15:36:33 ok I guess we can get started 15:36:51 #topic === releng Announcements === 15:38:14 what should be announced here? 15:38:20 hehe 15:38:46 ok, lets move on to tickets then 15:38:50 next monday is a public holiday in Germany 15:38:56 maybe we need to move next meeting? 15:39:05 this could be an announcement 15:39:34 I don't know whether it is an holiday in the USA as well 15:39:54 tyll_: its not, last monday was a us holiday 15:40:02 win 40 15:40:16 #topic === releng Tickets === 15:40:16 #link https://fedorahosted.org/rel-eng/report/10 15:41:15 https://fedorahosted.org/rel-eng/ticket/5877 (Schedule and coordinate mass rebuild for F21) 15:41:47 are we all set for it? 15:41:57 last meeting it came up that the scripts need to be looked at 15:42:12 I remember some problems reporting FTBFS during last cycle 15:42:46 maybe links to build logs were wrong 15:42:48 we should try and make the reporting better 15:42:52 tyll_: yeah, me too, but I don't recall details. ;) 15:43:07 the issue with the ftbfs bugs was that the urls to the logs were wrong 15:43:10 ah yeah, links. 15:43:22 and they only live a short time anyway 15:43:36 btw. will be rebuild in a side tag? 15:43:38 really we shouldnt put links to koji tasks in 15:43:40 will we 15:43:52 tyll_: yes it will all land in f21-rebuild 15:43:59 ill merge it to f21 when done 15:44:09 mass_rebuild_file_bugs.py ? 15:44:13 that way we dont get hundreds of newRepos 15:44:17 we could copy failed build logs to fedorapeople or attach them to bugzilla 15:44:20 I see 15:44:21 yeah 15:44:39 tyll_: scripting attaching to bugzilla is really really hard 15:45:02 you need to get back the bug number and attach them seperately 15:45:04 #info rebuild will happen to f21-rebuild and will me merged into f21 to avoid newRepo tasks 15:45:19 the api doesnt allow you to upload attachments when creatinga bug 15:45:21 it would be nice to attach them, because then they would always exist. 15:45:29 but yeah. 15:45:37 mdomsch always attached them in the past 15:45:42 it could be done, just likely a fair bit of work 15:46:04 maybe we can find his scripts and see how he did it 15:46:11 maybe we can first move them to fedorapeople to save them and attach them to the bugs if we find the time 15:46:26 an option 15:46:29 bugzilla --user=foo '--password=bar' attach -f "$filename" -d "$filename" -t text/plain $bug_id 15:46:39 ^^ this used to work to attach files in bz 15:47:14 sharkcz: but you need to know the bug number 15:47:25 which i think was kinda hard to get 15:47:32 we should try do it 15:47:34 mass_rebuild_file_bugs.py is the script right? 15:47:37 at least look 15:47:42 nirik: yeah 15:48:06 would anyone have time to work on it? I could see if any of the infra folks might... 15:49:23 can try and make time, though i suspect that I wont have any until next week 15:49:39 * nirik also ask around. 15:49:41 we dont need to run it until we merge the tag back in 15:49:48 which i expect to do next monday 15:49:59 we also need a f21 mass rebuild wiki page right? 15:50:08 yeah 15:50:15 I usually do that last minute :( 15:50:49 #action fix mass_rebuild_file_bugs.py to attach build logs to bugs 15:51:00 #action add f21 mass rebuild wiki page 15:51:58 #info rebuild will happen to f21-rebuild and will me merged into f21 to avoid newRepo tasks 15:52:35 shouldn'z zodbot say something if I use a #keyword? 15:52:42 on the plus side we have all builders up and running now... so we should have even quicker time than last time. 15:52:49 tyll: i believe it doesnt 15:52:58 tyll: it doesn't... design decision from the plugin author. 15:53:03 last time was 46 hours 15:53:49 do we re-run the script on the failed builds, to sorta shake-out bad builds? 15:53:50 #info mass rebuild will start friday evening US Central time 15:54:09 masta: nope 15:54:29 the maintainers are responsible for dealling with all failures 15:55:04 #action add f21 mass rebuild wiki page 15:55:06 ok, then afterwards I'll try to triage with the maintainers of any failed builds, or just try them myself 15:55:13 #action fix mass_rebuild_file_bugs.py to attach build logs to bugs 15:56:00 so I pulled a f20 ftbfs bug up, but the links seem ok in it... 15:56:10 oh wait, I see where they are wrong. 15:56:29 nirik: they were pointing at parent tasks not the child that had the logs 15:56:54 right. the base one was ok, but the logs were all wrong. 15:57:37 but ideally we would just attach them. 15:58:20 yeah, just need to fetch them and get the bug number to attach them to 15:59:17 I believe createbug() already returns a bug object with the number 16:00:05 ok folks, I'd like to steer the meeting topic to the next ticket soon. Any final things to discuss on the mass-rebuild? 16:01:07 https://fedorahosted.org/rel-eng/ticket/5870 (rawhide signing) 16:02:10 The current status is in the ticket 16:02:36 I still need access to secondary sigul and become koji admin to be able to do everything 16:02:56 =) 16:02:58 #topic 16:03:14 gahh 16:03:42 #topic https://fedorahosted.org/rel-eng/ticket/5870 16:03:52 the recent builds on Rawhide should already be signed 16:04:01 But there some design decissions left 16:04:33 Since fedmsg is not meant to be reliable, I believe it would be best to use fedmsg messages only as a trigger to sign all unsigned builds 16:04:53 this is currently not easy to do, because there are too many builds 16:04:56 tyll: ill try get the secondary sigul bits sorted out later, we will need Red Hat IT to open up some ports for us 16:05:24 tyll: never going to be a good way to sign the whole tag 16:05:45 tyll: but one way to do it would be to use calls to koji to get builds since the last run 16:05:46 well, if we tell rawhide mash to check the key it can tell us if we missed any... 16:05:46 also the idea is to use a gating tag, e.g (f21-pending or f21-unsigned) and then move signed builds to f21 16:05:53 then sign all those builds 16:06:06 dgilmore: it might be easier with the gating tag I guessed 16:06:13 tyll: could be 16:06:25 Ideally it is only one build all the time 16:06:33 should be a smaller number of builds 16:07:05 but if we miss a fedmsg, we wouldn't know to move that build would we? 16:07:33 nirik: well if when you get teh next one you list the builds in teh staging tag you would 16:07:45 nirik: if always all builds from f21-unsigned are signed and then moved to f21, nothing would be missed 16:08:18 can we create a f21-unsigned tag already but use inheritance to not disturb rawhide compose? 16:08:24 yeah, but if we do that I would like a log/warning. ;) 16:08:29 this would allow to develop and test this 16:08:41 since when you move builds you need to do it per build you will have to list all builds in the tag with inhertance off to get the nvrs to move 16:08:48 ie, in our past tests we have not had any fedmsg lossage, but would be good to know if it started happening. 16:08:55 so you would sign those nvrs then move them 16:09:20 nirik: it is not so easy to detect, because there might be builds were a message is not processed 16:09:53 tyll: well, we have datalogger/datagrepper listening/recording them all too... so we could check against thaat 16:10:19 nirik: yes, some kind of delayed reporting is possible 16:10:30 * nirik nods. 16:11:14 in the past, tflink and I ran a script that 1) listened for koji messages and counted them, and then 2) once an hour, asked koji for all the latest builds. 16:11:16 ah, I just rememeber, I asked threebean whether we could add a fedmsg message for signed builds, to detect problems 16:11:39 it compared the two lists and if there was any discrepancy, it warned us. aside from some early hiccups where we had typos and stuff, there were no dropped messages over a two week period. 16:11:47 but -- that's a strategy for testing if you want to try it 16:11:50 * nirik nods. 16:11:59 then the tag messages could be compared with the signing messages and discrepancies reported 16:12:29 tyll:  16:12:37 gahh 16:12:47 well, if we just sign/move the build(s) we just got a message for and then check that the tag has nothing left in it, that would check too right? 16:13:00 nirik: yep 16:13:10 plenty of ways to skin the cat 16:13:38 nirik: not really, because there might be a message in the queue for the left builds 16:13:43 ie, sign foobar-1.0, move it to the f21 tag, then see that there's baz-1.0 still in f21-unsigned, but we didn't see a fedmsg for it... alert! 16:13:43 if we list the builds so we know what to sign and move we would know also 16:13:55 nirik: at least currently the script only does stuff if it processes a message 16:14:01 if there was two builds in the tag but one on the fedmsg 16:14:17 ah I see... if 10 landed and you were processing... hum. 16:15:31 especially for a first cut, I would be happy with something that just let us know and we could manually fix things... 16:15:55 perhaps time based... 16:15:57 nirik: should be easy to just get t all signed 16:16:18 list the tag, sign the builds, move them 16:17:00 i guess some of it will come down to do we get race conditions from lots of builds landing at once 16:17:21 nirik: time based would work, e.g. sign all builds a message was received for and check for builds that are older than e.g. 15 minutes still left in the tag and report them 16:17:48 I will figure something out 16:18:20 yeah. we also could do some nagios checks on it... 16:18:37 but back to the gating tag, can we introduce it already to be able to add the tag moving feature? 16:19:06 I don't think we can set it up as we will want it to be... until we have something that moves them. 16:19:47 tyll: we can 16:19:58 but could we have buildrawhide move them? 16:20:00 it will effect rawhide 16:20:04 until the script is ready? 16:20:21 nirik: I hoped inheritance can make moving uneccesary 16:20:27 nirik: maybe yeah 16:20:37 e.g. if f21 inherits from f21-unsigned 16:20:44 tyll: but then we need to change it later when the script is ready... 16:20:53 tyll we do not use inheritance in composing 16:21:13 can we tag stuff in two tags at once? 16:21:21 and in this case it wouldnt work 16:21:30 e.g. both into f21 and f21-unsigned? 16:21:42 we would have to have f21-signed inherit from f21 and compose from f21-unsigned 16:21:53 we can tag a build in as many tags as we want 16:22:14 tyll: you just want it to test against? perhaps we could make the tag and just manually have you tag builds into it for testing? 16:22:25 and then hook it up later. 16:23:52 Looks like we got one more ticket, and then open floor. 16:24:19 nirik: hm, it could test the single method for this manually, but since it is triggered by fedmsg it would be inconsistent during a complete run 16:24:23 masta: tickets, secondary arch updates and open floor is the order 16:24:50 dgilmore: ah, thx... will update my notes 16:24:56 tyll: true. Perhaps we can get our staging koji usefull... thats one thing I wanted to work on in the fad this week. ;) 16:25:45 nirik: that would be useful. I need to write some more koji patches for images. need a test koji to test them on 16:27:05 nirik: yes, but then some builds and sigul need to get there too to make it complete 16:27:07 dgilmore: yeah, lots of questions around how to do it, but we can talk about them when you get here. ;) 16:27:24 tyll: yep 16:27:48 #topic https://fedorahosted.org/rel-eng/ticket/5914 16:27:53 tyll: the build side is easy, the sigul harder 16:28:06 (Move fedmsg based blocking service to Fedora Infrastructure) 16:29:06 for this we should setup a user in koji or reuse one like masher 16:29:23 probably have it run as a service on releng04 16:29:31 the the fedmsg based blocking service I wanted to discusss whether we can make it based on dead.package commits 16:29:33 unless we want to have it isolated 16:30:42 tyll: id honestly rather not. but only because it means we have to watch the contents of commits 16:31:01 there is 3 things that need to happen to retire a package 16:31:11 dgilmore: all information is already in the fedmsg message 16:31:16 dead.package, retire n pkgdb and block in koji 16:31:46 people tend to forget to retire in pkgdb, because it requires addtitional credentials 16:32:05 dgilmore: btw. the fedpkg patches need to be merged to fix this 16:32:19 tyll: i merged your patches on saturday 16:32:37 dgilmore: \o/ 16:32:48 just need to build an update 16:33:07 however, I have code to detect dead.package commits already ready 16:33:11 * dgilmore will drop off soon. descent just started 16:34:15 tyll: so im less concerned how it all happens than that it all happens. but I feel like koji blocking should be tied to pkgdb retiring 16:34:59 tyll: so how would you envison it works if it was dead.package based 16:35:04 dgilmore: ah, I would also retire the package in pkgdb if a dead.package commit is detected 16:35:08 would teh service to pkgdb also? 16:35:13 yes 16:35:49 tyll: id rather that the owner do that. as a check that the owner is the one doing it and not someone deciding to go retire the package 16:36:37 dgilmore: in case there is a dispute, it is very cheap to undo the retirement 16:36:55 dgilmore: Also I don't think it will happen that often 16:37:03 maybe we can convince pkgdb to allow koji certs for auth for retiring packages 16:37:27 dgilmore: It would be fas_openid... and I'm not sure that's a good idea. 16:37:34 We still can't encrypt koji certs on disk. 16:37:37 not sure how hard that would be to implement 16:37:47 I mean -- without annoying maintainers out of their minf. 16:38:02 abadger1999: even if we could people will always be able to undo it 16:38:11 Yewah 16:38:43 what about a cron job? 16:39:23 abadger1999: well tyll wanst to have it so if somoene commits a dead.package to git that the service would block in koji and retire in pkgdb 16:39:24 I guess the trigger would have to be in pkgdb if we want to be assured of who requested the retirement. 16:40:42 my concern is that someone could come along and retire somoeone elses package 16:41:12 that someone could also commit bad contents 16:41:21 true 16:41:25 e.g. it is alreay privileged acccess 16:41:35 and retiring is not final but undoable 16:41:44 they would have to be a proven packager 16:41:53 Maybe if fedmsg is detailed enough, have something watch for a message about dead.package being added. Then trigger a retirement in pkgdb and block in koji. The triggered script could check that the git commit was from a pkgdb owner. 16:42:26 abadger1999: but what do we want to do if it was not the pkgdb owner? 16:42:27 We'd also want an infrequent cron job that checked that everything was consistent (in case fedmsg drops a notification.) 16:42:33 i guess emails will eb sent and someone could speak up 16:42:47 yeah 16:43:03 the cron job is required anyhow, because it is all inconsistent already 16:43:26 end goal is to make sure retirement is one step and works 16:44:03 (at least regarding pkgdb, dist-git and koji) 16:44:13 tyll: so using dead.package the person starting the service will have to enter their credentials for pkgdb? 16:44:58 dgilmore: this would be a possibility 16:45:16 just wondering how it would all work 16:45:35 dgilmore: it would also be possible to create a designated account for it 16:45:57 but the service needs credentials to pkgdb for it - but it also needs a certificate for koji 16:47:04 tyll: the koji bit is pretty easy to handle 16:47:09 tyll: So.. if we're requiring pkgdb credentials... maybe it makes more sense to submit it to pkgdb and have pkgdb manage the git and koji retirement. 16:47:13 the pkgdb side is trickier 16:47:13 I would actually prefer to have a designated account that has only hte necessary pkgdb cretdentials 16:47:55 abadger1999: yes, it would require more coding, though 16:48:02 16:48:08 i need to run now. laptop has to be turned off 16:48:13 in a few moments... let us move on to Secondary arch updates, and then open floor. (we are running over already) 16:48:33 dgilmore: thanks and have a great day =) 16:48:58 tyll: any final thoughts on this ticket? 16:49:20 masta: no, thank you, we need to find a consent 16:49:56 alright, then we can move along.... 16:50:10 #topic === releng Secondary Architectures === 16:50:24 #topic PPC64 16:50:43 dwa or sharkcz: any things to update? 16:50:51 yeah 16:51:00 ppc64 is catching up primary to get to the state when we can merge with ppc64le 16:51:07 #info ppc64 is playing whack-a-mole to catch up to primary with deps to prepare for merging ppc64le 16:51:23 #info ppc64 may though delay doing the mass rebuild so we can get ppc64le imported and do it all at once 16:51:56 #info ppc64 has completed updating all builders to Fedora 20 now, huzzah 16:52:37 also for any who didn't already know, this is my last week at Red Hat. masta (and dennis and nirik) will have the keys to the ppc64 kingdom :) 16:52:38 cool 16:53:14 dwa: good luck with your endevors... it's been great working with you! 16:53:35 nirik: thanks, same here :) 16:54:02 I think that does it for me 16:54:16 dwa: try not to charge too high a consulting fee if I come looking for you to ask questions. ;-) 16:54:35 lol. I've told johnf that I'll happily answer questions :) 16:55:01 happy and sad all at once. 16:55:31 #topic aarch64 16:55:39 ok, so that is me. 16:56:26 we are set for the rebuild, I think maybe the usual scramble to patch things just prior where possible. 16:56:57 alright... 16:57:00 I found out aarch64 koji is missing fedmsg integration, are there any plans to change it soo? 16:57:37 tyll: sure, we can probably do that. I was unaware. 16:58:02 tyll: I'll syncup with you later or tomorrow about that. ok? 16:58:17 masta: threebean can help you more proabably 16:58:26 masta: I noticed it only because of the Rawhide signing 16:58:28 tyll: there are plans. dgilmore has asked me about it recently but we just haven't gotten time to do it. 16:58:39 tyll: ack 16:58:50 it needs to be handled 'by hand' since the secondary arch koji instances are not under infrastructure ansible control (afaik) 16:59:04 I see 16:59:08 #topic === releng Open Floor === 16:59:23 masta: you missed s390 :-) 16:59:28 what! 16:59:29 ok 16:59:32 s390 is catching up primary after the glibc ABI breakage, we went from the end of January to end of April during May, so it goes quite well, naturally some packages don't like us and needs some attention :-) 16:59:45 #topic S390 17:00:02 #info s390 is catching up primary after the glibc ABI breakage, we went from the end of January to end of April during May, so it goes quite well, naturally some packages don't like us and needs some attention :-) 17:00:17 eom 17:00:34 #topic === releng Open Floor === 17:00:41 sharkcz: my sry's 17:00:46 masta: np 17:01:08 I want to propose to handle add a list of the usual meeting topics to the wiki 17:01:42 good idea 17:01:54 also I have ideas for two more generic topics: recap important changes since the last meeting 17:02:28 and to announce current/upcoming tasks to make it more transparent what is happening 17:02:53 * nirik has to leave now... will catch the rest in logs. 17:03:07 our meetings seems to consume much time already. 17:03:20 but these are good ideas tyll 17:03:23 and as a short last item, discuss the next meeting date, e.g. when the next one will happen 17:04:27 great idea 17:04:56 hm, then we should decide what we want to use the meetings for - maybe discussing the tickets in too much detail is not so good 17:05:03 I've added all of these to my meeting notes... the doc I go by to conduct a meeting... which I just wrote as we held this meeting... it could become the wiki doc tyll 17:06:39 tyll: well the problem is there was good discussion... it's hard to predict how good our meetings will be... so best not to schedule things immediately after releng irc. 17:07:31 alrighty, good stuff... I'm going to shut it down. 17:07:38 ok 17:07:38 thanks all! 17:07:45 #endmeeting