13:01:26 #startmeeting 13:01:26 Meeting started Mon Aug 17 13:01:26 2015 UTC. The chair is mvollmer. Information about MeetBot at http://wiki.debian.org/MeetBot. 13:01:26 Useful Commands: #action #agreed #halp #info #idea #link #topic. 13:01:31 .hello mvo 13:01:32 mvollmer: mvo 'Marius Vollmer' 13:01:40 #topic Agenda 13:02:00 * Multipath 13:02:12 * journalctl regression 13:02:24 * Further testing fires 13:03:15 * hubbot disk usage 13:03:33 .hello dperpeet 13:03:36 dperpeet: dperpeet 'Dominik Perpeet' 13:03:46 .hello stefw 13:03:47 stefw: stefw 'Stef Walter' 13:03:59 * Makefile conventions 13:04:07 dperpeet, rpmquery is available 13:04:10 * Internet Explorer 11 13:04:12 petervo, thanks! 13:04:43 * Moving things out of the shell 13:04:55 * Moving things out of cockpit-wrapper 13:06:38 ok, let's start 13:06:45 #topic Multipath 13:06:52 So I have started playing with this 13:07:09 some notes here: https://github.com/cockpit-project/cockpit/wiki/Storage---Multipath 13:07:17 [cockpit] stefwalter opened pull request #2590: Update cockpit.spec for changes from RHEL (master...rhel-spec-updates) http://git.io/v3pmW 13:07:46 should be almost straightforward to get Cockpit "multipath safe" 13:07:59 more work to make it "multipath nice" 13:08:07 nice research 13:08:14 how much of the work should go into storaged? 13:08:17 [cockpit] dperpeet pushed 1 new commit to master: http://git.io/v3pmd 13:08:17 cockpit/master 967318a Marius Vollmer: test: Only skip check-journal with systemd 219-21.... 13:08:23 i think we should offer UI for setting multipath up and configuring it 13:08:38 storaged should expose the mpath configuration 13:08:45 basically the multipathd state 13:08:46 makes sense 13:09:08 that is, storaged would need to wrap a d-bus api around multipathd 13:09:37 the first bit of info we need is to identitfy which of the multiple block devices is the 'leader' 13:10:12 phatina, have you seen the research mvollmer has done on multipath? 13:10:13 initially, I'll do some regexp matching to find the device mapper device, but storaged should do that more directly 13:11:02 I also found a bug in Cockpit re a misunderstanding of the storaged Drive property. 13:11:21 stefw: yes 13:12:32 phatina, I just learned about the sysfs slaves/ and holders/ construct 13:12:46 maybe storaged could expose that? 13:13:06 but I tink storaged should also natively know about multipath 13:13:20 and then we don't need that generic slaves/ holders/ thing 13:13:52 phatina, https://www.redhat.com/archives/dm-devel/2006-February/msg00091.html 13:15:06 so anyway, I continue playing with this to learn more about which pieces can be relied upon. 13:16:14 next topic? 13:16:29 yip 13:16:45 #topic journalctl regression 13:17:27 A regression was committed up stream 13:17:28 to systemd 13:17:33 where journalctl -f -t unmatched 13:17:34 no longer works 13:17:39 219-21 13:17:47 and that patch was backported to 219-21 13:17:51 and then pushed through f22 updates 13:18:00 systemd git master now contains my fix to patch this 13:18:05 but it's not yet fixed in F22 13:18:21 https://bugzilla.redhat.com/show_bug.cgi?id=1253649 13:18:32 Tests should be failing with this 13:18:41 but we've shut off journal tests 13:18:46 when that systemd version is present 13:19:02 we need to remove the workaround once that bug is fixed 13:19:35 that's it 13:19:37 http://files.cockpit-project.org/hubbot/bug_status.html 13:19:44 should be listed there ^^ 13:19:59 cockpit itself is also broken by the regression, of course. 13:20:06 yup 13:20:23 but not completely: as soon as the display is not empty, it will work 13:20:33 i hope that is a rare thing 13:20:52 should we extend the tests? 13:20:55 and catch the issue 13:21:03 / handle it 13:21:04 well, we do... :-) 13:21:28 ahh, you mean workaround in the cockpit code? 13:21:37 yeah 13:21:41 detect a non-blocking journalctl? 13:21:45 I don't think it's that necessary 13:21:47 I'd say no. 13:22:04 what would we do, poll? 13:22:19 we certainly shouldn't fix our journal code to do that 13:22:29 there's no practical way 13:22:33 we should only do it when f22 drags their feet too long 13:22:43 we should raise it with FESCO if they do 13:22:47 i'll prepare a patch to the rpm build 13:22:51 yeah 13:22:57 that makes more sense 13:23:08 why did they include the patch in the first place? 13:23:25 probably to fix the bug that I have reported way back... :-) 13:24:21 https://bugs.freedesktop.org/show_bug.cgi?id=71548 13:25:09 next topic? 13:25:39 #topic Further testing fires 13:26:02 phantomjs is quite often crashing with check-login, no? 13:26:11 yeah it's the first time i've s tarted seeing that 13:26:15 was it due to disk space issues? 13:26:26 right, might be 13:26:42 we can start a blanket retest of failed tests tonight 13:26:46 /home was 100% full. 13:26:49 when it doesn't block that much 13:27:14 this was due to a reluctance to automatically delete images 13:27:23 mvollmer and I fixed this today 13:27:24 yeah, my fault 13:27:38 hubbot now cleans up instead of pushing things into the basement 13:28:02 we still use about 120 GiB 13:28:09 how much space do we have left? 13:28:23 let me check 13:28:26 daily master checks are a pretty significant drain 13:28:34 /dev/mapper/vg_dhcpclient06-lv_home 204G 137G 57G 71% /home 13:28:39 we could automatically delete those as soon as a test finishes 13:28:45 heh, not so useful 13:28:50 indeed 13:28:52 57G available 13:29:34 we haven't changed the wildcard regarding images to keep yet 13:29:44 right now I believe all of a pull requests images are kept 13:29:55 while we only really need the newest one 13:29:58 36G cockpit-data 13:30:05 46G hubbot 13:30:12 not as bad as I thought 13:30:37 9.5G /srv/testdata/ 13:30:37 we can tweak this a bit 13:30:59 or even change behavior to more aggressively prune when disk space is running low 13:31:18 one idea is to only keep the last run for a given commit 13:31:31 now we keep them all, mostly because lazy 13:31:55 anyway 13:31:57 we probably need to do both 13:31:58 other fires? 13:32:02 i'll also try to look at more fires, and fix them 13:32:04 there's this one: 13:32:12 https://github.com/cockpit-project/cockpit/issues/2591 13:32:22 the thing that keeps me going is that so many of these are real bugs 13:32:25 both in cockpit and elsewhere 13:33:16 yep 13:33:42 mvollmer, can i assign that one to you? 13:33:48 yes 13:34:21 dperpeet, have you discovered anything about Travis? 13:34:30 is it time to move away from Travis if it keeps falling over? 13:34:33 or should we disable our root tests 13:34:35 the tests seem to work again 13:34:51 the thing is, I have the feeling they want to move away from the type of test we use 13:34:52 but i still see broken output: 13:34:52 https://travis-ci.org/cockpit-project/cockpit/jobs/75573396 13:34:54 towards non-root containers 13:35:00 i can move it to non-root 13:35:03 but it means disabling certain tests 13:35:06 that never get run anywhere 13:35:21 all the tests I started after last friday afternoon seemed fine 13:35:30 what about this one? 13:35:30 https://travis-ci.org/cockpit-project/cockpit/jobs/75573396 13:35:39 i can restart it 13:36:07 that's three days ago 13:36:09 I think 13:36:12 from the date 13:36:31 mvollmer, what was the other ci system you linked? 13:36:41 semaphoreci.com 13:37:09 they have caching 13:37:12 from one non-open-source to another 13:37:13 wheeee 13:37:51 I can test how much effort it would be to move to travis' new structure 13:37:57 why do we use travis in the first place? 13:38:03 because it was so easy 13:38:11 could we extend hubbot to do the job now? 13:38:27 it's just make check et cetera 13:38:30 we could have a dedicated vm 13:38:40 and you've invented jenkins 13:38:45 well already done that with hubbot 13:38:46 adds load to our server 13:39:04 a distinct service adds variety to the testing 13:39:25 how about I see how much effort it would be to get our tests to run in the new travis? 13:39:32 with containers 13:39:54 they require non-root but supposedly cache better 13:40:06 it's just one line change 13:40:18 i can make that change 13:40:20 not a problem 13:40:24 well two lines 13:40:32 unless something doesn't work :) 13:40:40 well, go for it 13:40:48 if something crops up, we can start putting out the fires 13:42:30 hmmm, no it's not thath easy 13:42:37 "If you require sudo, for instance to install Ubuntu packages, a workaround is to use precompiled binaries, uploading them to S3 and downloading them as part of your build, installing them into a non-root directory." 13:42:56 like I said, non-root 13:43:27 is running the unit tests in mock good enough? 13:43:37 the output is hidden, but we can polish that up 13:44:02 'make distcheck' 13:44:05 'make check-memory' 13:44:09 in mock on travis or with hubbot? 13:44:13 and a full autogen.sh 13:44:14 hubbot 13:44:17 all those things need to be tested 13:44:23 to have a similar feature set 13:44:33 we could have a vm and do what travis does 13:44:45 except we cache the updated base system 13:44:49 also clang builds 13:45:15 but I'd say we didn't get much value from those, or did we? 13:45:16 yup 13:45:25 well often one would fail and not the other 13:45:30 we'd be putting more eggs into the hubbot basket 13:45:45 and load on the server 13:46:18 ok, time for the next topic? 13:46:29 one last question 13:46:38 should we arrange more discussion about this? 13:46:39 stefw, do you want to look into travis changes? 13:46:45 just to be clear on the action 13:46:47 yes doing that now 13:46:50 ok 13:46:55 /eot 13:47:16 #topic hubbot disk usage 13:47:26 we have covered that 13:47:37 #topic Makefile conventions 13:47:54 dperpeet? 13:47:55 background on this is that we have quite a few variables in the Makefile.am files 13:48:00 for the individual packages 13:48:06 and it's not that easy to write or review 13:48:15 to see if files are missing, and which file goes into what 13:48:33 do we want to simplify this or somehow write up a mini-guideline? 13:48:40 or do we even have one and I missed it 13:48:51 with debugdata, gz, min, data, ... 13:48:57 bundle 13:49:13 we need a guideline 13:49:20 unfortunately it seems hard to make it simpler than it is 13:49:22 * stefw cries 13:50:02 maybe we can add a sanity check 13:50:13 compare with files present 13:50:18 or even write one if necessary 13:50:27 and those files need to be in the variables somehow 13:50:45 we can agree on writing it into the wiki or hacking.md 13:50:56 [cockpit] stefwalter opened pull request #2592: Use Travis container based infrastructure (master...travis-container-based) http://git.io/v3puJ 13:51:10 I guess I got some details wrong/forgot some file and that is what broke Travis 13:51:48 well because travis runs 'make distcheck' 13:52:01 that passed for me locally 13:52:05 for me too 13:52:39 also difference between --enable-debug 13:52:40 and not 13:52:43 so it's good to have these checks 13:52:56 enable-debug is probably the culprit 13:53:21 of course it is - I can start writing the guidelines 13:53:40 #action dperpeet to start wiki page on Makefile.am conventions for packages 13:54:20 next? 13:54:47 #topic Internet Explorer 11 13:54:56 * mvollmer rushes a bit, sorry 13:55:15 I was able to reproduce the error message we saw in https://github.com/cockpit-project/cockpit/pull/2566 13:55:29 and I couldn't see it locally after a patch, waiting on tests to run 13:56:05 functionality seems fine 13:56:18 so after merging current cockpit should work fine with IE11 13:56:21 /eot 13:57:03 # Moving things out of the shell / out of the cockpit wrapper 13:57:13 i did some mechanical work there 13:57:19 lots more still to do 13:57:38 one goal could be to first get the shell into a clean state 13:57:48 and move the legacy stuff somewhere else 13:58:01 and then continue to clean this up, as we feel like it 13:58:33 I think refactoring is more important than actual cleanup at this point 13:58:53 a clean structure makes it easier to clean the smaller parts as we touch them 13:59:26 moving stuff out of cockpit-wrapper is important 13:59:29 as it lets us run as a container 14:00:43 yep 14:01:01 I like the level of change you put into it so far 14:01:02 ok, my plan is to return to this after multipath 14:03:17 sounds good 14:03:37 with priority on cockpit-wrapper 14:04:04 yeah 14:04:12 #topic Any other biz 14:05:10 we need to do a release today 14:05:19 everything marked priority is targetted to go in 14:05:55 ok, I'll do https://github.com/cockpit-project/cockpit/pull/2589 14:06:13 subin, when #2572 is ready, i can review again 14:06:23 and i'll try to merge #2566 when it's done testing 14:06:25 mvollmer, i'm in the process of doing that 14:06:36 stefw , working on it 14:06:38 oh, sorry. 14:06:41 great 14:07:59 stefw, the fixup goes into the second commit (fix race condition) 14:08:02 in #2566 14:08:09 ok 14:08:15 if it works :) 14:08:28 ok 14:08:32 #endmeeting