12:01:59 #startmeeting Weekly Community Meeting 31/Aug/2015 12:01:59 Meeting started Wed Aug 31 12:01:59 2016 UTC. The chair is rastar. Information about MeetBot at http://wiki.debian.org/MeetBot. 12:01:59 Useful Commands: #action #agreed #halp #info #idea #link #topic. 12:01:59 The meeting name has been set to 'weekly_community_meeting_31/aug/2015' 12:02:20 #topic Rollcall 12:02:35 * kshlm \o_ 12:02:37 * ndevos _o/ 12:02:46 * aravindavk is here 12:03:01 * jdarcy o/ 12:03:09 * kkeithley is here 12:03:17 * rjoseph o/ 12:03:33 o/ 12:03:41 * jiffin1 is here 12:04:02 today's meeting will be co-chaired by jiffin 12:04:07 #chair jiffin 12:04:07 Current chairs: jiffin rastar 12:05:04 will wait for two more mins 12:05:37 * msvbhat is here 12:06:34 Hello everyone. We will start now. 12:06:44 * partner here 12:06:50 * atinm is here too 12:06:55 o/ 12:07:06 #topic Next week's meeting host 12:07:42 Do we have any volunteer? 12:08:09 Me. 12:08:20 I already put my name in the pad. 12:08:24 #info kshlm will host the next meeting. 12:08:38 #topic GlusterFS-4.0 12:09:22 Nothing from me. On vacation, plus multiplexing work. 12:09:33 Not much from me either. 12:10:10 I've been doing some refactoring of the transaction framework. And I need to push it out. 12:10:23 Also, last week gRPC-1.0 was announced. 12:10:36 I'm thinking of giving it a go once again. 12:11:08 Cool. 12:11:22 Moving on. 12:11:32 #topic GlusterFS-3.9 12:12:09 aravindavk: any updates? 12:12:20 i don't see pranith or dblack 12:12:22 today is last day to get patches in 12:12:43 Pranith sent mail to followup with feature owners 12:13:09 I see that most of the owners replied back. 12:13:25 If any of you haven't replied, please do. 12:13:28 * obnox arrives late 12:14:12 Just an update from testing side 12:14:55 We have tests/features/lock_revocation.t hanging on NetBSD 12:15:06 which has caused the build pipeline to increase. 12:15:33 If you have a patch that needs to be merged in 3.9, please monitor NetBSD results and follow up. 12:15:42 Contact nigelb or kshlm or rastar 12:15:52 keep aravindavk aware about it 12:16:10 do we know since when this test started to hang? 12:16:35 not exactly, but the guess is around 3 weeks 12:16:59 are we marking this as a bad test? 12:17:01 I can get a number post-meeting. 12:17:06 hagarth: that is done. 12:17:10 rastar: ok 12:17:11 But pretty sure it's longer than 3 weeks. 12:17:12 hagarth: Just merged the patch to do so. http://review.gluster.org/#/c/15374/ 12:17:15 hagarth: patch needs to be merged though 12:17:21 thanks jdarcy 12:17:30 great, thanks jdarcy! 12:17:37 I have posted possible fix at http://review.gluster.org/#/c/15375 12:17:45 The patch that added it was merged in mid-july 12:17:52 I just hit the merge button. Others did the work. ;) 12:18:05 that is based on a hunch that dd wasn't complete while the test proceeded to append 12:18:27 anyways, it was just an update to people blocked by it. 12:18:28 rastar: many test unit deletions with 15375? 12:18:32 http://www.gluster.org/pipermail/maintainers/2016-August/001262.html 12:18:37 jdarcy: teamwork ftw! ;) 12:18:40 so I reported it on 19th August 12:18:55 hagarth: that is a test, so that I can re-run it many times in less time. 12:19:09 hagarth: if that works, I will update the patch to restore all tests. 12:19:12 rastar: ok, good idea :) 12:19:22 hagarth: easier than having a NetBSD VM on laptop. 12:19:33 rastar: right 12:19:58 any more updates? 12:20:56 Moving on. 12:20:57 my understanding (after discussing with rastar) is that the test is actually a bad one and needs improvement, there is nothing wrong in the code, right rastar ? 12:21:13 atinm: not sure yet. 12:21:26 if 15375 passes we will be sure 12:21:37 rastar, yes that's what I assumed too :) 12:21:46 it looks a little bad that a single test can hange a whole VM... 12:22:06 I would expect the cleanup function to get the VM back into running 12:22:11 ndevos: it is deadlock test 12:22:13 that's how the test case is written 12:22:18 ndevos: so kind of expected. 12:22:49 sure, but should we not improve the cleanup to be able to recover from it? 12:23:13 ndevos: it does not proceed to cleanup 12:23:14 BTW during the multiplexing work I found a case where cleanup was not getting called. I should send a separate patch for that. 12:23:29 ndevos: there is a line with & at the end 12:23:43 ndevos: which is a dd process. cleanup is not aware of the process. 12:23:56 Perhaps for the most general case we should run tests under a timer. What's the longest any test legitimately takes? 12:24:12 jdarcy: around 600 seconds 12:24:23 rastar: we need to improve cleanup then, there is a 'jobs' command that would show background processes, I think 12:24:53 #action improve cleanup to control the processes that test starts 12:24:55 Not sure if "jobs" works from a non-interactive shell. Let me test. 12:25:08 It's with great fear and self-loathing I've added a `pkill gluster` into the Jenkins job. I'd love to be able to remove it. 12:25:09 let us not delay any more. 12:25:57 please use the bug 1369401 for more suggestions 12:25:58 Bug https://bugzilla.redhat.com:443/show_bug.cgi?id=1369401 high, high, ---, bugs, NEW , NetBSD hangs at /tests/features/lock_revocation.t 12:26:14 #topic GlusterFS-3.8 12:26:47 all on track 12:27:01 nothing that I heard needs special mentioning 12:27:17 schedule is to release 3.8.4 around the 10th of September 12:27:30 Nice, boring release management is a good thing. 12:27:35 we're having major issues with all the 3.8 version when used with libglusterfs and libvirtd.. :( 12:27:53 partner: got a bug reported for that? 12:28:01 partner, Do you mean libgfapi? 12:28:07 ie. trying to use glusterfs as a volume backend for openstack.. 12:28:14 sorry, libgfapi yes 12:28:30 no ticket yet since we don't know what to blame.. 12:28:57 but that combo does not work fine with latest'ish centos 7 (7.2), works better with 7.1, no segfaults there 12:29:16 just fyi, no need to dig into deeper here 12:29:24 partner: if you think it is a gluster issue, file a bug early and keep updating its progress, we can move bugs between components if needed 12:30:03 partner: check it is related to https://bugzilla.redhat.com/show_bug.cgi?id=1352632 12:30:03 Bug 1352632: unspecified, unspecified, ---, bugs, CLOSED CURRENTRELEASE, qemu libgfapi clients hang when doing I/O 12:30:06 *if 12:30:13 partner: also make sure to use the lastest 3.8 release, there have been some fixes for gfapi 12:30:18 it is fixed in 3.8.1 12:30:28 we've tried 3.8.1 through 3.8.3 12:30:47 trying to find some working combo by downgrading other components such as libvirtd 12:31:03 partner: ok, please file and bug then and we will assign it to right group when we triage. 12:31:15 yup, thanks 12:31:37 * post-factum is late 12:31:40 nothing else for 3.8 I guess. 12:31:47 post-factum: hi! 12:31:53 rastar: o/ 12:32:04 moving on 3.7 12:32:12 #topic GlusterFS-3.7 12:32:43 I tagged 3.7.15 yesterday. 12:33:08 .15 has about 30 fixes over .14. 12:33:12 fedora packages are building now. Debian, Ubuntu, SuSE shortly 12:33:20 I'm waiting on packages to show up before doing the announcement. 12:33:25 kkeithley, Thanks! 12:33:26 Yay! 12:33:33 That's it. 12:34:00 #topic GlusterFS-3.6 12:34:33 We've finished the 3.6 bug-screen. 12:34:53 Thank you everyone who helped. 12:35:09 All set for EOL then. 12:35:16 Yep. 12:35:39 #topic Project Infrastructure 12:36:44 atinm: do you want to notify nigelb about lost smoke reports? 12:36:58 * misc want to remind people about the planned reboot on the 1st 12:37:28 misc: reboot of? 12:37:35 formicary, I believe. 12:37:52 the mail I sent on gluster-dev, jenkins and gerrit 12:37:58 to snapshot them and for security update 12:38:13 rastar, sure 12:38:15 we don't have update on mail with subject Reboot of jenkins/gerrit for upgrade and snapshot 12:38:34 nigelb, I guess we discussed about it couple of days back when you figured out we have a missing mkdir -p 12:38:37 from any of the maintainers 12:39:14 ok, no objections for reboot. 12:39:31 #info Reboot of jenkins/gerrit for upgrade and snapshot on September 1st 12:39:35 atinm: yeah, but that was outright failure rather than lcak of reporting, right? 12:39:45 in the middle of the night? For some definition of middle and night? 12:40:01 nigelb, yes, there are two different issues 12:40:15 atinm: okay, file a bug and I'll dig tomorrow. 12:40:28 nigelb, I believe we don't know about the reason on why it failed to vote back 12:40:35 Indeed not. 12:40:54 I've been working on a prototype for somethign to show test failure trends 12:40:57 rastar, if you have the link, can you file it? I've lost the other one which happened two days back 12:40:57 I got this far - http://www.gluster.org/pipermail/gluster-infra/attachments/20160826/ad3e45eb/attachment-0001.png 12:41:14 atinm: I will check with itisravi 12:41:15 Needs more work for sure, but it's getting there during free hours between debugging failures in the CI system. 12:41:17 kkeithley: well, more doing during end of the day for me 12:41:47 nigelb: that is really nice. 12:41:48 is there a reason why the "vote string" (i.e. the ssh command used to register a vote) that used to be at the end of every regression and other test is no longer there? 12:41:54 nigelb++ 12:41:55 atinm: nigelb's karma is now 2 12:42:12 misc: that's probably the middle of the night for someone 12:42:17 kkeithley: Only regressions had them, as far as I know. 12:42:27 and they should still have it. 12:42:36 do they? okay. nm then 12:42:43 smoke tests are voted directly by gerrit after aggregating multiple tests. 12:42:48 *directly by jenkins 12:42:54 kkeithley: japan, yep 12:43:22 rastar: I'm still not happy with it. Needs more work. Hopefully, I can have a demo by the end of the month. 12:43:43 kkeithley and I have been talking about getting strfmt test from centos-ci into our ci as a smoke test. 12:43:49 nigelb: it will always be "in progress". It is a nice start though. 12:43:58 there's a thread on gluster-devel that we've kicked off. So, please voice your opinions there. 12:44:12 It'll run on master to begin with and will be non-voting until we get all the failures to pass. 12:44:21 and then it'll start being a normal smoke test which will vote on failure. 12:44:55 And the last update: I'm talking to centos CI folks to see how we can figure out some sort of regression tests for performance. 12:45:03 I'd like to run it as close to a production setup as possible. 12:45:17 i.e. with 4 servers and 4 clients or something like that. 12:45:33 (on hold because I've been away yesterday and I'm technically away today) 12:45:37 hmm, the strfmt test passes currently? https://ci.centos.org/view/Gluster/job/gluster_strfmt/ 12:45:41 When I have something, I'll report back on glsuter-devel. 12:46:15 ok, thanks nigelb 12:46:16 I just noticed a lot of warnings in a scratch build log of 3.7.15 12:46:20 ndevos: hrm, kkeithley said they were creeping up. 12:46:21 strfmt warnings 12:46:52 how does the test in CentOS CI decide there were no warnings? 12:47:09 different compiler version? 12:47:13 e.g. used %lu for a size_t 12:47:17 grep, I think? 12:47:27 grep -E ".*: warning: format '%.*' expects( argument of)? type '.*', but argument .* has type 'ssize_t" build.log | tee -a warnings.txt 12:47:30 grep -E ".+: warning: format '%.+' expects( argument of)? type '.+', but argument .+ has type 'size_t" build.log | tee -a warnings.txt 12:47:38 https://github.com/gluster/glusterfs-patch-acceptance-tests/blob/master/centos-ci/gluster_strfmt/run-test.sh 12:48:26 ok, next toic 12:48:28 *topi 12:48:30 and who of our devs ever looks at the CentOS CI results? 12:48:30 *topic 12:49:15 kkeithley: that test sends an email to the maintainers list if it fails, with the warnings in an attachment (+URL to them) 12:49:18 we need to do something about that. 12:49:47 okay 12:49:52 ndevos: kkeithley should we cc devel? 12:50:07 uh, -devel is already pretty noisy 12:50:17 (this is why I want to make it smoke) 12:50:24 so the relevant dev gets notification instantly 12:50:30 rastar: should not be needed, maintainers are expected to catch these before they merge the patches - this is just a 'oops, fix it!' mail 12:50:57 nigelb: I like that idea. 12:51:22 ok, next topic 12:51:31 #topic Ganesha 12:51:35 nigelb: yeah, but that means more load on the slaves, and it needs to be a 32-bit build 12:51:42 2.4 RC1 was tagged on Friday 12:52:36 no other updates then. 12:52:40 expect GA in 2-3 weeks. There's some 11th hour clean up of some of the headers so that they'll compile with C++, for reasons that aren't related to glusterfs. 12:53:10 but if you want to know, ping me in -devel 12:53:15 #info NFS-Ganesha GA around 3rd week of September 12:53:17 or I can tell you here 12:53:44 if anyone is interested 12:53:47 #info NFS-Ganesha 2.4 GA around 3rd week of September 12:54:06 #topic Samba 12:54:20 samba 4.5rc3 was tagged no Aug 29 12:54:30 not that it currently matters much gluster. 12:54:37 semi-related, for 3.10 I expect we will make gnfs a build option and separate RPM package 12:55:18 performance work (md-cache...) is ongoing in gluster to speed up samba 12:56:05 in samba proper, multi-channel work is going on, driven by people also in this community. 12:56:10 #info Samba 4.5rc3 tagged on Aug 29th 12:56:35 jiffin will take over now. 12:57:00 rastar: thanks for ur time 12:57:20 #topic Last weeks AI 12:57:23 thanks jiffin 12:57:52 #topic pranithk/aravindavk/dblack to send out a reminder about the feature deadline for 3.9 12:58:01 It feels like we're in a football match where jiffin gets some play minutes just before the end wistle 12:58:11 Pranith sent mail 12:58:19 aravindavk: i saw that 12:58:26 can u paste me the link 12:58:30 ndevos: I thought my next meeting was midway this one. It is in 2 minutes from now. :) 12:58:56 ndevos: substitution rastar out jiffin in 12:59:11 aravindavk: for the mail? 12:59:15 it's okay, jiffin will get the last batsman out and win the game 12:59:28 jiffin: searching 12:59:28 #link http://www.gluster.org/pipermail/gluster-devel/2016-August/050627.html 12:59:41 ndevos: thanks 13:00:01 #info pranithk send out a reminder about the feature deadline for 3.9 in ML 13:00:20 there are no more AIs 13:00:28 moving to next topic 13:00:31 and also the release check-list at http://www.gluster.org/pipermail/gluster-devel/2016-August/050675.html 13:00:45 #topic Open Floor 13:01:02 Just posted some brick-multiplexing status to gluster-devel, if anyone's interested. 13:01:21 Should show up for others in about half an hour. ;) 13:01:34 thanks jdarcy for the update 13:02:08 it just arrived. But the email I sent to -devel 35 minutes ago still hasn't 13:02:19 great 13:02:36 kkeithley: it just did, for me. 13:02:37 kkeithley: I have the email from you 13:02:43 and jdarcy's too 13:02:44 * jiffin wonders why it is taking long 13:03:45 times is already up 13:03:54 Mailing-list split brain. 13:04:11 * jdarcy almost typed "splat brain" 13:04:19 so do we need to discuss anything else? 13:04:33 strfmt already discussed 13:04:52 longevity is just for people to see the latest numbers, no need to discuss 13:05:02 ditto for RHEL/CentOS RPMs 13:05:05 Anyone planning extended vacation in September? 13:05:13 there was a Gluster Summit topic, but now it's gone 13:05:15 k then, i shall wind up the meeting 13:05:44 kkeithley:i had put that topic 13:05:51 * ndevos will not be on 'vacation' as such, but travelling almost the whole month 13:06:02 Good to know. 13:06:04 ndevos and I will be in BLR starting the 12th 13:06:12 Ooh. 13:06:17 I'll be in town that weekend. 13:06:32 IMO we can discuss it next week, after publishing the schedule 13:06:35 I arrive the 10th, not sure when ndevos arrives 13:06:46 11th, late in the evening 13:07:06 just providing gentle reminder 13:07:11 #topic Announcements / Reminders 13:07:32 If you're attending any event/conference please add the event and yourselves to Gluster attendance of events: http://www.gluster.org/events (replaces https://public.pad.fsfe.org/p/gluster-events) 13:07:41 #end-meeting 13:07:49 no hyphen 13:07:59 sorry for typo 13:08:05 #endmeeting