16:30:04 #startmeeting fedora_coreos_meeting 16:30:04 Meeting started Wed Oct 10 16:30:04 2018 UTC. 16:30:04 This meeting is logged and archived in a public location. 16:30:04 The chair is miabbott. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:30:04 Useful Commands: #action #agreed #halp #info #idea #link #topic. 16:30:04 The meeting name has been set to 'fedora_coreos_meeting' 16:30:10 #topic roll call 16:30:10 .hello2 16:30:11 lorbus: lorbus 'Christian Glombek' 16:30:13 .hello2 16:30:14 .hello2 16:30:14 dustymabe: dustymabe 'Dusty Mabe' 16:30:15 .hello2 16:30:17 ajeddeloh: ajeddeloh 'Andrew Jeddeloh' 16:30:20 slowrie: slowrie 'Stephen Lowrie' 16:30:20 geoff-: Geoff Levand 16:30:22 .hello2 16:30:26 yzhang: yzhang 'Yu Qi Zhang' 16:30:32 .hello2 16:30:33 mskarbek: mskarbek 'None' 16:30:40 #chair lorbus dustymabe ajeddeloh slowrie geoff- yzhang mskarbek 16:30:40 Current chairs: ajeddeloh dustymabe geoff- lorbus miabbott mskarbek slowrie yzhang 16:30:43 .fas jasonbrooks 16:30:44 jbrooks: jasonbrooks 'Jason Brooks' 16:30:45 .hello sinnykumari 16:30:47 ksinny: sinnykumari 'Sinny Kumari' 16:31:06 #chair jbrooks ksinny 16:31:06 Current chairs: ajeddeloh dustymabe geoff- jbrooks ksinny lorbus miabbott mskarbek slowrie yzhang 16:31:23 .hello2 16:31:24 jlebon: jlebon 'None' 16:31:45 #chair jlebon 16:31:45 Current chairs: ajeddeloh dustymabe geoff- jbrooks jlebon ksinny lorbus miabbott mskarbek slowrie yzhang 16:31:50 .hello2 16:31:51 bgilbert: bgilbert 'Benjamin Gilbert' 16:31:57 #chair bgilbert 16:31:57 Current chairs: ajeddeloh bgilbert dustymabe geoff- jbrooks jlebon ksinny lorbus miabbott mskarbek slowrie yzhang 16:32:09 .hello2 16:32:10 bhavin192: bhavin192 'Bhavin Gandhi' 16:32:15 #chair bhavin192 16:32:15 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon ksinny lorbus miabbott mskarbek slowrie yzhang 16:32:20 .hello rfairleyredhat 16:32:21 rfairley: rfairleyredhat 'Robert Fairley' 16:32:28 #chair rfairley 16:32:28 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon ksinny lorbus miabbott mskarbek rfairley slowrie yzhang 16:32:46 where I can fix that "None" in the name? I have my full name in FAS. :/ 16:32:51 .hello2 lucab 16:32:52 kaeso: Sorry, but you don't exist 16:33:05 .hello lucab 16:33:06 kaeso: lucab 'Luca Bruno' 16:33:11 #chair kaeso 16:33:11 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon kaeso ksinny lorbus miabbott mskarbek rfairley slowrie yzhang 16:33:20 we'll get started in about 1m 16:33:36 (I'm briefly afk) 16:34:18 mskarbek: i'm not actually sure 16:34:30 :( 16:34:30 #topic Action items from last meeting 16:34:41 bgilbert to write up a proposal in the ticket (i.e. your description + modifications based on our conversations) and we can all read it and vote on it in the ticket #22 16:35:14 .hello jligon 16:35:15 jligon: jligon 'Jeff Ligon' 16:35:26 #chair jligon 16:35:26 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon jligon kaeso ksinny lorbus miabbott mskarbek rfairley slowrie yzhang 16:35:50 bgilbert: it looks like that action got completed with ticket #22? 16:36:50 yup 16:36:51 dustymabe: do you know? ^^ 16:36:55 ah, ok 16:37:01 moving on then 16:37:02 #info https://github.com/coreos/fedora-coreos-tracker/issues/22#issuecomment-427218716 16:37:08 #topic set up logging for #fedora-coreos IRC channel #11 16:37:13 #link https://github.com/coreos/fedora-coreos-tracker/issues/11 16:37:28 dustymabe: assigned to you ^^ 16:37:43 I just added a comment to the ticket 16:37:59 https://github.com/coreos/fedora-coreos-tracker/issues/11#issuecomment-428639803 16:38:07 dustymabe: seems a bit unclear? 16:38:29 dustymabe: legal says that avoiding long-term storage would help, but we should get someone else to do it? 16:38:29 bgilbert: mostly: 'steering clear of hosting our own instance as part of the Fedora project is still the recommendation from legal' 16:38:58 mostly that it "might help". I think they'd have to do some more research 16:39:07 .hello2 mnguyen 16:39:08 mnguyen_: Sorry, but you don't exist 16:39:15 .hello mnguyen 16:39:16 mnguyen_: mnguyen 'Michael Nguyen' 16:39:23 ^ love that error message 16:39:25 #chair mnguyen_ 16:39:25 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon jligon kaeso ksinny lorbus miabbott mnguyen_ mskarbek rfairley slowrie yzhang 16:39:35 what's the next step then? 16:39:46 I think there is a decent level of 'we're scared of what we don't know and this can only bring us pain' involved 16:39:50 dustymabe: is this something where we can ask "but seriously, can we do this" 16:40:14 yep. we can start more formal conversations around the topic 16:40:28 there are a few things we need to figure out 16:40:32 1. can we do it legally 16:40:40 because to my knowledge there aren't any other 3rd party ones that I know of 16:40:42 2. would fedora infra be willing to host an instance 16:40:49 3. if not are we willing to host our own? 16:41:03 hosting our own is worth it imo 16:41:13 if fedora infra wont 16:41:30 for the uninitiated, what does "hosting our own" mean in the context of Fedora? some individual going and doing it? 16:42:00 dustymabe, my team could provide a vm for hosting, if fedora infra won't 16:42:02 long running cloud instance that reboots at some time for updates? 16:42:23 it's not like it needs a particularly beefy instance 16:42:26 bgilbert: I think host our own would mean we "coreos group" hosts own infra/server for this purpose 16:42:42 another option is an individual chooses to implement it themselves 16:43:20 i don't think the hosting/implementation is really the hard part at all 16:43:25 it's mostly the legal aspects 16:43:27 if Fedora Infra were to host tho, wouldnt that be preferable? 16:43:42 lorbus: yeah, i mean anyone to manage something we don't have to :) 16:43:45 i think so, yes 16:43:49 and with Fed Infra, we'd probably get the best legal scrutiny 16:44:24 otherwise (personal instance) someone would personally be responsible for it...which I wouldnt do 16:44:25 i wonder if you could setup a cron job to just start/end a meeting in the channel every day... 16:45:15 lorbus: if we can get a container that does it, just set up a cl instance and once we have a stable fcos switch to that 16:45:21 test of the "set and forget" 16:45:32 maybe. I don't know if that would actually be a good UI though 16:45:35 yeah, I'm still wondering why recording meetings is okay 16:45:55 I'll start an email thread with bgilbert + ajeddeloh and legal 16:46:05 +1 16:46:08 sure, but legally, someone would still be responsible for any wrongdoings under GDPR 16:46:31 #action dustymabe start an email thread with bgilbert + ajeddeloh and legal about logging #fedora-coreos in fedora infra or otherwise 16:46:49 #topic Major release and update cycle for Fedora CoreOS #22 16:46:55 #link https://github.com/coreos/fedora-coreos-tracker/issues/22 16:47:21 nobody assigned, but bgilbert made a detailed comment most recently 16:47:31 s/comment/proposal/ 16:48:57 i guess we need folks to read it and leave feedback? 16:48:59 bgilbert: one could argue that meeting is a "formalized" activity you are choosing to participate and by that agree to the conditions - also recording, normal irc conversation dosn't have any formal requirements to participate 16:49:42 mskarbek: maybe. we do warn people in the topic that the channel is logged though 16:49:58 Something to note: we change the details of what the CL release channels mean occasionally. I think we ought to reserve the right to do the same with FCOS 16:49:58 miabbott: sure, or we can discuss here 16:50:15 because we probably won't get it perfect from the start 16:50:29 let's disucss 16:50:33 discuss* 16:50:40 ajeddeloh: yeah. I mentioned that the cadences are not contractual but I should broaden that language 16:50:54 so one bikeshed - on stable/testing/next names 16:51:19 i wonder if we should rename testing to something else so as to not confuse with bodhi updates-testing 16:51:48 I'm fine with that 16:51:50 bgilbert: re. deprecation, weren't we planning to have chokepoint and dead-leaves via cincinnati? (to simplify the graph and avoid year-long deprecations) 16:51:55 bgilbert: +1 I think just maybe draw a distinction between cadence and channel meaning 16:51:59 *chokepoints 16:52:18 ajeddeloh: ? 16:52:28 i also think it might be beneficial to rename the "development refs" from "updates" to "bodhi-updates" and "updates-testing" to "bodhi-updates-testing" 16:52:47 just to try to further remove any confusion there 16:52:48 dustymabe: candidate? 16:52:50 bgilbert: cadence refers to frequency, but not necessarily the meaning of the channels 16:53:24 ajeddeloh: not sure if I'm understanding. I was thinking of saying that both cadence and definition are non-contractual 16:53:47 sgtm, we're in agreement 16:54:13 kaeso: I realized after I wrote the comment that it doesn't take Cincinnati into account at all 16:54:17 if we can't think of another really good name to replace 'testing' then let's just leave it at 'testing' :) 16:54:31 kaeso: I have a mostly-written draft issue for update rollouts 16:54:57 kaeso: I'd say let's only consider stream structure for now and think about rollouts separately 16:55:07 bgilbert: ack, I'll check that later on then 16:55:20 dustymabe: not sure if it's a good alternative, but it could be renamed to "candidate" 16:55:38 (or something similar) 16:56:04 lorbus: that could work, but a little long :) 16:56:57 rc..but that term could be confusing ppl 16:57:21 rc ? 16:57:22 I'll think on it a bit, even if we don't come up with a name now 16:58:01 it kinda is a release-candidate stream isnt it? 16:58:16 I (don 16:58:46 't want to get into name bikeshedding now) but also thing we should avoid names that have other connotations like alpha, beta, rc, etc 16:58:59 oh oh rc == release candidate 16:59:02 got it :) 16:59:07 I dont feel strongly about this, tho. I'd be fine with leaving it as is 16:59:17 ajedeloh +1 16:59:18 other non-naming comments? 16:59:23 ajeddeloh +1 16:59:36 yeah. testing is good - especially if we rename the dev ref to 'bodhi-updates-testing' 16:59:53 * ksinny is +1 for bodhi-updates and bodhi-updates-testing 17:00:03 * dustymabe done with naming comments :) 17:00:31 but wonders which Fedora version we are talking, at a time there can be bodhi updates for multiple Fedora stream 17:00:42 latest stable 17:01:10 okay, that's reasonable 17:01:33 the proposal says the Fedora version currently tracked by `testing` 17:02:02 +1 17:02:16 on the theory that if users have a problem with stable, they try testing, and if they still have a problem, they might try bodhi-* 17:02:48 sounds like we have good agreement on the proposal. do we want to keep it open (on GH) for further discussion? 17:03:02 +1 for the bodhi- prefixes 17:03:17 there are a couple tweaks from this discussion 17:03:20 But bodhi-* could be tracking, today f28 or f27, but I'm assuming we care only about the latest bodhi-stable release 17:03:25 I'll add the comment about the bodhi prefixes to the ticket 17:03:34 after that, I can go directly to design doc PR, or we can do another round here 17:03:53 dustymabe: no need, I'll update the proposal from this discussion and add a comment 17:04:32 #action bgilbert update ticket #22 with naming choices for development refs (bodhi-*) etc. 17:04:44 jbrooks: for clarity, were you responding to my comment? 17:05:02 #topic Determine how to handle automatic rollback #47 17:05:08 #link https://github.com/coreos/fedora-coreos-tracker/issues/47 17:05:36 bgilbert, Yes, I think ksinny was pointing out that there are two stable fedoras out at any one time 17:05:55 I'm assuming we're always talking about the more recent one 17:05:57 yeah 17:06:01 jbrooks: right, and I was saying we'd track the one referenced by `testing` 17:06:10 which may or may not be the current one 17:06:15 oh, interesting 17:07:35 so rollback: lorbus you worked on some boot-counting stuff right? Do you think there's a reason to give an install more than 1 try? 17:09:01 install == an upgraded deployment ? 17:09:04 ajeddeloh: I don't know honestly. It was a feature that had been desired by different people for some time 17:09:13 dustymabe: yeah 17:09:33 bgilbert: keep me honest, but with CL we only have one shot 17:09:43 i think that was configurable based on the work lorbus was doing (i.e. boot counting) 17:09:52 the number of tries is just a variable 17:09:54 ajeddeloh: don't we have 3 retries? 17:09:58 we do? 17:10:00 can/should be able to be configured 17:10:37 yeah, esp. if we allow user-configured health checks 17:10:41 It is intended to eliminate any flakes/things that would usually go right, but just do not in that boot attempt 17:10:47 ajeddeloh: I was somehow arguing for a "more than one" on the basis that a node reboot / power failure / cosmic ray can happen 17:10:47 pretty sure CL is one shot, yeah 17:11:11 kaeso: exactly that 17:11:58 ok, that's a good reason, what happens if there's a race though? 17:12:02 using grub2, it'll be possible to configure how many tiems to try 17:12:11 times* 17:12:14 like an install comes in, loses, wins, then loses 17:12:42 as implemented in grub2/greenboot it is not an atomic op, though 17:13:15 ajeddeloh: a win would be sticky I'd say 17:13:17 on the first win, then we stop retrying, right? 17:13:44 yes, sticky win 17:14:21 lorbus: have you been able to use the grub2 patches and see boot counting in action? 17:14:28 sgtm, just wanted to be clear. 17:14:30 essentially, to use the grub2 boot counting feature, one must set boot_success=0 and boot_counter=x grubenv vars after rpm-ostree upgrade, before rebooting 17:14:31 https://github.com/projectatomic/rpm-ostree/issues/1598#issuecomment-428218699 17:14:55 lorbus: those changes are all in grub configs and not the grub executables, yes? 17:15:22 .hello2 17:15:23 walters: walters 'Colin Walters' 17:15:47 jlebon: no. arithmetic ops weren't actually supported in grub2 until last week, when pjones implemented it. My code was merged a bit prematurely and was actually ignored by grub2-mkconfig 17:16:04 #chair walters 17:16:04 Current chairs: ajeddeloh bgilbert bhavin192 dustymabe geoff- jbrooks jlebon jligon kaeso ksinny lorbus miabbott mnguyen_ mskarbek rfairley slowrie walters yzhang 17:16:14 lorbus: ahh heh, fun 17:16:26 there is a PR open, that is supposed to finally get it working: https://github.com/rhboot/grub2/pull/42 17:16:47 but for now the script has been removed from grub in the current package build 17:17:17 one of the things we've discussed previously is having a static, handwritten grub config that knows how to pick which install to use 17:17:31 ajeddeloh: yes. its all grubenv vars that are being changed 17:17:48 somewhat related, does anyone disagree that we don't want the "skip the bootloader unless failure" part? 17:17:50 hmm, i'm wondering if there's a way to use more grubenv vars to not have to manually set boot_success=0 after `upgrade`... e.g. maybe also store some deployment marker for which boot_success=1 was set 17:17:56 (for servers) 17:18:16 walters: i don't want to ever skip the bootloader for servers 17:18:22 IMHO 17:18:26 and the in the grub config if the marker doesn't match, we know it's a new deploymenht 17:18:27 walters +1 17:19:16 walters: "skip the bootloader" meaning the menu? 17:19:23 right 17:19:32 TBH I think grub is a monster and it's a good thing they want to replace it in the long term 17:19:33 if so, then yeah we should just never hide it 17:20:17 that doesn't help us right now, tho. and with grub not being able to edit the BLS filenames, grubenv is really the only thing I can see solving this right now 17:20:48 flag files, potentially 17:21:22 boot_success was actually introduced with the menu_hide feature. they want to use a 2min timer unit to set it on the Workstation 17:22:01 but the counting feature doesnt rely in that code 17:22:06 on* 17:22:18 yeah that's the primary driver of the logic today, which is why i lean towards backing it all out until we have a tested design 17:22:40 (backing it out for FCOS/FAH, not Workstation obviously) 17:22:52 sorry, network is dying 17:23:58 I believe it is also going to be used in Fedora IoT, which could be nice testing ground for us as well 17:24:10 yes that will be nice 17:24:12 being rpm-ostree based and all 17:24:34 not sure if this message made it through: lorbus thoughts on having a static, handwritten grub config then storing all state in the grubenv 17:24:53 +1 to that from me 17:25:22 and what bits of state does an install need? 17:25:22 ajeddeloh: I'm not opposed. You dont actually need boot_counter before using it, too. 17:25:37 (more importantly IMO, removing os-prober) 17:25:44 ...we've got about 5m left for this meeting... 17:26:03 jlebon: how would we do this atomically, i.e. without explicitly setting boot_counter? 17:26:22 are writes to grub-env atomic? 17:27:01 * walters thinks about sector sizes here 17:27:05 ajeddeloh: not sure. 17:27:20 i.e. do we need to worry about what happens if we lose power in the middle of trying to write to it 17:27:22 btw there was a fsdevel thread about this 17:27:32 walters: that's important, yes. Grub2 cant increase the size 17:27:55 https://marc.info/?l=linux-fsdevel&m=153741350128439&w=2 17:28:12 lorbus: yeah, that's an issue. e.g. if grub is interrupted between updating boot_success and some e.g. boot_success_marker 17:28:36 we can noodle on it some more in issues 17:28:49 +1 on noodling in issues 17:28:59 +1, also next cabal 17:29:07 I think we ought to tack down what state we need for each install 17:29:14 or maybe we actually have some way forward by then 17:29:16 and how that should be represented in the grubenv 17:30:03 ajeddeloh: yes. during GSoC we made a state machine diagram 17:30:21 I'll update that to the current implementation asap 17:30:26 folks, we're at scheduled time for the meeting 17:30:47 lorbus: keep in mind we also need to have some state that can differentiate two successful installs 17:31:30 * lorbus will think about this some more 17:31:31 do we want to move to open floor? or just close the meeting? 17:31:51 * ajeddeloh has nothing for OF 17:32:07 #topic open floor 17:32:13 * lorbus neither 17:32:15 if you have anything, now's the time :) 17:32:19 I have something 17:32:28 there is a meeting happening now to discuss coreos-assembler 17:32:45 https://lists.fedoraproject.org/archives/list/coreos@lists.fedoraproject.org/thread/T44VRBNIB7GN3ESQDR2DYLQEOEOFS2MS/ 17:34:38 last call for open floor...closing meeting in 1m 17:35:34 #endmeeting