#openstack-meeting-alt log

22:02:54 <jeblair> #startmeeting zuul
22:02:55 <openstack> Meeting started Mon Jun 26 22:02:54 2017 UTC and is due to finish in 60 minutes.  The chair is jeblair. Information about MeetBot at http://wiki.debian.org/MeetBot.
22:02:56 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
22:02:58 <openstack> The meeting name has been set to 'zuul'
22:03:06 <jeblair> #link agenda https://wiki.openstack.org/wiki/Meetings/Zuul
22:03:21 <jeblair> #link previous meeting http://eavesdrop.openstack.org/meetings/zuul/2017/zuul.2017-06-12-22.04.html
22:03:41 <jeblair> #topic Status updates: Standard job library
22:04:24 <jlk> o/
22:04:30 <jeblair> i don't think we've made substantial progress on this since last meeting, though pabelanger did do a lot of work to get gearman running with ssl which we will need for secrets support
22:05:08 <jeblair> mordred and i are planning on pushing pretty hard on this in the next week or two, i believe
22:06:03 <jeblair> once we get going, hopefully we can get the bonnyci folks, and software factory too, looking at the shared jobs repo
22:06:11 <Shrews> ++
22:06:58 <jeblair> anything else to add on this?
22:08:18 <jlk> Ready to review! :)
22:08:57 <jeblair> #topic Status updates: Documentation
22:09:21 <jeblair> i did a lot of work on documentation over the past week
22:09:25 <jeblair> #link https://review.openstack.org/463328
22:09:31 <jeblair> #link https://review.openstack.org/475928
22:09:42 <mordred> yah. them's a lot of words
22:09:45 <jeblair> those are the biggest changes, but they are at the top of a stack.
22:10:08 <jlk> I just spent a significant chunk of today (re)reading the first one.
22:10:12 <SpamapS> very excited to see docs starting to form
22:10:17 <mordred> ++
22:10:18 <mordred> me too
22:10:19 <Shrews> i LOVE when others write much needed documentation
22:10:24 <jeblair> the whole stack is ready for review, and it was green earlier today, though i merged tristanC's config default change which conflicts with some of them.  they will be easy to fix, so expect new revs tomorrow.
22:10:51 <jeblair> i know there's still a lot more to do (especially we need more examples, some of which should be in-line, some of which should be appendices)
22:11:03 <jeblair> and i think there's still some polish that could happen on higher-level overview stuff
22:11:31 * Shrews read that as "Polish"
22:11:32 <jeblair> but i'm hoping that's enough of a skeleton for us to build on  (i mean, it's more than a skeleton, it is pretty substantial, but you get the idea)
22:11:41 <jeblair> Shrews: reverse polish?
22:11:56 <jlk> jeblair: hopefully you can refrain from rebasing the changes, right? So that sub-patchset level comparisons can still be used if/when you make changes?
22:12:40 <jeblair> jlk: i will need to rebase most of the stack in order to fix the conflicts, but i will do a two-step rebase to make it easier
22:12:54 <jeblair> jlk: i'll rebase without changes first, and annotate that so you can ignore that inter-patchset diff
22:12:57 <jlk> kk
22:13:04 <jeblair> jlk: then i'll fix the conflicts in its own chaneg
22:13:23 <jeblair> maybe.  if possible.
22:13:46 <jeblair> if i can't, i'll just make sure that i only change the minimum to fix conflicts in the rebase.
22:14:11 <jlk> appreciated
22:14:54 <jeblair> i've tried to build a narrative structure in each of the user/admin guides so we're sort of taking users or admins through a learning process.
22:15:10 <jeblair> i'm sure it's not fully there yet, but that's what i was aiming for
22:15:59 <jlk> it's a good read
22:16:04 <jlk> They're good words, Jom.
22:16:06 <jeblair> i'm very eager to land these asap, and then resume our practice of changing docs with code.  :)
22:16:27 <mordred> yes. this will be quite nice
22:17:30 * Shrews will review Jom's words tomorrow
22:18:10 <jeblair> the second change is all kinds of reorg, much of which changes again later.  i don't know how to make that better, sorry.
22:18:54 <jeblair> also, keep in mind that the docs job renders everything, so folks may enjoy reading the final rendered form in later changes.  :)
22:19:10 <jeblair> i've consulted it several times today
22:19:27 <jeblair> anything else?
22:19:54 <clarkb> if you pull the second change and look at it locally I think git itself might be able to better show you the file moves?
22:19:57 <clarkb> maybe not
22:21:57 <jeblair> clarkb: yeah, i'm not sure.  i took notes while writing it.  :|
22:22:24 <jeblair> #topic Status updates: Github parity
22:23:07 <jeblair> jlk: do you have any outstanding changes related to this?
22:23:28 <jlk> There are a couple open.
22:23:45 <jlk> Depends-On for github, and a fix to support push events better (non pull-request)
22:24:02 <jlk> I also noticed while reading through docs that we could do a better job of supporting statsd metrics in the github driver
22:24:20 <jeblair> good point; now's probably a good time to add that
22:24:37 <jlk> Oh, and Depends-On is partial, it does not do the helpful thing of back-searching to discover needed-by changes to re-enqueue.
22:25:01 <jlk> the back searching is going to be limited by what github supports in search, as we discussed at AnsibleFest.
22:25:37 <jlk> We might be able to get partially there just by checking our own cache of change information, but that could lead to non-deterministic behavior.
22:26:16 <jeblair> yeah, we still need to decide how to handle that.  we brainstormed some ideas about that; i think our choices are roughly: (a) don't support it; (b) move depends-on into pull request messages rather than commits; (c) perform some kind of local memoization so we just ask ourselves for the info
22:26:29 <jeblair> (c) is growing on me
22:26:46 <jeblair> the amount of data we would need to keep is tiny, i think.
22:26:54 <mordred> I believe b and c are better outcomes than a
22:26:57 <jlk> C feels fairly easy to implement, if we ignore lost cache during restarts.
22:26:57 <jamielennox> c is useful as we should know, but aren't we already doing (b)
22:26:59 <clarkb> bigger problem is probably making sure that you've collected all the data right?
22:27:03 <clarkb> for c
22:27:26 <jeblair> jlk: i think we can have the scheduler go ahead and store the data on disk to persist between restarts.
22:27:27 <jamielennox> i don't think it makes sense to have depends-on in commit messages for github
22:27:29 <mordred> clarkb: well - the only thing that could be blocked on a depends-on is a previously-gotten event
22:27:32 <jlk> clarkb: so far, we are. Every change we runtime cache we grab every commit message of every commit on the change.
22:27:47 <mordred> ah - different question. jlk's answer is better than mine
22:28:07 <jlk> jamielennox: you prefer that they be read from pull request comments instead?
22:28:38 <mordred> I thought we did not get an event when a PR summary changes?
22:28:55 <jamielennox> i thought we were doing from the pull request body, i mean you have to merge the full PR
22:28:58 <jlk> I'll have to test.
22:29:09 <jamielennox> but not comments on a PR
22:29:10 <clarkb> jlk: right but hat happens when you take a 4 hours downtime and github changes a bunch?
22:29:21 <mordred> btw - I think we should do c even if we do b - because gh rate-limiting means if we can avoid the extra query it would be valuable
22:29:42 <jlk> well.... lets take a step back here
22:29:42 <clarkb> or are you saying that its ok because next eent you'd just repopulate complete cache?
22:29:49 <jlk> clarkb: (yes that)
22:29:57 <mordred> ++ to that
22:30:07 <mordred> if there's a zuul downtime people are going to need to recheck anyway
22:30:21 <jlk> If we listen to jamielennox , and shift the data source to pull comments
22:30:22 <jeblair> (or admin enqueue events or something)
22:30:30 <jlk> we can avoid some of this problem
22:30:51 <mordred> well- I think pull comments are the wrong place - pull summary would be the more similar place, no?
22:30:56 <jlk> (it'll introduce some new problems, but not insurmountable)
22:31:03 <jamielennox> mordred: right, not arbitrary comments added to the PR
22:31:04 <mordred> summary goes into merge commit message
22:31:06 <jamielennox> just the summary
22:31:06 <mordred> jamielennox: ++
22:31:07 <jlk> hah, well, okay
22:31:19 <mordred> I thought we did not get events when that updates?
22:31:24 <jlk> yes, we'd just have to be aware when that changs.
22:31:26 <jlk> es
22:31:28 <jlk> one sec
22:31:34 <jamielennox> not sure
22:31:57 <jamielennox> also, it seems right to me on the PR, but i'm used to gerrit so all my PRs are one commit long
22:32:17 <jamielennox> but there are some great/shocking examples of 200 commits to a PR that i think mean PR is about the only place they make sense
22:32:22 <jlk> Changing the PR summary generates a comment event with an action of "edited"
22:32:40 <mordred> cool. so we can react to that and notice a depends-on has been added or removed
22:33:07 <jlk> sure, we'd just have to ensure we're only looking at the "special" comment
22:33:10 <jlk> which is the PR summary
22:33:34 <jeblair> jamielennox: i think the behavior is that if any of those 200 events have a depends-on footer, then the pr depends on that.  but putting it in the commit lets you say "this commit depends on this functionality", and that's something that will transit through multiple git repos, etc.  it's a property of the commit, rather than the (somewhat more transient) pull request.
22:34:36 <mordred> jeblair: yah - I think I'm coming to the other pov on that ... for github users, the PR is the unit of work, not the commit - removing a depends-on footer if it stops being true would also become potentially hard
22:34:54 <mordred> for folks who do not rebase their branches but instead push new patches on top of a branch
22:34:55 <clarkb> especially since old commits and comments go away when you rebase right?
22:34:56 <jamielennox> yea, ok, i see it's useful to reference it in the commit that uses it. but everything else we do is via PR, like you can't address a commit in a PR like BonnyCI/zuul#32@1 so your PR is really your unit in GH
22:34:59 <jlk> The PR Summary is implemented as the first comment of the PR, we'd have to identify this and make it "special"
22:35:29 <jamielennox> jlk: there's nothing that identifies the PR body as special at all in an event?
22:35:41 <mordred> clarkb: old commits do - old comments stay around
22:35:41 <jlk> not that I've seen :(
22:36:04 <clarkb> mordred: only top level comments though right?
22:36:23 <jlk> ugh
22:36:27 <mordred> clarkb: hrm. I thought in-line comments also stuck around and were marked as "for a previous revision" with a red x
22:36:35 <jlk> haha
22:36:38 <jlk> guys you're on a tangent
22:36:43 <mordred> we are indeed
22:36:56 <jlk> those are "different" as they're "review comments"
22:36:59 <jlk> not to be confused with Pull Request Review comments
22:37:03 <jlk> There's comments to the PR which you see on the front page
22:37:07 <mordred> ah
22:37:13 <jlk> there's review comments which are made in-line with the source code
22:37:23 <jlk> and then there's Pull Request Review comments
22:37:33 <jamielennox> which GH implemented as a special type of issue, which confuses this more - but digression
22:38:09 <jlk> From the webhook data I'm looking at, I don't even see an identifier for _which_ comment was edited.
22:38:29 <mordred> jlk: you can edit comments other than the summary?
22:38:35 <jlk> yes
22:38:44 <jamielennox> jlk: so you could still say a comment was editted on this PR and fetch it again? not greatest but it would catch it
22:39:01 <jlk> http://paste.openstack.org/show/613760/ is hte payload
22:39:03 <mordred> oh wow. you really can edit comments. that's so weird to me
22:39:12 <jlk> jamielennox: yeah, we could fetch and then re-examine the "first" comment.
22:40:05 <jeblair> okay, so there's some support for moving this to PR summary.  how about we dive deeper into this and make sure that doing so is both workable and a solution to the needed-by problem, and regroup?
22:40:09 <jamielennox> hmm, crazy that the PR body is only considered a comment
22:40:15 <mordred> ++
22:40:20 <mordred> to both jeblair and jamielennox
22:40:21 <jamielennox> ++
22:40:30 <jlk> ++
22:41:06 <jeblair> jlk: okay if i action you with that?
22:41:27 <jlk> worksforme
22:41:32 <jeblair> #action jlk look into moving depends-on to pull-request summary to see if it is both workable and a solution to the needed-by problem
22:42:06 <jeblair> cool, thanks.. let's move on to make sure we don't run out of time
22:42:17 <jeblair> #topic Status updates: (web) console streaming
22:42:48 <Shrews> oh, hey. that's an agenda item now
22:44:01 <Shrews> not sure what to say except that i've hit a roadblock and had to swallow my pride and ask for help from the original author and Dear Leader
22:44:29 * mordred is also going to help swear at things
22:45:08 <clarkb> have the problems been described anywhere yet?
22:45:32 <Shrews> clarkb: not publicly, no
22:46:05 <jeblair> hopefully we can figure out in the next couple days if the problems are "merely technical" or something more fundamental that would cause us to re-evaluate our approach
22:46:48 <jeblair> i think we're at the "this doesn't seem to work; not sure what's going on" stage
22:46:56 <Shrews> yeah. if it's a stupid programmer (aka, me) error, we can move forward. if it's an architectural thing, then public input may be needed
22:48:11 <jeblair> i'm going to skip some topics and move on to...
22:48:18 <jeblair> #topic Removing py2 support from v3 nodepool (Shrews)
22:48:31 <SpamapS> One thing I've never liked about async "futures" style coding is that it is often just hard to see wtf is going on inside the machine.
22:48:43 <jeblair> #link https://review.openstack.org/476683
22:48:43 <SpamapS> (that was regarding the web streaming)
22:48:58 <Shrews> mordred and I were discussing https://review.openstack.org/476683 this morning, and it came up that maybe we should stop supporting py2 in feature/zuulv3 nodepool
22:49:35 <jeblair> SpamapS: [me too; part of why i want to keep that kind of code in its own sandbox]
22:49:40 <mordred> yah - if we don't support py2 in zuulv3 of zuul, I'm not sure why we should support it in nodepool - of course, we need to start running infra's nodepool in v3 before we drop it
22:49:58 <clarkb> Shrews: mordred my concern with that right now is python35 integration test does not work on nodepool
22:50:07 <mordred> clarkb: that's a great concern!
22:50:08 <clarkb> I think we need to get the testing solid on python3 first then make that assertion
22:50:11 <mordred> clarkb: we sohuld, you know fix that :)
22:50:14 <mordred> ++
22:50:17 <clarkb> because right now python2 is what works based on testing
22:50:19 <Shrews> clarkb: that's a very valid point
22:50:27 <jeblair> clarkb: the devstack one, or the zuul one?
22:50:32 <clarkb> (I think it may work on feature branch but not master)
22:50:36 <clarkb> jeblair: the devstack one
22:50:45 <mordred> oh. yah - I only mean on feature/
22:51:03 <clarkb> mordred: so I think you have to do both at the same time actually
22:51:10 <mordred> but we should _definitely_ make sure testing is solid before doing anything
22:51:14 <clarkb> and the reason for that is we have a habit of fixing things on feature first then maybe backporting to master
22:51:22 <clarkb> and you can't backport anymore if different pythons are supported
22:51:52 <Shrews> devstack, from what i've seen. weird errors, like: http://logs.openstack.org/23/476223/2/check/gate-dsvm-nodepool-py35-src-nv/af51324/logs/devstacklog.txt.gz#_2017-06-26_17_03_17_402
22:52:09 <clarkb> Shrews: ya it has to do with setuptools and permissions and stuff I think
22:52:12 <clarkb> Shrews: its good python fun
22:52:16 <mordred> so - let's maybe reframe this to "before we release nodepool v3, we should drop python2 support" - largely because I dont' think we want to grow new python2 based users that makes it hard to drop support
22:52:47 <Shrews> mordred: yes, that's a good way to put it. A goal to move towards
22:52:52 <mordred> and between now and then, we should definitely make sure that all the python3 testing is solid and that infra is running nodepool v3 on python3
22:53:09 <mordred> since those are both just good ideas for sanity and consistency
22:53:23 <jeblair> mordred: i'm okay with that, though i note that it doesn't address the immediate need, which was driven by "we want to land some code in v3 that is much simpler (like, a hundred lines plus an external dependency) with py3
22:53:36 <SpamapS> does this maybe up the priority of shimming?
22:53:41 <mordred> jeblair: indeed
22:54:21 <Shrews> jeblair: to be fair, i *really* like harlowja's change, but it isn't something that's needed RIGHT NOW.
22:54:29 <clarkb> I'm ok with doing it I just think it has to be done on both branches together with working tests
22:54:39 <clarkb> Shrews: also that, it may save like a second on shutdown
22:54:48 <harlowja> i save all the seconds, lol
22:54:48 <clarkb> not super urgent but a nice code cleanliness thing
22:54:49 <Shrews> well, could be several seconds, but yeah
22:55:13 <jeblair> SpamapS: i think the main gains of shimming were load testing plus ease of transition for openstack-infra; so i don't see this as driving a further need for that.
22:55:44 <SpamapS> Oh I thought maybe shimming would let us get onto feature/ sooner.
22:55:54 <SpamapS> so we wouldn't be backporting so much
22:56:23 <jeblair> SpamapS: oh, hrm.  i guess it would?
22:56:35 <jeblair> like, we could go ahead and merge feature back into master
22:57:20 <jeblair> but it sounds like our immediate needs aren't important enough to push that though
22:57:56 <jeblair> we can shelve harlowja's change until we're ready to do that anyway, and we'll survive
22:58:05 <harlowja> wfm
22:58:14 <jeblair> i will buy harlowja a cookie
22:58:37 <Shrews> so, is it a goal that we remove py2 support from nodepool before release then?
22:58:55 <jeblair> +1
22:59:14 <jeblair> #topic Surprise bonus topic: should we meet next week?
22:59:36 <jeblair> i just realized i'm going to be sitting around watching a smoker all day next monday
23:00:00 <jeblair> (it's the day before a holiday, so i will probably make it a 4 day weekend)
23:00:09 <Shrews> i'm considering taking all of next week off, so +1 from me
23:00:14 <jeblair> would folks like to cancel the meeting, or someone else want to chair?
23:00:37 <jlk> I'm okay w/out a meeting
23:00:57 <jlk> I'll report my investigations by way of a gerrit change
23:01:35 <jeblair> okay, i'll wait for a volunteer chair until tomorrow and without one, i will send a cancellation notice
23:01:39 <jeblair> thanks all!
23:01:40 * SpamapS is +0
23:01:46 <jeblair> #endmeeting