#openstack-meeting-cp log

17:00:00 <ildikov> #startmeeting cinder-nova-api-changes
17:00:01 <openstack> Meeting started Mon Oct  3 17:00:00 2016 UTC and is due to finish in 60 minutes.  The chair is ildikov. Information about MeetBot at http://wiki.debian.org/MeetBot.
17:00:02 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
17:00:04 <openstack> The meeting name has been set to 'cinder_nova_api_changes'
17:00:10 <ildikov> scottda ildikov DuncanT ameade cFouts johnthetubaguy jaypipes takashin alaski e0ne jgriffith tbarron andrearosa hemna erlon mriedem gouthamr ebalduf patrickeast smcginnis diablo_rojo gsilvis  xyang1 raj_singh
17:00:12 <mriedem> o/
17:00:40 <cFouts> hi
17:00:42 <scottda> hi
17:00:49 <jgriffith> present
17:01:15 <ildikov> hi All :)
17:01:49 <mriedem> i have a quick update once we're done waiting for people
17:02:07 <ildikov> should I do a quick ping on the Cinder channel?
17:02:14 <mriedem> shrug
17:03:17 <ildikov> ok, pinged a few guys
17:03:29 <ildikov> mriedem: I think we can move to your update
17:03:50 <mriedem> ok, we have a working nova-initiated swap volume test in tempest now http://logs.openstack.org/73/374373/2/check/gate-tempest-dsvm-neutron-full-ubuntu-xenial/cb37dbc/console.html#_2016-10-03_16_12_03_518757
17:04:01 <mriedem> not merged yet
17:04:04 <mriedem> stack starts here: https://review.openstack.org/#/c/374373/
17:04:26 <ildikov> mriedem: wow, that looks great!!!
17:04:43 <hemna> doink
17:04:46 <jgriffith> sweet
17:05:01 <hemna> ooh nice
17:05:17 <mriedem> it's gotten decent review in tempest though so i expect that to be merged this week
17:06:17 <ildikov> mriedem: that sounds pretty good
17:06:39 <ildikov> scottda: mriedem: did we have other tests in mind that we would need?
17:06:39 <mriedem> the only other thing i had was i did the thorough review on removing volume_api.check_attach in nova https://review.openstack.org/#/c/335358/
17:06:54 <mriedem> ildikov: scottda's tempest change for retype doesn't trigger volume swap in nova
17:06:55 <ildikov> I mean in general, not for the new API
17:06:59 <mriedem> i'm not really sure why
17:07:02 <mriedem> but i don't know much about retype
17:07:18 <scottda> ildikov: Yes, we need to run tempest test on 2-node cinder to call swap_volume
17:07:38 <scottda> retype from lvm->lvm on the same node won't call swap_volume
17:07:50 <scottda> Just some "smarts" in lvm driver code.
17:07:55 <ildikov> scottda: ok, I'll add notes to the etherpad
17:08:21 <mriedem> scottda: will it be a retype test?
17:08:25 <mriedem> or volume migration?
17:08:31 <scottda> But I *think* we can make it work for 2-node. Sounds like it would duplicate coverage of swap_volume. So useful, but not imperitive
17:08:42 <scottda> mriedem: Yes, it would be cinder retype-with-migration
17:08:47 <scottda> so both
17:08:50 <mriedem> ok
17:08:55 <mriedem> not a total swap_volume duplicate,
17:09:06 <mriedem> because the test above that i'm working on is nova-api initiated, not cinder
17:09:10 <mriedem> so there is less back and forth
17:09:24 <scottda> ok, cool. So still usefull for call to novaclient, etc.
17:09:29 <mriedem> yes
17:09:57 <ildikov> mriedem: I've just gotten there to check your comments
17:10:11 <ildikov> I don't know the answer to all questions you raised
17:10:49 <ildikov> like what happens with unreserve_volume if we never called reserve, but end up with the cleanup path
17:11:17 <ildikov> does unreserve do volume status check?
17:11:27 <ildikov> scottda: hemna: ^^
17:11:52 <mriedem> i checked
17:11:53 <mriedem> and it does
17:11:56 <mriedem> so i think that would fail
17:11:59 <mriedem> with a conflict
17:12:06 <ildikov> :(
17:12:16 <hemna> cinder volume has to be in-use
17:12:27 <hemna> https://github.com/openstack/cinder/blob/master/cinder/volume/api.py#L635-L638
17:12:45 <hemna> in-use or available I think by the looks of that
17:13:08 <ildikov> hemna: what if something goes wrong after calling reserve and then we call unreserve to clean things up?
17:13:21 <ildikov> as reserve moves it to 'attaching' IIRC
17:13:22 <mriedem> if it allows available,
17:13:30 <hemna> meaning the volume is in error state before calling unreserve ?
17:13:35 <mriedem> then it seems unreserve would be fine if you never called reserve
17:13:55 <scottda> expected status must be "attaching"
17:13:59 <hemna> if an error happens between reserve and unreserve....then...it's in error state and it's borked.
17:14:24 <hemna> scottda, oh yah you are right.  I didn't see line 632
17:14:25 <ildikov> hemna: we are talking more about issues on Nova side because of which we want to undo the steps like volume attach
17:14:28 <mriedem> yeah https://github.com/openstack/cinder/blob/master/cinder/volume/api.py#L632
17:14:49 <hemna> my bad
17:15:04 <scottda> And that conditional update stuff is specifically to eliminate races. So it cannot change on the cinder side.
17:16:13 <mriedem> so the reason ildikov is asking,
17:16:17 <mriedem> is because in this change https://review.openstack.org/#/c/335358/17/nova/compute/api.py
17:16:27 <mriedem> based on the version of the compute, the nova-api may or may not reserve the volume
17:16:54 <mriedem> on the compute it's unreserving the volume https://review.openstack.org/#/c/335358/17/nova/compute/manager.py
17:16:58 <ildikov> the history is that we identified one place where reserve_volume was not called
17:17:12 <ildikov> and by removing check_attach it now matters
17:17:13 <mriedem> in case _prep_block_device fails
17:17:47 <ildikov> or matters more and we're now trying to cover the failure scenarios for cleanup, ie unreserve
17:17:53 <mriedem> my question is more clearly asked here https://review.openstack.org/#/c/335358/17/nova/conductor/manager.py
17:18:46 <hemna> so looking at the cinder api call
17:18:54 <hemna> if it's not in attaching, you'll get an error
17:19:04 <mriedem> btw, is that a 400 or 409?
17:19:57 <hemna> I'll try and track that down
17:20:00 <hemna> it's not obvious
17:20:07 <scottda> I'd like to make sure that with the new API we're not creating all these exceptional cases that are all shoved into one API call.
17:20:52 <ildikov> mriedem: also have you checked the other calls of _check_attach_and_reserve_volume regarding unreserve? I kinda thought those are covered, but surely haven't checked all scenarios
17:21:18 <mriedem> ildikov: no that's why i asked in https://review.openstack.org/#/c/335358/17/nova/compute/api.py@3394
17:22:13 <ildikov> ok, I'll try to check then, I read your comment more like you're leaning towards it's not done
17:22:56 <mriedem> ildikov: i believe it's because with attach_volume in the compute api,
17:23:04 <mriedem> it reserves the volume in the api, then casts to the compute to attach,
17:23:14 <mriedem> and if the attach on the compute fails, the compute unreserves
17:23:38 <ildikov> I can fix additional occurrences if they're missing, but it would be pretty odd
17:23:38 <mriedem> however, if the cast to the compute failed for some reason, we'd leave the volume orphaned and you'd have to reset the state to get it back
17:23:43 <ildikov> yeap, that's right
17:23:57 <hemna> anyway, I can't make sense of the response to the conditional_update failure
17:24:05 <hemna> and how the wsgi converts that into a failure
17:24:08 <hemna> 400 vs 409
17:24:10 <mriedem> ildikov: this attempts to fix that https://review.openstack.org/#/c/290793/
17:24:14 <mriedem> hemna: yeah i couldn't either
17:24:21 * hemna isn't happy about it
17:24:54 <scottda> gorka would have the answer, I think. But it should't require a genius to figure it out.
17:25:01 <hemna> scottda, +1
17:25:07 <hemna> it's stupid that it's this hard to figure it out
17:25:46 <ildikov> mriedem: ah ok, I got it fully now what you mean, but that's kinda independent from the check_attach removal and it's a generic how to handle reserve/unreserve
17:26:15 <mriedem> ildikov: yes
17:26:20 <mriedem> just more garbage that needs cleaning up
17:27:04 <hemna> https://github.com/openstack/cinder/blob/master/cinder/api/openstack/wsgi.py#L600
17:27:06 <hemna> maybe that ?
17:27:07 <hemna> dunno
17:30:26 <mriedem> ok, so anyway, ildikov's check_attach patch needs some work
17:30:31 <mriedem> jgriffith: updates on your changes?
17:30:45 <ildikov> mriedem: as I see that patch you sent is taken care of, but maybe needs more eyes to figure things out
17:30:53 <jgriffith> mriedem: yeah, I pushed another update yesterday...
17:31:15 <jgriffith> mriedem: so there's some things to be fixed with migrate and unshelve updating correctly
17:31:17 <ildikov> mriedem: yeap, tnx for the review, I'll answer/fix this week
17:31:46 <jgriffith> mriedem: I also started working on the idea of ack'ing the create/terminate connection phases as an option
17:32:07 <jgriffith> mriedem: that interestingly started to make things look a lot like todays intialize--->attach :(
17:32:27 <mriedem> ildikov: _attach_volume_shelved_offloaded is also busted, it doesn't unreserve the volume in case attach fails
17:32:53 <jgriffith> mriedem: so  I started an experiment to just overhaul/improve that type of flow (better names, not so many optional args) that uses attachment_ids and stores the connectors
17:33:02 <ildikov> mriedem: is there a bug report for that? we might track these issues, just to not get lost in this
17:33:13 <scottda> maybe we'd be better off with a cleanup_failed_attach API to unreserve regardless of state.
17:34:06 <jgriffith> mriedem: scottda are we talking about the same thing still?
17:34:21 <jgriffith> mriedem: scottda or is this another topic that you're discussion with ildikov ?
17:34:31 <jgriffith> discussing
17:34:40 <scottda> We were earlier talking about cleaning up failed attaches...
17:35:13 <scottda> But the call to unreserve only works if state is "attaching". So there were issues with ildikov 's patches to remove calls to reserve
17:35:14 <jgriffith> ok, carry on then
17:35:23 * jgriffith goes back to what he was doing
17:36:00 <mriedem> ildikov: i'll open a bug
17:36:03 <jgriffith> scottda: FWIW you could just change that :)
17:36:27 <scottda> jgriffith: Yeah, but the conditional update stuff to eliminate races would get busted.
17:36:39 <ildikov> jgriffith: we finished with that, just mriedem brought up one more example of the issue :)
17:36:51 <scottda> jgriffith: Anyway, didn't want to drag us back. You should carry on with your update.
17:36:54 <jgriffith> scottda: by allowing it to unreserve an "error-attaching"?  I don't see how
17:36:55 <mriedem> jgriffith: have you had a clean ci run yet?
17:37:14 <jgriffith> mriedem: nope, down to eight failures
17:37:16 <mriedem> at least for the attach flow
17:37:24 <mriedem> ok, shelve is all sorts of weird
17:37:31 <mriedem> and not well tested probably
17:37:32 <jgriffith> mriedem: I've discovered that :)
17:37:54 <jgriffith> mriedem: there's something kinda funky when it calls os-brick
17:37:58 <jgriffith> will get to that today
17:37:59 <mriedem> yeah, 'save all of my vm state in the db and take care of it, but destroy my guest and don't charge me until i want it back'
17:38:01 <mriedem> super pet scenario
17:38:08 <jgriffith> mriedem: yeah!
17:38:22 <jgriffith> mriedem: frankly I've come to hate the *feature* :)
17:38:29 <mriedem> as have several of us
17:38:38 <jgriffith> mriedem: I particularly hate the concept of attaching a volume to a shelved instnace
17:38:48 <jgriffith> instance even :)
17:38:49 <mriedem> yes, that's what i was just looking at
17:38:54 <mriedem> it's not attached until unshelve
17:39:10 <jgriffith> mriedem: well in reality correct, but from Cinder's point of view :)
17:39:16 <mriedem> yeah
17:39:41 <mriedem> i'm not sure how well we handle detaching those when we delete a shelved offloaded instance either
17:39:45 <jgriffith> mriedem: anyway, I got most of it all worked out, just something in brick maybe privsep was puking
17:39:51 <mriedem> because i think that assumes terminate_connection will work
17:39:56 <jgriffith> mriedem: so that part is actually covered which is nice
17:40:08 <jgriffith> mriedem: it does a cleanup of everything in the BDM talbe
17:40:10 <jgriffith> table
17:40:33 <jgriffith> mriedem: oh that part :)
17:40:43 <jgriffith> mriedem: so my patch added some things, because we were cheating
17:40:58 <jgriffith> mriedem: my patch has a no_connect flag in the connector
17:41:11 <jgriffith> mriedem: so now nova and cinder both know that it's not actually being used yet
17:41:12 <mriedem> ok yeah i think we might have fixed this in the api anyway
17:41:23 <mriedem> yeah _get_stashed_volume_connector should return None
17:41:28 <mriedem> so we don't attempt to terminate the connection
17:41:35 <jgriffith> mriedem: exactlys
17:42:07 <jgriffith> mriedem: anyway, I'm working on an alternate proposal
17:42:27 <jgriffith> mriedem: I want to post this today or tomorrow and get some folks to weigh in on both of these options
17:42:34 <mriedem> ok
17:42:37 <jgriffith> mriedem: see what folks think is going to be best long term
17:43:12 <ildikov> jgriffith: that sounds good, thanks
17:44:42 <ildikov> jgriffith: when I checked your spec the last time I saw comments from johnthetubaguy
17:45:08 <jgriffith> ildikov: yes, the addition of a confirmation that nova finished the connection
17:45:47 <jgriffith> ildikov: My latest update added an acknowlodge option to the create_attachment call
17:46:03 <ildikov> jgriffith: yeap, we touched on that briefly today too
17:46:03 <jgriffith> ildikov: so you'd leave the state in attaching until Nova comes back and says "finalize" it
17:47:11 <ildikov> jgriffith: and if Nova never comes back it remains in attaching state?
17:47:14 <mriedem> isn't that basically what we have today with os-reserve and os-attach?
17:47:33 <mriedem> os-reserve reserves the volume for attach, and os-attach changes to in-use when we're done
17:48:12 <smcginnis> mriedem: We kind of realized that this morning.
17:48:14 <smcginnis> Or at least I did.
17:49:52 <ildikov> smcginnis: kinda same here, I didn't put those together in a flow until now
17:50:05 <mriedem> you need my expert 3rd party input
17:50:07 <mriedem> :P
17:50:13 <scottda> I think johnthetubaguy had pointed out a race last week...
17:50:20 <smcginnis> But this would still clean up what we have now.
17:50:25 <scottda> where the physical attach to the nova host failed.
17:50:46 <scottda> And in the shared-attach situation the cleanup would kill a new attachment that was shared to the same host.
17:51:03 <scottda> And that's why he thought this Ack would be useful.
17:51:17 <ildikov> mriedem: lol :) that we always need :)
17:53:33 <ildikov> scottda: shared-attach as multi-attach?
17:53:56 <scottda> ildikov: no, for those drivers that share a connection for >1 attach to the same host
17:54:04 <scottda> so, multi-attach is required.
17:54:13 <ildikov> scottda: ah, right, got it
17:54:16 <scottda> But it's specific to drivers
17:54:49 <jgriffith> sorry, bot pulled away :)
17:54:54 <jgriffith> but looks like you all have it
17:54:59 <scottda> If I had an intern, I would request that they write out the entire state machine for volume attach, including all the wierd corner cases.
17:55:23 <ildikov> jgriffith: slowly getting to the same page :)
17:55:37 <jgriffith> scottda: I've already done that :)
17:55:50 <jgriffith> scottda: the only *weird* cases are customizations in the drivers
17:56:07 <jgriffith> this stuff really isn't that hard, until you have 70+ backends to figure out
17:56:11 <scottda> jgriffith: By weird I also mean shelve_offload and nova live_migration
17:56:14 <ildikov> scottda: there are internship programs in OpenStack, so you can have one :)
17:56:34 <jgriffith> scottda: those aren't really that weird in the old sense though, other than the concept
17:56:38 <scottda> and also drivers that share connections to the same compute host. And multi-attach
17:57:01 <jgriffith> scottda: right... and that's a driver/backend complication
17:58:27 <ildikov> smcginnis: mriedem: BTW before we run out of time completely, I saw on the cross-project etherpad that our topic is a bit too specific for that track
17:58:51 <smcginnis> :[
17:59:00 <scottda> ildikov: Yes. It seems in the past that they prefer we do a cinder-nova sync for stuff like this.
17:59:16 <scottda> Just 2 projects doesn't seem to count for cross-project
17:59:38 <smcginnis> mriedem: I need to finalize the Cinder sessions yet. Do you have enough to dedicate one to cinder-nova, or should I look at doing that?
17:59:43 <ildikov> smcginnis: mriedem: I saw overlaps between the Nova and Cinder sessions, so I would guess we can pick one
18:00:07 <mriedem> smcginnis: we have plenty of room
18:00:18 <mriedem> smcginnis: i already penciled in a nova/cinder session friday morning
18:00:20 <mriedem> in the nova room
18:00:30 <smcginnis> mriedem: Awesome, thanks.
18:00:40 <ildikov> jgriffith: I would guess that with a bit simpler and generic API the specifics could be handled by the drivers
18:00:59 <jgriffith> ildikov: +1
18:01:00 <mriedem> smcginnis: https://etherpad.openstack.org/p/ocata-nova-summit-ideas
18:01:02 <ildikov> mriedem: sounds great
18:01:09 <ildikov> mriedem: tnx!!!
18:01:11 <scottda> smcginnis: I think we wanted a cinder session for the new API that went before syncing with Nova, didn't we?
18:01:29 <ildikov> scottda: I think that would be beneficial
18:01:38 <mriedem> we're over time,
18:01:42 <smcginnis> scottda: Yeah, we had discussed that. Since that is Friday morning, shouldn't be a problem to fit it in somewhere.
18:01:52 <mriedem> but i'd like to come into the summit with, this is the cinder api(s), and how nova is going to use them
18:01:58 <ildikov> scottda: I felt earlier that there are still Cinder specific items where we don't have an agreement
18:02:02 <mriedem> so we can get consensus in person that those are the right things
18:02:28 <ildikov> mriedem: huge +1
18:03:19 <ildikov> mriedem: do we need any info sharing before the summit with at least a rough plan?
18:03:52 <ildikov> mriedem: just that people have a basic idea of what we're planning to do and we could focus a bit more about some level details
18:03:53 <mriedem> umm
18:03:56 <ildikov> mriedem:
18:04:00 <mriedem> yeah probably
18:04:04 <ildikov> just an idea/question
18:04:07 <mriedem> i don't want to come into the summit blind
18:04:14 <mriedem> could we do a hangout next week?
18:04:40 <ildikov> mriedem: sure!
18:04:50 <mriedem> basically from a nova pov i'd like the cinder team saying, yup we're on board with the proposed api and think it will work and we understand why
18:05:23 <scottda> mriedem: yes to hangout
18:05:28 <mriedem> if there are warts or unknowns we should talk those through in a hangout
18:05:37 <smcginnis> jgriffith: Think you'd be in good shape to discuss patches in a hangout next week?
18:05:41 <scottda> mriedem: And we have  been prodding the cinder team on this. We can put something on this week's agenda as well.
18:05:49 <jgriffith> sure
18:06:10 <ildikov> mriedem: would that be a Cinder-Nova team Hangout? or this team and we might stream it to a broader audience?
18:06:23 <smcginnis> I would think this team.
18:06:29 <smcginnis> To start
18:06:49 <ildikov> ok, we will figure out the info sharing later then
18:07:24 <ildikov> I can either put up a Doodle poll or we re-use the meeting slot for a Hangout call
18:07:35 <mriedem> let's just use this slot i think
18:07:38 <mriedem> if it works for people
18:07:55 <smcginnis> That should work for me.
18:08:16 <ildikov> I'm happy to do both, but reusing the slot is easier than finding another one that works for mostly everyone
18:08:23 <scottda> now is good
18:08:53 <ildikov> then I'll just post a Hangout link here at the meeting time next week
18:08:54 <scottda> or an hour ago, to be exact
18:09:51 <ildikov> scottda: 167 hours later to be super exact :)
18:10:32 <ildikov> is there anything else that we should discuss today?
18:11:21 <ildikov> ok, I tae it as a no :)
18:12:13 <ildikov> scottda: let's try to have this topic on the Cinder team's meeting agenda to get the team to the same page and move towards the inter-project agreements
18:12:39 <ildikov> and next Monday 1700UTC we will have a Hangouts call as opposed to the regular IRC meeting
18:12:48 <ildikov> thanks all!!!
18:13:11 <ildikov> #endmeeting