14:00:18 #startmeeting cinder 14:00:19 Meeting started Wed Aug 5 14:00:18 2020 UTC and is due to finish in 60 minutes. The chair is rosmaita. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:20 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:22 The meeting name has been set to 'cinder' 14:00:29 o/ 14:00:30 рш 14:00:32 hi 14:00:33 hi 14:00:35 o/ 14:00:36 hi 14:00:36 o/ 14:00:39 Hi 14:00:42 hi 14:00:48 o/ 14:01:01 good turnout 14:01:05 #link https://etherpad.openstack.org/p/cinder-victoria-meetings 14:01:28 not much on the agenda today, so i'll wait another minute for stragglers 14:01:56 o/ 14:02:35 hi! o/ 14:02:35 ok, guess that's everybody 14:02:42 hi 14:02:45 ok, *now* that's everybody 14:02:48 :) 14:02:57 #topic updates - virtual mid-cycle 14:03:26 it's next week on Wednesday at this time, so we won't have a team meeting 14:03:36 actually, we will, it will just be the midcycle 14:03:52 anyway: 12 August 2020 14:00-1600 UTC 14:04:16 we'll do it in bluejeans, connection info is in the email 14:04:26 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-August/016289.html 14:04:44 don't forget to add topics to the etherpad 14:05:08 #link https://etherpad.opendev.org/p/cinder-victoria-mid-cycles 14:05:40 #topic updates - monthly video weekly meeting survey 14:06:02 everyone who took the survey said we should to a video meeting again 14:06:12 all 4 of them 14:06:27 so our last meeting of August will be video + IRC again 14:06:39 #topic updates - gate excitement 14:06:58 you may have noticed that the gates were flakey earlier this week 14:07:05 i believe they are back to normal now 14:07:24 smcginnis put up a patch that got the cinder gate moving for a while 14:07:36 and then a longer-term fix was found 14:07:50 for the lower-constraints job failure 14:07:54 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-August/016295.html 14:08:04 and, grenade was blowing up in stable/ussuri 14:08:11 looks like that is fixed now too 14:08:21 #link https://review.opendev.org/#/c/744753/ 14:08:38 hopefully that will be all the excitement for this week 14:08:55 that's all from me -- anyone have any news they'd like to share? 14:09:55 #topic Release notes for bugs 14:10:02 geguileo: that's you 14:10:09 thanks 14:10:28 we tend to add bug numbers in the release notes 14:10:32 but not always 14:10:53 and some of us just add the number, and others also add the link to the LP bug 14:11:12 I think we should reach a consensus on what we want to have 14:11:17 for consistency purposes 14:11:21 IMO, link to launchpad and some brief description will be great 14:11:31 i agree 14:11:45 ok, whoami-rajat pointed it out in some of his reviews 14:11:45 would it make sense to see if reno could parse and expand the launchpad link automatically from a keyword like lpnnnnn ? 14:11:53 (maybe it's already there) 14:12:07 tosky: oh, if that existed it would be great 14:12:28 i don't think it's already there, but it's a good idea 14:12:29 but in the meantime I think we kind of agree that having the link is what we want 14:12:36 but for now, let's do it by hand 14:12:37 geguileo, yep, rosmaita pointed that out in my reviews and I've been using that since :) 14:12:46 that may get messy if it's a bug on multiple projects 14:12:58 ok, then I think we can start doing that and mentioning it in the reviews 14:13:24 I think we have a reno page in our docs 14:13:35 if we do, I'll mention this resolution there 14:13:44 sounds good 14:13:48 while we are on this topic 14:13:48 ok 14:14:23 i try to point this out in reviews, but it would be good for driver change renos to begin with the driver name 14:14:34 would make it easier for operators to find it 14:14:41 something like 14:14:53 Foo driver: fixed the bar in the baz. 14:15:10 sounds good 14:15:20 and we could do that in the first line of commit messages as well 14:15:21 or Foo driver: `Bug #1234 `_: fixed the bar in the baz 14:15:22 bug 1234 in Launchpad itself "Gina is an unmaintainable mess of command line options, environment variables and shell scripts" [Medium,Fix released] https://launchpad.net/bugs/1234 - Assigned to Daniel Henrique Debonzi (debonzi) 14:15:32 makes sense 14:15:51 well at least bug 1234 is fixed 14:15:59 :D 14:16:01 lol 14:16:04 lol 14:16:14 ok, that will really improve the release notes 14:16:17 thanks, geguileo 14:16:23 any other observations? 14:16:35 I'll propose a format in the patch and we can decide how we exactly want it 14:16:40 thanks everyone :-) 14:16:43 excellent 14:17:11 #topic what to return when a request fails because the default volume type doesn't exist 14:17:20 not a 500 :) 14:17:22 this is something whoami-rajat is working on now 14:17:36 i think this is a legitimate 500 though 14:17:43 it's a server side mis-configuration 14:17:44 specifically create volume request fails 14:18:00 500 sounds right for that case 14:18:01 or anything that needs to access the default volume type 14:18:20 the situation is that we are now requiring the default_volume_type to be set 14:18:25 but if the operator has a typo 14:18:30 then it can't be found 14:18:40 and since we don't allow untyped volumes since traing 14:18:42 *train 14:18:47 we fail the request 14:18:56 we've got InvalidConfigurationValue, 400 (https://github.com/openstack/cinder/blob/master/cinder/exception.py#L249) 14:19:35 but 400 doesn't looks good too:( 14:19:40 400 indicates that the problem is with the request from the user, which doesn't fit this 14:20:01 eharney: +1. that's why I don't like this option 14:20:17 i guess you could stretch things and say if the user calls the admin and complains before making the next request, that's a 400 14:20:55 but it really seems it's a server-side thing 14:21:02 rosmaita: +1 14:21:23 hopefully it won't happen a lot 14:21:25 I don't like 500 errors but I can't propose a good enough solution :( 14:21:34 providing a wrong volume type in request is good for a 400 but sadly returns 404 14:21:42 here 500 makes sense to me as well 14:22:03 no, if the requestor makes a mistake, should be 4xx 14:22:31 because they can fix the request 14:23:22 yep, that was a different topic, just using that to explain why 500 makes sense here 14:23:33 ok, i misunderstood 14:23:47 ok, looks like there's support for this to be a 500 ? 14:23:57 yes 14:23:59 +1 14:24:35 ok, well hopefully operators will be careful in their configuration and won't see this very often 14:25:15 #topic Support volume re-image 14:25:25 #link https://review.opendev.org/#/c/606346/ 14:25:29 rambo_li: that's you 14:25:40 yes 14:26:09 this is a blast from the past 14:26:47 yeah 14:26:57 but the nova rebuild is need this 14:28:49 I well hopefully this reimage api will merged in Victoria 14:30:11 i don't think anyone is philosophically opposed to this, so it's mostly a matter of getting reviews 14:31:06 we should probably add some tests to cinder-tempest-plugin to verify the functionality 14:31:44 i am still skeptical about allowing the reimage operation on volumes with status "error" 14:32:00 that does seem a bit odd 14:32:08 i think it will just not work in many cases 14:34:18 eharney: the volume status is "error" means the volume data is bad, so we can reimage this, any problem? 14:35:07 it could mean many different things, including, there isn't actually a volume 14:35:50 "error" indicating anything about the data on the volume is probably the rare case 14:36:05 rambo_li: what's the status of the nova side of this? 14:37:50 the volume is reserved 14:38:05 when do the rebuild action 14:38:13 ok, let's discuss this more at the midcycle next week 14:38:31 i don't remember that stein spec at all and need to read through it 14:38:41 and also the comments on the nova spec 14:38:51 #link https://review.opendev.org/#/c/739349/11 14:39:26 anything else? 14:40:17 #topic generic NFS online extend 14:40:18 * jungleboyj sneaks in late. 14:40:22 lseki: you're up 14:40:24 o/ 14:41:04 so, currently generic NFS fails to extend an attached volume 14:41:28 and I'm working on a fix that lets Nova do the actual extend operation 14:41:49 the problem is: what if nova fails to do this extend operation 14:42:12 the volume would remain with the original size, but the volume size would get inconsistent in Cinder DB 14:42:50 Lee Yarwood from Nova proj told that Nova creates an action 14:42:54 #link https://github.com/openstack/nova/blob/49c3ac7dfa7aca5504f9d19958ff82a40e47f5a1/nova/compute/api.py#L5316-L5323 14:43:14 which Cinder could poll for, to check if it succeeded or not 14:43:37 I'm not familiar with polling Nova things from Cinder 14:43:59 so I'd like to know if there's a similar code that I could take as an example 14:44:09 We generally don't. We don't want to have tight coupling with Nova, as much as possible. 14:44:11 i'm not sure that cinder currently has any code that consumes events from nova 14:45:21 we do have code to trigger events there in the other direction, presumably doing this would use a similar infrastructure 14:47:55 i think we do some sort of polling for snapshot of attached volumes? 14:48:17 in that case, nova updates a value that cinder can see in the db, we don't have to poll nova 14:49:23 can we do the same thing here? 14:49:30 i am not familiar with that code 14:50:07 we shouldn't copy the way the snapshot update happens, because it's a hacky mess 14:50:17 would probably be easier to just the nova events API 14:50:47 ok, i am anti-hacky-mess 14:51:43 https://github.com/openstack/cinder/blob/master/cinder/volume/drivers/remotefs.py#L1677-L1721 14:52:19 in this case, what is it we're waiting for nova to do exactly? 14:52:20 but as Eric said, we should do the it optimally 14:53:19 i guess libvirt is just extending the file 14:53:39 lseki: could we let cinder give nova a callback, when nova finished, it call callback, and the cinder update DB? need to let nova handle callback 14:55:32 hmm do Cinder & Nova have such callback mechanism? 14:55:43 eharney, waiting for the blockRebase or blockCommit to complete? i think it updates a field in snapshot (progress) to 90 when it completes 14:55:55 whoami-rajat: that's for snapshots, not for extend 14:56:18 eharney, oh, i thought you asked for snapshots case, my bad 14:56:33 lseki: can you map out how you see this working on an etherpad or something, and we can discuss more next week? (you can send something to the ML when your etherpad is up, and we can look earlier and all be ready for a discussion) 14:56:55 please test whether this works with encryption and whether it works with volumes attached to instances that are shutoff 14:56:56 sounds like on the nova side, lyarwood at least is in favor of polling the server actions API 14:57:01 iirc the encryption case is already known to have problems 14:57:27 rosmaita: sure, I will write the alternatives 14:57:30 lseki: the extend is initiated from the cinder side, is that correct? 14:58:11 so we could reject unsupported configurations 14:58:48 rosmaita: correct 14:59:02 ok, that sounds good 14:59:16 sorry everyone, looks like we are out of time for open discussion 14:59:30 thanks! 14:59:31 feel free to talk in #openstack-cinder if you have something on your mind 14:59:32 I rememer resize is initiate from Nova, extend volume is initiate from cinder 14:59:48 ty folks! 14:59:53 #endmeeting