14:00:26 #startmeeting cinder 14:00:26 Meeting started Wed Feb 16 14:00:26 2022 UTC and is due to finish in 60 minutes. The chair is rosmaita. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:26 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:27 The meeting name has been set to 'cinder' 14:00:36 #topic roll call 14:00:46 o/ Kind-of. :-) 14:00:47 hi 14:01:13 Hi 14:02:01 hi 14:02:05 o/ 14:02:24 o/ 14:02:57 hi 14:03:11 ok, let's get started 14:03:16 hi 14:03:19 hi 14:03:27 #link https://etherpad.opendev.org/p/cinder-yoga-meetings 14:03:39 #topic announcements 14:03:41 hi 14:03:50 Festival of XS Reviews on Friday, 1400-1600 UTC 14:04:13 you can find links and info in the top of the agenda etherpad 14:04:21 in the "Spotlight Links" 14:04:45 upcoming stuff: 14:04:51 cinderclient release next week 14:05:41 rosmaita: do we have a open questions moment in the Cinder meetings? 14:05:55 rafaelweingartner: sure, at the end of the meeting 14:06:48 there are a few open patches, not much, though: project:openstack/python-cinderclient status:open branch:master 14:07:10 if you want your patch to be considered, make sure you quickly address any negative votes, and that it's passing zuul 14:07:44 we're adding at least one new mv to cinder this cycle, so look for the patch about that 14:07:59 on that topic, btw, the cinder-side patch I'm talking about is: 14:08:25 mv stand for? 14:08:33 microversions 14:08:38 i have it open somewhere 14:09:03 #link https://review.opendev.org/c/openstack/cinder/+/822040 14:09:26 ok, thanks 14:10:01 I'm also working on the reimage feature, will work on getting the client patch passing zuul 14:10:18 whoami-rajat: remind me, did that entail a new mv? 14:10:26 yes 14:10:34 it's a new volume action 14:10:45 ok, then let's start an etherpad for cinderclient release 14:10:57 we need those 2 cinder patches on there, because they need to merge first 14:11:36 #link https://etherpad.opendev.org/p/yoga-cinderclient 14:11:59 hi 14:12:40 whoami-rajat: do you have a link to your cinder change? 14:12:53 sure, 1 sec 14:13:21 rosmaita, https://review.opendev.org/c/openstack/cinder/+/606346 14:13:40 thanks 14:16:08 ok, we also have the cinder FEATURE FREEZE next week 14:16:21 it applies to cinder and drivers 14:16:32 we are working from these blueprints: 14:17:15 #link https://blueprints.launchpad.net/cinder/yoga 14:17:40 please update your BP with links to the code patches and update your status 14:18:49 if your feature is *not* ready for review now, it's unlikely it will be reviewed, revised, and merged by next thursday 14:19:12 so ping me in irc and i will move it to the 'zed' series 14:19:28 #action rosmaita - set up zed series in launchpad 14:20:01 if you have a driver feature, please help us out in the reviews 14:20:16 make sure that your patch has passed your CI 14:20:29 do a fresh run if you need to, we need to logs to be there 14:20:59 would be helpful if you link into your logs to show that your feature is working 14:21:21 reviewers can do that themselves, but it takes extra time, which means slower reviews 14:21:36 so if you want to speed things up, do what you can to help us out 14:22:09 so the review priorities over the next week are: 14:22:22 #link https://etherpad.opendev.org/p/yoga-cinderclient 14:22:25 and 14:22:33 #link https://blueprints.launchpad.net/cinder/yoga 14:22:58 and of course, critical bugs and security bugs, but those can be fixed after feature freeze 14:23:07 so please concentrate on feature reviews 14:23:35 finally, next week's meeting will be in videoconference 14:23:51 usual time & place, i will make sure connection info is on the agenda 14:24:04 #topic os-brick release status 14:24:16 you may have seen the email i sent about this yesterday 14:24:24 #link http://lists.openstack.org/pipermail/openstack-discuss/2022-February/027296.html 14:25:12 i think yuval's patch should be just about ready, i think he pushed a new patch set this morning 14:25:15 #link https://review.opendev.org/c/openstack/os-brick/+/823654 14:25:33 the other one is https://review.opendev.org/c/openstack/os-brick/+/811718 14:25:52 What is the remaining issue? 14:25:54 geguileo was looking at it and discovered a different but related bug 14:26:21 We have no gerrit notice on that 14:26:24 geguileo: what are your thoughts on https://review.opendev.org/c/openstack/os-brick/+/811718 14:26:26 the problem I found is that we removed the call to nvme disconnect 14:26:44 which is wrong for some cases 14:27:03 we have 2 variables depending on the driver/storage 14:27:16 1. whether the targets are shared (subsystem) 14:27:29 2. whether the storage system support namespace AER 14:28:04 for the LVM driver that doesn't do shared targets and doesn't support NS AER, the removal of the disconnect creates a problem 14:28:33 after disconnect the nvme kernel driver keeps trying to connect to the namespace 14:28:39 we did not remove a disconnect in that patch... 14:28:41 and it only gives up after 600 seconds 14:28:47 it was on another one 14:28:57 and I was trying to find out if we could have race conditions 14:29:46 if that condition exists it is upstream of this patch. 14:29:50 and during those 600 seconds we cannot reconnect that volume to that same host 14:30:08 chuckpiercey_: let me look for the patch where this was introduced 14:30:48 https://review.opendev.org/c/openstack/os-brick/+/800014 14:30:59 in this patch that is being proposed there is a rescan added 14:31:12 and if the backend doesn't support NS AER, then we have a race condition 14:31:58 and for 600 seconds after the rescan the previously removed device (that got back in due to the rescan) will not be able to connect (I think) 14:32:50 so basically with all the changes to the nvme driver what we are doing is supporting just storage systems that support NS AER 14:33:08 and not supporting those that don't support it 14:33:08 this isn't our patch - we'll take a look and respond with our thoughts in that patch. That one is a big patch. 14:33:34 these are 2 different things, but they are both related 14:33:46 both are related to NS AER 14:34:04 OK, we have to release yoga os-brick tomorrow 14:34:10 due to that older patch we cannot connect recently removed volumes to the sam host 14:34:22 811718 is small and is intended to verify current connections before reconnect 14:34:28 i guess we need a "known issues" note 14:34:46 in this newer patch we would be introducing a new race condition for storage systems that share targets and don't support NS AER 14:35:43 ok, but we already have the disconnect problem for systems that don't support NS AER 14:35:51 rosmaita: that would be an acceptable compromise from my perspective 14:36:21 ok, i will put up a release notes patch 14:36:41 basic idea is, if you are using nmve that doesn't support NS AER, you are SOL 14:37:19 rosmaita: yup 14:37:25 and geguileo, please file a bug about the disconnect issue, i can link to that 14:37:43 so where does that leave us with https://review.opendev.org/c/openstack/os-brick/+/811718 14:38:44 rosmaita: I would say that the code is fine 14:39:08 ok, if you can drop your -1 from that patch, that will be helpful 14:39:10 because rescan is the recommended mechanism by the kernel nvme driver instead of a restart 14:40:02 rosmaita: done 14:40:23 i have got to say, i do not feel comfortable +2ing any of these nvme patches any more 14:40:50 i look back at these patches that have caused regressions and bugs, and i do not have a good track record here 14:41:13 so: my question for cinder-core members right now is: 14:41:24 who will review https://review.opendev.org/c/openstack/os-brick/+/811718 ? 14:41:45 we have to release tomorrow 14:42:19 rosmaita: I will review it, and I'll most likely give the +2 14:42:46 thank you 14:43:07 we need a second person ... 14:43:15 rosmaita: after all theirs is the only driver using multiple namespaces for a subsystem 14:43:15 rosmaita: let me try to run our CI for it 14:43:26 e0ne: that would be very helpful 14:43:34 e0ne: what nvme storage are you using? 14:46:25 hmmm, e0ne may have been disconnected 14:46:44 it seems we don't have CI now :( 14:46:50 ouch :-( 14:46:54 it's the mellanox SPDK CI, is that right? 14:47:16 I'll try to catch my colleagues tomorrow 14:47:19 rosmaita: yes 14:47:42 ok, thanks 14:47:55 we are sort of relying on that CI to prevent regressions in the nvmeof connector 14:48:14 we need to have a discussion about nvme at the PTG 14:48:58 we have kioxia, lightbits, dell/emc, and pure all wanting to implement stuff 14:49:01 rosmaita: I'm currently looking into making a new nvmet target for the LVM driver 14:49:16 If I can help somehow please let me know 14:49:18 one that uses shared targets and returns the new connection information type 14:49:33 geguileo: so we could use regular Zuul CI to detect regressions, is that correct? 14:49:47 rosmaita: some of them at least 14:50:18 and then we could use the LVM driver for both kind of connections: sharing subsystem and not sharing 14:51:58 ok, i will let the next PTL figure out a strategy for coordinating and testing nvme-of efforts 14:52:01 geguileo how can we test it without nvme storage? 14:52:14 Yaron would be happy to help on that @brian 14:52:20 e0ne: right now we can test NVMe with RDMA and TCP using the LVM driver 14:52:44 I even have a patch up for review to do this automatically with devstack 14:52:57 yes, but t require NVMe SSD doesn't it? 14:52:58 Nice. 14:53:07 chuckpiercey_: ack 14:53:47 ack 14:53:53 e0ne: I don't think it does 14:54:09 ok, we will discuss this more at the PTG 14:54:11 I'm running it in side a VM 14:54:37 ok, moving on 14:54:47 #topic release cadence proposal 14:55:20 proposal is to keep 6 month cycles, but go to "tick-tock" releases, where you can upgrade from tick to tick 14:55:29 look at the proposal to make sense of what i just said 14:55:37 #link https://review.opendev.org/c/openstack/governance/+/828777 14:56:49 the idea is that tick-to-tick could require downtime; our rolling upgrades would only have to support tick-to-tock or tock-to-tick upgrades 14:57:02 that is the idea, i don't know that it's that simple 14:57:29 hopefully there will be a ptg discussion about this before the TC approves it 14:57:49 #topic Call for Outreachy OpenStack mentors - May 2022 round 14:57:57 #link http://lists.openstack.org/pipermail/openstack-discuss/2022-February/027239.html 14:58:14 that email has some useful links about being a mentor and what the commitment is 14:58:27 :D 14:58:40 i think there is some cinder stuff here: https://wiki.openstack.org/wiki/Internship_ideas 14:58:48 probably needs to be updated 14:59:04 enriquetaso_: can you talk a little more about this next week? 14:59:17 yes! 14:59:20 (because we are running out of time) 14:59:25 sure 14:59:29 #topic stable releases 14:59:38 looks like everything has merged? 14:59:43 yes 14:59:52 cool, so nothing to do here 14:59:57 apart from the dell patches in Xena which doesn't has a reply since past 2-3 weeks 15:00:03 well, nothing for us, whoami-rajat has to post the patches 15:00:05 so we will be skipping those in this release 15:00:15 seems fair enough 15:00:26 we are out of time 15:00:29 will post the patches and update in the etherpad 15:00:29 Abount Xena, we would like to ask you guys about the patch: https://review.opendev.org/c/openstack/cinder/+/812685. Would you guys ind reviewing it? 15:00:30 thanks everyone! 15:00:38 also thanks for updating the etherpad rosmaita 15:00:41 mind* 15:01:04 Thank you! 15:01:52 rafaelweingartner: ack, we have several issues around that, so i will be interested to see what you have come up with 15:02:07 #endmeeting