14:00:04 #startmeeting cinder 14:00:05 Meeting started Wed Jan 15 14:00:04 2020 UTC and is due to finish in 60 minutes. The chair is rosmaita. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:00:06 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:00:09 The meeting name has been set to 'cinder' 14:00:22 #link https://etherpad.openstack.org/p/cinder-ussuri-meetings 14:00:27 #topic roll call 14:00:34 o/ 14:00:40 hi 14:00:43 hi 14:00:43 hi 14:01:00 hi :) 14:01:12 hi 14:01:19 O/ 14:01:23 hi 14:01:37 hi 14:01:39 o/ 14:01:43 hi! o/ 14:01:44 looks like a good turnout! 14:03:05 #topic announcements 14:03:15 some upcoming deadlines 14:03:21 hi 14:03:23 spec freeze: 31 Jan 2020 (23:59 UTC) 14:03:36 new driver/new target driver must be merged by 13 February 2020 (23:59 UTC) 14:03:48 i should probably send out a reminder to the ML about that one 14:04:01 #action rosmaita send email about driver merge deadline 14:04:18 that same week we have Ussuri milestone-2: 13 February 2020 14:04:41 other news 14:04:49 virtual mid-cycle part 1 next week on Tuesday 1300-1500 UTC 14:05:04 thanks to everyone who participated in the poll to select the time/date 14:05:33 we'll be holding the meeting in bluejeans like we did for the virtual PTG 14:05:48 info is here: 14:05:48 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-January/011968.html 14:06:13 also, if you have specific items you want to discuss, please add them to the etherpad: 14:06:21 #link https://etherpad.openstack.org/p/cinder-ussuri-mid-cycle-planning 14:06:42 ok, final announcement is a reminder 14:06:52 Voting for the 2020 Individual Directors of the OpenStack Foundation Board of Directors is now open and will remain open until Friday, January 17th, 2020 at 11:00am CST/1700 UTC 14:07:11 to vote, you have to use the link that is sent to you in your email 14:07:20 so i can't put a link here 14:07:32 and that's all the announcements 14:07:39 on to real business 14:07:41 Thanks for the reminder. ;) 14:07:49 :-) 14:07:52 np 14:08:21 #topic Spec: Volume local cache 14:08:21 #link https://review.opendev.org/#/c/684556/ 14:08:31 LiangFang: that's you! 14:08:49 i see you pushed an update, i haven't had time to look at it yet 14:09:08 I've added more comments on the previous patch 14:09:08 yes, Gorka give lots of comments 14:09:20 in response to LiangFang responses 14:09:26 most of the schedule things 14:10:16 corrently in flavor, no space for volume type 14:10:43 so they cannot schedule it correctly? 14:10:51 yes 14:10:57 some work around 14:11:01 then if os-brick just fails, it will be a nightmare 14:11:19 users attaching volumes to nodes without cache won't know why they can't attach 14:11:44 boot from cinder vol will fail until it lands on a node with cache (if there is one) 14:12:07 they can create the VM to the servers with cache capability 14:12:30 so they could define a flavor that has a cache? 14:12:48 I got the new laptop, and no dictionary installed yet:( 14:13:16 gibi said we can design the schedule later 14:13:48 you may answer this question on the current version, but i will ask it anyway 14:13:51 corrently use zone 14:14:16 currently use zone 14:14:23 or aggragate 14:14:40 my concern is that this is going to be complicated for customers 14:14:41 so the VM can be scheduled to correct zone 14:15:34 is there a reason why a cacheable volume type *must* be cached? could we have it be cached if it lands on a compute that supports it (user picks the flavor for the VM), and not cached if user picks non-cache flavor? 14:16:01 I think that's what's best right now 14:16:21 I would think if someone requested a volume to be cached, it could be confusing to them to end up with a volume that is not cached. 14:16:34 Especially if their service provider charges more for the cached volume types. 14:16:48 I talked with Alex (Nova core), he suggest to fail the request if not supported. 14:17:01 i was thinking of the caching as an attribute of the compute, not the volume 14:17:08 LiangFang: do they have a way to report to the users why it failed? 14:17:15 like we do with the user messages? 14:17:18 so the provider would charge more for a flavor with a cache 14:17:31 rosmaita: makes sense to me 14:17:33 Not sure, but I could see that being a "premium" option. 14:18:00 because the nice thing about LiangFang's choice of open-cas (as opposed to bcache) is that it doesn't have to modify the volume at all 14:18:11 geguileo: we plan the throw exception, 14:18:14 so the volume itself could be attached anywhere 14:18:32 LiangFang: that's useless for users 14:18:43 only useful for admins 14:19:26 rosmaita: I think you were onto something when you said cacheable should be a Nova attribute or something 14:19:26 i think some basics are missing about how this interacts w/ cinder features -- if using a write-back cache, how do you reliably use cinder snapshots, as a user? 14:19:50 eharney: and live migrations must be disabled somehow 14:20:03 it would also affect consistency groups, backups, etc.. 14:20:15 eharney: Good point, we would need a hook for flushing cache. 14:20:21 but for snapshots, you don't have a way to even know if the data you want to snapshot was written 14:21:11 eharney: I believe that's the case for migrations as well, because the hypervisor would just flush and expect that to do what it's meant to do 14:21:14 write-back indeed have data integrity issues 14:21:18 geguileo: right 14:22:01 for data integrity scenarios, they should choose write-through 14:22:29 i think we have to assume that data integrity is a priority in general 14:23:51 Kind of a base requirement for a storage service. 14:24:23 yes, so in most case, write-through is selected by default 14:24:26 well, if you use a cache, you sometimes want to trade off speed for data integrity 14:25:02 write-through is also the default cache mode of open-cas 14:25:04 but we could decide to support only those modes that ensure data integrity 14:25:45 so the speed boost would be mostly for subsequent reads 14:25:52 which is still a plus 14:25:55 rosmaita: sure, but to enable that trade-off we need a mechanism to make sure that things like live migration don't fall apart 14:26:15 eharney: ++ 14:26:41 and i think the spec doesn't cover such concerns yet 14:27:47 should we write this note in manual page? 14:28:15 e.g. tell operator, if you choose write-back, then cannot live migration 14:28:49 LiangFang: that's not good enough, because users and admins are different people 14:28:52 either that, or we don't allow anything other than write-through for now 14:28:57 geguileo: ++ 14:29:30 LiangFang: if we only allowed write-through mode, would that make this feature useless? 14:29:49 rosmaita: then only read io could be boost 14:29:57 rosmaita: we wouldn't know the mode they have actually selected 14:30:08 because it's configured outside of openstack 14:30:16 geguileo: that's true 14:30:28 but we can document why those modes are not supported 14:30:38 yes, that's what i was going to say 14:31:01 so we leave it up to the operator how to handle this 14:31:08 if we document it then admins can ignore it and just be careful in those situations (I'm fine with that) 14:31:22 because, if they do a flavor, they could have cache in xxx-mode, another flavor cache in yyy-mode 14:31:40 and then the user picks which flavor to use depending on how risk-averse they are 14:32:27 that changes the approach we had 14:32:32 but I like it 14:32:43 not being a Cinder volume thing 14:32:56 except for the cacheable True part 14:32:58 right, we are back to just 'cacheable' for volume-type 14:33:06 because there are backends that don't support caching in OpenStack 14:33:53 I really think the way to go is that 'cacheable' means that this volume *could* be cached if it lands on the correct hypervisor 14:33:57 so in the end Nova decides where to schedule, what mode it wants, etc 14:34:13 rosmaita: on Cinder side yes 14:34:25 rosmaita: it would be True by default and we would return False on RBD and RemoteFS 14:34:30 having "cacheable True" seems like it assumes we could swap a different caching tool in later when we support a second one, do we know if that's the case? (it's not if part of your data is stored in an open-cas writeback cache..) 14:35:34 it really seems like we need some "live" discussion of this 14:35:40 eharney: you are talking about in-use retype? migrate? 14:36:02 geguileo: talking about making sure we don't get stuck when we try to add another cache system into Cinder later 14:36:04 Definitely a midcycle topic I think. 14:36:15 eharney: because cacheable only means, from Cinder perspective, that there is a /dev/xyz device that can be used by the cache system 14:36:50 eharney: I don't think we would get stuck on a second one 14:36:52 geguileo: ok, so the implication for the future is that even if we have multiple cache drivers, you can only use one in a deployment, then? 14:37:16 eharney: wouldn't that be a Nova thing? 14:37:23 geguileo: i'm not sure it would be 14:37:30 how so? r:-?? 14:37:51 are we using this cache during reimage? image->volume? backup? 14:38:17 if the cache mode is hard coded to write-through, then even in future there's a different cache software, it still works 14:38:27 LiangFang: when is the nova spec deadline? 14:38:32 i think the spec needs to spell out more about the interactions between cinder/nova and the cache 14:38:56 eharney: +1 14:39:01 rosmaita: sorry, I know very near, but don't know the date 14:39:06 eharney: ++ 14:39:12 i couldn't find the date either 14:39:48 i think we really need to discuss this at the midcycle so we can give you specific things that need to be addressed, and so that we can determine what these specific things are 14:40:03 eharney: the cache is only in compute node 14:40:19 LiangFang: if so, that has implications for how backups work 14:40:25 rosmaita: ++ 14:40:31 rosmaita: ok. thanks 14:40:36 and reimage, etc 14:41:02 Feb 10 - Feb 14: Ussuri-2 milestone, nova spec freeze 14:41:09 https://wiki.openstack.org/wiki/Nova/Ussuri_Release_Schedule 14:41:15 i think eric is bringing up some good points that we need to think about whether they impact your spec or not 14:41:17 geguileo: thanks 14:41:47 if the cache mode is write-through, then from cinder point of view, seems can ignore it, but all the data is backend volume 14:41:58 so it looks like if we can get this discussed on Tuesday, we can get it hammered out next week and there will still be some time on the nova side 14:42:30 especially if we push the idea of putting the cache as a flavor instead of into the scheduler 14:42:38 but we can discuss on Tuesday 14:42:55 LiangFang: i believe you will be available for the midcycle on tuesday? 14:43:07 yes:) 14:43:11 great! 14:43:19 it's my first time to join 14:43:30 I tried this afternoon:) 14:43:46 were you able to access bluejeans? 14:44:02 https://bluejeans.com/3228528973 14:44:06 yes, as guest 14:44:15 ok, great 14:44:30 but I don't know how to register an account 14:44:48 LiangFang: i think you have to be a guest 14:44:56 OK 14:44:57 it's a paid service 14:45:05 ok 14:45:10 (our use is courtesy of Red Hat) 14:45:57 ok, let's wrap up with everyone please read through the spec before the midcycle so we all are ready to discuss 14:46:05 thanks the discussion today, I will read the chat log later, because I may have not got all the points 14:46:25 ok, great, and you can ask in IRC also if things aren't clear 14:46:32 ok 14:46:51 #topic Update on community goal "Drop Python 2.7 Support" 14:47:11 as is my wont, i put together an etherpad for this: 14:47:14 #link https://etherpad.openstack.org/p/cinder-ussuri-community-goal-drop-py27-support 14:47:43 is xuanyandong here by any chance? 14:48:14 anyway, if you look at the etherpad, we are mostly OK except for cinderclient 14:48:38 the cinder-tempest-plugin question is interesting 14:48:51 eharney: yes, thanks for the reminder about that 14:49:02 i was hoping tosky might know 14:49:11 Tempest is dropping support too. 14:49:23 So though it's branchless, that's part of why they are still tagged. 14:49:26 so do we mark the plugin to only work with sufficiently new versions of tempest, or, what? 14:49:51 i guess we can just do that in requirements.txt 14:50:16 Yeah, since we don't have a lower-constraints file, I guess it would be there. 14:50:25 but the plugin is branchless, so then you have to pin jobs for stable branches too i guess? 14:50:29 But I'm not sure it needs to be strictly enforced in code. 14:50:52 It's a little on the consumer to get the right tempest plugin for the given tempest version. 14:51:07 Tagging of those are done to be able to match up what versions go together. 14:51:15 let's say we're the consumer in the stable/stein gate jobs :) 14:51:46 It *should* be using the stein plugin with the stein tempest, but I'm not entirely sure there. 14:51:55 eharney: no, tempest is going to run with python3 even on older branch 14:52:03 and the plugins are going to be used from master as well 14:52:04 oh 14:52:19 Oh right, because the tempest runtime doesn't matter. 14:52:20 tempest runs in its own venv, and the plugins are installed there 14:52:25 That's separate from the service runtime. 14:52:26 that works then 14:53:08 but yeah, the support for py2 should be removed as soon as all the consumers of cinder-tempest-plugin are not testing with py2 anymore 14:53:31 The python-cinderclient functional failure is odd. Looks like the project isn't being installed so it's not finding the cinder command. 14:53:50 yeah, ignore that result 14:54:01 i am having duelling patch sets with xuanyandon 14:54:05 re python-cinderclient: if we run out of time, just revert to the in-tree playbook for now and we can figure it out later 14:54:27 yeah, i am going to do what tosky suggests this afternoon 14:55:19 smcginnis: i fixed the missing /bin/cinder 14:56:22 ok, so it looks like the cinder-tempest-plugin is already using py3 as its basepython 14:57:03 i think we are ok there 14:57:23 Does this mean that drivers should also start abandoning py2? 14:57:42 whfnst17: They can, but there isn't a priority to remove compatibility code. 14:57:54 ok 14:58:09 only remaining issue is Thing 5 on that etherpad 14:58:20 update setup.cfg to *require* py36 14:58:43 that's the thing nova did in november that broke everything 14:58:53 but we should be in a better place now 14:59:13 it does sound like the right thing to do, but i don't know a lot about possible side effects 14:59:17 but i imagine that will be one of those bot-proposed items for all repos 14:59:27 so i don't think we need to do it? 14:59:36 I don't think so. 14:59:43 ok, cool 14:59:52 well, sorry that we are out of time for open discussion 15:00:01 see everyone on Tuesday at the virtual mid-cycle! 15:00:03 I think there's a difference between saying we don't support py2, and actually enforcing in code that it's absolutely disallowed. 15:00:09 Or even for py35. 15:00:23 smcginnis: that makes sense 15:00:27 #endmeeting