16:00:10 #startmeeting nova 16:00:10 Meeting started Tue Sep 7 16:00:10 2021 UTC and is due to finish in 60 minutes. The chair is gibi. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:10 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:10 The meeting name has been set to 'nova' 16:00:53 o/ 16:01:43 o/ 16:03:31 #topic Bugs (stuck/critical) 16:03:46 no critical 16:03:50 #link 13 new untriaged bugs (-4 since the last meeting): #link https://bugs.launchpad.net/nova/+bugs?search=Search&field.status=New 16:03:57 we have 1 bugs marked with xena-rc-potential tag #link https://bugs.launchpad.net/nova/+bugs?field.tag=xena-rc-potential 16:04:03 https://bugs.launchpad.net/nova/+bug/1942345 fix is going through the gate 16:04:33 and we have the placement transaction scope issue https://storyboard.openstack.org/#!/story/2009159 that is being worked on in https://review.opendev.org/c/openstack/placement/+/807014 16:04:54 I guess we should mark that as RC critical too 16:05:34 I think melwitt's solution is OK but we cannot really make a reproduction test working in the functional env 16:05:40 so I started running nova jobs against the fix 16:05:58 in https://review.opendev.org/c/openstack/nova/+/807558 16:06:16 so far the runs there are not producing the error 16:06:48 so probably we will land that fix without the reproduction test 16:07:24 is there any other bug we should consider as RC critical? 16:07:37 you're saying that we can't repro the failure in functional, 16:07:51 but that's kinda expected and normal for these kinds of load-based heisenbugs right? 16:07:52 * bauzas is now back and around 16:08:25 dansmith: yes, it is a race that is hard to reproduce in a clean env (it needs mysql and it needs parallel transactions) 16:08:29 dansmith: the problem is about mysql 16:08:34 gibi: yeah 16:08:36 hah, jinxed 16:08:55 difficult to write a correct parallel test 16:09:11 yeah, just sounded like gibi was expecting we wouldn't land until we had a repro test 16:09:18 and I'm saying I'd have expected that to be impossible 16:09:26 we tried the repro test but we faild 16:09:32 so I'm OK to land this without a repro 16:10:13 "I guess we should mark that as RC critical too" > yes, please 16:10:42 bauzas: OK, I will. It is already in the tracking etherpad https://etherpad.opendev.org/p/nova-xena-rc-potential 16:11:19 so if there is no other bug for the RC then I have a question about https://bugs.launchpad.net/nova/+bug/1942709 16:11:35 do we support providing the adminPassword to the guest via the metadata service? 16:11:40 I see that the metadata service trying to fetch the password from instance.system_metadata 16:11:43 https://github.com/openstack/nova/blob/402fe188b4e7ff76109e8a5ea1f24a5e915eaa09/nova/api/metadata/password.py#L37 16:12:07 but as far as I see we dont store the password in instance.system_metadata on master 16:12:20 and I was not able to track down changes around this in the git history 16:12:28 * bauzas dunoo 16:13:05 I don't remember if this works with libvirt 16:13:09 I want to say no 16:13:30 we need to look at the code honestly 16:13:52 bauzas: please, I got stuck tracking done if it ever worked 16:14:04 OK moving on then 16:14:15 any other bug we need to discuss? 16:15:19 #topic Gate status 16:15:24 Nova gate bugs #link https://bugs.launchpad.net/nova/+bugs?field.tag=gate-failure 16:15:29 The nova-next job should be passing now after the new osc-placement release on Friday 16:15:42 Allocation deletion conflict still can happen, and it is being worked on in https://review.opendev.org/c/openstack/placement/+/807014 16:15:58 gibi: wish me good luck for finding about the crap in sysmetadata 16:16:07 bauzas: good luck! :) 16:16:33 I'm not tracking other high frequency bugs in the gate I think we are able to merge code to master 16:16:43 Placement periodic job status #link https://zuul.openstack.org/builds?project=openstack%2Fplacement&pipeline=periodic-weekly 16:16:49 placement periodics are green 16:17:11 any other gate issue we need to talk about? 16:18:20 #topic Release Planning 16:18:24 Release tracking etherpad #link https://etherpad.opendev.org/p/nova-xena-rc-potential 16:18:28 We are in Feature Freeze. 16:18:41 I'm not tracking any FFEs 16:18:57 and not tracking any feature patches approved but not landed yet 16:19:06 We need to produce RC1 latest on 17th of September which is in less than 2 weeks. 16:19:17 * bauzas raised fist at reno btw. 16:19:30 yeah, bauzas writing the reno prelude as you see 16:19:32 :) 16:19:32 I can't upload the prelude change because of a weird issue with my rst file 16:19:45 (actually, yaml one) 16:19:57 bauzas: push it up and tomorrow I can look at it 16:19:58 hope to fix it sooner than later 16:20:15 gibi: yeah, if I can't find it, I'll throw it to the gate 16:20:29 the above etherpad has the links to RC bugs and other TODOs 16:20:35 probably something obvious enough that I missed. 16:20:37 any question about the coming release? 16:21:43 #topic PTG Planning 16:21:47 every info is in the PTG etherpad #link https://etherpad.opendev.org/p/nova-yoga-ptg 16:22:06 we don't have much topics yet, but there is still plenty of time 16:22:11 If you see a need for a specific cross project section then please let me know 16:22:40 any question about the PTG? 16:22:58 is there way to set extra_specs to an existing instance without modifying the db? 16:22:59 I guess you can turn to bauzas about those as well as he is the PTL elect :)P 16:23:10 could we have a yoga mat for the PTG ? :D 16:23:15 mloza: let get back to you at the Open Discussion section 16:23:29 bauzas: sure you can, please buy one in decatlon 16:23:33 :) 16:23:49 * bauzas misses swag 16:24:13 #topic Stable Branches 16:24:20 elodilles_pto is off 16:24:25 so we don't have status update on the wiki 16:24:32 but I guess stable is OK :) 16:24:54 any stable issue? 16:24:59 either way, we're entering a pretty quiet period about stable 16:25:27 I have one news we pushed stable os-vif releases today 16:25:38 https://review.opendev.org/c/openstack/releases/+/807694 16:25:41 https://review.opendev.org/c/openstack/releases/+/807696 16:25:44 victoria and ussuri 16:25:53 anyhow moving on 16:26:05 #topic Sub/related team Highlights 16:26:11 Libvirt (bauzas) 16:26:13 ? ;) 16:26:22 nothing to say 16:26:30 #topic Open discussion 16:26:37 nothing on the agenda 16:26:42 mloza: your turn 16:26:52 mloza: do you mean flavor extra_specs? 16:26:59 yup 16:27:17 you have to resize the vm 16:27:20 embedded flavors can't be changed 16:28:44 unless you resize, indeed 16:29:54 right now live migrating is failing because the instance doesn't have extra_specs that the flavor has. I cannot afford to the resize since it incurs a downtime 16:30:33 I don't think that's the reason for a live migration failure, right? 16:31:37 i have host aggregates with metadata 16:31:52 the metadata is associated to the flavors 16:32:10 I see, so it's the aggregate not the flavor 16:32:23 force migrate should skip that, no? bauzas ? 16:33:05 wait, was distracted, sorry 16:33:05 force still runs the scheduler isn't it? 16:33:22 gibi: that depends on the microversion you ask 16:33:25 true 16:33:38 with old enough microversion you can skip the scheduler 16:33:42 mloza: which exact command do you run ? 16:33:43 I was going to say, I thought there was a way.. I mean that's how you break your AZ right? :) 16:34:23 host= force=True microversion <= 2.67 16:34:32 openstack server migrate --os-compute-api-version 2.30 --live-migration $id 16:34:49 with no target ? 16:34:59 yep with no taret 16:35:02 target* 16:35:04 you need < 2.30 and you nee d target 16:35:04 if so, this goes to the scheduler 16:35:13 which checks this indeed 16:35:17 https://docs.openstack.org/api-ref/compute/?expanded=live-migrate-server-os-migratelive-action-detail#live-migrate-server-os-migratelive-action 16:35:30 the scheduler is doing what you asked, in that case.. not sending the host to the aggregate that requires specific specs 16:35:35 mloza: if you dont pass a target then it will use the az form the request spec 16:35:37 gibi: or you can use 2.30 with the explicit force flag 16:35:52 bauzas: yes, or <=2.67 and force=True 16:35:54 (which was deprecated later) 16:35:59 that, yes 16:36:35 I'd say, the contract is granted. 16:36:47 you're asking to migrate something you shouldn't migrate 16:37:02 I know this is weird 16:37:34 if you for the migration will that bypas the aggreate extra spec affintiy filter 16:37:42 i assume that is the one that is blocking the host 16:37:47 id did not think it would 16:37:49 but if all your hosts are within aggregates that set metadata and you have a filter that verifies this, then you explicitly write a contract 16:38:45 i guess with force we will just check the host exist and skip everything else with the old microverion? 16:39:05 that would leave the vm in a state the future move operation will try to move ti out of the aggreate you moved it into 16:39:32 got it to live migrate after passing a target host 16:40:01 OK, I guess that is solve then 16:40:03 :0 16:40:04 :) 16:40:17 any other topic for today? 16:42:10 then lets closed this 16:42:17 thanks for joining 16:42:23 #endmeeting