14:08:46 #startmeeting neutron_l3 14:08:47 Meeting started Wed Jun 17 14:08:46 2020 UTC and is due to finish in 60 minutes. The chair is liuyulong. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:08:49 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:08:51 The meeting name has been set to 'neutron_l3' 14:08:59 Sorry, a bit late... 14:09:17 slaweq, haleyb, ping 14:09:23 hi 14:09:30 hi 14:09:36 hi 14:10:12 Alright, let's start 14:10:13 #topic Announcements 14:11:09 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015368.html 14:11:28 This is the ptg summary from the Virtual PTG. 14:12:27 Thanks slaweq for the detailed summary. 14:13:16 #link http://kaplonski.pl/images/Virtual_PTG_2020/photo_3.png 14:13:56 I saw you handsome guys. 14:14:24 #link http://eavesdrop.openstack.org/meetings/networking/2020/networking.2020-06-16-14.00.log.html#l-13 14:15:38 This is the announcements from the team meeting yesterday. 14:16:29 We are in Victoria devloping cycle now, so each spec should be moved to Victoria folder. 14:16:54 OK, no more from me now. 14:17:04 :) 14:17:25 Neutron CI is down, any idea? 14:18:41 #link https://bugs.launchpad.net/neutron/+bug/1883601 14:18:41 Launchpad bug 1883601 in neutron "ovn based neutron gate jobs failing 100% of times" [Critical,In progress] - Assigned to Jakub Libosvar (libosvar) 14:19:14 This is new bug, but seems the real problem is not fixed either. 14:20:11 OK... 14:20:13 #link https://review.opendev.org/#/c/735536/ 14:20:31 This is the gatefix 14:20:53 Next topic 14:20:56 #topic Bugs 14:21:26 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015178.html 14:21:31 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015323.html 14:21:38 #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015442.html 14:21:43 We have a long list.... 14:24:45 First one 14:24:48 #link https://bugs.launchpad.net/neutron/+bug/1880969 14:24:48 Launchpad bug 1880969 in neutron "Creating FIP takes time" [Low,New] 14:25:23 IMO, the times spent by the server is ok 14:25:30 c#2 of this LP 14:25:41 (only the Neutron server times) 14:26:38 ralonsoh, yes, agreed. The HTTP response time from the neutron server log should be considered first. 14:27:54 "GET /v2.0/ports?network_id=55c74232-825a-4a4a-b53d-5b4b7aa4ad74&device_owner=network%3Adhcp HTTP/1.1" status: 200 len: 1272 time: 0.0676231 14:28:07 A simple case from my deployment. 14:28:37 A pattern for logstash should be useful. 14:28:56 #link https://bugs.launchpad.net/neutron/+bug/1880532 14:28:56 Launchpad bug 1880532 in neutron "[RFE]L3 Router should support ECMP" [Wishlist,New] - Assigned to XiaoYu Zhu (honglan0914) 14:29:08 I have reviewed the spec one time. 14:29:18 #link https://review.opendev.org/#/c/729532/ 14:29:41 I have to review this spec too 14:31:24 In general, the final use scenarios looks limited to the loadbalancer. The main point is not in the Neutron side. 14:31:44 So let's continue the discussion on the gerrit. 14:31:59 yes, there are some suggestions that it can be done with existing neutron API IIRC 14:32:31 My current plan is to add a new function to extraroutedb.py to handle this requirement. 14:32:52 Hi, you are here. 14:33:01 "XiaoYu Zhu" it's you? 14:33:07 and use api like :/v2.0/routers/27757e09-fb6a-4196-957d-cdce604f087e/remove_ecmps 14:33:11 yes 14:33:20 I am 14:33:23 Welcome 14:36:23 ZhuJoseph, if there are some existing code or POC, you may submit it in parallel, that could also be useful for the upstream team to understand your real requirement. 14:37:12 And do not forget to add the link to the spec. 14:37:29 One more thing, you should move specs/ussuri/l3-router-support-ecmp.rst, to the Virtual folder. 14:37:45 s/Victoria 14:37:49 ok 14:39:16 OK, next 14:39:22 #link https://bugs.launchpad.net/neutron/+bug/1881995 14:39:22 Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova) 14:39:54 We already have some discussion on the LP, and here is a workaround fix: 14:40:03 #link https://review.opendev.org/#/c/734070/ 14:41:10 For the fix, IMO, it partially revert the fix of the original fix of https://review.opendev.org/#/c/692352/ 14:41:12 IMO this is a workaround 14:41:17 in some case 14:41:51 but if accepted and does not clash with any other part of the code 14:41:53 I'm ok 14:42:31 you know better this code... 14:42:32 The main problem is in the namespace deletion based on my current research. 14:43:05 #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/7 14:43:05 Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova) 14:43:12 #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/8 14:43:48 I will add some log for this issue as a start. 14:45:01 good finding in c#7 14:45:02 ralonsoh, the pyroute2 namespace deleting could be related. I may need your help. : ) 14:45:08 sure 14:45:20 but where is this called? 14:45:34 Wait a sec 14:45:40 no no 14:45:42 I mean 14:45:46 in this executing 14:45:54 why the namespace is deleted? 14:46:03 *execution 14:46:25 #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L705 14:46:42 #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L906 14:47:03 yes and the ns is deleted, so that's ok 14:47:11 but why the ns was deleted? 14:47:56 And finally, https://github.com/openstack/neutron/blob/master/neutron/privileged/agent/linux/ip_lib.py#L542 14:48:17 the qrouter namespace was not deleted successfully. 14:48:30 bug/1881995/comments/7 14:50:22 Or maybe it is concurrent query and deleting. 14:50:44 Delete namespace does not have much log now, I will add some. 14:53:02 OK, next one 14:53:05 #link https://bugs.launchpad.net/neutron/+bug/1882860 14:53:05 Launchpad bug 1882860 in neutron "after FIP is assigned vm lost network connection" [Undecided,Incomplete] 14:53:31 It's a ovn-router related report. 14:54:44 Jakub has left a potential fix of the issue and some questions, no response for now. 14:55:21 Next 14:55:23 #link https://bugs.launchpad.net/neutron/+bug/1883321 14:55:23 Launchpad bug 1883321 in neutron "Neutron OpenvSwitch DVR - connection problem" [High,New] 14:55:56 This is really a complicated issue. 14:57:14 As I said in the fix, there are tons of cases for the real deployment, for instance, DVR, DVR + HA, openflow firewall, network node mixed compute services... 14:57:26 I have a long list. 14:58:04 Let's continue the talk on LP bug. 14:58:08 Last one 14:58:28 #link https://bugs.launchpad.net/neutron/+bug/1883089 14:58:28 Launchpad bug 1883089 in neutron "[L3] floating IP failed to bind due to no agent gateway port(fip-ns)" [Medium,In progress] - Assigned to LIU Yulong (dragon889) 14:58:29 reported by me 14:58:48 I have two patches. 14:59:04 #link https://review.opendev.org/#/c/735432/ 14:59:10 #link https://review.opendev.org/#/c/735762/ 14:59:43 The test case should be simple, just create a fake external network, and create router/network/subnet/VM. 15:00:24 Then just see the changes of fip-namespace on hosts and DvrFipGatewayPortAgentBinding in DB. 15:00:41 #link https://review.opendev.org/#/c/702547/ 15:01:10 IMO, this fix just missed that DVR related clean up action. 15:01:16 OK, we are out of time. 15:01:28 #endmeeting