14:08:46 <liuyulong> #startmeeting neutron_l3
14:08:47 <openstack> Meeting started Wed Jun 17 14:08:46 2020 UTC and is due to finish in 60 minutes.  The chair is liuyulong. Information about MeetBot at http://wiki.debian.org/MeetBot.
14:08:49 <openstack> Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.
14:08:51 <openstack> The meeting name has been set to 'neutron_l3'
14:08:59 <liuyulong> Sorry, a bit late...
14:09:17 <liuyulong> slaweq, haleyb, ping
14:09:23 <slaweq> hi
14:09:30 <haleyb> hi
14:09:36 <liuyulong> hi
14:10:12 <liuyulong> Alright, let's start
14:10:13 <liuyulong> #topic Announcements
14:11:09 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015368.html
14:11:28 <liuyulong> This is the ptg summary from the Virtual PTG.
14:12:27 <liuyulong> Thanks slaweq for the detailed summary.
14:13:16 <liuyulong> #link http://kaplonski.pl/images/Virtual_PTG_2020/photo_3.png
14:13:56 <liuyulong> I saw you handsome guys.
14:14:24 <liuyulong> #link http://eavesdrop.openstack.org/meetings/networking/2020/networking.2020-06-16-14.00.log.html#l-13
14:15:38 <liuyulong> This is the announcements from the team meeting yesterday.
14:16:29 <liuyulong> We are in Victoria devloping cycle now, so each spec should be moved to Victoria folder.
14:16:54 <liuyulong> OK, no more from me now.
14:17:04 <slaweq> :)
14:17:25 <liuyulong> Neutron CI is down, any idea?
14:18:41 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883601
14:18:41 <openstack> Launchpad bug 1883601 in neutron "ovn based neutron gate jobs failing 100% of times" [Critical,In progress] - Assigned to Jakub Libosvar (libosvar)
14:19:14 <liuyulong> This is new bug, but seems the real problem is not fixed either.
14:20:11 <liuyulong> OK...
14:20:13 <liuyulong> #link https://review.opendev.org/#/c/735536/
14:20:31 <liuyulong> This is the gatefix
14:20:53 <liuyulong> Next topic
14:20:56 <liuyulong> #topic Bugs
14:21:26 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015178.html
14:21:31 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015323.html
14:21:38 <liuyulong> #link http://lists.openstack.org/pipermail/openstack-discuss/2020-June/015442.html
14:21:43 <liuyulong> We have a long list....
14:24:45 <liuyulong> First one
14:24:48 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1880969
14:24:48 <openstack> Launchpad bug 1880969 in neutron "Creating FIP takes time" [Low,New]
14:25:23 <ralonsoh> IMO, the times spent by the server is ok
14:25:30 <ralonsoh> c#2 of this LP
14:25:41 <ralonsoh> (only the Neutron server times)
14:26:38 <liuyulong> ralonsoh, yes, agreed. The HTTP response time from the neutron server log should be considered first.
14:27:54 <liuyulong> "GET /v2.0/ports?network_id=55c74232-825a-4a4a-b53d-5b4b7aa4ad74&device_owner=network%3Adhcp HTTP/1.1" status: 200  len: 1272 time: 0.0676231
14:28:07 <liuyulong> A simple case from my deployment.
14:28:37 <liuyulong> A pattern for logstash should be useful.
14:28:56 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1880532
14:28:56 <openstack> Launchpad bug 1880532 in neutron "[RFE]L3 Router should support ECMP" [Wishlist,New] - Assigned to XiaoYu Zhu (honglan0914)
14:29:08 <liuyulong> I have reviewed the spec one time.
14:29:18 <liuyulong> #link https://review.opendev.org/#/c/729532/
14:29:41 <slaweq> I have to review this spec too
14:31:24 <liuyulong> In general, the final use scenarios looks limited to the loadbalancer. The main point is not in the Neutron side.
14:31:44 <liuyulong> So let's continue the discussion on the gerrit.
14:31:59 <slaweq> yes, there are some suggestions that it can be done with existing neutron API IIRC
14:32:31 <ZhuJoseph> My current plan is to add a new function to extraroutedb.py to handle this requirement.
14:32:52 <liuyulong> Hi, you are here.
14:33:01 <liuyulong> "XiaoYu Zhu" it's you?
14:33:07 <ZhuJoseph> and use api like :/v2.0/routers/27757e09-fb6a-4196-957d-cdce604f087e/remove_ecmps
14:33:11 <ZhuJoseph> yes
14:33:20 <ZhuJoseph> I am
14:33:23 <liuyulong> Welcome
14:36:23 <liuyulong> ZhuJoseph, if there are some existing code or POC, you may submit it in parallel, that could also be useful for the upstream team to understand your real requirement.
14:37:12 <liuyulong> And do not forget to add the link to the spec.
14:37:29 <liuyulong> One more thing, you should move specs/ussuri/l3-router-support-ecmp.rst, to the Virtual folder.
14:37:45 <liuyulong> s/Victoria
14:37:49 <ZhuJoseph> ok
14:39:16 <liuyulong> OK, next
14:39:22 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995
14:39:22 <openstack> Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova)
14:39:54 <liuyulong> We already have some discussion on the LP, and here is a workaround fix:
14:40:03 <liuyulong> #link https://review.opendev.org/#/c/734070/
14:41:10 <liuyulong> For the fix, IMO, it partially revert the fix of the original fix of https://review.opendev.org/#/c/692352/
14:41:12 <ralonsoh> IMO this is a workaround
14:41:17 <liuyulong> in some case
14:41:51 <ralonsoh> but if accepted and does not clash with any other part of the code
14:41:53 <ralonsoh> I'm ok
14:42:31 <ralonsoh> you know better this code...
14:42:32 <liuyulong> The main problem is in the namespace deletion based on my current research.
14:43:05 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/7
14:43:05 <openstack> Launchpad bug 1881995 in neutron "Centralized SNAT failover does not recover until "systemctl restart neutron-l3-agent" on transferred node" [Medium,In progress] - Assigned to Ann Taraday (akamyshnikova)
14:43:12 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1881995/comments/8
14:43:48 <liuyulong> I will add some log for this issue as a start.
14:45:01 <ralonsoh> good finding in c#7
14:45:02 <liuyulong> ralonsoh, the pyroute2 namespace deleting could be related. I may need your help. : )
14:45:08 <ralonsoh> sure
14:45:20 <ralonsoh> but where is this called?
14:45:34 <liuyulong> Wait a sec
14:45:40 <ralonsoh> no no
14:45:42 <ralonsoh> I mean
14:45:46 <ralonsoh> in this executing
14:45:54 <ralonsoh> why the namespace is deleted?
14:46:03 <ralonsoh> *execution
14:46:25 <liuyulong> #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L705
14:46:42 <liuyulong> #link https://github.com/openstack/neutron/blob/master/neutron/agent/linux/ip_lib.py#L906
14:47:03 <ralonsoh> yes and the ns is deleted, so that's ok
14:47:11 <ralonsoh> but why the ns was deleted?
14:47:56 <liuyulong> And finally, https://github.com/openstack/neutron/blob/master/neutron/privileged/agent/linux/ip_lib.py#L542
14:48:17 <liuyulong> the qrouter namespace was not deleted successfully.
14:48:30 <liuyulong> bug/1881995/comments/7
14:50:22 <liuyulong> Or maybe it is concurrent query and deleting.
14:50:44 <liuyulong> Delete namespace does not have much log now, I will add some.
14:53:02 <liuyulong> OK, next one
14:53:05 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1882860
14:53:05 <openstack> Launchpad bug 1882860 in neutron "after FIP is assigned vm lost network connection" [Undecided,Incomplete]
14:53:31 <liuyulong> It's a ovn-router related report.
14:54:44 <liuyulong> Jakub has left a potential fix of the issue and some questions, no response for now.
14:55:21 <liuyulong> Next
14:55:23 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883321
14:55:23 <openstack> Launchpad bug 1883321 in neutron "Neutron OpenvSwitch DVR - connection problem" [High,New]
14:55:56 <liuyulong> This is really a complicated issue.
14:57:14 <liuyulong> As I said in the fix, there are tons of cases for the real deployment, for instance, DVR, DVR + HA, openflow firewall, network node mixed compute services...
14:57:26 <liuyulong> I have a long list.
14:58:04 <liuyulong> Let's continue the talk on LP bug.
14:58:08 <liuyulong> Last one
14:58:28 <liuyulong> #link https://bugs.launchpad.net/neutron/+bug/1883089
14:58:28 <openstack> Launchpad bug 1883089 in neutron "[L3] floating IP failed to bind due to no agent gateway port(fip-ns)" [Medium,In progress] - Assigned to LIU Yulong (dragon889)
14:58:29 <liuyulong> reported by me
14:58:48 <liuyulong> I have two patches.
14:59:04 <liuyulong> #link https://review.opendev.org/#/c/735432/
14:59:10 <liuyulong> #link https://review.opendev.org/#/c/735762/
14:59:43 <liuyulong> The test case should be simple, just create a fake external network, and create router/network/subnet/VM.
15:00:24 <liuyulong> Then just see the changes of fip-namespace on hosts and DvrFipGatewayPortAgentBinding in DB.
15:00:41 <liuyulong> #link https://review.opendev.org/#/c/702547/
15:01:10 <liuyulong> IMO, this fix just missed that DVR related clean up action.
15:01:16 <liuyulong> OK, we are out of time.
15:01:28 <liuyulong> #endmeeting