21:01:11 #startmeeting networking 21:01:11 Meeting started Mon Mar 2 21:01:11 2015 UTC and is due to finish in 60 minutes. The chair is mestery. Information about MeetBot at http://wiki.debian.org/MeetBot. 21:01:12 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 21:01:14 The meeting name has been set to 'networking' 21:01:21 #link https://wiki.openstack.org/wiki/Network/Meetings Agenda 21:01:25 mestery: you should see it as a chance to pick new blood 21:01:26 o/ 21:01:29 #topic Announcements 21:01:32 salv-orlando: lol :) 21:01:42 We continue our march towards Kilo-3! 21:01:47 #info Feature Proposal Freeze: March 5 21:01:55 #info Feature, String, and Dependency Freeze: March 19 21:01:58 #info RCs: April 9-23 21:02:02 #info Kilo release: April 30 21:02:07 #link https://wiki.openstack.org/wiki/Network/Meetings 21:02:16 I know I paste these each week, but does anyone have any questions on these dates? 21:02:47 I should highlight that approved specs with no code by this Thursday will be out of Kilo unless they can get a FFE 21:02:59 I know there are a fair number in this state having just looked at the LP page 21:03:08 #link https://launchpad.net/neutron/+milestone/kilo-3 21:03:49 OK, lets move on then. 21:03:56 #topic Bugs 21:04:17 enikanorov enikanorov__: Here? 21:04:26 * dougwig dons his cage suit. 21:04:26 I have a bug to discuss anyways 21:04:27 https://bugs.launchpad.net/neutron/+bug/1426543 21:04:28 Launchpad bug 1426543 in neutron "Spike in DBDeadlock errors in update_floatingip_statuses since 2/27" [High,In progress] - Assigned to Kevin Benton (kevinbenton) 21:04:28 hi 21:04:39 I know kevinbenton and armax have been working that, with plenty of reviews. 21:05:00 enikanorov__: See ^^^ for what was a critical gate bug last week 21:05:31 yeah, seen that 21:05:39 Also, this one: https://bugs.launchpad.net/neutron/+bug/1426397 21:05:41 Launchpad bug 1426397 in neutron "tempest.scenario.test_network_v6.TestGettingAddress.test_slaac_from_os intermittently fails with MismatchError" [Undecided,New] - Assigned to Henry Gessau (gessau) 21:05:46 HenryG is working a fix on the tempest side for that one 21:05:46 What was the question about it? 21:05:55 kevinbenton: No question, just raising awareness :) 21:06:16 Oh okay :) 21:06:28 mestery: thanks for doing it for me ;) 21:06:33 enikanorov__: lol ;) 21:06:37 enikanorov__: Did I miss anything else? 21:06:54 nope. other critical gate issue is already fixed 21:07:02 (I mean the bug with func test) 21:07:08 enikanorov__: Cool! 21:07:25 Anyone else have a bug to discuss this week? 21:07:36 yes 21:07:57 there is that bug with db reties and corresponding patch 21:08:05 let me find a link 21:08:20 https://review.openstack.org/#/c/149261/ 21:08:27 https://bugs.launchpad.net/neutron/+bug/1382064 21:08:28 Launchpad bug 1382064 in neutron "Failure to allocate tunnel id when creating networks concurrently" [High,In progress] - Assigned to Eugene Nikanorov (enikanorov) 21:08:43 so is there a consensus that we want to go with retries? 21:08:51 i'm a bit tired of that neverlasting discussion 21:08:53 salv-orlando: ? 21:09:17 enikanorov__: data tell me we should do as afazekas says, in general 21:09:39 Maybe we should dedicate a summit session to it? 21:09:43 but since what I found out in data contrast with a general mandate against use of lock for updatre 21:10:00 I would say that for fixing that bug we should just make sure we retry with a different vni id 21:10:15 salv-orlando: ++ 21:10:16 salv-orlando: ok, cool 21:10:18 ++ 21:10:18 sorry to interupt you guys 21:10:28 (i'll take everything going after 'but') 21:10:30 :-) 21:10:58 but would really welcome your advice on how to setup devstack with neutron networking as a VM on virtualbox or VM Workstation 21:11:04 so right now the plan is to use the decorator from oslo.db 1.5.0 21:11:21 femtox: CAn you jump over to #openstack-neutron? 21:11:26 femtox: #openstack-neutron 21:11:29 femtox: This is the weekly neutron team meeting 21:12:05 yes, will do thanks. sorry to break in mestery. 21:12:11 femtox: No worries 21:12:13 salv-orlando: what do you mean it contrasts with the mandate against "lock for update" 21:12:26 i think he means 'select for update' and galleria. 21:12:30 galera. 21:12:46 kevinbenton: I mean that there is a general trend against replacing its usage with alternatives, such as queries which do update with a where 21:13:08 dougwig: galleria might be a name for a vpn software 21:13:18 salv-orlando: Or a mall in Minnesota 21:13:22 salv-orlando: i thought we had to replace them because they were broken on galera clusters anyway? 21:13:27 salv-orlando: also evil deadlocks 21:13:48 salv-orlando, mestery: yeah, that was auto-correct, being it's usual self. 21:13:49 mestery: I think there are malls called Galleria every where in the world. There's one in Hatfield near my place too. 21:14:03 lol 21:14:11 and one in NY 21:14:15 'lock for update' gives false promise of serialization in case of gallera. that's one of the reasons 21:14:20 (after deadlocks) 21:14:20 kevinbenton: broken in the sense that they do not work as expected there 21:14:23 normally i try to throw things into the weeds. this time was not my fault, i swear. :) 21:14:43 write-intent locks are always enforced on the local node 21:14:56 they're not propagate on write-sets. Simply because that would not make any sense at all. 21:15:13 ok, thanks for your feedback 21:15:15 so the way galera reacts to a situation where you acquired a lock but you shouldn;t 21:15:23 is by triggering a deadlock error. 21:15:57 Therefore the argument is whether one should retry the transaction on deadlock or design algorithms which avoid the deadlock 21:16:10 salv-orlando: i see 21:16:15 another problem is that the select for update lock doesn't work with postgres in HA 21:16:41 I am seeing problems in production because of that 21:16:43 salv-orlando: shouldn't we try algorithms anyway since we have an abusive relationship with our currentl sql driver which we refuse to leave? 21:16:45 rossella_s: do you have such environment to test? 21:17:02 salv-orlando: we can't avoid the awful 60 seconds to timeout on a held lock 21:17:12 kevinbenton: that's what we're trying to do 21:17:15 hence retries 21:17:32 kevinbenton: that is another good point. But I am afraid that it is a point that here adds only white noise. May I suggest to have a discussion on the already open ML thread 21:17:36 kevinbenton: well the abusive relationship is our creation :) 21:17:38 enikanorov__ I don't have one available right now but I can create one. Anyway customers are reporting that kind of problem, I mean the lock simply is not locking anything and we get duplicate entry errors 21:17:52 kevinbenton: btw, the write set replication deadlock does not have the 50 second timeout 21:17:55 it's different 21:18:06 rossella_s: what's the failure mode? 21:18:33 salv-orlando: i know, but it's also a side effect of "lock for update". i'll take discussion offline 21:18:41 rossella_s: i'm curious because we usually test against galera single writer/multireader 21:19:09 salv-orlando if you have 2 neutron servers they both try to write on the same table even if there's the select for update lock. So one gets the duplicate entry error 21:19:18 kevinbenton: you're right. The whole point of doing write intent locks is that we were too lazy and too afraid to impleent a minimal level of coordination in software 21:19:48 rossella_s: so that would be HA in active/active mode on pg_sql? 21:20:26 salv-orlando drbd 21:21:20 sorry, what is the right way to replace 'select for update' in HA environment of both mysql and pg_sql? 21:21:42 rossella_s: thanks. I think the last time I used drbd my age still started with a '2'. I need to document a bit 21:21:55 lol 21:21:59 OK, shoudl we move this discussion to the ML now? 21:22:02 mestery: I think we're diverging a bit 21:22:04 per salv-orlando's advice? 21:22:07 gongysh: by not using it :) 21:22:23 ok 21:22:27 gongysh: generally "UPDATE WHERE" statements have been popular 21:22:44 gongysh: there is no alternative. there is no serialization available for distributed dbs with multiwriter mode 21:22:58 OK, I think we've exhausted the bugs discussion, lets move on in the agenda now. 21:23:17 emagana isn't here so we'll skip our docs update for the week. 21:23:30 #topic Plugin Decomposition Update 21:23:43 armax isn't here, but I wanted to give folks a chance to ask questions in this slots, etc. 21:24:00 Also, armax has a series of patches moving the documentation of the stauts of the effort into the neutron git repo 21:24:12 I encourage folks to add to that (as Plumgrid and midokura have done) when decomposing 21:24:29 https://review.openstack.org/160109 21:24:37 ^^ doc for vendor decomp status 21:25:02 dougwig: Nice! Thanks for the link 21:25:10 #link https://review.openstack.org/160109 21:25:18 OK, lets move on. 21:25:25 #topic nova-network to neutron migration 21:25:28 anteaya markmcclain: Anything this week? 21:26:26 I'll take that as a likely no. 21:26:31 nothing new to share 21:26:34 Cool 21:26:41 #topic LBaaS in Kilo: Update from the other side 21:26:44 dougwig: You're on 21:26:54 let's see if i can bust the rate limiter... 21:26:57 so, we're shipping this lbaasv2 thing in kilo. 21:26:57 it has a new object model and TLS termination/offload support. 21:26:57 models, migrations, extension, plugin, basic ref driver, client/CLI, devstack (in-tree), and api docs have merged. 21:26:57 tempest tests (in-tree), TLS support, scalable-ish agent driver, and vendor drivers are in gerrit. 21:26:57 horizon support won't happen until Liberty, though we're also thinking of seeing if we can do it in-tree instead of in horizon. 21:26:57 we are beginning to assemble thoughts on next steps here: 21:26:58 https://etherpad.openstack.org/p/lbaas-vancouver-planning 21:26:58 we still need to bring the operators into that discussion as well. 21:26:59 any questions/additions/comments? 21:28:01 that's all i've got. 21:28:27 * mestery digests LBaaS 21:28:30 awesome 21:28:36 Nice work dougwig and team 21:29:17 #topic VPNaaS 21:29:20 pc_m: You're up! 21:29:27 Just four items to update folks on... 21:29:44 Plan is to have VPN discussions as part of Neutron on-demand topics 21:30:09 Strong Swan driver is out for review. Need reviewers: https://review.openstack.org/144391 21:30:24 Working on functional job for StrongSwan as well. 21:30:50 pc_m: Is the plan to make StrongSwan the default in Kilo? 21:30:59 Started refactoring for VPN to remove dependency on L3 agent. First commit is out for review https://review.openstack.org/#/c/160179/ 21:31:12 pc_m: so the vpnaas meeting is moving into the on-demand agenda as needed? 21:31:27 mestery: No, but we should in L and then start to deprecate OpenSwan. 21:31:43 OpenSwan is no longer packaged with Ubuntu (14.10+) 21:31:49 dougwig: yes 21:32:15 dougwig: Not much interest/participation in meetings for the past month or so. 21:32:39 we've been discussing something similar for lbaas after v2 ships. 21:32:40 Will work with Carl on the refactoring... I have a few questions. 21:32:48 dougwig: ++ 21:33:06 That's all I have, unless there are more Qs 21:33:12 Thanks pc_m. 21:33:22 Sure. Please help on reviews... 21:33:31 #topic Open Discussion 21:33:41 at least trim down lbaas meetings (octavia, neutron-lbaas) 21:34:07 blogan: You don't like all those meetings? :) 21:34:40 thats a trick question, as i am currently in a meeting 21:35:25 OK, thanks for attending this week folks! 21:35:31 bye 21:35:37 thanks and bye 21:35:37 Remember FPF is this Thursday. 21:35:38 bye 21:35:40 So get those patches out there! 21:35:41 #endmeeting