16:00:16 #startmeeting Mistral 16:00:16 Meeting started Mon Dec 1 16:00:16 2014 UTC and is due to finish in 60 minutes. The chair is rakhmerov. Information about MeetBot at http://wiki.debian.org/MeetBot. 16:00:18 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 16:00:21 The meeting name has been set to 'mistral' 16:00:31 hi 16:01:04 hi ! 16:02:30 NikolayM, is Jenkins now working fine? 16:02:44 yes 16:02:55 two patches is already merged 16:03:04 ok 16:03:55 hi all 16:04:03 I wonder why devstack gate failed for https://review.openstack.org/#/c/137763/ 16:04:09 Nikolay, please take a look 16:04:14 hi dzimine 16:04:21 ok, let's start 16:04:38 rakhmerov, it is just ok 16:04:45 why? 16:05:10 was it before Nastya's patch merged in or after that? 16:05:23 before 16:05:35 ok, then it should be fine 16:05:37 agenda: https://wiki.openstack.org/wiki/Meetings/MistralAgenda 16:05:39 after Nastya's patch devstack works fine 16:05:44 ok 16:05:55 #topic Review Action Items 16:06:07 we had no previous action items 16:06:28 #topic Current status (progress, issues, roadblocks, further plans) 16:08:25 my status: I took PTO last Friday. Almost finished fixing that race condition, today I changed the design a little bit and just want to do more testing. There's still something that I need to think about it though but I may want to postpone it for a while and switch to "join" 16:09:15 nothing on my side or winson's : long break last week in the US 16:09:53 I'm a little confused. Was it really a whole week break? 16:09:55 added advanced tests on workflow-resume, fixed resume algorithm, fixed "Application context not found" error in tests, fixed bug when we didn't see error at action class init 16:10:09 I thought thanksgiving is just one day 16:10:50 yes, but as it's Thu, everyone is taking Fri off. 16:11:11 NikolayM, very good ) I'm so glad we fixed that "Application context not found" error 16:11:17 it was a pain in the butgt 16:11:27 dzimine, ok 16:11:48 NikolayM, I think that's why nobody answered us in ML about WSME 16:12:12 #topic Release "Kilo-1" progress 16:12:23 I also think so :) 16:13:10 rakhmerov, what is left for Kilo-1? 16:13:26 yep, just a sec 16:13:47 ok, as far as "kilo-1" release we now have effectively 2 BPs that didn't start yet 16:13:54 "pause-before" and HA 16:14:21 Bryan was going to start "pause-before" this week as far as I remember 16:14:29 and it's pretty simple to do I think 16:14:42 if he has some problems we'll help him 16:14:52 is "join" in Kilo-1? 16:15:08 #action rakhmerov, get in touch with Bryan to see how his work's going 16:15:13 dzimine, yes 16:15:29 I'll continue to work on it tomorrow 16:16:06 by the end of this week most likely I'll get it done 16:17:08 Nastya is also about to start doing HA testing, she's now figuring out what's needed: where and how we can do it 16:17:39 I don't think this is something that could be done just till the end of this cycle 16:17:41 though 16:17:49 we have mistral 0.2 planning doc at https://docs.google.com/a/mirantis.com/document/d/1ycCOBS4gj5-3vbHjBEThTJ9gSNOOWGpX0kgdcfVw9IY/edit# 16:17:52 before or after rakhmerov's fix? 16:17:57 IMO, this is an ongoing piece of work 16:18:03 don't dorget about it :) 16:18:20 we won't 16:18:25 dzimine, what do you mean? 16:19:14 regarding HA testing I mean that we'll do some basic testing for now to make sure our main functionality works at scale 16:19:28 but we'll keep working on that moving on 16:19:45 I mean any load may be problematic without the fix that you have on a review. 16:19:56 Anyways it'll take care of itself :) 16:20:02 by exposing the bugs. 16:20:10 ooh, I hope it will be in in 1 day 16:20:16 in the worst case in 2 days 16:20:30 yes :)) 16:20:46 Nastya do you have a plan on what/how to test? 16:21:26 we had a weird bug when workflow locked on 2 cpu VM. I'll ask Winson to report it. 16:21:35 btw, dzimine, if you have your own requirements on what needs to be tested first (I mean your interest) please share 16:21:56 yes, you told me but it's really weird 16:22:12 would like to see some details (may be logs?) 16:22:25 I think Nastya is not here today 16:23:04 yes, I think he managed to repro it on his dev env, I have just asked him to file it. 16:23:24 we already discussed the general plan and she definitely has a high-level understanding 16:23:34 ok 16:23:42 would be helpful 16:24:24 btw, dzimine, did you try Mistral in multiprocess configuration on some of your tasks? 16:24:52 it's when you have separate engines and executors (may be even api's) 16:25:11 no. 16:25:17 ok 16:28:04 ok, I think we're mostly on target. Productivity after summits is always not the best but I feel it's not increasing :) 16:28:14 #topic Open Discussion 16:28:25 any questions to discuss from your side? 16:28:30 ok, little thing: 16:28:38 some concerns, roadblocks? 16:28:38 yes 16:28:49 task defaults with reverse workflows, obviously don't work, right? 16:29:02 ....it's now increasing.... 16:29:10 the question is how to set up a default failure policy with reverse workflow. 16:29:32 dzimine, we need to check it 16:29:35 with "join" implemented it may not be pressing, but at least need t odocument. 16:29:38 I'm not sure it doesn't work 16:30:04 I guess policies should work with task defaults in case of reverse workflows 16:30:07 honestly I didn't check if it can work, was just curious 16:30:12 let us check that pls 16:30:14 let's check. 16:30:18 yep 16:30:56 Next: I am thinking we (I?) should do a bit more in writing specs. 16:30:58 #action akuznetsova: check if task-defaults work for reverse workflows (policies) 16:31:03 We do write blueprints, 16:31:16 but not in hi level of details. 16:31:22 agree 16:31:36 it's all about priorities, you know 16:31:39 and than, when feature is implemented 1) not all corner cases are there, 16:31:50 and 2) we dont know if we are done or not. 16:31:59 yes 16:32:01 Example: for-each. 16:32:13 It's so good to have it in, 16:32:20 strictly speaking, "for-each" is not 100% ready 16:32:27 now I have a bag of feedback on this. 16:32:40 result ordering, sequential processing are not done 16:32:46 for-each for now has a set of limitations 16:33:26 dzimine, can you pls share that feedback in some form? BPs, specs? 16:33:50 NikolayM, please create new BPs for what's left on "for-each" 16:33:55 well that's my point, "when"? is it ready? what is left? where do we track this? all this project management routines that keep it safe 16:34:15 #action: NikolayM, create new BPs for what's left on "for-each" 16:34:30 ok 16:34:43 So the smallest step I think is to put more details into the blueprint, that describes in more details on what needs to be done, how it is expected to work, etc. 16:35:06 https://blueprints.launchpad.net/mistral/+spec/mistral-dataflow-collections 16:35:26 rakhmerov and I have discussed pretty deep corner cases of for-each, I can help capture them in a blueprint. 16:35:49 let's just have a discussion in the BP itself (I know it's not very convenient but there isn't a better way now I think) 16:36:01 ok 16:36:35 And a new one that we thought of and people just asked to do is to introduce a "parallel" count to "throttle" the execution. parallel=1 makes it sequential. 16:36:41 I am ok with creating new BPs if you find it convenient 16:37:11 you still mean "for-each" thing? 16:37:16 yes 16:37:40 ooh, you're saying it's not just a binary choice "parallel" or "sequential" 16:37:43 ok 16:37:57 I see 16:38:20 NikolayM, it's going to be even much more interesting than we thought ) 16:38:49 I think it's a nice feature 16:39:48 I'll type it up in the etherpad referenced from the blueprint. 16:39:51 because if we, for example, have an array of a thousand elements and need to create a thousand parallel iterations then it may be an overage for just one task 16:39:58 ok 16:40:11 this is very good 16:40:29 But back to a general comment, let's do a write-up on what we implment, and if there are any cut corners call them out after implementation. 16:40:51 ok 16:41:31 the use case for parallel throttling is also concern on the target system. In our example it's VMWare vCenter and the user wants to keep it at no more than 5 new VM requests at a time. 16:41:55 yes, I'm actually thrilled to get "join" and "for-each" asap 16:42:19 yeah, you told me 16:43:03 but when we were talking about it I was thinking about a system-wide throttling rather than a "for-each" parameter 16:43:11 it should be both eventually though 16:44:17 #action dzimine, put information about additional features/requirements for "for-each" in the etherpad 16:44:26 ^^ done. 16:44:32 ok 16:44:44 one question that I have 16:45:01 do we need to support Keystone API v2 in Mistral? 16:45:06 any input on that? 16:45:09 Nikolay, how do you track what's not done in for-each? E.g., ordering of results, handling multiple arrays, etc? 16:45:31 Keystone v2 - anyone asked for it? 16:45:39 yes, that's the point 16:45:49 what's the context& 16:45:50 ? 16:45:55 2-3 people last week asked about it 16:46:45 the point is: they have Icehouse OpenStack where they don't have Keystone API v3 enabled because they think it's still experimental 16:47:08 my understanding 16:47:15 dzimine, it is in my notepad 16:47:30 unfortunately, I don't have a comprehensive info on that topic at the moment 16:48:06 so they said: can we have Mistral work with v2 or create some workaround for that? 16:48:19 the problem is that we use trusts 16:48:36 which actually don't work well too for us 16:49:06 but at least the concept of trusts seems to be helpful for us 16:49:46 so, I'm not expecting us to answer that right now but we need to figure that out within days 16:50:00 already create a BP (in discussion status) 16:50:03 created 16:50:43 you can leave your thoughts in it: https://blueprints.launchpad.net/mistral/+spec/mistral-keystone-api-v2-support 16:50:43 what will it take to support v2? did someone looked at it? NikolayM you know keystone pretty well? 16:51:42 the whole problem is in auth token expiration 16:52:12 for example, when a trigger runs a workflow we don't have user credentials 16:53:16 there are only two ways: authenticate using Mistral's system credentials (from config) or use a previously trust (basically impersonated credentials) 16:53:32 the first way is ugly because it may be a security breach 16:54:36 honestly, I don't see a good solution at this point 16:55:06 technically, if we come up with a solution it should be easy to implement 16:55:58 Dmitri, maybe you could discuss this thing with your guys 16:56:11 we don't use much of keystone. 16:56:34 yeah, but some of you may have seen a similar problem before 16:56:41 ok 16:56:43 and the problem with credentials is not limited to keystone. Talk to Fuel guys, they'll need sets of credentials to all sorts of targets 16:56:54 I know 16:56:55 yes 16:57:45 I heard that keystone folks were planning to drop trusts completely, btw, and introduce a different thing 16:57:55 but didn't see any detailed description so far 16:58:14 ok, let's finish for today 16:58:28 about a minute and a half left 16:58:43 thanks for coming 16:59:03 Dmitri, let's be in touch with you. We need your input 16:59:12 bye 16:59:29 #endmeeting