15:01:24 #startmeeting Scheduler 15:01:25 Meeting started Tue Oct 15 15:01:24 2013 UTC and is due to finish in 60 minutes. The chair is alaski. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:01:26 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:01:28 The meeting name has been set to 'scheduler' 15:01:43 hi 15:01:50 hi 15:02:15 So we don't have an official agenda today since I'm subbing in 15:02:26 But we were going to continue the discussion from last week about summit sessions 15:02:33 We have some discussion queued up from the ML 15:02:39 that too 15:02:42 hi 15:02:58 yes, I figured we could get to ML after a summit discussion 15:03:03 OK 15:03:13 I think we were going to get back to scheduler session too - if @PhilDay is here 15:03:26 Hi Folks 15:03:33 here he is 15:03:38 Is https://etherpad.openstack.org/p/IceHouse-Nova-Scheduler-Sessions erroring for anyone else? 15:03:53 #topic Summit sessions 15:04:08 I had trouble with it in Chrome - but could get to it in IE 15:04:20 me too 15:04:21 I assume some problem with the Etherpad changes 15:04:25 Yes - it is for me @alaski 15:05:02 ok, works in FF but not Chrome for me 15:05:21 PhilD: do you want to recap your suggestion for sessions? 15:05:21 works for me in Chrome.. 15:05:29 it works for me in chrome, version 30.0.1599.6 15:05:40 As for content, I like Phil's revised trio 15:05:51 for the case where we get only 3 sessions 15:05:58 Sure. We have three guaranteed slots from Russell - so I was trying to make sure that we cover all of the major topics 15:06:20 Group scheduling, Making other Metrics available, and performance 15:06:43 And then I tried to map teh proposed sessions (from both the EP and submitted) into those 15:07:08 Some other things will need to take thier chance against the general pool of sessions 15:07:24 ok, so smart resource placement and instance group model would be combined? 15:07:46 yes 15:07:52 I think the best we can do is say to RB "If there are only three slots - please cut it this way" - if tehre is an extra slot, please split this one up 15:07:56 I don't like the idea of combining 15:08:06 alaski volunteered to help with building the schedule, so if he understands the priorities of this group, we should be set :) 15:08:09 It is a lot of big topics combined into a single session 15:08:34 i'm pulling in some folks to help with some of the hard decisions ... 15:08:37 @alaski - its not great I agree, but the alternatiove is that we don;t proivde any steer, and they they get combined in some other way instead 15:08:39 Like Phil said, our top ask would be to split that session 15:08:42 btw, i really appreciate the hard work you guys are doing with planning this stuff 15:09:02 Thanks Russell 15:09:04 russellb: thanks 15:09:44 What surprised me was that Boris doesn't seem to have submitted a performance session yet ? 15:09:58 yeah, I'm a little surprised too 15:10:06 probably he forgot 15:10:08 I'll try to ping him about that 15:10:15 boris-42: are you here? 15:10:26 alaski always here=) 15:10:32 boris-42: :) 15:10:53 are you going to submit a session around scheduler performance, or should someone else? 15:11:12 alaski we almost finish work around refactoring scheduler 15:11:25 alaski and yes I will get results before & after 15:11:45 boris-42: but a design summit session, did you plan to submit one? 15:12:02 russellb there is already for scheduler 15:12:16 PhilD: Debo and I felt combining Group APIs, Smart Resource Placement, Heat and Scheduling into a single session, it is going to be hard to give justice to everything.. 15:12:24 russellb it seems that we have at least 3-4 concepts =) 15:12:27 which, this one? http://summit.openstack.org/cfp/details/34 15:12:34 boris-42: yep 15:12:41 oh 15:12:45 Yathi: I think it would just be group apis and smart resource placement 15:12:50 russelbb I don't thing that this is right session 15:13:02 boris-42: which session are you referring to then? 15:13:13 russellb wait a sec i will found 15:13:15 ok 15:13:20 find* 15:13:40 russellb omg 15:14:17 @Yathi - I underdstand the concern, really I do, but as always we have more material / ideas / proposed sessions than slots - so whatever way we cut this will be a compromise. 15:14:22 russellb oh I thought that there is a session 15:14:25 grayk ping 15:14:25 I tend to agree with Yathi that having an in-depth discussion on all of that would be a challenge. ideally, we can narrow-down the scope to something that we actually plan to implement in Icehouse timeframe.. 15:14:27 garyk ping 15:14:45 boris-42: garyk isn't here today 15:14:53 alaski omg 15:14:59 =) 15:15:07 russellb there is a big thread in mailing list 15:15:18 boris-42: do you want to propose a session for this? 15:15:22 Smart resource placement has a lot of people interested, and there might be even something to demo, using cross services theme 15:15:24 alaski yes 15:15:32 alaski we could discuss all approaches 15:15:33 boris-42: right ... well these guys have been working hard trying to organize schedulre sessions, so that's why they were wondering 15:15:49 Yathi: I would be happy to demo what my group has done 15:15:53 russellb I would like to be on their session imho 15:15:55 boris-42: cool. Can you do it today or tomorrow? 15:15:57 So which of teh other 2 sessions would you drop, scheduler performance or genearlised metrics / ceilometer ? 15:16:15 alaski russellb I will add just a session and link to that thread?) 15:16:31 Both of those topics also have a lot of interest, and a lot to discuss as well 15:16:37 BTW, I also plan to submit a proposal on adding support for multiple scheduler configurations, as indicated in the Etherpad, to discuss the dilemas identified during the work on the blueprint in Havana 15:16:48 boris-42: definitely link the thread if it has good information. But please also summarize it in the proposal 15:16:50 MikeSpreitzer: Same here, I will be happy to demo too, what our group is doing in this area. 15:17:17 probably doesn't fit into either of the 3 themes 15:17:19 alaski Okay will do today but little bit later 15:17:30 boris-42: that works, thanks 15:17:44 @glikson - that's fine, I think there can eb other sessiosn that take thier chance against all of th eother porpoosals. What we're trying to do here is work out what the three "must have" scheduler sessions are 15:17:46 alaski russellb are you interested in benchmark results? 15:17:57 alaski russellb I am testing OpenStack agaisnt 1k servers 15:18:02 PhilD: sure, makes sense 15:18:13 With Boris submitting a session, PhilD's revised trio still looks good, right? 15:18:29 boris-42: benchmarks are good. everyone likes real numbers 15:18:36 Yep - I'd assumed that Boris would submit a session;-) 15:18:44 alaski actually I don't want to speak about numbers 15:18:52 ? 15:18:54 MikeSpreitzer: makes sense to me 15:19:08 alaski I would like to speak about how to get such results on your PC or small amount of servers using Rally 15:19:09 =) 15:19:24 alaski https://wiki.openstack.org/wiki/Rally 15:19:55 oh 15:19:58 even if you don't spend a lot of time on numbers it would be good to hear what you saw 15:20:36 yes, we all are interested in that. 15:20:41 One thing we have to try and get smart on is how to do the cross team sessions (Nova/Heat), Nova/Celiometer, etc 15:21:05 are these cross team sessions - separate from the three you are trying to pick here ? 15:21:07 boris-42: I think we were anticipating a session on how to improve scheduling, not how to measure the improvement 15:21:19 Feels like those are always hard to schedule - and generally slow because both groups have to come up to speed. 15:21:52 MikeSpreitzer: agreed. That's what I'm expecting 15:22:17 I'm asking the question / trawling for ideas really on what we can do to make those effective. 15:22:40 MikeSpretizer I am speaking about another one 15:22:41 MikeSpreitzer: better have some numbers to compare those methods on how to improve scheduling 15:22:55 PhilD: My thinking has been that we should start with what needs to be done in each project, and limit the cross team concerns 15:23:03 PhilD: cross anything is tough, you first have to spend more time than you expected learning each other's vocabulary and way of thinking 15:23:04 MikeSpreitzer scheduler is one thing, OpenStack benchmark system another 15:23:33 boris-42: OK, two sessions, got it 15:23:42 Yes, I definitely like measuring! 15:23:56 MikeSpreitzer yeah we have to simplify it 15:24:18 boris-42: now I'm lost, simplify what? 15:24:33 Ok - so then we can exclude the Heat aspects from the Group scheduler session and the Celiometer aspects from the metrics one ? 15:24:48 boris-42 do you mean benchmark system to be in scheduling sessions or elsewhere? 15:24:48 MikeSpreitzer simplify benchmarking of openstack deployments. Then we will be able to start continues working around performance 15:24:57 PaulMurray NOO 15:24:59 =) 15:25:05 PaulMurray I will bring only numbers=) 15:25:13 boris-42: understand remark about simplification; thanks 15:25:23 boris-42 thanks :) 15:25:36 PhilD: I think that makes sense if we're trimming to the minimum possible 15:26:09 So just to summarize, we're looking at "Group scheduling, Making other Metrics available, and performance" 15:26:23 group sheduling would be instance groups and smart resource placement 15:26:31 boris-42: will open a performance session 15:26:33 I'm not sure I understand the remark about trimming #2 15:26:35 Yep - that would be my core trio 15:27:00 What is the proposed in scope, what is out, for #2 ? 15:27:05 alaski so should I make 2 session? 15:27:08 sessions* 15:27:39 And if there is a chance of a 4th core scheduler session we split group scheduling into "API" and "scheduling implementation" - rather than pick a 4th topic 15:27:40 boris-42: it sounds like it. We want a session on improving scheduler performance, with numbers. And then you can open a session on Rally 15:27:55 alaski Nice thanks 15:28:01 is "making other metrics available"   Generic Scheduler Metrics and Celiometer? 15:28:02 Ok. if there are extra sessions available it will be good to see the API discussion separately 15:28:13 Yathi: agreed 15:28:15 PhilD: that will be good 15:28:22 In fact, quite desired 15:28:27 I hope alaski pushes hard for it 15:29:23 I'll be ready to go with that. But I haven't seen how many sessions are competing for slots right now 15:29:37 I would like more than 3 scheduling sessions if possible 15:29:48 Smart resource placement topic is too big involving several areas, and also covers cross-services. I hope we have a good start with this during the session allotted 15:30:15 @shenewangs1 yes - although it sounds as if were agreed to go with focusing on what can be done inside Nova first, rather than risk a cross-team session 15:30:39 Maybe we should go for a two week summit next time ;-) 15:30:43 Keep in mind that what we really need to cover is work that can be expected to be done in IceHouse, which may help limit scope 15:30:48 @Phil thanks. 15:31:30 alaski: +1 :-) 15:31:42 alaski: Agreed. Our discussions so far in the ML and in scheduler meeting, is to start simple.. get somethign that works within Nova first 15:31:52 maybe we can try arranging additional 'unconference' sessions to deep-dive into the more complicated topics.. 15:31:57 @Phil Maybe we should have nova summit, ceilometer summit:) 15:32:27 Yathi: cool, that seems the best approach 15:32:34 Right. But with an eye towards going holistic. 15:32:47 I am thinking these 'unconference' sessions will be where we will get to demo our initial progress 15:33:05 @glikson I like that idea 15:33:05 But they conflict with official sessions, right? 15:33:28 MS: yes 15:33:44 It is my first summit.. don't know how they work 15:33:52 sorry, networking glitch, I missed what shanewang1 likes from glikson 15:33:58 So are we pretty much settled and ready to move on? 15:34:15 what did glikson say that s. likes? 15:34:24 "unconference" 15:34:33 thanks 15:34:43 if you want to grab space there, do it early 15:34:47 it fills up fast 15:34:47 my first summit too. Will stop whining now. 15:35:03 my second:) 15:35:32 how long do the official sessions last ? 15:35:41 20-30min? 15:35:52 Yathi: 50-60 min I think 15:36:18 How does one get an unconference slot? 15:36:21 scheduler sessions will all get 50-60 mins ? 15:36:22 :-X 15:36:49 Yathi: each session will get that time, yes 15:37:12 could also make sense to try agreeing on acceptable time slots for the unconference before the summit, among the 'core' interested folks (maybe using doodle) 15:38:04 We have to make sure we get interested people to attend the interesting 'unconference' sessions 15:38:23 sounds like an interesting idea 15:39:00 probably would need to happen during the last week before the summit, after the design tracks agenda is finalized 15:39:32 how do you reserve a 'unconference' session? Is it on the spot ? 15:39:41 Yathi: yes, a whiteboard 15:39:47 do we have enough rooms for "unconference", and can two scheduler "unconference"s happen at one time? 15:39:53 when does the whiteboard open? 15:40:12 I would not want two competing unconferences for us 15:40:17 MS: when the summit begins 15:40:42 So why don't we come back to this after some/all of the sessions have been scheduled. Then we can see what might need to go into an unconference 15:40:43 omg 15:40:55 alaski: sounds good to me 15:41:01 russellb: do you know how many 'unconference' rooms will be available? one or more? 15:41:08 shane-wang: ? 15:41:09 alaski: agree 15:41:23 omg "when the summit begins" 15:41:31 ? 15:41:45 alaski: sure. probably would make more sense to have follow-ups rather than completely new topics 15:41:49 MikeSpreitzer1: how would you title the current ML discussion? I'm not exactly sure where it left off 15:41:58 I bet a lot of competition 15:42:24 glikson: not sure 15:42:37 have to run for lunch, ttyl guys! 15:42:41 Well, there have been a few topics in ML. I am most directly involved in discussion of API (including model) for policy to inform joint decision-making 15:43:04 (at first just joint in nova) 15:43:24 #topic APIs for Smart Resource Placement 15:43:51 this is step 1 on garyk's three step roadmap; later steps include supporting lower level APIs and implementation. 15:44:15 I think we are pretty converged on the model 15:44:31 Biggest open question is whether Policy gets its own lifecycle... 15:44:44 MikeSpreitzer1: I just want to clarify that the reason to target Nova first, or any project really is just to get some work done. This topic has come up before but never really got beyond discussion, except for the current Instance Group work 15:44:48 I do not think it ruins anything, I just think it's an unnecessary nuisance. 15:44:59 alaski: +1 15:45:19 I am willing to live with Policy having its own lifecycle, if that's what it takes to get an agreement. 15:45:46 The other question about Policy is scope. That's really an airy thing, at most a matter of naming, at this point. 15:46:45 It is the idea of a Policy model applying outside the scope of an INstance group 15:46:56 Hence the suggestion for a Policy to have its own lifecycle 15:47:21 Not sure if we want to start that discussion here. 15:47:22 yes 15:47:37 In terms of the ML discussions for identifying 15:47:41 Well, does anybody have anything to add? 15:47:54 this is a good place for it. we can hash things out more quickly than on the ML 15:47:55 'unconference' topics - are we do that here ? 15:48:28 Yathi: I think we should see what scheduler sessions are accepted and then look at the unconference 15:48:31 alaski: agreed.. the last 3 - 4 scheduler meetings we have made good discussions, and I really liked it 15:48:59 alaski: ok 15:49:20 So, anything to add to the ML discussion about Policy objects? 15:49:41 hi 15:49:42 so do you like the latest update to the model diagram 15:49:48 I'm new to th group 15:50:00 I've read the desc of policy 15:50:12 i agee with yathi 15:50:25 that it's indepdent and can have its own cycle 15:50:38 Like I expressed in the email, let's limit the INstanceGroupPolicy to have only a reference to a policy that is persisted separately and with a lifecycle of its own 15:51:00 this common "Policy" object can be used by other InstanceGroups if applicable.. 15:51:14 Yathi: +1 15:51:35 Yathi: I understand your argument, and can live with your revised model. If nobody has any additional considerations, I move for unanimous consent on that model. 15:52:09 alaski: do you have any comments ? 15:52:15 (except that maybe there is a little redundancy that I mentioned in private email) 15:52:49 Unfortunately other work considerations have kept me from completely understanding the ML discussion right now 15:53:00 (both policy_uuid and policy_id field in reference object) 15:53:04 so I'm okay with the general consensus 15:53:34 MikeSpreitzer1: I think we have a general consensus 15:54:13 regarding the other objects in the API model - I hope they look okay too? 15:54:48 Yathi, MS: do we have a clear view of instance groups may evolve to include things other than servers? 15:54:50 Yes, the rest of the model is OK. I sent a note about generalizing attachment from groups to allow resources, don't know if anybody noticed or cared. 15:55:14 glikson: great question, I'm also interested to understand that 15:55:16 glikson: I think it's easy, just exapnd the set of resources that can appear in a group 15:55:18 yeah the instance like I mention in the document, can refer to any resource 15:55:41 ..only that no other resources exist in Nova DB :-) 15:55:45 an InstanceGroupMember has a member_id and this id can refer to any resource's UUID 15:55:56 glikson: we are starting within Nova 15:56:05 glikson: right, expanding kinds of resources allowed is connected to later generalizing beyond nova 15:56:08 but we expect to have a global state repository - 15:56:12 Is there a path to breaking this out of Nova though 15:56:28 this is part of the Smart Resource Placement bigger idea 15:56:30 ooh, good question 15:56:36 but starting within Nova first 15:56:58 I think it can be relatively easy to evolve the API, it will be harder with the implementation. 15:57:06 yeah, so the question is whether merging all the schedulers into one is the only way going forward.. I was wondering whether a more distributed approach would make sense. 15:57:29 Nobody is proposing to change the existing schedulers, I think 15:57:32 one small question: where does user add Connection & Metadata to a group? 15:57:34 glikson: we have not fully come to that conclusion yet 15:57:45 yeah, the API is what i'm interested in. What does that need to look like as this moves out of Nova 15:57:46 also, the existing schedulers will continue to work as is 15:57:47 I saw POST instance, but not connection 15:57:47 I note that the existing schedulers can all (more or less) be told a decision already was made by client 15:58:10 Yathi: I don't see how the two can co-exist.. can you elaborate? 15:58:18 MikeSpreitzer1: sort of. They can, but there are no guarantees that it will listen 15:58:30 s/it/they/ 15:58:40 alaski: I see making that fully true as a relatively small and easy evolution 15:59:19 glikson: no concrete thoughts yet, but the idea is for a pluggable scheduler, depending on the current request, hence the existing schedulers can exist 15:59:34 alaski: I mean, let us (when we add holistic scheduling) make the individual existing schedulers all willing to really accept a decision from the client 15:59:41 the POC code I pushed for a smart solver-based scheduler is a new driver that works within Nova 16:00:01 MikeSpreitzer1: agreed. In my opinion I would like to rework that API though. get right of scheduler hints and design a better api, since ripping out scheduler hints isn't much at this point 16:00:41 guys I think we will need to continue this next time or in the ML 16:00:41 alaski: OK, refining APIs is good. But later in our roadmap. For now I am satisfied to expect it can be done. 16:00:47 well, we're at time 16:00:47 alaski: any chance you can come up with a brief summary of your thoughts on this? 16:01:19 glikson: which one? multiple schedulers? 16:01:23 I'll be watching the ML 16:01:41 alaski: no, reworked API, hints, etc 16:02:18 glikson: basically I find the hints api lacking. Based on current work let's see what sort of placement api makes sense and move towards that 16:02:21 we are yet to get to the implementation details behind how the API work will lead to the actual scheduling 16:02:33 but we should give up the channel now 16:02:39 #endmeeting