17:00:44 #startmeeting ironic 17:00:44 Meeting started Mon Jan 12 17:00:44 2015 UTC and is due to finish in 60 minutes. The chair is devananda. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:45 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:47 The meeting name has been set to 'ironic' 17:00:50 \o 17:00:52 #chair NobodyCam 17:00:53 Current chairs: NobodyCam devananda 17:01:06 g'morning / afternoon / evening, all 17:01:20 morning here :) 17:01:21 o/ 17:01:24 o/ 17:01:31 o/ 17:01:33 as usual, the agenda can be found here - 17:01:35 #link https://wiki.openstack.org/wiki/Meetings/Ironic 17:01:36 o/ 17:01:39 hi 17:01:42 morning 17:01:43 oh rloo has a v6 ip neat 17:01:59 :) 17:02:11 I do? (oh yeah, I'm hip...) 17:02:29 I left several of the items from last week's meeting on there - it was our first post-holiday meeting, and I'd like to at least touch on everything again since many of the usual folks weren't there 17:02:56 #topic announcements 17:03:30 probably the only announcement from me is the midcycle(s) 17:03:56 devananda: is there a list of who is going to which meetup? 17:04:21 * NobodyCam assumes both are official now? 17:04:33 NobodyCam: not yet - but there are meetup sites that folks should be signing up at 17:04:45 (apologies for being a bit slow - my links are missing ...) 17:04:52 is the SF one actually going to happen then? 17:05:03 yep 17:05:10 yes! 17:05:17 Rackspace is hosting another sprint in the SF bay area the week after the Grenoble sprint 17:05:25 all y'all should come visit us 17:05:28 since there are a lot of folks who apparently can't make it 17:05:41 Shrews: the j's are holding it weather its official or not as I understand it 17:05:45 lol 17:06:14 awesome 17:06:20 jroll: didn't ya'll post something on the ML ? 17:06:32 devananda: I think just the wiki 17:06:35 I could email 17:06:41 oh - right 17:06:48 that's why I cant find the link 17:06:51 I don't have an eventbrite or anything, unclear if I should 17:06:56 jroll: ++ That would be good I think 17:07:11 #link https://wiki.openstack.org/wiki/Sprints/IronicKiloSprint 17:07:12 details are there 17:07:27 grenoble meetup: https://www.eventbrite.com/e/openstack-ironic-kilo-midcycle-sprint-in-grenoble-tickets-15082886319 17:07:36 SF meetup: https://www.eventbrite.com/e/openstack-ironic-kilo-midcycle-sprint-in-san-francisco-tickets-15184923515 17:07:39 oh, there is an eventbrite! 17:07:40 awesome 17:07:43 I can try and add more information to that wiki if desired, or if anyone has questions about the area feel free to ask in channel and we'll help. 17:07:46 phew :) I knew I'd seen it 17:07:56 jroll: suggested accomodations would be great 17:08:08 Shrews: ++ 17:08:11 JayF: ^ 17:08:14 I'll be going to both, but that's not expected of anyone else 17:08:30 Shrews: I'll make a note to add those after the meeting. We have a couple of suggested hotels but anything near transit or in SoMA/Financial District will be close :) 17:09:21 #info both Grenoble and SF Bay area meetups are on. Details on the above link. 17:09:27 that's all for my announcements 17:09:48 #topic subteam status reports 17:09:49 nothing from /me this time 17:10:31 dtantsur: thanks for updating the bug stats 17:10:33 looks like a couple of the updates didn't make it to the white board this time 17:10:45 #link https://etherpad.openstack.org/p/IronicWhiteBoard 17:10:51 np, sorry for being inactive with bugs for some time 17:11:09 also, there're some bugs, that require deep though, I left them as NEW for now 17:11:16 dtantsur: you are allowed to take some PTO .... now an then 17:11:28 :) 17:11:33 NobodyCam: I cleared the whiteboard this time... some sections were feeling stale 17:11:45 wanyen: congrats on getting the nova spec landed. is there code up for that on both nova and ironic sides? 17:11:46 jroll: ++ 17:11:51 jroll: ++ 17:12:19 deva, yes. code changes to nova ironic virt driver is up for review. 17:12:30 wanyen: link? 17:12:41 let me find the link 17:13:21 NobodyCam, https://review.openstack.org/#/c/141012/ 17:13:36 #link https://review.openstack.org/#/c/141012 17:13:36 jroll, jayf: I see the IPA's API breaking change mentioned here. has that landed now? 17:13:39 Ty Nisha 17:13:45 NobodyCam, https://review.openstack.org/141010 17:13:49 devananda: yes, it's landed 17:14:02 #link https://review.openstack.org/141010 17:14:16 devananda: yes, after some appreciated input and review from a few interested parties but the design stayed mostly the same 17:14:19 thanks for those who reviewed that 17:15:00 #info non-backwards-compatible change in ironic-python-agent API landed, as per discussion from prior week's meeting. thanks to all who provided feedback. 17:15:06 link for nova code change for passing flavor to ironic https://review.openstack.org/#/c/141012/ 17:15:41 #info need more eyes on bugs - the backlog is growing, and many bug reports are getting stale // need to be re-triaged 17:15:56 ^^^ +100 17:16:26 last meeting, there was also general agreement that we should have a bug day 17:16:33 dtantsur: would you like to organize that? 17:16:53 dtantsur: do you have a list of the bug that require deeper thought? 17:17:00 no problem. e.g. tue and wed work for me 17:17:00 *bugs 17:17:10 NobodyCam, http://ironic-bugs.divius.net/ shows then as NEW 17:17:24 (maybe not all of them, but most) 17:17:30 dtantsur: Ack :) 17:18:00 dtantsur: ok, thanks much :) 17:18:04 WDYT about bug day on Wed some common time? 17:18:23 dtantsur: ++ works for me 17:18:34 I'll be offline most of wednesday. any other day this week wfm 17:19:03 then only Tue works for me. what about tomorrow? 17:19:17 I could also make that 17:19:20 e.g. the same time as this meeting? 17:19:27 is it ok for most folks? 17:19:36 I could make that 17:19:40 wfm 17:20:00 wfm 17:20:07 :) sweet 17:20:17 ++ 17:20:23 #info bug day Tue, 12 Jan 17:00 UTC 17:20:28 correct? 17:20:33 * dtantsur has problems with time zones 17:20:37 it isn't the whole day, right? 17:20:39 think so 17:20:51 rloo: hopefully it wont take all day :) 17:20:59 dtantsur: looks correct 17:21:01 due to time zones, it can't be the whole day 17:21:15 let's move on - we're over 10 minutes for this section 17:21:24 ++ 17:21:34 #topic current state machine 17:21:52 NobodyCam: I left this on the agenda since there are still open reviews - anything you'd like to discuss? 17:22:26 nope we are getting eyes on it 17:22:32 s/it/them/ 17:22:41 great 17:22:51 ofc the more the merrier 17:22:52 #link https://review.openstack.org/#/q/status:open+project:openstack/ironic+topic:bp/new-ironic-state-machine,n,z 17:23:13 anyone have anything to discuss on this right now? 17:23:26 I'll give it another minute, then move on otherwise 17:24:02 :) we landed two last week. would love to land the rest this week 17:24:46 ok, moving on 17:24:55 #topic IPA Hardware Manager 17:25:10 oops, skipped one 17:25:12 #undo 17:25:13 Removing item from minutes: 17:25:16 this was discussed last week, it's been done 17:25:18 oh 17:25:19 #topic stable branch maintenance 17:25:33 this was discussed last week 17:25:57 folks present agreed that we aren't actively supporting Icehouse at this point, but that we should be 17:26:07 publishing our team's stable branch policy officially 17:26:16 and I have that on my TODO list (but not done yet) 17:26:41 just sharing the info - nothing to discuss here on my end 17:26:42 devananda: we came up with supprt for two cycles basicly 17:26:43 we *should* be supporting icehouse? ugh. 17:26:50 er 17:26:57 I mean we should be publishing our policy 17:27:05 rloo: not that we should be supporting icehouse 17:27:22 devananda: ok, cuz i think it would be hard to support icehouse 17:27:35 rloo: indeed. and I dont think we should be. 17:28:20 #info devananda to draft a formal policy of support for two cycles, starting with Juno, and publish to wiki 17:28:27 #topic IPA's hardware manager 17:29:01 This must be an agenda item that never got removed last week. 17:29:15 There shouldn't be a need for further discussion. 17:29:34 ack 17:29:43 skipping it then 17:29:52 #topic driver-specific periodic tasks 17:29:53 dtantsur: hi! 17:29:59 :) 17:30:20 just wanted to ask, if folks are ok with landing periodic tasks temporary using greenthread.spawn_n 17:30:40 and then once we get oslo.service work done for parallel periodic tasks, switch to it 17:30:48 (that may be pretty late in K cycle) 17:30:51 wdyt? 17:31:11 dtantsur: I agree with jroll's comment -- as long as that doesn't require changes in the drivers themselves, it should be OK 17:31:30 devananda: jroll: ++ 17:31:54 devananda, you mean it does not break existing drivers? well, it does not 17:31:58 oh - not just "no change in driver API" -- I would prefer to see no change in drivers code, actually 17:32:05 right 17:32:20 so there should be some base function to register a peridic task for a driver 17:32:23 drivers use that 17:32:37 the implementation can be changed without drivers knowing 17:32:41 if we add an interface for drivers to register their periodic task, and then later we change how ConductorManager is creating or tracking those tasks 17:32:43 is how I see this working 17:32:45 yeah 17:32:48 that shouldn't require a driver author to change their code a second time 17:32:58 jroll: ++ 17:33:21 not even adding 'parallel=True' to e.g. decorator signature? hmm... 17:34:11 dtantsur: that's optional, though, that should be fine 17:34:33 right? 17:34:34 yea I think a change of that nature would be ok 17:34:42 thats just a bit flip 17:34:50 the issue is that we'll probably default to non-parallel periodic tasks, so the worst thing to happen is that they become non-parallel after this switch. 17:35:04 and will require adding 'parallel=True' (or something) to signature 17:35:12 is it an issue? 17:35:34 why default to non-parallel? 17:35:36 dtantsur: you'll first add with parallel = false then just flip it? 17:36:06 jroll, it's how it is done right now.. 17:36:17 to preserve current behavior? 17:36:26 ok 17:36:28 just asking :) 17:36:29 I think that's fine 17:36:36 well, ok, we can start with having noop parallel=True in these tasks, and then it'll become a proper implementation. 17:36:39 I agree 17:36:42 I think I understand now 17:36:48 if the first pass of this functionality doesn't support parallel tasks 17:37:09 then we shouldn't be publishing an interface which makes it appear that it does 17:37:23 either don't expose a "parallel" parameter, or error if =True is passed 17:37:26 for now 17:37:43 I'm fine with *adding* features to it when we switch to oslo 17:37:52 but don't *require* a driver author to change their code at that point 17:37:53 sorry just joined... +1 for preserve current behavior. How is the work in oslo going? was it accepted to have parallel periodic tasks there? 17:38:16 I was putting it wrong. we can for now simulate parallel=True with spawn_n, that's what I wanted to say 17:38:22 so drivers won't be changed 17:38:26 sorry for the confusion 17:38:26 ah, ok 17:38:31 oh 17:38:44 lucasagomes, it's blocked by oslo.service graduation 17:38:52 and it's not moving fast 17:38:59 dtantsur: can we simulate parallel=False with spawn_n? 17:39:02 hmm I see 17:39:11 jroll, yes, by not using it :) 17:39:27 just call the function directly - and you get parallel = False 17:39:34 dtantsur: oh, right 17:39:39 ok, so why not support both? 17:39:49 and when we switch to oslo that will also support both? 17:39:50 now I realize that we should, yes 17:39:52 ok 17:39:53 cool 17:39:55 +1 17:39:59 I was somewhat confused myself :) 17:40:04 thanks all for the feedback! 17:40:09 :) 17:40:44 dtantsur: sounds good to me 17:40:53 anything else on this topic? 17:41:04 nothing from me 17:41:05 +1 supporting both seems good 17:41:14 * lucasagomes catchs up with the agenda 17:41:23 #topic open discussion 17:41:32 wow - almost 20 minutes left for open discussion today! ;) 17:41:34 rameshg87: go 17:41:37 NobodyCam, yes 17:41:43 :) 17:41:52 this has been an exceptionally productive meeting :D 17:42:06 all, i had mentioned a spec to share boot images for ilo drivers https://review.openstack.org/#/c/137291/ 17:42:16 NobodyCam: to your question on the grenoble meetup, I have not had as many signups on eventbrite as I would hope for 17:42:30 NobodyCam: but there are several folks who have msg'd me to confirm but not yet signed up, too ... 17:42:33 I started a conversation in channel just before the meeting about the ref_counter on https://review.openstack.org/#/c/137291/8/specs/kilo/ilo-drivers-share-boot-images.rst 17:42:46 instances using same images will have the same kernel/ramdisk and hence can share the same boot iso 17:42:59 devananda: ack :) 17:43:25 to track how many instances are currently using the image, i am using a ref counter in Swift and it's modification in Ironic is protected by a lock 17:43:52 my concern was that ref_counter could get out of sync 17:44:32 and with the iso name being a hashed value a operator would nover know the image was not needed 17:44:37 so my biggest issue here is this. say changing the ref counter fails or gets out of sync. 1) how does an operator detect this? 2) how does an operator fix this? an operator can't get the same lock the conductor uses, except maybe twiddling the db. 17:44:59 3) what happens if ref counter goes to 0, ironic cleans up, something is still using it 17:45:27 oh I hadn't even gotten to think of #3 17:45:38 yeah 17:45:43 and it could happen pretty easily 17:46:06 rameshg87: I'd like to understand the use-case here. it doesn't seem like the added complexity is worth it for saving a small amount of space in Swift 17:46:12 devananda, mentioned enabling deduplication in swift, if the space is what is concerned here 17:46:23 decrement succeeds, http response is dropped, ironic thinks decrement fails, either fails the tear down or decrements again 17:46:26 I think it worth investigation, generating the ISO with the same content result in the same output? 17:46:40 if so, deduplication would be the right way to do it IMO too 17:46:43 devananda, yeah space was the concern here .. if 100 instances were using the same image, you ideally need to store only one 17:47:09 rameshg87: how large is the image? 50 MB ? 17:47:26 lucasagomes, yeah we can safely assume 17:47:34 sorry i meant to devananda :) 17:48:01 np :) 17:48:19 lucasagomes, i am not sure if generating the same iso image twice will result in exactly same copy, thought it should :) 17:48:27 saving a few GB of space in a petabyte-sized swift cluster doesn't seem like a reason to do anything 17:48:49 so 50m * 100 instances is not that much space in todays world of multi TB drives 17:49:20 rameshg87: disk storage is cheap. why the concern over saving a few GB ? 17:49:21 * jroll wants to share devananda's petabyte cluster with him 17:49:24 rameshg87, right, we could investigate that... And if that's the case we could include in our documentation about how to enable deduplication in swift (or a link to one) 17:50:01 devananda, lucasagomes, yeah may be i could check on deduplication 17:50:02 lucasagomes: ++ fpr letting swift handle that 17:50:15 *for even 17:50:50 devananda, probably i will check and get back then .. 17:51:25 rameshg87: awesome Thank you 17:51:31 sounds good, thank you rameshg87 17:51:43 thanks everyone :) 17:52:27 any other OD topics? 17:52:31 lucasagomes, wanna discuss MANAGE[D]? 17:52:39 oh, yeah that would be good 17:52:42 ah yea 17:52:45 if nobody else has any other topic 17:52:48 * lucasagomes gets the link 17:53:02 #link https://review.openstack.org/#/c/146452/ 17:53:18 so it's about making the states name in the state machine more consistent 17:53:49 +1 17:53:59 in the spec we say that we use states names endiging with -ED and -ING for active states (states that transition from other state without an api call) 17:54:01 though there's no MANAGING or MANAGED state, right? 17:54:09 the only exception is the MANAGED state with is a passive state 17:54:21 devananda, no 17:54:29 cause it goes from INSPECTING to MANAGED 17:54:35 I suggest MANAGEABLE instead :) 17:54:36 or ENROLL -> MANAGED 17:55:38 so to avoid confusion, we don't want passive states to end in 'ED', right? 17:55:38 oh, on a related topic, I'd like to s/NOSTATE/AVAILABLE/ this week 17:55:46 so, what you guys think? Should we rename it to MANAGE or MANAGEABLE or ... 17:55:57 rloo, yeah 17:56:21 I like manage 17:56:26 +1 for MANAGEABLE :) 17:56:26 devananda, +1 for the AVAILABLE, NOSTATE is complicated... 17:56:52 NobodyCam, re MANAGE, I feel a bit strange using verbs for static state 17:56:54 specially because the value of it is None in the states.py :) 17:56:56 manageable seem out of place with the other states to me 17:56:56 lucasagomes: yea. without breaking the API. I may want some help from you 17:57:06 devananda: wrt NOSTATE->AVAILABLE, does that break backwards compatibility? 17:57:06 deva, +1 for available 17:57:09 lucasagomes: the change needs to work with current Nova driver 17:57:12 NobodyCam, AVAILABLE looks similar to me 17:57:17 devananda, +1 cool yeah no problem we can work on that 17:57:25 rloo: it needs to NOT break compat. that's the challenge 17:57:31 dtantsur: ah good point 17:57:50 devananda: good if it doesn't break compat! 17:57:53 for MANAGED[D], what about UNAVAILABLE ? 17:58:29 devananda, sounds like an error... 17:58:32 ? 17:58:34 right. never mind 17:58:45 two minutes 17:58:46 yeah sounds like it's off or something 17:59:02 so we go from ENROLL to XYZ (which allows one to do ...) to AVAILABLE 17:59:31 I think MANAGEABLE is good, but we can iron out the name after the meeting too. If we agree here that we will change MANAGED its good already 17:59:38 rloo: both MANAGE and RESCUE are verbs, which I think is where your objection stems from? 17:59:47 managed means we can talk to the node? 18:00:00 manageable sounds fine 18:00:04 devananda: just wondering if there is another word than MANAGE. But yeah, RESCUE is a bit odd too. 18:00:08 rloo, yeah that's how I understand it 18:00:15 credentials are set and all 18:00:20 lucasagomes: I agree there's room for improvement in the words, until we actually implement them, at which point, changes will break compat 18:00:28 ok - we're out of time 18:00:31 thanks all! 18:00:36 thanks! 18:00:38 #endmeeting