05:00:33 #startmeeting ironic 05:00:34 Meeting started Tue Jan 6 05:00:33 2015 UTC and is due to finish in 60 minutes. The chair is devananda. Information about MeetBot at http://wiki.debian.org/MeetBot. 05:00:35 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 05:00:37 The meeting name has been set to 'ironic' 05:00:38 hyn all 05:00:43 \o 05:01:06 happy new year, and good evening/morning/afternoon/other time of day :) 05:01:24 :-p 05:01:27 as usual, our agenda is up on the wiki here: https://wiki.openstack.org/wiki/Meetings/Ironic 05:02:00 apologies in advance if I'm typing slower than usual, it's late for me -- but I'm glad to see a couple folks that I don't usually see :) 05:02:26 #chair NobodyCam 05:02:27 Current chairs: NobodyCam devananda 05:02:29 #topic announcements 05:02:43 * NobodyCam will also be slow 05:03:09 only announcement for me is just a reminder to folks 05:03:33 that I've posted details for an early February meetup / sprint to the mailing list 05:03:44 during the break, so I wanted to draw attention to it in case anyone missed it 05:03:49 #link http://lists.openstack.org/pipermail/openstack-dev/2014-December/053617.html 05:04:18 and some further thoughts, since it seems like a lot of US folks may not make it, are posted here 05:04:25 #link http://lists.openstack.org/pipermail/openstack-dev/2014-December/053618.html 05:04:46 are their any details on the SF meetup? like where should I look for a hotel? 05:05:01 remtoe dial-in is available? 05:05:24 NobodyCam: no. TBD. I just floated the idea and it seems people liked it, so need to arrange that this week 05:05:39 lol ack :) 05:05:43 wanyen: it will be a developer sprint, not a planning meeting 05:06:28 devananda: NobodyCam: fwiw, we have plenty of room at our office in soma 05:06:50 I think russell even booked a room, plus we have this big open area we could hack at 05:06:58 free to use it if you'd like 05:07:05 jroll: let's iron out the details of that so we can announce at next week's (more US-time-friendly) meeting 05:07:12 devananda: yep, just a heads up :) 05:07:17 jroll: ack. ty 05:07:21 devananda: If I had an issue to be discussed with core members, it it possible to attend SF meeting via WebEX or something? 05:07:21 jroll: with a white board or two too? 05:07:44 NobodyCam: yessir 05:07:53 and TVs for code pairing 05:08:15 devananda: I can provide WebEX. 05:08:48 naohirot: I'm trying specifically to avoid having the sprint be(come) a planning meeting 05:09:18 naohirot: so if there are design issues, we should try to discuss those via IRC, email, etherpad, etc -- all the usual means 05:09:35 deva, what will be covered in the sprint? 05:10:07 devananda: Yeah, Okay. What does "sprint" mean? 05:10:08 wanyen: it's a sprint - who ever is there will work on writing code for the open specs. or something like that, I hope :) 05:10:18 write all the code. 05:10:23 hack the planet 05:10:28 jroll: I see. 05:10:28 It's much less interesting than I suspect you all expect it will be :) 05:10:37 ah, i see. I assume everyone knows what a "code sprint" means 05:11:18 let's move on for now, and come back to this if there's time at the end 05:11:23 #topic subteam status reports 05:11:25 +1 05:11:35 #link https://etherpad.openstack.org/p/IronicWhiteBoard 05:11:55 hm, bug stats look quite old 05:12:09 I count 19 NEW bugs right now :( 05:12:20 "dtantsur on PTO/holidays, back on Jan 5th" 05:12:25 yea... 05:12:32 he was in meetings all day today, probably didn't catch a break to count 05:12:34 * devananda makes a note to go do bug triage 05:12:36 devananda: yes, I raise some bugs during my testing 05:13:16 devananda: I see some old but high bugs, are these still import for Ironic? 05:13:42 lintan: I suspect they are miscategorized 05:13:56 ieek about 20 + NEW bugs 05:14:08 launchpad doesn't separate "impact" from "urgency" 05:14:15 at least people are filing them :) 05:14:16 so we end up with bugs that are high impact but low urgency 05:14:29 s/urgency/priority/ 05:14:39 we should poke dtantsur in the morning and see if he wants help, maybe a bug day in the next couple weeks 05:15:14 re-visiting all the bugs and re-assessing them would be good 05:15:23 yeah 05:15:28 probably some stale, definitely a lot of new ones 05:15:58 wanyen: any updates on third-party CI ? 05:16:32 #info it looks like we still need to get reviews on the iRMC and AMT drivers 05:16:58 deva: tried to setup 3rd-party CI but found out that we need more hardware. 05:16:59 devananda: Yes, please 05:17:14 #info many new bugs, and lots of stale bugs. we should clean this up before kilo-3 (at the latest) 05:17:14 devananda: code, spec's or both 05:17:23 NobodyCam: code, it looks like 05:17:36 iirc iRMC has a pending spec 05:17:46 at least one 05:18:07 AMT driver has both.... 05:18:08 https://review.openstack.org/#/c/134865/ yeah; I've been looking at this. Others should too. 05:18:27 wanyen: hm, I see 05:18:52 NobodyCam: ok, both it is. 05:19:00 :p 05:19:10 any other status updates? 05:19:46 I have a patch for IPA we should talk about; but I called it out specifically on the agenda. Other than that nothing notable for IPA 05:20:00 JayF: ack, I'll make sure we get to it 05:20:01 JayF: I appreciated your view :) 05:20:35 ok, thanks everyone for keeping the status page up to date! 05:20:38 moving on ... 05:20:47 #topic new state machine code reviews 05:20:54 NobodyCam: that's you. well, and me ... 05:21:05 NobodyCam: but you put it on the agenda :p 05:21:46 I was looking to get eyes on the state machine reviews 05:22:03 * jroll makes a note to review those this week 05:22:11 NobodyCam: what's the topic for those? 05:22:25 I must admit I forgot to look them over today with all the catch up 05:22:30 #link https://review.openstack.org/#/q/status:open+project:openstack/ironic+branch:master+topic:bp/new-ironic-state-machine,n,z 05:22:32 jroll: it's linked from agenda 05:22:35 thanks 05:22:45 NobodyCam: I haven't posted a new rev since the break 05:22:45 oh, maybe I should look at the agenda then :P 05:22:54 * jroll thought he was just here for moral support :P 05:23:23 short version - these are laying the foundation for the Finite State Machine work that we need to move forward 05:23:29 I will look them over first thing tomorrow over coffee 05:23:34 but so far, there have been very few reviews on the code I wrote 05:23:53 mostly rloo and NobodyCam, with a few from Shrews 05:24:15 it would be good to have more eyes on it, since it is refactoring some central parts of the ConductorManager 05:24:33 ++ 05:24:42 devananda: In case of iRMC deploy, is it enough to follow the implementation of https://review.openstack.org/#/c/140883/? 05:25:30 naohirot: excellent question -- this is one reason I'd really like to get more eyes on it, and hopefully land that 05:25:40 devananda: I'm wondering how new state machine affects iRMC deploy implementation. 05:26:25 naohirot: in my opinion, yes, but other cores must approve of it as well 05:26:51 devananda: Yes, of course 05:27:02 ok. moving on because of time 05:27:05 approve of using process_event() in iRMC? or approve of iRMC following that patch in the chain? 05:27:46 jroll: the implementation pattern. my patches are affecting drivers 05:27:52 ok, yeah 05:27:55 jroll: so naturally anyone who is writing a driver now is affected ... 05:28:01 #topic stable branch maintenance 05:28:07 #link http://lists.openstack.org/pipermail/openstack-dev/2014-December/053366.html 05:28:07 naohirot: assuming that patch goes through as-is, yes, it will follow 05:28:14 I posted this question to the list last month 05:28:35 should we officially say we're no longer supporting icehouse? 05:28:38 +1 05:28:43 jroll: Okay, I got it, thank! 05:28:48 :) 05:29:05 and how long should we commit to supporting juno? 05:29:10 but didn't get any replies on the ML 05:29:57 opinions now, or discuss on the ML ? 05:30:12 I have seen question in channel about Icehouse support at least a few days before the break 05:30:18 with my trunk-chaser hat on, I don't care much for stable maintenance... I think we should support juno for two cycles like other projects do 05:30:28 icehouse seems fairly useless to support 05:31:07 jroll: so juno until "L" is released? 05:31:25 devananda: I guess so, yeah, that's what other integrated projects do, yes? 05:31:39 also, fwiw, I think we need adam_g to weigh in on this, as he's been doing the lion's share of stable maint for us so far 05:31:47 and it'd be great to have >1 person doing that 05:31:48 indeed 05:32:17 jroll: yea, generally I think it's 2 cycles after a release 05:32:30 so that distros and users have a reasonable migration window 05:32:41 where reasonable is somehow defined as "one year" 05:33:11 right 05:33:21 I don't really understand it, but willing to do the right thing :) 05:33:33 jroll: apt-get install 05:33:35 jroll: that's why 05:33:44 i would agree with where reasonable is somehow defined as "one year" 05:33:57 devananda: I know that, I still don't understand :P 05:34:15 jroll: heh :) 05:34:28 I think if distros want Juno maintained for a year, they should help do it :) 05:34:38 lb -s /bin/apt-get /usr/bin/pip 05:34:44 ln not lb 05:34:46 JayF: they do, to be fair 05:35:03 I know; it's just I wish we were able to spend all our time on the future instead of the past :) 05:35:49 jroll: hm, but to be precise, they aren't doing stable maint for ironic, afaik 05:36:01 anyway 05:36:38 I can draft something more formal-like and sjhare at the next meeting 05:37:04 at least no one seems to have initially objected :) 05:37:09 moving on .. 05:37:14 #topic AMT spec 05:37:19 lintan: hi! 05:37:32 Hi all guys 05:37:51 I need your opinions about the design 05:38:18 One thing is to discuss about where to put amt_boot_device 05:38:26 in driver_info or extra 05:39:11 If it's driver specific, wouldn't it belong in driver_info? 05:39:24 it shouldn't be exposed to users via either JSON field, though 05:40:11 lintan: i'm not sure why it needs to be stored at all 05:40:36 AMT/vPro only accept the first boot device and ignore the rest if we send multiple _set_boot_device_order requests to AMT nodes. 05:40:53 devananda: AMT doesn't support persistent=true, and doesn't have a "get boot device" command, AIUI 05:40:58 lintan: the hardware ignores repeated requests? 05:41:07 jroll: right. I think that can be worked around 05:41:07 yes 05:41:23 but if the hardware only accepts the FIRST one 05:41:31 then we can't change it before the next reboot 05:41:33 that's a problem 05:41:36 lintan: is ^ what you mean? 05:42:09 devananda:I mean it doesn't support 05:42:18 I don't see that as a problem, when do you need to change it twice between boots? 05:42:35 my question is, if you do "set boot device pxe", and then "set boot device pxe", does it pxe boot? 05:43:09 jroll: it does pxe boot in your case 05:43:26 jroll: if I do "set boot device hdd" then "set boot device pxe" then "reboot" -- which one does it boot? 05:43:33 oops, lintan ^ 05:43:37 devananda: why would you do that 05:44:04 as long as this works: "set boot device hdd" then "reboot" then "set boot device pxe" then "reboot" 05:44:06 should be fine 05:44:36 * devananda waits for lintan's answer before stating why that would bork things 05:44:43 jroll: but other drives support "set boot device hdd" then "set boot device pxe" then "reboot" 05:45:04 when/why do we do that? 05:45:12 our API exposes set-boot-device, so if a user (for what ever reason) issues such a command manually, it would, i 05:45:12 why would you set bootdev to hdd if you're not going to boot from hdd 05:45:15 oh. 05:45:17 that. 05:45:20 yep 05:45:45 jroll: another critical concern is for persistent boot 05:45:47 which this whole "remember the boot device" doesn't help with 05:46:02 in pxe boot processing, we have two pxe boot 05:46:06 lintan: that's a different issue that can be solved generically, I think 05:46:46 lintan: I'm sad that AMT hardware doesn't support changing this option multiple times 05:47:21 another point to make here: Haomeng managed to find ipmi hardware that doesn't support persistent=true 05:47:31 lintan: can it be worked around in the hardware somehow, eg. by issuing another command just before hand to "erase" a previous request? 05:47:33 jroll: YES 05:47:34 which means the latter problem isn't just an AMT problem 05:47:43 oh :( 05:47:48 ok then 05:47:50 jroll: for some hardware, it ignore persistent=true 05:47:57 that's awesome 05:48:00 Haomeng: right 05:48:02 so awesome 05:48:09 for most case it should work right ? 05:48:12 on the plus side, it forces us to solve this generically 05:48:21 most isn't good enough, unfortunately 05:48:34 ok - lintan, can you propose that as a separate change? 05:48:51 for persistent issue? 05:48:51 jroll: I just tested with two machine, not working with pxe set set bootdev persistent=true 05:48:56 I think we'll need a new table to store "requested but not applied changes" 05:49:07 or "persistent things we need to set every time" 05:49:08 or something 05:49:12 Haomeng: yeah, I believe you. hardware is bad. 05:49:24 jroll: :) 05:49:35 jroll: maybe:) 05:50:15 devananda: uggh :( 05:50:15 lintan: I think the general approach you have is fine, but this shouldn't be saved in a JSON field like driver_info or extra 05:50:27 lintan: and it needs to be available for other drivers to leverage as well 05:50:29 Haomeng, is this the problem for bios or uefi mode? 05:50:31 ok, so someone (maybe lintan) should propose a spec to add a table or something to deal with this 05:50:41 jroll: I understand some hardware does not implement all ipmi stand to support more options such as persistent=true 05:50:45 OK, I am willing to do that 05:50:53 wanyen: bios mode 05:51:12 Haomeng, I see. 05:51:19 #agreed we need a generic way to store a user-requested persistent boot device settign which has not been applied yet, and then only apply it during the reboot phase 05:51:45 #note this issue affects some IPMI-based hardware as well as AMT hardware 05:51:45 I have another issue for AMT 05:51:48 do we have time for JayF? 05:51:50 mmm 05:52:00 devananda, do we expect the user to reboot through Ironic only every time ? 05:52:05 lintan: what's the issue 05:52:08 lintan: we're almost out of time - can you be quick? 05:52:12 During PXE deploy processing, the target machine will boot by itself. 05:52:13 Not having time for me is not an option after staying up :) 05:52:30 deva,for hardware that does support persistent boot then no need to use the table. So the use of table is optional. right? 05:52:43 rameshg87: that's an interesting question, I don't think we can expect that, unfortunately :( 05:52:48 AMT Driver has to call ensure_next_boot_device again in_continue_deploy(). 05:53:02 wanyen: I think that's irrelevant 05:53:02 wanyen: dunno. we'll discuss that on the relevant spec, when it's proposed 05:53:08 jroll, and ironic cli doesn't have an option for reboot or soft-reboot 05:53:23 rameshg87: node-set-power-state reboot (also nova reboot) 05:53:34 jroll, oh :D 05:53:46 ok - I am going to need to cap this so we can get to JayF 05:53:46 devananda: another one -should ironic support force-delete to follow nova commad? 05:53:48 jroll, soft reboot is missing afaik 05:53:51 rameshg87: though both are hard power off, power on 05:53:53 yeah 05:54:00 I'd love to add this 05:54:06 * jroll wants to #topic 05:54:07 jroll, +1 me too 05:54:12 #note need to discuss this more 05:54:17 #topic breakign change for IPA 05:54:20 #link http://lists.openstack.org/pipermail/openstack-dev/2014-December/053662.html 05:54:26 jroll: you want soft poweroff? 05:54:29 JayF: gogogo 05:54:36 NobodyCam: yes 05:55:14 So tl;dr: when looking at splitting up our hardware manager into smaller, more sharable pieces, I realized our hardware manager loading didn't work the wqy I expected it to -- or the way josh and I modeled it at the summit 05:55:31 so I proposed https://review.openstack.org/#/c/143193 to allow for multiple simultaneous hardware managers 05:55:42 the only downside is this would be a breaking-api change for any out of tree hardware managers 05:55:58 Of which; the only one I know exists today is the one I maintain, heh 05:56:12 So I wanted to generally get visibility on this and a general "OK" from the group for doing this 05:56:37 JayF, very soon i would like to use this - for raid configuration 05:56:46 #note If you are using Ironic-Python-Agent with an out-of-tree hardware manager, please respond to JayF's email (linked above) 05:57:08 Also reviews very much appreciated; particularly by those who may want to consume this interface later 05:57:09 I've voiced my opinion to JayF elsewhere, but again here: posted to mailing list, no response, should be ok 05:57:27 rameshg87: is there any reason you will not submit your hardware manager upstream? 05:57:49 devananda, there isn't. i will want to submit it to upstream :) 05:57:55 devananda: you think highly hardware specific managers should go upstream? 05:58:14 JayF: yes 05:58:38 devananda: what about in cases (almost all, in cases I've seen) that they require a proprietary tool to work? 05:58:41 I tend to agree 05:58:51 JayF: if Ironic supports a given vendor's hardware, IPA ought to as well ... 05:58:53 that's always been what has slowed me down when looking at upstreaming things 05:58:55 JayF, adding ironic-agent element in dib solves this 05:58:57 but when it comes to firmware things, that will almost certainly be downstram in most cases 05:59:16 though you could just say "requires crappy-vendor-tool.sh" 05:59:17 JayF, i can build a ironic-python-agent ramdisk by including my custom element 05:59:38 rameshg87: sure; that's what I do now; but should we ship code that doesn't work without a custom element that we probably couldn't even open source 05:59:43 if the separation is clean -- custom element is just "include proprietary utility.sh" 05:59:54 and the interface to taht is in IPA -- I think it's good 05:59:57 rameshg87: it's also straightforward to build IPA with a custom manager without DIB 06:00:07 JayF: yes. we do that in Ironic today. 06:00:13 JayF: look at all the third-party drivers 06:00:14 JayF, iLO teamis interested in adding Proliant hardware manager to IPA 06:00:32 devananda: so you're ok with IPA shelling out to "utility.sh", where utility.sh is unspecified? 06:00:42 wanyen: rameshg87: The big thing this change (Which you should apparently review) that matters is that you can do small HardwareManager pieces 06:00:47 jroll: or is specified by the hardware manager 06:01:01 i.e. I'd rather see 5 hardware managers for 5 components than a single, large hardware manager targetted at specific hardware mixes 06:01:03 but that's just my vision :) 06:01:07 JayF, sure i will take a look asap 06:01:11 JayF: ++ 06:01:17 Thanks all 06:01:18 devananda: sure, but is quanta-modelxyz-hwmanager ok? this could grow extremely large 06:01:27 and keep in mind everything in tree bloats the ramdisk 06:01:36 I'm not saying we shouldn't, it's just a somewhat hard proiblem to solve 06:02:01 it shouldn't install all the driver's req's all teh time -- that does need to be configurable in some way 06:02:13 or -- on large platforms, maybe that's fine 06:02:16 we're over time 06:02:20 anyway, we're over time 06:02:26 right 06:02:27 JayF, the bottomline is to allow vendor to add hardware management functionalitites. We will take a look at your proposal. 06:02:32 Good meeting, thanks all, see you tomorrow 06:02:35 ok, thanks everyone :) 06:02:39 please go check out the change / respond on the ML if you're interested in IPA and hardware managers 06:02:41 ty all 06:02:46 ok 06:02:47 thanks all! see you next time 06:02:55 see you 06:02:56 see you 06:02:59 #endmeeting