17:00:17 #startmeeting ironic 17:00:18 Meeting started Mon Apr 6 17:00:17 2015 UTC and is due to finish in 60 minutes. The chair is devananda. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:19 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:22 The meeting name has been set to 'ironic' 17:00:30 #chair NobodyCam 17:00:31 Current chairs: NobodyCam devananda 17:00:36 hi folks! 17:00:40 mornign 17:00:40 heya 17:00:42 mornin' 17:00:43 :-p 17:00:53 o/ 17:01:08 I expect several folks are still on holiday 17:01:13 o/ 17:01:21 o/ 17:01:29 o/ 17:01:31 o/ 17:01:43 as usual, our agenda is posted on the wiki - https://wiki.openstack.org/wiki/Meetings/Ironic 17:01:53 o/ 17:01:56 mostly today though, we'll be talking about the release status 17:02:14 couple quick announcements / reminders 17:02:17 o/ 17:02:20 #topic announcements / reminders 17:02:28 - feature freeze is still in effect 17:02:51 - my goal is to tag a release candidate this week, as soon as reasonably possible 17:03:17 I'll also be taking a quick holiday thursday and friday, so we kinda need to do it before then :) 17:04:15 also, summit planning for all projects has started 17:04:17 #link https://wiki.openstack.org/wiki/Design_Summit/Planning 17:04:36 (posting that just to make sure everyone's aware) 17:04:51 that's it for me -- anyone else? 17:05:04 Drivers page (http://stackalytics.com/report/driverlog?project_id=openstack%2Fironic) needs to be updated so we have an accurate listing 17:05:20 will open L spec folder after theats cut? 17:05:30 s/will/we'll/ 17:05:34 oh wait, not quite done - I tagged a 0.5.0 release of python-ironicclient earlier today. I still need to send an announce to the ML (will do after this meeting) 17:05:46 NobodyCam: yes, we can open L for both specs and code once the RC is tagged 17:05:56 :) 17:06:02 woot 17:06:09 devananda: I have opening L tagged on my calendar 17:06:17 #info python-ironicclient 0.5.0 tagged today 17:06:35 devananda: is there a change log for it? 17:06:45 NobodyCam: yah. look at the pip 17:07:00 ack :) 17:07:04 hm. or the tag here ... 17:07:24 #link http://git.openstack.org/cgit/openstack/python-ironicclient/tag/?id=0.5.0 17:07:39 ahh TY :) 17:08:00 #topic subteam status reports 17:08:37 I see a couple notes from IPA and iLO teams that their docs are up and/or need reviews 17:08:45 oh cleaning is now enabled? 17:08:52 and I see a note from dmitry that discoverd has devstack support 17:08:55 NobodyCam: yah 17:09:14 :) w00 h00 17:09:37 devananda: that was from lastweek but I left it there for this meeting too 17:09:42 NobodyCam: ah, ok 17:10:28 ok, moving on in a minute ... 17:10:39 devananda: By docs do you mean specs? 17:10:46 For Liberty? 17:10:48 I've also seen many DOC patches this last week... Awesome !! keep'um comming 17:10:50 jlvillal: no, i mean documentation 17:10:51 jlvillal: docs. 17:10:55 devananda: Thanks 17:11:11 jlvillal: subteam status reports are at https://etherpad.openstack.org/p/IronicWhiteBoard 17:11:17 if you want to follow along 17:11:22 jroll: Bookmarking 17:11:33 (among a ton of other useful stuff) 17:11:53 #topic Release Candidate Status 17:12:01 #link https://launchpad.net/ironic/+milestone/kilo-rc1 17:12:19 that launchpad page is the canonical reference (no pun intended) to the status 17:12:37 as this week progresses, either we fix things or we discuss accepting that a bug just won't be fixed in Kilo 17:12:39 heh 17:12:40 and I untarget it 17:13:03 BadCub & others have been keeping track of things in flight on a whiteboard here 17:13:07 #link https://etherpad.openstack.org/p/IronicReviewDay 17:14:03 we still have a lot of bugs merely in confirmed / triaged status (iow, no one actively working on them) 17:14:03 if we have a review that we like eyes on should it be brought up here? 17:14:32 I'm inclined to simply defer those bugs now, unless someone wants to take them on today 17:14:38 https://bugs.launchpad.net/bugs/1433727 17:14:39 Launchpad bug 1433727 in Ironic "partial upgrade not possible, 'reason': u'Unknown argument: "configdrive" (HTTP 400)'" [High,Confirmed] - Assigned to Adam Gandelman (gandelman-a) 17:14:49 https://bugs.launchpad.net/bugs/1301279 17:14:50 Launchpad bug 1301279 in OpenStack Compute (nova) "Changing node's properties in Ironic after node is deployed will count as available resources in Nova" [Medium,Confirmed] - Assigned to Nha Pham (phqnha) 17:15:01 https://bugs.launchpad.net/bugs/1412561 17:15:02 Launchpad bug 1412561 in Ironic "IPA/DIB is not compatible to create the uefi capable images" [Medium,Confirmed] 17:15:07 https://bugs.launchpad.net/bugs/1177655 17:15:08 Launchpad bug 1177655 in Ironic "kernel boot command line for baremetal assumes block device UUID is correct root" [Medium,Triaged] 17:15:12 https://bugs.launchpad.net/bugs/1322179 17:15:14 Launchpad bug 1322179 in Ironic "Add check on total disk size before deploying" [Medium,Triaged] - Assigned to Sandhya Ganapathy (sandhya-ganapathy) 17:15:17 https://bugs.launchpad.net/bugs/1327260 17:15:19 Launchpad bug 1327260 in Ironic "Disk layout is not being verified on re-provisioning" [Medium,Triaged] 17:15:23 https://bugs.launchpad.net/bugs/1407820 17:15:24 Launchpad bug 1407820 in Ironic "server will not boot from pxe if it does not support command 'chassis bootdev pxe ' with 'persistent' option" [Medium,Triaged] 17:15:35 https://bugs.launchpad.net/bugs/1326269 17:15:37 Launchpad bug 1326269 in Ironic "Developer docs should include information about fake drivers" [Low,Triaged] - Assigned to Shraddha Pandhe (shraddha-pandhe) 17:15:59 I'm ok with giving 1326269 more time 17:16:06 yea, those two "Low" bugs I'm just bumping 17:16:13 the first one seems a bit important (15533727) 17:16:29 though if we punt on it, it goes away 17:16:59 wow, that number is not at all right 17:17:02 1433727 17:17:11 lol 17:17:20 i would like to bring https://bugs.launchpad.net/ironic/+bug/1422723 to notice 17:17:21 Launchpad bug 1422723 in Ironic "Ironic sets instance as active even if bootloader installation fails when boot_option is 'local'" [Critical,In progress] - Assigned to Ramakrishnan G (rameshg87) 17:17:40 it has tight dependency with it's corresponding dib patch 17:17:43 rameshg87: have the DIB changes landed? 17:17:46 which has already landed 17:17:56 jroll, i think the only solution for that is to document that nova's force_config_drive needs to be False before/during upgrades 17:18:05 ouch, yea 17:18:15 adam_g: ok, mind putting up a patch for that? 17:18:22 jroll, ya, was planning on doing that 17:18:24 rameshg87: so ironic + dib + localboot is broken today? 17:18:27 adam_g: awesome, ty 17:18:28 so in-effect either "ironic + dib should both go in", OR "both shouldn't go in (means reverting back dib patch)" 17:18:32 jroll: yes :( 17:18:36 :| 17:18:41 rameshg87: @!(*& :( 17:19:00 rameshg87: wait. that's what we use in the gate ... it shouldn't be breakable 17:19:10 devananda: we don't use localboot 17:19:12 devananda: we don't test localboot on gate 17:19:14 ahh, right 17:19:28 which is silly imo but it is what it is 17:19:46 who wants to create a tempest scenario test for localboot? 17:20:00 seems like, you know, something we should have done when we added that feature 17:20:06 * rameshg87 will give a try if no one else raises hands before 17:20:11 * jroll nominates lucas :P 17:20:19 jroll: lol 17:20:48 rameshg87: I'll review 169419 after the meeting 17:20:48 so, tempest not testing that aside... i'm concerned with the patch to ironic 17:20:58 let's make reviewing that a priority post-meeting 17:21:04 devananda: yes, please ... 17:21:09 +1 17:21:19 thanks NobodyCam 17:21:26 #link https://review.openstack.org/#/c/169419/ 17:21:27 does it look like https://review.openstack.org/#/c/164369/ will be making it in this week? 17:21:45 thats the one I wanted eyes on 17:22:11 i'd like to get https://review.openstack.org/#/c/167038/ proposed as a stable freeze exception for 2014.2.3 (out thursday), but cant really do that until 164369 is in 17:22:12 adam_g: that one's been somewhat contentious 17:22:12 the last comment will invoulve hiding more fields? 17:22:52 adam_g: I think you can do https://review.openstack.org/#/c/167038/ now 17:23:05 adam_g: I just pushed 0.5.0 client release today, which adds the microversion headers by default 17:24:01 devananda, so v1.1 is / will be the equiv of the juno API? it sounded like there was some contention over whether that'd be v1.0 or 1.1 17:24:13 adam_g: indeed, there still is contention over that 17:24:29 adam_g: or rather, whether NOVERSION = Juno, v1.1 is some point in the beginning of kilo 17:24:39 or whether we actually have a supported v1.0 that was Juno 17:25:11 * NobodyCam just want to deside and move on :/ 17:25:24 I know what I want -- because I proposed it in https://review.openstack.org/#/c/164369/ 17:25:51 I mean, I don't see anyone that isn't ok with that patch 17:26:00 rloo just has some more work to do, seems valid 17:26:04 jroll: I haven't seen anyone +2 it 17:26:17 sure 17:26:22 I need to review it, still 17:26:23 which tells me no one is OK with it 17:26:38 (no one that reviewed it, anyway) 17:27:01 * NobodyCam is okay with it. 17:27:09 :-p 17:27:13 *ducks* 17:27:35 back-tracking a moment... 17:28:01 #info https://review.openstack.org/#/c/169419/ needs priority reviews: critical bug: pxe drivers + dib + local-boot option is broken right now 17:28:53 jroll: is anyone going to be NOT-OK if I abandon https://review.openstack.org/#/c/164369/ and we just keep the APIs as they are now? 17:30:12 That means custom or non-upgraded clients are broken by default unless they send the correct header, in the general case, yes? 17:30:32 devananda: we'll be fine, if that's what you're asking 17:31:27 devananda: humm and just leave maintenance reason field exposed? 17:31:27 JayF: I dont think so. not sending the header means they get v1.1 API 17:31:33 JayF: no, it means things like maintenance_reason etc (that were added before microversions) will be there 17:31:38 JayF: which is not exactly the same as Juno, but didn't have any backwards-incompatible changes 17:31:46 aha, okay, then I'm okay with that 17:31:47 JayF: basically juno vs $(when microversions were added) 17:31:56 eg, the NOSTATE->AVAILABLE rename is the really big thing that breaks folks, and that comes after v1.1 17:32:20 devananda: theoretically I'd prefer to have that patch, in reality I don't care that much 17:32:26 is my opinion 17:32:28 so if my patch doesn't land, I think it generally doesn't break anyone. it's just not as _clean_ as I'd like 17:32:48 jroll: cool. then it's not worth us discussing much longer, since there's still no concensus on it 17:32:55 * devananda abandons 17:33:06 whee. 17:33:27 * NobodyCam closes the ever open tab for 164369 17:34:23 here's some more bugs 17:34:31 https://bugs.launchpad.net/ironic/+bug/1431929 17:34:32 Launchpad bug 1431929 in Ironic "Ironic failing to gracefully handle: ipmi error "insufficient resources for session"" [Medium,In progress] - Assigned to Julia Kreger (juliaashleykreger) 17:34:49 TheJulia: you seem to hav ea fix, but you're stuck on unit tests, right? 17:35:10 the patch on that is wipped on a timing loop? 17:35:26 yea, challenges mocking time calls 17:35:31 devananda: basically yes. I've got 3 fo 4 tests working just fine, the ask in a review was to actually mock the timeout 17:35:32 quick question on that one: the bug says ipmi tuning didn't help... do we rate limit ipmi commands by node or by host that it connects to 17:35:58 IOW, is the real bug that we don't do rate limiting correctly for double bridging? 17:36:00 jroll: just made it worse :( 17:36:19 TheJulia: I'd be fine landing it sans that 4th test then 17:36:29 I would agree, but really the bug beyond that is that the ipmitool client should be retrying on it's own 17:36:56 TheJulia: ipmitool CLI does retry. but it's pretty brutal in my experience 17:37:20 i added all the retry / timeout guards in our ipmitool.py module to try to work around issues with ipmitool's CLI 17:37:35 https://bugs.launchpad.net/bugs/1436791 17:37:36 Launchpad bug 1436791 in Ironic "boot_mode is overwritten in node properties/capabilities when secure boot is requested for iLO drivers" [Medium,In progress] - Assigned to Shivanand Tendulker (shivanand-tendulker) 17:37:36 yeah, its treating this as a hard failure, hence the retry and string search 17:38:06 TheJulia: string search is not really good. what if someone's LOCALE is different? 17:38:17 LAST_CMD_TIME.get(driver_info['address'], 0)) # curious if this address is different per node or per bridge, that could be the issue here 17:38:30 jroll: it's the same for all devices in a chassis 17:38:32 the bridge is getting overloaded because we're spamming it with a thing for each node 17:38:35 oh? hrm 17:38:43 jroll: right. this code SHOULD guard against it 17:38:50 so wth 17:38:51 devananda: That is a really good point 17:38:57 welp. some things are flakey ... 17:39:15 context seems to be per node, not per bridge address 17:39:21 fwiw, I have a new OOB mgmt library in the works which might fix some of these issues for our moonshot 17:39:29 woot 17:39:31 but that's totally off topic. IPMI sanity is important too 17:40:07 re the ilo bug above, I suppose we'll leave it to the ilo driver folks, but we should get reviews on that one 17:40:31 https://review.openstack.org/167952 17:41:21 seems like a lot of code :/ 17:42:02 * devananda hasn't reviewed it yet 17:42:19 https://bugs.launchpad.net/bugs/1436824 17:42:21 Launchpad bug 1436824 in Ironic "Lost control of nodes with amt driver after user reboot " [Medium,In progress] - Assigned to Tan Lin (tan-lin-good) 17:42:38 lintan_: how's ^ going? 17:42:57 devananda: i think code is submitted. just the doc is pending 17:43:03 devananda: He might be asleep at this time. 17:43:17 devananda: https://review.openstack.org/#/c/167975/ 17:43:20 it's merged 17:43:24 oh, i see 17:43:27 * devananda closes bug 17:43:35 * rameshg87 notes devananda +Aed :) 17:43:40 wait what 17:44:04 oh, I see 17:44:05 there's a 5-line doc patch pending 17:44:16 kind of an odd fix, "only use localboot" 17:44:18 but alas 17:45:14 https://bugs.launchpad.net/bugs/1271317 17:45:16 Launchpad bug 1271317 in Ironic "XML support not required for ironic" [Low,In progress] - Assigned to Devananda van der Veen (devananda) 17:45:27 mrda did most of the work, then I redid it this morning 17:45:32 should be an easy +2 for folks 17:45:47 https://bugs.launchpad.net/bugs/1282836 17:45:48 Launchpad bug 1282836 in Ironic "PXE driver does not check if tftpd is running" [Low,In progress] - Assigned to Chris Krelle (nobodycam) 17:46:09 this one seems like an optimization to me 17:46:27 I guess what I mean is, low-ish value, high amount of distraction 17:46:30 NobodyCam: I reviewed ^ this morning. needs a bit of work. It's close to ready, but also very low-impact if we bump it 17:46:31 We should be careful that however we fix that honors dhcp_provider=None 17:46:45 JayF: +1 17:47:02 given that, I'm pretty tempted to bump it 17:47:05 TheJulia: you run dhcp_provider=None, but still run tftpd on the conductor yes? 17:47:09 devananda: yes and teh dependent patches have string changes 17:47:11 jroll: yes 17:47:12 jroll: yes 17:47:18 that's fun. 17:47:42 I'm ok with bumping the whole chain 17:47:45 k 17:47:45 TheJulia: devananda: that makes for some interesting things, we should talk about that more in L 17:48:48 NobodyCam: is https://bugs.launchpad.net/bugs/1431929 something we will actually mark as completed soon? 17:48:49 Launchpad bug 1431929 in Ironic "Ironic failing to gracefully handle: ipmi error "insufficient resources for session"" [Medium,In progress] - Assigned to Julia Kreger (juliaashleykreger) 17:48:55 erm, bad link 17:49:00 https://bugs.launchpad.net/bugs/1323589 17:49:00 Launchpad bug 1323589 in Ironic "Installation guide needs updating" [Medium,In progress] - Assigned to Chris Krelle (nobodycam) 17:49:04 that's the one! 17:49:41 NobodyCam: like is there a particular thing wrong in the installation guide that you're fixing? 17:50:02 just by the title we can probably leave that open forever 17:50:04 I need to see how that got assigned to me 17:50:14 oh. huh 17:50:15 I did a couple of patches 17:50:17 * rameshg87 sees 10 mins left 17:50:18 jroll: indeed 17:50:25 NobodyCam: looks like it's old. /me closes 17:50:35 ack 17:50:57 enough bugs for now -- I'll keep focusing on these for the next couple days 17:51:04 and we'll tag RC when the page is clear 17:51:05 thanks all! 17:51:09 #topic open discussion 17:52:40 https://review.openstack.org/#/c/166192/ - grub2 bootloader support for UEFI Boot Mode was on the list 17:53:15 BadCub: i guess stendulker is not here 17:53:41 rameshg87: indeed 17:53:44 BadCub: unfortunately i don't have full details on it :( 17:54:15 BadCub: but i think he wanted to share some finding on switching default boot loader from elilo to grub2 17:54:32 BadCub: don't have much info on it 17:54:40 I believe that is correct as well. But neither do I 17:54:43 rameshg87: ya that was the concern 17:54:59 That seems like a change that shouldn't go into K though, right? 17:55:10 Changing bootloaders a few days before RC seems unwise 17:55:23 JayF: no. i don't think so. 17:55:28 JayF: +100 17:55:41 Then why is it worth talking about here? Shouldn't it be spec'd for L like anything else going into L? 17:55:44 JayF: I think stendulker came up with a way to support both 17:55:57 I believe this was briefly discussed in earlier meetings and decided to hold this off for K. 17:56:57 as far as i can tell, this is incomplete feature work from the uefi-secure-boot blueprint 17:57:01 The patch is attached to the BP that was given a FFE for K. 17:57:06 meaning that was not actually completed 17:57:43 https://blueprints.launchpad.net/ironic/+spec/uefi-secure-boot 17:57:58 This was a PoC tries to address pxe-i,o secure boot Grub2 migraton concern 17:58:13 the FFE was to land the code that was in-flight at that time 17:58:19 which was marked completed when https://review.openstack.org/#/c/165907/ landed on 3/19 17:58:27 devananda: correct 17:58:36 https://review.openstack.org/#/c/166192/ was proposed a day later 17:58:46 * 2 minutes 17:58:57 wanyen: if it was known that this was going to be a problem, why wasn't https://review.openstack.org/#/c/166192/ proposed sooner? 17:59:29 wanyen: seems like very poor timing to ask for a FFE, then propose new code after that Feature is marked completed ... 17:59:41 wanyen: 166192 is only PoC? 17:59:55 devananda, this PoC was implemented after upstream raised concerns about bootloader migration to grub2 18:00:22 *time* 18:00:26 let's continue this in channel 18:00:35 great meeting THank you all 18:00:42 thanks all! keep up the great work closing bugs! 18:00:48 #endmeeting