17:00:17 #startmeeting ironic 17:00:18 Minutes (text): http://eavesdrop.openstack.org/meetings/ironci/2016/ironci.2016-11-21-17.00.txt 17:00:19 Log: http://eavesdrop.openstack.org/meetings/ironci/2016/ironci.2016-11-21-17.00.log.html 17:00:20 o/ 17:00:21 Meeting started Mon Nov 21 17:00:17 2016 UTC and is due to finish in 60 minutes. The chair is jroll. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:21 o/ 17:00:22 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:23 o/ 17:00:24 o/ 17:00:25 The meeting name has been set to 'ironic' 17:00:25 o/ 17:00:26 o/ 17:00:26 \o 17:00:29 o/ 17:00:30 <_milan_> o/ 17:00:31 o/ 17:00:33 o/ 17:00:36 as always the agenda is here 17:00:40 #link https://wiki.openstack.org/wiki/Meetings/Ironic#Agenda_for_next_meeting 17:00:48 o/ 17:01:23 #topic announcements and reminders 17:01:25 o/ 17:01:38 reminder pike PTG signups are open, and limited to 500 people, so sign up early 17:01:40 #link https://www.eventbrite.com/e/project-teams-gathering-tickets-27549298694 17:01:54 nova spec on portgroups merged 17:01:57 also thanksgiving is this week in the US, so expect lots of people off toward end of week 17:02:04 o/ 17:02:28 anything else? 17:03:08 #topic subteam status reports 17:03:13 as always those are here 17:03:15 #link https://etherpad.openstack.org/p/IronicWhiteBoard 17:03:19 starting around line 84 17:03:24 (priorities at line 74) 17:03:28 o/ 17:03:43 I've requested Newton releases for all projects, they'll hopefully be processed soon 17:03:54 thx dtantsur 17:04:02 oh yes, awesome 17:04:02 also Liberty reaches EOL as we speak 17:04:20 sambetts, vdrok: is that the LAST ironic spec for portgroups support? /me is getting tired of... 17:04:46 lol 17:04:48 rloo: you mean that update? I think so, yes 17:04:54 s/think/hope 17:04:59 :D 17:05:07 w00t 17:05:12 vdrok: /me hopes too! :) 17:05:32 lucasagomes: is that a cheer for eol'ing liberty? 17:05:48 rloo, for vdrok, but yeah works for liberty as well :-) 17:05:56 lucasagomes: :) 17:06:04 lucasagomes, don't be too glad, I think we still support it downstream :D 17:06:16 dtantsur, we do yeah 17:06:16 dtantsur: /o\ 17:06:17 sambetts: what's the status of the attach/detach patches? 17:06:36 New update for the Ironic side should be going out today :) 17:06:50 sambetts: great! thx 17:08:21 so i guess the deploy steps will be 'on hold' for a few weeks. mat128 is away for a bit but i forgot how long. 17:08:35 yep 17:08:47 the goal there is just to get the spec done, so I think that will be fine 17:09:45 jroll: what are this week's priorities? 17:10:00 rloo: still ingesting all the updates 17:10:06 oh yeah, root device hints are done, thx lucasagomes! 17:10:22 yeah, the last bit was only docs :-) 17:11:00 rloo: looks like we could just keep iterating on last week's stuff? 17:11:06 looks like they all got some action 17:11:27 we could. jroll do you think there will be anything for #5 this week? given your short week etc 17:11:48 rloo: yeah I hope to have a code update today or tomorrow 17:12:14 jroll: ok 17:12:38 #6 should be pretty quick to merge I think 17:12:54 * dtantsur hopes so 17:12:58 +1 17:13:14 so i see three? specs that need work: rolling upgrades, ipa REST API, fault support. are they at a state where we could meet to go over them, or should we wait another week or more? 17:14:02 I'm honestly not sure 17:14:03 i think fault support should wait at least another week 17:14:14 ok, let's wait then :) 17:14:20 JayF may correct me :) 17:14:39 I plan on having a reviewable vesion of that up by EOW 17:14:55 although I suspect it'll need more than a little review back-and-forth 17:15:32 Ok, I'll bring it up again in 2 weeks (if i remember) :) 17:16:12 cool 17:16:15 anything else on this? 17:17:33 #topic Drop QA/CI meeting and just handle any topics in this meeting? 17:17:36 should be super quick 17:17:46 in the QA meeting wednesday we agreed we don't need a separate meeting 17:17:54 can just use subteam updates and such 17:17:56 any objections here? 17:18:18 jroll: did you discuss with john? 17:18:52 rloo: I have not, good point 17:18:57 was hoping he'd be here I guess :) 17:19:09 jroll: he might be out this whole week 17:19:14 right, I know 17:19:28 i'm fine with dropping it as long as john is :) 17:19:31 if there's no objections here I'll check it with him and then do the thing 17:19:36 I propose we skip it this week thuogh :) 17:19:45 +1 to skipping it! 17:19:46 I'm all for it, as I usually cannot make it to the second evening meeting 17:20:07 krtaylor: mind sending a skip notice to the ML ? 17:20:12 yeah, same for me 17:20:40 if the ppl that attend the meeting agrees on dropping it, i've no objections 17:20:47 All for skipping this week and dropping it since it should bring more clarity to this meeting. 17:21:04 cool, seems everyone agrees, I'll email john 17:21:07 thanks! 17:21:11 #topic open discussion 17:21:14 who's got a thing? 17:21:27 jroll: i have something under rfe review 17:21:33 4 things actually 17:21:52 rloo: they were added within 20 minutes of the meeting, then :/ 17:21:58 * jroll refreshes 17:22:01 #topic RFE review 17:22:03 jroll: yeah. my bad. supposed to be 2 days or so before 17:22:11 fire away :) 17:22:33 so i am slowing going through our rfes. checking to see which need specs or not. here are the oldest four i picked to discuss 17:22:43 https://bugs.launchpad.net/ironic/+bug/1588177 17:22:43 Launchpad bug 1588177 in Ironic "RFE: Allow ilo drivers to choose the ports to be inspected" [Wishlist,In progress] - Assigned to Shivanand Tendulker (stendulker) 17:23:00 i don't think it needs a spec but ?? 17:23:18 I would like it to converted to "Allow to choose the ports to be inspected 17:23:27 because I just got the same complains about pxe_drac inspection 17:23:46 I'm fine with an RFE only as long as its generic and not iLO specific 17:24:07 then, however, it's going to be funny to implement for ironic-inspector (requires API update), but we'll handle it :) 17:24:11 dtantsur: makes sense to me. 17:24:16 sounds more like "specify nics to not be managed by ironic", no? 17:24:17 dtantsur: the generic part. 17:24:28 the funny part, i dunno. i'm not laughing yet anyway :) 17:24:30 jroll, to not present to Nova 17:24:43 dtantsur: yeah, that too I suppose 17:24:47 we don't "manage" ports IIRC, but nova picks the random one for provisioning 17:24:53 ok, agree about generic 17:25:04 we have the pxe_enabled field now 17:25:12 we should use those ports 17:25:14 ++ for generic... And, should we use the pxe_enabled for it? 17:25:17 sambetts, =+ 17:25:19 ++* 17:25:21 sambetts: this is about inspection 17:25:23 sambetts++ does nova always use it? 17:25:25 so that data isn't there yet 17:25:27 right? 17:25:36 dtantsur, not at the moment, I think 17:25:50 this feature is easy without inspection, the user just doesn't add the ports 17:26:06 yep. but inspection adds all ports 17:26:21 dtantsur: this can be suppressed, right? 17:26:21 (except for ironic-inspector which hacks around it, sigh) 17:26:22 right, so pxe_enabled doesn't exist yet, so we can't use it for this 17:26:37 pas-ha, for OOB inspection? 17:27:01 I remember there was something like do not add ports if found new... 17:27:43 or is it for discovery phase we are talking about? 17:28:02 * jroll comments his thoughts on the rfe 17:28:41 pas-ha, I think you're talking about ironic-inspector 17:28:58 pas-ha, and here we're discussing OOB inspection as implemented by (at least) pxe_ilo and pxe_drac 17:29:10 ok, now I got this 17:30:02 IMO I think inspector should firewall off all ports that have pxe_enabled=False, we can't do anything about ports we don't know about yet 17:30:25 that's... a different topic, though 17:30:25 sambetts, let's keep on topic 17:30:30 otherwise it will go crazy 17:30:32 this isn't about inspector or existing ports 17:30:39 still have 3 more RFEs 17:30:48 so do we want a spec? or is it sufficient if they can clarify in the rfe? 17:30:49 anyone disagree on approving a generic version of this one? 17:31:02 * jroll is fine without a spec, if it's generic 17:31:08 ++ for generic 17:31:13 ++ 17:31:17 ++ for generic 17:31:23 ++ 17:31:23 ++ 17:31:36 seems good enough to me 17:31:41 rloo: next! 17:31:51 https://bugs.launchpad.net/ironic/+bug/1596421 17:31:51 Launchpad bug 1596421 in Ironic "RFE: Increase size of data base entry for instance_info to allow configdrives larger than 64KiB" [Wishlist,In progress] - Assigned to John L. Villalovos (happycamp) 17:31:51 ++ 17:32:20 there's discussions in that rfe, and patches. but i don't know that we have decided on anything? 17:32:28 oh did we not? 17:32:45 jroll: i don't know. i can't tell from looking at the rfe. 17:32:45 So I thought the discussion w/r/t solving that problem about configdrive size 17:32:55 was to stop storing a configdrive at all. 17:33:05 JayF, that's my understanding as well 17:33:10 No other nova driver stores and re-uses a configdrive on rebuild. 17:33:21 ah, yeah, that was a thing 17:33:27 but our rebuild does not recreate nor offer it upon our rebuild 17:33:28 tho... we will need to change our API 17:33:29 In fact, today, the fact you can't re-pass configdrive into ironic when doing a rebuilt is an API change from other nova drivers on rebuild 17:33:39 to allow rebuild to have a configdrive parameter 17:33:42 err, our nova driver 17:33:48 lucasagomes: ++ 17:33:55 lucasagomes: ++ 17:34:54 folks, I'd like some opinion on the discussion started here: https://review.openstack.org/360330/ 17:35:31 gabriel-bezerra: please wait for open discussion 17:35:35 oh, sorry 17:35:50 I know folks who have modified ironic to automatically build and attach config drives 17:35:52 TheJulia: JayF: lucasagomes: is there someone that can write down that decision and carry it forward to rfe/spec approval? 17:36:03 jroll: I thought someone had done that, in a bug somewhere 17:36:06 * JayF looks 17:36:07 i thought I had seen the "open discussion" topic 17:36:13 JayF: not that I'm aware of 17:36:14 JayF: don't we need to store it at least for the period before we write it to the node?? 17:36:20 gabriel-bezerra: yeah, that was a misfire, no worries 17:36:28 JayF: I thought the same had already been done, since there was >1 related bugs 17:36:29 https://bugs.launchpad.net/ironic/+bug/1575935 17:36:29 Launchpad bug 1575935 in Ironic "Rebuild should also accept a configdrive" [Medium,Confirmed] - Assigned to Stephane Miller (stephaneeee) 17:36:39 We should unassigns that 17:36:45 unassign 17:37:11 TheJulia, ++ 17:37:11 JayF: so we should reject the other rfe and point to that, then 17:37:13 right? 17:37:15 +1 17:37:19 ok 17:37:59 * jroll has done it 17:38:02 lucasagomes: removed assignment, since I've talked to cinerama in the past few weeks 17:38:05 what's next rloo :) 17:38:09 ty 17:38:11 https://bugs.launchpad.net/ironic/+bug/1599425 17:38:11 Launchpad bug 1599425 in Ironic "[RFE]: Add few more capabilities to ironic inspection" [Wishlist,Confirmed] 17:38:29 * dtantsur wants a spec on that 17:39:00 There already is one if memory serves 17:39:03 we've been way too relaxed with adding capabilities. so now pxe_ilo detects something, ironic-inspector detects something different. 17:39:15 and other drivers are probably not detecting any 17:39:36 https://review.openstack.org/#/c/338138/ 17:40:19 TheJulia: oh, that's the spec for that rfe. ok. i'll just tag it as 'needs-spec' then :) 17:40:20 right 17:40:24 +1 17:40:53 last but not least and easy i think: https://bugs.launchpad.net/ironic/+bug/1626106 17:40:53 Launchpad bug 1626106 in Ironic "[RFE] Install optional dependencies when running unit tests" [Wishlist,Confirmed] 17:41:08 uhhh 17:41:15 Just the title makes me a little concerned 17:41:22 lol 17:41:33 ah, this one. yes 17:41:36 TheJulia: ? bells ringing? 17:41:52 IMO third party CI should be covering this 17:41:55 we got a problem when our tests for one of the drivers worked perfectly with mocks, but not with actual thing installed 17:42:02 * jroll doesn't see any reason not to do this 17:42:04 sambetts, 3rdparty CI to run unit tests? 17:42:06 so the upside is "catches problems with optional dependencies" and the downside is "will make g-r hate us"? 17:42:14 I think we should just do it 17:42:17 mgould, yes 17:42:18 dtantsur: hmm good point :/ 17:42:22 dtantsur: that is exactly the reason why to do it 17:42:28 why would this make g-r hate us? 17:42:42 jroll: none of the driver-requirements are in g-r or u-c 17:42:44 this doesn't affect g-r 17:42:45 so? 17:42:49 rloo: My only nit would be add "python" to the subject :) 17:42:51 this has nothing to do with g-r, good or bad 17:42:56 we wouldn't put the driver requirements in g-r 17:43:03 oh, good, I must have misunderstood rloo's comment 17:43:07 TheJulia: oh. ha ha. 17:43:15 what is g-r? 17:43:17 jroll, they may pull incompatible requirements to our unit tests 17:43:24 gabriel-bezerra, global requirements 17:43:25 if it was system level packages, I would be very concerned 17:43:33 dtantsur: not if we install them with u-c enabled 17:43:37 :) 17:43:41 gabriel-bezerra: https://github.com/openstack/requirements#global-requirements-for-openstack-projects 17:43:46 thanks 17:43:51 TheJulia: updated! 17:43:52 jroll, if we don't add these requirements to g-r, they may contradict u-c 17:43:55 \o/ 17:44:01 which will break our unit tests without ability to recover them 17:44:13 TheJulia: for system level, there's http://docs.openstack.org/infra/bindep/index.html which I think we want to move to 17:44:14 this is why everything is put into u-c 17:44:19 dtantsur: if that's a real concern, we can keep it non-voting, and file bugs when that happens 17:44:23 gabriel-bezerra: tl;dr all OpenStack projects must have compatible requirements.txt files 17:44:28 jroll, this will work, I guess 17:44:34 got it 17:45:04 yeah, as non-voting extra unit test job - I'm fine with that 17:45:14 it almost seems like a feature, not a bug 17:45:22 that it could expose g-r + driver-requirements incompatibilities 17:45:26 ++ ^ 17:45:26 JayF: as long as it's non-voting yeah :) 17:45:37 because then we can get he maintainers of the requirement to make it compat :) 17:45:44 agree 17:45:53 maybe make it voting once all in-tree drivers are compatible 17:46:44 mariojv, I doubt it. We don't even have ironic-inspector voting on ironic :D 17:46:49 * ironic-inspector job 17:46:51 ok, any objections to approve it? 17:46:59 None here 17:47:01 ++ for approving 17:47:04 nop 17:47:40 thx all. that wasn't too painful. i'm going to add 10 next time (j/k) 17:48:14 awesome 17:48:16 rloo: just add one per week ;) 17:48:18 open discussion? 17:48:22 thanks rloo, this is useful 17:48:46 ok, 4 or fewer :) 17:48:53 open discussion would be good, I think gabriel-bezerra had a question 17:48:56 As I would introduce earlier, I'd like some opinion on this: https://review.openstack.org/360330/ 17:49:12 the point is: oneview needs the machine to be turned off for it to set the boot device 17:49:13 #topic open discussion 17:49:34 agent driver tries to set the device while it is turned on 17:49:56 2 ways to fix it: change the agent driver or change oneview driver set-boot-device to shut the machine off 17:50:07 that patch goes the second way 17:50:38 what's the reasoning behind the first? 17:50:40 So I think this is a oneview centric thing, so it seems like it would need to be addressed in the oneview driver 17:51:04 can't we make it generic? add some flag to driver_info? 17:51:13 i agree with TheJulia, without having read the code. is there any disadvantage to the second option? 17:51:26 the first: changing the agent to turn the amchine off before changing the boot device? 17:51:29 I would suggest if you need to change the order of operations in the agent deploy interface you inherit it and create a oneviewagentdeployinterface 17:51:30 jroll: ^ 17:51:42 anyway, what's the question? how to address it? 17:51:47 gabriel-bezerra: yeah, what would be the advantage of that? 17:51:54 the second option also allows the oneview driver to error if a user tries to explicitly change the boot device via the api, which right now, I guess would silently fail... 17:52:11 we have a klugde today that we inherit the driver and copy-paste most of the code 17:52:25 whenever it changes, our driver breaks together 17:52:49 O_o 17:53:06 gabriel-bezerra: so that seems like a second problem we need to solve as well 17:53:10 the problem I see with the existing patch is that if we change the agent driver to do a inband operation after setting the boot device then it'll break because you've powered off the node 17:53:13 gabriel-bezerra, if you set the boot device when the machine is on it fails ? Or it suceeds and it gets applied later after the machine reboots ? 17:53:20 i mean.. part of the agent code is copy-pasted, so that we put the shut off code there 17:53:25 yes 17:53:32 lucasagomes: it fails 17:53:36 hmm 17:53:43 sambetts: I don't see us doing that though :P 17:54:02 another option i'd put is: add a hook to the agent driver and we could just put the shut off there. 17:54:08 gabriel-bezerra, what if you save the choosen boot device inside driver_internal_info and applies it when the machine is powered off ? 17:54:24 so ironic can lazily do it for oneview (instead of relying on the hardware for it) 17:54:44 yep, we do it for ipmitool 17:54:46 power_off() -> checks if there's a pending boot device -> applies/skip -> power off 17:55:00 I mean power off -> apply/skip 17:55:09 sounds like an option 17:55:31 sounds better than doing a hard power off when you call "node-set-boot-device" for me 17:56:22 I guess driver specific docs could highlight that difference if a user tries to explicitly set the boot device 17:57:14 something the patch shows is: even shutting the machine off during a set-boot-device, all tempest tests pass.. 17:57:39 the patch doesn't bring it back to the previous state 17:58:14 seems like all use cases so far for setting boot devices is followed by a reboot or power on 17:58:38 except the api :) 17:58:40 .. or that is obvious, and I'm just saing bs here 17:58:43 yeah.. 17:58:46 :) 17:59:17 yeah, idk if I have a good answer, need to think on this more 17:59:45 and time is up 17:59:45 small review request - couple of small patches - https://review.openstack.org/396355, if you have some time 17:59:52 to gabriel-bezerra's patch! 17:59:55 thanks y'all 17:59:57 #endmeeting