17:02:12 #startmeeting ironic_qa 17:02:13 Meeting started Wed Oct 12 17:02:12 2016 UTC and is due to finish in 60 minutes. The chair is jlvillal. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:02:14 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:02:16 The meeting name has been set to 'ironic_qa' 17:02:21 o/ 17:02:21 o/ 17:02:24 o/ 17:02:30 Sorry I'm late. Got distracted fixing a bug :( 17:02:32 <[1]cdearborn> o/ 17:03:12 As always the agenda is at: 17:03:16 * jlvillal fights with Synergy 17:03:17 \o 17:03:30 https://wiki.openstack.org/wiki/Meetings/Ironic-QA 17:03:38 #topic Announcements 17:04:04 #info Xenial fix for ironic-lib was done. Thanks vsaienk0 17:04:34 ++ 17:04:43 #info New version of ironic-lib broke ironic unit tests. https://bugs.launchpad.net/ironic/+bug/1632762 Almost fixed 17:04:44 Launchpad bug 1632762 in Ironic "Unit tests fail with ironic-lib 2.2.0" [Critical,New] 17:04:59 is that related to slow iscsi io? 17:05:11 clarkb: no, bad mocks in unit tests 17:05:13 ah 17:05:16 clarkb: No. That was because we got sort of wonky unit tests. 17:05:21 clarkb: the iscsi thing turned out to be MTU problems 17:05:23 and is fixed 17:05:35 clarkb: What jroll said on MTU 17:05:40 thanks 17:06:01 clarkb: FYI: https://review.openstack.org/385433 17:06:12 #info patch proposed to enable Xenial for ironic-lib https://review.openstack.org/385433 17:06:21 Anything else? 17:06:37 Okay. Moving on then. 17:06:43 #topic Multi-node Grenade 17:07:34 #info Multiple patches are in the queue to get reviewed. Last I checked the multinode test was green with the entire patch series. 17:08:14 I don't have much to add there. I am working on setting up a local test environment for multi-node so I can test the patches and then be ready for adding Grenade to it. 17:08:19 Any questions/comments? 17:09:06 Okay moving on 17:09:14 #topic 3rd Party CI from krtaylor 17:09:24 krtaylor: The floor is yours. 17:09:47 sambetts: Can you start us off as I don't see krtaylor 17:10:03 sambetts: jroll: On the agenda is some questions from watanabe_isao 17:10:16 * jroll looks at agenda 17:10:35 So, there were several breakages last week caused by the SUBNETPOOL_PREFIX issue in devstack 17:10:52 the Cisco CI still hasn't recovered fully 17:10:53 mostly I think we should make it clear and my advice, I have write it in the agenda. 17:11:09 watanabe_isao: I agree, I've been working on a draft for some of this 17:11:16 I'm sorry I've been slow about it 17:11:35 they have merged a fix into devstack to be able to optionally disable subnetpools now so that should help 17:11:42 sambetts: I know clarkb has been pushing on getting a fix for that. 17:12:04 ++ 17:12:09 I think the neutron team has a better understanding of the issue after the most recent discussion so hopefully we can get to a place that makes them happy 17:12:19 awesome 17:12:49 jroll, what I am most worry about is we don't have a tool to check the CI reply time. 17:13:30 watanabe_isao: yeah, it's sad. someone could build one, or extend ci-watch. :) 17:13:49 I don't expect us to be super strict about runtime 17:14:09 with regards to the result turnover for a third party CI, I don't think we yet enforce a return time on the ironic ones, but we have discussed what is a resonable time frame in the past 17:14:11 but we need to have some sort of rules, waiting more than 8 hours isn't useful 17:15:44 jroll, and about the pass/fail rates, it is not mentioned in the space, but if all drivers update their CI account in Driverlog, it would be easier to check for that in stakalitics. 17:16:05 watanabe_isao: cool 17:16:57 irmc has done that, it is pretty easy. 17:17:23 yeah, krtaylor had updated some of them, I think 17:19:12 anything else on this topic? 17:19:15 Anything else? 17:19:22 jroll, sambetts, about the reasonable time, 17:19:33 it changed day by day. 17:20:06 what changed? 17:20:08 do we have a max patch set per day target to clear for the 8/4 hours time? 17:20:19 From the spec: Tests will be expected to complete and report back to gerrit within 8 hours of the patch submission until the end of Newton release and within 4 hours by the end of Ocata development cycle. 17:21:00 watanabe_isao: what do you mean by max patch set per day? 17:21:03 watanabe_isao: I would assume it should be able to keep up with the patches that are submitted. Not sure how many is normal per day. 17:21:07 watanabe_isao: it's every patch that touches code 17:21:23 there's no "max", just "all" 17:21:36 http://graphite.openstack.org/render/?width=586&height=308&_salt=1447868663.13&lineMode=connected&from=00:00_20141115&until=23:59_20151118&target=summarize%28stats_counts.zuul.pipeline.check.openstack.ironic.total_changes,%20%271d%27%29 17:21:46 sambetts: ++ 17:22:02 sambetts, thanks, I am asking this. 17:22:30 the time scales on that graph need updating but thats what we based the spec on 17:23:12 so as you can see it needs to deal with spikes of 35+ patches and still return within 8hours for all of them 17:23:22 http://graphite.openstack.org/render/?width=586&height=308&_salt=1447868663.13&lineMode=connected&from=00:00_20151115&until=23:59_20161118&target=summarize(stats_counts.zuul.pipeline.check.openstack.ironic.total_changes,%20%271d%27) 17:23:24 D: 17:23:25 thats 8hrs each btw 17:23:43 that has... dramatically increased 17:23:43 jroll: woah... 17:24:13 hm 17:24:18 * jroll needs to put some thought into this 17:24:35 that seems... high 17:24:37 jroll, sambetts, so the document patched included? 17:24:45 Of course if someone uploads the same patch five times in a 5 minute period. It should kill the running job each time a new revision appears. 17:24:48 watanabe_isao: yes, that graph is all patches 17:27:11 jroll, thanks, however 600 per day is a big number... 17:27:46 I'm not sure that it right, there are definatly not 600 patch messages in the ironic irc channel 17:27:55 yeah, something is off there 17:28:03 watanabe_isao: like I said, I need to put some thought into this 17:28:21 jroll, sure. 17:28:25 it does seem we've been posting about 100 patchsets per day, though 17:29:00 though if you imagine one hour per CI job, that's only 4 per hour, so 4 simultaneous runs 17:29:14 I don't think that's unreasonable 17:30:16 jroll, if one driver has 3 sub drivers, it makes it from 4 to 12 nodes. 17:30:37 watanabe_isao: right, I don't think that's an unreasonable amount of hardware 17:30:55 anyway, I will think about this a bit and try to write something up 17:31:32 watanabe_isao: oh, and if the rule is 4 hours, divide it by 4 17:31:38 so 1-3 nodes 17:31:47 * sambetts is going to have to drop, sorry guys, see you later 17:31:51 if runtimes can be kept to one hour 17:31:53 later sambetts :) 17:31:54 sambetts: Thanks! 17:32:06 sambetts, bye. 17:34:00 watanabe_isao: does that help? make sense? 17:34:07 we need to tweak the rules, I suspect 17:34:12 jroll, I don't quite get the divide it by 4, I was thinking you mean that 4 patchset per hour comes at max, when it is 100 patchset per day? 17:34:54 watanabe_isao: yeah, thinking more, ignore that 17:36:20 jroll, and if a CI test can make sure itself be done in 1 hour, then the 8 or 4 conditions are just the same. 17:36:55 watanabe_isao: yeah, I was wrong, I need more analysis to actually tell what this means (because patches aren't evenly distributed across time) 17:38:24 watanabe_isao: but otherwise, does this make sense 17:38:26 ? 17:38:37 jroll, yes we are considering the "100 patchsets" are been post int he American time half day(12 hours) which means it may not be 4 per hour but 8 per hours. 17:39:00 right 17:39:45 Europe time zone pastes often too. Seems like more of a 12-16 hour period. 17:39:51 s/pastes/posts/ 17:40:20 so beyond "jroll needs to analyze, think, and write about this" is there anything else on this topic? 17:41:07 jroll, I'm ok, and waiting for your answer about this. :) 17:41:18 ok, thanks for bringing it up :) 17:41:42 jlvillal: let's move on :) 17:41:48 jroll: Thanks 17:41:51 #topic Opens 17:41:55 Anyone have anything? 17:42:04 see dmitry's ML post? 17:42:12 people should read that thing :) 17:42:13 #info ironic gate is now working again. Unit tests are fixed. 17:42:41 * jlvillal looks and says Ah 17:43:21 #info dtantsur has proposed consolidating and extending Ironic CI jobs 17:44:22 http://lists.openstack.org/pipermail/openstack-dev/2016-October/105558.html 17:44:53 #info Jordon Pittier is trying to remove Ironic jobs from Tempest 17:45:12 http://lists.openstack.org/pipermail/openstack-dev/2016-October/105598.html 17:45:15 hrm 17:45:19 thiagop was working on that 17:45:26 whatever happened to him? 17:45:33 * jlvillal doesn't know 17:46:04 that email went to my spam O_o 17:46:11 :( 17:46:42 jroll: For me Gmail says: This message was not sent to Spam because of a filter you created. Edit Filters 17:46:59 * jlvillal had made filter that said never send openstack-dev email to spam. 17:47:18 oh, neat 17:47:34 Anything else to discuss? 17:47:42 * jroll suddenly realizes he isn't in -qa D: 17:47:45 I have nothing 17:47:53 heh 17:47:58 Okay. Thanks everyone! 17:48:05 #endmeeting