17:01:38 #startmeeting qa 17:01:39 Here 17:01:39 Meeting started Thu Jun 20 17:01:38 2013 UTC. The chair is sdague. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:01:40 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:01:42 The meeting name has been set to 'qa' 17:01:59 #topic Blueprint Status 17:02:16 please check in with any updated info on your blueprints using the #info tag 17:02:35 we'll give everyone a minute or two on that 17:03:34 sdague: no blueprint link this time? 17:03:45 #link https://blueprints.launchpad.net/tempest 17:03:52 sorry, I'm slow :) 17:04:08 #link https://launchpad.net/tempest/+milestone/havana-2 17:04:24 those are the havana 2 blueprints, which we are only 3 weeks away from, and there are a lot there 17:05:18 I'm going to try to get some progress on the stress tests. 17:05:19 davidkranz: what's need to close out the stress test one 17:05:23 ok, great 17:05:27 Volume was just added. 17:05:45 sdague: The most important thing is to create the job that will run it 17:05:55 sdague: And set a periodic build. 17:06:08 sdague: RIght now there is nothing making sure it actually works. 17:06:29 sdague: But there is an issue that these tests are really targeted to run in an environment not supported upstream. 17:06:30 davidkranz: ok, great, is that still doable for havana-2? 17:07:03 sdague: I am checking on that and will update accordingly. 17:07:07 davidkranz: well I think we had a policy that if we can't run it in CI some how (periodic or otherwise) it can't really be in tempest 17:07:13 because otherwise we get bit rot 17:07:31 that was a big issue with the old stress tests. 17:07:44 right, exactly 17:08:05 mtreinish: Unfortunately the reason the old stress tests could not be run still remains. 17:08:12 mtreinish: All the bogus log errors. 17:08:25 davidkranz: well we can run it as a non enforcing job 17:08:56 also, if there are specific bugs for each error in the logs, they are likely to get addressed 17:09:01 sdague: We need to do that anyway but the real issue if false positives if we can't check the log for legitimat errors 17:09:15 typing too fast... 17:10:00 sdague: I filed a whole bunch of bugs a few months ago 17:10:09 davidkranz: sure, but we can call out the false possitives to code around them, much like skips in our tempest code 17:10:15 sdague: Some were fixed, some not. 17:10:37 sdague: Yes, we can do that but it is very fragile. 17:10:38 ok, well lets leave it at the following 17:11:03 #action davidkranz to see if stress tests can be completed for h2 milestone 17:11:20 sdague: OK 17:11:32 mtreinish: I know I put testr later in the agenda, but you want to talk about it now, given that it's also a critical blueprint? 17:11:50 sdague: sure 17:12:05 mtreinish: Is there any issue with the admin tests running in parallel? 17:12:22 mtreinish: There is no tenant isolation there. 17:12:30 so I've got a patch pending for testr that partitions on test classes and it's giving runtime numbers that are what we expect 17:12:40 davidkranz: not that I've seen in my runs 17:12:43 mtreinish: Great! 17:13:30 mtreinish: any word from lifeless yet about his evaluation on the patch? 17:13:31 I'm waiting on lifeless to review the testr patch before I push out a change that moves run_tests and tox over to testr 17:13:40 ok, cool 17:14:05 once it gets merged we should be able use testr for everything 17:14:24 I'm sure it will shake loose some races but we debug those as they come up 17:14:43 yep 17:14:50 that will be great 17:14:51 but at least locally things have been fairly stable on my dev box 17:15:07 mtreinish: Maybe we should keep it work in progress and rerun it a bunch of times before merging? 17:15:09 mtreinish: you want to update that blueprint to good progress? 17:15:15 if anyone cares I've put some initial numbers up here: https://etherpad.openstack.org/testr-numbers 17:15:27 #link https://etherpad.openstack.org/testr-numbers initial tester numbers 17:15:46 also the testr patch is here if anyone wants to try it: https://code.launchpad.net/~treinish/testrepository/testrepository 17:15:50 #info https://blueprints.launchpad.net/tempest/+spec/speed-up-tempest now Good Progress, just waiting for test patch inclusion 17:15:51 mtreinish: I would hesitate to put that in the gate based on one successful run. 17:16:17 davidkranz: yeh, probably we'd want to make a separate tox rule for a full run 17:16:28 and run it in shadow mode for a week or two 17:16:32 just to make sure we are good 17:16:37 sdague: Sounds good to me. 17:16:41 davidkranz: yeah that's a fair point 17:16:50 sdague: yeah sounds like a good idea 17:17:09 ok, anything else on blueprints? 17:17:18 and, anyone here besides just us 3 :) 17:17:18 also testr has '--until-failure' and '--analyze-isolation' which we could maybe setup as a periodic too 17:17:47 mtreinish: Do you have a feel as to why the speedup falls sharply after 2x? 17:18:03 mtreinish: Of course 2x would be great :) 17:18:33 davidkranz: I think one thing that came up is that it's not using the timing database to optimize 17:18:41 hi 17:18:46 davidkranz: not really, I was happy with the numbers but I haven't looked into optimizing them further yet 17:18:53 I'm sure there is room for improvment 17:19:06 hey afazekas 17:19:13 mtreinish: Sure, just curious. At some point it will slow just due to load. 17:19:20 afazekas: anything from you on blueprints before we move to reviews? 17:19:21 hey sdague 17:20:21 ok, reviews 17:20:28 #topic Reviews that need attension 17:20:54 I will try to figure out why I have the issue with glance+swift+wc2, after that I will try to reproduce the real ec2 issue in a loop (as background job), at the same time I will work on the resource leak , and add some docs 17:21:03 looks like we actually have a lot of reviews with 1 +2 on them 17:21:14 s/wc2/ec2/ 17:21:16 sdague: Well, there is https://review.openstack.org/#/c/33689/ :) 17:21:39 sdague: I think a few are waiting for non-red-hat approval. 17:21:55 davidkranz: sdague started a ml thread on that one 17:21:55 davidkranz: sure, I'll take a look later today 17:22:25 mtreinish: I know. Should be fun. We've been down this road before. Swift folks just don't agree with the API stability policy. 17:22:28 davidkranz: yeh, on the swift one there is a thread here: http://lists.openstack.org/pipermail/openstack-dev/2013-June/010689.html 17:23:22 ok, I need to go figure out why my patch died on pg 17:23:30 didn't notice that one yet 17:23:41 ok, any other reviews of note? 17:24:16 ok, so the last thing on the agenda which we didn't yet talk about was multiple api versions - https://wiki.openstack.org/wiki/Meetings/QATeamMeeting#Weekly_QA_Team_meeting 17:24:28 #topic Multiple API versions 17:24:57 there was an ML thread on this, we think it's resolved enough there, or is there need for more discussion? 17:25:17 sdague: that fail should just be a recheck. I opened a bug for it here: https://bugs.launchpad.net/devstack/+bug/1192990 17:25:19 Launchpad bug 1192990 in devstack "Keystone certificate error with heat" [Undecided,New] 17:25:25 sdague: I don't think it is quite resolved there but we could continue the discussion there. 17:25:38 ok, we'll take the discussion there 17:26:01 My issue is how we test new and old versions that bhavior has not changed without duplicating the tests. 17:26:06 #topic Open Discussion 17:26:14 davidkranz: I don't think you can 17:26:30 sdague: That is a big problem given the size of nova, e.g. 17:26:32 duplicating is probably the right way from an issolation perspective, it also lets you remove old versions over time 17:26:33 davidkranz: yeah there is no guarentee something won't change between versions so we just have too test both 17:26:56 davidkranz: nova is duplicating the code paths internally for the same reason 17:27:05 mtreinish: Yes but I hope we can find a way to duplicate dynamically but not statically. 17:27:21 davidkranz: the resource structures are going to change as well 17:27:22 _interface='xml' 17:27:23 mtreinish: The test code will be the same if the behavior has not changed. 17:27:38 davidkranz: the reason for the new api is because it did change 17:27:47 davidkranz: not necessarily things like return codes will probably change 17:27:50 if it didn't change, you wouldn't need a bump 17:27:56 sdague: Every api changed? 17:28:01 I am not sure the current '_interface' solution is the nicest way how to avoid code duplication, but it works 17:28:31 the services could be responsible for the error code checking 17:28:35 We can continue this on the list but a real proposal is needed. 17:28:49 davidkranz: probably not every, but enough move around that it's easy to miss a test bug inside a complicated if / else around versions 17:28:56 'services' ie. rest_clients 17:29:09 sdague: OK, I did not realize it was this extensive. 17:29:12 things like tenant ids in urls going away 17:29:22 sdague: That is a localized change. 17:29:38 sdague: Obviously the rest client will change. 17:29:44 yeh, but the problem is that you can come from 2 points of view 17:29:57 assume it is all the same, and put complexity into the tests when it's different 17:30:09 or assume it's all different, and factor out common bits when it's the same 17:30:30 I feel like not knowing how same or different it will end up, option 2 is safer 17:30:41 sdague: I hope it is the first. Otherwise users who upgrade will suffer a lot. 17:30:56 davidkranz: that's why it's a major version bump 17:31:06 and why v2 will still be in havana 17:31:23 sdague: If history is a guide, once we get more stable there will be a lot of resistance to changes in major versions as well unless they are really necessary. 17:32:07 davidkranz: honestly, I don't expect v3 to be the final word, if the interfaces don't evolve over time, they don't stay relevant 17:32:18 but that's a meta discussion :) 17:32:28 sdague: Right. 17:32:34 ok, anything else from folks 17:33:07 Nope. 17:33:11 nothing from me 17:33:40 ok, thanks all 17:33:43 #endmeeting