17:00:19 #startmeeting qa 17:00:19 Meeting started Thu Aug 29 17:00:19 2013 UTC and is due to finish in 60 minutes. The chair is mtreinish. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:20 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:22 The meeting name has been set to 'qa' 17:00:28 hi 17:00:31 hi 17:00:31 hi who's around for the meeting? 17:00:41 google hangout? https://plus.google.com/hangouts/_/0d06aa2bdbfda14c66339811d843480446a54ab3?hl=en ? 17:00:51 me 17:01:09 guys, I would prefer we not use hangout 17:01:12 here's today's agenda: https://wiki.openstack.org/wiki/Meetings/QATeamMeeting 17:01:15 #link https://wiki.openstack.org/wiki/Meetings/QATeamMeeting 17:01:24 just for video 17:01:26 ok np 17:01:45 lets get started I guess it's a pretty long agenda today 17:01:54 #topic testr status 17:02:01 Here 17:02:16 so the big news on testr is that we are now running in parallel for the gate 17:02:24 and everything has been made parallel by default 17:02:35 so at this point its just bug triage and fixes 17:02:45 mtreinish: Very cool! 17:02:45 to make things as stable as they were running serially 17:03:24 giulivo: you wanted to talk about a bug day related to testr? 17:03:48 mtreinish, yeah maybe we can send to the -dev list an announcement for a bug triage day? 17:04:12 let's say something in mid september? would that work for you guys? 17:04:29 giulivo: sure that's probably a good idea, it's been a while since we last had one 17:04:34 and the bug list is getting biggert 17:04:40 giulivo: Sure 17:05:02 ok I'll do that tomorrow at last 17:05:17 giulivo: do you want to take on the effort of organizing it and pushing that forward? 17:05:54 are there preferences for a particular day? a week day I suggest rather than during the weekend, apart from that, I'd pick something from the 2nd week of sept 17:06:34 giulivo: that's fine with me but we can discuss the details on -qa or the ML 17:06:46 Hi, sorry for being late 17:06:55 tkammer_: np, welcome 17:07:08 #action giulivo to setup a bug day for sometime in sept. 17:07:13 I am late as well, sorry 17:07:20 mtreinish, fine with me 17:07:42 ok and dkranz you want to talk about writing tests in parallel and a process for approving them in the gate 17:08:19 mtreinish: Some of the recent problems made it clear that it is not obvious some times 17:08:30 when a test will interfere with some other 17:08:46 We have locks within a test class, and tenant isolation, but other issues can occur 17:09:04 So my thought was to have a way to mark a test to not fail the build if that test fails 17:09:23 Then after some time it is removed and the test becomes part of the gate. 17:09:29 I think we will have the same problems for stress tests that run in parallel 17:09:42 mkoderer: But stress tests are not gating 17:09:49 thats right 17:09:53 mkoderer: It is the gate, and blocking builds, that concerns me. 17:10:02 dkranz: correct 17:10:18 I am going to send a message to the list with some guidance. 17:10:29 dkranz: I'm not opposed to having a nonvoting set of jobs first, it's just in my experience nonvoting jobs don't ever get any attention 17:10:43 dkranz: how to prevent adding racy codes to nova or neutron ? 17:11:03 and flaky tests are going to happen 17:11:19 afazekas: We can't, but nova developers are constantly aware of this issue. 17:11:35 In tempest we cover all apis together which is more difficult. 17:11:37 I'd rather see them gate and catch more things 17:11:56 even if it causes headaches sometimes with flaky fails 17:12:00 mtreinish: ok, as long as our fearless ptl signs on 17:12:08 dkranz, is the suggestion to avoid tagging the tests with 'gate' or to use some different particular tag? 17:12:25 giulivo: My idea was a different tag 17:12:44 dkranz: because flaky fails are bugs, albeit sometimes obscure ones 17:12:58 dkranz: ok yeah sdague can weigh in on this when he's back next week 17:13:11 mtreinish: I know. I guess I'm just a little more conservative 17:13:22 But I can go with the flow 17:13:34 dkranz: heh, ok 17:13:36 That's it 17:13:54 I still think moving forward we should have a wiki page or a section in the docs about potential parallel issues 17:13:58 I do not see this kind of issues fixed in fast way: https://bugs.launchpad.net/neutron/+bug/1211915, https://bugs.launchpad.net/tempest/+bug/1205344 17:13:59 Launchpad bug 1211915 in neutron "Connection to neutron failed: Maximum attempts reached" [High,Confirmed] 17:14:41 lets move on, afazekas we can talk about that during the neutron topic 17:14:50 #topic stress tests status 17:14:56 ok 17:15:02 mkoderer: you're up 17:15:09 so we have a decorator now @stresstest 17:15:29 what is left is to identify good candidates 17:15:43 to get this decorator 17:15:54 mkoderer: do you have a list of criteria for what makes a good stress test candidate 17:15:55 maybe someone could help me with this task 17:16:02 so people can help you find them 17:16:14 mtreinish: yes would make sense to have such a list 17:16:25 I will prepare it and put it into the README 17:16:31 mkoderer: https://bugs.launchpad.net/tempest/+bug/1205344 17:16:33 mkoderer, mtreinish maybe the common create_get_delete tests we have around for the various resources could be a start? 17:16:34 Launchpad bug 1205344 in nova "mkfs error in test_stamp_pattern" [High,Confirmed] 17:16:55 giulivo: yeah those would probably work well 17:17:06 mkoderer: yeah that's probably a good place to put it 17:17:19 The tests cases which caused flaky issues are usually good candidates IMHO 17:17:20 and maybe start a thread on the ML about doing this 17:17:31 mkoderer: If we think this is important there also might be things that are close but must need a tweak to be able to be stress candidates. 17:17:52 dkranz: yes right 17:17:53 mkoderer: "just need a tweak" 17:18:14 so ok first step preparing a definition 17:18:32 and I am happy about any patch that introduces new stress tests :) 17:18:47 mkoderer: ok cool 17:18:50 good work on this 17:18:53 is there anything else? 17:19:04 don't think so 17:19:14 ok then lets go to the next topic 17:19:20 #topic neutron status 17:19:26 mlavalle: you're up 17:19:50 mtreinish: I've been updating progress in https://etherpad.openstack.org/gate-tempest-devstack-vm-quantum-full 17:20:47 #link https://etherpad.openstack.org/gate-tempest-devstack-vm-quantum-full 17:20:55 We have fixed several things. By next week I will compare what is in the eteherpad with the logs 17:20:55 ok that's got a lot of details 17:21:14 and will provide a summary for the team in this meeting 17:21:31 mlavalle: ok, cool 17:21:45 so we can asses how far we are from fixing that gate job 17:22:19 one other thing is that since everything else is in parallel now we probably won't push this voting until that works with neutron 17:22:24 even if the issues are all fixed 17:22:34 mtreinish: Yes 17:22:43 cool with me 17:23:05 I will aslso be pushing code this weekend for managing networks in the isolation code 17:23:10 so https://bugs.launchpad.net/tempest/+bug/1216076 should probably be priority 17:23:12 Launchpad bug 1216076 in tempest "Neutron jobs won't pass tempest running in parallel" [Critical,Confirmed] 17:23:33 mlavalle: ok, cool hopefully that will be all thats required for making neutron work in parallel 17:23:48 i'm crossing my fingers 17:23:54 ;-) 17:24:35 ok is there anything else about neutron that needs to be brought up? 17:24:41 afazekas: you had a bug earlier? 17:24:43 nope. I'm done 17:26:11 mtreinish: Next topic? 17:26:18 ok yeah 17:26:26 #topic Update on slow heat gating job 17:26:29 mtreinish: there is mysql side lock wait issue like issue which causes gate issues on single thread 17:26:30 dkranz: this one is yours 17:26:45 afazekas: Are you looking at that? 17:27:54 dkranz: I did not seen myself yet.. 17:28:06 afazekas: Is there anything to discuss now?> 17:28:30 dkranz: no 17:28:37 afazekas: ok 17:28:43 let's continue with heat 17:28:51 The slow heat status is that everything is in place. 17:29:11 THere is a job on the new "experimental" queue. 17:29:25 dkranz: ok cool 17:29:28 The current problem is that the autoscale test cannot run because there is a missing image needed. 17:29:49 dkranz: is that a devstack issue? or a devstack-gate problem? 17:29:54 sbaker and lifeless are working on getting that image into devstack 17:30:21 dkranz: ok, once that gets solved is the plan to add things to gate queue? 17:30:26 Once the tests run we can move them into the gate for all projects 17:30:41 dkranz: ok very cool 17:30:44 I would recommend non-voting at first :) 17:31:00 good work on this, it'll be nice to get coverage for another project in the gate 17:31:18 mtreinish: Yes. sbaker will let us know when we can proceed 17:31:29 mtreinish: I also put the job as experimental in devstack for debugging 17:31:36 That's it. 17:31:56 dkranz: ok so one quick thing how do we run things in the experimental queue? 17:32:09 because this is new as of this week right? 17:32:19 mtreinish: With a "check experimental" in a review comment. Yes this is new. 17:32:40 mtreinish: You can see the definition in layout.yaml in zuul 17:32:48 #info to run experimental jobs leave a "check experimental" in a review comment. 17:32:57 dkranz: ok cool thanks 17:33:06 then lets go to the next topic 17:33:10 #topic critical reviews 17:33:21 ok does anyone have any reviews that they would like to bring up 17:34:13 mtreinish: Just that there are a lot of patches with +2 that are waiting for approval. 17:34:14 tkammer? 17:34:20 yeah 17:34:39 dkranz: ok yeah, I've been a bit lax with the reviews because of all the parallel triage stuff 17:34:47 I'll take some time later today to go through those 17:34:56 mtreinish: I understand. We are light reviewers 17:35:14 dkranz: well just 1 this week :) 17:35:15 afazekas, are we talking about the auto config review yet? 17:35:19 mtreinish: And a bunch are waiting for non-red-hat 17:35:25 and I'm being furloughed next week so I'll be out 17:36:06 ok if there aren't any reviews lets go to the next topic 17:36:07 mtreinish: Sorry to hear that. I will be around next week but out on Thursday so will miss next meeting. 17:36:18 tkammer_: this is the part if you have review related question, you can ask it. 17:36:29 #topic Testing client library compatibility 17:36:37 dkranz: this is yours too 17:36:40 o.k then, I want to bring up the review of 17:36:47 tkammer_: Go ahead 17:36:53 https://review.openstack.org/#/c/42920/ 17:37:02 I wanted to ask a couple of things, but first 17:37:16 I want to know what is the prefered way of implementing this 17:37:16 tkammer_: you've got the whole next topic about this 17:37:26 mtreinish, o.k then, I'll wait :) 17:37:56 So the library issue is what I sent to the stable-branch ml and got no response. 17:38:12 We say that libraries are supposed to work with older releases but nothing tests that. 17:38:37 I proposed to add a gate job to each client lib that runs the last stable release but with master for client libs. 17:38:59 And runs just the scenario tests or anything else that uses client libs. 17:39:12 dkranz: that sounds like a good thing to have 17:39:21 I think that is straightforward technically but wanted input. 17:39:26 that should just be the cli tests and scenario at this point 17:39:32 mtreinish: Yes 17:39:45 dkranz: I'm all for it 17:40:02 mtreinish: I wanted to make sure there were no gotchas with this approach. 17:40:26 dkranz: there is probably something on the devstack side with the global requirements stuff in there 17:40:27 mtreinish: I don't know whether to interpret silence as consent or just no one wanted to respond. 17:40:54 mtreinish: I think it is really just running a stable branch job but pointing the ENV for clients to master. 17:41:08 dkranz, I think it's great, it'd be nice to tests latest clients with latest two releases maybe rather than just the latest 17:41:26 giulivo: Well, that doesn't really scale. 17:41:39 giulivo: I also proposed to add this test to the bitrot jobs for all older releases. 17:41:39 guitarzan: dkranz mordred has been owrking on something like that 17:41:49 dkranz: you might want to start a separate ML thread on dev and stable maint about this to get a wider pool of opinions 17:41:52 when he gets back next week you may want to see what he had in mind 17:42:12 clarkb: OK. I didn't get any reponse on my mail to the stable-maint list 17:42:20 clarkb: Didn't know mordred was out 17:42:33 dkranz, for all older releases doesn't scale I understand but my idea is that someone has got the havana clients and logs on a grizzly release 17:42:52 giulivo: Yes, that is what this would be testing 17:43:09 ok sorry for the bad wording than 17:43:52 dkranz: ok is there anything else on this topic? 17:43:57 mtreinish: Nope 17:44:10 ok then lets move on 17:44:15 #topic Devstack independent tempest usage 17:44:24 afazekas: this is listed as yours but its about tkammer_ patch 17:44:30 #link https://review.openstack.org/#/c/42920/ 17:44:44 so tkammer_ go ahead 17:44:50 mtreinish, thanks 17:45:10 I want to know what is the preferred way of implementing this, Python vs BASH 17:45:29 I currently implemented it as BASH since I thought to go as the devstack guys did 17:45:43 but if there is no objections, Python is an option as well 17:45:57 tkammer_: well normally we do things in python. We only use bash if there is a particular reason to do it in bash 17:46:16 it makes it easier to review I think (at least for me) 17:46:17 mtreinish: looks like someone copied it from the prev agenda :) 17:46:28 O.k, that is why I wanted to ask in advance prior for me developing this into something more complex 17:46:30 afazekas: oops :) 17:46:59 tkammer_: at least for me complex bash code can be harder to read 17:47:16 tkammer_: also using python will let you call the client libs directly 17:47:35 +1 for python 17:47:58 True, but than again, there is not much difference between calling the clients and calling the CLI :) just easier (for me) to parse the data later on :) 17:48:39 tkammer_: You might find some useful utilities or ideas from https://github.com/stackforge/anvil 17:48:51 tkammer_: A python version of devstack basically 17:49:04 tkammer_: the cli output really isn't the easiest to parse. (Look at the cli tests) But calling the libs just gives you a python object. 17:49:21 dkehn, interesting, thanks! 17:49:25 mtreinish, I agree 17:49:31 I prefer myself python 17:49:44 but was not sure what is the correct way to implement this :) 17:50:08 O.k, I will convert this into python and start working on a more complex initialization 17:50:10 tkammer_: Well there are a few votes for python and none for bash... 17:50:39 dkranz, I think that says it all :) 17:50:57 IMHO we should start with basic python version, extended it step by step.. 17:51:11 tkammer_: ok is there anything else on this topic? 17:51:37 mtreinish, not on this topic, but I would like to comment about my initial use of tempest 17:51:45 is it alright to raise it now? 17:51:48 or should I wait? 17:52:13 tkammer_: we've got another topic to go and <10min so can you hold it for after the meeting? 17:52:18 tkammer_: I think sending that to the qa list would be good value 17:52:33 tkammer_: And easier to type in an email :) 17:52:42 dkranz, mtreinish o.k, I will wait with it :) 17:52:51 ok then lets go to last topic on the agenda 17:52:55 #topic BP proposal: rework "skip test" functionaly 17:52:57 tkammer_: Or discuss in #openstack-qa 17:53:04 mkoderer: this is yours 17:53:15 mtreinish: Sorry, I have to run out. 17:53:21 dkranz: ok np 17:53:30 I remeber that we discussed about skipping interface one of the last meetings 17:53:33 mtreinish: Talk to you in two weeks 17:53:41 dkranz: yep cya 17:53:43 and I had a disussion about it with giulivo 17:54:01 #link https://review.openstack.org/#/c/43490/ 17:54:25 so would it make sense to add a blueprint to change all needed things in that area 17:54:46 mkoderer: that commit is just to make skip messages easier to parse for the skip_tracker 17:54:48 e.g. instead of duplicating code like "@testtools.skip('Skipped until the Bug #1170718 is resolved.')" 17:54:50 Launchpad bug 1170718 in nova "nova list --ip filter shows all instances" [Undecided,Fix released] https://launchpad.net/bugs/1170718 17:55:11 I would like to have a decorator @skip(bug=12345) 17:55:42 mkoderer, I think that would be nice and could also help the accounting of the skips 17:55:57 mkoderer: that's an interesting idea, but I think having the flexibility in the message is needed 17:56:12 maybe we should ensure bug= accepts a list as argument :P 17:56:33 mtreinish: the field "bug" could be optional 17:56:42 and we add a field "message" 17:56:48 I think we could find a way 17:57:11 so maybe I will brain storm a bit what we can do in that area 17:57:17 and put it on the ML? 17:57:27 Sooner or later we will have zillion of decorators, can we have one decorator, can we have single decorator on all test classes for centralizing decorator magic ? 17:57:35 mkoderer: I'm not opposed to this idea, I like it. But we've got a lot of other things on the bp list already 17:57:47 afazekas: I'm fine with having lots of decorators 17:57:56 we may want to centralize their definition though 17:58:00 like a utils file 17:58:04 What if we would have json or csv file with test_method name, and bug numbers for skipping ? 17:58:13 mkoderer: yeah starting a ML thread would be the right way to do it 17:58:29 and you can open up the bp too for that discussion as well 17:58:34 mtreinish: yes it's a question of prioritization 17:59:03 mtreinish: ok 17:59:03 just wanted to know if you are interested in general 17:59:17 mkoderer: everyone 17:59:27 'is always interested in improvments 17:59:37 it's just a matter of time to get to everything 17:59:40 :) ok - I will put on the ML 17:59:53 ok so we're basically out of time 17:59:58 #endmeeting