17:00:19 #startmeeting qa 17:00:20 Meeting started Thu Oct 10 17:00:19 2013 UTC and is due to finish in 60 minutes. The chair is sdague. Information about MeetBot at http://wiki.debian.org/MeetBot. 17:00:21 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 17:00:23 The meeting name has been set to 'qa' 17:00:27 ok, how's around for the QA meeting? 17:00:38 hi 17:00:44 o/ 17:00:44 hi 17:00:48 #link https://wiki.openstack.org/wiki/Meetings/QATeamMeeting#Proposed_Agenda_for_October_10_2013 17:01:24 ok, first off, I still need to do the old blueprint purge, a few other things got in the way of that with things like jury duty this week 17:01:38 #topic Blueprints (sdague) 17:01:53 so I'll work on doing that before next meeting 17:02:04 anyone else have blueprint updates to share? 17:02:23 hi 17:03:06 Here 17:03:55 hi 17:03:58 ok, moving on from bluprints 17:04:11 dkranz: why don't we jump to your topic 17:04:17 #topic Lot's of new negative tests. Do we want that? (dkranz) 17:04:23 sdague: ok 17:04:30 thats a good one 17:04:46 I am concerned that we have seen a lot of new negative tests. 17:05:01 Last time this happened it was because they are easy low-hanging fruit for new people 17:05:24 This is fine but we should have a clearer line between what it in tempest and what is in unit tests 17:05:39 I added a few "sub-topics" trying to discern which are good and how they should be tagged 17:06:04 yes for me there are some that are typical unit tests 17:06:14 In an ideal world we would have abstractions where we describe the call to be made and it could be done either as unit or tempest 17:06:32 And there was the idea of "fuzz" testing as well which never went anywhere 17:06:48 dkranz: I'm fine with negative tests in tempest as long as they're testing api responses. 17:06:51 I remember marun actually had this idea at the last summit of tests in the neutron tree that tempest could also run in a real env 17:07:03 *sigh* yes 17:07:07 but I don't think that got any progress 17:07:09 it goes back to the reason we have api tests in general 17:07:09 sdague: Yes, I have discussed it with others as well 17:07:12 I did a proof of concept in may but haven't gotten farther 17:07:24 I should probably post the proof of concept so others can take over 17:07:51 i'll make sure to have something to show by the summit 17:08:01 I think we would want some kind of yaml or something that describes the actual test data 17:08:30 marun: could you paste a link to that poc? 17:08:48 The review team could spend all day reviewing negative tests and it might be better to work on making this easier to create and review. 17:08:49 mkoderer: I never polished it for public consumption, I'm afraid. 17:08:58 marun: ok I see 17:09:06 marun you going to be in HK? 17:09:08 mkoderer: I'll have to dig it up and get it working again. 17:09:10 sdague: yes 17:09:22 I think this would be a great summit session especially if you had that poc ready 17:09:37 sdague: I think I already added it to proposed on the etherpad 17:09:42 dkranz: great 17:09:52 honestly, right now, I'm torn on negative testing 17:10:04 on the one hand, it's adding coverage 17:10:10 sdague: Well, I think we should have it, but it needs to be easier to write and review 17:10:17 but it definitely could be done close to the projects 17:10:39 sdague: The projects could consider tempest to be "closer" than they currently do 17:10:49 sdague: Some are actually making movements to be closer 17:10:50 dkranz: so in the short term, I'd propose that we continue to let folks propose it 17:11:03 but that we ensure all the negative tests are in seperate files 17:11:12 so it would be easier to purge out later 17:11:18 sdague: That seems reasonable 17:11:29 and they should use the "negative" attr type 17:11:30 then we can make final decisions at summit 17:11:33 mkoderer: yep 17:11:36 sdague: sounds like another hacking file update 17:11:40 sdague: But we should also suggest that no one concentrate on that 17:11:40 people have been pretty good about it 17:11:51 dkranz: sure 17:11:58 ok, volunteers for the hacking update? 17:12:02 Once they are up to speed on tempest they should focus on higher-value tests 17:12:18 several negative test are not just simple malformed request, or a reference to a non existing resource 17:12:31 yeah I still have concerns about that, for instance 17:12:39 sdague: I guess I'll do it since I brought it up 17:12:41 - some are "validating strings", like passing long strings or checking for invalid vs. non existent uuids 17:12:47 shall we accept both tests? 17:12:51 #action mtreinish to update hacking 17:12:58 - some are "traversing" the components, like ensuring we can't attach resources to non existent servers 17:13:03 giulivo: Those cases are malformed requests 17:13:04 shall we accept such a kind of negative? 17:13:05 #action mtreinish to update hacking to describe negative tests in dedicated files 17:13:25 giulivo: yes that a point.. I saw all of them in one review 17:13:31 sorry dkranz why malformed? 17:13:43 giulivo: string too long 17:13:47 giulivo: negative is typically more fuzz testing of the API, throwing it garbage 17:14:19 afazekas: do you have a good more complicated example? 17:14:22 so if I get it right, non of those is acceptable 17:14:23 I'm fine with those if we don't have to write them long-hand 17:14:30 and review them 17:14:46 Do we want to remove the 'negative' flag from test cases which not just simple fuzz ? Like the auth related ones ? 17:14:48 dkranz: right, so this is about some kind of automation on fuzzing 17:14:57 afazekas: code example of that? 17:15:00 https://review.openstack.org/#/c/50217/5 17:15:04 maybe this one? 17:15:23 sdague: Yes but it is two parts: declarative test spec, and parameterized randomization 17:15:39 sdague: Not all cases have to be randomly generated 17:15:40 https://review.openstack.org/#/c/50122/ rejected 17:16:03 afazekas: That was rejected for bad spelling, not content 17:16:11 owner_accept 17:16:29 test_image_share_v2_owner_accept 17:17:00 dkranz: the 'rejected' test is not 'negative' sorry 17:17:02 dkranz: ok, so that sounds like actually another good topic, and slightly different from marun's one 17:17:28 sdague: Yes. A simple negative test could require setup that is non-trivial 17:17:34 yep 17:18:03 So for now perhaps we can just say that we will accept negative tests but not make them a priority 17:18:14 ANd come up with a better plan at the summit 17:18:30 good plan 17:18:38 dkranz: yes. and already we have good coverage 17:18:46 we should create a blueprint after the summit 17:19:08 mkoderer: did you get your travel approval for hk? 17:19:13 It would be good to have a ml discussion about this prior to the summit to make it more effective 17:19:16 This is a big topic 17:19:20 https://github.com/openstack/tempest/blob/master/tempest/api/compute/test_authorization.py looks like these does not have the negative flag 17:19:31 dkranz: yes, agreed 17:19:38 dkranz: you want to kick that thread off? 17:19:49 sdague: OK. I will discuss it with marun 17:19:53 mtreinish: most probably yes 17:19:54 great 17:20:25 #action dkranz to kick off the thread on openstack-dev list about approaches on negative and shared testing with projects 17:21:06 sdague: Were you thinking tagged with [qa] or not? 17:21:36 sdague: I was going to tag it 17:21:37 https://github.com/openstack/tempest/blob/master/tempest/api/compute/admin/test_flavors_access.py 17:21:48 yeh, it should be tagged 17:22:16 sdague: I think that's it for that topic 17:23:19 ok, great 17:23:37 mtreinish: how about you next 17:23:41 #topic Elastic Recheck Top issues (mtreinish) 17:23:53 oh, this is a weekly thing now 17:24:01 #link http://status.openstack.org/elastic-recheck/ 17:24:10 so we're actually doing really good since er went live 17:24:20 most of the big bugs have been fixed 17:24:27 most of the remaining issues are on the neutron side 17:24:48 there are some new ones popping up that we need to classify still 17:25:13 yeh, ER is already a pretty awesome tool 17:25:15 so if you find a nondeterministic failure and can identify it with a logstash query please submit a review to add it to the query list 17:26:00 #link https://github.com/openstack-infra/elastic-recheck 17:26:07 for people that haven't seen the code yet 17:26:37 Very cool 17:26:48 great work 17:26:54 ok, lets talk about a related issue, dkranz on whitelisting errors 17:27:06 #topic Rollout plan for failing builds that pass tempest but with spurious log ERRORs (dkranz) 17:27:16 sdague: So the patch is up https://review.openstack.org/#/c/50795/ 17:27:46 Many of the non-neutron runs are showing no non-whitelisted errors 17:27:59 I think the next steps are to 17:28:17 1. Use elastic recheck to track non-deterministic bogus error messages 17:28:26 2. Communicate all this to the dev list 17:28:35 3. Start failing builds 17:28:56 4. Build some kind of momentum to actually get people fixing the bugs 17:28:57 that sounds pretty reasonable 17:29:19 I have looked at a lot of errors in the past day and many of them look like real bugs to me 17:29:24 Not just bogus errors 17:29:28 dkranz: for step 1 you'll need to open bugs for all the non-deterministic bogus error messages you hit 17:29:34 But on one pays attention to the logs when a build succeeds 17:30:01 mtreinish: I was thinking of doing it in a slightly different way 17:30:07 dkranz: so something like http://status.openstack.org/elastic-recheck/ for showing the occurance of these might be helpful 17:30:27 Based on what appears in the tempest log 17:30:28 just those graphs seemed to make a difference to people in understanding how bad the problems were, and being able to see them go away 17:30:34 sdague: Agreed 17:31:10 So please comment on that patch when you get a chance 17:32:07 sdague: Ideally each group would have some one paying attention to their log errors 17:32:46 The ignoring of these errors is a huge weakness in the tempest methodology 17:33:02 yeh, but sun light will help 17:33:05 A test can pass but the system is screwed up afterword 17:33:10 sdague: +1 17:33:24 ok, I actually got to run away, dkranz can you take over the rest of the meeting? 17:33:33 sdague: Sure 17:33:37 sdague: Just a sec 17:34:07 We done with this? 17:34:20 I think so 17:34:23 #topic Stable Branch Timing 17:34:56 What happened to the topic bot? 17:35:00 dkranz: I think you need to be sdague to do that 17:35:07 since he started the meeting 17:35:14 Oh well. 17:35:19 ANyway, next topic 17:35:33 I had proposed branching as late as possible 17:35:44 Any other theories? 17:35:56 dkranz: I'm fine with that 17:36:16 mtreinish: So when would that be? Release day? 17:36:20 I don't think we should do it the same time as the other projects 17:36:27 so next week 17:36:36 mtreinish: RIght, haven't some already done it? 17:36:53 they've rc'd but I don't think anything is done yet 17:37:19 mtreinish: OK, so the release day is the 17th 17:37:24 I think branching it as late as possible is the easiest way 17:37:29 We could just do it then 17:37:52 do you want to do a milestone proposed branch or just go straight to a havana branch? 17:38:12 mtreinish, I was going to propose that, a milestone-proposed branch 17:38:30 but I'd also branch the same day as other projects then, why wait one week more? 17:38:31 mtreinish: Well, if we are waiting until release day I'm not sure why we need milestone-proposed 17:38:48 yeah that's how I feel too 17:38:59 dkehn, my idea is that people wanting to propose stuff for the stable branch will have to use milestone-proposed 17:39:05 giulivo: Because we are still writing havana tests they will have to be backported once we branch 17:39:05 dkranz, ^^ 17:39:06 ttx: do you have any thoughts on this topic? 17:40:15 dkranz, yeah to solve that very same issue, I'd use a milestone-proposed branch but branch the 17th 17:40:35 (branch our stable, on the 17th) 17:40:55 giulivo: there's no difference at that point from just doing a stable branch 17:41:01 giulivo: I don't see how that changes anything 17:41:14 we did a milestone proposed last cycle because we broke something for the rcs when we added a test (I think) 17:41:15 giulivo: The back port rule is our policy 17:41:15 so I think it's important branching the same day as other projects 17:41:28 and the backport rule remains in place for the future additions 17:41:30 giulivo: We can't because they don't all do it on the same day 17:42:25 but tests submitted closer to the branch date, if relevant for the release, can be posted to the milestone and avoid being skipped 17:42:40 I think there is little risk. The 17th is one week from now 17:42:47 dkranz: lets just say we'll cut the stable/havana right before next week's meeting 17:42:54 mtreinish: Works for me 17:43:43 mtreinish: I can't remember who actually does that. Infra? 17:43:43 #info stable/havana branch to be cut right before next meeting 17:43:51 dkranz: sdague will know 17:43:59 OK, any more comments on that? 17:44:54 #topic Design Summit Initial Planning 17:45:17 https://etherpad.openstack.org/icehouse-qa-session-planning 17:45:46 So we have 7 proposals and 13 slots 17:46:04 We don't have to use all of them, but please add to the etherpad if you have ideas. 17:46:34 dranz: is this the place to add under proposed ? 17:46:55 ravikumar_hp: Yes, the etherpad I pasted above 17:47:06 http://summit.openstack.org/ ? 17:47:22 no use the etherpad and put it under "Proposed" 17:47:29 ok 17:47:39 ravikumar_hp: I think the plan was to do it in the etherpad because it is better for communication 17:48:16 ravikumar_hp: Of course any one who is not participating in these meetings can propose a session on summit.openstack.org 17:48:46 ravikumar_hp: That site is a clumsy tool for collaboration 17:49:09 Any other comments on this topic? 17:49:20 btw it's my first summit, so if someone want to give me some word how a talk should look like 17:49:24 dkranz: thanks. i got it 17:49:27 would be great 17:49:30 mkoderer: No talks! 17:49:48 I mean I need to prepare something right? 17:49:51 mkoderer: The summit sessions are discussions. YOu can have a few intro slides, but that's it 17:50:08 dkranz: ok just a few slides good 17:50:13 mkoderer: And try to have enough discussion beforehand so it can be effective 17:50:25 dkranz: ok cool 17:50:26 mkoderer: just having an etherpad with an outline of discussion points is normally all that is done up front 17:50:57 ok fine 17:51:32 OK, any critical reviews before open discussion? 17:51:44 Other than my error log check :) 17:51:56 dkranz: aren't we going to talk about Neutron? I see it in the agenda 17:52:18 mlavalle: Yes! That had sean's name but please tell 17:52:49 dkranz: I've been working on this bug https://bugs.launchpad.net/swift/+bug/1224001 17:52:52 Launchpad bug 1224001 in neutron "test_network_basic_ops fails waiting for network to become available" [Critical,Fix released] 17:53:02 mlavalle: I saw a number of things in the neutron log files after succesful runs that looked like real bugs 17:53:21 dkranz: I've been doing this with marun and salv-orlando 17:53:25 dkranz: for reviews both: https://review.openstack.org/#/c/49559/ and https://review.openstack.org/#/c/49562/ 17:53:39 to split out the unit tests from the jenkins runs 17:54:20 mtreinish: doesn't everything run in jenkins? 17:54:38 dkranz: salv-orlando proposed a fix to neutron that should help with test_network_basic_ops fails 17:54:49 dkranz: I should have worded it more clearly. make the unit tests a separate jenkins job only for tempest 17:54:57 mtreinish: Oh, ok 17:55:00 instead of running it with everything else 17:55:23 mlavalle: That's great 17:55:37 we definitely need to address errors in the log on otherwise successful runs 17:55:37 mlavalle: looking at the elastic-recheck graph 1224001 is still happening 17:55:40 dkranz: I'll test tonight or tomorrow 17:55:52 it's confusing things when we have failures - hard to see the forest for the trees. 17:56:01 or do you think it's just an overzealous query 17:56:29 mtreinish: I don't know. I need to look at it carefully 17:57:04 mlavalle: http://status.openstack.org/elastic-recheck/ 17:57:08 mlavalle: ok 17:57:15 i've been seeing failures that gerrit attributes to 124001 that are not due to a neutron test failure. 17:57:20 mtreinish: thanks i'll look at it 17:57:21 We only have a few minutes left 17:57:33 dkranz: thanks 17:57:42 marun: it looks for 'tempest.scenario.test_network_basic_ops AssertionError: Timed out waiting for' in the test output 17:57:50 mlavalle, marun : anything else on neutron? 17:57:59 dkranz: i'm done 17:58:03 i'm done 17:58:14 mtreinish: i'll follow up offline about the bug detection 17:58:20 marun: ok 17:58:22 ok, anything else from any one? We had a full agenda today. 17:58:46 dkranz: can we renamed the Duplicate exception to Conflict ? 17:58:59 dkranz: can we rename the Duplicate exception to Conflict ? 17:59:28 let's discuss it after the meeting? 17:59:29 as 409 depicts Conflict 17:59:40 Yes, we are out of time. 17:59:45 sure 17:59:53 #endmeeting 18:00:13 fungi, clarkb, jeblair: ^^^ 18:00:13 mtreinish: Can sdague end the meeting now or is he really gone? 18:00:22 sdague is away how do we end the meeting 18:00:28 there is a timeout in meetbot 18:00:38 it is 60 minutes, so anyone can end it after that timeout 18:00:50 try it again 18:00:57 #endmeeting