13:59:50 #startmeeting glance 13:59:51 Meeting started Thu Jan 31 13:59:50 2019 UTC and is due to finish in 60 minutes. The chair is jokke_. Information about MeetBot at http://wiki.debian.org/MeetBot. 13:59:53 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 13:59:55 The meeting name has been set to 'glance' 13:59:57 #topic roll-call 13:59:59 o/ 14:00:40 #link https://etherpad.openstack.org/p/glance-team-meeting-agenda 14:01:16 o/ 14:01:23 o/ 14:01:47 o/ 14:02:26 Nice we have quorum 14:02:33 #topic updates 14:03:44 just a quick one. We did release b1 and as the rosmaita's test is merged in the master please note that the check and gate will fail until we get the alembic migrations version change in 14:04:08 (see links in open discussion on agenda) 14:04:32 So please do not spam recheck and load the gate for no reason, it will be solved once the change gets (hopefully) through gate 14:04:40 .o. 14:04:49 just got to the gate now, survived the initial check! 14:05:03 got kicked out last night, post failure and a timeout 14:05:07 yeap 14:05:31 so that's all from for now 14:05:39 #topic release updates 14:05:43 abhishekk: floor is yours 14:05:50 ack 14:06:05 subunit.parser error is back 14:06:11 arrrrgh! 14:06:28 periodic jobs for functional py3 are failing intermittently 14:06:46 jokke_, just covered release updates 14:06:53 Was this something we thought we had fixed or is this something that jusappeared for a while? 14:07:08 just disappeared 14:07:13 more about failure in point 3 added by rosmaita 14:07:24 jokke_, looks like that 14:07:31 yeah, it just stopped for a while 14:07:34 kk, lets hop on that 14:07:43 #topic subunit parser 14:08:07 yeah, so abhishekk answered my question 14:08:12 which was, is it back? 14:08:16 and the answer is yes 14:08:36 abhishekk has a patch up based on the neutron one he mentioned 14:08:44 abhishek's patch: 14:08:55 #link https://review.openstack.org/634235 14:09:08 i was looking at the cinder bug about this that smcginnis worked on 14:09:19 #link https://bugs.launchpad.net/cinder/+bug/1728640 14:09:20 Launchpad bug 1728640 in Cinder "py35 unit test subunit.parser failures" [Critical,Fix released] - Assigned to Sean McGinnis (sean-mcginnis) 14:09:21 it is very hard to reproduce, I have ran entire test suit by removing 1 by 1 potential failing test case but not able to reproduce 14:09:37 yeah, i don't know why it only seems to happen in the gate 14:09:46 It's very random. 14:10:03 And even though that is marked as "fixed", I just consider that a workaround. 14:10:12 that's what i was going to ask 14:10:28 but as long as it decreases the frequency, that would be better, i guess 14:10:32 There was something with the switch from testr to stestr that introduced this with python 3. 14:11:24 Treinish did a lot of debugging and tracing around the Sydney Summit for this but were never able to track down the root cause. 14:11:33 smcginnis: yeah I can't recall seeing these while we were still using testr 14:12:21 smcginnis: you mentioned on the bug that the changing the log level in the setup doesn't always help in the functional tests, something about the tests calling out to other stuff that winds up changing the log level? 14:12:24 I was banging my head against wall for last two days 14:13:11 because this thing is so random, i don't know how we can test abhishekk 14:13:13 's patch 14:13:29 maybe i can include it on all of my patches, i seem to hit it a lot! 14:13:46 rosmaita, no idea 14:13:57 and I am also not sure it will work or not 14:14:16 I'm wondering if it's resource starvation on gate 14:14:37 rosmaita: It's fuzzy now, but I did fix one thing where we were setting the log level higher but the code was going back and lowering it again. 14:14:52 ok 14:14:56 because I have just checked in zuul and found that py3-functional job is stalled and it will timed out later 14:14:57 which would explain why we see it in gate but locally when we have nuf power we never see it 14:15:00 The only "fix" seems to be really restricting how much log output happens during test runs. 14:15:55 abhishekk: that is what i'm seeing a lot, too -- subunit parser error shows up in output, a bunch of tests are listed as '...inprogress', and then eventually failure 14:16:22 yes, that's the case 14:16:59 i'm trying to think of a way we can run *only* func-py35 test in gate, but i don't think that's possible 14:17:10 we only control the jobs in our local zuul 14:17:20 the other templates are out of our control 14:18:03 i'd like to run abhishekk's change like 50x to see what happens, but that's going to run a lot of jobs 14:18:10 rosmaita: we should be able to drop the majority on test patch if we just add "*" to the irrelevant files 14:18:51 actually, maybe change tox.ini 14:18:58 on abhishekk's patch 14:19:06 rosmaita: on the same patch you want to try out. Just mark it DNM ;) 14:20:19 I guess we can modify yaml file and keep functional-py35 job only? 14:20:54 abhishekk: yes 14:21:00 i think in the project section 14:21:08 yes 14:21:20 abhishekk: that's what I meant. Just add "*" to all jobs there but the py3 job 14:21:37 that should drop the patch from running any of the other tests 14:22:02 i think you have to modify what templates are called too though 14:22:14 we can't define the irrelevant files for them only 14:22:28 anyway, key thing is if we can not have tempest or grenade run 14:22:36 those are the ones that are taking forever 14:22:56 so looks like jokke_ is right 14:23:12 we can '*' for tempest-full and neutron-grenade 14:23:30 do we need to take a look of the code of stestr to identify the root cause? 14:23:54 LiangFang: if someone has time 14:24:12 yes 14:24:13 mtreinish is the author, and he and smcginnis were trying to figure it out with no luck 14:24:44 LiangFang: if you have time to dig into that, I'm sure there would be lots of happy people also outside of glance ;) 14:24:51 ++ 14:24:55 +1 14:25:01 If someone can, my hunch was there was some string or stream handling that was getting encoded differently or something under python 3. 14:25:27 I will take a look tomorrow 14:25:43 but china new year is coming 14:25:52 so i think what we do now is abhishekk updates his patch to exclude tempest and grenade 14:25:57 I will take vacation for one week 14:26:14 and puts up another with the .zuul.yaml change but without his fixture change 14:26:22 and we run them both a bunch of times 14:26:33 should see way more errors in the "unfixed" patch 14:26:43 rosmaita, ok 14:26:49 rosmaita: sounds like a plan to me 14:26:53 i can help 14:27:06 because the problem is that you can't recheck a successful job 14:27:17 have to make some kind of change 14:27:22 You can abandon and restore. 14:27:28 ah, ok 14:27:33 or just poke some comment string 14:27:41 change a character 14:27:56 i like abandon and restore, just click some buttons! 14:28:01 :D 14:28:03 ok, sounds good 14:28:07 playing a videogame 14:28:17 Nice, we have a plan! 14:28:27 so I need to update current patch to remove temepst and grenade jobs 14:28:33 thanks abhishekk!! 14:28:54 and another patch without logging change and just functional py35 jobs 14:29:07 right 14:29:32 abhishekk: yeah, just add the "*" to irrelevant files on basically every job but the py3 job that keeps failing 14:29:50 that way we spawn minimal amount of jobs on the gate trying to find the reson for this 14:30:08 jokke_, ok 14:31:01 And please remember to remove those before we merge it if it helps :P 14:31:10 :D 14:31:29 hopefully, the reviewers will be paying attention! 14:31:39 only for check jobs right? 14:31:53 otherwise I will go to revert frenzy and pull out everything that was merged after when I find out :P 14:31:57 yeah, shouldn't hit the gate unless someone approves it 14:32:01 abhishekk: yeap 14:32:04 I will set WF to -1 to be safer side 14:32:17 ++ 14:32:31 ok, lets move on 14:32:45 #topic configuration test job 14:32:50 this should be quick 14:33:15 we have this new test that will break when glance version and the db migration CURRENT_VERSION dont match up 14:33:34 jokke_ pointed out that we don't want to run that locally, just in the gate 14:33:45 so my question is about separating it out 14:33:58 should i move it in the tree to like glance/tests/config/ or somethinig? 14:34:12 or just blacklist it in the normal tox definitions? 14:34:23 i'm sort of leaning toward moving it 14:34:35 but if there aren't any other tests like this, seems kind of dumb 14:34:35 yeah I like the idea of moving it 14:34:41 ok, that's all then 14:34:52 i will put up a patch soon-ish 14:34:56 that will make it possible to still run it locally, it just won't run by default 14:35:13 just needs to be done before stein release 14:35:39 ok, thanks, next topic 14:36:04 should we call the folder /glance/tests/gate ? that way if we in future want to move something else there as well, we do not need to keep doing new floders under our tests tree? 14:36:18 that sounds good 14:36:43 gr8, unless there is objections, now we can move on :) 14:36:56 action me! 14:37:40 #action rosmaita Move the alembic migration test under /glance/tests/gate and make new job definition for it that is not ran locally by default 14:37:44 happy with that? 14:37:45 ty 14:38:05 then i won't have to hunt for the name when i forget :) 14:38:09 #topic open discussion 14:38:10 :D 14:38:32 just wanted to point out that i updated the meeting info like we discussed last week 14:38:57 Quick one, I'll be travelling Fri & Mon (FOSDEM time!) so won't be too reachable before Tuesday 14:39:23 telegram is your best bet if you need me 14:39:33 also, jokke_ already mentioned holding off on rechecks until https://review.openstack.org/#/c/632185/1 merges 14:39:43 rosmaita: thanks, didn't realize that there was chair section on it as well :D 14:40:09 yeah, noticed that when i was looking for the eavesdrop link 14:40:27 abhishekk: you too, i put you as chair of the bug squad 14:40:35 rosmaita, ack 14:40:40 good good 14:40:52 you can change the time if you like, just put up a patch to infra like this one 14:41:03 #link https://review.openstack.org/#/c/633824/ 14:41:42 that's all i have, execpt i will miss next week's meeting 14:41:55 it will be quiet without me! 14:42:02 haha! 14:42:04 it will 14:42:18 * smcginnis will be buying rosmaita a drink (probably coffee at this time) 14:42:37 * rosmaita will be drinking an appropriate beverage 14:42:45 oh you guys will be meeting up? 14:42:53 cinder midcycle 14:43:25 something is wrong with my patch, I have removed a;ll the jobs except functional-py35 but still they are running 14:43:27 nice, so we can expect smcginnis not be around either 14:43:37 is something wrong I have done? 14:44:01 I should be online, but hopefully things go well with the midcycle and I am distracted by good discussions there. 14:44:09 abhishekk: did you remove some of the templates? 14:44:22 rosmaita, nope 14:44:23 i think some jobs run whether we want them to or not 14:44:34 just removed the jobs from check pipeline 14:44:37 abhishekk: yes, don't remove them, add "*" line on to the irrelevant-files section on each of the check jobs in .zuul.yaml 14:44:47 it has removed non-voting job though 14:44:56 jokke_, ok will do that 14:45:15 it may take some experimentation 14:45:29 you may want to ask in #openstack-infra for the best way to do this 14:45:33 that should bypass the template runs as zuul goes like "OK, nothing relevant has been changed in the repo" :P 14:45:35 they should be glad to help! 14:46:29 jokke_: i think because of our project status, there are some jobs that run that we have no control over and cannot override 14:47:00 rosmaita: yes, but they still honour the irrelevant-files override, that's why I said to use that ;) 14:47:37 i'm not sure about that, otherwise everyone could use '*' to turn off jobs 14:47:42 yup 14:48:02 anyway, ping AJaeger or fungi or cboylan in infra 14:48:04 I've been seriously tempted few times 14:48:24 jokke_: i think everyone has! i think that's why you can't do it for all the jobs! 14:48:48 rosmaita: and this might reveal the loophole which might end up to a patch in zuul :P 14:48:59 but for now it should work 14:49:35 abhishekk: it's late where you are, you can hold off for tomorrow 14:49:56 just shoot an email when you have the patches up, and i will abandon/restore all day tomorrow 14:49:56 rosmaita, tomorrow I am on leave 14:50:01 oops 14:50:14 i am too, actually 14:50:18 :) 14:50:18 abhishekk: I think the whole line should look like "- ^*$" 14:50:23 so I will do it and then take off 14:50:32 haha ... we're all out for tomorrow then :D 14:50:43 sounds good, i wlll monitor for the rest of today 14:51:16 ok, we can continue this on the #os-glance, lets end the meeting unless someone has something else for us 14:51:16 i think it must be the red hat pto expires on feb 15 situation 14:51:46 nothing from me, have a good week everyone! 14:52:08 Thanks all! 14:52:26 thank you all 14:52:33 bye 14:52:50 Oh LiangFang Have a very happy new year! 14:53:04 #endmeeting