Thursday, 2020-04-30

*** Goneri has quit IRC00:14
*** jmasud has joined #oooq03:48
*** jmasud has quit IRC04:06
*** ykarel|away is now known as ykarel04:15
*** ratailor has joined #oooq04:20
*** jmasud has joined #oooq04:40
*** skramaja has joined #oooq05:11
*** jmasud has quit IRC05:14
*** jmasud has joined #oooq05:16
*** jfrancoa has joined #oooq05:22
*** chem has quit IRC05:23
*** chem has joined #oooq05:24
*** udesale has joined #oooq05:41
*** ysandeep|away is now known as ysandeep05:42
*** marios has joined #oooq05:57
akahat|ruckchandankumar, o/06:10
*** jmasud has quit IRC06:16
*** jmasud has joined #oooq06:18
*** bogdando has joined #oooq06:27
*** jmasud has quit IRC06:57
*** jmasud has joined #oooq06:59
*** amoralej|off is now known as amoralej07:20
mariospanda|ruck: akahat|ruck: o/ hey folks rocky/queens gate borked?both blocking there TripleO CI Scrum07:23
mariosThursday, 30 April⋅16:00 – 17:0007:23
mariosWeekly on Monday, Thursday, until 31 Dec 202007:23
mariosmeet.google.com/oiv-geho-mai07:23
mariospanda|ruck: akahat|ruck: sorry, wrong paste ;) there https://review.opendev.org/#/c/722562/07:24
akahat|ruckmarios, hello.07:26
akahat|ruckmarios, yes few jobs are still broken07:26
akahat|ruckBZ is already there for it:https://bugs.launchpad.net/tripleo/+bug/187583307:27
openstackLaunchpad bug 1875833 in tripleo "The WebSocket timed out before the Workflow completed in rocky/stain jobs" [Critical,New] - Assigned to amolkahat (amolkahat)07:27
*** tosky has joined #oooq07:28
mariosthanks akahat|ruck07:30
*** matbu has quit IRC07:39
*** matbu has joined #oooq07:46
ykarelakahat|ruck, i have triggered openstack-periodic-wednesday-weekend pipeline as puppet-pacemaker is available in consistent repo07:46
ykareli see rocky job failed in -latest-released pipeline, didn't checked the reason07:47
akahat|ruckykarel, great. :)07:47
ykarelcan u check, if it's real failure07:47
ykarelif not a real failure, good to run a testproject change with rocky periodic jobs07:48
akahat|ruckykarel, could you please pass the link07:48
akahat|ruck?07:48
ykarelhttps://review.rdoproject.org/zuul/status07:48
*** jmasud has quit IRC07:48
*** jmasud has joined #oooq07:49
akahat|ruckykarel, thanks :)07:50
*** ratailor is now known as ratailor|lunch07:50
*** jpena|off is now known as jpena07:57
zbrmarios: i guess i could increase my productivity by starting the day very early, zuul queues are decent till ~noon GTM.07:58
zbri managed to find an older CR that would make install-deps use venv under py3, which avoids virtualenv bugs: https://review.opendev.org/#/c/707215/808:02
zbrany reasons for not doing it? it would have saved us a good number of man-work-days with recent virtualenv issues08:03
marioszbr: since venv is the 'standard' way for py3 ... personally don't object to that... i see you have included a test so it will fall back to virtualenv if not available08:06
zbrin fact the implementation is doing the opposite08:07
zbrmarios: haha! you did not read the code. that is what it does.08:07
marioszbr:            if ! $(python_cmd) -m venv ${OPTS:-} ${OPT_WORKDIR}; then virtuaelenv...08:07
marioszbr: i see you have included a test so it will fall back to virtualenv if not available08:08
marioszbr: ? don't forr what you mean by 11:07 < zbr> marios: haha! you did not read the code. that is what it does.08:08
*** ysandeep is now known as ysandeep|lunch08:08
zbrmarios: ahh, i was the one not reading carefully the ms :p08:09
marioszbr: ack08:09
zbrif that code would not work, we would se a lot of red from builds.08:10
*** ccamacho has joined #oooq08:12
*** jbadiapa has joined #oooq08:14
*** ratailor|lunch is now known as ratailor08:30
*** ykarel is now known as ykarel|lunch08:39
*** yolanda has joined #oooq08:41
*** apetrich has joined #oooq08:59
chemhey, so I'm in pain with https://review.opendev.org/#/c/721292/18 the molecule testing (docker) and role addition check are consistently failing08:59
chemakahat|ruck: ^08:59
chemakahat|ruck: hey, what are my options here ?08:59
chemakahat|ruck: can we move them out of the way https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ansible-centos-8-role-addition09:01
chemand that https://zuul.opendev.org/t/openstack/builds?job_name=tripleo-ansible-centos-8-molecule-tripleo_redhat_enforce#09:02
*** ysandeep|lunch is now known as ysandeep09:04
zbrarxcruz: what is happening with https://review.opendev.org/#/c/681211/ ?09:06
zbrweshay|ruck: is https://review.opendev.org/#/c/712105/7 still needed?09:11
zbrmarios: https://review.opendev.org/#/c/708380/ still need to be open or we can abandon?09:16
zbrnever mind, looked at it and decided to abandon, i am sure you will agree.09:17
akahat|ruckchem, looking.09:18
chemakahat|ruck: thanks09:18
marioszbr: ack thanks agree09:20
akahat|ruckmarios, zbr do you have an idea what's going on with tox ^^09:22
*** dtantsur|afk is now known as dtantsur09:23
zbrakahat|ruck: i do not see any link to a tox job09:24
*** derekh has joined #oooq09:25
mariosakahat|ruck: i don't but looking09:28
marioszbr: in there https://review.opendev.org/#/c/721292/1809:28
mariosmodule 'setuptools.build_meta' has no attribute '__legacy__09:29
zbrthat has nothing to do with tox09:31
mariosakahat|ruck: zbr: maybe that https://github.com/pypa/setuptools/issues/1694#issuecomment-466023567   and https://github.com/cloudnull/ansible-tripleo_sdk/commit/0bdafbd21118540e970b74a34adb282feb11002d09:31
zbrit is a pip/setuptools issue09:31
zbrLMGFY sends me to https://github.com/pypa/setuptools/issues/169409:33
zbri am inclined to believe the error has nothing to do with the tested change09:34
mariosakahat|ruck: so if its blocking us then file a new bug for starters... looks like that change from cloudnull can provide a hint for the fix cc panda|ruck09:36
akahat|ruckmarios, yes it's blocking us09:36
akahat|ruckchem, ^^ that might help.09:39
akahat|ruckmarios, zbr thanks.. I'll talk to cloudnull. :)09:39
akahat|ruckmarios, cloudnull is he online now?09:40
zbrakahat|ruck: i bet the reason is sitepackages = True09:41
chemakahat|ruck: not sure what could help, should I fill the lp ?09:41
akahat|ruckchem, it's blocking us from merging then yes.. this will help.09:43
akahat|ruckzbr, did not got your point. :(09:46
zbrhttps://review.opendev.org/#/c/724627/09:48
akahat|ruckzbr, oh.. got it. :)09:50
arxcruzzbr: nothing, it just not priority right now09:52
zbrarxcruz: mark it somehow that it does not need attention, -W or a "wip" prefix.09:53
marioszbr: arxcruz: chem: i think we just need to disable pep517?09:53
marioszbr: --no-use-pep517 but i can't find the right pip task to add it to09:53
mariosfor the fix i mean.09:53
marioswell 'fix' at least workaround09:53
zbrmarios: hah, pep517 is not something you can disable09:53
marioszbr: well apparently there is a --no-use-pep517 for pip extra_args09:54
zbror remove it, the future is that everyone will switch to pep517, and drop support for old setup.py09:54
zbris not something you can afford to ignore09:55
marioszbr: well you can see here it is where the error is coming from09:55
marioszbr: 2020-04-30 03:41:04.887579 | centos-8 |     File "/home/zuul/src/opendev.org/openstack/tripleo-ansible/.tox/role-addition/lib/python3.6/site-packages/pip/_vendor/pep517/_in_process.py", line 76, in _build_backend09:55
marios2020-04-30 03:41:04.887592 | centos-8 |   AttributeError: module 'setuptools.build_meta' has no attribute '__legacy__'09:55
marioshttps://f90bbe801532b1321206-1f2089e8a25dfa4acbcd47153d041690.ssl.cf2.rackcdn.com/721292/18/check/tripleo-ansible-centos-8-role-addition/f39b625/job-output.txt09:55
mariosakahat|ruck: did you file a LP ?09:55
zbrplease file a bug and add me to it, if that comes from one of our repos/packages i could help, i happen to know the pep517 subject09:57
chemakahat|ruck: marios https://bugs.launchpad.net/tripleo/+bug/187607309:57
openstackLaunchpad bug 1876073 in tripleo "Zuul CI is giving false positive on role-addition and molecule consistently " [Critical,New]09:57
mariosthanks chem ;)09:57
akahat|ruckchem, thank you :)09:57
akahat|ruckpanda|ruck, ^^09:58
chemakahat|ruck: marios it's only basic reporting, I never run molecule and everything in that job is from the default templates09:58
chemakahat|ruck: marios beside the 15line of ansible I needed09:58
chemakahat|ruck: marios On my defense, docker doensn't run out of the box on fedora 31 ... podman is  coming :)09:59
marioschem: akahat|ruck: panda|ruck: adding alert and critical to that one10:00
zbrit works even on fedora 32, but with some hacking needed, any redhat OS after centos-7 needed hacks to make it work.10:00
marioszbr: akahat|ruck: panda|ruck: looks like it is coming from the invocation there https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/tox/tasks/siblings.yaml ie zuul-jobs so we can't workaround it i think10:15
*** jmasud has quit IRC10:16
*** jaosorior has quit IRC10:17
zbri do not see a bug there, what is wrong on siblings?10:18
marioszbr: no that is where the invocation starts i mean and where we'd have to disable pep51710:18
marioszbr: just from going on the logs trace10:18
marioszbr: https://bugs.launchpad.net/tripleo/+bug/1876073/comments/210:19
openstackLaunchpad bug 1876073 in tripleo "Zuul CI is giving false positive on role-addition and molecule consistently " [Critical,Triaged]10:19
zbrmarios: here is my proof: https://review.opendev.org/#/c/724627/10:23
*** ykarel|lunch is now known as ykarel10:25
*** derekh has quit IRC10:41
*** derekh has joined #oooq10:42
akahat|ruckzbr, your patch worked: https://review.opendev.org/#/c/724627/10:44
zbrnot surprised, there is another failed build, which is supposed has similar reason.10:45
zbrakahat|ruck: can you take care of making a fix on this and ping me if getting stuck?10:46
akahat|ruckzbr, yeah sure.10:47
akahat|ruckarxcruz, hey o/10:59
*** udesale_ has joined #oooq11:12
*** udesale has quit IRC11:15
*** ysandeep is now known as ysandeep|brb11:18
arxcruzakahat|ruck: hey, ho, let's go11:27
arxcruzakahat|ruck: sorry, was in a meeting, what's up ?11:27
akahat|ruckarxcruz, it's okay. I have few jobs from openstack-periodic-wednesday-weekend pipeline.11:28
akahat|ruckthose jobs are breaking on one specific test.11:29
akahat|ruckarxcruz, this is test: - tempest.scenario.test_network_basic_ops.TestNetworkBasicOps11:29
akahat|ruckall of them are queens jobs.11:29
akahat|ruckarxcruz, - https://logserver.rdoproject.org/openstack-periodic-wednesday-weekend/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-queens/998a119/logs/undercloud/home/zuul/tempest/tempest.html.gz11:29
akahat|ruckust nee your help to verify this.. is it really issue ?11:30
arxcruzakahat|ruck: yes, it is, I sugest to open a promotion blocker, so the bug will be added on the cix board, and the network team can take a look11:32
arxcruzthe vm is up, running, but the network is failing11:32
akahat|ruckarxcruz, okay. I'll open bug.11:33
panda|ruckakahat|ruck: they're forming a straight line11:34
panda|ruckarxcruz: they're forming a straight line11:34
*** pojadhav is now known as pojadhav|brb11:34
arxcruzpanda|ruck: didn't get it...11:35
akahat|ruckpanda|ruck, what are you trying to say?11:35
*** jpena is now known as jpena|lunch11:36
panda|ruckarxcruz: you can't start with "hey, oh, let's go" without knowing the rest of the lyrics.11:39
weshay|ruckbhagyashris, http://dashboard-ci.tripleo.org/d/Z_UNB29Wz/third-party-dependency-check?orgId=111:39
arxcruzpanda|ruck: lol, i know the music, but i wasn't connectiong it, sorry, i still in my fist cup of coffee :D11:40
arxcruzeven though it's late here already11:40
panda|ruckarxcruz: I hope you're on you are in your third can of beer11:41
arxcruzpanda|ruck: Last time I drink was when marios set a meeting11:42
mariosweshay|ruck: fyi tomorrow i am out cos may 1st public holiday so taking the day11:43
mariosweshay|ruck: (noted during planning, i think quite a few folks are out tomorrow )11:44
akahat|ruckarxcruz, there is on more thing: https://logserver.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario010-ovn-provider-standalone-master/d6660dd/logs/undercloud/home/zuul/tempest/tempest.html.gz11:47
akahat|rucklooks like simple :P11:47
arxcruzakahat|ruck: seems to be the same issue, but without the console log output11:48
akahat|ruckarxcruz, it's RHEL8 job11:49
akahat|ruckarxcruz, sorry.. centos811:49
arxcruzakahat|ruck: probably the console output option in temepst is off11:49
arxcruztempest*11:49
arxcruzbut pretty much the same thing11:49
weshay|ruckbhagyashris, https://github.com/rdo-infra/ci-config/tree/master/ci-scripts/infra-setup/roles/rrcockpit11:49
weshay|ruckbhagyashris, https://github.com/rdo-infra/ci-config/blob/master/ci-scripts/infra-setup/roles/rrcockpit/files/development_script.sh11:49
akahat|ruckarxcruz, okay.11:50
arxcruzakahat|ruck: oh, wait11:50
arxcruzakahat|ruck: it's a different test11:50
arxcruzwell, still, it tries to ssh to the vm, and get a timeout11:50
arxcruzThis might be impact other tests as well11:50
akahat|ruckarxcruz, yes.. it says password is None.11:50
akahat|ruckarxcruz, yes.11:50
rfolcoykarel, hi o/11:51
ykarelrfolco, hi, was about to ping you :)11:51
rfolcoykarel, link for the demo: https://meet.google.com/fgn-rnkp-sqt11:51
ykarelrfolco, ack i joined11:52
rfolcoykarel, if you have any slides or material, please paste here or add to the training page https://hackmd.io/OXNWWIShSBaPXPC8d7kFgQ11:53
ykareladding11:54
*** ysandeep|brb is now known as ysandeep11:54
rfolcoakahat|ruck, bhagyashris pojadhav|brb soniya29 ysandeep >> ping virtual training session - https://meet.google.com/fgn-rnkp-sqt11:55
rfolcoplease join 2 min earlier so we can get started on time11:55
arxcruzakahat|ruck: also, there's no errors in nova log11:58
arxcruzi would sugest you to create a promotion blocker to discuss this on cix call11:58
*** pojadhav|brb is now known as pojadhav11:59
akahat|ruckarxcruz, for this centos8: octavia_tempest_plugin.tests.scenario.v2.test_traffic_ops12:00
akahat|ruck?12:00
*** ratailor has quit IRC12:10
*** rlandy has joined #oooq12:22
rlandypojadhav: hi there12:25
pojadhavrlandy, in training12:25
rlandypojadhav: can we meet after scrum?12:26
pojadhavrlandy, yup12:26
weshay|ruckrfolco, rec12:29
weshay|ruckis that on?12:29
weshay|ruckcan't tell12:29
weshay|ruck:)12:29
rfolcoweshay|ruck, recording is on12:29
rfolco:)12:29
*** derekh has quit IRC12:32
*** amoralej is now known as amoralej|lunch12:33
weshay|ruckpanda|ruck, akahat|ruck looking at queens failures..12:36
weshay|ruckhttps://logserver.rdoproject.org/openstack-periodic-wednesday-weekend/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset016-queens/5449d05/logs/tempest.html.gz12:36
weshay|ruckprobably going to have to bug / skip12:36
weshay|ruckqueens is fooked12:36
weshay|ruckhttps://logserver.rdoproject.org/openstack-periodic-wednesday-weekend/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-queens/998a119/logs/tempest.html.gz12:36
rfolcoykarel, don't know if you can hear us, there is one more question from soniya2912:37
rfolcoykarel, okay, so why do we even need to add packages for non openstack dependency?12:37
weshay|ruckhttps://logserver.rdoproject.org/openstack-periodic-wednesday-weekend/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001-queens/2d7d8bd/logs/tempest.html.gz12:37
ykarelrfolco, i can't hear right now12:37
akahat|ruckweshay|ruck, alredy file for network error: https://bugs.launchpad.net/tripleo/+bug/187608712:37
openstackLaunchpad bug 1876087 in tripleo "Network tests are failing on queens jobs" [Critical,Triaged] - Assigned to amolkahat (amolkahat)12:37
akahat|ruckweshay|ruck, will file for the volume_boot_pattern.12:38
weshay|ruckakahat|ruck, k.. updated the bug name https://bugs.launchpad.net/tripleo/+bug/187608712:38
openstackLaunchpad bug 1876087 in tripleo "Queens, tempest.scenario.test_network_basic_ops.TestNetworkBasicOps failing. Timeout" [Critical,Triaged] - Assigned to amolkahat (amolkahat)12:38
*** derekh has joined #oooq12:38
*** jpena|lunch is now known as jpena12:38
weshay|ruckakahat|ruck++12:38
arxcruzykarel: we can't hear you12:39
weshay|ruckykarel, question in chat... re-review current -> consistent12:39
arxcruzakahat|ruck: weshay|ruck both octavia and this are the same issue12:39
arxcruzssh timeout12:39
weshay|ruckarxcruz, ya.. we're talking about queens though12:39
arxcruzthe message are different because they are different tests, but the root cause seems to be equals12:39
arxcruzok12:40
weshay|ruckarxcruz, will need ur help in queens as most issues are in tempest atm  soniya29 ^12:40
weshay|ruckarxcruz, octavia is train+ no?12:40
weshay|ruckI'm not clear on the connection you are drawing other than the error message is the same.. but it's generic.. right?12:41
weshay|ruckor am I misunderstanding12:41
weshay|ruckakahat|ruck, add these failed tempest tests to the skip list12:41
arxcruzweshay|ruck: i check the logs, haven't see any error on neutron side12:41
arxcruzthe message are different because it uses different methods to ssh into the machine, but both are timeing out12:42
weshay|ruckakahat|ruck, https://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/roles/validate-tempest/vars/tempest_skip_queens.yml12:42
arxcruzssh into the vm*12:42
weshay|ruckarxcruz++ please assist akahat|ruck12:42
arxcruzweshay|ruck: akahat|ruck oh, found something in openvswitch...12:43
arxcruz2020-04-30 10:06:37.256 80095 ERROR neutron.agent.linux.async_process [-] Error received from [ovsdb-client monitor tcp:127.0.0.1:6640 Interface name,ofport,external_ids --format=json]: net_mlx5: cannot load glue library: libibverbs.so.1: cannot open shared object file: No such file or directory12:43
akahat|ruckarxcruz, those packages are missing again?12:44
arxcruzhttps://logserver.rdoproject.org/openstack-periodic-wednesday-weekend/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset010-queens/998a119/logs/subnode-1/var/log/containers/neutron/openvswitch-agent.log.txt.gz12:44
arxcruzakahat|ruck: i remember i saw these errors before...12:44
akahat|ruckthose packages did not get pulled as dependency for neutron?12:44
akahat|rucks/neutron/openvswith/12:45
*** dpawlik has joined #oooq12:46
*** dpawlik has quit IRC12:46
*** dpawlik has joined #oooq12:46
chemmarios: I'm wondering if the last comment in the lp means that something must be changed in the review ?12:49
weshay|ruckakahat|ruck, panda|ruck \0/ https://review.rdoproject.org/zuul/build/521209cf497444a98144fd0eca84c9b312:50
chemmarios: basically, we are downporting this today ... so if it needs another round, the earlier I know the better :)12:50
weshay|ruckchem, the job should be marked nv imho12:53
weshay|ruckpanda|ruck, can you review train for promotion12:53
weshay|ruckpanda|ruck, let's pull the trigger it to fix pacemaker12:54
marioschem: looking12:54
weshay|ruckzbr, marios mark it non-voting w/ a bug included in the patch12:54
mariosakahat|ruck: zbr: didn't zbr fix that?12:55
zbrakahat|ruck: marios: wait, cloudnull is working on a fix12:55
marioshttps://review.opendev.org/#/c/724627/ with that12:55
marioschem: sorry i thought it was fixed cos i saw that discussion earlier here ^^^12:55
mariospanda|ruck: akahat|ruck: so even if cloudnull is working on a fix we can mark non voting to unblock chem for now12:57
akahat|ruckmarios, yes. i think we can do that as quick fix.12:57
weshay|ruckpanda|ruck, ?12:57
weshay|ruckyou with me?12:57
mariosakahat|ruck: are you on that or would you like me to?12:58
akahat|ruckmarios, could you please.. I'm busy with some other stuff.12:58
mariosakahat|ruck: k doing12:59
akahat|ruckmarios, thanks13:00
panda|ruckweshay|ruck: I am now13:02
weshay|ruckpanda|ruck, see what you think about train.. periodic results13:02
rfolcoscrum is delayed a few min13:03
panda|ruckweshay|ruck: looking13:05
rfolcoykarel, you are muted13:06
ysandeeprfolco, do you have the hackmd link handy where yatin have created for demo.13:09
pojadhavykarel, thanks for the session :)13:10
ykarelysandeep, it's in presentation also, u can check https://hackmd.io/lzjXK_FmQfeL7Wr_J-3V9A13:10
rfolcoysandeep, https://hackmd.io/OXNWWIShSBaPXPC8d7kFgQ click on ykarel's presentation13:10
rfolcoscrum time13:10
ysandeepykarel, thank you for the session, It was very informative o/13:10
ysandeepykarel++13:10
ykarelplease go through exercises, it shouldn't take much time, but will help to understand better :)13:11
ysandeeprfolco, thanks!13:11
pojadhavykarel, sure13:11
rfolcoarxcruz, rfolco, zbr, panda, sshnaidm, rlandy, marios, ysandeep, bhagyashris, soniya29, pojadhav, akahat, weshay, chandankumar13:12
rfolcoping scrum13:12
rfolcohttps://meet.google.com/oiv-geho-mai?authuser=113:12
weshay|ruckin another mtg atm13:12
*** ykarel is now known as ykarel|afk13:14
panda|ruckweshay|ruck: I think we are good to go on train, want me to propose the waive13:15
panda|ruck?13:15
weshay|ruckpanda|ruck, yes.. please propose a waive :)13:17
weshay|ruckthank you13:17
weshay|ruckI'll review and merge :)13:18
*** amoralej|lunch is now known as amoralej13:19
*** skramaja has quit IRC13:22
*** skramaja has joined #oooq13:23
*** TrevorV has joined #oooq13:23
panda|ruckweshay|ruck: that job is not in promotion criteria, so train should have promoted by now.13:27
marios10:27 < akahat|ruck> BZ is already there for it: https://bugs.launchpad.net/tripleo/+bug/187583313:30
openstackLaunchpad bug 1875833 in tripleo "The WebSocket timed out before the Workflow completed in rocky/stain jobs" [Critical,New] - Assigned to amolkahat (amolkahat)13:30
mariosrfolco: ^13:30
panda|ruckweshay|ruck: but it didn't13:34
mariosarxcruz: 10:27 < akahat|ruck> BZ is already there for it: https://bugs.launchpad.net/tripleo/+bug/187583313:47
openstackLaunchpad bug 1875833 in tripleo "The WebSocket timed out before the Workflow completed in rocky/stain jobs" [Critical,New] - Assigned to amolkahat (amolkahat)13:47
arxcruzmarios: efcharistó13:48
mariosarxcruz: bitteschon13:51
*** ysandeep is now known as ysandeep|afk13:56
*** ykarel|afk is now known as ykarel14:09
panda|ruckweshay|ruck: I was mistaken, so the fs039 failed last, but a bunch of other jobs failed 1 hour before. fs002 failed, so we can't promote without overcloud images..14:09
* weshay|ruck looks14:10
*** Goneri has joined #oooq14:10
weshay|ruckpanda|ruck, ah.. the current one is passing https://logserver.rdoproject.org/openstack-periodic-latest-released/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_1comp-featureset002-train-upload/57f75bd/14:11
weshay|ruckso perhaps hold for that one14:11
panda|ruckweshay|ruck: yep14:11
weshay|ruckpanda|ruck, and fs20 passed14:11
weshay|ruckpanda|ruck, fs001/35 also passing14:12
panda|ruckweshay|ruck: ah ye the current one looks good14:12
weshay|ruckmight promote on it's own14:12
weshay|rucksoniya29, arxcruz fyi.. https://trello.com/c/niYGK63i/1475-cixlp1876087tripleociproa-queens-tempestscenariotestnetworkbasicopstestnetworkbasicops-failing-timeout14:16
arxcruzweshay|ruck: I saw it14:17
weshay|ruckdanka14:17
soniya29weshay|ruck, I will have a look over it14:19
*** dtantsur has quit IRC14:24
weshay|ruckthanks!14:24
rfolcoakahat|ruck, do we have a fix for https://bugs.launchpad.net/tripleo/+bug/1875833 already?14:26
openstackLaunchpad bug 1875833 in tripleo "The WebSocket timed out before the Workflow completed in rocky/stain jobs" [Critical,New] - Assigned to amolkahat (amolkahat)14:26
akahat|ruckrfolco, no not yet.14:26
rfolcoworkarounds? anything? marios panda|ruck ^14:27
weshay|ruckakahat|ruck, panda|ruck train is promoting14:27
mariosrfolco: was not involved there at all i just got the bug from seeing those jobs fail on a few reviews this morning14:27
rfolcomarios, ack14:28
mariosrlandy: so do you think i should file a new BZ for https://sf.hosted.upshift.rdu2.redhat.com/logs/66/198466/1/check/periodic-tripleo-build-containers-rhel-8-master/845202a/logs/containers-failed-to-build.log14:33
rlandyarxcruz: tls and tempest ... I see in validate tempest there is an option to work with tls - is there something similar in os_tempest?14:33
mariosrlandy: maybe just one for all of them ?14:33
*** dtantsur has joined #oooq14:33
rlandymarios: there are a few14:33
mariosrlandy: yeah its like 414:33
rlandymarios: one per BZ is probably clearer14:33
rlandyif you have the time14:33
mariosrlandy: ack14:34
rlandymarios: wrt fs039, are you running os_tempest there?14:34
mariosrlandy: yeah14:34
mariosrlandy: the problem right now though isn't tempest there its that https://bugs.launchpad.net/tripleo/+bug/1875353 which is a race14:34
openstackLaunchpad bug 1875353 in tripleo "(inconsistent/race) periodic centos-8-ovb featureset039-master fails overcloud ipa-client install No such file or directory: /var/lib/ipa-client/sysrestore" [High,Triaged]14:34
rlandymarios: are you adding anything special?14:34
mariosrlandy: i.e. fails on deployment14:35
rlandymarios: ack - but standalone ipa has a tempest failure ... TASK [os_tempest : Ping router ip address]14:35
rlandyfails14:35
rlandyand I wanted to know if I needed to add another option to deal with tls14:35
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/98/195798/5/check/tripleo-ci-centos-8-standalone-on-multinode-ipa-internal/a66d9a5/job-output.txt14:36
rlandymarios: arxcruz: ^^?14:36
rlandyhttps://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/validate-tempest/templates/run-tempest.sh.j2#L9414:36
zbrrlandy: marios see https://review.opendev.org/#/c/724664/14:36
mariosrlandy: i updated fs39 to use the os-tempest stuff https://github.com/openstack/tripleo-quickstart/blob/701271b11c8f793600726434b168f3cea44feb7b/config/general_config/featureset039.yml#L14014:36
zbrpanda|ruck: ^14:37
mariosrlandy: ack hadn't seen that one in centos8/fs39 i.e. the ping fail from os-tempest14:37
rlandymarios: ok - and that's working fine for you14:38
mariosrlandy: well when deployment passes then yeah we get green job with tempest14:38
rlandyok14:38
mariosrlandy: but some kind of race cos it is inconsistent14:38
rlandyI'll look at that later14:38
rlandyarxcruz: what does tempest_os_cloud refer to?14:44
arxcruzrlandy: in which context?14:51
arxcruzfrom the name, i believe it's the oscloud name for the yaml file containing the credentials14:52
rlandyarxcruz: in creating projects ... https://github.com/openstack/openstack-ansible-os_tempest/blob/ad4841f70a0ad4ed8090fc3f4c83812204cb59c0/tasks/tempest_resources.yml#L8314:52
rlandyhmmm ... where do I find that?14:53
mariosrlandy: k going to put ceilo-base and swift-proxy-server together cos they have the same dependency missing14:53
rlandyok14:53
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/98/195798/5/check/tripleo-ci-centos-8-standalone-on-multinode-ipa-internal/a66d9a5/logs/undercloud/home/zuul/.config/openstack/clouds.yaml14:54
rlandyarxcruz: ^^ that?14:54
rlandyarxcruz: what I think I need is something like: https://github.com/openstack/tripleo-quickstart-extras/blob/master/roles/validate-tempest/templates/run-tempest.sh.j2#L9414:55
arxcruzrlandy: https://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/roles/validate-tempest/templates/configure-tempest.sh.j2#L2014:56
arxcruzbasically, it's the name of the credential in the clouds.yaml14:56
rlandyarxcruz: ok - that seems fine14:57
rlandyarxcruz: just trying to figure out why it can't ping router14:57
*** bogdando has quit IRC14:59
rlandymarios: sorry - another tempest question ...15:11
rlandyhttps://github.com/openstack/tripleo-quickstart/blob/master/config/general_config/featureset039.yml - the tempest setting you added here ...15:11
rlandythey are not in fs05215:11
rlandyI think those settings are elsewhere?15:12
mariosrlandy: filed filed new https://bugzilla.redhat.com/show_bug.cgi?id=1829918 https://bugzilla.redhat.com/show_bug.cgi?id=1829921 https://bugzilla.redhat.com/show_bug.cgi?id=1829924 https://bugzilla.redhat.com/show_bug.cgi?id=182992715:12
openstackbugzilla.redhat.com bug 1829918 in releng "Cannot build aodh-base container for rhel8-osp17 - missing python3-ceilometerclient" [Unspecified,New] - Assigned to rhos-maint15:12
openstackbugzilla.redhat.com bug 1829921 in releng "Cannot build designate-base container for rhel8-osp17 - missing python3-edgegrid" [Unspecified,New] - Assigned to rhos-maint15:12
openstackbugzilla.redhat.com bug 1829924 in releng "Cannot build octavia-base container for rhel8-osp17 - missing python3-tenacity" [Unspecified,New] - Assigned to rhos-maint15:12
openstackbugzilla.redhat.com bug 1829927 in releng "Cannot build ceilometer-base swift-proxy-server container for rhel8-osp17 - missing python3-tenacity" [Unspecified,New] - Assigned to rhos-maint15:12
mariosso that was fun15:12
mariosrlandy: added to https://tree.taiga.io/project/tripleo-ci-board/task/1656 & back to in progress15:12
mariosi'll refrain from sending the cix mails though unless lon tells us he need them? wdyt15:13
mariosdo we need those on the trello board is the question15:13
mariosrlandy: reading back on fs5215:13
mariosrlandy: 18:11 < rlandy> they are not in fs05215:13
mariosrlandy: not sure what you mean?15:14
mariosrlandy: the fs39 things are not in fs 52 ?15:14
rlandymarios: thanks re: BZ's ... I'll chat with lon as soon as I am done debugging tempest here15:14
mariosrlandy: i'm out tomorrow its a public holiday here15:16
rlandymarios: ah - May 1st - labor day everywhere expect here15:17
panda|ruckalmost everywhere.15:30
mariosrlandy: and then on monday we get 'upgraded' we get 3 sms per day \o/ for outing :D - joking aside its progress... stage 1 of lifting restrictions some shops will open etc15:31
rlandymarios: enjoy - go do three fun things15:31
*** skramaja has quit IRC15:32
*** matbu has quit IRC15:34
rfolcoakahat|ruck, panda|ruck: this was the last time tripleo-ci-centos-7-containers-multinode-rocky reported success: 2020-04-27T16:17:3115:54
*** marios is now known as marios|out16:09
weshay|ruckpanda|ruck, akahat|ruck before you guys go.. do we have patches up to the skip list?16:14
*** matbu has joined #oooq16:20
*** jmasud has joined #oooq16:24
*** marios|out has quit IRC16:30
*** ykarel is now known as ykarel|afk16:31
rlandyweshay|ruck: hey ...16:33
rlandyweshay|ruck: we hit build dependency failures again in downstream containers per the BZs marios added above16:33
rlandyweshay|ruck: chatted with lon about fixing those missing deps - some of which we need to wait until monday to check with  tvignaud16:34
weshay|ruckfun16:34
rlandyweshay|ruck: what do you think about adding those failing containers to the exclude list and just pushing the ones that succeed16:34
weshay|ruckrlandy, any luck w/ pushing?16:35
rlandyso that we can test push?16:35
weshay|ruckrlandy, ya.. that's fine as a temp16:35
weshay|ruck+116:35
rlandyweshay|ruck: you don't get to push if build fails16:35
weshay|ruckya.. I know unless you try it manually16:35
weshay|ruck:)16:35
rlandyweshay|ruck: alter, if easy enough, I'll patch openstack-tripleo-common to exclude those deps16:35
rlandybut as lon said, they may actually get added16:36
rlandychicken-egg problem16:36
*** ykarel|afk is now known as ykarel|away16:36
weshay|ruckzbr,16:38
weshay|ruckmolecule-container-push-delegated-centos-7FAILURE in 4m 08s16:38
weshay|ruckmolecule-tripleo-common-delegated-centos-7FAILURE in 3m 23s16:38
weshay|ruckmolecule-delegated-promote-images-delegated-centos-7FAILURE in 2m 00s16:38
weshay|ruckthose have been red for several days now16:38
weshay|ruckpanda|ruck, ^16:39
*** udesale_ has quit IRC16:39
weshay|ruckrlandy, https://review.rdoproject.org/r/#/c/27117/16:42
rlandyfine by me16:43
weshay|ruckhttps://review.opendev.org/724703 https://review.opendev.org/724704  <--- keep an eye on16:45
zbrgoing to have a brief look now16:50
panda|ruckzbr: it's clearer in this job https://logserver.rdoproject.org/22/27022/1/check/molecule-delegated-promote-images-delegated-centos-7/da19134/job-output.txt17:04
zbrpanda|ruck: yep, remove the pinning of molecule, latest version does not have this bug (caused by a bad release of "sh")17:05
zbruse >3.0,<3.117:05
zbr3.0.4 fixed that17:05
*** dtantsur is now known as dtantsur|afk17:06
zbri already mentioned that twice yesterday17:06
panda|ruckmentioned to whom ?17:08
panda|ruckzbr: ^ ?17:08
zbrahh, I forgot that ruck does not join scrums17:20
*** chem has quit IRC17:22
*** jpena is now known as jpena|off17:24
zbrpanda|ruck: see https://pypi.org/project/sh/#history -- for molecule 2.x you must manually add <1.1317:24
*** chem has joined #oooq17:24
*** amoralej is now known as amoralej|off17:36
*** dpawlik has quit IRC17:38
*** jmasud has quit IRC17:43
*** jbadiapa has quit IRC18:03
*** ccamacho has quit IRC18:08
*** jmasud has joined #oooq18:11
zbrweshay|ruck: rlandy fix for promoter https://review.rdoproject.org/r/#/c/27118/ (passed)18:14
*** jmasud has quit IRC18:17
*** jmasud has joined #oooq18:18
rfolcoakahat|ruck, panda|ruck: filed this bug as it seems to be different issue: https://bugs.launchpad.net/tripleo/+bug/187616018:27
openstackLaunchpad bug 1876160 in tripleo "pacemaker: Failed to call refresh on containers-multinode-{rocky,queens}" [Critical,Triaged]18:27
weshay|ruckzbr, thanks.. move to nv abandonded 27118 wf18:49
weshay|ruckrfolco, what are you hitting?18:50
rfolcoweshay|ruck, bug above is blocking release file patch18:50
weshay|ruckrfolco, dupe18:51
rfolcoweshay|ruck, https://review.opendev.org/#/c/72184218:51
rfolcodupe of ?18:51
*** jmasud has quit IRC18:53
*** rlandy is now known as rlandy|biab19:00
*** jmasud has joined #oooq19:08
weshay|ruckrfolco, we know :)19:09
rfolcoweshay|ruck, any fixes or workarounds ?19:10
*** jfrancoa has quit IRC19:25
*** derekh has quit IRC19:25
*** rlandy|biab is now known as rlandy19:32
rlandyweshay|ruck: do you still have not merged patches for get-hash or are all your changes in?19:32
rlandyfixing for downstream19:33
weshay|ruckrfolco, rlandy https://review.rdoproject.org/r/2711919:37
weshay|ruckthreading needles here19:38
weshay|ruckrlandy, I don't know :)19:38
rlandyweshay|ruck: well - this is where we have come to - running one job in promotion19:39
rlandy+219:39
weshay|ruckheh.. weeeeeee19:39
weshay|ruckrlandy, and turning tempest off19:39
weshay|ruckrlandy, https://review.opendev.org/#/c/724703/19:40
weshay|ruckshit hit the fan19:40
rfolcoweshay|ruck, I don't want to slow you down, I just don't understand how relaxing promotion criteria helps to fix the issue19:40
rlandyweshay|ruck: ho4it19:43
rlandygo4it19:43
weshay|ruckrfolco, ya.. well sign me up for another section in https://hackmd.io/OXNWWIShSBaPXPC8d7kFgQ and I'll walk through it19:43
weshay|ruckbecause this will happen again19:43
rfolcounless we switch with some pre-scheduled one, next available slot is June 18th19:45
rfolcoweshay|ruck, ^19:45
weshay|ruckfine.. it shouldn't happen again before that19:45
rfolcok19:46
rfolcoweshay|ruck, how do you entitle this ?19:47
rfolcoliving dangerous19:47
rfolcodo not do this at home19:47
rfolcoI still know what you did last summer19:48
*** jmasud has quit IRC19:48
weshay|ruckWhat to do went CentOS breaks your shit19:48
rfolcok19:48
*** saneax has quit IRC19:51
*** jmasud has joined #oooq19:52
rlandyweshay|ruck: we're close here ... tls deploy is passing ... falling over on tempest now - can't ping the router... I tried adding the ca_cert, changing to use the public interface ... no dice20:16
rlandyany other ideas?20:16
rlandywould love to close this out20:16
weshay|ruckrlandy, this is master?20:17
rlandyweshay|ruck: ack20:17
rlandyhttps://zuul.openstack.org/stream/e16e76d5c8ab49fd8b4077a5f95b5417?logfile=console.log20:17
rlandyhttps://github.com/openstack/openstack-ansible-os_tempest/blob/master/tasks/tempest_resources.yml#L28320:18
rlandydies there ^^20:18
weshay|ruckrlandy, will have to look at the logs20:18
rlandyweshay|ruck: https://zuul.opendev.org/t/openstack/build/fbdf8a89dda8464d8938b5d53bb6210b20:19
rlandyneed to chat with chandankumar when he gets back20:19
rlandythere were settings to deal with tls in validate tempest20:19
weshay|ruck+ exec /usr/bin/networking-ovn-metadata-agent --config-file /etc/neutron/neutron.conf --config-file /etc/neutron/plugins/networking-ovn/networking-ovn-metadata-agent.ini --log-file=/var/log/neutron/ovn-metadata-agent.log20:25
weshay|rucknet_mlx5: cannot load glue library: libibverbs.so.1: cannot open shared object file: No such file or directory20:25
weshay|rucknet_mlx5: cannot initialize PMD due to missing run-time dependency on rdma-core libraries (libibverbs, libmlx5)20:25
weshay|ruckPMD: net_mlx4: cannot load glue library: libibverbs.so.1: cannot open shared object file: No such file or directory20:25
weshay|ruckPMD: net_mlx4: cannot initialize PMD due to missing run-time dependency on rdma-core libraries (libibverbs, libmlx4)20:25
weshay|ruckrlandy, ^20:25
weshay|ruckhttps://802b28d13e59f2e68859-fef50fbf80bf7a8da07d35d3242576b8.ssl.cf1.rackcdn.com/706288/25/check/tripleo-ci-centos-8-standalone-on-multinode-ipa/fbdf8a8/logs/undercloud/var/log/extra/podman/containers/ovn_metadata_agent/stdout.log20:25
weshay|ruckcomparing to https://0743aa731054346d94ba-c5bb35ae23b4f1517e6ef394b01c7761.ssl.cf1.rackcdn.com/722109/4/check/tripleo-ci-centos-8-standalone/c5504de/logs/undercloud/var/log/containers/neutron/ovn-metadata-agent.log20:25
rlandyweshay|ruck: hmmm ...not following20:26
rlandyhow would that be diff in tls?20:27
weshay|ruckrlandy, well.. I would at least run it twice20:31
weshay|ruckit could be you just had bad luck that go20:31
rlandyweshay|ruck: I did20:31
weshay|rucksame error both times?20:31
rlandythe second run failed in the same place20:31
rlandyand this run on internal20:32
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/98/195798/5/check/tripleo-ci-centos-8-standalone-on-multinode-ipa-internal/a66d9a5/logs/20:32
rlandyweshay|ruck: somehow fs039 runs20:32
weshay|ruckrlandy, this is the internal port version?20:33
rlandyweshay|ruck: what do you mean by internal port version?20:33
weshay|ruckthe work around you referred to yesterday20:34
rlandyweshay|ruck: no, this is the proper - no-hack-me version20:35
rlandythe hacked version passed tempest20:36
rlandyso here's another question ...20:36
rlandytempest_interface_name20:36
rlandypublic or internal?20:36
rlandyI think that should be public20:36
rlandyI tried setting it but no idea if that setting took20:36
weshay|ruckhrm20:38
weshay|ruckrlandy, it shouldn't be different than the normal tempest job..20:38
weshay|rucknormal standalone rather20:39
rlandyhttps://2bd0765af3260bed3084-e3e0586d119e861007b7a5c43dc05934.ssl.cf2.rackcdn.com/706288/26/check/tripleo-ci-centos-8-standalone-on-multinode-ipa/e16e76d/logs/undercloud/var/log/containers/neutron/ovn-metadata-agent.log20:40
rlandy^^ clean log - still fails to ping20:40
rlandyhttps://review.opendev.org/#/c/724729/ made no diff20:41
rlandyweshay|ruck: ^^20:41
rlandytrying turning off ssl validation altogether20:41
rlandyto see if that helps20:41
rlandyhttps://github.com/openstack/openstack-ansible-os_tempest/blob/master/tasks/tempest_resources.yml#L25420:42
weshay|ruckrlandy, afaict neutron did not come up properly20:42
* weshay|ruck looks at internal 20:42
weshay|ruckah20:42
weshay|ruckur saying that one is clean20:42
rlandyyep20:43
rlandysecond run20:43
weshay|ruckrlandy, nope20:44
weshay|ruckrlandy, ur looking at the wrong log :)  https://2bd0765af3260bed3084-e3e0586d119e861007b7a5c43dc05934.ssl.cf2.rackcdn.com/706288/26/check/tripleo-ci-centos-8-standalone-on-multinode-ipa/e16e76d/logs/undercloud/var/log/extra/podman/containers/ovn_metadata_agent/stdout.log20:44
weshay|rucksame issue20:45
weshay|ruckhttps://2bd0765af3260bed3084-e3e0586d119e861007b7a5c43dc05934.ssl.cf2.rackcdn.com/706288/26/check/tripleo-ci-centos-8-standalone-on-multinode-ipa/e16e76d/logs/undercloud/var/log/extra/podman/podman_allinfo.log20:45
rlandyweshay|ruck> comparing to https://0743aa731054346d94ba-c5bb35ae23b4f1517e6ef394b01c7761.ssl.cf1.rackcdn.com/722109/4/check/tripleo-ci-centos-8-standalone/c5504de/logs/undercloud/var/log/containers/neutron/ovn-metadata-agent.log20:46
rlandy^^ two diff logs :) - followed your comment20:46
rlandyweshay|ruck: same problem here  https://320c106ad4452b72d3ea-9dead46848380cd5333c94a2972396ab.ssl.cf5.rackcdn.com/724436/2/gate/tripleo-ci-centos-8-standalone/1df4a52/logs/undercloud/var/log/extra/podman/containers/ovn_metadata_agent/stdout.log20:48
rlandyjob passes fine20:48
weshay|ruckrlandy, hold the nodes in a test project20:49
weshay|ruckwill be easier20:49
rlandyI wish - will have to wait until tomorrow when internal clears from its phantom jobs - blocked there20:50
rlandynhicher said he'd bounce that tomorrow20:50
*** cgoncalves has quit IRC20:53
*** kopecmartin has quit IRC20:53
*** whoami-rajat has quit IRC20:53
*** pojadhav has quit IRC20:53
*** beagles has quit IRC20:53
*** whoami-rajat has joined #oooq20:54
*** b3nt_pin has joined #oooq20:54
*** pojadhav has joined #oooq20:55
*** cgoncalves has joined #oooq20:55
*** jmasud has quit IRC21:21
rlandyweshay|ruck: rfolco:  https://review.rdoproject.org/r/27120 Upadte dlrn_report role for downstream usage21:36
weshay|ruckrlandy, can we remove     - fedora_centos_mixed_distribution_job|default(false)21:37
weshay|ruckthat's long gone21:37
rlandyweshay|ruck: k21:37
rlandystill in the upstream playbooks though21:37
rlandyweshay|ruck: https://review.rdoproject.org/r/#/c/27120/ updated21:48
weshay|ruckrlandy, one more21:54
rlandyweshay|ruck: link?21:55
weshay|ruckcomment21:55
*** TrevorV has quit IRC22:00
rlandyweshay|ruck: is train rhel-8 still a thing?22:09
weshay|ruckrlandy, if you could kill those jobs, I'd appreciate it22:09
rlandyweshay|ruck: I ma just asking for the conditions on using get-hash22:10
rlandyeither way, this role is not yet used upstream22:10
*** Tengu has quit IRC22:11
rlandyhttps://review.rdoproject.org/r/#/c/27120/3/roles/dlrn-report/tasks/dlrn-vars-setup.yml22:11
rlandyweshay|ruck: ^^ that will fail with rhle-8 train ... but it's fine22:11
rlandythis role is never used there22:11
*** Tengu has joined #oooq22:12
*** rfolco has quit IRC22:37
*** jmasud has joined #oooq22:38
*** jmasud has quit IRC22:39
*** jmasud has joined #oooq22:43
*** tosky has quit IRC23:11
*** jmasud has quit IRC23:44
*** jmasud has joined #oooq23:49

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!