15:00:15 #startmeeting qa 15:00:15 Meeting started Tue Apr 19 15:00:15 2022 UTC and is due to finish in 60 minutes. The chair is kopecmartin. Information about MeetBot at http://wiki.debian.org/MeetBot. 15:00:15 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:00:15 The meeting name has been set to 'qa' 15:00:29 #link https://wiki.openstack.org/wiki/Meetings/QATeamMeeting#Weekly_QA_Team_meeting 15:00:33 agenda ^^ 15:02:21 #topic Announcement and Action Item (Optional) 15:02:24 none from my side 15:02:30 #topic Zed Priority Items progress 15:02:35 o/ 15:02:36 #link https://etherpad.opendev.org/p/qa-zed-priority 15:02:41 hi gmann 15:03:22 topic: Unstable tests in Tempest monitoring 15:03:36 I found out that we have 3 tests decorated as unstable ones 15:04:16 ok 15:05:26 test_create_object_with_transfer_encoding is associated with the following bug 15:05:28 kopecmartin: can you please remove the +W , i forgot to keep it in -W as I am still testing it https://review.opendev.org/c/openstack/tempest/+/837777 15:05:57 gmann: sure, sorry 15:06:08 thanks 15:06:18 so the above test is associated with the following bug: 15:06:21 #link https://bugs.launchpad.net/tempest/+bug/1905432 15:06:34 it's in progress, we can leave the test as is right now 15:06:41 although the other test 15:06:45 test_container_synchronization 15:06:54 should be fixed already 15:06:56 #link https://bugs.launchpad.net/tempest/+bug/1317133 15:07:09 do we want to remove the unstable decorator there and watch what happens? 15:07:53 I think we can remove and watch, its start of cycle so good to try 15:08:40 ack 15:08:49 the last one test_server_connectivity_cold_migration_revert 15:08:53 has an expired test 15:08:57 *bug 15:09:05 #link https://bugs.launchpad.net/neutron/+bug/1836595 15:09:23 should we skip the test permanently? 15:10:13 humm skip test permanently is not good 15:10:42 yeah, ok , maybe let me dig in a bit and find out where/what jobs execute the test 15:10:50 let's see how it behaves 15:12:14 yeah, we can re-open it for neutron 15:12:41 sounds good 15:12:50 any other updates? 15:13:12 i'm gonna push a review soon to switch to ecdsa keys by default in tempest 15:15:34 #topic OpenStack Events Updates and Planning 15:15:38 nothing specific here 15:15:43 #topic Gate Status Checks 15:15:51 https://review.opendev.org/q/label:Review-Priority%253D%252B2+status:open+(project:openstack/tempest+OR+project:openstack/patrole+OR+project:openstack/devstack+OR+project:openstack/grenade) 15:16:04 none reviews there 15:16:10 i have couple of patches failing in barbican-tempest-plugin with same error 15:16:25 https://aad416e65f3b6b410df4-779f7e98ba8ab2ef93c3580c1febb3f0.ssl.cf2.rackcdn.com/834000/3/check/barbican-tempest-plugin-simple-crypto-victoria/07b40f8/job-output.txt 15:16:36 yeah, they fail due to this 15:16:38 #link https://bugs.launchpad.net/devstack/+bug/1968798 15:17:20 even though some workaround patches have been merged, it still fails somewhere 15:17:30 e.g. barbican-tempest-plugin or python-tempestconf 15:17:36 i don't know why 15:18:09 soniya29: kopecmartin yeah it should be fixed now. or you saw the error after fix were merged ? 15:18:13 kopecmartin, my patches are blocked because of it :) 15:18:24 gmann, i am seeing these errors today 15:18:45 yep, it failed today, see my comment 15:18:51 https://bugs.launchpad.net/devstack/+bug/1968798/comments/5 15:18:58 soniya29: this is victoris job which should be fixed too 15:19:20 gmann, it is failing on xena, ussuri as well 15:19:34 https://review.opendev.org/c/openstack/barbican-tempest-plugin/+/834000 15:19:47 ussuri it will fail as we fixed only till stable/victoria 15:20:08 https://review.opendev.org/c/openstack/barbican-tempest-plugin/+/833801 15:21:52 i wonder why this fails as it uses master :/ 15:21:53 #link https://zuul.opendev.org/t/openstack/build/025e36991b6e472ba0c204fede267104 15:23:36 I haven't opened the links but any use of git as a different user than ownership of the repo can cause this 15:23:37 Eduardo Olivares proposed openstack/tempest master: Validate network downtime during live migration - part II https://review.opendev.org/c/openstack/tempest/+/838518 15:24:00 the patches to devstack have flagged the repos that devsatck clones as safe to address this but if a plugin or job manually clones and doesn't do this they can still break 15:24:14 or if they interact with the zuul cloned repos directly as not the zuul user 15:24:29 etc etc. You'll need to dig into why they are still broken particualrly on master where the workaroudns should be sufficient 15:26:20 clarkb: does this count as interacting with it directly as not zuul user? 15:26:21 #link https://opendev.org/openinfra/python-tempestconf/src/branch/master/roles/install-plugins/tasks/main.yaml#L5 15:27:20 kopecmartin: no that should run out of the tempest clone done by devsatck which gets the flag set. The next task may be though as that pip installs requirements as root out of the zuul clone 15:27:41 er wait that doesn't pip install requirements it just gets upper constraints 15:27:50 it pip installs some list of plugins but I'm not sure where those plugins are located 15:28:11 it fails on the Prepare tempest venv task anyway 15:28:14 before installign plugins 15:28:21 ok, i'll try to dig more into that 15:29:11 btw, plugins_paths is just a var which references the plugins locations where they were cloned by devstack 15:29:34 so the workaround should work on that too 15:29:55 specifically the issue is running git commands in a git repo as a user other than the user that owns the files in the git repo 15:30:17 reading the files directly or navigating the repo with cd and that sort of interaction is fine. It is git itself policing this when you run git commands against a repo 15:30:35 installing things causes this to happen because pbr runs git commands to determine version info 15:33:07 i see, thanks, i'll check the tasks executed in more detail 15:33:59 #topic Periodic jobs Status Checks 15:34:08 stable: 15:34:09 #link https://zuul.openstack.org/builds?job_name=tempest-full-yoga&job_name=tempest-full-xena&job_name=tempest-full-wallaby-py3&job_name=tempest-full-victoria-py3&job_name=tempest-full-ussuri-py3&pipeline=periodic-stable 15:34:17 master: 15:34:19 #link https://zuul.openstack.org/builds?project=openstack%2Ftempest&project=openstack%2Fdevstack&pipeline=periodic 15:35:19 tempest-full-ussuri-py3 failed couple days ago with the same pbr error 15:35:21 #link tempest-full-ussuri-py3 15:35:27 #link https://zuul.openstack.org/build/e5fe6f0121924271b820c1ac32882640 15:35:44 which makes sense as the fix/workaround wasn't backported to ussuri 15:36:20 #link https://review.opendev.org/c/openstack/devstack/+/837749 15:37:28 #topic Distros check 15:37:33 centos 8/9 stream 15:37:40 #link https://zuul.openstack.org/builds?job_name=tempest-full-centos-9-stream&job_name=tempest-full-py3-centos-8-stream&job_name=devstack-platform-centos-8-stream&job_name=devstack-platform-centos-9-stream&skip=0 15:38:06 fedora 15:38:08 #link https://zuul.openstack.org/builds?job_name=devstack-platform-fedora-latest&skip=0 15:38:17 oh, that's really bad 15:38:28 openEuler 15:38:29 #link https://zuul.openstack.org/builds?job_name=devstack-platform-openEuler-20.03-SP2+&skip=0 15:38:42 debian 15:38:44 #link https://zuul.openstack.org/builds?job_name=devstack-platform-debian-bullseye&skip=0 15:39:41 #topic Sub Teams highlights 15:39:47 Changes with Review-Priority == +1 15:39:56 #link https://review.opendev.org/q/label:Review-Priority%253D%252B1+status:open+(project:openstack/tempest+OR+project:openstack/patrole+OR+project:openstack/devstack+OR+project:openstack/grenade) 15:40:08 no reviews 15:40:09 #topic Open Discussion 15:40:16 anything for the open discussion? 15:43:05 very quickly, I have a question 15:43:20 as you may know, there is an ongoing effort to switch the ceph deployment to cephadm (in devstack-plugin-cephadm) 15:43:33 while helping with testing some patches, we realized we don't test RGW in place of swift at all 15:43:42 so I've tried to enable it, and I hit an issue 15:43:44 the problem we are facing now is that when you disable swift, tempest verify-config complains because it finds the swift endpoint but swift is false in the service_enabled section. 15:43:48 So I'm not sure what to do: change the definitions services so that "s-*" means "the swift services" and "swift" generically means "any swift interface", and the latter can be set to true while the former to false? 15:43:55 or something else 15:43:59 let me dig the error 15:45:14 https://57f0df93d98f0ba67cf4-594c8dbcc2892c36fc68c95ad460eff0.ssl.cf5.rackcdn.com/837842/1/check/devstack-plugin-ceph-tempest-py3/5444202/job-output.txt 15:45:22 from https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/837842 15:45:53 (the error is not related to cephadm, that patch doesn't use it, it's an issue we have had for a long while) 15:47:02 Merged openstack/devstack stable/yoga: Write safe.directory items to system git config https://review.opendev.org/c/openstack/devstack/+/838352 15:47:04 Merged openstack/devstack stable/xena: Write safe.directory items to system git config https://review.opendev.org/c/openstack/devstack/+/838423 15:47:07 Merged openstack/devstack stable/wallaby: Write safe.directory items to system git config https://review.opendev.org/c/openstack/devstack/+/838426 15:47:10 Merged openstack/devstack stable/victoria: Write safe.directory items to system git config https://review.opendev.org/c/openstack/devstack/+/838428 15:48:04 * kopecmartin checking the links 15:51:20 what I'd like to avoid is to introduce another variable for devstack/tempest which really means "a generic swift interface is available", but maybe it's the only way 15:52:53 it sounds like a bug in verify_tempest_config .. i'm trying to figure out whether it's possible to make it smarter 15:53:17 .. without a new option 15:54:13 i forgot how the tool works, i need to check whether it's possible to deduce that rgw is enabled and skip the recommendation to enable swift 15:54:17 but is swift enabled in service_available option? 15:54:30 no, it's not 15:54:38 https://57f0df93d98f0ba67cf4-594c8dbcc2892c36fc68c95ad460eff0.ssl.cf5.rackcdn.com/837842/1/check/devstack-plugin-ceph-tempest-py3/5444202/controller/logs/tempest_conf.txt 15:56:16 and how we disabled the swift, it should be disabled in devstack in that case 15:57:18 Dan Smith proposed openstack/devstack master: Gather performance data after tempest https://review.opendev.org/c/openstack/devstack/+/837139 15:58:35 gmann: https://review.opendev.org/c/openstack/devstack-plugin-ceph/+/837842/1/.zuul.yaml 15:58:39 because if you are disabling the swift in tempest but enabled it in devstack then it will complain so you need to disable in devstack side also so that devstack would not cerate the swift endpoint 16:00:23 gmann: that seam like a sub optimial solution 16:00:35 tosky: kopecmartin it check both 'swift' or 's-*' https://github.com/openstack/devstack/blob/676dcaf94487665882be048cfe1f3206d6807e0f/functions-common#L2083 16:00:38 gmann: we might need swift for some feature but not want to actully test it with devstack 16:01:01 gmann: for example we talkabout using swift to store tpm data for shleve at one point 16:01:45 if we wanted to test that in the nova gate we woudl need to enable swift but we might not want to run the swift tempest tests 16:01:46 sean-k-mooney: yeah so in that case you just do not run the swift test. 16:02:04 right so we shoudl be able to have swift enabeld in devstack adn disable it in tempest 16:02:21 verify-config tool is mainly to tell if anything mismatch in tempest config from what your cloud has 16:02:42 sure but coudl it not be a warning instead of an error 16:02:44 sean-k-mooney: not disable in tempest, just do not run the swift test via regex 16:03:18 gmann: right im sayting using the regex is not good form a ux point of view 16:03:35 it works but if i disable it in the tempest config i should not have too use the regex 16:03:53 i understand that conflict with verify-config's usecase 16:03:58 yeah but making verify-config less restrictive loose overall goal of this tool 16:04:01 yeah 16:04:27 for current error, not sure why s-* are not working to disable service in devstack even we have this https://github.com/openstack/devstack/blob/676dcaf94487665882be048cfe1f3206d6807e0f/functions-common#L2083 16:04:35 ignoring the tool if i disable swift in tempest is that enough to disable the tests 16:05:13 gmann: is this related to ceph? is it the rados gateway? 16:05:19 gmann: I think disabling s-* is working - the point of the job is that RGW is enabled instead 16:05:24 gmann: so the endpoint is populated 16:05:30 right 16:05:42 so tempest shoudl treat RGW as swirft 16:05:53 the problem is that tempest tries to set service_enabled.swift based on s-* and swift value and it found it false and it sets it to false 16:05:57 RGW is ment to supprot swifts api 16:06:27 and then verify-config is called, before everything else could forcibly set service_enable.swift to true, and fails 16:06:56 ya so verify-cofnig either need to check the ceph plugin flag 16:07:07 basically everything would work if "swift" meant just "swift-like interface" and then the real availability of the services was handled by s-* variables 16:07:09 I see, I think we talked about RGW support in tempest/devstack testing but I need to check if something we talked in bug or so 16:07:12 or it need to look at the keytone endpoints in perfernce to local.conf 16:07:25 sean-k-mooney: but that would be a reverse dependency to devstack-plugin-ceph 16:07:43 not entirely 16:07:59 it does not depend on it it can simple check if the value is set or not 16:08:06 unless you have a variable which says "expect to find a swift interface, regardless of the implementation" 16:08:08 so that can happen if any other deployment is adding things which we told devstack/tempest not to and things fail 16:08:22 but then we could use 'swift' in SERVICE_ENABLED for that, as I said before 16:08:43 tosky: yes and do not run test if you do not need. 16:09:11 tosky: I mean enable it in tempest and do not run test 16:09:26 tosky: https://github.com/openstack/devstack-plugin-ceph/blob/master/devstack/lib/ceph#L115= 16:09:33 i was hoping there woudl be something like that 16:09:40 which enabled or disable the RWG 16:09:45 that it could check 16:09:48 sean-k-mooney: yes, but that's not the problem 16:09:58 tempest shouldn't need to check that value 16:09:59 oh there is ENABLE_CEPH_RGW 16:10:18 right it coudl use the keystone catalog 16:10:32 i guess it is and comparing it to its enabeld services 16:10:38 based on its config 16:11:04 so you could add a test_cofig section to the devstack_plugin_ceph 16:11:10 it's too late 16:11:12 which woudl enable swift 16:11:20 ok well post_config 16:11:29 it happens before any service is started 16:11:45 if you use post_cofnig jobs can still override it with test config 16:11:50 too late :) the section in lib/tempest has no breaks between setting swift false and ruenning verify-config 16:12:46 oh ok 16:12:51 well that seam simple to fix 16:13:03 hence the suggestion of redefining the meaning of 'swift' 16:13:03 just move verify-config ot after test_config phase 16:13:08 or that 16:13:22 I was trying to find a minimally-invasive change 16:13:31 but it probably doesn't exist :) 16:13:34 so I'm here 16:14:28 this is where it is failing yes https://github.com/openstack/devstack/blob/676dcaf94487665882be048cfe1f3206d6807e0f/lib/tempest#L661= 16:16:04 so ya i think the simple solution would be to move that to its own fucntion and run it either after post_config (and before test_config) preserveing the current validation scope but allowing other plugins to modify the config 16:16:19 or put it after test_config having it veriry the final config 16:17:11 need to disappear for a while, but open to any solution 16:17:11 right now its happeing in install https://github.com/openstack/devstack/blob/676dcaf94487665882be048cfe1f3206d6807e0f/doc/source/plugins.rst#pluginsh-contract= 16:17:50 movign it to extra or test-config woudl be what i woudl suggest 16:18:04 proably extra 16:19:40 thank you for the discussion, i need to end the meeting now :) 16:19:48 we may continue next time 16:19:56 #endmeeting