14:59:39 #startmeeting openstack_ansible_meeting 14:59:39 Meeting started Tue Oct 3 14:59:39 2023 UTC and is due to finish in 60 minutes. The chair is noonedeadpunk. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:59:39 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:59:39 The meeting name has been set to 'openstack_ansible_meeting' 14:59:45 * NeilHanlon needs more coffee 14:59:48 #topic rollcall 14:59:55 Yeah, apparently me too :D 14:59:58 hehe 15:00:02 we're just a bit early today 15:00:36 hey ! 15:01:01 \o/ 15:01:06 o/ hello 15:01:15 hi! 15:01:28 #topic bug triage 15:01:30 To be frank - I in fact don't have much updates/to say from the last meeting. Was mostly occupied with internal things 15:01:44 But, I've submitted couple of new bugs we've found during the upgrade 15:02:53 But lets start from this one 15:02:56 #link https://bugs.launchpad.net/openstack-ansible/+bug/2036421 15:03:17 I;ve replied to it, but I'm not sure if we wanna do smth there or just mark as invalid 15:03:30 I want to see if Horizon has been translated into Farsi or not?! 15:03:34 As I see how this release note can be confusing, but I'm not sure what can be done 15:04:18 openstack-ansib: you can see horizon translations here: https://translate.openstack.org/iteration/view/horizon/master?dswid=7334 15:05:14 regarding bug - I'm not sure what we can do at this point... We could add another release note back in the days, but not I guess it's useless 15:05:20 *now 15:06:40 So I guess I will mark it as invalid if no objections 15:07:02 that seems reasonable 15:08:29 #link https://bugs.launchpad.net/openstack-ansible/+bug/2037554 15:08:35 this one is actually really bad 15:08:46 and it's not limited to what is described there... 15:09:07 I've catched another phase of this bug yesterday 15:10:05 should we make that a separate play? 15:10:05 So it failed on us when we were deploying new compute node 15:10:13 just to make it simple 15:10:31 As it attempted to do db migration against compute 15:10:40 jrosser: yeah, I was thinking exactly about that tbh 15:11:08 except we have quite some complexity in the playbook for nova... I guess that stopped me from doing this originally 15:12:41 as I was thinking to add some post_tasks or do tasks_from, but that doesn't look too trivial with how we do things here https://opendev.org/openstack/openstack-ansible/src/branch/master/playbooks/os-nova-install.yml#L54-L58 15:12:43 There is another bug here 15:12:45 https://bugs.launchpad.net/osops/+bug/2038315 15:13:26 Ok, moved it to the correct project 15:14:22 common-playbooks/nova.yml is kind of nasty too 15:14:24 sorry for not describing bugs in a good way - was pretty much in a rush and needed not to forget about them 15:14:42 as it attempts to be common, then a whole bunch of the tasks are conditional for computes only 15:14:46 yup, it pretty much is 15:14:47 so that could be simplified too 15:14:57 like too much optimisation there makes a mess 15:15:09 I wonder if we need to have that common playbook 15:15:18 true 15:15:32 perhaps some tidying here would make a neat place to deal with that bug 15:15:36 as it may be way more plain when moved to os-nova-install 15:15:50 ok, sounds like a plan then 15:16:17 hopefully, it will be backportable enough... 15:16:23 #link https://bugs.launchpad.net/openstack-ansible/+bug/2038315 15:16:40 So, we have some stable branches borked due to ansible.netcommon breaking galaxy 15:16:55 I didn't have time to look there today, but we're catching failures internally as well 15:17:01 Except 2023.1 - it's working fine 15:17:12 (as we moved to github from galaxy there) 15:17:24 ayush: did workaround I've posted you helped? 15:17:46 ie - create user-collection-requirements.yml with https://paste.openstack.org/show/bC5oFVEOBoOdB9yMb9Dg/ ? 15:18:14 i can probably find some time to look at the galaxy problem today/tomorrow 15:18:17 damiandabrowski: actually, you've catched that yesterday as well 15:18:42 what actually is the bug? 15:18:55 that bootstrap fails of some stable branches 15:19:12 oh sure but is there a new netcommon that breaks our old stuff? 15:19:15 because it can't install ansible.netcommon for $reasons 15:19:20 ahha ok 15:19:30 yeah, I believe it just breaks galaxy overall 15:20:00 I believe that's related to them having `v` in tags, while not having in versions or smth like that 15:20:14 (as ansible wasn't happy either in some scenarios) 15:20:31 "unexpected error when getting available versions of collection" 15:20:38 such a helpful message 🙃 15:20:46 Skipping Galaxy server https://galaxy.ansible.com/api/. Got an unexpected error when getting available versions of collection ansible.netcommon 15:20:47 yeah 15:21:18 #link https://github.com/ansible-collections/ansible.netcommon/issues/562 15:21:24 @noonedeadpunk : i have not tried, i will try and let you know the results by tmrw 15:21:29 mrng 15:22:12 But, I see no new commits in collection 15:22:21 And web ui of galaxy doesn't seem broken 15:22:26 https://galaxy.ansible.com/ui/repo/published/ansible/netcommon/ 15:22:45 So... it's either already fixed... Or API != UI 15:22:50 ope 15:22:51 # https://github.com/ansible/awx/issues/14496 15:22:56 this is "galaxy ng" isnt it? 15:22:57 #link https://github.com/ansible/awx/issues/14496 15:23:13 "We've had similiar issue today but with 'ansible.netcommon' collection" 15:23:20 " Removal of default Galaxy credential from organisation settings helped us mitigate the problem." 15:23:24 > However the old-galaxy.ansible.com settings seems to have solved the issue for us ;-) 15:23:40 Lol 15:23:53 * NeilHanlon adds liquor to his coffee 15:23:59 This reminds me of some company.... that maintains CentOS Stream... 15:24:08 What the name was that... 15:24:12 Anyway 15:24:38 Might be we can replace galaxy with github or just wait for the fix... 15:25:17 OR, we need to update ansible version... 15:25:24 i'm gonna ping greg sutcliffe.. get his thoughts :) 15:26:52 Yeah, would be nice to hear 15:26:57 seems like new ansible won't help 15:27:05 #link https://bugs.launchpad.net/openstack-ansible/+bug/2037649 15:27:08 #link https://forum.ansible.com/t/awx-cannot-download-collections-roles-from-the-new-galaxy/1251/4?u=neil 15:27:52 that looks really messy... 15:28:11 What makes it worse, that in branches like Xena we also had issues with rabbitmq versions... 15:28:19 :( 15:28:19 yeah 15:29:09 so 2037649 is another race condition bug, but thankfully it affects only vpnaas 15:29:09 rabbitmq is a mess.. 15:29:29 jamesdenton: you might be interested in https://bugs.launchpad.net/openstack-ansible/+bug/2037649 as well 15:29:58 I will check what's the best approach, but I guess it would be change of the destination directory to the one inside the venv 15:30:10 so that symlinking would be not a problem 15:30:18 #topic office hours 15:30:34 I failed to book a PTG slots until now :( 15:30:42 Will try to do that during this week 15:31:11 Another thing is that we likely should switch from master to tracking stable/2023.2 since all projects have branched already 15:31:40 Will propose patches soonish 15:31:48 #action noonedeadpunk book PTG slots 15:32:03 do we have a TODO list for bobcat 15:32:03 #action noonedeadpunk propose patch to track 2023.2 15:32:10 like actual patches to chase and deal with 15:32:19 seems to be many many open 15:32:54 i think we also need to give the CI some tlc 15:33:11 jobs seem to be running slowly, upgrades are TIMEOUT regularly atm 15:33:32 I think it's Debian 12, quorum queues at minimum 15:33:39 then there is a fair number of failing to retrieve u-c with https 15:33:39 we can discuss rest, like Skyline 15:34:15 and I see a number of Rocky repo errors every day still 15:34:33 But what we tried to scope at the beginning of cycle is here 15:34:35 #link https://etherpad.opendev.org/p/osa-bobcat-ptg 15:35:02 Ah, we have openstack-resources role 15:35:26 #link https://review.opendev.org/q/topic:osa%252Fopenstack_resources 15:35:58 Ugh, it's failing with same galaxy issue 15:36:33 bleh 15:36:36 oh no curse of the tests repo 15:36:50 yeah-yeah... 15:37:01 I will look into rocky issues, jrosser. i know of a few lingering problems I'm trying to snuff out.. but also wondering based on what you're saying and I'm seeing if I just need to give the CDN a good ol' flush. 15:37:06 and modular libvirt would be also a thing nice to have 15:37:13 yea, i have an update on that 15:37:28 It's basically confirmed to be the only thing available in EL10 15:37:50 and ELN composes have begun for what will become CentOS Stream 10 15:37:58 https://composes.stream.centos.org/stream-10/ 15:38:31 so we need to get to bottom of that sooner or later anyway 15:38:49 And it seems that Debian is not going the same path, at least not with Debian 12? 15:40:32 yeah. i am going to schedule myself some time to come up with a plan, especially around how to migrate/upgrade to modular libvirt 15:41:38 well. if it's only EL10... 15:41:57 Maybe no migration needed - you still can't upgrade 9->10 15:42:08 and then we can wait for 10 for that 15:42:10 true, but i wonder about folks running mixed environments? 15:42:21 make up some conditions? 15:42:23 i.e., 9 and 10 15:42:41 maybe you're right--we can just not worry about it until we support EL10 15:42:43 like define services to restart in vars per distro 15:42:54 yeah 15:43:06 like we have already here: https://opendev.org/openstack/openstack-ansible-os_nova/src/branch/master/vars/redhat.yml#L19-L21 15:43:15 right right 15:43:25 But I bet it would be way-way more complicated.... 15:43:58 As I'm not sure I understand logic behind this modular thing and how to make tls/non-tls things to work... 15:44:10 And the order to start/restart 15:45:13 yeah 15:45:23 i'm not looking forward to it, honestly. lol 15:48:13 so, does the list of Debian 12, quorum queues support, and openstack-resource role (possibly) sounds good enough? 15:48:35 I was planning also to work on PKI improvement to add vault support, but I'm not sure if I will be able to fit 15:48:46 And also would better release earlier... 15:50:06 that sounds good to me 15:50:13 would rather less things we're more confident with 15:50:57 I will also try to work on documentation review. I've already posted some changes, inlcuding some OVN drawing here https://review.opendev.org/c/openstack/openstack-ansible/+/894384 15:51:07 which would be great to get some feedbacks on... 15:51:24 And was going to make openstack_user_config examples OVN-friendly 15:51:41 As I guess most users copying them and running into issues afterwards 15:53:10 ok, so if that's it 15:53:17 #endmeeting