Tuesday, 2022-04-26

clarkbanyone else here for the meeting? We'll get started in a couple of minutes18:59
fungisounds like a blast, count me in19:00
ianwo/19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Apr 26 19:01:18 2022 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
fricklero/19:01
clarkb#link https://lists.opendev.org/pipermail/service-discuss/2022-April/000333.html Our Agenda19:01
clarkb#topic Announcements19:01
clarkb#link https://lists.opendev.org/pipermail/service-announce/2022-April/000037.html Sent an OpenDev Update19:01
clarkbWanted to call out this email I sent. Many of your reviewed it too.19:02
clarkbBasic idea is to try and expose more of the user facing things we have done and plan to do on a ~monthlyish time frame19:02
clarkbHopefully in the next one we'll be able to talk about how jammy is now our jam and gerrit 3.5 upgrade is done or will be shortly :)19:02
clarkbAlso worth noting that the ELK services have been removed and the servers have been shut down19:03
clarkbif anyone is looking for that tooling the openstack project has an opensearch that people cna look at instead19:03
clarkb#topic Actions from last meeting19:03
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2022/infra.2022-04-12-19.01.txt minutes from last meeting19:03
clarkbIts been a couple of weeks since we had a meeting, but no actions to go over.19:03
clarkb#topic Topics19:03
clarkb#topic Improving OpenDev CD Throughput19:04
clarkbI've reviewed most of the Zuul ansible upgrade and security stance change stack19:04
clarkbthe last change I need to review in that stack is the 2k line change that adds ansible 5 support.19:04
clarkbOverall it is basically as expected. I did ask that the bwrap isolation get a bit of testing in the zuul test framework so that we are confident in some of the assertions we are makign with bwrap and those tests got added19:05
clarkbI would encourage others to at least familiarize themselves with the major aspects of the change and compare that against their understanding of our CD model to ensure there aren't any major gaps we need to plug19:06
clarkbBut reviewing the 2k line change is probably overkill for everyone :)19:06
ianw++ thanks for keeping a close eye on it from an opendev infra perspective19:07
fungito be fair, it's mostly file deletions ;)19:07
clarkbfungi: not the ansible 5 support addition19:07
clarkbits like +1.8k-20019:07
clarkbanything else to add on this topic?19:08
fungioh, sorry, i meant the ansible fork removal19:08
clarkb#topic Container Maintenance19:09
clarkbI did some work around this recently to trim our base images down to a more manageable set19:10
clarkbAll of our buster base images are now not being updated as they were removed fro mconfig management19:10
clarkbdocker hub will keep hosting the builds of the buster images until they get timed out due to no use and get deleted19:11
clarkbI also removed the python3.7 bullseye images as all our stuff builds against python3.8 or newer now19:11
clarkbAnd finally python3.10 images were added.19:11
clarkbIf you notice anything using buster (I checked and moved what I could find to bullseye) that should get updated to bullseye instead of buster19:12
clarkbThen once we've got jammy up (to be discussed soon) and things have python3.10 testing we can start shifting to the new python3.10 images19:12
clarkb#topic Spring Cleaning Old Reviews19:12
clarkb#link https://review.opendev.org/q/project:opendev/system-config+status:open+topic:system-config-cleanup could use second reviews19:12
clarkbThat topic could use second reviews on a few changes. I think ianw reviewed most/all of what remains19:13
clarkbAlso if you've got a spare half hour looking at your old changes and abandoning anything no longer relevant or stale DNM changes etc would help trim the list down19:13
clarkbI've got a number of repo retirements proposed at https://review.opendev.org/q/topic:retire-elk19:14
clarkbfungi: ^ last time I did a bunch of retirements you used gertty to mass abandon changes on the repos. Any chance you'd be willing to do that for this set here?19:14
clarkb#link https://review.opendev.org/q/topic:retire-elk More repo retirements which means more change abandonments19:14
fungigladly!19:14
clarkbthanks!19:15
fungii'll put that on my agenda for after dinner19:15
clarkb#topic Support for Jammy Jellyfish19:15
clarkbThanks to the hard work of frickler we've got jammy jellyfish images built with nodepool and the instances boot (at least on some clouds)19:16
fungijam and jelly are similar, but not the same19:16
clarkbThe mirroring of updates and security repos is possibly not happy based on some checking that frickler has done so that may need investigating19:16
clarkbWe've also not mirrored arm64 jammy in ubuntu-ports yet or the docker repo19:16
clarkb#link https://review.opendev.org/c/opendev/system-config/+/839422 Docker package mirroring for Jammy19:16
fungiworst case, we could probably rebuild mirror-update.o.o on jammy19:17
frickleralso need to set up wheels builds I guess19:17
fungiyeah, wheel builds will need working nodes of course19:17
clarkbfrickler: ++ I suspect there will be a number of these small things that we need to address over time. We don't need to solve it all at once19:17
clarkb#topic Pruning AFS to make room for Jammy ports and more19:18
fricklerwhom should I nag about getting a nodepool release with the fixes needed to build jammy? my comment in #zuul hasn't received much attention19:18
clarkbfrickler: corvus  typically handles those19:18
clarkbFor the topic of jammy ports mirroring I think we may need to make more room on the openafs fileservers19:19
clarkbrecently I did a bunch of pruning of our various rpm mirrors and that actually got us about 320 GB iirc19:19
clarkbcurrently jammy x86_64 is only using about half that so there isn't significant pressure to clean things up but its good hygiene19:19
clarkbOne thing fungi and noticed today is that we mirror source pacakges for ubuntu and debian repos19:20
clarkbbut basically nothing should use source packges in our ci system and if a one off needs it pulling from upstream is probably ok19:20
clarkb#link https://review.opendev.org/c/opendev/system-config/+/839421 Drop source pacakges for ubuntu ports mirror19:20
ianwfrickler: i'm looking at nodepool now, unfortunately gate looks broken with boto3 changes.  i'll look into it today19:20
clarkbThis is a change to try removing source packages from the ports repos as a first step19:20
clarkbif that ends up being helpful and not problematic we shoudl do the same for ubuntu and debian mirrors19:20
ianw#link https://review.opendev.org/c/opendev/system-config/+/83763719:21
clarkbOne thing to note is that the zuul-jobs configure-mirrors role configures deb-src entries for debian hosts (but not ubuntu)19:21
ianwi have that to cleanup some extra fedora too19:21
clarkboh cool19:21
clarkbI thought that had landed but I guess not yet19:21
clarkbI figure once we've removed source packages we can take stock and decide if removing xenial is necesasry or if we should add more disk to the fileservers19:22
ianwi can watch it through today, didn't want to walk away from it incase it deleted more than i hoped :)19:22
clarkbadding disk is not a problem now that we have deleted the elasticsearch volumes19:22
clarkbbut if we don't need to add disk that is preferable19:22
ianwi guess we also have f36 coming soonish; i think maybe we currently only carry one fedora because 34 was broken19:22
clarkbso ya, lets try cleaning up fedora a bit more and clean out source packages and see where we are19:22
clarkbianw: yup that seems correct based on my memory19:22
fricklerdo we have a plan to retire xenial? that would also have the potential for some cleanup19:22
fungi#link https://review.opendev.org/839421 Drop source package mirroring for ubuntu-ports19:23
fricklerand it is eol for quite some time now19:23
clarkbfrickler: yes, it was brought up with the openstack tc as they apparently use it for old stable branch jobs19:23
clarkbthe main issue is that opendev is using it too19:23
fungioh, clarkb already linked that earlier19:23
fungi#undo19:23
clarkbfrickler: I think that we should figure out what of oepndev's stuff would potentially break with xenial removal and if nothing super important remains (puppet jobs?) we can remove it and switch those jobs to upstream mirrors19:24
clarkband communicate to openstack etc that they are going away19:24
fungiworth noting, the frequency of openstack stable branch jobs which use xenial is fairly small, we could look into swapping out the mirror urls for xenoal with official mirrors19:24
clarkbfungi: ++19:24
fricklerthe issue might be that upstream repos no longer exist?19:24
clarkbubuntu keeps them around unlike red hat distros19:24
fungioh, good point, might need paid support to get to them19:24
clarkbthere is a secondary qusetion of when do we remove the images themselves but this can be a staged removal starting with the mirror19:25
clarkbfungi: you don't19:25
fungiright, okay, so only if you want newer updates than when they reached eol19:25
clarkbubuntu keeps really old package mirrors up for a really long time19:25
clarkbyou just don't get security updates etc19:25
clarkbyes, that is my understanding19:25
clarkbBut ya doing a staged removal starting with the xenial mirror and "fixing" what breaks to use the upstream mirrors makes sense to me19:26
clarkbthe major thing is identifying what would break in opendev land if that happened just to avoid wedging ourselves19:26
clarkband then we can check if adding disk to afs makes sense after all the cleanups19:26
ianwfungi: on 839421; it looks to me that configure-mirrors is by default adding deb-src lines?19:27
clarkbianw: only for debian not ubuntu if I read the ansible correctly19:28
clarkbianw: they each use different source.list templates19:28
clarkb(why I do not know)19:28
clarkband yes removing deb-src from the debian source.list templates would be a good next step too19:28
clarkbor making it a toggleable flag19:28
ianwhrm, isn't ubuntu pulling in from https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/configure-mirrors/tasks/mirror/Ubuntu.yaml#L10 ?19:30
ianwi guess checking on an actual node would be easy ...19:30
clarkbianw: yup which is https://opendev.org/zuul/zuul-jobs/src/branch/master/roles/configure-mirrors/templates/apt/etc/apt/sources.list.j219:30
clarkband that file doesn't have deb-src in it19:30
clarkbhttps://opendev.org/zuul/zuul-jobs/src/branch/master/roles/configure-mirrors/tasks/mirror/Debian.yaml#L10-L14 those all have deb-src19:30
fungidon't even need to check a node, just look at random job output19:31
ianwoh how confusing, ubuntu uses sources.list.j2 and debian sources.list.d19:32
clarkbyup19:32
fungihttps://zuul.opendev.org/t/openstack/build/e88573e7fad746e798fb6b9e1ad47d9b/console#1/0/16/ubuntu-focal19:32
fungiit only "get"s the deb lines, no source19:33
ianwok, thanks19:33
clarkbanyway I think we've got a few good avenues forward here. Will just take some time to see what sort of impact we get out of changes and make our next decisions based on that19:33
clarkbAnything else on the topic of AFS disk pruning?19:33
fungier, i meant to link https://zuul.opendev.org/t/openstack/build/e88573e7fad746e798fb6b9e1ad47d9b/console#0/4/23/ubuntu-focal19:34
ianwcool, i've approved that, but i have a feeling we may need a manual run to actually delete things19:35
clarkbianw: yup that was another concern. We're learning as we go :)19:35
clarkbI suppose if that happens rm'ing all the .xz files isn't too hard with find. Just annoying19:35
ianwi'll have to look it up, but when we've dropped things before there's some clearrepo type commands ...19:36
clarkbianw: in ubuntu-ports I think we can clear out the xenial cruft too19:37
clarkbwe don't have xenial packagesi n the pool in ports but do have the indexes which are now stale and not related to an actual mirror we have19:37
ianwit was debian-security from my notes when we did some cleanups19:37
fungithe idea behind 839421 is that we'll be able to see if it does or doesn't remove the source packages after the next run19:37
ianwand the stretch removals19:37
fungiyes, removing entire releases defniitely included manual commands19:38
funginot sure if the same goes for removing architectures19:38
ianwlooks like i didn't log *what* i did, just that i did something :) https://meetings.opendev.org/irclogs/%23opendev/%23opendev.2021-11-12.log.html#t2021-11-12T00:54:3219:39
clarkbheh19:39
ianw"ianw: #status log debian-stretch has been yeeted from nodepool and AFS mirrors"19:39
clarkbAlright we dno't have to solve the details of cleanup now. I'm happy to help work through it though. Just let me know19:39
clarkblets move on19:40
ianw++19:40
clarkb#topic Deleting the subunit2sql MySQL Trove instance19:40
clarkbAs mentioned earlier I deleted the ELK services and servers. At the same time I cleaned up the openstack health and subunit worker servers that were shutdown a few months ago19:40
clarkbThe health and subunit workers sat in front of a subunit2sql mysql trove instance in rax19:40
clarkbnothing is using that database right now and it is quite large: 286GB19:40
fungii can't speak for the qa team, but it seems like that data is getting increasingly stale so keeping it around is unlikely to be much benefit19:40
clarkbya the questions is can we just yeet the trove db entirely and delete it?19:41
clarkbdoes anyone here have concerns withdoing that?19:41
clarkbor do we need to make a backup of the data first? I lean towards deleting it since no one was using the service for a while after it broke which is what led us to turning it off19:41
clarkbI can triple check with gmann before deleting it too if we are happy with doing so ourselves19:41
* frickler had to look up "yeet" in a dictionary19:42
fricklerbut no objection19:42
fungii hope it was a recent dictionary19:43
fungidict(1) doesn't return any results19:43
clarkbianw: if you don't have any objection I'll take that as the plan to gmann (just a straight delete)19:43
fricklerduckduck found some urban dict refs19:43
ianwyeah that seems fine, i can't imagine there is anything useful now19:44
clarkbgreat, I'll proceed with that being the plan and double check with gmann. Thanks everyone!19:44
clarkb#topic Open Discussion19:44
ianwmy urban dictionary is a 10yo boy :)19:44
clarkbAnd now a few minutes for anything else we might have to bring up19:44
fricklerjust in case you hadn't seen it, nova has dropped py36 support19:45
fricklerso devstack is failing on centos 8 now19:45
fungii expect the rest of openstack to follow suit before the end of zed (and hopefully before milestone 1)19:45
ianwyeah i saw the removal change, i think 9-stream is in a state to take over19:45
ianw(from an infra pov, builds are stable)19:46
fricklerwell I still think either is stable in devstack, but that's another discussion19:46
clarkbsupposedly centos 9 stream should be better about addressing issues than 8 is/was19:46
fungiapparently rhel 9 is still in beta19:46
fricklers/either/neither/19:46
clarkbI think that if projects want a long term stable python 3.6 then rocky linux may be the thing we offer best suited to that19:47
clarkbbut sounds like they are in a hurry to leave 3.6 behind so not a big deal for us19:47
ianwwell except for old branches, but centos jobs have never run there i don't think19:48
clarkbI pushed up a gerrit 3.5 stack of changes at https://review.opendev.org/c/opendev/system-config/+/839250/ and child19:49
clarkbI noticed our builds were a bit out of date compared to upstream and also wanted to set a config option bsed on upstream ml discussion to improve memory use19:49
ianwoh good, probably can start thinking about preparing for that upgrade (checklists, etc.)?19:50
clarkbya thats my goal to try and start actively planning the upgrade19:50
clarkbOne thing I wanted to check is if 3.5 can downgrade to 3.419:50
ianwi'm probably in a better position to drive an upgrade at some point soon-ish than i have been previously this year, if we like19:50
clarkbsince that has been a nice get out of jail card we have had for the last couple of upgrades19:51
clarkbianw: that would be great. I'm also happy to help as I've been more plugged into upstream lately19:51
fungiyeah, i'm always happy to pitch in as well19:52
ianwok, we've had downgrade instructions in the checklists before, so maybe we should start a new checklist page as a first step?19:52
clarkbianw: ++19:52
ianwi can put that near the top of the todo19:52
ianw#link https://review.opendev.org/q/topic:loop-no-log-json19:52
ianw^ that's a couple of changes after we found that yum/dnf/package+yum/dnf was not showing up in console19:52
ianwzuul console.  i've added a test-case so hopefully that addresses your comment clarkb?19:53
clarkbcool I'll take a look and see19:53
ianw#link https://review.opendev.org/c/zuul/zuul/+/83745819:54
ianwis another low-priority one that a user reported problems with their substitution strings and turns out i think zuul can do better19:54
clarkbI'll do my best to get to the various reviews but I promised corvus that ansible 5 zuul chagne review today so that will be the priority.19:56
ianwno probs, just background things compared to that :)19:56
clarkbAnd that takes us basically to the end of our hour. I'm hungry so I'll call it here :)19:57
clarkbthank you everyone. As always feel free to reach out in #opendev or at service-discuss@lists.opendev.rog19:57
clarkbexcept its opendev.org not opendev.rog19:57
clarkb#endmeeting19:57
opendevmeetMeeting ended Tue Apr 26 19:57:34 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:57
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2022/infra.2022-04-26-19.01.html19:57
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2022/infra.2022-04-26-19.01.txt19:57
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2022/infra.2022-04-26-19.01.log.html19:57
fungithanks clarkb!19:57

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!