Tuesday, 2022-01-18

clarkbHello, it is weekly meeting time19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Jan 18 19:01:08 2022 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkbI expect this meeting to be another quicker one as we're all still sort of rolling into the new year19:01
ianwo/19:01
clarkb#link https://lists.opendev.org/pipermail/service-discuss/2022-January/000314.html Our Agenda19:01
clarkbBut we do have an agenda to run trhough19:01
clarkbianw: good morning!19:01
clarkb#topic Announcements19:01
clarkbService coordinator nominations run January 25, 2022 - February 8, 2022 (that starts next week)19:01
clarkbAnother reminder that service coordinator nominations will begin soon19:02
clarkbOpenInfra Summit CFP and programming committee need your input: https://openinfra.dev/summit/19:02
clarkbthe OpenInfra Summit CFP is accepting papers and if you'd like to help program the event you can nominate yourself to be on the programming committee19:02
clarkbthere are a number of tracks including a CI/CD track that we might find interesting19:02
clarkbDeadlines for both are fast approaching (I think early february for CFP and end of the week ish for programming committee?019:04
clarkb#topic Actions from last meeting19:04
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2022/infra.2022-01-11-19.01.txt minutes from last meeting19:04
clarkbLast meeting was pretty easy going. No actions recorded19:05
clarkb#topic Topics19:05
clarkb#topic Improving OpenDev's CD throughput19:05
clarkbI have on my todo list an item that says I should review this spec but I haven't yet19:06
clarkb#link https://review.opendev.org/c/opendev/infra-specs/+/821645 -- spec outlining some of the issues with secrets19:06
clarkbinfra-root ^ would be good to look at that in the near future. I'll try to prioritize it myself once I've flushed a bit more of the backlog (getting really close)19:06
clarkbianw: anything else to add on this topic? I suspect you are in backlog catch up mode too and this may not be near the top19:07
ianwyeah nothing more on this one yet19:07
clarkb#topic Container maintenance19:08
clarkb#link https://etherpad.opendev.org/p/opendev-container-maintenance19:08
clarkbGood news: all buster based images should be updated to bullseye now. Thank you to everyone who helped me push that along19:09
fungicontrats!19:09
clarkbI've got a change that came out of that to stop running a limnoria fork at https://review.opendev.org/c/opendev/system-config/+/821331 which I updated today to consume from a tag instead of tip of master19:09
fungiand thanks for all the work on it19:09
clarkbI think that might want to be landed on a friday when there are fewer meetings19:09
ianw++ thanks for pushing that through!19:10
clarkbjentoio I'll be reaching out sometime soon to start looking at the dedicated user work with our containers as that is the next thing on the container todo list19:11
jentoiosounds good, thanks19:12
fungii'd love to find the time to work on deploying the storyboard containers we're already building19:12
clarkbThere are also a few other large todos on that etherpad including zookeeper upgrades, mariadb upgrades, and intermediate registry pruning if anyone gets bored and wants to take a look :)19:12
fungialso we'll have container deployment work involved in the upcoming mailman3 effort19:13
clarkbfungi: yup. Re storyboard converting it to a newer mariadb may be a good idea as a first step on the mariadb upgrades planning19:13
fungioh, in the dockerfile?19:14
clarkbI think the zk upgrade may be as simple as shifting the version ahead and letting ansible do its thing as newer zk is supposedly compat with older19:14
clarkbfungi: in the docker compose file19:14
fungier, that's what i meant, of course ;)19:14
clarkbfor zk we should probably test that assumption a bit before going ahead with it though19:14
clarkbAnyway that was all I had. Happy to help others that dig into this and I plan to pull tasks off the list as I'm able myself19:15
clarkb#topic Nodepool image cleanups19:15
clarkbTumbleweed is gone. Haven't heard any complains19:15
clarkbCentOS 8 removal has begun. I pushed up a bunch of changes to infra related repos as well as openstack requirements and zuul-jobs to remove the use of centos819:15
clarkbWe should no longer be updating the wheel mirror for centos 8 as well19:16
clarkbI sent email to service-discuss and openstack-discuss warning people and some specific projects of our plan to remove centos 819:16
clarkbThose emails indicated we would remove the label from nodepool/zuul entirely around the end of the month19:17
clarkbOverall it seems people have been receptive and are converting jobs to centos 8 stream or removing centos entirely. Which is basically what we were looking for so yay19:17
clarkbHowever, `ping` is currently broken for non root users on centos 8 stream. Something to be aware of if conversion happen and don't work. That distro needs to revert its most recent iputils package update or update systemd to systemd-239-55 to correct this19:18
clarkbA couple of projects like devstack and tripleo have worked around this by adding sysctl to mimic the system-239-55 update19:18
clarkbianw: for fedora 34 frickler indicated that neutron is using that platform for testing. Do you know what is required to move them ahead to fedora 35?19:19
clarkbfedora 34 is the other distro in nodepool that would be good to remove in the near future as that one doesn't boot reliably19:20
fungiyeah, i think we have it booting successfully in only two providers?19:20
ianwi think we need to go through a few bootloader things and cut a dib release19:20
clarkbianw: to remove fedora 34 or to fix it?19:21
ianwsorry, to fully fix fedora 35 booting and keep 9-stream happy, etc.19:22
ianwjust context switching back in, sorry19:23
clarkbGotcha general rhel system improvements then. But no reason to keep fedora 34 around if we can sort out neutron? and even then fedora has a short shelf life so neutron may need to accept it is going away soon either way19:23
ianwso f35 wasn't booting, https://review.opendev.org/c/openstack/diskimage-builder/+/818851 fixed that and i think is released now19:23
ianwno, it's just merged19:24
ecsantos[m]Oh, sorry to butt in, just a quick note about openSUSE. SUSE dropped OpenStack in 2019: https://www.zdnet.com/article/suse-drops-openstacks/19:25
ecsantos[m]That's why the mirrors were broken, apparently. I have a change semi-ready to address it19:25
ianwit caused a regression in 9-stream "centos" element -- fixed with https://review.opendev.org/c/openstack/diskimage-builder/+/82177219:25
ianwthere's a couple of other changes in the queue related now19:26
ianwso i think we go through those and try to find a coherent point to make a 3.17.0 release.  that should free up the fedora work19:27
clarkbianw: sounds good, thanks19:27
clarkb#topic Scheduling Gerrit Project Renames19:27
clarkblast week we penciled in January 24 for this as that seemed to fit the openstack release cycle well and shouldn't interfere with zuul much either19:28
clarkbfungi: I think you were thinking later in the day UTC time. Like 22:00 UTC?19:28
fungii plan to be on hand all day, so am happy to help as long as it's while i'm still awake19:28
fungivery early like in the 00:00-03:00 UTC range would also work for me19:29
ianwshould we perhaps schedule the 3.4 upgrade at the same time?19:29
ianw(i need to finish the donwgrade/revert section, but i think it's good to go)19:30
clarkbI'm happy to try the 3.4 upgrade around then too. But is ~6 days enough advancewarning for a gerrit upgrade?19:30
clarkbI guess the delta here is much smaller. The biggest gap is probably zuul working with gerrit19:30
fungiit seems tight, but i'm okay with the idea19:30
clarkbIf we do it that way I think we do the renames first19:30
clarkbthat way if we rever the 3.4 upgrade we'll have completed the renames19:30
ianw++19:31
fungialso we should plan enough time in the window to test and roll back19:31
clarkbfungi: ++19:31
clarkbianw: when is a good time to start all that for you?19:31
clarkbis 22:00 UTC too early?19:31
ianwno, that is fine and probably is better for EOD in US time19:32
ianw(than earlier)19:33
clarkbok I can put together an email today announcing that we'll plan to rename projects starting at 22:00 UTC January 24 as well as upgrade gerrit to 3.4 with an expected end time around 00:00 UTC (I think 2 hours should be enough, an hour for each action)19:33
ianw#link https://etherpad.opendev.org/p/gerrit-upgrade-3.419:34
clarkbinfra-root ^ does that proposed plan seem reasonable?19:34
fungiwfm19:34
ianwsounds excellent, thanks19:34
clarkbalright I think we can go to the next topic then19:34
clarkb#topic Spring Cleaning for Old Reviews19:34
clarkbfrickler: put this on the agenda and poinst out that repos like system-config have >300 open reviews most of which are more than a year old and have merge conflicts19:35
clarkbshould we (auto) abandon these?19:35
clarkbMy initial thought is that maybe we should all try to spend an hour going through the backlogs and manually prune but that is a lot of effort. Maybe we can do a better breakdown and see how many are puppet era and abandon those at least (since we aren't really puppeting anymore)19:36
clarkbthings from a more modern ansible+docker era are likely worth double checking for relevance before abandoning19:37
ianwi'm not opposed19:38
ianwi also see old things as "ideas that may be kind of valid but never quite floated to the top at the time they were proposed, but may one day" and it doesn't bother me so much having them around19:39
clarkboff the top of my head anything >3 years seems very safe to abandon. Anything >2 is probably fine too. But I see at least one change in my backlog from september 2020 that is probably worth another look at19:39
clarkbianw: yup that is my hesitation whcih is why I'm ok with clearing stuff that we are reasonably sure is related to the old puppet way of doing things19:40
clarkbsince they are very unlikely to apply any longer19:40
clarkbI guess think on this a bit and we can propose a plan next week? I do think that having cleaner gerrit dashboards would be nice for getting myself organized19:41
ianwagree19:41
clarkb#topic Open Discussion19:42
fungii guess if i see a notification from gerrit about anything i've proposed getting abandoned and i think it's still worthwhile, i'll revisit and restore it19:42
clarkbfungi: that seems reasonable too. We can always restore later19:42
clarkbMy Gerrit 3.3.9 update change just landed. ianw re your autohold may be worth rerunning anyway to ensure you're on the latest images for the revert testing19:43
clarkbianw: I do wonder if the autohold hit a scale otu scheduler bug. Might need to go look at zuul debug logs19:43
clarkbBut I bring this up because I'd like to do a much smaller gerrit restart today. Probably after the kids get back from school this afternoon and things are quiet19:43
fungialso i'm getting ready to turn the jitsi-meet servers back on once the meeting wraps (new stable images without any log4j in them were finally tagged a few hours ago)19:44
clarkb\o/19:44
clarkbI'll give this a few more minutes before calling it a meeting.19:44
jentoioa comment on opendev docs, specifically opendev/system-confg19:45
clarkbjentoio: go for it19:45
jentoioas I read them I'm finging lots of outdated info. For example, major systems, logstash19:46
jentoioI guess its fine to update them as I find outdated info, is my question19:46
fungifor logstash, probably better to not spend time updating those docs19:46
jentoiois it worth working on19:47
clarkbjentoio: yup pushing changes to update out of date info is greatly appreciated. Even if you just push a note warning that indicates the info is out of date19:47
fungiwe've got an active effort underway which should result in us removing all the logstash docs anyway19:47
clarkblogstash is a special case beacuse we are in the process of removing it entirely. But in general pointing things out with a warning blob or updating out of date info to be correct would be great19:47
fungithough sure, a warning banner on the logstash docs can't hurt19:47
jentoiolots of lionks with http not http(s)19:47
fungiwe've added https to a lot of our services/sites recently, so yes any of them that work via https would be great to switch in hyperlinks19:48
jentoiookay, i'm hearing that pushing updates to updated docs is fine19:48
clarkb++19:48
jentoioexcept logstash19:48
fungiianw's design for broad automation of lets encrypt has allowed us to be far less choosy about what gets a cert19:48
clarkbSounds like that may be all. Thank you everyone. I'll give you about 10 minutes back for $meal :)19:50
clarkbI myself need some lunch19:50
clarkb#endmeeting19:51
opendevmeetMeeting ended Tue Jan 18 19:51:02 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:51
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2022/infra.2022-01-18-19.01.html19:51
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2022/infra.2022-01-18-19.01.txt19:51
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2022/infra.2022-01-18-19.01.log.html19:51

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!