Tuesday, 2023-09-26

clarkbI'm going to be a minute or three late to the meeting19:00
clarkbI'd liek to finish this train of thought with gerrit19:00
clarkb#startmeeting infra19:02
opendevmeetMeeting started Tue Sep 26 19:02:07 2023 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:02
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:02
opendevmeetThe meeting name has been set to 'infra'19:02
clarkb#link https://lists.opendev.org/archives/list/service-discuss@lists.opendev.org/thread/JUUNKEA2W2T4IA64DHBIYKUHBXOH7W3D/ Our Agenda19:02
clarkb#topic Announcements19:03
clarkbOpenStack is going to be making final release candidates this week and the actual Bobcat release should occur next week19:04
clarkbplease be aware of that as we make changes19:04
clarkb#topic Mailman 319:06
clarkbfungi: should we jump right into planning for the final mailman3 migration?19:06
fungisure, i had a proposed date on the etherpad... just a sec19:06
fungi#link https://etherpad.opendev.org/p/mm3migration19:07
fungiline 26519:08
fungitentative maintenance 15:30-19:30 utc on thursday 2023-10-12 (week after release)19:08
fungithat's for lists.openstack.org which is the only remaining site on the old server19:08
clarkbthat date works for me19:09
fungiif folks generally find that acceptable i'll announce it on openstack-discuss this week and start putting together a detailed maintenance plan based on the previous 6 sites we've migrated19:09
frickler+119:09
fungii've started a handful of notes below there on things we'll want to be mindful of for this specific maintenance, either lessons learned from earlier migrations or due to the size of the data19:10
fungii'll take those into account when drafting the full plan19:10
fungiaside from that, last week's maintenance went smoothly19:11
fungiwe've had a couple of requests for clarification on new urls for starlingx mailing lists, but no problems reported19:11
fungisounds like we can proceed with the date and time indicated, not hearing any objections19:12
fungii didn't have anything else on this topic, but am happy to answer questions19:13
clarkb#topic Server upgrades19:13
clarkbNothing new to add here19:14
clarkb#topic Nodepool Image Build and Upload Status19:14
clarkbhas anyone checked if the uploads in various clouds (particularly rax iad?) are looking good since our config chagnes (ending with the increase of the upload timeout about a week ago)19:15
fungii have not19:15
fricklerI just check rax-iad19:15
fricklerlooking as expected, all uploads succeeded at the first attempt19:16
fungiyay!19:16
fungii guess we can drop this from the agenda?19:16
frickleryes, we will handle inmotion as a different topic I think19:16
fungiagreed19:16
fricklerexcept maybe check for leftover images one last time?19:17
clarkbfeel free :)19:17
frickleror did you check last week?19:17
clarkbI did not do a pass since I cleaned up the images I last cleaned up whcih was more than a week ago19:17
fungii don't remember any more, but i can take a look19:17
clarkbthanks19:20
clarkb#topic OpenMetal19:20
clarkbHvaen't heard back from Yuriy since I responded last week. Did anyone else see email that I overlooked?19:20
clarkbI probably need to prod him on that19:20
fricklernope19:21
fungii did no19:22
fungit19:22
clarkbok ya I'll try to remember to send a followup again then19:23
clarkb#topic Zuul PCRE regexes are deprecated19:23
clarkbI don't think there is anything new to cover on this19:23
clarkbBut happy to be wrong :)19:23
fricklerI saw some teams doing patches in reaction to the announcement19:24
fricklerso IMO we can give it some weeks maybe and then recheck the remainders19:24
clarkbsounds good19:25
clarkb#topic Python container image updates19:25
clarkb#link https://review.opendev.org/q/(topic:bookworm-python3.11+OR+hashtag:bookworm)+status:open19:25
clarkbAs noted previously we'll defer Gerrit until after the release. As a result I pushed up more changes19:25
clarkbreviews welcome. Be aware that zuul/zuul-registry's image doesn't have a straightfowrard move to bookworm because it currently relies on openssl 1.119:26
clarkbwe may end up leaving that on bullseye for now and then keeping on bullseye image longer term for python3.11?19:26
clarkbin any case we can sort that out later. Reviews are welcome on the other changes19:29
clarkb#topic Etherpad 1.9.3 Upgrade19:29
fungiyeah, that's an unfortunate situation19:29
clarkbfungi: want to fill is in on where this neded up yseterday?19:30
fungi#link https://review.opendev.org/896454 Upgrade Etherpad to 1.9.319:30
fungiclarkb and i tested a held upgraded node yesterday19:30
fungii didn't observe any problems, though i think you noticed some weirdness with cached something in chrom*?19:31
clarkbya my chrome browser kept reconencting to the etherpad19:32
fungithe changelog is short and doesn't seem to touch anything our deployment should care about19:32
clarkbwell and before I did a hard refresh it got an error for some unfound symbol19:32
clarkbbut switching into incognito mode made the problems go away so almost certainly something was cached badly19:32
fricklersounds like a 3rd test might be helpful to decide how reproducible this is19:34
fricklerI'll try to do that this week, then19:34
clarkbfrickler: ++ if anyone other than fungi or myself have time for that. the clarkb-test pad is where we were testing and it has content already19:34
clarkb#topic Gitea 1.21.0 Upgrade19:35
clarkbGitea 1.21.0-rc0 is out now19:35
clarkbThere is no changelog yet so this isn't urgent, but in the past I've tried to push up an upgrade testing chagne once RCs are available in order to start working through problems19:36
clarkbIt would probably be good for someone other than myself to have a go at it. Any interest in doing a 1.21.0 upgrade?19:36
clarkbThe typical process I use is 1) check if go and/or nodejs need to be updated 2) upate our templates to keep in sync with updates upsteram 3) sort out any other items called out in the changelog19:37
clarkb3) is what usually takes the most time as you need to understand chagnes if they intersect with our use cases and occasionally write tests to cover the updates19:38
clarkbbut even just having 1 and 2 done can help rule out a bunch of stuff pretty quickly in our test system19:38
clarkbif there is interest say something in #opendev so that we don't end up doing duplicate work19:39
clarkb#topic Gerrit Replication Task Leaks Fix19:40
clarkb#link https://gerrit-review.googlesource.com/c/plugins/replication/+/38731419:40
clarkbI've written a chagne that when tested by hand on a held node seems to do what we want. Somewhat predictably gerrit maintainers have asked me to write test cases though. I've spent the morning doing my best to pretend I understand what is going on there and the end result is my latest comment19:41
clarkbtl;dr I think i have two test cases that are very close but haven't figured out how to run them locally yet and a third is basically missing a major piece of testing in the replication plugin (replicating when permissiosn say no) and I am currently completely lost in Gerrit's internal models for permissions19:42
clarkbI'm hopeful we can get that merged by the time we want to restart Gerrit though so that we can fix the problem for good19:42
clarkb#topic PTGBot Webserver crashing19:45
clarkbseems to be running right now. Did anyone have to restart it since we restarted it yseterday?19:45
fricklernot me19:46
fungii don't see any indication it's been restarted19:46
fungiSep 25 19:20:43 eavesdrop01 docker-ptgbot[646]: DEBUG:root:Debugging on19:46
fungithat was the last thing it logged19:46
fungiwhich is when i started it yesterday19:47
fricklerps also says running since yesterday19:47
fungiyeah, so i think it has not crashed (yet anyway)19:47
clarkbok so no new information to look at there. Did we figure out if the webserver logging was broken?19:48
fungii think it's just not instrumented for request logging19:48
fungiit's logging an explicit line proving that debug logging was enabled at start19:48
fungii have no idea if it's sufficient to log exceptions/tracebacks though19:49
fricklerso that'll need some further testing once someone has time, but it doesn't seem urgent for now19:50
clarkbok sounds good19:50
clarkb#topic Open Discussion19:50
clarkbAnything else?19:50
fungii did check for leaked nodes in rax-iad and found 1919:51
fricklernodes or images?19:51
funginot sure how old they are19:51
fungisorry, images19:51
fungione i just looked at was from 2023-09-0119:52
fungiso they might not be very recent19:52
fricklerso that's before the timeout bump19:52
clarkb~2023-09-18 is when we got the config where we wanted it19:52
clarkbI would probably clean anything before the 18th up and anything after can be subject to debugging19:52
fungilooping through to get dates now19:54
fungimost are from 2023-08-30 and 2023-09-0119:54
clarkbya so possibly timed out uploads that didn't clean up proeprly19:54
fungii think 2023-09-13 is the most recent19:55
fungiso all from before the change19:55
fungii think that indicates it's been successful, and we can go ahead and mop up these remnants19:55
fungii'll delete them now19:55
clarkb++19:55
fricklerack19:56
fungiand now that's done too19:57
fricklernext tuesday is a bank holiday here, so I might not be around19:58
clarkbenjoy the day off19:58
clarkbI don't think we have any holidays here until november19:58
fungialso corvus and i looked closely at one and they're completely missing metadata because the sdk adds the metadata after the image import tasks complete, so we can't use metadata to indicate they're safe to clean up19:58
fungithough maybe we could look for a complete lack of metadata, i dunno19:59
clarkband we are just about at time. Thank you everyone. We'll be back next week19:59
clarkbfungi: complete lack of metadata is normal for user uploads which is the risk there19:59
fungiright, exactly19:59
clarkbyou might delete somethign someone has uploaded iirc19:59
clarkb#endmeeting19:59
opendevmeetMeeting ended Tue Sep 26 19:59:42 2023 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:59
opendevmeetMinutes:        https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-26-19.02.html19:59
opendevmeetMinutes (text): https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-26-19.02.txt19:59
opendevmeetLog:            https://meetings.opendev.org/meetings/infra/2023/infra.2023-09-26-19.02.log.html19:59
corvusyeah, works for us manually, but not automatable19:59

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!