Tuesday, 2021-06-08

*** pleimer has joined #opendev-meeting13:14
*** pleimer_ has joined #opendev-meeting13:53
*** pleimer has quit IRC13:59
clarkbAnyone else here for the meeting?19:00
clarkbwe'll get started shortly19:00
ianwo/19:00
clarkb#startmeeting infra19:01
opendevmeetMeeting started Tue Jun  8 19:01:16 2021 UTC and is due to finish in 60 minutes.  The chair is clarkb. Information about MeetBot at http://wiki.debian.org/MeetBot.19:01
opendevmeetUseful Commands: #action #agreed #help #info #idea #link #topic #startvote.19:01
opendevmeetThe meeting name has been set to 'infra'19:01
clarkb#link http://lists.opendev.org/pipermail/service-discuss/2021-June/000253.html Our Agenda19:01
clarkbThe agenda isn't very big today. Hopefully I'll be able to give you some time back19:01
clarkb#topic Announcements19:01
clarkbThis didn't make the agenda, but it occurred to me wheil prepping for this meeting. I've got family in town starting tonight and I may not be able to run the next two meetings. I think I'll be around next week but definitely not the week after19:02
clarkbI'll try to figure that out more accurately once i can discuss their plans tonight/tomorrow19:02
clarkb#topic Actions from last meeting19:03
clarkb#link http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-06-01-19.01.txt minutes from last meeting19:03
clarkbThere were no recorded actions19:03
clarkb#topic Specs19:04
clarkb#link https://review.opendev.org/c/zuul/zuul/+/793669 Zuul spec to use matrix instead of irc19:04
clarkbThis isn't our spec but it is worth keeping an eye on and commenting on if you have input19:04
fungii think corvus is hoping to make the decision on it later this week19:04
fungiso feedback sooner would be better19:05
clarkb++19:05
clarkb#topic Topics19:05
clarkb#topic Switch to OFTC19:05
clarkbianw has been working to redo our irc bot deployment stuff. Including spinning up a new server and running bots out of containers there19:06
fungiand a new eavesdrop server19:06
clarkbmeetbot and ptgbot are the bots left out in that process as they aren't currently containerized19:06
fungier, you did say new server, yeah19:06
clarkb#link https://review.opendev.org/q/topic:%22limnoria%22+status:open Limnoria bot rewrite19:06
clarkbThis topic is a set of changes that converts meetbot from supybot to limnoria and containerized that bot so ti can run on the new server too19:07
fungistatusbot, btw, is still running foreground in a screen session, hasn't crashed again :/19:07
clarkbI think I've gotten through that topic for reviews but should double check19:07
clarkb#link https://review.opendev.org/c/openstack/ptgbot/+/793792 Update PTGbot for OFTC19:07
clarkbThis change updates ptgbot to make it possible for it to connect to oftc, but we need to double check if it needs python2 or if it can run under python3. If it can run under python3 then a good followup would be containerizing it. If not then converting to python3 then containerizing it would be good19:07
fungioh also, i think the openstack tc has decided friday of this week is when they'll change channel topics on freenode, not sure if anyone else is following suit or if we want to do something similar for our handful of opendev channels there19:08
ianwso the bits that want review now are the deployment19:08
ianw#link https://review.opendev.org/q/topic:%22meetbot-container%22+(status:open%20OR%20status:merged)19:08
clarkbah ok I'll have to review those then19:09
ianwi am still working on statusbot in19:09
ianw#link https://review.opendev.org/c/opendev/system-config/+/79521319:09
clarkbfungi: I'm happy to update our topic for #opendev and #opendev-meeting though I dropped off of freenode today. Neither channel had any real traffic since the switc hthough19:09
ianwthat's the only bit i would consider hacky.  i'm installing it in the limnoria container, even though it's not a plug-in, on the theory that it *should* be19:09
fungiyeah, i can do topic updates, i'm still joined there19:10
ianwbut realistically, i doubt anyone is that interested in rewriting it; but if they did, that's where it should go19:10
clarkbianw: you mean rather than using a separate statusbot container?19:10
clarkbianw: is there a reason to combine them rather than do process per container? I think we have tried to stick to process per container as much as possible19:11
clarkbI'll try to dig into that when get to reviewing the changes19:13
ianwwell, to avoid having to make another container and build process for it all, for something that we don't really care that much about19:13
clarkbI see19:13
clarkbAnother thing I wanted to point out is that zuul is using a matrix element oftc connection url instead of oftc's webchat as easy connect mode. If we've got similar links that need updating I think that worked reasonable well19:14
clarkbmight also suggest it to other users like openstack19:14
clarkbanything else on the subject of IRC?19:15
ianwi guess ptgbot i wanted to call out19:16
ianwi've -1'd19:16
ianw#link https://review.opendev.org/c/opendev/puppet-ptgbot/+/79379419:17
fungiwe have time to solve ptgbot, there's no looming ptg for months thankfully19:17
ianwexplaining that the puppet no longer runs on eavesdrop01.openstack.org (because we're worried pip will explode on itself)19:17
clarkbianw: oh I didn't realize there was a puppet change already too. I agree we should convert it over. Particularly since we have time for that bot19:17
fungiso if it's offline for a while to get containerified, it's not a huge deal19:17
ianwso the future for this, if there is any, is for someone to make a container and migrate it to ansible, and deploy on eavesdrop0119:18
clarkb++19:18
ianwat one time, per the irc spec, the future was to rewrite it as a limnoria plugin19:18
ianwbut i honestly don't think anyone is going to do that now19:18
fungii have no idea how much of a forklift that would actually be, i expect the internal logic is mostly isolated and would probably not be that hard to translate to a limnoria plugin, but i can't say i've looked at it with that in mind19:19
ianwand i would not be surprised if it prompts a think about it's future anyway19:19
clarkbfungi: on the flip side I think installing it to a container and bind mounting the dir where it stores info is probably super trivial19:20
fungiyep19:20
fungiwe already just pip install it anyway19:20
corvuso/19:21
clarkbwe can bring it up with diablo_rojo_phone and ttx to see what they think and take it from there19:21
clarkbI agree with ianw though taht we shouldn't try to keep the puppetry going given what hapepned with pip on the old server when we switched19:21
ianwi doubt containerising it would be too difficult.  i'd just like someone else to own it :)19:22
clarkb++19:22
clarkbI suspect diablo_rojo_phone and/or ttx would be happy to help with that19:22
clarkbLets move on19:22
clarkb#topic Gerrit Updates19:22
ianwi'm happy to help point people in the right direction to get it all going19:22
clarkbianw: I can let them know. I can probably help too19:23
clarkb#link https://review.opendev.org/c/opendev/system-config/+/775961 MariaDB container for review02 accountPatchReviewDB19:23
clarkbianw has been working on ^ as one of the last pieces before we can schedule a cut over to review0219:23
clarkbI found one more thing reviewing it this morning that would be good to double check19:23
ianwok i'll loop back.19:24
clarkbBut otherwise I think it is just about there. It would be good if other infra-root can review it too since it has the potential to make review01 unhappy (though the change attempts to avoid changing anything on that)19:24
clarkbI also remembered that I had achange up to unfork gerrit19:24
clarkb#link https://review.opendev.org/c/opendev/system-config/+/791995 Stop adding special x/ path handling to gerrit19:24
fungiyeah, that's fixed and backported to the minor series we're on19:25
clarkbIt passes testing, though we may want to double check we don't need to update plugin versions on the image build?19:25
clarkbApplying that will require a gerrit restart, but I think getting off of our fork is a good thing overall and something we haven't done for years :)19:25
clarkbPlease carefully review that one too.19:25
ianwit would be good to get the latest zuul summary plugin in with better timestamps19:26
ianwi have19:26
ianw#link https://etherpad.opendev.org/p/gerrit-upgrade-202119:26
ianwtracking general upgrade notes19:26
clarkbianw: landing 791995 should rebuild with the latest zuul summary plugin too.19:26
ianwwhen we have the accountReviewDb situation sorted, i think we can more or less migrate at will19:27
clarkbianw: was reverse dns set up? Seeing that on your etherpad and remembering that is important for email19:27
ianwahh, that's a good one, i should ping mnaser on that one19:27
clarkb++19:28
clarkbanything else on gerrit updates?19:28
ianwi think review02 will be the host now, especially with the address pinning we've done19:28
fungii suppose the same mariadb container can be useful for a future zuul scheduler, and storyboard containerization as well19:28
clarkbfungi: yup, we also already do similar for gitea and maybe something else too19:29
ianwyeah, it's just the upstream container, but the deployment bits may be helpful to copy19:29
fungioh, right, forgot we already had that implemented for gitea since ages19:29
clarkb#topic Server Upgrades19:30
clarkbWe have already discussed the eavesdrop upgrade and tasks for the gerrit upgrade19:30
clarkbI'm still slowly poking at the listserv upgrades. I'm hoping I'll have time to boot a test node and upgrade it in place during the second half of this week19:30
clarkbI'll be using the snapshot that fungi took about a month ago t oavoid needing to wait for and slow down the server again19:31
ianw#link https://review.opendev.org/c/opendev/system-config/+/79520719:31
ianw^ that removes ask.o.o which is causing deployment issues as it's now shutdown19:31
clarkbianw: fungi: shoudl we stash a db dump from the ask db as part of that?19:31
ianwwe do already have that on backup hosts19:32
clarkboh excellent19:32
clarkbthe redirect with LE cert seems to be working well too19:32
fungiit's a local redis, right?19:32
clarkbno askbot is postgres or mysql. I forget which19:32
fungiahh, okay, but regardless it's not in a trove instance19:33
ianwiirc it was a mysql and dumped out with mysqldump as a stream19:33
clarkbfor that I am not sure19:33
clarkbit might be trove if it was mysql19:33
fungii'll check, if it's trove i'll make an archival snapshot and then delete the instance19:33
clarkbthanks!19:33
clarkbAny other server upgrades or cleanups to call out?19:33
ianwwe must be getting pretty thin on xenial now?19:34
clarkbOh it would be good if infra-root can check the test node listservs that I upgraded to see if anything stands out as broken19:34
clarkbianw: slowly but surely :)19:34
ianwpost listserv, eavesdrop, review what is left?19:34
ianwpaste i guess19:34
clarkbone thing I notice about the inplace upgrades of the listservs is we'll end up with sysv init scripts and unit files for mailman19:34
funginot wiki, it's still trusty! ;)19:34
clarkbianw: translate which we need to figure out because zanata, health, elasticsearch, logstash19:35
clarkbwe said we wouldn't just turn off ELK things to give peopel time to try and figure that out. We should probably periodically check in on that and work with openstack to draw a line in the sand so that we aren't just keepign it out there forever19:36
fungino trove instance for askbot, btw19:36
clarkb#action clarkb followup with openstack on ELK retirement planning19:37
clarkbI can do that19:37
ianwzanata i forget, was that a dead project?19:37
fungi(we have trove instances for one or more of cacti, grafana, paste, refstack, review, subunit2sql, translate, wiki, zuul)19:37
clarkbyup19:37
fungione or more each i mean19:38
clarkbianw: and there was talk about transitioning to a hosted open source alternative, but it kept getting deprioritized on my end because other things kept popping up19:38
fungi(e.g., review/review-dev/review-test)19:38
clarkbianw: transitioning that would actually be a good project for someone in the community if we can find someone interested in it since all the data is exportabel as a user aiui19:38
ianwoh, cacti must be another one that needs updating19:38
clarkbcorvus had talked about switching to some prometheus + influxdb + grafana type setup19:39
ianwyeah, i remember discussion on translate that seemed to stall19:39
fungiif memory serves, the proposal was to have prometheus query snmpd and then store the data in its own timeseries database?19:39
corvusi recommend skipping influxdb19:40
corvusya19:40
clarkbThe upside to that is we can plug services like gitea into it19:40
corvusgrafana+prometheus snmp_exporter+grafana19:40
ianwi was reading last night about https://grafana.com/docs/grafana/latest/whatsnew/whats-new-in-v8-0/#grafana-v80-alerts19:40
clarkband get service level as well as host level metrics19:40
fungiand i guess if we want history, we can use rrdtool to dump the existing graphs and ingest them into prom19:40
corvusi haven't run that yet; but i have run influxdb and don't think it's a good fit right now19:41
clarkbmaybe we should write up a small spec for that? As I do think replacing cacti with prometheus gets us more functionality like the gitea service metrics which will be nice to have19:42
clarkband I suspect that starting fresh might be simpler than uplifting cacti (but I'm not super familiar with either tool so could be wrong about that)19:43
fungialso i guess it allows us to have one grafana interface into things19:43
fungiso we can mix-n-match stuff from graphite and prometheus into the same dashboards19:43
clarkbwe could include zuul system metrics alonside the service metric :)19:44
fungiprecisely19:44
corvusyep.  the trick will be getting actually correct memory graphs :)19:44
clarkb#action Someone Write up a spec to replace cacti with prometheus19:44
clarkbThis way I won't forget its a thing19:44
clarkbI can probably give that a go too, but not promising that at the moment19:45
clarkbAny other upgrade items before we move on?19:45
clarkb#topic Removing PPC packages from CentOS 8 mirrors19:46
clarkbYesterday we discovered that our centos mirrors had stopped updating because we ran out of afs quota room19:46
clarkbit was discovered that we are mirroring ppc packages btu we can't find any users of that (not surprising as we don't have ppc hardware)19:46
fungiapparently we intentionally included ppc64le packages in our centos-7 mirroring, and it's lumped in with 8 and 8-stream by default19:47
ianwdid blame show me/tonyb adding it explicitly at some point?19:47
fungiianw: if you did, it was many years back19:47
clarkbI wanted to bring this up here as a last call for objections if there is some reason to not make that change19:47
ianwlooks like https://review.opendev.org/c/opendev/system-config/+/66272719:48
clarkbotherwise I think we should update all our mirrors to drop ppc19:48
fungifor 8 and 8-stream we just mirror all the official architectures, which are presently x86-64, aarch64 and ppc64le19:48
clarkbThen update afs quotas as appropriate19:48
ianwhrw just posted a note not long about about dropping ppc from kolla iirc19:48
clarkbya yoctozepto confirmed nothing ever used ppc kolla in our CI systems19:48
clarkbthey had some ppc support but never exercised by us19:48
fungii have a feeling we'd cut nearly a third of our content from that volume if we dropped ppc64le packages/images19:49
clarkbhas anyone written a change to do that cleanup yet? Does anyone want to write it?19:49
ianwi can do that19:50
fungii expect it to be a fair number of exclusions, since there are more than a few subtrees for it19:50
*** mordred has joined #opendev-meeting19:50
clarkbfungi: I think we can jsut remove ppc64le from altarchs and add it to altarches_ignored?19:50
clarkbianw: thanks19:51
fungifor centos-7 yes, but it's not in the altarches tree in 8 and 8-stream19:51
clarkbfungi: aha19:51
fungiit's an official architecture so it's in-tree along with x86-64 and aarch6419:51
clarkb#action ianw Push change to cleanup ppc packages in our CentOS mirrors19:51
fungiso, yeah, i welcome someone more familiar with rhel/centos repository structure tackling that19:52
clarkb#topic Scheduling Project Renames19:52
clarkbfungi: you had volunteered to update the rename playbook(s), do we have any change(s) for that yet?19:52
fungii can't remember, did we determine this will be our first attempt at renaming a project since the 3.2 upgrade?19:52
clarkbthis is our first attempt yes, other than what we did by hand on the test node to confirm it is possible19:53
fungialso not surprisingly i forgot i volunteered for that, will need to revisit what we determined needed to be done for it19:53
fungijust remove the mysql bits i suppose?19:53
clarkbfungi: we need to drop all the db stuff from the playbook and only do the on disk renames. Otherwise I think it is very similar19:53
clarkbyup19:53
clarkband we can possibly test it using our gerrit functional test jobs19:54
fungiokay, i'll try to leave myself a less forgettable reminder this time19:54
clarkbby adding that playbook to the list of test playbooks and confirming a project has moved within gerrit19:54
clarkbbut we can do that as a followon19:54
clarkbthanks!19:54
clarkbonce we have those updates up then I think we can think about schedulign the renames as we'll have the tools to do it19:54
clarkb#topic Open Discussion19:54
fungii'm happy to include the functional testing in my change, but will appreciate some guidance on the validating bits19:54
clarkbI can help with that19:55
fungi(when i reach that point)19:55
clarkbI've poked at similar for other changes recently19:55
clarkbWe did end up using our time afterall. Was there anything else to bring up in the last 5 minutes here?19:55
fungii have nothing19:55
fungineed to shift to prepping dinner in a few minutes19:55
clarkbWhy don't we all return to eating $meal then :)19:56
clarkbthanks everyone!19:56
clarkb#endmeeting19:56
opendevmeetMeeting ended Tue Jun  8 19:56:47 2021 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)19:56
opendevmeetMinutes:        http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-06-08-19.01.html19:56
opendevmeetMinutes (text): http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-06-08-19.01.txt19:56
opendevmeetLog:            http://eavesdrop.openstack.org/meetings/infra/2021/infra.2021-06-08-19.01.log.html19:56
fungithanks clarkb!19:57
ianw++19:57
*** pleimer_ has quit IRC22:09

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!