Monday, 2024-03-04

tkajinamfrickler, sorry I was out because of some personal matters03:17
tkajinamfrickler, so far telemetry is blocked by it (we attempted to fix it in https://review.opendev.org/c/openstack/telemetry-tempest-plugin/+/910572 but are hitting different neutron problem it seems...)03:20
tkajinamthere are a few more patches related to tripleo removal currently blocked. quickly checking these, some of thedse may be different problems but I'm unsure if we can fully complete these before the release timing03:21
fricklertkajinam: well I looked at that tempest failures and I really fail to see how they could be related to the dib-utils removal. so I'm pretty convinced the re-adding it will not help and the tempest team will fix their CI even after that06:50
frickler*will need to fix06:55
tkajinamyeah. I agree with that07:37
opendevreviewMerged openstack/project-config master: Retire OpenStack-Chef: remove project from infra  https://review.opendev.org/c/openstack/project-config/+/91010007:53
opendevreviewEttore Simone proposed openstack/diskimage-builder master: Cleanup uses of DIB_IMAGE_ROOT_FS_UUID variable  https://review.opendev.org/c/openstack/diskimage-builder/+/90555611:10
*** tosky_ is now known as tosky11:36
opendevreviewClark Boylan proposed opendev/system-config master: Add more info to afs fileserver recovery docs  https://review.opendev.org/c/opendev/system-config/+/91066216:22
clarkbfungi: ianw  ^ that should address your comments16:22
fungithanks!16:22
clarkbfor mariadb upgrades we did the easy one :). Any opinions on which we should do next out of gerrit, mailman3, etherpad, refstack, and gitea? I'm leaning towards etherpad and refstack. I think gitea can be done as well, but given the recent db stuff there I think it may deserve a bit more thought. Gerrit would be easy if it didn't imply a service outage.16:26
fungifor gitea, we could try upgrading one backend and then if it exhibits problems just take it out of the haproxy pool while we work through that. or we could try holding a test node and importing a recent backup mysqldump, then upgrading if we want a preliminary sniff test16:35
fungiactually, for partial production upgrade, we could take a backend down in haproxy first, upgrade it, test it, then put it back into the pool if everything seems okay, and exercise it for a while looking for errors before we upgrade the others16:35
clarkbfungi: ya I think testing on a copy of the db is what I'd like to do. My main concern is that the utf8 stuff seems to get stricter in newer mariadb versiosn and I'd like to ensure we understand if that affects anything in the upgrade (it really shouldn't only system tables are touched by the upgrade not application tables, but still)16:36
fungimakes sense, yep16:41
clarkbfor some reason I thought today is tomorrow and that I'd have a bunch of meetings.16:44
clarkbI'm going to send an email to openstack-discuss about centos 7 cleanup. The vast majority of cleanup needs to be done in openstack-ansible, swift, devstack, freezer, keystone etc17:30
fungisounds reasonable17:31
clarkbI can handle the zuul-jobs cleanup and the stuff within opendev and haev already pushed a change for starlingx. But not sure I can wait around for all of those to cleanup17:31
clarkbhoping an email gets things moving17:31
fungii think we can just announce a date and then rip it out17:32
clarkbya do we think March 15 is too early?17:32
clarkbI know it may impact the release aprticularly with the concurrent tripleo cleanup. But at the same time I would really like to get this done before we add any new images17:33
clarkblike ubuntu 24.0417:33
fungithat's rc1 deadline for the openstack release, but i suppose it should only impact stable branches anyway?17:33
clarkbfungi: no all of the things I listed above are master using centos 717:34
fungioh wow17:34
clarkbthis is why I keep saying openstack need to clean stuff up at release time 17:34
clarkbrather than at eol time17:34
clarkbfungi: ya the flip side of letting this go to RC time is that we'll have another branch that needs cleanup17:37
clarkbif we deal with it now then its less work overall17:37
clarkbbut I suspect most people would like us to not impact the release at all then just ignore the problem17:37
fungithankfully the number of job references for https://codesearch.opendev.org/?q=centos-7 seems fairly limited in the openstack/ namespace17:38
clarkbso ya an objections to saying we'll drop things March 15?17:41
clarkbnoonedeadpunk: mnasiadka fyi ^ a coupel of openstack ansible repos show centos 7 still17:44
clarkbI know you indicated this wasn't an issue, but would be good to clean that up to avoid problems in the first place /me will send email shortly too17:44
fungiskimming the search results, it looks like most of the affected job definitions aren't in any pipelines, they stopped running them but missed cleaning up where they were defined?17:45
clarkbseems likely17:45
fungii'm trying to put together a picture of which affected jobs are actually still running on master vs just unused cruft configuration17:46
jrosserfor OSA //review.opendev.org/c/openstack/openstack-ansible-ops/+/91097217:47
jrosserhttps://review.opendev.org/c/openstack/openstack-ansible-ops/+/91097217:47
clarkbjrosser: ansible-plugin-container-connection is the other repo with centos-7. Also thanky ou17:47
jrosserthats not one i recognise immediately17:48
clarkbjrosser: in 910972 you deleted a - job: line which made zuul unhappy17:48
clarkbbut otherwise that lgtm. It also cleans up other old stuff whcih is great17:49
noonedeadpunkopenstack/openstack-ansible-ops is our crap17:49
fungilooks like some jobs are referenced in project-templates which are no longer applied in any projects, e.g. https://codesearch.opendev.org/?q=ansible-role-functional-jobs-centos717:49
jrosseri suspect ansible-plugin-container-connection is very dead17:49
jrosserit would be nice if we could search with a wildcard in the zuul errors page17:50
jrossermonorepo == easy there17:50
jrosserdozens of repos not so easy17:50
noonedeadpunkyeah, unfortunatelly zuul doesn't have parentproject concept17:51
clarkbjrosser: note they aren't errors yet17:51
jrosserno, but i did make a bunch of patches for errors last week17:51
clarkbah gotcha17:51
jrosserhttps://review.opendev.org/q/topic:%22osa/zuul-errors%2217:51
jrosserand i suspect that some intervention will be needed to merge the bulk of those17:52
clarkbspot checking it looks like typical bitrot in jobs. Could mark them non voting if you think there is interest in fixing them or just remove them if not17:53
clarkbI've pushed a change to delete a job in pbr for similar reasons this mornign17:53
clarkbok I'm going to send the march 15 date and then see if there is much screaming. I suspect the impact is small enough we can make that happen17:53
jrosseractually ansible-plugin-container-connection looks like it in the Ansible SIG https://opendev.org/openstack/governance/src/branch/master/reference/sigs-repos.yaml#L517:55
fungii guess the concern is what happens if, say, 2024.1 rc1 is tagged for a number of affected projects that week, and then we remove the configuration and don't notice that jobs have broken for them until we go to tag the final release a couple weeks later17:55
clarkbfungi: yes though maybe that is the kick we need to actually clean this up before we make releases17:56
fungifair17:56
clarkbfungi: the major issue here is that we don't have a post or pre release task to go and cleanup all the old cruft we don't need or want anymore17:56
clarkbfungi: so we've ended up with centos 7 in master somehow17:56
clarkbwhen it should've been removed most places years ago17:56
fungialso zuul should call out the stable/2024.1 branch as having errors in those projects and we can identify and correct them before release day17:56
clarkbI'll make a note that this can impact the release and I'd welcome feedback17:58
clarkbat the same time we've got people wanting ubuntu 24.04 though17:58
clarkband I really feel like this is a give and take situation where we clear room and then add new stuff17:58
jrosserfrom a deployment project POV we quite often stretch support for an OS for some cycles after the main projects move forward17:59
jrosserwe did that fairly spectacularly with centos-7 and bionic17:59
clarkbjrosser: yup and we've tried to be accomodating in the CI system too17:59
clarkbcentos 7 is 10 years old17:59
jrosseroh sure yes this was ages ago17:59
clarkband will EOL very soon18:00
fungiand about to be eol by rh18:00
fungilooking at the search results, i think swift will be the hardest hit. they have 5 jobs they're currently running on master which they'll need to drop18:02
jrosserslurp releases have made things stretch more too18:02
timburkefungi, not for long we won't ;-) https://review.opendev.org/c/openstack/swift/+/910829 is approved; i'll work on some backports to clean out stable branches18:02
clarkbfungi: yup and timburke is already on it (I reached out to them first because they did seem to have the biggest usage)18:02
fungiswift-probetests-centos-7, swift-tox-py27-centos-7, swift-tox-func-py27-centos-7, swift-tox-func-encryption-py27-centos-7, swift-tox-func-ec-py27-centos-718:03
clarkbemail sent18:04
clarkbthinking out loud here I suspect that we can drop centos-7 platform testing in zuul-jobs and stop wheel cache/mirror builds prior to that date18:07
clarkbbut then wait fro teh 15th to merge the base-jobs nodeset removal and actual image and node cleanups in nodepool18:07
fungiyep. i can try pushing some removals to, like, devstack and openstack-zuul-jobs to see what errors they report, if nobody's done that yet18:10
clarkbfungi: I don't think anyone has. I'm starting with ozj and project-cnfig and will try to excise what looks safe18:10
opendevreviewSteve Baker proposed openstack/diskimage-builder master: Call grub2-mkconfig with --update-bls-cmdline  https://review.opendev.org/c/openstack/diskimage-builder/+/91097718:11
fungiclarkb: is there a common review topic string you're using?18:11
clarkbfungi: 'drop-foo' is what I'e been doing so 'drop-centos-7' should work18:12
opendevreviewClark Boylan proposed openstack/project-config master: CentOS 7 removal prep changes  https://review.opendev.org/c/openstack/project-config/+/91097818:18
clarkbI believe ^ and https://review.opendev.org/c/openstack/openstack-zuul-jobs/+/910979 should be fairly safe as long as zuul doesn't get made about removing anything there18:20
opendevreviewClark Boylan proposed openstack/project-config master: CentOS 7 removal prep changes  https://review.opendev.org/c/openstack/project-config/+/91097818:24
opendevreviewClark Boylan proposed openstack/project-config master: CentOS 7 removal prep changes  https://review.opendev.org/c/openstack/project-config/+/91097818:26
fungihttps://review.opendev.org/c/openstack/devstack/+/910980 Drop the devstack-single-node-centos-7 nodeset18:28
fungicurious to see what errors zuul reports on that18:28
clarkbthe keystone job likely uses that nodeset18:31
clarkband maybe octavias too18:31
fungii'm expecting mostly problems from stable branches18:31
fungibut there are master branch references in zuul configs for keystone, neutron-vpnaas, solum, and x/networking-opencontrail18:33
fungiwhich leaves me wondering, if there's nobody to approve a cleanup change in x/networking-opencontrail do we drop it from the tenant config so that the devstack cleanup change can merge?18:33
fungioh, you know what, the devstack master branch cleanup isn't going to fail because zuul will just use the nodeset definition from one of devstack's many stable branches18:36
fungiis there a specific order they need to be cleaned up in?18:38
fungi(oldest first, latest first, doesn't matter)18:38
fungilooks like devstack currently has 7 other branches this needs to be cleaned up in18:40
clarkbI don't think order matters because the definitions all have to match otherwise it is an error?18:41
clarkband yes removing networking-opencontrail from zuul is what I would do 970978 does this to x/collectd-openstack-plugins18:42
fungii also just realized this intersects with the stable to unmaintained transition for wallaby and xena branches18:58
fungiso far only stable/yoga has been replaced by unmaintained/yoga, so if i propose nodeset cleanup changes to stable/wallaby or stable/xena they're going to get auto-closed when the unmaintained/wallaby and unmaintained/xena branches are created19:00
fungiand if we don't actually merge the nodeset cleanup across every open branch, we'll never see the errors from other projects referencing that nodeset19:00
clarkbyes I didn't udnersatnd why the oldest branches weren't udpated first19:01
clarkbfor unmaintained I mean19:01
fungielodilles: ^ perhaps you know?19:03
clarkbfungi: https://review.opendev.org/c/openstack/project-config/+/910978 should be good to go regardless of all the other changes to rip out centos 721:25
clarkband with that in we can recheck the ozj change that won't pass zuul config checks until after x/collectd-openstack-plugins is out of zuul21:26
opendevreviewClark Boylan proposed opendev/system-config master: Upgrade Refstack's MariaDB to 10.11  https://review.opendev.org/c/opendev/system-config/+/91099921:28
opendevreviewClark Boylan proposed opendev/system-config master: Upgrade Etherpad's MariaDB to 10.11  https://review.opendev.org/c/opendev/system-config/+/91100021:30
clarkbCI should give us an idea if the database version change presents a compatibility problem (it won't test the upgrade itself)21:31
fungithanks21:32
opendevreviewClark Boylan proposed opendev/system-config master: Add more info to afs fileserver recovery docs  https://review.opendev.org/c/opendev/system-config/+/91066221:48
clarkband that addresses frickler's comment21:48
opendevreviewMerged openstack/project-config master: CentOS 7 removal prep changes  https://review.opendev.org/c/openstack/project-config/+/91097821:49
clarkbreminder to get your team meeting agenda items onto the agenda today. I've got to do a school run in a few but then I'll be putting an agenda together when I get back22:03
opendevreviewMerged opendev/system-config master: Add more info to afs fileserver recovery docs  https://review.opendev.org/c/opendev/system-config/+/91066222:07
corvusI'm going to perform a rolling restart of the entire zuul cluster to pick up a recently merged change which should fix a regression in executor job distribution.22:20
corvusi'm going to do that by running the same command we use for the weekly restart cron in a screen on bridge22:21
corvus#status log began graceful restart of zuul cluster22:23
opendevstatuscorvus: finished logging22:23
fungithanks corvus!22:36
clarkbfungi: shoudl we go ahead and approve teh git-review updates for the commit msg hook?23:03
clarkbthen I can drop it from the meeting agenda (i'll probably do that either way as we got reviews)23:03
fungii need to double-check whether i got all the comments addresses23:03
fungiaddressed23:03
clarkbpretty sure you addressed my concerns23:04
fungimm, yeah timburke had some suggestions on the chmod change i was going to address23:04
timburkespeaking of git review, is there anything else i should do for https://review.opendev.org/c/opendev/git-review/+/907097 and https://review.opendev.org/c/opendev/git-review/+/907101 ?23:06
opendevreviewClark Boylan proposed opendev/system-config master: Upgrade Etherpad's MariaDB to 10.11  https://review.opendev.org/c/opendev/system-config/+/91100023:12
clarkbfound a super minor incompatibility afterall. Just a test thing though23:12
clarkbtimburke: I went ahead and approved the classifier one. On the importlib vs pkg_resources change maybe fungi can double check it for comaptibility and the napprove23:14
opendevreviewJeremy Stanley proposed opendev/git-review master: Don't make hook script read-only  https://review.opendev.org/c/opendev/git-review/+/91026823:14
opendevreviewJeremy Stanley proposed opendev/git-review master: Vendor a copy of Gerrit's commit-msg Git hook  https://review.opendev.org/c/opendev/git-review/+/91027523:14
fungilooks like we don't have a test of git review --version but i can whip one up real quick23:20
clarkbmy agenda edits are in. I'll send that out in about half an hour23:26
ianwclarkb: speaking of afs; https://review.opendev.org/c/opendev/system-config/+/906013 is some fixups to the client role that i think are all useful23:26
clarkboh yes having a clear "the kernel is too old" message would be helpful for debugging23:30
opendevreviewJeremy Stanley proposed opendev/git-review master: Test that the --version option returns something  https://review.opendev.org/c/opendev/git-review/+/91101023:30
opendevreviewMerged opendev/git-review master: Add classifiers for Python 3.10 and 3.11  https://review.opendev.org/c/opendev/git-review/+/90709723:37
opendevreviewJeremy Stanley proposed opendev/git-review master: Test that the --version option returns something  https://review.opendev.org/c/opendev/git-review/+/91101023:39

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!