Wednesday, 2022-04-27

ianwok, done as much on the nodepool gate breakage as i can, going to have a quick crack at ^ now04:20
ianwi've got a root screen up on mirror-update04:22
*** ysandeep|out is now known as ysandeep04:31
ianwi'm doing the deleteunreferenced now04:53
ianwthat took us from 395 - 256 = 140gb ... not bad06:19
ianwi've dropped the lock, but will leave the root screen just incase we need a scrollback on what i did06:19
ianw#status log To save mirror volume space, we have removed source packages from the ubuntu-ports repository06:20
opendevstatusianw: finished logging06:20
*** jpena|off is now known as jpena07:27
*** ysandeep is now known as ysandeep|lunch08:12
*** ysandeep|lunch is now known as ysandeep08:52
jrosseri kicked the tyres on the ubuntu jammy image and I think the disk may not be expanded? https://zuul.opendev.org/t/openstack/build/23ce4c4d007a478da4067e48c0bd463809:58
*** rlandy|out is now known as rlandy10:21
ianwjrosser: hrm, interesting.  the nodepool functional test does actually test our growroot grows.  not to say that it's impossible, but it has slipped past testing at least :)10:26
ianwhttps://opendev.org/zuul/nodepool/src/branch/master/tools/functional-test-check.sh#L35 is where we do that10:28
fricklerianw: jrosser: I'm not sure if that's disk full on the node actually, or log size on the executor reaching the limit? also note that our jammy repo mirror may still be broken 10:31
frickleractually devstack got along much further today than yesterday. but indeed I see no growroot being done there, either https://1ae34c77782edc957e3d-ea76e3e710a8219c6c723d407750283e.ssl.cf1.rackcdn.com/839389/2/check/devstack-platform-ubuntu-jammy/2dfd8e4/job-output.txt10:33
fricklerah, that's because devstack trims the log. the root partition size looks fine here https://1ae34c77782edc957e3d-ea76e3e710a8219c6c723d407750283e.ssl.cf1.rackcdn.com/839389/2/check/devstack-platform-ubuntu-jammy/2dfd8e4/controller/logs/df.txt10:36
jrosseraaaah ok i misunderstood the error message there10:36
ianwfrickler: i think it's a service, so will happen on boot and not be captured in job-output.txt; syslog dumps on devstack may have it i guess10:37
ianwoh, but it seems it's probably growing.  and yeah, i'd agree that if zuul is reporting DISK_FULL it's from an executor POV, as it won't know the disk is full on the executing node side10:38
*** dviroel|rover|out is now known as dviroel|rover11:16
funginote that we also record things like df output at the start of most jobs with a common role and capture that in a log like zuul-info/zuul-info-ubuntu-jammy.txt11:34
*** pojadhav is now known as pojadhav|afk11:48
gthiemongeHi Folks, we got a weird failure in the CI this morning: https://zuul.opendev.org/t/openstack/build/7443aa7e7fbd4948b199f79fd6b7faee11:52
fungigthiemonge: we occasionally see that when the cloud has a rogue virtual machine squatting some ip address nova/neutron think is available and have allocated to a new instance12:00
fungithe gateway's arp table ends up with that entry repeatedly overwritten by the new and old mac, so ~half of connections to the ip address go to the rogue vm12:01
fungiif i can manage to ssh into the old node (by randomly trying that same ip address over and over), i'll shut it down12:01
fungiof course, that assumes the old vm was also one of our test nodes (it may belong to another tenant entirely)12:02
gthiemongefungi: ack12:04
fungii managed to catch it hitting the old vm i think, because i just saw the host key change on me, but attempts to ssh into it are hanging12:05
fungiand when i finally managed to get in, i wound up back at the new instance again12:05
fungiyeah, i think it wasn't one of our test nodes. i finally ended up getting a password prompt instead of my admin key logging me straight in12:07
fungianyway, the provider (rackspace in this case) seems to run periodic cleanup sweeps for such situations so hopefully it'll disappear on its own soon enough, but if not we can open a support ticket12:08
*** pojadhav|afk is now known as pojadhav12:26
*** ysandeep is now known as ysandeep|afk12:42
*** ysandeep|afk is now known as ysandeep13:00
*** ysandeep is now known as ysandeep|out14:21
*** marios is now known as marios|out15:28
*** dviroel|rover is now known as dviroel|rover|lunch15:29
opendevreviewClark Boylan proposed openstack/diskimage-builder master: Revert "Fallback to persistent netifs names with systemd"  https://review.opendev.org/c/openstack/diskimage-builder/+/83886315:41
mnasiadkaHello - is someone working on ubuntu-jammy-arm64? Only x86_64 is available at the moment from what I see15:54
clarkbmnasiadka: we're taking it one step at a time right now. x86 jammy isn't fully functional yet I don't think15:54
clarkbmnasiadka: the next steps for arm64 are pruning our AFS disk usage a bit more so that we can add the arm64 jammy mirror. And then we can add the image15:55
mnasiadkagreat, thanks - just asking15:55
clarkbyseterday we removed ubuntu-ports source packages which pruned about 140GB of disk. Next we should prune ubuntu x86 mirror source packages15:55
clarkbfungi: ^ do you knwo if there is a change for that assuming we're happy with the ports results?15:55
fungithere isn't yet, but i did double-check things this morning and they look good after ianw manually pruned the mirror15:56
fungii'll get one pushed up shortly i hope, today's just been full of other distractions15:56
clarkbthanks!15:57
clarkband ya that was a good improvement on the prots side, would be good to get that applied to ubuntu and eventually debian once we're set with zuul-jobs' configure-mirrors15:57
mtreinishgmann: I think we'll have to update the tempest removal process. The data was missing from subunit2sql anyway the last time I looked because the prune cron job deleted everything after the newer oslo-db/sqlachemy version got pulled into the worker and broke the updates16:03
mtreinishso there shouldn't be anything blocking deleting the db16:03
fungithanks for confirming, mtreinish!16:04
gmannmtreinish: thanks, yeah I will update the tempest process. 16:07
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Enable deb-src on Debian with a flag in configure-mirrors  https://review.opendev.org/c/zuul/zuul-jobs/+/83959216:17
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Switch enable_src_repos to False in configure-mirrors  https://review.opendev.org/c/zuul/zuul-jobs/+/83959316:17
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Add enable_deb_src_repos support for Ubuntu  https://review.opendev.org/c/zuul/zuul-jobs/+/83959416:17
*** dviroel|rover|lunch is now known as dviroel|rover16:18
opendevreviewClark Boylan proposed zuul/zuul-jobs master: Add enable_deb_src_repos support for Ubuntu  https://review.opendev.org/c/zuul/zuul-jobs/+/83959416:18
opendevreviewMaksim Malchuk proposed openstack/diskimage-builder master: Revert "Fallback to persistent netifs names with systemd"  https://review.opendev.org/c/openstack/diskimage-builder/+/83886316:29
corvusclarkb: fungi any zuul weirdness overnight?16:34
clarkbcorvus: the only thing I saw was the cloud arp fight between instances. Nothing on the zuul end16:34
corvusthe buildsets page looks a lot more sane to me (which is the main thing i was brown-bag-fixing)16:36
corvusi'll go ahead and push the tag now16:36
opendevreviewJeremy Stanley proposed openstack/diskimage-builder master: Switch to release-notes-jobs-python3  https://review.opendev.org/c/openstack/diskimage-builder/+/83959916:43
fungicorvus: i concur, nothing out of the "ordinary" (or what passes for ordinary in these parts)16:45
clarkbmtreinish: thank you for confirming16:50
clarkbinfra-root I'm going to proceed with deleting the subunit2sql trove mysql db now16:50
clarkb5d0dc5ac-7a3e-4568-9acb-e62406129666 subunit2sql-MySQL <- that is the instance that will be deleted16:53
clarkbit hosts a mysql db named subunit2sql with query and subunit2sql users. That all looks like the correct one to me so I'm going to click the button now16:54
clarkband done16:55
clarkbit doesn't show up in the last anymore16:55
clarkbs/last/list/16:55
clarkbinfra-root any objection to me approving the puppet config management retirements for ELK and subunit2sql/health now?16:56
clarkbfungi reviewed them and abandoned changes16:56
opendevreviewMerged zuul/zuul-jobs master: Enable deb-src on Debian with a flag in configure-mirrors  https://review.opendev.org/c/zuul/zuul-jobs/+/83959216:58
*** jpena is now known as jpena|off17:01
clarkb#status log Deleted the old subunit2sql database now that the OpenStack CI Health dashboard and subunit2sql workers have been removed.17:02
opendevstatusclarkb: finished logging17:02
fungiclarkb: feel free to self-approve, or i'm happy to single-core approve them17:04
TheJuliaWhat would it take to get two VMs part of a multinode job held to permit manual investigation of the state?17:07
clarkbTheJulia: you just need to let us know what job and change, then we mark it and you recheck if necessary. Then once failed we can add your ssh key to the hosts17:07
clarkbfungi: ya I'll go ahead and start approving them17:07
TheJuliachange 839086,9, job ironic-grenade-multinode-multitenant. Presently running at the moment.17:09
fungii can add a hold for that17:09
TheJuliaMuch appreciated!17:09
opendevreviewMerged opendev/puppet-subunit2sql master: Retire this project  https://review.opendev.org/c/opendev/puppet-subunit2sql/+/83924217:09
opendevreviewMerged opendev/puppet-logstash master: Retire this project  https://review.opendev.org/c/opendev/puppet-logstash/+/83924017:09
opendevreviewMerged opendev/puppet-log_processor master: Retire this project  https://review.opendev.org/c/opendev/puppet-log_processor/+/83923917:09
opendevreviewMerged opendev/puppet-elasticsearch master: Retire this project  https://review.opendev.org/c/opendev/puppet-elasticsearch/+/83923817:09
opendevreviewMerged opendev/puppet-kibana master: Retire this project  https://review.opendev.org/c/opendev/puppet-kibana/+/83923717:10
TheJuliaUnfortunately it looks like traffic just doesn't make it across the vxlan tunnel, and I've solved the most recent two errors afaik, and it still looks like from what has scrolled across the screen that this is the case... :\17:10
clarkbvxlan itself should be fairly stable considering the number of jobs that make use of it. Wouldn't surprise me if it is a route or firewall issue (and holding nodes in that case can definitely help)17:11
TheJuliayeah, that is kind of what I'm thinking at this point17:12
clarkbfungi: https://review.opendev.org/c/openstack/project-config/+/839243 is the change to finalize those repo retirements17:12
fungi| 0000000040 | openstack | opendev.org/openstack/ironic | ironic-grenade-multinode-multitenant | refs/changes/86/839086/9 |     1     | TheJulia troubleshooting XVLAN connectivity problems |17:13
fungiTheJulia: when that build fails, let me know what ssh key(s) you want added to the nodes for it17:13
TheJuliafungi: awesome17:14
TheJuliafungi: ended https://paste.openstack.org/show/bHHmaYhfkP2YOm1qCy1q/18:16
fungiTheJulia: ssh root@198.72.124.82 and 198.72.124.12218:20
TheJuliafungi: much appreciated, I'll let you know when I'm done18:20
TheJuliahopefully this will be quick and easy18:20
fungimy pleasure. have "fun"18:20
TheJulialol18:21
clarkbfungi: I'm thinking for cleanup of the debian source packages we can probably wait until we flip the default to false in zuul-jobs? Or do you think it is prudent to udpate our base job?18:33
clarkbI suspect that clearing out ubuntu and ubuntu-ports source packages will be sufficient to free up plenty of room for arm64 jammy packages18:33
clarkbjustbased on the numbers we already got from cleaning up -ports18:33
fungiclarkb: i have it noted in the commit message, just a moment18:38
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for debian-docker  https://review.opendev.org/c/opendev/system-config/+/83962118:38
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for ubuntu  https://review.opendev.org/c/opendev/system-config/+/83962218:38
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for debian  https://review.opendev.org/c/opendev/system-config/+/83962318:38
fungisince i don't see a wip change to switch the default yet, i'll wip that last change until there's something it can depends-on18:39
fungibut to answer your question, i would wait for the zuul-jobs change unless we want to serve as a canary and add an override in our base job18:41
fungii agree we're probably in no real hurry for the debian change, ubuntu should free plenty already18:42
fungithe only reason i can think of to override in base now is to make sure the non-default value works. but it's ultimately 3 changes instead of just 1 so i wouldn't unless we're worried it won't18:43
fungi(3 changes to override in base-jobs, switch the default in zuul-jobs, and then clean up the redundant override in base-jobs)18:44
Clark[m]fungi: https://review.opendev.org/c/zuul/zuul-jobs/+/839593 is the change you can depends on18:44
fungimore than 3 changes if we also want to be cautious and exercise one or both base-jobs changes in base-test too18:44
fungiaha, thanks, i missed 83959318:44
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for debian  https://review.opendev.org/c/opendev/system-config/+/83962318:45
*** artom__ is now known as artom18:53
clarkbfungi: I think https://review.opendev.org/c/opendev/system-config/+/839621/1/playbooks/roles/reprepro/files/debian-security/config/distributions was meant to go in the last change of the stack not the first?19:09
clarkbfungi: if you have time for https://review.opendev.org/c/openstack/project-config/+/839243 that would be great too19:18
fungiclarkb: oops, thanks!19:27
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for debian-docker  https://review.opendev.org/c/opendev/system-config/+/83962119:31
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for ubuntu  https://review.opendev.org/c/opendev/system-config/+/83962219:31
opendevreviewJeremy Stanley proposed opendev/system-config master: Stop mirroring source packages for debian  https://review.opendev.org/c/opendev/system-config/+/83962319:31
clarkbthats weird the diff between patchsets doesn't show a diff because the parent always had the change before19:34
opendevreviewGage Hugo proposed opendev/irc-meetings master: Update security sig chair details  https://review.opendev.org/c/opendev/irc-meetings/+/83963219:34
clarkbunfortaunte that it makes updates like this difficult to see but at least it makes snse19:34
fungiyeah19:39
fungirebases are always a bit... wonky in gerrit19:39
fungiyou should be able to see the file removed from the patchset diff in the first change, and nothing different in the third change19:40
clarkbya the file isn't listed in the first two patchsets but then shows up in the third but no delta between ps1 and ps3 or ps2 and ps3 due to the parent carrying those updates previously19:42
opendevreviewMerged openstack/project-config master: Finalize ELK puppetry retirement  https://review.opendev.org/c/openstack/project-config/+/83924319:44
*** rlandy is now known as rlandy|mtg20:28
*** dviroel|rover is now known as dviroel|rover|biab21:12
*** rlandy|mtg is now known as rlandy21:27
*** rlandy is now known as rlandy|bbl22:10
TheJuliafungi: you can take back those vms now, thanks!22:24
fungithanks TheJulia! done22:25
*** dviroel|rover|biab is now known as dviroel|rover22:31
*** dviroel|rover is now known as dviroel|rover|out22:54

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!