Wednesday, 2022-03-30

*** marios is now known as marios|ruck05:06
ttxwoohoo, tests passed, and PTLs +1s are piling up06:19
elodillesyepp, so far so good \o/06:42
elodilleszuul load is moderate, though we would count with some load (right now i see multiple tripleo patches on the gate)06:46
elodilles* we should06:46
elodillesyesterday the gate passed for yoga-final around ~1.5 hrs. adding the post release jobs and some final tasks... i think 11:30 UTC (the original starting time we aimed for) could work: we'll have 3.5 hrs to finish the tasks06:50
elodillesmaybe if we are not that patient we can start a bit earlier :] we will see06:52
ttxyeah I would not mind pushing the button early if we have good confidence the gate is in good shape07:07
elodilles++07:17
*** amoralej|off is now known as amoralej07:19
opendevreviewAlfredo Moralejo proposed openstack/releases master: Create new bug release in openstacksdk for wallaby  https://review.opendev.org/c/openstack/releases/+/83578808:07
*** marios|ruck is now known as marios|ruck|lunch10:01
*** marios|ruck|lunch is now known as marios|ruck10:30
fungii'm not quite around yet, but can be if the button-pushing is starting earlier than discussed10:33
elodillesfungi: ack, thx10:35
elodillessince we passed 10:30 UTC we have less than an hour for starting the machinery :]10:38
elodillesif we start somewhat sooner, around 11:00 UTC then we have even less time :-o10:38
ttxI +2ed it10:41
fungizuul looks reasonably quiet and i don't see any obvious problems indicated on the status graphs10:42
fungiyeah, stuff is still passing jobs and merging10:47
fungiand there's no real backup10:47
elodillesttx fungi : roger!10:49
elodillesthis also looks good so far: https://status.python.org/10:50
fungiyou can always try to merge a release-test release as a canary to make sure, but i wouldn't worry about it since we were tagging release candidates as late as yesterday10:55
elodillesyepp, unfortunately (& fortunately) that was not so long ago :S10:56
ttxlet's go!11:02
elodillesso we have passed 11:00 UTC & zuul & python infrastructure looks good11:03
elodilleslet's start then!11:03
ttxyay!11:03
fungisounds good11:03
fungii'm caffeinating now11:03
elodilles+2+W'd11:04
elodilleszuul has picked it up! it's on the gate!11:05
fungiyep, lookin' good so far11:05
elodillesjobs are in the queue: https://zuul.opendev.org/t/openstack/status#83532211:05
fungiand nodes are getting assigned now11:10
ttxI have a meeting at 12:30utc that should last about an hour11:10
elodillesttx: ack11:11
fungii guess you'll have to turn down the release soundtrack for a little while11:11
elodillesthe job is expected to finish around that time and post-release jobs will start afterwards11:12
*** dviroel|out is now known as dviroel11:17
opendevreviewMerged openstack/releases master: Yoga final releases for cycle-with-rc projects  https://review.opendev.org/c/openstack/releases/+/83532211:36
fungiw00t11:36
fungiand now, the tags11:37
fungitag-releases is already running11:38
elodillesoh, it was quite fast!11:40
elodillesi guess because we don't need the list-changes job here11:40
elodilleslet's see how the tagging goes11:40
fungiit's pushed a bunch of them already11:41
elodillesyepp, patches are gathering in 'release' queue11:43
fungihttps://pypi.org/project/adjutant-ui/ "4.0.0 Released: 1 minute ago"11:44
ttxit's happening11:51
elodilles:]11:57
elodillestag & release queues are quite busy11:57
fungiper discussion in #opendev the iweb nodes (~20% of our quota) are impacted by api-related issues at the moment so i've emergency approved a change to stop booting new nodes there while they work on fixing it11:58
fungibut it looks like we have a lot of available capacity right now anyway so it shouldn't impact things11:58
fungiand the problem in the provider is dns-related leading to inaccessibility of the openstack apis, so shouldn't impact running builds11:58
fungijust wanted to let everyone know11:59
fungiunfortunate timing, but it is what it is11:59
fungiin good news. zuul doesn't even seem to be breaking a sweat processing all these tags12:02
elodillesfungi: ack, thanks for the heads up12:04
elodillestag queue is still highly loaded with 'publish-openstack-releasenotes-python3' jobs for releases (67 at the moment)12:06
elodillesbut release queue size is slowly decreasing (26 at the moment)12:07
elodillessoon i should start the missing-releases command if i'm not mistaken12:08
fungii think the releasenotes jobs ended up using a semaphore to serialize them so they wouldn't step on one another, right? i'll double-check that's why they're piling up12:08
elodillesno, they are not interfere with each other, they are just added in the queue :)12:12
elodilleseverything seem OK so far. fingers crossed :X12:13
fungithe disruption in iweb is over, according to mgagne12:13
elodilles\o/12:14
elodillesrelease queue is estimated to be empty in a minute12:14
fungiconfirmed, https://zuul.opendev.org/t/openstack/job/publish-openstack-releasenotes-python3 shows the builds all share a semaphore called publish-releasenotes12:15
fungithat's why it's only running one of them at a time12:15
*** amoralej is now known as amoralej|lunch12:16
fungithat semaphore was added ~1.5 years ago in https://review.opendev.org/75655312:16
elodilleswe are not in a hurry, so that's OK :)12:17
fungicommit message confirms it was added to address publication races12:17
elodillesone last 'announce-release' job is running (~3 mins left)12:18
elodillesstarted to run the 'missing-releases' command locally12:20
fungiseems the releasenotes jobs spend a lot of time at "TASK [submit-logstash-jobs : Submit logstash processing jobs to log processors]" which we're due to rip out when we take the old logstash system offline12:20
fungishould have been gone already but there were delays with the replacement12:21
elodillesno worries we are still in time i guess :)12:23
elodilleshmmm, what I see so far, 2 missing wheels:12:23
elodillesdid not find python 3 wheel https://tarballs.openstack.org/ansible-role-atos-hsm/ansible_role_atos_hsm-3.0.0-py3-none-any.whl12:23
elodillesdid not find python 3 wheel https://tarballs.openstack.org/ansible-role-thales-hsm/ansible_role_thales_hsm-3.0.0-py3-none-any.whl12:23
fungithose are from this release?12:23
elodillesthis might be related to the setuptools issue? :/12:23
elodillesyepp12:24
elodillesand one more:12:24
elodilles  did not find python 3 wheel https://tarballs.openstack.org/heat-tempest-plugin/heat_tempest_plugin-1.5.0-py3-none-any.whl12:24
fungii don't see them listed at https://lists.openstack.org/pipermail/release-job-failures/2022-March/thread.html12:25
elodilleshowever 'found python 2/3 wheel' is there for them12:25
fungiyeah, that's probably a project that forgot to drop universal from their setup12:25
elodilleshopefully they're just false negative signals12:25
elodillesfungi: hmm, that's quite possible12:26
fungiansible-role-atos-hsm 3.0.0 was tagged today at 11:40z12:26
fungihttps://pypi.org/project/ansible-role-atos-hsm/#files shows a ansible_role_atos_hsm-3.0.0-py2.py3-none-any.whl12:27
fungiso same situation12:27
elodillesyepp, probably all the same12:28
fungiyeah, just checked, same for all12:28
fungiso those projects need to fix their setup.cfg12:28
funginothing brok in the release jobs12:28
fungier, broke12:28
elodilles++12:31
fungii assume we don't need to wait for all the releasenotes builds to complete i order to announce the release, if it comes down to that?12:35
fungier, in order12:36
fungiat the current run rate we're probably looking at half a day for those to complete12:40
elodillesyes, i think so, though we are in time, aren't we?12:40
elodillesfungi: oh, i see12:40
fungithey're taking around 5-10 minutes each, plus some delay between, and we have >60 still to go12:41
elodillesi thought the jobs take 1-2 minutes each12:41
fungisee the duration column at https://zuul.opendev.org/t/openstack/builds?job_name=publish-openstack-releasenotes-python312:42
fungithey may each be taking a couple minutes extra due to whatever's going on with the old logstash systems12:42
fungiwhat's especially unfortunate though is that the semaphore was added to that job in order to protect against publication races when two tags are pushed at the same time for different branches of the same project. there's really no need to serialize these across different projects12:43
elodillesthe missing-release command is finished, we have 3 issues (the above mentioned ones doesn't seem to appear there, so the script knew they are false negatives): https://paste.opendev.org/show/btkRzLl3C1h9SstBLQdc/12:43
elodillesthe problem is with tap-as-a-service12:44
elodillesif i'm not mistaken it was manually released in the past12:44
elodillesfungi: any idea what we could do regarding tap-as-a-service12:46
fungii'm looking12:46
fungithis was the python release build for it: https://zuul.opendev.org/t/openstack/build/bb62e96739f54f869a55e46553a5edc412:47
elodillesi don't see failing job in the log :-o 12:48
fungioh, https://tarballs.openstack.org/tap-as-a-service/ is redirecting to https://tarballs.opendev.org/x/tap-as-a-service/ because it was moved out of openstack originally12:50
elodillesyes, it was moved ~half a year ago if i'm not mistaken :-o12:51
fungiright, and the old redirect for it never got undone12:52
fungiyou can see the new releases here: https://static.opendev.org/project/tarballs.opendev.org/openstack/tap-as-a-service/12:52
fungii'll push up a change to take out the redirect, and i'll separately copy the old releases from x/tap-as-a-service to openstack/tap-as-a-service12:53
elodillesfungi: thanks \o/12:53
fungihttps://review.opendev.org/835829 should solve it13:00
fungiand thinking about it more, i probably shouldn't copy the old tarballs into the new (or new again) location, since those weren't made when the project was an official part of openstack anyway13:01
*** amoralej|lunch is now known as amoralej13:03
elodilleshmm13:03
fungithe releases made as part of openstack will be reachable once that redirect removal merges and deploys though13:06
elodillesawesome, thanks \o/13:07
elodillesshould we continue with the next tasks?13:08
fungii expect so, unless you just want to wait for as many release notes to be updated as possible before the deadline13:09
elodillesthe www update & mark series as released ones13:10
elodillesthis: https://review.opendev.org/c/openstack/openstack-manuals/+/83536713:10
elodillesand this: https://review.opendev.org/c/openstack/releases/+/835323/13:10
fungibut i doubt anyone will be confused if some of them aren't updated until after the announcement (i doubt anyone will even notice)13:10
fungiyeah, seems like those should be safe to merge now13:10
elodillesfor the www update patch i wrote: "(short) before 14:00 UTC update the doc pages with: [www] Set yoga as released (the pages needs ~1 hrs to refresh, so needs to be done well in time)"13:11
fungido you have approval permissions for openstack-manuals, or do we need to find someone?13:11
elodillesfungi: i guess if i remove my -W then Andreas' +W will start the job13:12
elodillesif i'm not mistaken :)13:12
fungioh, yep that works. you may need to add a recheck since i'm not sure the workflow -1 to 0 switch will enqueue it into the gate13:13
elodilleslet's find it out13:13
fungithough if rechecking is necessary it has to be in a separate comment anyway (recheck comment when adding or changing a vote gets ignored because it fails to match the comment regex for the pipeline trigger)13:13
elodillesi'll wait 1 or 2 mins and recheck if zuul doesn't pick it up13:15
elodilles-W removed13:15
fungilooks like it's in the gate, so that worked13:19
elodilleswell, Andreas -W & +W'd it :)13:19
fungiyep, just saw, so that's what actually did it13:19
fungibetter than a recheck since we don't have to wait for it to go back through check13:20
elodillesyepp :)13:22
elodillesttx: in case the meeting is over, then please review this: https://review.opendev.org/c/openstack/releases/+/835323/13:22
elodilleswe are in time with this i think13:23
elodillesso far :X fingers crossed13:23
fungii'm going to step away momentarily to get a shower, since things seem to be on track13:35
elodillesfungi: ack13:35
elodilleswww patch has merged. we just need to wait until page is refreshing and we'll see it on docs.o.o13:36
ttxI'm back13:37
ttx+2 on the "released" patch13:39
ttxelodilles: did you run missing-releases?13:40
ttxquickly running a link checker to be extra sure13:42
elodillesttx: yes, i ran it and found tap-as-a-service missing (for which fungi tracked down the root cause and started to apply fixes, see the discussion ^^^)13:49
elodillesttx: i'm about to +W the 'released' patch now then13:51
ttx+113:51
elodilleshttps://docs.openstack.org has been refreshed! it shows now Yoga as the current supported release \o/13:54
fungiawesome13:54
ttxfungi: my link checker only found broken links in the header/footer. Quite a few though14:03
opendevreviewMerged openstack/releases master: Mark Yoga as released  https://review.opendev.org/c/openstack/releases/+/83532314:05
elodillesttx: which ones? tap-as-a-service or the ansible-role-*-hsm wheels?14:05
ttxhttps://etherpad.opendev.org/p/xJvP9GdInAl69JMw7mf014:05
ttxelodilles: no outside of the release content, in the footer and header14:05
ttxI think those are in some theme we need to fix14:06
elodilleshmm, i see.14:07
ttxonly 3 are really broken14:09
ttxI'll push a fixfor them14:11
elodilles(meanwhile, releases patch has merged, but releases.o.o is not yet updated)14:14
elodillesi don't even find the missing pages. security i guess is https://security.openstack.org but don't know the others :S14:22
opendevreviewThierry Carrez proposed openstack/releases master: Fix links in releases.o.o footer  https://review.opendev.org/c/openstack/releases/+/83583414:23
ttxThis syncs it with the one used in openstack.org14:23
elodillesreleases.o.o has been refreshed as well: https://releases.openstack.org/14:23
ttxLet's wait for the docs build to approve it14:23
ttxI think we are all set14:24
ttxFeel free to send emails, I'm preparing a tweet :)14:24
elodillesttx: your patch won't fix the links in docs.o.o, right?14:25
elodillesas i see it has the same wrong links :S https://docs.openstack.org14:26
elodillesin the footer14:26
ttxI only care about releases.o.o :) Yes it will only fix the customized copy of the theme we use on releases-side14:27
elodillesack :)14:27
elodillesso i'm preparing the announcement mail \o/14:28
elodillesttx: "Remind Foundation staff to switch openstack.org Yoga messaging on" do we need this? or you've already done that? o:)14:29
elodillesttx: i see that you added 'confirmed ok' to 'Coordinate the timing of the email with the press release from the Foundation staff'14:29
ttxChecking14:30
elodilles(i'm ready to send the announcement mail meanwhile)14:31
fungiand i'm available to approve it through the moderation queue14:37
elodillesfungi: ack14:41
elodilleshmmm, the 'stay in touch' part of the generated docs in the footer looks weird: https://309a0a1a1f63db90b41e-6e6a98e42a02be15877850d4946fa43f.ssl.cf2.rackcdn.com/835834/1/check/openstack-tox-docs/679b370/docs/index.html14:42
elodillesthough i don't know whether it is just wrong there or will be the same once we merge the 'fix links' patch14:42
ttxYou can announce elodilles 14:44
ttxLet me know when the mail is sent so I can push my tweet :)14:44
gmann\o/. thanks release team for all your effort.  14:45
elodillesfungi: mail is sent to openstack-announce14:46
elodillesgmann: \o/ thanks too \o/14:47
fungiwaiting for it to arrive at the listserv14:47
fungiseems like it may be taking a minute14:47
fungielodilles: you're subscribed to openstack-announce, right?14:48
elodillesfungi: nope :S14:48
fungioh, then your post may be automatically rejected14:48
ttxelodilles: do you have a twitter handle I can tag?14:48
elodillesfungi: let me subscribe quickly14:49
fungielodilles: yeah, looks like mailman sent you a rejection notice14:49
fungi:q14:50
fungihah, you're not my vi process14:50
elodillesi've subscribed now14:51
elodillesi mean i guess i need some verification first :)14:51
fungiyeah, it should send you a confirmation token by e-mail14:52
fungicheck your junk mail if it's not in your inbox14:52
elodillesok, verified, resending the announcement14:52
fungiawesome, standing by to approve14:52
elodillessent14:54
fungiyep, got it. double-checking the sample really quickly14:54
elodillesttx: well, i don't have twitter so i'll rely on you about twitter announcement o:)14:54
elodillesfungi: ack14:55
ttxalright :)14:55
fungiapproved it14:55
elodillesfungi: \o/14:55
elodilleshere we have it: http://lists.openstack.org/pipermail/openstack-announce/2022-March/002059.html14:55
fungiyep. looks good14:56
fungiright on schedule14:56
fungiho hum, yet another on-time release14:56
elodillesjust. in. time. :D14:56
ttxyay!14:57
gmann /o\14:58
elodillesthanks everyone! \o/14:59
elodillesttx: thanks for the tweet :]15:00
*** marios|ruck is now known as marios15:00
*** dviroel is now known as dviroel_15:04
*** dviroel_ is now known as dviroel15:04
fungimaybe it should have been "ho hum, a 25th on-time release"15:06
elodilles:]15:06
fungithe narrow terminal for my mail client truncates the announcement subject at "OpenStack Yoga is off"15:08
fungii thought that was fun15:09
*** marios is now known as marios|ruck15:10
elodilles:S15:11
elodilles:D15:11
fungii'm headed out to my appointment in a few minutes, but looks like all the excitement here is done15:39
fungifollowing clarkb's suggestion i restarted the geard process on the logstash server which has eliminated the several-minute delay for that task during each releasenotes job, which has sped things up some, but we're probably still looking at a few hours until the remaining releasenotes builds complete15:40
elodillesfungi: ack, thanks for the help!15:42
*** dviroel is now known as dviroel|lunch15:44
clarkbfeel free to ping me if something comes up in the next hour while fungi is out too15:45
*** amoralej is now known as amoralej|off15:50
elodillesclarkb: ack, thanks in advance!15:52
*** marios|ruck is now known as marios|out16:20
fungiokay, i'm back16:54
fungi27 releasenotes jobs in the tag pipeline still16:59
*** dviroel|lunch is now known as dviroel17:00
elodillesit's progressing slowly but steady17:20
opendevreviewElod Illes proposed openstack/releases master: Fix links in releases.o.o footer  https://review.opendev.org/c/openstack/releases/+/83583418:26
fungithe very last releasenotes build just started. ~8.75 hours start to finish for all of those to run serially20:39
elodillesit really took a while :)20:41
elodillesbut we are almost there :D20:41
fungiand done!20:42
elodilles\o/20:42
fungiwe should really figure out if there's a better way to not have rsync writes collide so those can safely be run in parallel instead21:04
*** dviroel is now known as dviroel|out21:16
fungilooking at the rsync manpage, it's possible we could use --temp-dir and make sure to avoid --delay.* and --partial.* options21:59
clarkbfungi: aiui it was fine before it just raised a failure and that made people concerned. Maybe have a check of the content after the rsync to see if it is complete and not fail if so?21:59
fungiclarkb: the underlying problem is that rsync creates tempdirs in the same directories as the files being replaced, and --delete wants to remove another concurrent processes tempdirs in the same tree22:00
fungiif we told it not to create tempdirs in the same place as where files are being deleted, we could probably work around it (except in the rare case where two releasenotes builds try to update the same index.html at the exact same moment)22:01
fungi--inplace might be another option, as i think that may avoid creating tempfiles/tempdirs entirely22:02
fungimost of these features are trying to solve problems we don't have since we write to a read-write afs volume and then separately release that to the read-only replicas22:03
fungiso things like atomic file replacement are less critical as we have atomic file*system* replacement22:04
clarkbah yup since we do that on the whole tree22:04
clarkbthat makes sense22:04
fungianother approach might be to have rsync wait if it sees evidence another rsync is already updating the same tree (might be toctou racy tho) or have it retry on error (might lead to hysteresis/livelock between processes?)22:07
fungibut probably asking rsync to do less is the simplest solution22:07
fungiit's smart about some things we want, but its attempts at being smart about other things we don't need are getting in the way22:08
*** ianw_pto is now known as ianw22:24

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!