Wednesday, 2021-11-10

opendevreviewIan Wienand proposed openstack/diskimage-builder master: containerfile: handle errors better  https://review.opendev.org/c/openstack/diskimage-builder/+/81713900:01
opendevreviewIan Wienand proposed openstack/diskimage-builder master: Revert "centos 9-stream: make non-voting for mirror issues"  https://review.opendev.org/c/openstack/diskimage-builder/+/81731300:01
opendevreviewIan Wienand proposed openstack/diskimage-builder master: containerfile: fix tar extraction  https://review.opendev.org/c/openstack/diskimage-builder/+/81731700:01
*** mazzy5098811 is now known as mazzy50988100:10
opendevreviewIan Wienand proposed openstack/project-config master: Pause Fedora 34 builds  https://review.opendev.org/c/openstack/project-config/+/81731800:11
clarkbI went ahead and fast approved ^ since I had looked over the related work00:12
*** mazzy5098812 is now known as mazzy50988100:18
clarkbrosmaita: jrosser_: note I left some comments on your changes unrelated to the zuul trouble00:24
opendevreviewMerged openstack/project-config master: Pause Fedora 34 builds  https://review.opendev.org/c/openstack/project-config/+/81731800:24
*** mazzy5098814 is now known as mazzy50988100:42
opendevreviewIan Wienand proposed openstack/diskimage-builder master: centos 9-stream: make non-voting for mirror issues  https://review.opendev.org/c/openstack/diskimage-builder/+/81731200:44
opendevreviewIan Wienand proposed openstack/diskimage-builder master: containerfile: fix tar extraction  https://review.opendev.org/c/openstack/diskimage-builder/+/81731700:44
opendevreviewIan Wienand proposed openstack/diskimage-builder master: containerfile: handle errors better  https://review.opendev.org/c/openstack/diskimage-builder/+/81713900:44
opendevreviewIan Wienand proposed openstack/diskimage-builder master: Revert "centos 9-stream: make non-voting for mirror issues"  https://review.opendev.org/c/openstack/diskimage-builder/+/81731300:44
corvusclarkb: i think we don't call loadTPCs in the layout update path00:58
corvusi may need to include an event with branch cache ltimes in my test case01:00
clarkbI would agree that scheduler.py seems to loadTPCs when validating, priming and reconfiguring01:01
clarkbbut those are the only instances of the loadTPCs calls01:02
corvusclarkb: and i think the cacheConfig call you were looking at is called once every time, but it's caching things onto the tpcs, so if we're reusing them when we expect to have empty ones...01:03
corvusstill can't repro locally though01:04
corvusi think i reproduced it01:19
Clark[m]Yay01:20
corvusthrowing in a loadTPCs call does fix it.  now, i need to try to clean this test up; it's a mess.01:28
corvus(it's just a big pile of changes, reconfigurations, and sleeps that i threw at it until it broke)01:28
Clark[m]Kitchen sink debugging01:40
*** diablo_rojo is now known as Guest547501:59
opendevreviewTakashi Kajinami proposed openstack/project-config master: Retire puppet-senlin - Step 1: End project Gating  https://review.opendev.org/c/openstack/project-config/+/81732402:14
opendevreviewTakashi Kajinami proposed openstack/project-config master: Retire puppet-senlin - Step 3: Remove Project  https://review.opendev.org/c/openstack/project-config/+/81732702:20
opendevreviewchandan kumar proposed opendev/system-config master: Enable mirroring of centos stream 9 contents  https://review.opendev.org/c/opendev/system-config/+/81713603:31
ianwhttps://review.opendev.org/c/openstack/diskimage-builder/+/817312/2 just failed with "Nodeset ubuntu-bionic-2-node already defined"04:01
Clark[m]ianw: the fix for that is corvus' most recent change pushed to zuul04:03
Clark[m]Hopefully we can restart with that fix tomorrow once people review it04:03
ianwthanks, i figured that one, wasn't sure if we'd seen it on other changes.  probably a good sign to walk away :)04:05
opendevreviewIan Wienand proposed openstack/project-config master: Set debian-stretch to min-ready: 0  https://review.opendev.org/c/openstack/project-config/+/81733804:11
opendevreviewIan Wienand proposed openstack/project-config master: Remove debian-stretch nodes and builds  https://review.opendev.org/c/openstack/project-config/+/81733904:11
opendevreviewIan Wienand proposed opendev/system-config master: reprepro: stop mirroring Debian stretch  https://review.opendev.org/c/opendev/system-config/+/81734004:12
*** ysandeep|out is now known as ysandeep05:33
akahat|roverhello08:38
akahat|roveron zuul.opendev.org/ queue: tripleo is stuck08:39
akahat|roverwe can see there are some jobs which are pending since 11 hrs08:39
akahat|roversome jobs in queue *08:39
*** ysandeep is now known as ysandeep|lunch08:41
soniya29|ruck tripleo-ci-centos-8-containers-multinode and tripleo-ci-centos-8-standalone-upgrade-victoria are few of them08:41
soniya29|ruckhere is the console log:- https://zuul.openstack.org/stream/e3488a9056d2422983bbdc140b6f487e?logfile=console.log08:43
*** akahat|rover is now known as akahat|lunch08:44
*** ykarel is now known as ykarel|lunch08:51
*** akahat|lunch is now known as akahat|rover09:13
*** ysandeep|lunch is now known as ysandeep09:23
fricklercorvus: I'm seeing an empty "Queue:" header for every patch in check and other pipelines, is this a known issue? for gate, "Queue: internal" etc. looks o.k.09:33
frickleralso that "nodeset already defined" seems to be happening quite often, hopefully we can get that fixed soon, otherwise we should maybe revert to one node until we can09:35
fricklerakahat|rover: soniya29|ruck: I don't see anything being stuck, just a lot of patches plus gate resets due to failures09:36
akahat|roverfrenzy_friday, https://zuul.opendev.org/t/openstack/status#817233, jobs: tripleo-ci-centos-8-containers-multinode, tripleo-ci-centos-8-standalone-upgrade-victoria09:42
akahat|roverfrickler, ^^09:42
akahat|roverit is running for 12 hr 37 mins09:43
akahat|rover817106, 817260 this ids jobs already ran.. but they still are in queue.09:46
fricklerhmm, the console logs for all the jobs that are still being shown as in progress in the ui for those jobs show "build id not found"09:53
fricklerso something is indeed broken, maybe you want to abandon/restore the affected patches, otherwise we'll need to wait for corvus09:55
*** ykarel|lunch is now known as ykarel09:57
akahat|roverfrickler, okay we will wait for corvus.09:58
*** melwitt is now known as Guest550810:12
ysandeepfolks o/ https://zuul.openstack.org/status#heat Some of the jobs are waiting for too long for a node , "Build ID 8b3c99ddb0d94a999def904873717e1d not found" Do we have a known issue?11:06
*** dviroel|out is now known as dviroel11:16
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Add Etherpad backend  https://review.opendev.org/c/opendev/statusbot/+/80794611:55
*** soniya29|ruck is now known as soniya29|ruck|afk12:23
Alex_GaynorJobs on pyca/cryptography don't appear to be starting, on https://zuul.opendev.org/t/pyca/status/ I see our queue constantly hanging around at 8. Known issue?12:29
fricklerAlex_Gaynor: we have some known issues currently, but I'm not sure whether this is related or now. clarkb, corvus ^^12:30
fricklers/now/not/12:30
*** ysandeep is now known as ysandeep|afk12:32
*** ysandeep|afk is now known as ysandeep13:08
*** jpena|off is now known as jpena13:21
*** soniya29|ruck|afk is now known as soniya29|ruck13:33
noonedeadpunkwell and jobs are not scheduled for gates for us as well13:37
noonedeadpunkand all gate jobs looks like stuck ones atm13:38
*** rlandy|ruck is now known as rlandy|ruck|mtg14:01
corvusi'm looking into the error causing the stuck queues14:15
tristanCthe cacti graphs for zookeeper seems correct, though there is a suspicious spike about 9h ago on the network page14:17
fungifrickler: there's already a fix for the "nodeset already defined" problem, hopefully the new patchset makes it in shortly ( https://review.opendev.org/817328 )14:22
fungitristanC: openstack periodic and periodic-stable pipelines trigger their jobs at 02:00-02:01 utc, which seems to line up with the start of the burst at http://cacti.openstack.org/cacti/graph.php?action=view&local_graph_id=70044&rra_id=all14:24
corvusi'm trying to save the debug info i need, but zk-shell isn't helping; i need to write a quick script14:41
corvusokay, i've saved a copy of the zk data, and i think we should restart now.  probably the best thing to do is restart on .4, then we'll switch back once the in-flight changes have landed.  sound good?14:50
corvusi'm going with that14:52
corvusstopped; deleting state now14:53
tristanCcorvus: that sounds good. so you now have a dump of all the zk nodes?14:53
fungithanks corvus, i agree14:54
fungii expect we'll lose anything in the event queue with the downgrade to .4, as our saved change queues are all that will get reenqueued14:55
corvustristanC: yes14:57
corvusstarting now14:57
corvushere's my dump script: https://paste.opendev.org/show/810906/14:58
fungioh, that's handy14:59
corvusthere was a deserialization error, and i'll need to look at the data to figure out what it was.  but i also don't know the path, so i grabbed the whole system.15:01
corvusit's something wrong with the config_errors patch i wrote :(15:01
corvusanyway, i don't think it'll be too hard to fix once i find the actual node with the error :)15:01
*** ykarel is now known as ykarel|away15:12
corvusre-enqueing15:27
*** soniya29|ruck is now known as soniya29|ruck|dinner15:38
outbrito_Do I have to remove +W and re-add to get something back on the gate queue?15:57
outbrito_(btw, sorry for the newbie question)15:57
*** akahat|rover is now known as akahat|lunch16:03
*** akahat|lunch is now known as akahat|dinner16:03
fungioutbrito_: it depends on which tenant that's in. some tenants will send a rechecked change straight to the gate pipeline if they have sufficient approval votes, others will require a positive vote from the check pipeline and addition of a new workflow +116:08
corvusoutbrito_: i re-enqueued all the changes that were there before; check the status page now, and if i missed yours, go ahead and reapprove or recheck16:08
fungii think it's just the openstack tenant which will require a positive check vote before entering the gate pipeline, but also yes make sure it wasn't already reenqueued16:09
fungiit seems like there were a few events the schedulers didn't process, so never got enqueued16:09
*** soniya29|ruck|dinner is now known as soniya29|ruck16:21
*** ysandeep is now known as ysandeep|out16:51
*** akahat|dinner is now known as akahat|rover16:52
*** soniya29|ruck is now known as soniya29|ruck|out17:01
*** marios is now known as marios|out17:03
*** rlandy|ruck|mtg is now known as rlandy|ruck17:09
opendevreviewJeremy Stanley proposed opendev/statusbot master: Add use_ssl option  https://review.opendev.org/c/opendev/statusbot/+/80794717:10
opendevreviewJeremy Stanley proposed opendev/statusbot master: Handle exception for unprivileged commands  https://review.opendev.org/c/opendev/statusbot/+/80794817:11
outbrito_Yeah, mine was one of them. Just +W again to re-enqueue. Tks (btw, it was on starlingx/openstack-armada-app, openstack tenant)17:18
outbrito_thanks fungi corvus 17:18
*** jpena is now known as jpena|off17:34
*** rlandy is now known as rlandy|ruck17:40
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Introduce a BackendInterface  https://review.opendev.org/c/opendev/statusbot/+/80787119:27
opendevreviewTristan Cacqueray proposed opendev/statusbot master: Add Etherpad backend  https://review.opendev.org/c/opendev/statusbot/+/80794619:27
opendevreviewJeremy Stanley proposed opendev/statusbot master: Add use_ssl option  https://review.opendev.org/c/opendev/statusbot/+/80794719:59
opendevreviewJeremy Stanley proposed opendev/statusbot master: Handle exception for unprivileged commands  https://review.opendev.org/c/opendev/statusbot/+/80794819:59
ianwfungi/clarkb: https://review.opendev.org/c/opendev/system-config/+/817136 has some numbers on the 9-stream mirror, and including source/debug.  layout seems different to the way it was done before.  thoughts welcome either way20:41
fungiwhat, red hat changed things in a new release? ;)20:42
fungiand thanks, i've been trying to get around to taking a look at that one20:42
*** dviroel is now known as dviroel|out21:15
opendevreviewMerged openstack/diskimage-builder master: centos 9-stream: make non-voting for mirror issues  https://review.opendev.org/c/openstack/diskimage-builder/+/81731221:51
opendevreviewMerged openstack/diskimage-builder master: containerfile: fix tar extraction  https://review.opendev.org/c/openstack/diskimage-builder/+/81731721:51
opendevreviewMerged openstack/diskimage-builder master: containerfile: handle errors better  https://review.opendev.org/c/openstack/diskimage-builder/+/81713921:56
opendevreviewMarco Vaschetto proposed openstack/diskimage-builder master: Allowing ubuntu element use local image  https://review.opendev.org/c/openstack/diskimage-builder/+/81748122:11
opendevreviewMerged opendev/statusbot master: Introduce a BackendInterface  https://review.opendev.org/c/opendev/statusbot/+/80787122:17
opendevreviewMerged opendev/statusbot master: Add Etherpad backend  https://review.opendev.org/c/opendev/statusbot/+/80794622:17
opendevreviewMerged opendev/statusbot master: Add use_ssl option  https://review.opendev.org/c/opendev/statusbot/+/80794722:22

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!