Thursday, 2018-02-22

andreafpabelanger : thanks - trying that00:10
ianwhmm, subscribing to the vcs repo for nodepool/zuul install isn't quite right ... if install fails we don't detect that.  we really shoud use the pip provider i guess00:11
clarkbianw: you mean if the git update fails we don't detect that?00:12
ianwclarkb: no, if that intsall fails, when we re-run we don't try to reinstall because the git tree hasn't changed00:13
mnaseranyone against adding the link to infrastructure status in the topic?00:14
mnaseri feel like its a nice place for it there00:14
clarkbianw: gotcha00:15
fungiianw: problem with the pip package provider is that it lacks support for setting a non-default upgrade strategy, so will unconditionally fight with distro-installed packages of any of your dep tree00:15
ianwfungi: yeah, i think it's sufficiently corner case to not worry ... but how do i reset vcsrepo's idea of what it's done?00:16
fungiwe really need it to have some means of specifying --upgrade-strategy=only-if-needed00:16
fungiianw: easiest brute-force method is to git reset --hard HEAD^1 and then let puppet do its thing all over again00:17
ianwlarge hammer, i like it :)00:18
fungiianw: good news is seems to be targeted for pip 1000:18
mnaserinfra-root: does anyone have a status update on the mirror rebuilding indices?  just to know if i can recheck or not yet :)00:19
pabelangerstill working on it00:19
mnaserpabelanger: not to rush things but any approximate ETA to know how long so i dont bug anyone :p00:19
pabelangermnaser: unknown, likely some time tomorrow00:20
pabelangermnaser: I'll ping once it is fixed00:20
fungipabelanger: is it still trying to resync the afsdb?00:20
mnaserpabelanger: ok cool, thank you so much and good luck :)00:20
pabelangerfungi: no, I've stopped it because there was an issue with AFS00:21
pabelangerabout to start it up again00:21
pabelanger| opensuse-tumbleweed-0000000001 | opensuse-tumbleweed | nb02    | qcow2,vhd | ready | 00:00:05:27 |00:21
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Reduce duplicate data in nodepool configuration
mnaseri always thought tumbleweed was a super cool name for a rolling release00:25
ianwok, nb03 has puppetted itself, nodepool is there, config looks "right" (is probably wrong, but is deployed correctly)00:26
ianwnodepool-builder doens't start, debugging now00:26
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Bring online opensuse-tumbleweed images
pabelangerianw: clarkb: mnaser: ^next patches to boot opensuse-tumbleweed image.00:31
clarkbpabelanger: didn't we remove the images from citycloud?00:33
clarkbI would not add new images there at least00:33
pabelangerclarkb: from nodepool.yaml file, that stops them from getting uploaded00:33
pabelangerwhich is builders00:33
clarkbI see so we only did it to the builders, I guess that makes sense00:34
pabelangerclarkb: can you also get the parent too00:35
pabelangera clean up of data00:35
clarkbfungi pabelanger ianw we are about ~15 minutes away from what has become a daily gerrit slowdown. I've got a call I've got to be on but if you can maybe keep an eye out for unusual gerrit behavior and what might be causing this it would be helpful00:44
fungiclarkb: yup, not sure what to look for but i'll see if we can catch it00:45
ianwrun an iostat maybe?00:47
pabelangerfungi: up for 2 reviews? and nodepool changes00:47
clarkbfungi: ianw pabelanger my hunch is its something like stackalytics running a bunch of expensive queries00:49
clarkbthe behavior I have been able to isolate using melody is that we haev a large spike in threads but not a spike in http requests00:49
fungiianw: good call. i have `iostat -t 10` going in a root screen session on review.o.o now00:50
fungiclarkb: i've got cacti and javamelody up too, though i assume those were of no help in previous instances00:51
clarkbnot really. The only thing melody helped with was showing a large spike in threads and that it didn't appear to be garbage collection related00:51
pabelangerI've restarted mirror-update.o.o to see if that help with AFS kernel modules00:54
pabelangerwe seem to be losing network access a fair bit00:54
clarkbpabelanger: do you see similar packet loss outside of afs?00:54
clarkbmtr is a nice tool to monitor that00:54
fungi83 tasks according to show-queue but a lot of those look related to replication for the ancient git tags which got pushed by release automation a little while ago00:54
fungif88e687b              22:23:28.880      [18949c8c] push cgit@git02.openstack.org00:55
clarkbcompletely unrelated but I think that grenade may be broken00:56
fungiso yeah, i think we have a replication backlog for nova for a while00:56
clarkbmy guess is beacuse we use stestr now00:56
clarkbianw: ^00:56
pabelangerclarkb: I'll check00:56
mnaserpabelanger: ^00:56
clarkbmnaser: thanks00:56
mnaseri backported the master fix00:57
clarkbI don't appear to have approval rights on that maybe tonyb can help us00:57
ianwme either .. #openstack-qa?01:00
ianwoh, the call is out01:00
pabelangerokay, reboot seems to help, reprepro is now reporting db corruption. I'm going to copy good database from RO volume, into RW, then run reprepro again01:01
pabelangershould then add missing debs into the database01:01
clarkbtonyb: stable reviews for that01:01
clarkbtonyb: to make grenade happy with stsetr01:01
clarkb(I can't spell)01:01
fungiwho needs spilling?01:02
tonybGah no dice sorry01:02
fungigerrit queue has doubled to 14101:03
mnaseroh gerrit is fully dead01:04
mnaser"Reason: Error reading from remote server"01:04
tonybLooks like grenade is limited,members +,members01:04
ianwpabelanger: ok, i hope it's not necessary but we expanded after the last issues01:04
ianwthat's worryingly small for a project that runs so much01:06
pabelangerianw: yah, I'll fallback to that01:06
openstackgerritIan Wienand proposed openstack-infra/system-config master: Add linaro cloud to nodepool clouds.yaml
*** yamamoto has joined #openstack-infra01:10
funginot seeing any significant spikes in cacti graphs for review.o.o01:13
funginot heise-enough because mnaser reported it going out to lunch at 01:0401:14
fungiso cacti does show a relatively anomalous write spike to xvdb atarting up right at 01:0001:18
fungisame for 24 hours ago01:18
clarkbis it bup maybe?01:19
fungixvdb is the pv for the main vg01:19
clarkbwhich hosts the git repos iirc01:19
clarkbsomeone doing clones maybe?01:19
fungiso something writing to /home/gerrit201:19
fungiit's write activity, not read01:20
fungiaccording to cacti01:20
clarkbpushes then? or logging?01:20
fungiinstead of ianw's iostat suggestion (which just told us what block devices were seeing activity) i should have had iotop running instead01:25
fungiwhich i do now, but it's probably too late to be useful01:26
fungibut perhaps tomorrow01:26
fungicurrently the bulk of the write activity is the gerrit jvm, it looks like01:27
clarkbcould it be the garbage collecting and packing?01:28
fungigerrit does that now, right? we don't cron it any longer01:28
clarkbya its the jvm doing it now iirc01:28
fungithough there is also this:01:28
fungino, wait, i was reading cron fields backwards01:29
fungiso yeah, very well may be what you're saying01:29
fungiwe don't seem to have any cronjobs around that timeframe at any rate01:29
pabelanger generating universe/Contents-amd64...01:30
pabelangerhopefully not much longer to finish01:31
pabelangerthen I'll manually vos release01:31
fungiclarkb: threadcount didn't really spike my more than maybe 20% at 01:0001:33
fungi(if even that much)01:33
clarkbseveral hundred percent according to melody01:34
fungii must be looking at a different graph01:35
fungithe "threads count" graph says it jumped from around 425 threads to 500 at 01:0001:35
clarkbsorry active threads01:36
fungioh, active threads01:36
clarkband it spiked here too01:36
fungiyes, though the bigger spike there was apparently closer to 01:2001:36
fungispiking up to around 55-ish01:37
*** slaweq has quit IRC01:39
*** salv-orlando has joined #openstack-infra01:41
openstackgerritIan Wienand proposed openstack-infra/system-config master: Add linaro cloud to nodepool clouds.yaml
fungiio ops to /home/gerrit2 are still pretty heavy and iotop says it's mostly the jvm01:45
fungiso daily git repacking is seeming a likely explanation01:45
*** salv-orlando has quit IRC01:46
*** ykarel|afk has joined #openstack-infra01:52
*** lbragstad has joined #openstack-infra01:56
ianw| ubuntu-xenial-arm64-0000000003 | ubuntu-xenial-arm64 | nb03    | qcow2     | building | 00:00:04:22 |01:58
*** kien-ha has joined #openstack-infra01:58
persiaThat looks promising :)01:58
ianwit's a start.  it's going to take a while to cache everything in02:02
*** annp has joined #openstack-infra02:02
*** olaph1 is now known as olaph02:12
*** yamamoto has joined #openstack-infra02:12
pabelangerokay, DB recreated, just checking pools now02:15
*** dklyle has joined #openstack-infra02:17
*** yamamoto has quit IRC02:18
openstackgerritMerged openstack-infra/project-config master: Reduce duplicate data in nodepool configuration
*** inc0 has joined #openstack-infra02:21
pabelangerreprepro successful02:22
pabelangerI'm going to vos release now02:22
fungiwoo! thanks for sticking with it02:22
pabelangerthe reboot did help, no AFS issues that time02:22
openstackgerritMerged openstack-infra/project-config master: Bring online opensuse-tumbleweed images
pabelangerReleased volume mirror.ubuntu successfully02:29
pabelanger#status log mirror.ubuntu reprepro has been repaired and back online02:30
openstackstatuspabelanger: finished logging02:30
pabelangermnaser: ^02:30
*** mriedem has quit IRC02:32
*** zhenguo has joined #openstack-infra02:32
melwittdoes anyone know if reviewday is still supported at ? I notice it says "Page refreshed at 2018-02-02"02:37
*** dhill__ has quit IRC02:40
clarkbmelwitt: it should be, its likely just broken for one reason or another02:41
melwittcool, thanks02:42
pabelanger| 0002664319 | inap-mtl01 | opensuse-tumbleweed | d7390c45-35c9-4c30-bb1e-4dc634f2a3d0 |  |                                        | ready    | 00:00:00:11 | unlocked |02:47
pabelangerclarkb: dirk: ^02:47
mnaserpabelanger: awesome thank you02:51
mnaser#thanks pabelanger and infra for getting ubuntu mirrors repaired and backup quickly!02:52
openstackstatusmnaser: Added your thanks to Thanks page (
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Add opensuse-tumbleweed testing to ozj
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Add opensuse-tumbleweed testing to ozj
*** wolverineav has quit IRC03:00
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: DNM
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Add fedora-27 testing to ozj
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: DNM
*** yamamoto has joined #openstack-infra03:14
openstackgerritMatt Riedemann proposed openstack-infra/openstack-zuul-jobs master: Don't run neutron-grenade job on stable/ocata changes
*** yamamoto has quit IRC03:20
*** mriedem has joined #openstack-infra03:23
mriedemzuulv3 question: the neutron-grenade-multinode job is defined in neutron, but not in the stable/ocata branch for neutron. nova has the neutron-grenade-multinode job in it's check queue via project-config,03:25
mriedemwe need to not run the grenade job on ocata changes since newton is eol,03:25
mriedembut i'm not sure if i need to add a branch restriction to project-config, or neutron's job def (on master?), or other03:25
mriedemneutron-grenade is defined in openstack-zuul-jobs and is part of the integrated-gate template, and is defined to not run on stable/newton (and i have a patch up to make it stop running it against ocata changes too)03:26
mriedemso a bit confused about where the branch restriction should live03:26
mriedemactually i guess it's also defined in neutron stable/ocata so that probably needs to be deleted,03:28
mriedemand nova's re-def of it in project-config also needs to go03:28
mriedemtonyb: ^ you might care since ocata is blocked atm03:28
tonybmriedem: Thanks.   I'd like to know how to avoid making this same mistake (if?)when ocata goes EOL03:30
mriedemwe've always had to just update branch restrictions for running grenade after a branch goes eol,03:31
tonybmriedem: but I have no idea.  I kinda thought it should be modified in openstack-zuul-jobs on the octata branch (or as you say that shoudl be deleted) but I'm just confused03:31
mriedembut now it's spread out more since the jobs can be defined in one place, and then re-defined (sort of) still in project-config per project03:31
mriedemi'm gonna remove the job def from neutron in ocata03:31
tonybmriedem: Yeah, this time I'd like to do it right after the branches get deleted but it a matter of knowing what "it" is03:32
mriedemit is it03:32
mriedemwhat is it03:32
tonybback and forth I sway with the wind ?03:33
*** wolverineav has joined #openstack-infra03:33
mriedemah crap03:34
mriedemthat's master03:34
clarkbthe best place to put the exclusion is on thr main job def if it will be gloal03:37
clarkband I think these grenade exclusions are effectively global right?03:38
*** dave-mccowan has joined #openstack-infra03:38
clarkbthat said I think the best thin here is to evwntually get to having the job list on each branch in each projecf maybe?03:38
clarkbthen you just update the jobs a project wants to run and thats an easy direct exvlusion03:39
mriedembut then you have to remove this job from every project that defines it once a branch goes eol, which sucks03:39
mriedemi'm not really sure what happens if we remove the job def from neutron's stable/ocata branch, will nova still attempt to run it against ocata changes and if so, which job def would it be using?03:40
mriedemthat's where nova says it wants to run that job right now03:41
clarkbthats a good question I think it would be an error without removing the job from other branches that use it. Grenade may be a special case  because we drop it a cycle early03:41
*** ramishra has quit IRC03:41
clarkbso adding exclusion to main job def is probably best for grenade03:42
mriedemdoes that go in the main job def in the neutron stable/ocata branch though?03:42
*** xarses has joined #openstack-infra03:43
mriedemb/c that seems bonkers - the job def in stable/ocata would have to say, don't run this in stable/ocata03:43
*** salv-orlando has joined #openstack-infra03:43
clarkbya thats a good point this may be a corner case we have to figure out03:43
*** andreww has joined #openstack-infra03:43
mriedemok so for now, i'm going to push a project-config patch to say don't run that job on stable/ocata chnages03:43
*** andreww has quit IRC03:44
*** wolverineav has quit IRC03:44
*** andreww has joined #openstack-infra03:44
mriedemthere is still a newton branch for devstack, not sure why03:45
mriedemor if that works03:45
mriedemtonyb: ?03:45
*** xarses has quit IRC03:47
*** cuongnv has quit IRC03:47
*** salv-orlando has quit IRC03:49
openstackgerritMatt Riedemann proposed openstack-infra/project-config master: Don't run neutron-grenade-multinode on newton or ocata changes
mriedemok ^ says don't run that job on projects that have stable branches (excluded tempest since it's branchless)03:51
*** wolverineav has joined #openstack-infra03:51
tonybmriedem: because $some projects are still using it.03:54
tonybmriedem: I'll look into which ones but I suspect we can drop it and clean up the mess later03:55
mriedemand ocata is marked for eol next week :)03:56
mriedemalright i'll check things out in the morning, gotta drop03:56
tonybmriedem: Yeah but we have a session at the PTG to work out if that's the reality03:57
*** wolverineav has quit IRC03:57
tonybI suspect it'll get another 3-6 months but that depends who shows up03:58
*** rlandy has joined #openstack-infra03:59
*** rlandy is now known as rlandy|rover03:59
*** slaweq has joined #openstack-infra03:59
*** rlandy|rover has quit IRC04:01
*** slaweq has quit IRC04:04
*** udesale_ has joined #openstack-infra04:07
*** dave-mccowan has quit IRC04:09
*** wolverineav has joined #openstack-infra04:17
*** wolverineav has quit IRC04:22
*** psachin has joined #openstack-infra04:34
*** zhenguo has quit IRC04:42
*** olaph has quit IRC04:42
*** salv-orlando has joined #openstack-infra04:45
*** salv-orlando has quit IRC04:49
*** ykarel|afk is now known as ykarel04:54
*** VW has quit IRC04:55
*** VW has joined #openstack-infra04:55
*** VW has quit IRC05:00
*** hongbin has quit IRC05:02
*** rosmaita has quit IRC05:06
*** wolverineav has joined #openstack-infra05:21
*** xarses has joined #openstack-infra05:24
*** slaweq has joined #openstack-infra05:25
ianwpabelanger: 485748 ... i just ... dunno.  which is why i guess this sat in review, because it's not *obviously* correct and so nobody wants to break odd things ...  hopefully based on that there's a convincing argument for why this and not
*** andreww has quit IRC05:27
*** janki has joined #openstack-infra05:27
*** slaweq has quit IRC05:30
openstackgerritTristan Cacqueray proposed openstack-infra/nodepool master: Add /label-list to the webapp
openstackgerritJens Harbott (frickler) proposed openstack-infra/devstack-gate master: Add neutron-tempest-plugin-api job
*** claudiub has joined #openstack-infra05:47
*** aeng has quit IRC05:49
*** salv-orlando has quit IRC05:50
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: bindep: use shell instead of command with executable
*** jogo has quit IRC06:03
openstackgerritOpenStack Proposal Bot proposed openstack-infra/project-config master: Normalize projects.yaml
*** tdasilva has quit IRC06:10
*** salv-orlando has joined #openstack-infra06:19
*** slaweq has joined #openstack-infra06:27
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: bindep: use shell instead of command with executable
dirkpabelanger: nice! I'm creating some test jobs today06:31
*** slaweq has quit IRC06:31
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: bindep: use shell instead of command with executable
*** salv-orlando has quit IRC06:51
*** salv-orlando has joined #openstack-infra06:51
*** salv-orlando has quit IRC06:56
openstackgerritTristan Cacqueray proposed openstack-infra/zuul-jobs master: ara: check for return code instead of succeed
*** bhujay has quit IRC06:57
*** bhujay has joined #openstack-infra07:07
openstackgerritAndreas Jaeger proposed openstack-infra/zuul-jobs master: Add abstract attribute to base jobs
*** hashar has joined #openstack-infra07:16
AJaegerdirk: see also , both tumbleweed tests fail07:17
openstackgerritMerged openstack-infra/project-config master: Normalize projects.yaml
openstackgerritAndreas Jaeger proposed openstack-infra/zuul-jobs master: Replace with ini lookup
*** snapiri1 has joined #openstack-infra07:28
openstackgerritAndreas Jaeger proposed openstack-infra/zuul-jobs master: Replace with ini lookup
*** andreas_s has joined #openstack-infra07:33
ameedaMorning :) , should I reverse os-xenapi version from 0.3.1 to 0.1.0 as this bug mentioned " "07:34
*** ykarel is now known as ykarel|lunch07:34
*** snapiri1 is now known as snapiri-07:35
*** pcaruana has joined #openstack-infra07:37
*** slaweq has joined #openstack-infra07:45
*** slaweq_ has joined #openstack-infra07:47
AJaegermordred: I fixed your change - could you double check it, please?07:48
*** armaan has quit IRC07:48
AJaegerameeda: there's an #openstack-nova channel for nova discussion and questions.07:49
*** slaweq has joined #openstack-infra07:49
ameedaAJaeger: Thanks for reply :)07:50
*** salv-orlando has joined #openstack-infra07:52
*** slaweq_ has quit IRC07:52
*** Miouge has joined #openstack-infra07:54
*** kjackal has quit IRC07:56
*** salv-orlando has quit IRC07:56
*** slaweq_ has joined #openstack-infra08:00
*** slaweq_ has quit IRC08:04
*** threestrands_ has joined #openstack-infra08:15
*** liusheng has quit IRC08:17
*** liusheng has joined #openstack-infra08:18
*** ykarel|lunch is now known as ykarel08:28
*** slaweq_ has joined #openstack-infra08:28
andreaftobiash corvus mordred ianw AJaeger some more improvements on stage-output
*** electrofelix has joined #openstack-infra08:40
*** florianf has joined #openstack-infra08:41
*** pblaho has joined #openstack-infra08:44
*** slaweq_ has quit IRC08:45
*** jpena|off is now known as jpena08:50
*** kjackal has quit IRC08:51
*** kjackal has joined #openstack-infra08:52
*** salv-orlando has joined #openstack-infra08:52
*** salv-orlando has quit IRC08:57
*** salv-orlando has joined #openstack-infra08:57
*** alexchadin has quit IRC09:03
*** oidgar has joined #openstack-infra09:04
slaweqhi all09:12
slaweqon I see error "status.json: Not Found" and no jobs are displayed09:13
slaweqis it some already known issue or should I search on my side?09:13
*** amoralej|off is now known as amoralej09:13
slaweqAJaeger: thx, that helped :)09:15
*** dtantsur|afk is now known as dtantsur09:17
*** mgoddard_ has joined #openstack-infra09:31
*** kjackal has quit IRC09:32
*** kjackal has joined #openstack-infra09:32
*** jaosorior has joined #openstack-infra09:39
openstackgerritHoang Trung Hieu proposed openstack-infra/system-config master: Fix dead link
openstackgerritJean-Philippe Evrard proposed openstack-infra/project-config master: Add OSA os_panko repo base jobs
openstackgerritJean-Philippe Evrard proposed openstack-infra/project-config master: Add OSA nspawn host/container repo base jobs
AJaegerevrardjp: note we still have not recovered from your broken imports, hope corvus can help later today. We might need a Zuul restart.10:05
evrardjpResubmitting -- I guess I could :)10:05
evrardjpOh darn I thought it was done during the night, silently!10:05
evrardjpduring my night at least :)10:05
evrardjpI am sorry for that then10:06
AJaegerevrardjp: I rechecked one chagne this morning and it was still failing.10:06
evrardjpoh I haven't seen the rechecks, that's why I rebased.10:06
evrardjpProbably need moar coffee then.10:06
AJaegerand I didn't see any mention in IRC that it's fixed either10:06
AJaegerevrardjp: yeah, drink some coffee - and ask the US based team later today again, please.10:07
*** jpich has quit IRC10:07
evrardjpyup will do.10:07
*** cuongnv has quit IRC10:08
evrardjpsorry, it wasn't my intent to offend .10:08
*** links has quit IRC10:09
*** links has joined #openstack-infra10:10
AJaegerevrardjp: no offense taken or understood - I just wanted to explain since it looked like you weren't away.10:10
* AJaeger needs better typing skills ;(10:11
*** olaph1 has joined #openstack-infra10:16
*** lpetrut has joined #openstack-infra10:17
*** olaph has quit IRC10:17
*** rossella_s has quit IRC10:20
*** tosky has joined #openstack-infra10:22
*** annp has quit IRC10:23
*** kjackal has quit IRC10:26
*** kjackal has joined #openstack-infra10:27
danpawlikcoreycb, zigo, beisner: Hi, I have small question to you. Is possible to create Octavia packages for Ubuntu? I create a storyboard for it!/story/200156610:30
zigodanpawlik: Sure it's possible, do you have the needed skills?10:31
zigodanpawlik: I can sponsor the upload in Debian if you want, then it will migrate to Ubuntu.10:31
danpawlikI guess I don't have such skills :(10:32
*** zoli is now known as zoli|lunch10:33
*** zoli|lunch is now known as zoli10:33
*** alexchadin has joined #openstack-infra10:34
danpawlikzigo: maybe I will try do it tomorrow10:34
zigodanpawlik: My advice would be to just look how other packages are made.10:35
zigodanpawlik: For Queens, everything is uploaded in Experimental until the final releases, so just use Sid + Experimental.10:35
zigodanpawlik: FYI, at the moment, I'm almost finished with Queens, and I'm uploading services in RC1 versions.10:36
*** pcrews has joined #openstack-infra10:36
zigo(packages are already prepared, doing the final uploads ...)10:36
*** spiffxp has joined #openstack-infra10:36
danpawlikzigo: nice :)10:37
danpawlikzigo: so I try to do it tomorrow base on some other queens package10:37
zigodanpawlik: So I use the Debian BTS, and as a Git repository.10:39
danpawlikah, ok10:39
zigodanpawlik: I can add you in the team if you create an account there.10:40
zigoGive me your account, and I'll add you there.10:40
zigoWe're using a Git tag packaging workflow, I'm not sure if you know how this work ...10:40
zigodanpawlik: How many source package will you need?10:41
danpawlikzigo: user @daniel.pawlik-guest10:42
danpawlikzigo: I know the workflow on RDO10:42
danpawlikbut here not10:43
danpawlikzigo: what do you mean "How many source package will you need?"10:43
danpawlikzigo: I try to create debianize for Octavia10:43
danpawlikOpenstack Octavia service10:43
zigodanpawlik: So, just one?10:43
*** panda|off is now known as panda|sick10:44
*** kjackal has joined #openstack-infra10:44
zigoThere's already pytohn-octaviaclient that I did.10:44
danpawlikzigo: but its just a client :P10:44
danpawlikjust one10:44
zigoI'm creating the repo for you.10:44
danpawlikzigo: thanks10:45
*** dhajare_ is now known as dhajare10:46
zigodanpawlik: You can also file the ITP bug. Do you know how to do that?10:48
danpawlikzigo: maybe come on priv10:48
*** panda has joined #openstack-infra10:57
*** purp_too has joined #openstack-infra10:57
*** scarpino has joined #openstack-infra10:58
*** Zara has joined #openstack-infra10:59
*** scarpino is now known as ilpianista10:59
*** olaph1 has quit IRC11:00
*** links has joined #openstack-infra11:00
*** EmilienM has joined #openstack-infra11:00
*** salv-orl_ has joined #openstack-infra11:00
*** kambiz has quit IRC11:01
*** purp_too has quit IRC11:01
*** salv-orlando has quit IRC11:03
*** namnh has quit IRC11:03
*** yamamoto has joined #openstack-infra11:06
*** pcaruana has quit IRC11:07
*** yamamoto has quit IRC11:13
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
*** panda is now known as panda|sick11:17
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
*** Zara_ has joined #openstack-infra11:25
*** kaisers1 has joined #openstack-infra11:25
odyssey4meMorning all - apologies for the drama yesterday with the various new repo imports. I've learned some things which will hopefully make it a smoother process next time.11:31
odyssey4meAJaeger evrardjp I see that we're still not quite done as the base jobs aren't yet importing. I guess we're waiting for the US crew to wake to determine next steps?11:32
*** efried is now known as efried_rollin11:32
*** Faster-Fanboi_ has joined #openstack-infra11:32
*** olaph has quit IRC11:34
*** Zara has quit IRC11:34
*** kaisers has quit IRC11:34
*** mattoliverau has quit IRC11:34
*** rbergeron has quit IRC11:34
*** Qiming has quit IRC11:34
*** bradm has quit IRC11:34
*** Faster-Fanboi has quit IRC11:34
*** ldnunes has joined #openstack-infra11:34
*** rbergeron has joined #openstack-infra11:35
*** aviau has quit IRC11:39
*** aviau has joined #openstack-infra11:39
*** Zara_ is now known as Zara11:44
*** udesale_ has quit IRC11:50
*** armaan has joined #openstack-infra11:57
*** jpena is now known as jpena|lunch11:57
*** snapiri1 has quit IRC11:58
*** danpawlik has quit IRC12:00
*** danpawlik has joined #openstack-infra12:05
*** yamamoto has joined #openstack-infra12:09
*** pcaruana has joined #openstack-infra12:09
*** thorre_se has joined #openstack-infra12:09
*** thorre has quit IRC12:13
*** thorre_se is now known as thorre12:13
*** yamamoto has quit IRC12:14
openstackgerritHoang Trung Hieu proposed openstack-infra/zuul-jobs master: Update and replace http with https for doc links
*** dave-mccowan has joined #openstack-infra12:27
*** janki has quit IRC12:27
*** yamamoto has joined #openstack-infra12:27
*** links has quit IRC12:29
evrardjpodyssey4me: yes12:30
*** pcichy has quit IRC12:32
*** pcichy has joined #openstack-infra12:48
*** zul has quit IRC12:51
*** wxy has joined #openstack-infra12:52
*** salv-orl_ has quit IRC13:10
*** salv-orlando has joined #openstack-infra13:11
*** amoralej is now known as amoralej|lunch13:13
*** ying_zuo has quit IRC13:16
coreycbdanpawlik: happy to help you to get octavia into ubuntu as well. it would need to get into ubuntu bionic by march 1st for queens.13:16
*** hamzy_ is now known as hamzy13:16
*** ying_zuo has joined #openstack-infra13:17
*** olaph1 has joined #openstack-infra13:17
openstackgerritOmer Anson proposed openstack-infra/project-config master: Dragonflow: Add requirement for neutron-dynamic-routing in tests
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
*** jpena|lunch is now known as jpena13:27
*** zhipeng has joined #openstack-infra13:28
*** vivsoni has quit IRC13:28
*** vivsoni has joined #openstack-infra13:28
*** ralonsoh_ has joined #openstack-infra13:32
*** udesale has joined #openstack-infra13:32
*** yamamoto has quit IRC13:34
*** ykarel is now known as ykarel|away13:35
*** efried_rollin is now known as efried13:35
*** rlandy has joined #openstack-infra13:36
*** rlandy is now known as rlandy|ruck13:36
fungievrardjp: odyssey4me: it's possible approving another new project will get things back on track, or a manual zuul reconfig13:38
fungii'll see if i can figure something out in a sec13:38
AJaegergood morning, fungi!13:42
*** mriedem has joined #openstack-infra13:42
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
fungiAJaeger: thanks! that saved me looking one up13:44
evrardjpgood mornign fungi :)13:44
*** yamamoto has joined #openstack-infra13:44
*** yamamoto has quit IRC13:44
AJaegerfungi: if you do approve it, please review it's sibling as well ;)13:44
openstackgerritSimon Westphahl proposed openstack-infra/zuul master: Allow using remote refs to find commits for change
fungiAJaeger: will do13:46
fungiAJaeger: mmm... do you think this is safe to import?
fungii'm looking now for where that template is defined13:47
AJaegerfungi: let's recheck - it should fail ;)13:48
*** shardy has joined #openstack-infra13:48
fungiyeah, i believe that is yet another broken zuul config13:48
* fungi looks for a new project not importing prewritten zuul configuration13:49
*** pgadiya has quit IRC13:49
*** kgiusti has joined #openstack-infra13:50
*** annp has joined #openstack-infra13:50
AJaeger ?13:50
AJaegerhas no imports ;)13:50
*** VW has joined #openstack-infra13:51
*** briancurtin has quit IRC13:52
AJaegerfungi, 537802 has no zuul config13:52
AJaegermy test did not work on tatu-dashboard - why?13:53
*** arxcruz|ruck is now known as arxcruz|rover13:53
*** eernst has joined #openstack-infra13:55
odyssey4memorning fungi :) apologies for all that drama yesterday... I guess this is how we learn, and how zuul development gets its feedback :)13:55
fungiAJaeger: 537802 has
fungisimilarly broken too i think13:55
fungiodyssey4me: yep!13:55
*** ralonsoh_ is now known as ralonsoh13:55
*** oidgar has quit IRC13:55
*** mwhahaha has joined #openstack-infra13:56
AJaegerfungi: care to +2A simple typo, please?13:57
AJaegerthat should block the two reviews you mentioned now13:57
*** robcresswell has quit IRC13:57
*** zxiiro has joined #openstack-infra13:58
*** amoralej|lunch is now known as amoralej13:58
*** Kevin_Zheng has joined #openstack-infra14:02
*** clayg has quit IRC14:02
*** ihrachys has joined #openstack-infra14:06
openstackgerritMerged openstack-infra/project-config master: Add new project 'osel'
*** Goneri has joined #openstack-infra14:06
openstackgerritMerged openstack-infra/project-config master: Fix typo in tools/
fungionce puppet updates zuul01.o.o for the osel addition we should hopefully be back on track14:06
* fungi needs to step away and do morning things for a few minutes14:07
*** alexchadin has quit IRC14:08
*** lbragstad has joined #openstack-infra14:10
*** zhipeng has quit IRC14:12
*** lathiat has quit IRC14:13
*** zhipeng has joined #openstack-infra14:14
*** derekh has joined #openstack-infra14:15
pabelangerspacex launch now, FYI14:19
pabelangerAJaeger: dirk: we'll also need mirrors for opensuse-tumbleweed14:20
pabelangerthat's likely why job it failing14:20
dirkpabelanger: ehm, sorry, which job is failing?14:20
dirkgood morning btw14:20
pabelangerdirk: see
pabelangerzypper looks to be configured wrong14:22
*** myoung|afk is now known as myoung14:23
*** vabada has quit IRC14:27
*** agopi has joined #openstack-infra14:27
*** vabada has joined #openstack-infra14:27
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool master: Add additional builder debug logging
*** baoli has quit IRC14:28
*** baoli has joined #openstack-infra14:29
*** dhill__ has joined #openstack-infra14:29
pabelangermnaser: fungi: ubuntu mirror looks good this morning too, we've released with no issues over night. And AFS errors (lost connection to cell) are all but gone today.  We haven't lost connection so far14:30
*** pblaho has quit IRC14:30
mnaserpabelanger: sweet. I think my jobs went throug without a hitch too14:31
*** esberglu has joined #openstack-infra14:31
pabelangermnaser: great14:32
*** olaph has joined #openstack-infra14:35
*** Guest96947 has quit IRC14:36
*** sree_ has joined #openstack-infra14:36
*** sree_ is now known as Guest6481014:37
openstackgerritHoang Trung Hieu proposed openstack-infra/zuul master: WIP: Update and replace http with https for doc links
*** caphrim007_ has quit IRC14:39
*** caphrim007 has joined #openstack-infra14:40
*** Guest64810 has quit IRC14:41
*** pcichy has quit IRC14:42
*** yamamoto has joined #openstack-infra14:45
*** eernst has quit IRC14:45
*** psachin has quit IRC14:49
*** ameeda has quit IRC14:51
*** yamamoto has quit IRC14:51
*** auristor has joined #openstack-infra14:54
*** eernst has joined #openstack-infra14:54
thingeehi infra, just one more +2 on adding a core group for the project sphinx-feature-classification. already has the ptl +1 too14:54
*** e0ne has quit IRC14:57
mnaserthingee: voila14:59
*** wolverineav has joined #openstack-infra15:01
odyssey4mepabelanger fungi yeah, thanks for sorting out the ubuntu mirrors - any idea why the update took so long?15:02
fungiodyssey4me: there had apparently been some disconnect with the afs backend db, and the resulting recovery was causing writes to go slowly. couple that with a largeish set of security updates for ubuntu and a mirror sync ran longer than the allotted time for its kerberos ticket so got aborted leaving a lockfile behind15:04
odyssey4meAJaeger are we good to go to recheck & now?15:04
*** annp has quit IRC15:05
pabelangerYup, when the database becomes corrupt in reprepro, we need to reindex everything again from AFS. And if info isn't in cache, we depend on network bandwidth to fetch new file info15:05
thingeemnaser: thank you15:05
fungithen cleaning up the lockfile and rerunning manually we noticed the slowness, realized the situation with the cache repair wasn't progressing as expected, rebooted the server which seemed to get the cache sync back underway...15:05
odyssey4melooks like the previously mentioned tests added are now working to validate a repository contents before allowing a merge15:05
*** baoli has quit IRC15:05
*** wolverineav has quit IRC15:05
odyssey4mefungi ah, makes sense - thanks all for following that through15:06
*** baoli has joined #openstack-infra15:06
fungiit was mostly pabelanger15:06
fungiodyssey4me: AJaeger: yeah, job working as designed!
fungithanks for adding that test, AJaeger!15:07
pabelangeras for 546573 and 546294, I think we need a full reload zuul configuration. I was holding off from doing that until corvus was online, incase he wanted to debug why it still fails15:07
fungipabelanger: i approved another project addition a little while ago15:08
fungipabelanger: hoping to see if that triggers a reconfig when puppet kicks it again15:08
openstackgerritGraham Hayes proposed openstack-infra/project-config master: Add certbot-dns-openstack repo
fungipabelanger: for the record, that was
AJaegerfungi: great!15:09
*** tpsilva has joined #openstack-infra15:09
odyssey4meok, good - looks like the plan right now is to simply block *any* pre-implemented zuul config from being imported - which works... that's certainly easier than any other options I can think of15:10
openstackgerritAndreas Jaeger proposed openstack-infra/project-config master: Add zuul config for new project 'osel'
openstackgerritGraham Hayes proposed openstack-infra/project-config master: Add zuul entry for certbot-dns-openstack
pabelangerodyssey4me: there also been some discussion in #zuul about maybe no loading a project in zuul, if the configuration is bad too. Will be a topic at PTG I believe.15:11
AJaegerodyssey4me: yes, big axe - let's import these files via a normal review step.15:11
odyssey4meI support this course of action.15:16
fungiaha, i think the big gcc update we saw in ubuntu was the retpoline enablement backport15:16
odyssey4meOf course ideally zuul could protect itself without requiring an external test, but I can understand that's hard to do and certainly the axe works as a stop gap to buy more time.15:17
*** udesale has quit IRC15:19
*** mriedem has joined #openstack-infra15:20
*** links has quit IRC15:20
pabelangerdmsimard: are you aware of an issue with fedora-26 jobs in ozj?
pabelangerTimeout (32s) waiting for privilege escalation prompt:15:22
pabelangerwhen reloading iptables15:22
dmsimardpabelanger: yes, I have held nodes to troubleshoot it15:22
dmsimardpabelanger: Haven't gotten to the bottom of it yet15:22
AJaegerfungi, we're green again with the new repos!15:22
dmsimardpabelanger: If it's blocking something, we can put it non-voting15:23
AJaegerdmsimard, fungi , pabelanger, could you review to refactor static publish jobs as well as and to mark base jobs as abstract, please?15:23
pabelangerdmsimard: okay, I don't fully understand what the jobs is doing. Not blocking, wanted to add fedora-27 nodes, because we want to delete fedora-2615:24
dmsimardpabelanger: that particular test is for iptables rules persistence integration testing -- we run the multinode roles from zuul-jobs, flush iptables and then restart it to see if the rules are back15:25
dmsimardpabelanger: I don't have any objection to move forward with f2715:25
*** dtantsur|brb is now known as dtantsur15:26
*** alexchadin has quit IRC15:26
fungiodyssey4me: yep, i believe the plan is that zuul will skip loading repos with invalid configuration rather than reverting to an old layout15:27
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Set openstack-infra-multinode-integration-fedora-26 non-voting
pabelangerdmsimard: ^15:27
*** alexchadin has joined #openstack-infra15:27
AJaegerfungi: and that's what happened in this case. The repo was completely ignored - but that also meant, we could not run any tests to merge anything using the gerrit workflow.15:29
*** slaweq has quit IRC15:30
*** slaweq has joined #openstack-infra15:30
*** alexchadin has quit IRC15:31
*** e0ne has quit IRC15:34
*** slaweq has quit IRC15:35
*** e0ne has joined #openstack-infra15:35
fungiyeah, i guess the solution is more nuanced than that, but for discussing in #zuul15:36
*** alexchadin has joined #openstack-infra15:37
*** krtaylor has quit IRC15:38
*** yamahata has joined #openstack-infra15:40
*** zhipeng has quit IRC15:42
openstackgerritPaul Belanger proposed openstack-infra/glean master: Testing
openstackgerritGraham Hayes proposed openstack-infra/project-config master: Add certbot-dns-openstack repo
openstackgerritGraham Hayes proposed openstack-infra/project-config master: Add zuul entry for certbot-dns-openstack
corvuspabelanger: debug why what still fails?15:45
openstackgerritAndrea Frittoli proposed openstack-infra/devstack-gate master: Add stable/pike Tempest bitrot job
*** hongbin has joined #openstack-infra15:46
pabelangercorvus: ya, we force merged changes to remove the bad zuul.d folders, however zuul still complained about project conflict. It looks like we got a full reconfiguration @ Last reconfigured: Thu Feb 22 2018 09:46:54 GMT-0500 (EST). So, I guess we recheck and see if it works15:47
*** yamamoto has joined #openstack-infra15:47
pabelangerwasn't sure if that was the required workflow or some other issue15:47
pabelangerokay, is good15:48
pabelangerso, guess we needed a reload then15:48
corvuspabelanger: yes, zuul only reloads its tenant config file on a full reconfiguration.15:49
pabelangercorvus: ack, thanks15:49
*** david-lyle has joined #openstack-infra15:50
openstackgerritAndrea Frittoli proposed openstack-infra/project-config master: Remove legacy tempest bitrot jobs for pike
*** rossella_s has quit IRC15:51
openstackgerritSean McGinnis proposed openstack-infra/project-config master: Limt jobs on release-tools presentations
openstackgerritAndrea Frittoli proposed openstack-infra/openstack-zuul-jobs master: Cleanup tempest pike legacy jobs.
*** VW has quit IRC15:56
*** VW has joined #openstack-infra15:57
pabelanger20 seconds now15:58
fungipabelanger: yeah, memory utilization is climbing in the past hour too16:00
pabelangerso it is16:01
andreafAJaeger fungi corvus pabelanger tosky tonyb three patches to replace legacy tempest jobs with zuulv3 jobs for pike (on stable and bitrot jobs) now that devstack patches have been backported to pike
pabelangerzxiiro: we wrote zuul-launcher which loaded JJB files at run time, and converted them into ansible-playbooks. Then the launchers run ansible-playbook locally16:02
fungizxiiro: no, we switched from jenkins ssh plugin running jobs to zuul v2's experimental ansible launcher which basically replaced our jenkins masters16:03
fungior what pabelanger also said16:03
zxiiroah ok. so even before that they were basically regular freestyle jobs with shell16:04
fungithe ansible solution in v2 was our replacement/alternative for jenkins, basically16:04
pabelangerzxiiro: yah, you can look at 2.6.0 release of zuul to see how it worked.16:04
pabelanger(not that I recommend running it in production)16:04
pabelangerbut like fungi said, jenkins was gone at that point16:05
openstackgerritSean McGinnis proposed openstack-infra/project-config master: Switch notifications for stable branches to match all
fungiright, we limited our jenkins plugin utilization as much as possible, anticipating a future where we switched to something else and wouldn't want to have to reimplement various jenkins plugins in some other framework16:05
*** rossella_s has joined #openstack-infra16:05
zxiirogood to know. thanks. mostly just curious. I'm exploring ways to make Jenkins launch Ansible. The Ansible Jenkins plugin seems to be the current frontrunner.16:06
openstackgerritMerged openstack-infra/release-tools master: stub in presentation for rocky ptg
openstackgerritMerged openstack-infra/release-tools master: draft of presentation for rocky ptg
fungiwhich made translating our mostly-shell job definitions from jjb to ansible playbooks tractable16:06
VWhey folks - our AUP department is getting a takedown request for this
*** caphrim007 has joined #openstack-infra16:06
VWsupposedly the decompiler offered there is a no no16:07
hughsaundersHey, is there a timescale for a nodepool release post zuulv3 branch becoming master?16:07
pabelangerzxiiro: you might also be interested in which is some ideas on how zuulv3 might integrate with jenkins16:07
pabelangerhughsaunders: already done! we merged feature/zuulv3 into master last month16:07
fungiVW: i can likely delete it from the database, but we also tell search engines not to index that service so i'm curious how it was even found16:07
pabelangerhughsaunders: but, v3.0 release should be coming in the next few weeks16:07
VWyeah - that I don't know, fungi16:08
pabelangerhughsaunders:!/board/53 is last items to finish off16:08
hughsaunderspabelanger: ahh, ok I was going to say 0.5.0 was 9th Jan and merge was after (18th?)16:08
VWI just got hit up because I'm listed as the primary on the account you all use16:08
hughsaundersThanks for storyboard link16:08
fungiVW: oh! our robots.txt isn't being served now for some reason. that might explain it16:08
VWwell, there you go16:08
*** eventingmonkey has joined #openstack-infra16:09
pabelangerhughsaunders: yes, that sounds right. Let me see if I can find the review16:09
fungiVW: thanks! cleaning that up now and also figuring out why robots.txt is missing16:09
VWcool - thanks fungi16:09
VWI'll inform the Brute Squad16:09
hughsaundersyeah thats the one16:10
hughsaundersI just noticed theres quite a large delta between last release and master (due to that merge). Will use a SHA for now while waiting for the v3 tag.16:11
*** ramishra has quit IRC16:11
andreafpabelanger not using delegate_to and doing push/pull was the working solution
*** sshnaidm is now known as sshnaidm|off16:12
pabelangerhughsaunders: yah, there is some breaking changes to have pre feature/zuulv3 nodepool works too. The configuration format will be a breaking change.16:12
hughsaundersnot to mention the disappearance of zmq...16:13
pabelangerandreaf: good to know16:14
*** olaph has quit IRC16:14
*** olaph has joined #openstack-infra16:14
pabelangerhughsaunders: yah, 0.5.0 does support zookeeper for builders, IIRC. But yah, zmq dropped too16:14
*** shoogz has joined #openstack-infra16:15
*** sshnaidm|off has quit IRC16:17
*** rossella_s has quit IRC16:20
*** rossella_s has joined #openstack-infra16:23
pabelangerfor some reason, reloads are taking a little longer now. Which means we are not starting builds as fast:
openstackgerritJavier Peña proposed openstack-infra/system-config master: Move AFS mirror code to puppet-openstackci
smcginnisGerrit UI seems slow too. Network issues?16:26
*** agopi has quit IRC16:29
fungi#status log deleted from lodgeit openstack.pastes table (paste_id=665906) due to provider aup violation/takedown notice16:30
openstackstatusfungi: finished logging16:30
fungiVW: ^16:30
*** yolanda has quit IRC16:30
openstackgerritJeremy Stanley proposed openstack-infra/puppet-httpd master: Allow access to docroot for proxy exclusions
*** yolanda has joined #openstack-infra16:30
fungiinfra-puppet-core: ^ should make our robots.txt work on the paste.o.o site16:31
*** tosky has quit IRC16:31
VWthanks fungi!16:32
fungiVW: thanks for bringing it to our attention! we had intentionally disallowed crawlers on that server for precisely that reason16:32
fungiplease do16:33
*** armaan has quit IRC16:33
*** armaan has joined #openstack-infra16:33
*** oidgar has quit IRC16:35
*** d0ugal has quit IRC16:35
*** slaweq has joined #openstack-infra16:40
*** d0ugal has joined #openstack-infra16:40
*** david-lyle is now known as dklyle16:42
*** slaweq has quit IRC16:45
openstackgerritFabien Boucher proposed openstack-infra/zuul master: Tenant config can be read from an external script
*** e0ne has quit IRC16:46
*** andreas_s_ has quit IRC16:47
openstackgerritMerged openstack-infra/project-config master: Add new project for Tatu (SSH as a Service) Horizon Plugin.
*** yamamoto has joined #openstack-infra16:48
*** cmurphy has joined #openstack-infra16:48
openstackgerritFabien Boucher proposed openstack-infra/zuul master: Tenant config can be read from an external script
*** eernst has quit IRC16:49
*** oidgar has joined #openstack-infra16:50
*** pblaho has joined #openstack-infra16:51
*** andreas_s has quit IRC16:51
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Refactor status functions, add web endpoints, allow params
*** yamamoto has quit IRC16:52
*** eernst has joined #openstack-infra16:54
*** wolverineav has joined #openstack-infra16:55
*** wolverineav has quit IRC16:55
*** ihrachys has quit IRC16:56
dirkpabelanger: is there a way for me to download the qcow2 file (opensuse-tumbleweed) somehow?16:56
*** ihrachys has joined #openstack-infra16:56
dirkpabelanger: I am interested what is in there16:56
*** dklyle has quit IRC16:56
*** rossella_s has quit IRC16:57
pabelangerdirk: nothing public by default, I can expose it manually. Something your DIB dosn't have?16:57
*** rossella_s has joined #openstack-infra16:59
*** VW_ has joined #openstack-infra17:00
*** VW has quit IRC17:00
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Refactor status functions, add web endpoints, allow params
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: Add separate modules for management commands
openstackgerritMatthieu Huin proposed openstack-infra/nodepool master: webapp: add optional admin endpoint
*** david-lyle has joined #openstack-infra17:01
*** links has joined #openstack-infra17:04
fungiZara: good question, i'll find out who you should contact17:05
Zarathanks :)17:07
clarkbgood morning17:07
openstackgerritGraham Hayes proposed openstack-infra/project-config master: Add certbot-dns-openstack repo
*** zoli is now known as zoli|gone17:11
*** zoli|gone is now known as zoli17:11
fungiZara: kendall waters said it shouldn't charge you, but suggested you e-mail and she can help you out17:11
*** armaan has quit IRC17:12
*** armaan has joined #openstack-infra17:13
*** agopi_ has joined #openstack-infra17:18
*** agopi has quit IRC17:20
*** agopi_ has quit IRC17:22
*** baoli has quit IRC17:24
*** baoli has joined #openstack-infra17:24
Zarafungi: great, thanks, will do :)17:27
*** dprince has quit IRC17:28
*** slaweq has joined #openstack-infra17:29
jlvillalHey clarkb. I did the puppet-gerritbot change to install via git. It is passing Zuul. But I'm mostly cargo-culting my changes. So a review from someone who knows puppet is appreciated :)
openstackgerritMerged openstack-infra/nodepool master: Refactor playbooks/nodepool-zuul-functional/pre.yaml
mnaserclarkb: jlvillal ok if i review from a puppet pov? :)17:34
jlvillalmnaser, Yes please :)17:35
clarkbmnaser: yes!17:35
fungimnaser: speaking of puppet module changes, is semi-urgent17:36
clarkbjlvillal: left some thoughts but you amy want to wait for mnaser to comment too17:36
jlvillalclarkb, thanks17:37
*** olaph1 has joined #openstack-infra17:38
*** olaph has quit IRC17:39
mnaserjlvillal: left a few comments but it's mostly ok! :)17:40
AJaegerconfig-core, could you review a refactor of our static publish jobs, please?
jlvillalmnaser, Thanks!17:40
AJaegermnaser: could you review these initial jobs for repos:
AJaeger, please?17:43
mnaserAJaeger: sure, ill have a look and fungi it looks good to me but cant +A :)17:44
fungimnaser: no problem, just appreciate the look!17:44
*** agopi has joined #openstack-infra17:44
*** olaph1 is now known as olaph17:45
mnaserAJaeger: all done17:45
*** lpetrut has quit IRC17:46
mnaser is an easy one for any config-core :)17:46
AJaegermnaser: time for a job removal as well, please?
AJaegermnaser: will check...17:47
fungiinfra-puppet-core reviewers: i'm looking for a second +2/approval on 547062 to fix serving robots.txt on paste.o.o so spam/scams/illegal content indexed there stop getting picked up by search engines17:47
*** slaweq has quit IRC17:48
*** yamamoto has joined #openstack-infra17:48
clarkbfungi: oh interesting this is if it uses the built in template rather than providing one of its own (whcih we typically do instead)17:50
clarkbfungi: I've approved it17:50
fungithanks clarkb17:51
fungiand yeah, the built-in one was basically broken (at least for apache 2.4 on xenial, may have been working-ish on 2.2 without explicit access allowances)17:51
*** jpena is now known as jpena|off17:52
*** armaan has joined #openstack-infra17:52
openstackgerritJohn L. Villalovos proposed openstack-infra/puppet-gerritbot master: Change gerritbot to install from git
fungimy guess is it worked on an implicit lack of acl until we upgraded paste.o.o to trusty17:52
fungier, to xenial17:52
fungiat which point we stopped serving robots.txt and it reverted to being a cesspool of cut-n-paste links to illicit sites, eventually drawing complaints to our provider17:54
*** yamamoto has quit IRC17:54
*** dprince has joined #openstack-infra17:55
*** mgoddard_ has quit IRC17:55
*** efoley has joined #openstack-infra17:55
fungihuh... kde started using phabricator for hosting/reviewing source code?17:56
*** derekh has quit IRC18:00
mnaserjlvillal: any reason behind the decision to install deps then gerritbot after?18:02
jlvillalmnaser, cargo-cult from puppet-zuul18:02
openstackgerritMerged openstack-infra/project-config master: Add OSA os_panko repo base jobs
fungimnaser: we want to upgrade gerritbot but not unconditionally upgrade its deps if what is already installed is satisfactory18:03
mnaserjlvillal: i hate thinking of a suggestion after reviewing but i think this is really useful for you -
mnaserstatusbot deploys from master afaik18:03
fungithis will get simpler with pip 10 since --upgrade-strategy=only-if-needed becomes the new default18:04
jlvillalmnaser, looking...18:04
AJaegerany config-core around to review to simplify our static publish jobs , please?18:04
jlvillalmnaser, What am I looking for?18:04
mnaserjlvillal: drawing general inspiration for future maybe :P but it's useful code to look at in comparision, pretty much same idea18:05
mnaserthe change looks fine to me if fungi mentioned the whole pip install thing is ok18:05
jlvillalmnaser, Okay. Thanks!18:05
fungiwell, mainly just explaining why it's done in such a roundabout way in some of our modules right now18:05
fungi"okay" is not the term i'd use, but it's a necessary evil in at least some situations18:06
fungiparticularly if you want to mix distro-packaged python modules and pypi18:06
clarkbAJaeger: part of me wonders if we want to do it as it may become easier to sneak a change that dumps the secret through since its not immediately obvious that the secret is being used in the child jobs18:07
clarkbbut that could be me being overly paranoid of potential situations18:07
* jlvillal remembers only the paranoid survive :)18:07
fungias pip historically (and still for a little while longer) will download newer versions of your deps from pypi when you specify --upgrade/-U even if you have locally preinstalled sufficient versions18:07
*** pcichy has joined #openstack-infra18:08
clarkbspeaking of pip 10, are they still planning to break the world by not uninstalling things globally?18:08
funginot sure18:08
clarkbI should resurrect my change to test it18:08
AJaegerclarkb: according to the docs, the child has no access to it ;)18:09
clarkbfungi: new patchset that undoes some of my earlier workarounds so that we can test more vanilla pip 10 behavior18:11
fungiclarkb: thanks!18:11
*** pcichy has quit IRC18:12
clarkbfungi: in our case pyyaml and python-psutil were breaking in devstack18:12
*** pcichy has joined #openstack-infra18:12
clarkbfungi: because we install pyyaml in devstack-gate for features.yaml support from the distro and not pypi and python-psutil is part of devstack's debs and rpms lists but then python packages we pip install dep on it too and try to upgrade it18:13
clarkbfungi: we should know soon enough how broken it is based on 513825's results18:13
openstackgerritMerged openstack-infra/project-config master: Switch notifications for stable branches to match all
*** oidgar has quit IRC18:14
clarkbanyone else want to review 545234 for AJaeger before I approve it?18:14
fungiclarkb: given that the default upgrade strategy is now only-if-needed in 10, that may no longer happen18:15
clarkbfungi: ya though it was broken on pip 10 when I first tested it. Do you know if that is a recent pip 10 change?18:15
clarkboh actually I bet its constraints burning us there18:15
AJaegeryeah, we can get rid of jenkins/data now - infra-root, one bindep change: and one for config-core on openstack-zuul-jobs: - please review18:15
*** slaweq has joined #openstack-infra18:15
*** gfidente has quit IRC18:16
AJaegeronce those two are in, we can merge
*** ykarel|away has quit IRC18:16
fungiclarkb: oh, yes, so the upgrade strategy default changed in decemberish i think, but also constraints is likely an issue if we specify a different version than the distro is packaging18:16
fungiclarkb: i stand corrected. looks like it may have gone in as early as may of last year
clarkbassuming pip doesn't change this behavior they are effectively saying you must use virtualenvs18:17
fungiso yeah, probably constraints is the issue18:17
mordredclarkb, fungi: maybe we should stop installing python things from distro packages in devstack or devstack-gate18:17
clarkbmordred: probably is you can't completely avoid it in all cases18:18
clarkbbeacuse cloud init for example18:18
clarkbwe don't have it but other people running devstack will18:18
clarkbyou may as well just error if not installing to a virtualenv now18:18
*** dklyle has joined #openstack-infra18:18
mordredoh - well, no arguments from me there18:18
fungii'm still not convinced that `sudo pip install ...` is a sane strategy anyway. using virtualenvs and avoiding mixing package managers makes a lot more sense to me but that train has sailed i guess18:18
clarkbor give us a flag to say we really know what we are doing and leave us a lone18:19
pabelangerfungi: yah, agree18:19
clarkbfungi: ya the problem is it has been a supported configuration for as long as pip has existed18:19
mordredI just meant in our repos we should basicaly never install python from distro packages because we install things from pip globally as well - and doing both is *always* going to break something18:19
clarkbyou can't just remove that functionality imo18:19
clarkbmaybe disable it by default18:19
*** dhajare has quit IRC18:19
pabelangerinstall everything in to virtualenv, update systemd unit files as needed, that's worked well so far with testing I've been doing18:19
fungiclarkb: a "supported" configuration where the upstream pip maintainers for the past decade have basically said "you really shouldn't do this but..."18:19
clarkbfungi: then they shouldn't have added the feature :P18:20
clarkbfungi: the problem is the ship sailed once it was allowed and people used it18:20
fungii agree on that point18:20
fungii don't think they realized what a problem it would grow into18:20
clarkbpabelanger: there are weird corner cases where it doesn't work like what we had with dib18:20
clarkbpabelanger: these are often fixable its just not something you can rely on when pulling python packages18:21
fungiand once they did and better solutions were devised, they couldn't easily take it back18:21
clarkbpabelanger: also does that mean you are volunteering to fix devstack? :P18:21
*** yamahata has quit IRC18:22
clarkbfwiw on debuntu it really isn't an issue18:22
mordredclarkb: you know - I wonder if devstack would work if we just added --user to the pip install commands18:22
clarkbmordred: you'd have to update all the unit file paths too for example but ya its doable18:23
clarkband then fix any dib sort corner cases if you run into them18:23
clarkb(the joy of systemd requiring fully rooted paths)18:23
clarkbalso this will break how we install zuul and nodepool etc etc etc18:24
clarkbits going to be a fairly painful transition18:24
openstackgerritMerged openstack-infra/puppet-httpd master: Allow access to docroot for proxy exclusions
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: remove legacy-rally-dsvm-fakevirt-heat
*** slaweq has quit IRC18:25
mordredclarkb: I'm still fully catching up - what does it break about our zuul/nodepool installs?18:25
mordredclarkb: or, rather, what's the behavior change?18:26
clarkbmordred: the breakage is if you pip install nodepool and it needs to upgrade some package that is system installed pip says no I won't do it and your install fails18:26
openstackgerritFabien Boucher proposed openstack-infra/zuul master: Tenant config can be read from an external script
clarkbsince we install zuul and nodepool globally and can't compeltely control the fact that there are common python system libs we lose18:26
clarkbbut nothing stops $package from pulling things in18:27
clarkbI expect that this will be particularly painful on centos18:27
clarkbbecause yum18:27
clarkbfedora's switch to dnf in theory makes this better?18:27
*** tesseract has quit IRC18:28
fungidoes dnf use different installation paths?18:28
openstackgerritJohn L. Villalovos proposed openstack-infra/puppet-gerritbot master: Change gerritbot to install from git
fungiaha, got it18:28
clarkbon ubuntu our biggest headache has been cloud init which we personally do not use but others do18:28
pabelangerclarkb: ha, I mean adding it to devstack would be a good adventure18:28
fungiso saying yum itself depends on a bunch of rpms of python libs18:28
clarkbfungi: ya18:28
*** gfidente has joined #openstack-infra18:29
fungithis is one of the reasons red hat has persisted in stating that you shouldn't run any non-rh-packaged python applications using the distro-installed interpreter18:29
pabelangerclarkb: but, I'd be willing to try using virtualenv directly in devstack.  Maybe find some time at PTG to test it out18:30
fungiand instead have your own python build with your own lib search path and install your python apps and deps in there18:30
*** lamt has quit IRC18:30
clarkbpabelanger: ya in theory devstack supports it but I have no idea if it works or if it was updated to use unit files18:30
fungithe party line, at least for years, seems to have been that the packaged python interpreter and libraries are only there to support packaged python tools and applications shipped as a part of the distro18:31
fungiwhich i think is also how they justified not having python 3 available even as recently as rhel 718:32
clarkbpersonally I think the best approach would be to disable the uninstall behavior by default with a warning that pip 11 will completely remove the functionality18:32
clarkbthen give people ability to fix things on pip 10 by enabling the feature18:32
*** claudiub has quit IRC18:41
*** slaweq has quit IRC18:43
clarkbmordred: that shows you how pip 10 will break18:47
clarkband ^ confirms the behavior has not changed so we should be prepared for much of our world to be broken once pip 10 happens18:47
openstackgerritMerged openstack-infra/project-config master: Add OSA nspawn host/container repo base jobs
mnasergerrit just 502'd18:48
mnaserfor what its worth18:48
clarkblooks like could be GC related looking at melody18:49
clarkbif it persists we'll probably have to restart the server18:49
mnaserit doesn't bother me to f5 but i just want to give a heads up in case you're tracking anything18:49
fungiclarkb: what's the temporary workaround flag for the refusal to uninstall distutils-based packages?18:49
clarkbfungi: I don't think there is one18:49
mpetersonhey, I have a set of jobs that are not generating the ARA html, I think it might be a bug of the role definition but I don't have enough visibility in the logs to be able to see what ansible is doing.. can someone check? any of the functional or fullstack jobs18:50
clarkbmpeterson: we only generate them on failure18:50
fungiclarkb: oh, thought you said there was a temporary one which would be removed in pip 1118:50
clarkbfungi: no I'm saying thats what pip should do but as far as I know is not doing that18:50
*** yamamoto has joined #openstack-infra18:50
clarkbmpeterson: and looks like functional and fullstack are successful job runs so won't have ara output. We did this to conserve inodes on the log filesystem :/ there is work to have ara use a database file rather than many tiny files18:51
clarkbdmsimard: ^ is that work done? maybe we can enable ara on successes now?18:51
*** olaph has joined #openstack-infra19:17
clarkbpabelanger: ^ maybe a lock was held?19:17
mnaserclarkb: oh i thought mirrors were every 24 hours for some reason, my bad19:17
AJaegerpabelanger: thanks19:19
clarkbmnaser: that said do we even mirror lxc images?19:19
clarkboh wait I get it19:19
*** yamamoto has quit IRC19:19
mnaserclarkb: :)19:19
clarkbits the new images which are not mirrored using new packages that we don't have so updates don't work19:19
mnaseri was investigating mirroring them right now19:20
mnaserand how much space that costs19:20
mnaserlooks like its a ~100M or so pull for every job19:20
*** rossella_s has quit IRC19:20
mnaserclarkb: that would probably make things much easier then, because the urls are timestamped for the images so easy invalidation19:21
AJaegermnaser: could I trouble you for another review, please? is ready...19:23
pabelangerclarkb: no, I think we are spending a lot of time reloading in scheduler, which is starving CPU and preventing new builds from starting21:02
clarkbpabelanger: ah that could be too though we are definitely having longish periods where executors stop accepting jobs21:02
pabelangeryou can see we are using more CPU21:03
pabelangernot sure why however21:03
clarkb~5 minutes at a time21:03
AJaegerafter my last approval it took 7 minutes until the change was queued in gate21:03
dmsimardpabelanger: actually wait_for_connection is a thing now btw :p
dmsimardpabelanger: but yeah, +1, I'll use that instead21:04
pabelangerdmsimard: neat, TIL21:04
dmsimardpabelanger: yeah it's more convenient when testing for ssh connectivity21:04
clarkblooks like cpu incrase started about 1500UTC?21:05
AJaegerconfig-core, thanks for the reviews - we still have some backlog on the projects, reviews welcome!21:05
* AJaeger waves good night21:05
AJaegerclarkb: and memory as well21:05
clarkboh ya21:06
clarkbso some config change maybe?21:06
AJaegerno idea - I glanced at zuul.o.o at that time but didn't see anything obvious ;/21:06
*** r-daneel has quit IRC21:06
*** r-daneel has joined #openstack-infra21:06
*** mgoddard_ has joined #openstack-infra21:07
*** hashar has quit IRC21:07
dmsimardI think it'd be useful to install some diagnostic tools by default, namely iotop and sysstat (for iostat)21:08
AJaegerconfig-core, some change that would be nice to have in are and
* AJaeger really leaves for today21:08
pabelangerclarkb: AJaeger: that's around the time when fungi approved the change to clear our zuul errors for OSA project21:08
dmsimardpabelanger: wasn't that yesterday ?21:08
*** slaweq_ has joined #openstack-infra21:09
AJaegerpabelanger: the change was approving another new project creation which then triggered the reconfig21:09
pabelangerdmsimard: there was some this morning too21:09
pabelangeryah, that21:09
pabelangerso, zuul did full reconfiguration21:09
*** eharney has quit IRC21:10
dmsimardThere's been quite a few exceptions if you look at just today's logs
jamespagefungi: in order to ease the overhead of backporting >300 source packages back to xenial, we've backported a newer debhelper so that any new bits landing in bionic which require debhelper >= 10 don't require us to hold a patch to downgrade it21:11
jamespagefungi: 3-4 pkgs vs 30-40 patches if you see what I mean21:12
odyssey4mejamespage :) evening21:12
dmsimardwow that's a nasty exception21:12
*** r-daneel_ has joined #openstack-infra21:13
dmsimardexception from the poll loop error:
dmsimardThat's almost java levels of stack trace21:13
*** r-daneel has quit IRC21:13
*** r-daneel_ is now known as r-daneel21:13
*** slaweq_ has quit IRC21:14
odyssey4mewe hit some trouble with that in OSA, although we discovered that we didn't really know why debhelper was being installed in the first place so we have patches up to remove it... and we're not entirely sure why the installs didn't work... we add UCA/queens on xenial, we try to install debhelper, kerblooey21:14
openstackgerritMerged openstack-infra/project-config master: Move jenkins/data/bindep-fallback.txt (2/2)
clarkbdmsimard: the unknown job error is causing the other error (no job means no attribute)21:17
clarkbdmsimard: I think that may imply we still have configs that are not quite correct?21:17
clarkband we are trying to run a job that doesn't exist?21:17
clarkbit would be nice if unknown job error had the job name too21:18
*** dprince has joined #openstack-infra21:18
* clarkb goes to poke at gear to see if that is doable21:18
dmsimardI'm not sure, there's a lot of other errors as well21:18
dmsimardNot familiar enough to tell if any of those are "normal", recoverable or fatal21:18
jamespagehi odyssey4me21:19
dmsimardI've seen this one before and it's happening a lot: "ERROR zuul.Scheduler: Unable to process autohold for None" I formally created a story for it so we don't forget21:19
dmsimardThere's also "ERROR zuul.Scheduler: Exception reporting runtime stats", "ERROR zuul.nodepool: Error unlocking node" and "ERROR zuul.nodepool: Node <Node 0002678661 ['primary']:fedora-26> is not locked"21:20
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Remove jenkins-slave element from DIB images
pabelangerAJaeger: clarkb: ^now we can remove the jenkins-slave element from nodes21:20
*** iyamahat has joined #openstack-infra21:22
openstackgerritClark Boylan proposed openstack-infra/gear master: Include job name in UnknownJobError
clarkbsomething like ^ may help in debugging this21:23
pabelangerianw: brings online fedora-27 for tempest / devstack21:23
jamespageodyssey4me: hmmm did something break? we pushed the rc's from last week through to -updates PM today21:24
jamespagethat may have had a number of these toolchain type things with it21:24
odyssey4mejamespage ok, so something did change today21:24
*** rossella_s has quit IRC21:24
jamespageit was quite a large number of packages so might not have all go in during the same sync cycle, which would create some installability issues for an hour or so21:24
odyssey4meI'm tempted to fire up a test now to see if the issue happens outside of infra.21:25
clarkbreading the code more though its getting a status_res packet and then not finding the job associated with that packet so that it can update the job21:25
dmsimardclarkb: do we want "handle" or "job" ?21:25
clarkbdmsimard: job not existing is why we error, handle is the key we are looking for21:25
jamespageodyssey4me: issue is with debhelper installability right?21:25
dmsimardclarkb: makes sense21:25
clarkbthinking about why this may happen maybe the client is cancelling the jobs but that state hasn't made it all the way through21:25
clarkbI still think the name would help us understand why it happens21:25
odyssey4mejamespage mind if we switch into #openstack-ansible? I'd rather not interfer with the other ocnversation going on here.21:26
jamespageodyssey4me: ok21:26
ianwpabelanger: tbh, i'd prefer if we could get the stack @ in21:27
*** jtomasek has quit IRC21:27
openstackgerritMerged openstack-infra/openstack-zuul-jobs master: Finish golang job removal
*** rossella_s has joined #openstack-infra21:29
openstackgerritDavid Moreau Simard proposed openstack-infra/openstack-zuul-jobs master: Attempt to rescue lost SSH connection when restarting iptables
pabelangerianw: okay, I've given my +121:31
clarkbdmsimard: thinking about restarting iptables if using iptables persistent I wonder if a stop iptables implies a drop all connections on fedora? maybwe can just start the iptables and avoid that?21:32
pabelangerianw: can you comment on the parts you have issue with in ? some jobs are using legacy-fedora-26, so we'll need to land legacy-fedora-27 for the moment21:32
ianwthanks, just need some eyes on the non-trivial stuff below that21:32
pabelangerit's possible that patch will just turn into adding nodesets21:32
dmsimardclarkb: we flush iptables rules on purpose to check if restarting iptables loads them back as a mean of testing that the persistence works -- I wonder why the other distros don't have that issue21:33
*** kgiusti has left #openstack-infra21:34
clarkbI would look at the unit file and any scripts it is calling21:34
clarkbmy guess is its doing something smart21:34
*** mgoddard_ has quit IRC21:35
clarkbI doubt its the kernel since they are pretty steadfast about no user noticeable breakages but I guess its possible the kernel is doing something21:35
ianwpabelanger: if f27 runs devstack with python3, the updown job doesn't serve much purpose any more.  it's really not worth running it with py2 on f27 i wouldn't think21:36
dmsimardyeah, I'll look after we figure out what's going on with Zuul :p21:36
dmsimardso I see that the full reconfigure appears to have been done at 14:39, starting with 2018-02-22 14:39:31,197 INFO zuul.TenantParser: Loading previously parsed configuration from openstack-infra/project-config21:37
dmsimardthat seems to correlate with the higher cpu usage21:37
odyssey4mefungi not sure if you're available, but it's plausible that the last UCA repo update happened in the middle of the UCA being updated... are there logs for when UCA last updated?21:37
clarkbdmsimard: and with the memory use jump21:37
*** camunoz has quit IRC21:37
odyssey4meor pabelanger ^ not sure who can help21:37
*** pcaruana has quit IRC21:37
clarkbodyssey4me: yes navigate to the mirror with your browser and there will be a timestamp file you can check21:38
pabelangerianw: k, is there somebody else that could confirm if we don't need it. If so, then lets delete21:38
pabelangerodyssey4me: I can look21:38
pabelangerbut it seems to be fine21:38
pabelanger1 sec21:38
ianwpabelanger: well i just put it in as a smoke test, not sure anyone but me ever cared :)21:38
pabelangerit should be running again in 22mins however21:38
pabelangerianw: ah, okay. then lets remove it21:39
ianwjust make sure the other job is doing USE_PYTHON321:39
pabelangerodyssey4me: have time to wait 20mins?21:40
odyssey4meclarkb pabelanger ah you mean
clarkbodyssey4me: for uca it iwll be a different path, but yes21:40
odyssey4meta, thanks - happy to help urselves do more diggin :)21:40
odyssey4metimestamps are UTC?21:40
*** iyamahat_ has joined #openstack-infra21:40
odyssey4mek thx21:41
pabelangerthey update every 2hours21:41
*** iyamahat has quit IRC21:41
odyssey4meis that timestamp a sfile that comes from the source, or placed there by the sync process when the sync is done?21:41
dmsimardclarkb: it looks like there's two reconfigure in a row..21:42
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Remove legacy-devstack-dsvm-py36-updown-fedora-26
dmsimardSeeing a lot of "WARNING zuul.GerritConnection: Unable to get change for [...]"21:44
*** slaweq_ has joined #openstack-infra21:46
*** olaph1 has joined #openstack-infra21:48
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Add fedora-27 devstack / tempest jobs
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Remove fedora-26 devstack / tempest testing
openstackgerritPaul Belanger proposed openstack-infra/openstack-zuul-jobs master: Remove devstack-dsvm-py36-updown-fedora-26 job
*** dave-mccowan has quit IRC21:49
*** olaph has quit IRC21:49
openstackgerritPaul Belanger proposed openstack-infra/project-config master: Switch devstack / tempest testing to fedora-27
pabelangerianw: ^okay, removed the updown job21:50
clarkbdmsimard: whats the ... in there?21:50
*** slaweq_ has quit IRC21:50
clarkbdmsimard: looking at the code for the gerrit connection I expect its just content we are effectively ignoring because it isn't a change or a ref21:51
odyssey4mefungi clarkb pabelanger we got to the bottom of the package install issue - briefly: jamespage pushed a bunch of updates to the PPA for UCA, which then somewhere while it wasn't done got pulled into UCA, which then also got pulled in by the infra mirror process21:51
dmsimardclarkb: looks like this:
odyssey4mesince then, further syncs have been done and the issue is resolved21:52
clarkbdmsimard: ya those are I think just noise, we are effectively ignoring the ref-replicated events this way21:52
pabelangerodyssey4me: you can see how we generate the timestamp:
clarkbsomething we might want to clean up in the logs so it doenst create distractions21:52
pabelangerodyssey4me: I guess we could expand it based on which phase it is on21:52
*** yamamoto has joined #openstack-infra21:53
dmsimardclarkb: there's a LOT of them, 19k in today's log21:53
*** dave-mcc_ has joined #openstack-infra21:53
dmsimardI'll create a story, I've created a few so far:!/project/67921:53
pabelangerodyssey4me: good to know21:53
odyssey4methanks, as always, for the help and support :)21:53
clarkbdmsimard: ya anytime a ref gets replicated by gerrit we'll get one (so x 8 for git mirror + 1 for github)21:53
clarkbdmsimard: 19000k/9 refs roughly I think21:54
clarkber no k there21:54
pabelangeryah, those warnings have been around for a while :)21:55
*** dhill__ has quit IRC21:56
*** yamamoto has quit IRC21:57
evrardjphello again21:59
pabelangerpushing 30mins since zuul has launched any new builds. Guess we have a lot of zuul.yaml files in pipelines right now21:59
evrardjpis there someone that can add the openstack-ansible-core group into the group,members (see also the review
clarkbpabelanger: or possibly the gaerman issues that dmsimard found are causing us to not launch jobs22:00
clarkbis gearman running?22:00
dmsimardif you look at the web status22:00
dmsimardsome changes' jobs have all finished running but they're just staying there22:00
clarkbdmsimard: thats "normal" if you look at the queue counts at the top of the page22:01
pabelangerdmsimard: right, zuul hasn't processed them yet22:01
clarkb624 results to process which are probably your changes22:01
pabelangerand there it goes now22:01
dmsimardah, okay yeah it just cleared it22:01
clarkblooks like ya it just went22:01
*** yamamoto has joined #openstack-infra22:01
pabelangerI think we just have a lot of zuul.yaml files right now22:01
pabelangerand will reloads taking 20seconds, taking longer22:02
dmsimardseeing fingergw error22:03
*** r-daneel_ has joined #openstack-infra22:05
* fungi is back again, catching up22:05
dmsimardI'm tailing logs and I'm not seeing anything obvious22:06
*** r-daneel has quit IRC22:06
*** r-daneel_ is now known as r-daneel22:06
clarkbI think we may just be seeing a further regression from the memory improvements?22:06
clarkbzuul is operating just more slowly22:06
pabelangerit was working great, up until this morning memory increase22:07
dmsimardexecutors are very spiky22:07
clarkbfungi: seems to be correlated to fixing the osa stuff?22:07
pabelangerdmsimard: starting builds graph is the once to watch, if that is flat, no new jobs are starting22:07
dmsimardpabelanger: yeah I realize that but executors *are* picking up new builds22:08
TheJuliapabelanger: sure, can it wait a day or two? Looks like it didn't take to fedora27 very well. Likely a fix, but I won't have the requisite brain cells for at least another day.22:08
*** danpawlik has quit IRC22:08
pabelangerdmsimard: yah, once builds are started, executors run them without issue22:08
pabelangerTheJulia: Sure, just wanted to make sure you were aware22:09
*** jcoufal has quit IRC22:09
dmsimardnodepool is showing over 100 nodes ready, at least according to
dmsimardso zuul is just not picking up the nodes from nodepool to assign them to builds or something ?22:10
*** dhill_ has joined #openstack-infra22:10
TheJuliapabelanger: awesome, thanks!22:10
pabelangerdmsimard: zuul is busying doing dynamic reloads, and events are backing up in queue. see top left of zuul.o.o. When reloading, no new events a processed22:11
pabelangerso, nodepool will finish bringing nodes online, then wait to move then in-use by zuul22:12
dmsimardpabelanger: thus the bursts of executors picking up new jobs we've been seeing22:12
*** armaan has quit IRC22:13
fungiclarkb: if what i did this morning (approving another new project creation change) is what triggered the current performance regression then we have some reason to be concerned, i suppose22:13
pabelangerreloads are taking 20 seconds now, up from 12 seconds yesterday. not sure why22:13
pabelangerfungi: clarkb: isn't there a way we can dump current layouts in memory and see if we've leaked something?22:16
pabelangerI'm not sure how that worked22:16
clarkbI think it required doing the repl socket22:17
clarkbnot sure if that is still enabled22:17
*** agopi has joined #openstack-infra22:18
*** slaweq_ has joined #openstack-infra22:18
dmsimardSo I think this isn't a perfect query but it shows recent changes for zuul files*%2524%2222:19
*** rlandy|ruck is now known as rlandy|biab22:21
*** slaweq has quit IRC22:22
*** slaweq has joined #openstack-infra22:22
*** slaweq_ has quit IRC22:23
dmsimardIn zuul-web, there's a lot of these: "Submitting job zuul:status_get with data {'tenant': 'openstack'}" and sometimes they don't come back immediately -- I suppose that's the /status endpoint.. is the apache caching working ?22:23
dmsimardI'm also not sure why that is but in the apache error log, there's this error but for like... every repo "Not a git repository: '/var/lib/zuul/git/openstack/neutron-vpnaas'"22:25
dmsimard(/var/lib/zuul/git is indeed empty on zuul01)22:25
dmsimardI have no idea where I'm going, just thinking out loud and hoping it will make someone knowledgeable think of something :D22:26
clarkbI think that may be old jobs looking for zuul refs?22:26
clarkbnot sure22:27
*** slaweq has quit IRC22:27
smcginnisclarkb: Do I remember right that one of the lunch presos next week will be about zuulv3?22:29
smcginnisclarkb: Or is that its own session?22:29
fungismcginnis: yes, tuesday (infra/qa joint update, much of which will be zuul related on both fronts)22:30
smcginnisfungi: Great, thanks22:30
dmsimardNever realized there was so many queries for status/status.json:
*** esberglu has quit IRC22:32
*** dprince has quit IRC22:33
*** kjackal has quit IRC22:34
*** tonyb has quit IRC22:35
*** tonyb has joined #openstack-infra22:36
dmsimardThere's an IP in particular that's hammering status.json, it's owned by RAX but doesn't seem to be one of our servers22:37
*** rcernin has joined #openstack-infra22:38
dmsimardI have no idea22:39
*** sshnaidm has joined #openstack-infra22:40
*** sshnaidm is now known as sshnaidm|off22:40
*** tosky has joined #openstack-infra22:41
*** shoogz has quit IRC22:44
fungiprobably someone's personal vm22:44
*** dave-mcc_ has quit IRC22:44
fungimaybe running a command-line status display22:44
mnasersneak in a quick rewrite rule for their ip only to show jobs saying 'pls-contact-infra'22:47
*** Goneri has quit IRC22:49
*** lpetrut has quit IRC22:50
dmsimardthey're getting 404's so I don't suppose it's hurting us too much22:58
openstackgerritDavid Moreau Simard proposed openstack-infra/zuul master: Add debug information when failing to autohold
clarkbis corvus around today? wondering if we shouldn't restart the server to see if reconfig time drops back down to 12s but worried that might destroy debugging ability23:01
corvusclarkb: i'm here23:01
clarkboh hi23:01
corvuswhat's up?23:01
clarkbcorvus: tl;dr is reconfigures are apparently up to 20s now and we're seeing higher cpu use on zuul.o.o since ~1500UTC which also came along with a jump in memory use23:02
clarkbthat time roughly correlates to when fungi fixed the osa repo23:02
corvusor rather, the reconfiguration after it was fixed i guess?23:02
clarkbas for user impacts zuul "feels" slow. The status graphs show it as being far more bursty than it was before23:02
clarkbcorvus: ya23:02
*** yamamoto has quit IRC23:02
corvushave we added any new projects other than osa since the last restart?23:03
* clarkb checks for github emails23:03
dmsimardcorvus: I created a few stories about the different errors/exceptions I've been seeing which might or might not be related:!/project/67923:03
clarkbyes tatu-dashboard and possibly osel23:03
corvusdid those have memory/cpu bumps?23:04
*** VW has quit IRC23:05
dmsimardneed to step away for now, I'll catch up later23:05
* clarkb does timestamp maths23:05
openstackgerritIan Wienand proposed openstack/diskimage-builder master: GPT partitioning support
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Fail if two elements provide the same thing
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add block-device defaults
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Choose appropriate bootloader for block-device
openstackgerritIan Wienand proposed openstack/diskimage-builder master: arm64: use HWE kernel and fix console
clarkb9:15 was when I got the tatu-dashboard email23:05
clarkbwhich would be 1715UTC23:05
*** VW has joined #openstack-infra23:05
clarkbno memory jump for that one23:06
clarkbosel was at 1430UTC23:06
clarkbdoesn't look like a jump for that one either23:07
*** rossella_s has quit IRC23:07
corvusso it sounds like there may be something amiss with sucessful full reconfigurations after a failed full reconfiguration23:07
clarkbthe memory increase appears to start at 151023:07
*** rossella_s has joined #openstack-infra23:09
corvusi don't think there's much more we can do right now with the current running process; i think we should take that as a hypothesis, go ahead and restart, and, since we're not going to get to fixing it right away on account of travel, keep an eye out for any more data points that could confirm/disprove that for the next little bit.23:10
*** VW has quit IRC23:10
clarkbok I'm loading my ssh key now. Anyone else able to help with that?23:12
corvusclarkb: i'm deep into another problem but can switch if you need me23:13
clarkbok I'll ping if I run into trouble23:13
clarkbchecking there aren't any release jobs queued23:14
*** rlandy|biab is now known as rlandy|ruck23:14
clarkbya no release jobs so I'll go ahead and save queues and restart the scheduler and zuul-web23:14
fungiclarkb: corvus: approving the osel project creation is what i did to "fix" the osa repos (i figured approving another project addition would trigger a full reconfig, and that seems to have done the trick)23:14
*** VW has joined #openstack-infra23:15
clarkbcorvus: ^ still want me to restart knowing that?23:15
clarkbI'm not sure it changes the hypothesis htough the timing is less tight23:16
corvusclarkb, fungi: then the 1430 approval probably tranlated into a 15.. something reload23:16
clarkbok moving ahead with restart then23:16
fungisounds likely since that part is apparently still puppet-driven23:16
corvus14:46 reload.  so there was a bit of a lag.23:17
clarkbI've told zuul to stop now just waiting for it to stop23:18
clarkb(its not an immediate response but the log did show it got the stop command on the socket)23:19
*** yee37935 has quit IRC23:20
corvusif it's in a slow reconfiguration queue run, it should stop when it gets to the end23:20
*** dbecker has quit IRC23:20
*** VW has quit IRC23:20
*** yee379 has joined #openstack-infra23:20
*** rossella_s has quit IRC23:21
*** rossella_s has joined #openstack-infra23:22
clarkblooks like it is currently processing queue items23:24
*** apetrich has quit IRC23:25
clarkblots of zuul.Pipeline.openstack.check: debug messages currnetly23:26
corvusthat seems strange23:26
corvusthere are no other pipelines it's processing23:28
clarkbya I've not seen any23:28
corvusclarkb: i suspect you should just kill it23:28
clarkbcorvus: sigint or sigkill?23:29
*** rossella_s has quit IRC23:29
clarkbI guess I can start with sigint23:29
fungii take it sigterm is what the initscript sent?23:29
corvusfungi: i think it uses the socket23:29
clarkbfungi: no it sent a command on the command socket23:29
fungikill uses sigterm by default, iirc23:30
clarkboh right not int23:30
clarkbI can start with term then23:30
clarkbhere goes23:30
*** rossella_s has joined #openstack-infra23:31
corvusit's stuck because it's not actually removing 547073,5 from the queue for some reason23:31
fungisigint is what happens when you ctrl-c in most foreground applications23:31
clarkbthat seems to have stopped the queue processing but not stopped the process23:31
clarkband now its gone23:31
clarkbstarting it up again now23:31
*** efried has quit IRC23:32
fungi547073,5 looks like just a bog-standard tempest patch23:33
*** slaweq has joined #openstack-infra23:33
corvusyeah, i don't know what's special there23:34
fungigit dependency on another tempest patch which is also typical-looking23:34
clarkbonce these cat jobs finish I'll reenqueue23:34
*** hongbin has quit IRC23:34
andreafclarkb corvus fungi is there any chance that jobs that were in progress will complete and still their logs uploaded?23:36
*** VW has joined #openstack-infra23:36
clarkbandreaf: if they finished before zuul released the nodesets yes otherwise its a race to how quickly nodepool can clean them up23:36
corvusthe executors will abort the jobs too once they lose the connection to the gearman server23:37
corvusit all happens very fast, so i'd say the chance is slim.23:37
andreafclarkb corvus ok np - can I recheck now?23:38
openstackgerritlifeless proposed openstack/gertty master: Don't lose sync requests that get bad responses
corvusandreaf: yep23:38
*** slaweq has quit IRC23:39
clarkbyes zuul is up and running now23:39
clarkbI am reequeuing too which may catch your thing23:39
*** tosky has quit IRC23:39
*** rosmaita has quit IRC23:40
andreafclarkb ok thanks23:40
*** VW has quit IRC23:41
lifelessfungi: ^ that might be a thing to merge and cut a release. It could explain a lot of 'wtf did my change go' moments :)23:41
lifelessfungi: there's also a bunch of other gertty stuff sitting there with multiple +1s etc :(23:42
*** VW has joined #openstack-infra23:42
corvuslifeless: thanks, i'll try to take a look soon23:42
lifelesscorvus: ta!23:42
lifelesscorvus: bug for it!/story/200157223:43
*** slaweq has joined #openstack-infra23:43
openstackgerritIan Wienand proposed openstack-infra/project-config master: Add arm64 nodes to launcher
fungilifeless: cool! i think i've probably hit that too but didn't manage to hunt down the cause23:45
*** abelur has quit IRC23:45
fungialso, while i am a sometimes reviewer/contributor to gertty, it's thoroughly a corvus production23:46
*** VW has quit IRC23:46
lifelessfungi: Its been a while, I just looked at the 'reviewers' list and grabbed the first name I remembered ;)23:46
lifelessfungi: corvus: there may be other cases where syncs are lost - checkResponse for instance may want to look for 400's and 5xx's more generically. But this specific case wouldn't be fixed by that either, so..23:47
*** slaweq has quit IRC23:48
andreafcorvus how difficult it is to setup a local zuul and get it running with a nodepool running a couple of local virtualbox vms ?23:48
andreafcorvus it would be nice to be able to do some quicker local testing23:49
ianwarm64 getting soooo close.  wouldn't mind eyes on which is a quick one to add linaro cloud to nodepool clouds conf ... one less hand-applied thing23:50
clarkbandreaf: there is a quick start guide in review somewhere23:50
ianwbuilds are working & uploading ... just a few tweaks to get console/config-drive working23:50
clarkbandreaf: but ya it shouldn;t be too hard let me try and find it23:51
*** rlandy|ruck is now known as rlandy|bbl23:51
clarkblooks like it doesn't go over nodepool but for that I'd configure nodepool to use some static nodes provided by whatever (virtualbox for example)23:53
corvusandreaf: if you do that, you can use the static driver in nodepool23:53
corvusandreaf, clarkb: you may prefer which is a more thorough step-by-step walkthrough for a quick example23:54
corvus(it does cover nodepool, but not static nodes (yet))23:54
*** caphrim007_ has joined #openstack-infra23:54
corvusandreaf: but if you just want to do quick local testing, why not just run ansible directly?23:55
*** caphrim007 has quit IRC23:57
andreafcorvus: that doesn't go e2e from the job definition - so I need to parse my job definition into an inventory file to supply ansible23:57
andreafcorvus merging the dicts and so23:57
corvusandreaf: you can grab an inventory file from a previously run job23:58
corvusmodify it to use your host, setup the git repos on your test node, then run the devstack playbook.  that should be about it.23:59

Generated by 2.15.3 by Marius Gedminas - find it at!